We define universal adversarial triggers: input-agnostic sequences of tokens that trigger a model to produce a specific prediction when ...
確定! 回上一頁