"That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial Attacks Paper โข 2204.04636 โข Published Apr 10, 2022 โข 1