Return to Article Details
White-Box Attacks on Hate-speech BERT Classifiers in German with Explicit and Implicit Character Level Defense