
Wenkai Yang
pre-trained language model
pre-trained language models
backdoor attack
ood detection
natural language processing; backdoor attacks; stealthiness
backdoor defense
model fine-tuning
intermediate feature
6
presentations
7
number of views
6
citations
SHORT BIO
I am a second-year master student at Peking University.
Presentations

Fine-Tuning Deteriorates General Textual Out-of-Distribution Detection by Distorting Task-Agnostic Features
Sishuo Chen and 3 other authors

Well-Classified Examples are Underestimated in Classification with Deep Neural Networks
Guangxiang Zhao and 5 other authors

RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models
Wenkai Yang and 4 other authors

Rethinking Stealthiness of Backdoor Attack against NLP Models
Wenkai Yang and 4 other authors

Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models
Wenkai Yang and 5 other authors

Expose Backdoors on the Way: A Feature-Based Efficient Defense against Textual Backdoor Attacks
Sishuo Chen and 4 other authors