Fan Yin
instruction tuning
robustness
interpretability
large language models
contrastive learning
faithfulness
uncertainty estimation
model interpretation
sensitivity
graph
multi-step reasoning
task selection
post-hoc interpretations
adversarial example detection
amortization
7
presentations
4
number of views
SHORT BIO
Hi, I am a third-year PhD student in the Computer Science department at University of California, Los Angeles (UCLA), advised by Prof.Kai-Wei Chang. Preivously, I received my B.S. degree in Computer Science from Peking University in 2020, where I have worked with Prof. Xiaojun Wan. My research interest are robustness, interpretability for trustworthy in NLP. My recent research tries to understand the characteristics of adversarial examples and associate it with interpretability and debugging of model behaviors.