profile picture

Fan Yin

instruction tuning

robustness

interpretability

large language models

contrastive learning

faithfulness

uncertainty estimation

model interpretation

sensitivity

graph

multi-step reasoning

task selection

post-hoc interpretations

adversarial example detection

amortization

7

presentations

4

number of views

SHORT BIO

Hi, I am a third-year PhD student in the Computer Science department at University of California, Los Angeles (UCLA), advised by Prof.Kai-Wei Chang. Preivously, I received my B.S. degree in Computer Science from Peking University in 2020, where I have worked with Prof. Xiaojun Wan. My research interest are robustness, interpretability for trustworthy in NLP. My recent research tries to understand the characteristics of adversarial examples and associate it with interpretability and debugging of model behaviors.

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved