
3
presentations
SHORT BIO
I'm a second-year master student at Beijing University of Posts and Telecommunications, advised by Sen Su, I’m currently interested in LLM Safety, jailbreak and privacy specifically.
Presentations

Course-Correction: Safety Alignment Using Synthetic Preferences
Rongwu Xu and 8 other authors

Alignment-Enhanced Decoding: Defending via Token-Level Adaptive Refining of Probability Distributions
Quan Liu and 5 other authors

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States
Zhenhong Zhou and 5 other authors