
6
presentations
Presentations

Sing it, Narrate it: Quality Musical Lyrics Translation
Ye Zhuorui and 2 other authors

Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias
Rongwu Xu and 7 other authors

Knowledge Conflicts for LLMs: A Survey
Rongwu Xu and 6 other authors

Course-Correction: Safety Alignment Using Synthetic Preferences
Rongwu Xu and 8 other authors

Preemptive Answer “Attacks” on Chain-of-Thought Reasoning
Rongwu Xu and 2 other authors

How Alignment and Jailbreak Work: Explain LLM Safety through Intermediate Hidden States
Zhenhong Zhou and 5 other authors