
4
presentations
Presentations

HoneypotNet: Backdoor Attacks Against Model Extraction
Tianle Gu and 4 other authors

ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
Haiquan Zhao and 13 other authors

Flames: Benchmarking Value Alignment of LLMs in Chinese
Kexin Huang and 11 other authors

Fake Alignment: Are LLMs Really Aligned Well?
Yixu Wang and 9 other authors