
Hao Lang
Alibaba Group, China
ood detection
alignment
llms
rlhf
2
presentations
3
number of views
SHORT BIO
I am currently working in the Conversational AI Group of DAMO Academy, Alibaba. My research interests are in conversational AI, especially natural language understanding and learning and inference under distribution shift.
Presentations

Fine-Tuning Language Models with Reward Learning on Policy
Hao Lang and 2 other authors

Estimating Soft Labels for Out-of-Domain Intent Detection
Hao Lang