
Zhihong Shao
non-autoregressive generation
mutual information
large language models
code generation
mathematical reasoning
numerical reasoning
alignment
weakly supervised question answering
learning method
multi-answer qa
recall-then-verify framework
structured predictions
scalable oversight
process reward models
4
presentations
SHORT BIO
I’m a final-year Ph.D. student in Conversational AI Group, Department of Computer Science and Technology, Tsinghua University. I’m fortunate to be advised by Prof. Minlie Huang.
My interests are in natural language processing and deep learning. I am particularly interested in how we can build a robust and scalable AI system that can leverage diverse skills (e.g., tool use and reasoning) to aggregate possibly-heterogeneous information and answer natural language questions precisely regardless of their complexity.
Presentations

Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations
Peiyi Wang and 8 other authors

Learning Task Decomposition to Assist Humans in Competitive Programming
Jiaxin Wen and 5 other authors

A Mutual Information Maximization Approach for the Spurious Solution Problem in Weakly Supervised Question Answering
Zhihong Shao and 3 other authors

Chaining Simultaneous Thoughts for Numerical Reasoning
Zhihong Shao