
Liang Zhang
Ph.D. student @ Renmin University of China
information extraction
evaluation
text generation
question answering
fact-checking
image captioning
representation learning
video understanding
fine-grained
multilingual generation
large language model
multimodal reading comprehension
multi-modal pre-training
multilingual instruction following
large vision-language model
6
presentations
2
citations
SHORT BIO
Liang Zhang received his B.Sc. degree in computer science and technology from the China University of Mining and Technology, Beijing in 2020. He is currently a Ph.D. student at the School of Information, Renmin University of China.
His research interests include multilingual machine learning, cross-modal retrieval, and multimodal reading comprehension.
Presentations

TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging
Liang Zhang and 7 other authors

Respond in my Language: Mitigating Language Inconsistency in Response Generation based on Large Language Models
Liang Zhang and 4 other authors

Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective
Zihao Yue and 2 other authors

InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation
Anwen Hu and 3 other authors

Accommodating Audio Modality in CLIP for Multimodal Processing
Ludan Ruan and 5 other authors

MPMQA: Multimodal Question Answering on Product Manuals
Liang Zhang and 4 other authors