profile picture

Liang Zhang

Ph.D. student @ Renmin University of China

information extraction

evaluation

text generation

question answering

fact-checking

image captioning

representation learning

video understanding

fine-grained

multilingual generation

large language model

multimodal reading comprehension

multi-modal pre-training

multilingual instruction following

large vision-language model

6

presentations

2

citations

SHORT BIO

Liang Zhang received his B.Sc. degree in computer science and technology from the China University of Mining and Technology, Beijing in 2020. He is currently a Ph.D. student at the School of Information, Renmin University of China.

His research interests include multilingual machine learning, cross-modal retrieval, and multimodal reading comprehension.

Presentations

TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging

Liang Zhang and 7 other authors

Respond in my Language: Mitigating Language Inconsistency in Response Generation based on Large Language Models

Liang Zhang and 4 other authors

Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective

Zihao Yue and 2 other authors

InfoMetIC: An Informative Metric for Reference-free Image Caption Evaluation

Anwen Hu and 3 other authors

Accommodating Audio Modality in CLIP for Multimodal Processing

Ludan Ruan and 5 other authors

MPMQA: Multimodal Question Answering on Product Manuals

Liang Zhang and 4 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved