
Chenhui Chu
speech translation
continual learning
multimodal
large language models
visual grounding
attention
named entity recognition
multimodal machine translation
speech processing
video
survey
hallucination
multilingualism
large language model
fine-tuning
9
presentations
SHORT BIO
Chenhui Chu received his B.S. in software engineering from Chongqing University in 2008, and his M.S. and Ph.D. in Informatics from Kyoto University in 2012 and 2015, respectively. He is currently a program-specific associate professor at Kyoto University. His research interests include natural language processing, particularly machine translation and multimodal machine learning.
Presentations

MELD-ST: An Emotion-aware Speech Translation Dataset
Sirou Chen and 6 other authors

MM-LLMs: Recent Advances in MultiModal Large Language Models
Duzhen Zhang and 6 other authors

Flexible Weight Tuning and Weight Fusion Strategies for Continual Named Entity Recognition
Yahan Yu and 3 other authors

Video-Helpful Multimodal Machine Translation
Yihang Li and 4 other authors

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning
Hao Wang and 3 other authors

Towards Speech Dialogue Translation Mediating Speakers of Different Languages
Shuichiro Shimizu and 3 other authors

Flexible Visual Grounding
Yongmin Kim and 2 other authors

Attending Self-Attention: A Case Study ofVisually Grounded Supervision in Vision-and-Language Transformers
Jules Samaran and 4 other authors

WRIME: A New Dataset for Emotional Intensity Estimation with Subjective and Objective Annotations
Tomoyuki Kajiwara and 4 other authors