
Yuchen Hu
PhD Student @ Nanyang Technological University
audio-visual speech recognition
mutual information maximization
adversarial network
reinforcement leanring
modality-invariant representations
frame-level
noise-robustness
viseme-phoneme mapping
modality transfer
representation learning
multichannel multi-modal speech
4
presentations
13
number of views
SHORT BIO
I am a Ph.D. student at School of Computer Science and Engineering, Nanyang Technological University. My research focus on LLMs, speech processing and multimodal.
Presentations

Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation | VIDEO
Qiushi Zhu and 4 other authors

MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition
Yuchen Hu and 4 other authors

Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition
Yuchen Hu and 5 other authors

Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning
Chen Chen and 5 other authors