profile picture

Yuchen Hu

PhD Student @ Nanyang Technological University

audio-visual speech recognition

mutual information maximization

adversarial network

reinforcement leanring

modality-invariant representations

frame-level

noise-robustness

viseme-phoneme mapping

modality transfer

representation learning

multichannel multi-modal speech

4

presentations

13

number of views

SHORT BIO

I am a Ph.D. student at School of Computer Science and Engineering, Nanyang Technological University. My research focus on LLMs, speech processing and multimodal.

Presentations

Multichannel AV-wav2vec2: A Framework for Learning Multichannel Multi-Modal Speech Representation | VIDEO

Qiushi Zhu and 4 other authors

MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition

Yuchen Hu and 4 other authors

Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition

Yuchen Hu and 5 other authors

Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning

Chen Chen and 5 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved