profile picture

Yaoyuan Liang

Tsinghua University

visual grounding

detection

vision and language

transformer

video understanding

detr

vision-and-language

2

presentations

SHORT BIO

I am a Ph.D student in Shenzhen Key Laboratory of Ubiquitous Data Enabling, Tsinghua Shenzhen International Graduate School, Tsinghua University. Research interests: Multi-modal Learning, Vision-Language Understanding and LLM-enhanced Visual Grounding

Presentations

CoSTA: End-to-End Comprehensive Space-Time Entanglement for Spatio-Temporal Video Grounding

Yaoyuan Liang and 7 other authors

DQ-DETR: Dual Query Detection Transformer for Phrase Extraction and Grounding

Shilong Liu and 7 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved