
haoxuan you
Doctoral student @ columbia university
commonsense reasoning
pre-training
commonsense
qa
multimodal
vision-language
zero-shot learning
vision-language understanding
4
presentations
1
number of views
SHORT BIO
I am a fifth-year Ph.D. candidate at Columbia University focusing on commonsense reasoning with multimodal knowledge.
Presentations

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding
Rui Sun and 5 other authors

Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense
Zhecan Wang and 5 other authors

Bridging the Gap between Recognition-level Pre-training and Commonsensical Vision-language Tasks
Yue Wan and 4 other authors

Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions
Liunian Harold Li and 5 other authors