
Hao Liu
deep neural networks
question answering
language and vision
self-supervised learning
contrastive learning
frequency
multi-modal vision
textual attribute recognition
masked image modeling
4
presentations
Presentations

The Devil is in the Frequency: Geminated Gestalt Autoencoder for Self-Supervised Visual Pre-Training
Hao Liu and 6 other authors

Locate Then Generate: Bridging Vision and Language with Bounding Box for Scene-Text VQA
Yongxin Zhu and 6 other authors

TaCo: Textual Attribute Recognition via Contrastive Learning
Yiqing Hu and 5 other authors

Perceiving Stroke-Semantic Context: Hierarchical Contrastive Learning for Robust Scene Text Recognition
Hao Liu and 7 other authors