
Cairong Zhao
multi-modal vision
image and video retrieval
self-supervised learning
computer vision
person re-identification
speaker recognition
residual learning
speech and multimodality
representation learning for vision
membership inference attack
similarity distribution shift
knowledge distillation
anomaly segmentation
language and vision
5
presentations
22
number of views
Presentations

Self-Supervised Likelihood Estimation with Energy Guidance for Anomaly Segmentation in Urban Scenes | VIDEO
Tu Yuanpeng and 6 other authors

Cross-Modal Distillation for Speaker Recognition
Cairong Zhao and 5 other authors

Similarity Distribution based Membership Inference Attack on Person Re-identification
Cairong Zhao and 8 other authors

Learning Hierarchical Prompt with Structured Linguistic Knowledge for Vision-Language Models
Yubin Wang and 4 other authors

Diverse Person: Customize Your Own Dataset for Text-Based Person Search | VIDEO
Zifan Song and 2 other authors