profile picture

Weidong Cai

Associate Professor @ The University of Sydney

3d computer vision

3d modeling

generative modeling

ml: applications

representation learning for vision

ml: multimodal learning

text-to-3d

language model safety

ml: deep neural architectures and foundation models

human-centered language modeling

context-aware model detoxification

balance of generation quality and detoxification

language model for children

stratified masking

segmentation

7

presentations

SHORT BIO

Dr. Weidong Cai is an Associate Professor in the School of Computer Science, Director of the Multimedia Laboratory, and Associate Director of the Biomedical & Multimedia Information Technology (BMIT) Research Group at the University of Sydney. He was a Lead Investigator / Visiting Professor on medical image analysis and medical computer vision at Harvard Medical School in 2014. He received his PhD degree in Computer Science from the Basser Department of Computer Science, The University of Sydney, in 2001. His research interests include image/video processing, medical image computing, computer vision, pattern recognition, machine learning, multimedia computing, and computational neuroscience. He has published more than 300 peer-reviewed papers in leading international journals and proceedings of top international conferences.

Presentations

CLIP-CID: Efficient CLIP Distillation via Cluster-Instance Discrimination

Xiang An and 7 other authors

Enhancing Advanced Visual Reasoning Ability of Large Language Models

Zhiyuan Li and 5 other authors

V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models

Heng Wang and 4 other authors

PaintHuman: Towards High-Fidelity Text-to-3D Human Texturing via Denoised Score Distillation

Jianhui Yu and 5 other authors

Rethinking Rotation Invariance with Point Cloud Registration

Jianhui Yu and 2 other authors

PaRot: Patch-Wise Rotation-Invariant Network via Feature Disentanglement and Pose Restoration

Dingxin Zhang and 3 other authors

RWKV-CLIP: A Robust Vision-Language Representation Learner

Tiancheng Gu and 6 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved