
Xiaoshuai Sun
language and vision
multi-modal vision
cv
3d computer vision
image & video synthesis
human-object interaction detection
vision-language knowledge distillation
3d vision,multimodal,graph neural network
cv: language and vision cv: multi-modal vision cv: segmentation
zero-shot
cv: scene analysis & understanding; cv: multi-modal vision; ml: multimodal learning
segmentation
computational photography
9
presentations
SHORT BIO
Xiaoshuai Sun (Member, IEEE) received the B.S. degree in computer science from Harbin Engineering University, Harbin, China, in 2007, and the M.S. and Ph.D. degrees in computer science and technology from the Harbin Institute of Technology, Harbin, in 2009 and 2015, respectively. He was a Postdoctoral Research Fellow with the University of Queensland from 2015 to 2016. He served as a Lecturer with the Harbin Institute of Technology from 2016 to 2018. He is currently an Associate Professor with Xiamen University, China. He was a recipient of the Microsoft Research Asia Fellowship in 2011.
Presentations

3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation
Changli Wu and 6 other authors

Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network
Haowei Wang and 4 other authors

End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation
Xiaoshuai Sun and 5 other authors

Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network
Jiayi Ji and 7 other authors

Dual-Level Collaborative Transformer for Image Captioning
Yunpeng Luo and 7 other authors

Toward Open-Set Human-Object Interaction Detection
Mingrui Wu and 4 other authors

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks
Siyu Zou and 7 other authors

Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation
Tianyu Guo and 4 other authors

X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks
Zhipeng Qian and 3 other authors