profile picture

Xiaoshuai Sun

language and vision

multi-modal vision

cv

3d computer vision

image & video synthesis

human-object interaction detection

vision-language knowledge distillation

3d vision,multimodal,graph neural network

cv: language and vision cv: multi-modal vision cv: segmentation

zero-shot

cv: scene analysis & understanding; cv: multi-modal vision; ml: multimodal learning

segmentation

computational photography

9

presentations

SHORT BIO

Xiaoshuai Sun (Member, IEEE) received the B.S. degree in computer science from Harbin Engineering University, Harbin, China, in 2007, and the M.S. and Ph.D. degrees in computer science and technology from the Harbin Institute of Technology, Harbin, in 2009 and 2015, respectively. He was a Postdoctoral Research Fellow with the University of Queensland from 2015 to 2016. He served as a Lecturer with the Harbin Institute of Technology from 2016 to 2018. He is currently an Associate Professor with Xiamen University, China. He was a recipient of the Microsoft Research Asia Fellowship in 2011.

Presentations

3D-STMN: Dependency-Driven Superpoint-Text Matching Network for End-to-End 3D Referring Expression Segmentation

Changli Wu and 6 other authors

Towards Real-Time Panoptic Narrative Grounding by an End-to-End Grounding Network

Haowei Wang and 4 other authors

End-to-End Zero-Shot HOI Detection via Vision and Language Knowledge Distillation

Xiaoshuai Sun and 5 other authors

Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network

Jiayi Ji and 7 other authors

Dual-Level Collaborative Transformer for Image Captioning

Yunpeng Luo and 7 other authors

Toward Open-Set Human-Object Interaction Detection

Mingrui Wu and 4 other authors

Towards Efficient Diffusion-Based Image Editing with Instant Attention Masks

Siyu Zou and 7 other authors

Improving Panoptic Narrative Grounding by Harnessing Semantic Relationships and Visual Confirmation

Tianyu Guo and 4 other authors

X-RefSeg3D: Enhancing Referring 3D Instance Segmentation via Structured Cross-Modal Graph Neural Networks

Zhipeng Qian and 3 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved