
Difei Gao
National University of Singapore
continual learning
vqa
scene graph
industrial application
ai-assistance
video language understanding; approaches for computing efficiency; video moment retrieval via language; contrastive learning
task-oriented video question answering dataset
assembly-disassembly
common ground theory for industrial task collaboration
3
presentations
8
number of views
Presentations

GazeVQA: A Video Question Answering Dataset for Multiview Eye-Gaze Task-Oriented Collaborations
Muhammet Furkan ILASLAN and 7 other authors

CONE: An Efficient COarse-to-fiNE Alignment Framework for Long Video Temporal Grounding
Zhijian Hou and 8 other authors

Symbolic Replay: Scene Graph as Prompt for Continual Learning on VQA Task
Weixian Lei and 6 other authors