profile picture

Shih-Fu Chang

large language model

pre-training

commonsense reasoning

multimodal

vision and language

factuality

commonsense

vision-language model

events

multimodality

qa

prompting

fake news detection

model evaluation

event extraction

19

presentations

17

number of views

Presentations

VIEWS: Entity-Aware News Video Captioning

Hammad Abdullah Ayyubi and 11 other authors

Training-free Deep Concept Injection Enables Language Models for Video Question Answering

Xudong Lin and 4 other authors

Personalized Video Comment Generation

Xudong Lin and 5 other authors

Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning

Kung-Hsiang Huang and 7 other authors

IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models

Haoxuan You and 7 other authors

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding

Rui Sun and 5 other authors

Enhanced Chart Understanding via Visual Language Pre-training on Plot Table Pairs

Mingyang Zhou and 5 other authors

Video Event Extraction via Tracking Visual States of Arguments

Guang Yang and 5 other authors

Understanding ME? Multimodal Evaluation for Fine-grained Visual Commonsense

Zhecan Wang and 5 other authors

Bridging the Gap between Recognition-level Pre-training and Commonsensical Vision-language Tasks

Yue Wan and 4 other authors

Meta Faster R-CNN: Towards Accurate Few-Shot Object Detection with Attentive Feature Alignment

Guangxing Han and 4 other authors

SGEITL: Scene Graph Enhanced Image-Text Learning for Visual Commonsense Reasoning

Zhecan Wang and 7 other authors

MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding

Revant Gangi Reddy and 11 other authors

Joint Multimedia Event Extraction from Video and Article

Brian Chen and 7 other authors

InfoSurgeon: Cross-Media Fine-grained Information Consistency Checking for Fake News Detection

Yi Fung and 8 other authors

Unsupervised Vision-and-Language Pre-training Without Parallel Images and Captions

Liunian Harold Li and 5 other authors

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved