AAAI 2026

January 23, 2026

Singapore, Singapore

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.

We introduce FinMMDocR, a novel bilingual multimodal benchmark for evaluating multimodal large language models (MLLMs) on real-world financial numerical reasoning. Compared to existing benchmarks, our work delivers three major advancements. (1) Scenario Awareness: 57.9\% of 1,200 expert-annotated problems incorporate 9 types of implicit financial scenarios (e.g., Portfolio Management), challenging models to perform expert-level reasoning based on assumptions; (2) Document Understanding: 837 Chinese/English documents spanning 9 types (e.g., Company Research) average 50.8 pages with rich visual elements, significantly surpassing existing benchmarks in both breadth and depth of financial documents; (3) Multi-Step Computation: Problems demand 11-step reasoning on average (5.3 extraction + 5.7 calculation steps), with 65.0\% requiring cross-page evidence (2.4 pages average). The best-performing MLLM achieves only 58.0\% accuracy, and different retrieval-augmented generation (RAG) methods show significant performance variations on this task. We expect FinMMDocR to advance the improvement of MLLMs and reasoning-enhanced methods on complex multimodal reasoning tasks in real-world scenarios. Data and code are available in the supplementary material.

Downloads

SlidesPaperTranscript English (automatic)

Next from AAAI 2026

Gaussian Approximation for Two-Timescale Linear Stochastic Approximation
poster

Gaussian Approximation for Two-Timescale Linear Stochastic Approximation

AAAI 2026

+2
Bogdan Butyrin and 4 other authors

23 January 2026

Stay up to date with the latest Underline news!

Select topic of interest (you can select more than one)

PRESENTATIONS

  • All Presentations
  • For Librarians
  • Resource Center
  • Free Trial
Underline Science, Inc.
1216 Broadway, 2nd Floor, New York, NY 10001, USA

© 2025 Underline - All rights reserved