DataMFM

Multimodal Document Understanding

Building on OmniDocBench and its upcoming extension OmniDocBench-Pro, the challenge provides a unified evaluation framework for document-centric multimodal tasks involving charts, tables, figures, layouts, and natural text.

--
Days
--
Hours
--
Minutes
--
Seconds

Overview

The DataMFM Challenge focuses on multimodal document understanding, a core challenge at the intersection of vision, language, and structured reasoning. Building on OmniDocBench and its upcoming extension OmniDocBench-Pro, the challenge provides a unified evaluation framework for document-centric multimodal tasks involving charts, tables, figures, layouts, and natural text.

This challenge is part of the Emerging Directions in Data for Multimodal Foundation Models (DataMFM) workshop at CVPR 2026, which examines research directions including web-scale to world-scale data, agentic and self-generated data pipelines, and principled data selection and mixture design.

Challenge Scope

Coming Soon: Detailed challenge tracks and evaluation metrics will be announced. The challenge will cover document-centric multimodal tasks involving charts, tables, figures, layouts, and natural text.

TRACK 01

Document VQA

Answer natural language questions about document images, requiring understanding of text, layout, and visual elements.

TBD
TRACK 02

Chart Understanding

Extract data and answer questions from charts and graphs, including bar charts, line plots, and pie charts.

TBD
TRACK 03

Table Recognition

Parse table structures and extract cell contents from complex document tables.

TBD

Timeline

Note: Challenge timeline will be announced soon. See the workshop dates for paper submission deadlines.

TBD
Challenge Announcement
TBD
Data Release
TBD
Submission Deadline
June 2026
Workshop @ CVPR

Leaderboard

Coming Soon: The leaderboard will be available once the challenge officially launches. Stay tuned!

Leaderboard Preview

RankTeamScoreDate
1
2
3

Dataset

The challenge data is based on OmniDocBench and its upcoming extension OmniDocBench-Pro. Download links and detailed documentation will be released when the challenge launches.

Submission

Submission Format

Submit your results as a JSON file:

{ "team_name": "Your Team Name", "method_name": "Your Method", "track": "document_vqa", "predictions": [ {"question_id": "q001", "answer": "..."}, {"question_id": "q002", "answer": "..."} ] }

How to Submit

  1. Prepare predictions in the format shown
  2. Register your team
  3. Upload your submission file
  4. View results on the leaderboard
Submit Now

Rules

Eligibility

Open to all researchers worldwide. No team size limit.

Submissions

Max 3 per day. Final evaluation allows 2 submissions.

Requirements

Top teams must submit technical report. External data must be disclosed.