DataMFM Challenge | CVPR 2026

Overview

The DataMFM Challenge focuses on multimodal document understanding, a core challenge at the intersection of vision, language, and structured reasoning. Building on OmniDocBench and its upcoming extension OmniDocBench-Pro, the challenge provides a unified evaluation framework for document-centric multimodal tasks involving charts, tables, figures, layouts, and natural text.

This challenge is part of the Emerging Directions in Data for Multimodal Foundation Models (DataMFM) workshop at CVPR 2026, which examines research directions including web-scale to world-scale data, agentic and self-generated data pipelines, and principled data selection and mixture design.

Challenge Scope

Coming Soon: Detailed challenge tracks and evaluation metrics will be announced. The challenge will cover document-centric multimodal tasks involving charts, tables, figures, layouts, and natural text.

TRACK 01

Document VQA

Answer natural language questions about document images, requiring understanding of text, layout, and visual elements.

TBD

TRACK 02

Chart Understanding

Extract data and answer questions from charts and graphs, including bar charts, line plots, and pie charts.

TBD

TRACK 03

Table Recognition

Parse table structures and extract cell contents from complex document tables.

TBD

Timeline

Note: Challenge timeline will be announced soon. See the workshop dates for paper submission deadlines.

TBD

Challenge Announcement

TBD

Data Release

TBD

Submission Deadline

June 2026

Workshop @ CVPR

Leaderboard

Coming Soon: The leaderboard will be available once the challenge officially launches. Stay tuned!

Leaderboard Preview

Rank	Team	Score	Date
1	—	—	—
2	—	—	—
3	—	—	—

Dataset

The challenge data is based on OmniDocBench and its upcoming extension OmniDocBench-Pro. Download links and detailed documentation will be released when the challenge launches.

Dataset (Coming Soon) OmniDocBench GitHub

Submission

Submission Format

Submit your results as a JSON file:

{
  "team_name": "Your Team Name",
  "method_name": "Your Method",
  "track": "document_vqa",
  "predictions": [
    {"question_id": "q001", "answer": "..."},
    {"question_id": "q002", "answer": "..."}
  ]
}

How to Submit

Prepare predictions in the format shown
Register your team
Upload your submission file
View results on the leaderboard

Submit Now

Rules

Eligibility

Open to all researchers worldwide. No team size limit.

Submissions

Max 3 per day. Final evaluation allows 2 submissions.

Requirements

Top teams must submit technical report. External data must be disclosed.