Document Parsing
This track focuses on recovering structured content from document pages, including natural text, tables, formulas, layout structure, and reading order.
Two Challenge Tracks: Document Parsing + Chart Understanding
The DataMFM Challenge uses two newly prepared challenge datasets based on OmniDocBench and ChartNet. Together, the two tracks target multimodal document understanding across natural text, tables, formulas, layouts, and charts.
The DataMFM Challenge focuses on multimodal document understanding at the intersection of vision, language, and structured reasoning. For this challenge, we prepare two new datasets based on OmniDocBench and ChartNet, corresponding to document parsing and chart understanding.
These two challenge components are designed to cover complementary capabilities: extracting structured content from document pages and understanding chart-centric visual information. This challenge is part of the Emerging Directions in Data for Multimodal Foundation Models (DataMFM) workshop at CVPR 2026.
The challenge scope covers two complementary tasks in multimodal document understanding: document parsing and chart understanding. Together, they evaluate how well models can extract, organize, and reason over structured information from complex visual documents.
This track focuses on recovering structured content from document pages, including natural text, tables, formulas, layout structure, and reading order.
This track focuses on two core chart understanding tasks: recovering structured chart data through chart-to-CSV extraction and generating grounded chart summaries through chart-to-summary generation.
Prize structure: Awards are granted per task for both the document parsing track and the chart understanding track.
$1,000 per task
$500 per task
$300 per task
Leaderboard: Results are updated periodically from successful public EvalAI submissions. Each table keeps the best public submission per team for the corresponding task.
Loading leaderboard data...
| Rank | Team | |||||
|---|---|---|---|---|---|---|
| Loading... | ||||||
| Rank | Team | |||||
|---|---|---|---|---|---|---|
| Loading... | ||||||
The challenge dataset is organized around two newly prepared components based on OmniDocBench and ChartNet. Public release materials are provided below for the document parsing and chart understanding tracks.
This newly prepared document parsing dataset is based on OmniDocBench and currently includes 1,005 page images organized under 89 document folders. It supports page-level parsing with supervision over text, tables, formulas, layout structure, and reading order.
This newly prepared chart understanding dataset is based on ChartNet and currently includes 3,807 chart images, with 2,000 synthetic samples and 1,807 real-world samples. For this challenge, the chart track focuses on two tasks: chart-to-CSV and chart-to-summary.
Current public note: submission files are organized to match the structure of the two released challenge datasets. The document parsing track uses flat, document-level Markdown files, while the chart understanding track uses split-based JSONL prediction files for the two supported tasks: chart-to-CSV and chart-to-summary.
For the document parsing track, submit one document-level .md file per document. The .zip archive should be flat and contain 89 Markdown files, with each filename matching the corresponding document UUID in the ground truth:
.md file per document, for 89 document-level Markdown files in total.00da2f04-2fd9-4ed2-a6e3-0839d06da691.md.\n\n). Headings with #.$$...$$, inline in $...$. Content must be LaTeX.<table> format is recommended for merged cells; Markdown pipe tables are also accepted..md file.
For the chart understanding track, organize submissions by split and task. Prediction files should follow the JSONL structure used by the current ChartNet-based evaluation pipeline for chart-to-CSV and chart-to-summary:
real/ and synthetic/ directories.chart2csv and chart2summary only.imagename plus the corresponding prediction field such as predicted_csv or predicted_summary.Open to all researchers worldwide. No team size limit.
Max 3 per day. Final evaluation allows 2 submissions.
Top teams must submit technical report. External data must be disclosed.