New OmniAI OCR Benchmark

cmo · April 4, 2025, 3:00am

README.md

main

[![Omni OCR Benchmark](https://omniai-images.s3.us-east-1.amazonaws.com/omni-ocr-benchmark.png)](https://getomni.ai/ocr-benchmark)

# Omni OCR Benchmark

A benchmarking tool that compares OCR and data extraction capabilities of different large multimodal models such as gpt-4o, evaluating both text and json extraction accuracy. The goal of this benchmark is to publish a comprehensive benchmark of OCRaccuracy across traditional OCR providers and multimodal Language Models. The evaluation dataset and methodologies are all Open Source, and we encourage expanding this benchmark to encompass any additional providers.

[**Open Source LLM Benchmark Results (Mar 2025)**](https://getomni.ai/blog/benchmarking-open-source-models-for-ocr) | [**Dataset**](https://huggingface.co/datasets/getomni-ai/ocr-benchmark)

[**Benchmark Results (Feb 2025)**](https://getomni.ai/ocr-benchmark) | [**Dataset**](https://huggingface.co/datasets/getomni-ai/ocr-benchmark)

![image](https://github.com/user-attachments/assets/2be179ad-0abd-4f0e-b73a-7d5a70390367)


## Methodology

The primary goal is to evaluate JSON extraction from documents. To evaluate this, the Omni benchmark runs <strong>Document ⇒ OCR ⇒ Extraction</strong>. Measuring how well a model can OCR a page, and return that content in a format that an LLM can parse.

![methodology](https://omniai-images.s3.us-east-1.amazonaws.com/methodology-diagram.png)

## Evaluation Metrics

This file has been truncated. show original

Topic		Replies	Views
Qwen2.5-VL Technical Report General research-paper	1	14	February 21, 2025
DeepSeek-R1's paper can be read here Large Language Model research-paper	0	21	January 28, 2025
GMI Cloud on DeepSeek-R1 General blog	0	13	January 30, 2025
GitHub - deepseek-ai/DeepSeek-VL2: DeepSeek-VL2: Mixture-of-Experts Vision-Language Large Language Model	0	5	February 7, 2025
Now Available: Optimized DeepSeek-R1 On GMI Cloud General	0	19	February 4, 2025

New OmniAI OCR Benchmark

Related topics