Update README.md

This commit is contained in:
Odunayo Ogundepo
2026-01-25 11:41:24 +00:00
committed by system
parent 954f79f56a
commit 4738fd5472

View File

@@ -78,23 +78,26 @@ While improved performance on African languages was our priority, KarantaOCR **m
## Evaluation ## Evaluation
KarantaOCR is evaluated on the OLMOocr benchmark using pass-rate accuracy. Scores are reported as averages across JSONL files with 95% confidence intervals. KarantaOCR is evaluated on the OLMOocr benchmark using pass-rate accuracy. Scores are reported as averages across JSONL files with 95% confidence intervals.
In addition to OLMOocr benchmark, we also create a KarantaOCR-Bench, which focuses specifically on testing OCR extraction on special characters and diacritics.
### Results -- KarantaOCR-Bench ### Results -- KarantaOCR-Bench
... | JSONL File | KarantaOCR (3B) | RoLMOCR (7B) | NanoNetsOCR-2 (3B) | OLMOCR-2 (7B) |
| --------------- | ---------- | -------- | ------------- | -------- |
| Diacritics & Special Characters | **61.6** | 53.6 | 55.1 | 55.5
### Results -- OlmoOCR-Bench ### Results -- OlmoOCR-Bench
| JSONL File | KarantaOCR (3B) | RoLMOCR (7B) | NanoNetsOCR-2 (3B) | OLMOCR-1 (7B) | OLMOCR-2 (7B) | Mistral OCR API | DeepSeek-OCR (3B) | | JSONL File | KarantaOCR (3B) | RoLMOCR (7B) | NanoNetsOCR-2 (3B) | OLMOCR-1 (7B) | OLMOCR-2 (7B) | Mistral OCR API | DeepSeek-OCR (3B) |
| --------------- | ---------- | -------- | ------------- | -------- | -------- |-------- |-------- | | --------------- | ---------- | -------- | ------------- | -------- | -------- |-------- |-------- |
| arxiv_math | 74.2 | **76.8** | 73.7 | 63.3 | 83.0 | 77.2 | 77.2 | | arxiv_math | 74.2 | 76.8 | 73.7 | 63.3 | 83.0 | 77.2 | 77.2 |
| baseline | **99.4** | 97.9 | **99.5** | 97.9 | 99.7 | 99.4 | 99.8 | | baseline | 99.4 | 97.9 | 99.5 | 97.9 | 99.7 | 99.4 | 99.8 |
| headers_footers | **95.3** | 94.1 | 32.8 | 93.4 | 96.1 | 93.6 | 96.1 | | headers_footers | 95.3 | 94.1 | 32.8 | 93.4 | 96.1 | 93.6 | 96.1 |
| long_tiny_text | 72.2 | 61.3 | **92.1** | 54.8 | 81.9 | 77.1 | 79.4 | | long_tiny_text | 72.2 | 61.3 | 92.1 | 54.8 | 81.9 | 77.1 | 79.4 |
| multi_column | 75.6 | 70.0 | **82.5** | 67.6 | 83.7 | 71.3 | 66.4 | | multi_column | 75.6 | 70.0 | 82.5 | 67.6 | 83.7 | 71.3 | 66.4 |
| old_scans | 41.3 | 42.4 | 41.4 | 38.6 | 47.7 | 29.3 | 33.3 | | old_scans | 41.3 | 42.4 | 41.4 | 38.6 | 47.7 | 29.3 | 33.3 |
| old_scans_math | 70.3 | **80.1** | 44.1 | 67.5 | 82.3 | 67.5 | 73.6 | | old_scans_math | 70.3 | 80.1 | 44.1 | 67.5 | 82.3 | 67.5 | 73.6 |
| table_tests | 64.3 | 72.2 | **84.2** | 62.3 | 84.9 | 60.6 | 80.2 | | table_tests | 64.3 | 72.2 | 84.2 | 62.3 | 84.9 | 60.6 | 80.2 |
| Average | 74.1% | 74.4% | 68.8% | 68.2% | 82.4% | 72.0 | 75.7 | | Average | 74.1% | 74.4% | 68.8% | 68.2% | 82.4% | 72.0 | 75.7 |