Update README.md
This commit is contained in:
35
README.md
35
README.md
@@ -27,6 +27,26 @@ tags:
|
|||||||
|
|
||||||
# KarantaOCR: Efficient Document Processing for African Languages
|
# KarantaOCR: Efficient Document Processing for African Languages
|
||||||
|
|
||||||
|
## Table of Contents
|
||||||
|
|
||||||
|
- [Model Description](#model-description)
|
||||||
|
- [Training Data](#training-data)
|
||||||
|
- [Stage 1: General OCR Training](#stage-1-general-ocr-training)
|
||||||
|
- [Stage 2: African Language Fine-Tuning](#stage-2-african-language-fine-tuning)
|
||||||
|
- [Training Plots](#training-plots)
|
||||||
|
- [Capabilities](#capabilities)
|
||||||
|
- [Evaluation](#evaluation)
|
||||||
|
- [Results -- KarantaOCR-Bench](#results----karantaocr-bench)
|
||||||
|
- [Results -- OlmoOCR-Bench](#results----olmocr-bench)
|
||||||
|
- [How to Use](#how-to-use)
|
||||||
|
- [Load the Model and Processor](#load-the-model-and-processor)
|
||||||
|
- [Prepare a PDF Page for Inference](#prepare-a-pdf-page-for-inference)
|
||||||
|
- [Run OCR Inference](#run-ocr-inference)
|
||||||
|
- [End-to-End Example](#end-to-end-example)
|
||||||
|
- [Citation Information](#citation-information)
|
||||||
|
|
||||||
|
---
|
||||||
|
|
||||||
## Model Description
|
## Model Description
|
||||||
|
|
||||||
#### [Paper](....)
|
#### [Paper](....)
|
||||||
@@ -59,6 +79,21 @@ KarantaOCR was trained using a **two-stage curriculum fine-tuning strategy**.
|
|||||||
|
|
||||||
This stage emphasizes accurate transcription of **diacritics, special characters, and region-specific typography**.
|
This stage emphasizes accurate transcription of **diacritics, special characters, and region-specific typography**.
|
||||||
|
|
||||||
|
### Training Plots
|
||||||
|
|
||||||
|
<div align="left">
|
||||||
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/604b97e27032db3f5e8d6e8e/5Jf26jOGs12rrMwy3hwrI.png" alt="Train Loss" width="600"/>
|
||||||
|
</div>
|
||||||
|
|
||||||
|
<div align="left">
|
||||||
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/604b97e27032db3f5e8d6e8e/9z4t6so8DIaykrFQHs0Su.png" alt="Eval Loss" width="600"/>
|
||||||
|
</div>
|
||||||
|
|
||||||
|
<div align="left">
|
||||||
|
<img src="https://cdn-uploads.huggingface.co/production/uploads/604b97e27032db3f5e8d6e8e/-TJltGBXNFABTkShCyvsL.png" alt="Learning Rate" width="600"/>
|
||||||
|
</div>
|
||||||
|
|
||||||
|
|
||||||
---
|
---
|
||||||
|
|
||||||
## Capabilities
|
## Capabilities
|
||||||
|
|||||||
Reference in New Issue
Block a user