Update README.md
Browse files
README.md
CHANGED
@@ -25,6 +25,9 @@ tags:
|
|
25 |
> [!Note]
|
26 |
> The **Callisto-OCR3-2B-Instruct** model is a fine-tuned version of *Qwen2-VL-2B-Instruct*, specifically optimized for *messy handwriting recognition*, *Optical Character Recognition (OCR)*, *English language understanding*, and *math problem solving with LaTeX formatting*. This model integrates a conversational approach with visual and textual understanding to handle multi-modal tasks effectively.
|
27 |
|
|
|
|
|
|
|
28 |
#### Key Enhancements:
|
29 |
|
30 |
* **SoTA understanding of images of various resolution & ratio**: Callisto-OCR3 achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc.
|
|
|
25 |
> [!Note]
|
26 |
> The **Callisto-OCR3-2B-Instruct** model is a fine-tuned version of *Qwen2-VL-2B-Instruct*, specifically optimized for *messy handwriting recognition*, *Optical Character Recognition (OCR)*, *English language understanding*, and *math problem solving with LaTeX formatting*. This model integrates a conversational approach with visual and textual understanding to handle multi-modal tasks effectively.
|
27 |
|
28 |
+
[](https://huggingface.co/prithivMLmods/Callisto-OCR3-2B-Instruct/blob/main/Callisto-OCR3-2B-Instruct-Demo/Callisto_OCR3_2B_Instruct.ipynb)
|
29 |
+
|
30 |
+
|
31 |
#### Key Enhancements:
|
32 |
|
33 |
* **SoTA understanding of images of various resolution & ratio**: Callisto-OCR3 achieves state-of-the-art performance on visual understanding benchmarks, including MathVista, DocVQA, RealWorldQA, MTVQA, etc.
|