Recognize text using Calamari OCR and the OCR-D framework
-
Updated
Jul 25, 2024 - Python
Recognize text using Calamari OCR and the OCR-D framework
Master repository which includes most other OCR-D repositories as submodules
Validate and transform various OCR file formats (hOCR, ALTO, PAGE, FineReader)
Tesseract Open Source OCR Engine (main repository)
The repo gt_structure_1_3 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
The repo gt_structure_1_2 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
About The repo gt_structure_1_4 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
The repo gt_structure_1_1 is part of the OCR-D Ground Truth Structure corpus. Only the structure of the printed page is annotated. The corpus was created as a result of the DFG project OCR-D.
Metadata tool for Ground Truth datasets
A template for creating a ground truth repo with the various functions and features: such as metadata creation, data analysis and presentation.
a makefilization for OCR-D workflows, with configuration examples
XSLT and shell scripts for analyzing and creating GitHub pages of a ground truth repository. These are centrally managed and can be used by all repositories created with gt-repo-template (https://github.com/OCR-D/gt-repo-template).
Add a description, image, and links to the ocr-d topic page so that developers can more easily learn about it.
To associate your repository with the ocr-d topic, visit your repo's landing page and select "manage topics."