Handwritten text recognition

Dataset

IAM comprises 1,539 pages and 13,353 lines of handwritten English text.
CASIA-HWDB is an offline handwritten Chinese dataset, which contains about 5,090 pages and 1.35 million character samples of 7,356 classes (7,185 Chinese characters and 171 symbols).

For CASIA-HWDB

请直接告诉我，图片中的文字都是什么？

Results of IAM

Results of CASIA-HWDB

Illustration of handwritten text recognition. (a), (b), (c), (d) are samples of page-level IAM, line-level IAM, page-level CASIA-HWDB and line-level CASIA-HWDB, respectively. In the responses of GPT-4V, we highlight characters that match the GT in green and characters that do not match in red. For English text, GPT-4V demonstrates excellent performance. In contrast, for Chinese text, GPT-4V has generated a passage of text that is semantically coherent, but it is not associated with the ground truth text (GT).