The search functionality is under construction.
The search functionality is under construction.

Handwritten Korean Character Image Database PE92

Dae-Hwan KIM, Young-Sup HWANG, Sang-Tae PARK, Eun-Jung KIM, Sang-Hoon PAEK, Sung-Yang BANG

  • Full Text Views

    0

  • Cite this

Summary :

The purposes of the current PE92 database project are twofold. One is to provide raw data to researchers so that they can concentrate their efforts primarily on the development of character recognition algorithms. The other is to provide a standard handwritten character data set to the perspective users as well as the developers so that they can evaluate and compare the performance of character recognition systems objectively. We collected 100 handwritten image sets of 2,350 Hanguel characters that correspond to the character set specified in Korean Standards KSC5601-1987 computer codes. We tried to collect as many writing styles as possible. The first 70 sets were generated by more than 500 different writers, and each of the remaining 30 sets was written by one person. Writers wrote down characters in the pre-specified boxes and the database was created by scanning the data sheets by an image scanner. The size of each image is 100100 pixels with 256 gray levels. Since each pixel needs one byte of memory, the size of the entire database PE92 turned out to be about 2.3 GB. Finally we obtained a raw data profile of PE92 by calculating various statistics of its image data.

Publication
IEICE TRANSACTIONS on Information Vol.E79-D No.7 pp.943-950
Publication Date
1996/07/25
Publicized
Online ISSN
DOI
Type of Manuscript
PAPER
Category
Image Processing,Computer Graphics and Pattern Recognition

Authors

Keyword