Handwritten Korean Character Image Database PE92

Dae-Hwan KIM; Young-Sup HWANG; Sang-Tae PARK; Eun-Jung KIM; Sang-Hoon PAEK; Sung-Yang BANG

Handwritten Korean Character Image Database PE92

Dae-Hwan KIM, Young-Sup HWANG, Sang-Tae PARK, Eun-Jung KIM, Sang-Hoon PAEK, Sung-Yang BANG

Full Text Views

0

Cite this

Summary :

The purposes of the current PE92 database project are twofold. One is to provide raw data to researchers so that they can concentrate their efforts primarily on the development of character recognition algorithms. The other is to provide a standard handwritten character data set to the perspective users as well as the developers so that they can evaluate and compare the performance of character recognition systems objectively. We collected 100 handwritten image sets of 2,350 Hanguel characters that correspond to the character set specified in Korean Standards KSC5601-1987 computer codes. We tried to collect as many writing styles as possible. The first 70 sets were generated by more than 500 different writers, and each of the remaining 30 sets was written by one person. Writers wrote down characters in the pre-specified boxes and the database was created by scanning the data sheets by an image scanner. The size of each image is 100100 pixels with 256 gray levels. Since each pixel needs one byte of memory, the size of the entire database PE92 turned out to be about 2.3 GB. Finally we obtained a raw data profile of PE92 by calculating various statistics of its image data.

Publication: IEICE TRANSACTIONS on Information Vol.E79-D No.7 pp.943-950

Publication Date: 1996/07/25

Publicized

Online ISSN

DOI

Type of Manuscript: PAPER

Category: Image Processing,Computer Graphics and Pattern Recognition

Cite this

Copy

Dae-Hwan KIM, Young-Sup HWANG, Sang-Tae PARK, Eun-Jung KIM, Sang-Hoon PAEK, Sung-Yang BANG, "Handwritten Korean Character Image Database PE92" in IEICE TRANSACTIONS on Information, vol. E79-D, no. 7, pp. 943-950, July 1996, doi: .
Abstract: The purposes of the current PE92 database project are twofold. One is to provide raw data to researchers so that they can concentrate their efforts primarily on the development of character recognition algorithms. The other is to provide a standard handwritten character data set to the perspective users as well as the developers so that they can evaluate and compare the performance of character recognition systems objectively. We collected 100 handwritten image sets of 2,350 Hanguel characters that correspond to the character set specified in Korean Standards KSC5601-1987 computer codes. We tried to collect as many writing styles as possible. The first 70 sets were generated by more than 500 different writers, and each of the remaining 30 sets was written by one person. Writers wrote down characters in the pre-specified boxes and the database was created by scanning the data sheets by an image scanner. The size of each image is 100100 pixels with 256 gray levels. Since each pixel needs one byte of memory, the size of the entire database PE92 turned out to be about 2.3 GB. Finally we obtained a raw data profile of PE92 by calculating various statistics of its image data.
URL: https://global.ieice.org/en_transactions/information/10.1587/e79-d_7_943/_p

Copy

@ARTICLE{e79-d_7_943,
author={Dae-Hwan KIM, Young-Sup HWANG, Sang-Tae PARK, Eun-Jung KIM, Sang-Hoon PAEK, Sung-Yang BANG, },
journal={IEICE TRANSACTIONS on Information},
title={Handwritten Korean Character Image Database PE92},
year={1996},
volume={E79-D},
number={7},
pages={943-950},
abstract={The purposes of the current PE92 database project are twofold. One is to provide raw data to researchers so that they can concentrate their efforts primarily on the development of character recognition algorithms. The other is to provide a standard handwritten character data set to the perspective users as well as the developers so that they can evaluate and compare the performance of character recognition systems objectively. We collected 100 handwritten image sets of 2,350 Hanguel characters that correspond to the character set specified in Korean Standards KSC5601-1987 computer codes. We tried to collect as many writing styles as possible. The first 70 sets were generated by more than 500 different writers, and each of the remaining 30 sets was written by one person. Writers wrote down characters in the pre-specified boxes and the database was created by scanning the data sheets by an image scanner. The size of each image is 100100 pixels with 256 gray levels. Since each pixel needs one byte of memory, the size of the entire database PE92 turned out to be about 2.3 GB. Finally we obtained a raw data profile of PE92 by calculating various statistics of its image data.},
keywords={},
doi={},
ISSN={},
month={July},}

Copy

TY - JOUR
TI - Handwritten Korean Character Image Database PE92
T2 - IEICE TRANSACTIONS on Information
SP - 943
EP - 950
AU - Dae-Hwan KIM
AU - Young-Sup HWANG
AU - Sang-Tae PARK
AU - Eun-Jung KIM
AU - Sang-Hoon PAEK
AU - Sung-Yang BANG
PY - 1996
DO -
JO - IEICE TRANSACTIONS on Information
SN -
VL - E79-D
IS - 7
JA - IEICE TRANSACTIONS on Information
Y1 - July 1996
AB - The purposes of the current PE92 database project are twofold. One is to provide raw data to researchers so that they can concentrate their efforts primarily on the development of character recognition algorithms. The other is to provide a standard handwritten character data set to the perspective users as well as the developers so that they can evaluate and compare the performance of character recognition systems objectively. We collected 100 handwritten image sets of 2,350 Hanguel characters that correspond to the character set specified in Korean Standards KSC5601-1987 computer codes. We tried to collect as many writing styles as possible. The first 70 sets were generated by more than 500 different writers, and each of the remaining 30 sets was written by one person. Writers wrote down characters in the pre-specified boxes and the database was created by scanning the data sheets by an image scanner. The size of each image is 100100 pixels with 256 gray levels. Since each pixel needs one byte of memory, the size of the entire database PE92 turned out to be about 2.3 GB. Finally we obtained a raw data profile of PE92 by calculating various statistics of its image data.
ER -