In this paper, we describe a two-phase method for biomedical named entity recognition consisting of term boundary detection and biomedical category labeling. The term boundary detection can be defined as a task to assign label sequences to a given sentence, and biomedical category labeling can be viewed as a local classification problem which does not need knowledge of the labels of other named entities in a sentence. The advantage of dividing the recognition process into two phases is that we can measure the effectiveness of models at each phase and select separately the appropriate model for each subtask. In order to obtain a better performance in biomedical named entity recognition, we conducted comparative experiments using several learning methods at each phase. Moreover, results by these machine learning based models are refined by rule-based postprocessing. We tested our methods on the JNLPBA 2004 shared task and the GENIA corpus.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
Copy
Seonho KIM, Juntae YOON, "Experimental Study on a Two Phase Method for Biomedical Named Entity Recognition" in IEICE TRANSACTIONS on Information,
vol. E90-D, no. 7, pp. 1103-1110, July 2007, doi: 10.1093/ietisy/e90-d.7.1103.
Abstract: In this paper, we describe a two-phase method for biomedical named entity recognition consisting of term boundary detection and biomedical category labeling. The term boundary detection can be defined as a task to assign label sequences to a given sentence, and biomedical category labeling can be viewed as a local classification problem which does not need knowledge of the labels of other named entities in a sentence. The advantage of dividing the recognition process into two phases is that we can measure the effectiveness of models at each phase and select separately the appropriate model for each subtask. In order to obtain a better performance in biomedical named entity recognition, we conducted comparative experiments using several learning methods at each phase. Moreover, results by these machine learning based models are refined by rule-based postprocessing. We tested our methods on the JNLPBA 2004 shared task and the GENIA corpus.
URL: https://global.ieice.org/en_transactions/information/10.1093/ietisy/e90-d.7.1103/_p
Copy
@ARTICLE{e90-d_7_1103,
author={Seonho KIM, Juntae YOON, },
journal={IEICE TRANSACTIONS on Information},
title={Experimental Study on a Two Phase Method for Biomedical Named Entity Recognition},
year={2007},
volume={E90-D},
number={7},
pages={1103-1110},
abstract={In this paper, we describe a two-phase method for biomedical named entity recognition consisting of term boundary detection and biomedical category labeling. The term boundary detection can be defined as a task to assign label sequences to a given sentence, and biomedical category labeling can be viewed as a local classification problem which does not need knowledge of the labels of other named entities in a sentence. The advantage of dividing the recognition process into two phases is that we can measure the effectiveness of models at each phase and select separately the appropriate model for each subtask. In order to obtain a better performance in biomedical named entity recognition, we conducted comparative experiments using several learning methods at each phase. Moreover, results by these machine learning based models are refined by rule-based postprocessing. We tested our methods on the JNLPBA 2004 shared task and the GENIA corpus.},
keywords={},
doi={10.1093/ietisy/e90-d.7.1103},
ISSN={1745-1361},
month={July},}
Copy
TY - JOUR
TI - Experimental Study on a Two Phase Method for Biomedical Named Entity Recognition
T2 - IEICE TRANSACTIONS on Information
SP - 1103
EP - 1110
AU - Seonho KIM
AU - Juntae YOON
PY - 2007
DO - 10.1093/ietisy/e90-d.7.1103
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E90-D
IS - 7
JA - IEICE TRANSACTIONS on Information
Y1 - July 2007
AB - In this paper, we describe a two-phase method for biomedical named entity recognition consisting of term boundary detection and biomedical category labeling. The term boundary detection can be defined as a task to assign label sequences to a given sentence, and biomedical category labeling can be viewed as a local classification problem which does not need knowledge of the labels of other named entities in a sentence. The advantage of dividing the recognition process into two phases is that we can measure the effectiveness of models at each phase and select separately the appropriate model for each subtask. In order to obtain a better performance in biomedical named entity recognition, we conducted comparative experiments using several learning methods at each phase. Moreover, results by these machine learning based models are refined by rule-based postprocessing. We tested our methods on the JNLPBA 2004 shared task and the GENIA corpus.
ER -