The search functionality is under construction.

IEICE TRANSACTIONS on Information

Extracting Knowledge Entities from Sci-Tech Intelligence Resources Based on BiLSTM and Conditional Random Field

Weizhi LIAO, Mingtong HUANG, Pan MA, Yu WANG

  • Full Text Views

    0

  • Cite this

Summary :

There are many knowledge entities in sci-tech intelligence resources. Extracting these knowledge entities is of great importance for building knowledge networks, exploring the relationship between knowledge, and optimizing search engines. Many existing methods, which are mainly based on rules and traditional machine learning, require significant human involvement, but still suffer from unsatisfactory extraction accuracy. This paper proposes a novel approach for knowledge entity extraction based on BiLSTM and conditional random field (CRF).A BiLSTM neural network to obtain the context information of sentences, and CRF is then employed to integrate global label information to achieve optimal labels. This approach does not require the manual construction of features, and outperforms conventional methods. In the experiments presented in this paper, the titles and abstracts of 20,000 items in the existing sci-tech literature are processed, of which 50,243 items are used to build benchmark datasets. Based on these datasets, comparative experiments are conducted to evaluate the effectiveness of the proposed approach. Knowledge entities are extracted and corresponding knowledge networks are established with a further elaboration on the correlation of two different types of knowledge entities. The proposed research has the potential to improve the quality of sci-tech information services.

Publication
IEICE TRANSACTIONS on Information Vol.E104-D No.8 pp.1214-1221
Publication Date
2021/08/01
Publicized
2021/04/22
Online ISSN
1745-1361
DOI
10.1587/transinf.2020BDP0007
Type of Manuscript
Special Section PAPER (Special Section on Computational Intelligence and Big Data for Scientific and Technological Resources and Services)
Category

Authors

Weizhi LIAO
  University of Electronic Science and Technology of China
Mingtong HUANG
  University of Electronic Science and Technology of China
Pan MA
  University of Electronic Science and Technology of China
Yu WANG
  University of Electronic Science and Technology of China

Keyword