Semantically Readable Distributed Representation Learning and Its Expandability Using a Word Semantic Vector Dictionary

Ikuo KESHI; Yu SUZUKI; Koichiro YOSHINO; Satoshi NAKAMURA

doi:10.1587/transinf.2017DAP0019

IEICE TRANSACTIONS on Information

Semantically Readable Distributed Representation Learning and Its Expandability Using a Word Semantic Vector Dictionary

Ikuo KESHI, Yu SUZUKI, Koichiro YOSHINO, Satoshi NAKAMURA

Full Text Views

0

Cite this

Summary :

The problem with distributed representations generated by neural networks is that the meaning of the features is difficult to understand. We propose a new method that gives a specific meaning to each node of a hidden layer by introducing a manually created word semantic vector dictionary into the initial weights and by using paragraph vector models. We conducted experiments to test the hypotheses using a single domain benchmark for Japanese Twitter sentiment analysis and then evaluated the expandability of the method using a diverse and large-scale benchmark. Moreover, we tested the domain-independence of the method using a Wikipedia corpus. Our experimental results demonstrated that the learned vector is better than the performance of the existing paragraph vector in the evaluation of the Twitter sentiment analysis task using the single domain benchmark. Also, we determined the readability of document embeddings, which means distributed representations of documents, in a user test. The definition of readability in this paper is that people can understand the meaning of large weighted features of distributed representations. A total of 52.4% of the top five weighted hidden nodes were related to tweets where one of the paragraph vector models learned the document embeddings. For the expandability evaluation of the method, we improved the dictionary based on the results of the hypothesis test and examined the relationship of the readability of learned word vectors and the task accuracy of Twitter sentiment analysis using the diverse and large-scale benchmark. We also conducted a word similarity task using the Wikipedia corpus to test the domain-independence of the method. We found the expandability results of the method are better than or comparable to the performance of the paragraph vector. Also, the objective and subjective evaluation support each hidden node maintaining a specific meaning. Thus, the proposed method succeeded in improving readability.

Publication: IEICE TRANSACTIONS on Information Vol.E101-D No.4 pp.1066-1078

Publication Date: 2018/04/01

Publicized: 2018/01/18

Online ISSN: 1745-1361

DOI: 10.1587/transinf.2017DAP0019

Type of Manuscript: Special Section PAPER (Special Section on Data Engineering and Information Management)

Category

Authors

Ikuo KESHI
  Nara Institute of Science and Technology
Yu SUZUKI
  Nara Institute of Science and Technology
Koichiro YOSHINO
  Nara Institute of Science and Technology
Satoshi NAKAMURA
  Nara Institute of Science and Technology

Keyword

distributed representation, word semantic vector dictionary, paragraph vector, word2vec, Twitter, sentiment analysis

Cite this

Copy

Ikuo KESHI, Yu SUZUKI, Koichiro YOSHINO, Satoshi NAKAMURA, "Semantically Readable Distributed Representation Learning and Its Expandability Using a Word Semantic Vector Dictionary" in IEICE TRANSACTIONS on Information, vol. E101-D, no. 4, pp. 1066-1078, April 2018, doi: 10.1587/transinf.2017DAP0019.
Abstract: The problem with distributed representations generated by neural networks is that the meaning of the features is difficult to understand. We propose a new method that gives a specific meaning to each node of a hidden layer by introducing a manually created word semantic vector dictionary into the initial weights and by using paragraph vector models. We conducted experiments to test the hypotheses using a single domain benchmark for Japanese Twitter sentiment analysis and then evaluated the expandability of the method using a diverse and large-scale benchmark. Moreover, we tested the domain-independence of the method using a Wikipedia corpus. Our experimental results demonstrated that the learned vector is better than the performance of the existing paragraph vector in the evaluation of the Twitter sentiment analysis task using the single domain benchmark. Also, we determined the readability of document embeddings, which means distributed representations of documents, in a user test. The definition of readability in this paper is that people can understand the meaning of large weighted features of distributed representations. A total of 52.4% of the top five weighted hidden nodes were related to tweets where one of the paragraph vector models learned the document embeddings. For the expandability evaluation of the method, we improved the dictionary based on the results of the hypothesis test and examined the relationship of the readability of learned word vectors and the task accuracy of Twitter sentiment analysis using the diverse and large-scale benchmark. We also conducted a word similarity task using the Wikipedia corpus to test the domain-independence of the method. We found the expandability results of the method are better than or comparable to the performance of the paragraph vector. Also, the objective and subjective evaluation support each hidden node maintaining a specific meaning. Thus, the proposed method succeeded in improving readability.
URL: https://global.ieice.org/en_transactions/information/10.1587/transinf.2017DAP0019/_p

Copy

@ARTICLE{e101-d_4_1066,
author={Ikuo KESHI, Yu SUZUKI, Koichiro YOSHINO, Satoshi NAKAMURA, },
journal={IEICE TRANSACTIONS on Information},
title={Semantically Readable Distributed Representation Learning and Its Expandability Using a Word Semantic Vector Dictionary},
year={2018},
volume={E101-D},
number={4},
pages={1066-1078},
abstract={The problem with distributed representations generated by neural networks is that the meaning of the features is difficult to understand. We propose a new method that gives a specific meaning to each node of a hidden layer by introducing a manually created word semantic vector dictionary into the initial weights and by using paragraph vector models. We conducted experiments to test the hypotheses using a single domain benchmark for Japanese Twitter sentiment analysis and then evaluated the expandability of the method using a diverse and large-scale benchmark. Moreover, we tested the domain-independence of the method using a Wikipedia corpus. Our experimental results demonstrated that the learned vector is better than the performance of the existing paragraph vector in the evaluation of the Twitter sentiment analysis task using the single domain benchmark. Also, we determined the readability of document embeddings, which means distributed representations of documents, in a user test. The definition of readability in this paper is that people can understand the meaning of large weighted features of distributed representations. A total of 52.4% of the top five weighted hidden nodes were related to tweets where one of the paragraph vector models learned the document embeddings. For the expandability evaluation of the method, we improved the dictionary based on the results of the hypothesis test and examined the relationship of the readability of learned word vectors and the task accuracy of Twitter sentiment analysis using the diverse and large-scale benchmark. We also conducted a word similarity task using the Wikipedia corpus to test the domain-independence of the method. We found the expandability results of the method are better than or comparable to the performance of the paragraph vector. Also, the objective and subjective evaluation support each hidden node maintaining a specific meaning. Thus, the proposed method succeeded in improving readability.},
keywords={},
doi={10.1587/transinf.2017DAP0019},
ISSN={1745-1361},
month={April},}

Copy

TY - JOUR
TI - Semantically Readable Distributed Representation Learning and Its Expandability Using a Word Semantic Vector Dictionary
T2 - IEICE TRANSACTIONS on Information
SP - 1066
EP - 1078
AU - Ikuo KESHI
AU - Yu SUZUKI
AU - Koichiro YOSHINO
AU - Satoshi NAKAMURA
PY - 2018
DO - 10.1587/transinf.2017DAP0019
JO - IEICE TRANSACTIONS on Information
SN - 1745-1361
VL - E101-D
IS - 4
JA - IEICE TRANSACTIONS on Information
Y1 - April 2018
AB - The problem with distributed representations generated by neural networks is that the meaning of the features is difficult to understand. We propose a new method that gives a specific meaning to each node of a hidden layer by introducing a manually created word semantic vector dictionary into the initial weights and by using paragraph vector models. We conducted experiments to test the hypotheses using a single domain benchmark for Japanese Twitter sentiment analysis and then evaluated the expandability of the method using a diverse and large-scale benchmark. Moreover, we tested the domain-independence of the method using a Wikipedia corpus. Our experimental results demonstrated that the learned vector is better than the performance of the existing paragraph vector in the evaluation of the Twitter sentiment analysis task using the single domain benchmark. Also, we determined the readability of document embeddings, which means distributed representations of documents, in a user test. The definition of readability in this paper is that people can understand the meaning of large weighted features of distributed representations. A total of 52.4% of the top five weighted hidden nodes were related to tweets where one of the paragraph vector models learned the document embeddings. For the expandability evaluation of the method, we improved the dictionary based on the results of the hypothesis test and examined the relationship of the readability of learned word vectors and the task accuracy of Twitter sentiment analysis using the diverse and large-scale benchmark. We also conducted a word similarity task using the Wikipedia corpus to test the domain-independence of the method. We found the expandability results of the method are better than or comparable to the performance of the paragraph vector. Also, the objective and subjective evaluation support each hidden node maintaining a specific meaning. Thus, the proposed method succeeded in improving readability.
ER -

IEICE TRANSACTIONS on Information