A Neural Knowledge Language Model

Ahn, Sungjin; Choi, Heeyoul; Pärnamaa, Tanel; Bengio, Yoshua

Full-text links:

Download:

(license)

Current browse context:

cs.CL

< prev | next >

new | recent | 1608

Computer Science > Computation and Language

Title: A Neural Knowledge Language Model

Authors: Sungjin Ahn, Heeyoul Choi, Tanel Pärnamaa, Yoshua Bengio

(Submitted on 1 Aug 2016)

Abstract: Communicating knowledge is a primary purpose of language. However, current language models have significant limitations in their ability to encode or decode knowledge. This is mainly because they acquire knowledge based on statistical co-occurrences, even if most of the knowledge words are rarely observed named entities. In this paper, we propose a Neural Knowledge Language Model (NKLM) which combines symbolic knowledge provided by knowledge graphs with RNN language models. At each time step, the model predicts a fact on which the observed word is supposed to be based. Then, a word is either generated from the vocabulary or copied from the knowledge graph. We train and test the model on a new dataset, WikiFacts. In experiments, we show that the NKLM significantly improves the perplexity while generating a much smaller number of unknown words. In addition, we demonstrate that the sampled descriptions include named entities which were used to be the unknown words in RNN language models.

Subjects:	Computation and Language (cs.CL); Learning (cs.LG)
Cite as:	arXiv:1608.00318 [cs.CL]
	(or arXiv:1608.00318v1 [cs.CL] for this version)

Submission history

From: Sungjin Ahn [view email]
[v1] Mon, 1 Aug 2016 04:42:49 GMT (727kb,D)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)

arXiv.org > cs > arXiv:1608.00318v1

Download:

Current browse context:

Change to browse by:

References & Citations

Bookmark

Computer Science > Computation and Language

Title: A Neural Knowledge Language Model

Submission history