A Diversity-Promoting Objective Function for Neural Conversation Models

Li, Jiwei; Galley, Michel; Brockett, Chris; Gao, Jianfeng; Dolan, Bill

Full-text links:

Download:

(license)

Current browse context:

cs.CL

< prev | next >

new | recent | 1510

Change to browse by:

Computer Science > Computation and Language

Title:A Diversity-Promoting Objective Function for Neural Conversation Models

Authors:Jiwei Li, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan

(Submitted on 11 Oct 2015 (v1), last revised 10 Jun 2016 (this version, v3))

Abstract: Sequence-to-sequence neural network models for generation of conversational responses tend to generate safe, commonplace responses (e.g., "I don't know") regardless of the input. We suggest that the traditional objective function, i.e., the likelihood of output (response) given input (message) is unsuited to response generation tasks. Instead we propose using Maximum Mutual Information (MMI) as the objective function in neural models. Experimental results demonstrate that the proposed MMI models produce more diverse, interesting, and appropriate responses, yielding substantive gains in BLEU scores on two conversational datasets and in human evaluations.

Comments:	In. Proc of NAACL 2016
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1510.03055 [cs.CL]
	(or arXiv:1510.03055v3 [cs.CL] for this version)

Try the Bibliographic Explorer
(can be disabled at any time)

EnableDon't show again

Bibliographic data

Submission history

From: Michel Galley [view email]
[v1] Sun, 11 Oct 2015 14:04:57 UTC (27 KB)
[v2] Thu, 7 Jan 2016 06:59:19 UTC (270 KB)
[v3] Fri, 10 Jun 2016 22:03:28 UTC (32 KB)

Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?) Browse v0.1 released 2018-10-22

arXiv.org > cs > arXiv:1510.03055