Towards An Automatic Turing Test: Learning to Evaluate Dialogue Responses

A Tensorflow Implementation of ADEM - An Automatic Dialogue Evaluation Model

Basic information about ADEM

Authors: Ryan Lowe, Michael Noseworthy, Iulian V. Serban, Nicolas Angelard-Gontier, Yoshua Bengio, Joelle Pineau
ACL 2017 Accepted Paper
Under review as a conference paper at ICLR 2017
Link: https://openreview.net/pdf?id=HJ5PIaseg

Brief Introduction

ADEM is an automatic evaluation model for the quality of dialogue, aiming to capture the semantic similarity beyond word overlapping metrics (e.g BLEU, ROUGH, METOER) which correlating badly to human judgement, and calculate its score using extra information the context of conversation besides the reference response and model response.

Learning the vector representations of dialogue context $\mathbf{c} \in \mathcal{R}^c$, model response $\hat{\mathbf{r}} \in \mathcal{R}^m$ and reference response $\mathbf{r} \in \mathcal{R}^r$ using a hierarchical RNN encoder, ADEM computes the score as follows:

$$\text{score}(c, r, \hat{r}) = (\mathbf{c}^TM\hat{\mathbf{r}}+\mathbf{r}^TN\hat{\mathbf{r}} -\alpha) / \beta$$

where M, N are learned parameters initialized with identity, $\alpha$, $\beta$ are scalar constants intialized in the range [0, 5]. The first and second term of the score function can be interpreted as the similarity of model response to context and reference response ,respectively in a linear transformation.

ADEM is trained to minimize the model predictions an the human scores with L1 regularizations

$$\mathcal{L} = \sum_{i=1:K}[{\text{score}(c_i, r_i, \hat{r_i}) - human_score_i}]^2 + \gamma |\theta|_1$ where $\theta = {M, N}$$

where \gamma is a scalar constant. The model is end to end differentiable and all parameters can be learned by backpropogation.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
ADEM		ADEM
document		document
test		test
.editorconfig		.editorconfig
.gitignore		.gitignore
.pylintrc		.pylintrc
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Towards An Automatic Turing Test: Learning to Evaluate Dialogue Responses

A Tensorflow Implementation of ADEM - An Automatic Dialogue Evaluation Model

Basic information about ADEM

Brief Introduction

About

Releases

Packages

Languages

License

Yoctol/ADEM

Folders and files

Latest commit

History

Repository files navigation

Towards An Automatic Turing Test: Learning to Evaluate Dialogue Responses

A Tensorflow Implementation of ADEM - An Automatic Dialogue Evaluation Model

Basic information about ADEM

Brief Introduction

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages