Towards a Human-like Open-Domain Chatbot

1116 · July 17, 2021, 1:01am

This paper presents the Meena chatbot trained as a 2.6B seq2seq model on 867M [context, response] pairs (341 GB of text, 40B words). Meena is shown to outperform recent industrial chatbots in a multi-turn open-domain setting, measured by their proposed human evaluation metric called Sensibleness and Specificity Average (SSA).

Comments

Though presented as a research paper, Meena is actually a good direction for industry too.
I’m looking forward to a breakthrough in graph encoders so that user profile can be easily integrated with these seq2seq models.
Usually seq2seq models have the same number of layers in encoders and decoders, why Evolved Transformer has so much less encoder layers (1 vs 13 decoder layers)?

Rating

5: Transformative: This paper is likely to change our field. It should be considered for a best paper award.
4.5: Exciting: It changed my thinking on this topic. I would fight for it to be accepted.
4: Strong: I learned a lot from it. I would like to see it accepted.
3.5: Leaning positive: It can be accepted more or less in its current form. However, the work it describes is not particularly exciting and/or inspiring, so it will not be a big loss if people don’t see it in this conference.
3: Ambivalent: It has merits (e.g., it reports state-of-the-art results, the idea is nice), but there are key weaknesses (e.g., I didn’t learn much from it, evaluation is not convincing, it describes incremental work). I believe it can significantly benefit from another round of revision, but I won’t object to accepting it if my co-reviewers are willing to champion it.
2.5: Leaning negative: I am leaning towards rejection, but I can be persuaded if my co-reviewers think otherwise.
2: Mediocre: I would rather not see it in the conference.
1.5: Weak: I am pretty confident that it should be rejected.
1: Poor: I would fight to have it rejected.

0 投票者