🤖 Edge#52: Google Meena That Can Chat About Anything

Read it without subscription

Jan 07, 2021

Recently, we have had a lot of new signups for TheSequence. Thank you for supporting our effort to improve awareness of AI research and technology. Below you will find an example of the Premium newsletter that is delivered to our paid subscribers every Tuesday and Thursday. We hope you will find such format of obtaining knowledge valuable and convenient for you. It takes less than 5 minutes to read, and you will learn a lot.

Yes, I want to join

💥 What’s New in AI: Google Meena is a Language Model That Can Chat About Anything

Natural Language Understanding (NLU) has been one of the most active areas of research of the last few years and has produced some of the most widely adopted AI systems to date. However, despite all the progress, most conversational systems remain highly constrained to a specific domain, which contrasts with our ability as humans to naturally converse about different topics. In NLU theory, those specialized conversational agents are known as closed-domain chatbots. The alternative is an emerging area of research known as open-domain chatbots. that focuses on building conversational agents that chat about virtually anything a user wants. If effective, open-domain chatbots might be a key piece in the journey to humanize computer interactions. In 2020, Google Research published a new paper introducing Meena, a new deep learning model that can power chatbots able to engage in conversations about any domain.

Despite the excitement around open-domain chatbots, the current implementation attempts still have weaknesses that prevent them from being generally useful: they often respond to open-ended input in ways that do not make sense, or with replies that are vague and generic. With Meena, Google tries to address some of these challenges by building an open-domain chatbot that can chat about almost anything.

Before building Meena, Google had to solve a non-trivial challenge that is often ignored in open-domain chatbot systems. A key criterion to evaluate the quality of an open-domain chatbot is the fact that its dialogs feel natural to humans. That idea seems intuitive but also incredibly subjective. How can we measure the human-likeness of a dialog? To address that challenge, Google started by introducing a new metric as the cornerstone of the Meena chatbot.

Sensibleness and Specificity Average

Sensibleness and Specificity Average (SSA) is a new metric for open-domain chatbots that captures basic but important attributes for human conversation. Specifically, SSA tries to quantify two key aspects of human-conversations:

1) making sense

2) being specific

Sensibleness arguably covers some of the most basic aspects of conversational human-likeness, such as common sense and logical coherence. Sensibleness also captures other important aspects of a chatbot, such as consistency. However, being sensible is not enough. A generic response (ex: I don’t know) can be sensible, but it is also boring and unspecific. Such responses are frequently generated by bots that are evaluated according to metrics like sensibleness alone. Specificity is the second metric that can help quantify the human-likeness of conversational interaction. For instance, if A says, “I love tennis,” and B responds, “That’s nice,” then the utterance should be marked, “not specific”. That reply could be used in dozens of different contexts. However, if B responds, “Me too, I can’t get enough of Roger Federer!” then it is marked as “specific” since it relates closely to what is being discussed.

SSA →f (Sensibleness, Specificity)

The actual mathematical formulation of the SSA metric is pretty sophisticated but the initial experiments conducted by Google showed a strong correlation with the human-likeness of a chatbot. The following figure shows this correlation for different chatbots (blue dots).