There are other types of seq2seq models such as LSTM (long short-term memory) and GRU (gated recurrent unit) networks. Sequence to sequence models are especially good for language-related tasks such as translation and question-answering. The Transformer, introduced for the first time in 2017, is a sequence-to-sequence (seq2seq) machine learning model, which means it takes a sequence of data as input (numbers, letters, words, pixels…) and outputs another sequence. According to the blog post and the paper published in the arXiv preprint server, Meena is based on the Evolved Transformer architecture. Like the many innovative language models that have been introduced in the past few years, Google’s Meena has a few interesting details. Now, the question is, how much does Meena, Google’s massive chatbot, move the needle in conversational AI? What’s under the hood? As for open-domain chatbots, AI agents that are supposed to discuss a wide range of topics, they either fail at generating relevant results or often provide vague answers that can be given to various questions, like a politician evading giving specific answers at a press conference. But so far, current AI can only engage in language-related tasks as long as the problem domain remains narrow and limited, such as answering simple queries with clear meanings or carrying out simple commands.Īdvanced language models such as OpenAI’s GPT-2 can produce remarkable text excerpts, but those excerpts quickly lose their coherence as they grow in length. Advances in natural language processing have also paved the way for the widespread use of AI assistants such as Alexa, Siri, and Cortana. In the past years, chatbots have found a niche in some domains such as banking and news. Until now, most efforts to create AI that can understand language, engage in meaningful conversations, and generate coherent excerpts of text have yielded poor-to-modest results. Making sense of language and engaging in conversations is one of the most complicated functions of the human brain. They sometimes say things that are inconsistent with what has been said so far, or lack common sense and basic knowledge about the world,” Google’s researcher wrote in a blog post. “Current open-domain chatbots have a critical flaw - they often don’t make sense. This week, Google introduced Meena, a chatbot that can “chat about… anything.” Meena is the latest of many efforts by large tech companies trying to solve one of the toughest challenges of artificial intelligence: language. This article is part of our reviews of AI research papers, a series of posts that explore the latest findings in artificial intelligence.
0 Comments
Leave a Reply. |
Details
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |