TEACh: Task-Driven Embodied Agents That Chat

Aishwarya Padmakumar; Jesse Thomason; Ayush Shrivastava; Patrick Lange; Anjali Narayan-Chen; Spandana Gella; Robinson Piramuthu; Gokhan Tur; Dilek Hakkani-Tur

doi:10.1609/aaai.v36i2.20097

Authors

Aishwarya Padmakumar Amazon Alexa AI
Jesse Thomason University of Southern California Amazon Alexa AI
Ayush Shrivastava University of Michigan
Patrick Lange Amazon Alexa AI
Anjali Narayan-Chen Amazon Alexa AI
Spandana Gella Amazon Alexa AI
Robinson Piramuthu Amazon Alexa AI
Gokhan Tur Amazon Alexa AI
Dilek Hakkani-Tur Amazon Alexa AI

DOI:

https://doi.org/10.1609/aaai.v36i2.20097

Keywords:

Computer Vision (CV), Speech & Natural Language Processing (SNLP)

Abstract

Robots operating in human spaces must be able to engage in natural language interaction, both understanding and executing instructions, and using conversation to resolve ambiguity and correct mistakes. To study this, we introduce TEACh, a dataset of over 3,000 human-human, interactive dialogues to complete household tasks in simulation. A Commander with access to oracle information about a task communicates in natural language with a Follower. The Follower navigates through and interacts with the environment to complete tasks varying in complexity from "Make Coffee" to "Prepare Breakfast", asking questions and getting additional information from the Commander. We propose three benchmarks using TEACh to study embodied intelligence challenges, and we evaluate initial models' abilities in dialogue understanding, language grounding, and task execution.

TEACh: Task-Driven Embodied Agents That Chat

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Subscription