Источник

CVPR / Embodied AI

Дата публикации

16.06.2024

Авторы

Анастасия Иванова Алексей Ковалёв Александр Панов

Поделиться

AmbiK: Dataset of Ambiguous Tasks in Kitchen Environment

Аннотация

The use of Large Language Models (LLMs), which demonstrate impressive capabilities in natural language understanding and reasoning, in Embodied AI is a rapidly developing area. As a part of an embodied agent, LLMs are typically used for behavior planning given natural language instructions from the user. However, dealing with ambiguous instructions in real-world environments remains a challenge for LLMs. Various methods for task disambiguation have been proposed. However, it is difficult to compare them because they work with different data. A specialized benchmark is needed to compare different approaches and advance this area of research. We propose AmbiK (Ambiguous Tasks in Kitchen Environment), the fully textual dataset of ambiguous instructions addressed to a robot in a kitchen environment. AmbiK was collected with the assistance of LLMs and is human-validated. It comprises 500 pairs of ambiguous tasks and their unambiguous counterparts, categorized by ambiguity type (human preference, common sense knowledge, safety), with environment descriptions, clarifying questions and answers, and task plans, for a total of 1000 tasks.

Читать в источнике Cкачать pdf

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Юрий Куратов, Айдар Булатов, Пётр Анохин, Иван Родькин, Дмитрий Сорокин, Артем Сорокин, Михаил Бурцев

Читать источник

Leveraging Single and Multi-Task Reinforcement Learning algorithms for Autonomous Mobile Aloha Robot

Aditya Narendra, Дмитрий Макаров, Александр Панов

Читать источник

Reframing: Detector-Specific Prompt Tuning for Enhancing Open-Vocabulary Object Detection

Михаил Авшалумов, Зоя Воловикова, Дмитрий Юдин, Александр Панов

Читать источник

Latent State Space Quantization for Learning and Exploring Goals

Артем Латышев, Александр Панов

Читать источник

Soft Adaptive Segments for Bio-Inspired Temporal Memory

Артем Прохоренко, Евгений Дживеликян, Петр Кудеров, Александр Панов

Читать источник

Common Sense Plan Verification with Large Language Models

Данил Григорьев, Алексей Ковалёв, Александр Панов

Читать источник

Stabilizing Manipulator Trajectory via Collision-Aware Optimization

Елена Рублева, Константин Миронов, Александр Панов

Читать источник