Источник

AGI

Дата публикации

24.05.2023

Авторы

Кристина Саркисян Александр Корчемный Алексей Ковалёв Александр Панов

Поделиться

Evaluation of Pretrained Large Language Models in Embodied Planning Tasks

Large language models, Plan generation, Planning for embodied agents

Аннотация

Modern pretrained large language models (LLMs) are increasingly being used in zero-shot or few-shot learning modes. Recent years have seen increased interest in applying such models to embodied artificial intelligence and robotics tasks. When given in a natural language, the agent needs to build a plan based on this prompt. The best solutions use LLMs through APIs or models that are not publicly available, making it difficult to reproduce the results. In this paper, we use publicly available LLMs to build a plan for an embodied agent and evaluate them in three modes of operation: 1) the subtask evaluation mode, 2) the full autoregressive plan generation, and 3) the step-by-step autoregressive plan generation. We used two prompt settings: prompt-containing examples of one given task and a mixed prompt with examples of different tasks. Through extensive experiments, we have shown that the subtask evaluation mode, in most cases, outperforms others with a task-specific prompt, whereas the step-by-step autoregressive plan generation posts better performance in the mixed prompt setting.

Читать в источнике

Relational Object-Centric Actor-Critic

Леонид Угадяров, Виталий Воробьёв, Александр Панов

Читать источник

LookPlanGraph: Embodied instruction following method with VLM graph augmentation

Анатолий Онищенко, Алексей Ковалёв, Александр Панов

Читать источник

Accelerating Transformers in Online RL

Даниил Зелезецкий, Алексей Ковалёв, Александр Панов

Читать источник

Re:Frame - Retrieving Experience From Associative Memory

Даниил Зелезецкий, Егор Черепанов, Алексей Ковалёв, Александр Панов

Читать источник

Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning

Егор Черепанов, Никита Качаев, Алексей Ковалёв, Александр Панов

Читать источник

A New Perspective on Transformers in Online Reinforcement Learning for Continuous Control

Никита Качаев, Даниил Зелезецкий, Алексей Ковалёв, Александр Панов

Читать источник

POGEMA: A Benchmark Platform for Cooperative Multi-Agent Pathfinding

Алексей Скрынник, Антон Андрейчук, Анатолий Борзилов, Александр Чернявский, Константин Яковлев, Александр Панов

Читать источник