Источник

AAAI

Дата публикации

25.02.2025

Авторы

Антон Андрейчук Константин Яковлев Александр Панов Алексей Скрынник

Поделиться

MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale

Аннотация

Multi-agent pathfinding (MAPF) is a challenging computational problem that typically requires to find collision-free paths for multiple agents in a shared environment. Solving MAPF optimally is NP-hard, yet efficient solutions are critical for numerous applications, including automated warehouses and transportation systems. Recently, learning-based approaches to MAPF have gained attention, particularly those leveraging deep reinforcement learning. Following current trends in machine learning, we have created a foundation model for the MAPF problems called MAPF-GPT. Using imitation learning, we have trained a policy on a set of pre-collected sub-optimal expert trajectories that can generate actions in conditions of partial observability without additional heuristics, reward functions, or communication with other agents. The resulting MAPF-GPT model demonstrates zero-shot learning abilities when solving the MAPF problem instances that were not present in the training dataset. We show that MAPF-GPT notably outperforms the current best-performing learnable-MAPF solvers on a diverse range of problem instances and is efficient in terms of computation (in the inference mode).

Читать в источнике Cкачать pdf

CrafText Benchmark: Advancing Language Grounding in Complex Multimodal Open-Ended World

Зоя Воловикова, Петр Кудеров, Григорий Горбов, Александр Панов, Алексей Скрынник

Читать источник

IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents

Shrestha Mohanty, Negar Arabzadeh, Andrea Tupini, Yuxuan Sun, Алексей Скрынник, Артем Жолус, Marc-Alexandre Cote, Юлия Киселева

Читать источник

Intrinsic Motivation in Model-based Reinforcement Learning: A Brief Review

Артем Латышев, Александр Панов

Читать источник

Relational Object-Centric Actor-Critic

Леонид Угадяров, Виталий Воробьёв, Александр Панов

Читать источник

LookPlanGraph: Embodied instruction following method with VLM graph augmentation

Анатолий Онищенко, Алексей Ковалёв, Александр Панов

Читать источник

Workshop ICLR 2025 Accelerating Transformers in Online RL

Даниил Зелезецкий, Алексей Ковалёв, Александр Панов

Читать источник

Re:Frame - Retrieving Experience From Associative Memory

Даниил Зелезецкий, Егор Черепанов, Алексей Ковалёв, Александр Панов

Читать источник