Источник

AAAI

Дата публикации

29.02.2024

Авторы

Алексей Скрынник Антон Андрейчук Константин Яковлев Александр Панов

Поделиться

Decentralized Monte Carlo Tree Search for Partially Observable Multi-agent Pathfinding

Аннотация

The Multi-Agent Pathfinding (MAPF) problem involves finding a set of conflict-free paths for a group of agents confined to a graph. In typical MAPF scenarios, the graph and the agents' starting and ending vertices are known beforehand, allowing the use of centralized planning algorithms. However, in this study, we focus on the decentralized MAPF setting, where the agents may observe the other agents only locally and are restricted in communications with each other. Specifically, we investigate the lifelong variant of MAPF, where new goals are continually assigned to the agents upon completion of previous ones. Drawing inspiration from the successful AlphaZero approach, we propose a decentralized multi-agent Monte Carlo Tree Search (MCTS) method for MAPF tasks. Our approach utilizes the agent's observations to recreate the intrinsic Markov decision process, which is then used for planning with a tailored for multi-agent tasks version of neural MCTS. The experimental results show that our approach outperforms state-of-the-art learnable MAPF solvers. The source code is available at this https URL: https://github.com/AIRI-Institute/mats-lp

Читать в источнике

Intrinsic Motivation in Model-based Reinforcement Learning: A Brief Review

Артем Латышев, Александр Панов

Читать источник

Relational Object-Centric Actor-Critic

Леонид Угадяров, Виталий Воробьёв, Александр Панов

Читать источник

LookPlanGraph: Embodied instruction following method with VLM graph augmentation

Анатолий Онищенко, Алексей Ковалёв, Александр Панов

Читать источник

Accelerating Transformers in Online RL

Даниил Зелезецкий, Алексей Ковалёв, Александр Панов

Читать источник

Re:Frame - Retrieving Experience From Associative Memory

Даниил Зелезецкий, Егор Черепанов, Алексей Ковалёв, Александр Панов

Читать источник

Memory, Benchmark & Robots: A Benchmark for Solving Complex Tasks with Reinforcement Learning

Егор Черепанов, Никита Качаев, Алексей Ковалёв, Александр Панов

Читать источник

A New Perspective on Transformers in Online Reinforcement Learning for Continuous Control

Никита Качаев, Даниил Зелезецкий, Алексей Ковалёв, Александр Панов

Читать источник