Источник

CLeaR

Дата публикации

07.05.2025

Авторы

Леонид Угадяров Виталий Воробьёв Александр Панов

Поделиться

Relational Object-Centric Actor-Critic

Аннотация

The advances in unsupervised object-centric representation learning have significantly improved its application to downstream tasks. Recent works highlight that disentangled object representations can aid policy learning in image-based, object-centric reinforcement learning tasks. This paper proposes a novel object-centric reinforcement learning algorithm that integrates actor-critic and model-based approaches by incorporating an object-centric world model within the critic. The world model captures the environment’s data-generating process by predicting the next state and reward given the current state-action pair, where actions are interventions in the environment. In model-based reinforcement learning, world model learning can be interpreted as a causal induction problem, where the agent must learn the causal relationships underlying the environment’s dynamics. We evaluate our method in a simulated 3D robotic environment and a 2D environment with compositional structure. As baselines, we compare against object-centric, model-free actor-critic algorithms and a state-of-the-art monolithic model-based algorithm. While the baselines show comparable performance in easier tasks, our approach outperforms them in more challenging scenarios with a large number of objects or more complex dynamics.

Читать в источнике Cкачать pdf

Temporal Predictive Coding as World Model for Reinforcement Learning

Артем Прохоренко, Петр Кудеров, Евгений Дживеликян, Александр Панов

Читать источник

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Юрий Куратов, Михаил Архипов, Айдар Булатов, Михаил Бурцев

Читать источник

CrafText Benchmark: Advancing Language Grounding in Complex Multimodal Open-Ended World

Зоя Воловикова, Петр Кудеров, Григорий Горбов, Александр Панов, Алексей Скрынник

Читать источник

IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents

Shrestha Mohanty, Negar Arabzadeh, Andrea Tupini, Yuxuan Sun, Алексей Скрынник, Артем Жолус, Marc-Alexandre Cote, Юлия Киселева

Читать источник

Intrinsic Motivation in Model-based Reinforcement Learning: A Brief Review

Артем Латышев, Александр Панов

Читать источник

LookPlanGraph: Embodied instruction following method with VLM graph augmentation

Анатолий Онищенко, Алексей Ковалёв, Александр Панов

Читать источник

Workshop ICLR 2025 Accelerating Transformers in Online RL

Даниил Зелезецкий, Алексей Ковалёв, Александр Панов

Читать источник