Источник

ICML / NGSM

Дата публикации

27.03.2024

Авторы

Алексей Староверов Егор Черепанов Дмитрий Юдин Алексей Ковалёв Александр Панов

Поделиться

Recurrent Action Transformer with Memory

Аннотация

Recently, the use of transformers in offline reinforcement learning has become a rapidly developing area. This is due to their ability to treat the agent's trajectory in the environment as a sequence, thereby reducing the policy learning problem to sequence modeling. In environments where the agent's decisions depend on past events, it is essential to capture both the event itself and the decision point in the context of the model. However, the quadratic complexity of the attention mechanism limits the potential for context expansion. One solution to this problem is to enhance transformers with memory mechanisms. In this paper, we propose the Recurrent Action Transformer with Memory (RATE) - a model that incorporates recurrent memory. To evaluate our model, we conducted extensive experiments on both memory-intensive environments (VizDoom-Two-Color, T-Maze) and classic Atari games and MuJoCo control environments. The results show that the use of memory can significantly improve performance in memory-intensive environments while maintaining or improving results in classic environments. We hope that our findings will stimulate research on memory mechanisms for transformers applicable to offline reinforcement learning.

Читать в источнике Cкачать pdf

IDAT: A Multi-Modal Dataset and Toolkit for Building and Evaluating Interactive Task-Solving Agents

Shrestha Mohanty, Negar Arabzadeh, Andrea Tupini, Yuxuan Sun, Алексей Скрынник, Артем Жолус, Marc-Alexandre Cote, Юлия Киселева

Читать источник

Intrinsic Motivation in Model-based Reinforcement Learning: A Brief Review

Артем Латышев, Александр Панов

Читать источник

Relational Object-Centric Actor-Critic

Леонид Угадяров, Виталий Воробьёв, Александр Панов

Читать источник

LookPlanGraph: Embodied instruction following method with VLM graph augmentation

Анатолий Онищенко, Алексей Ковалёв, Александр Панов

Читать источник

Workshop ICLR 2025 Accelerating Transformers in Online RL

Даниил Зелезецкий, Алексей Ковалёв, Александр Панов

Читать источник

GENATATOR: de novo Gene Annotation With DNA Language Model

Алексей Шмелёв, Artem Shadskiy, Юрий Куратов, Михаил Бурцев, Ольга Кардымон, Вениамин Фишман

Читать источник

Searching for Phenotypic Needles in Genomic Haystacks: DNA Language Models for Sex Prediction

Алла Чепурова, Юрий Куратов, Полина Белокопытова, Михаил Бурцев, Вениамин Фишман

Читать источник