Источник

ICONIP

Дата публикации

26.11.2023

Авторы

Александр Панов

Петр Кудеров

Зоя Воловикова

Поделиться

Interpreting Decision Process in Offline Reinforcement Learning for Interactive Recommendation Systems

Аннотация

Recommendation systems, which predict relevant and appealing items for users on web platforms, often rely on static user interests, resulting in limited interactivity and adaptability. Reinforcement Learning (RL), while providing a dynamic and adaptive approach, brings its unique challenges in this context. Interpreting the behavior of an RL agent within recommendation systems is complex due to factors such as the vast and continuously evolving state and action spaces, non-stationary user preferences, and implicit, delayed rewards often associated with long-term user satisfaction.

Addressing the inherent complexities of applying RL in recommendation systems, we propose a framework that includes innovative metrics and a synthetic environment. The metrics aim to assess the real-time adaptability of an RL agent to dynamic user preferences. We apply this framework to LastFM datasets to interpret metric outcomes and test hypotheses regarding MDP setups and algorithm choices by adjusting dataset parameters within the synthetic environment. This approach illustrates potential applications of our framework, while highlighting the necessity for further research in this area.

Читать в источнике

Decentralized Monte Carlo Tree Search for Partially Observable Multi-agent Pathfinding

Алексей Скрынник, Антон Андрейчук, Константин Яковлев, Александр Панов

Читать источник

TransPath: Learning Heuristics For Grid-Based Pathfinding via Transformers

Антон Андрейчук, Александр Панов, Константин Яковлев, Даниил Кириленко

Читать источник

Interactive Semantic Map Representation for Skill-Based Visual Object Navigation

Татьяна Земскова, Алексей Староверов, Кирилл Муравьев, Дмитрий Юдин, Александр Панов

Читать источник

Sign-based image criteria for social interaction visual question answering

Анфиса Чуганская, Алексей Ковалёв, Александр Панов

Читать источник

The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models

Антон Разжигаев, Матвей Михальчук, Елизавета Гончарова, Иван Оселедец, Денис Димитров, Андрей Кузнецов

Читать источник

Improved Anonymous Multi-Agent Path Finding Algorithm

Zain Alabedeen Ali , Константин Яковлев

Читать источник

Learn to Follow: Decentralized Lifelong Multi-agent Pathfinding via Planning and Learning

Алексей Скрынник, Антон Андрейчук, Maria Nesterova, Константин Яковлев, Александр Панов

Читать источник