en

Об институте
Публикации
Блог
Карьера

en

Источник

IEEE Access

Дата публикации

13.06.2022

Авторы

Александр Панов

Алексей Староверов

Поделиться

Hierarchical Landmark Policy Optimization for Visual Indoor Navigation

Navigation, Neural networks, Robotics, Reinforcement learning, Complex Indoor environments

Аннотация

In this paper, we study the problem of visual indoor navigation to an object that is defined by its semantic category. Recent works have shown significant achievements in the end-to-end reinforcement learning approach and modular systems. However, both approaches need a big step forward to be robust and practically applicable. To solve the problem of insufficient exploration of the scenes and make exploration more semantically meaningful, we extend standard task formulation and give the agent easily accessible landmarks in the form of the room locations and those types. The availability of landmarks allows the agent to build a hierarchical policy structure and achieve a success rate of 63% on validation scenes in a photo-realistic Habitat simulator. In a hierarchy, a low level consists of separately trained RL skills and a high level deterministic policy, which decides which skill is needed at the moment. Also, in this paper, we show the possibility of transferring a trained policy to a real robot. After a bit of training on the reconstructed real scene, the robot shows up to 79% SPL when solving the task of navigating to an arbitrary object.

Читать в источнике

Похожие публикации

Decentralized Monte Carlo Tree Search for Partially Observable Multi-agent Pathfinding

Алексей Скрынник, Антон Андрейчук, Константин Яковлев, Александр Панов

Читать источник

TransPath: Learning Heuristics For Grid-Based Pathfinding via Transformers

Антон Андрейчук, Александр Панов, Константин Яковлев, Даниил Кириленко

Читать источник

Interactive Semantic Map Representation for Skill-Based Visual Object Navigation

Татьяна Земскова, Алексей Староверов, Кирилл Муравьев, Дмитрий Юдин, Александр Панов

Читать источник

Logic Journal of the IGPL

Sign-based image criteria for social interaction visual question answering

Анфиса Чуганская, Алексей Ковалёв, Александр Панов

Читать источник

The Shape of Learning: Anisotropy and Intrinsic Dimensions in Transformer-Based Models

Антон Разжигаев, Матвей Михальчук, Елизавета Гончарова, Иван Оселедец, Денис Димитров, Андрей Кузнецов

Читать источник

Improved Anonymous Multi-Agent Path Finding Algorithm

Zain Alabedeen Ali , Константин Яковлев

Читать источник

Learn to Follow: Decentralized Lifelong Multi-agent Pathfinding via Planning and Learning

Алексей Скрынник, Антон Андрейчук, Maria Nesterova, Константин Яковлев, Александр Панов

Читать источник

Институт искусственного интеллекта AIRI

Вы можете задать нам вопрос или предложить совместный проект в области ИИ

Об институте
Публикации
Блог
Карьера

partner@airi.net

По вопросам научного
сотрудничества и партнерства

pr@airi.net

Для журналистов и СМИ

people@airi.net

По вопросам, связанным с HR

© 2024, AIRI

Присоединяйтесь к AIRI в соцсетях

Имя Почта Обращение Я не робот Отправляя форму, я даю согласие на обработку моих персональных данных

Сообщение отправлено.

Спасибо!

Что-то пошло не так. Попробуйте снова

Об институте
Публикации
Блог
Карьера

Связаться

Присоединяйтесь к AIRI в соцсетях

Вы можете задать нам вопрос или предложить совместный проект в области ИИ

Имя Почта Обращение Я не робот Отправляя форму, я даю согласие на обработку моих персональных данных

Сообщение отправлено.

Спасибо!

Что-то пошло не так. Попробуйте снова

partner@airi.net

По вопросам научного
сотрудничества и партнерства

pr@airi.net

Для журналистов и СМИ