Источник

PeerJ Computer Science

Дата публикации

31.01.2022

Авторы

Илья Макаров Мария Баханова Сергей Николенко Ольга Герасимова

Поделиться

Self-supervised recurrent depth estimation with attention mechanisms

Research article, Artificial intelligence, Computer vision, Robotics

Аннотация

Depth estimation has been an essential task for many computer vision applications, especially in autonomous driving, where safety is paramount. Depth can be estimated not only with traditional supervised learning but also via a self-supervised approach that relies on camera motion and does not require ground truth depth maps. Recently, major improvements have been introduced to make self-supervised depth prediction more precise. However, most existing approaches still focus on single-frame depth estimation, even in the self-supervised setting. Since most methods can operate with frame sequences, we believe that the quality of current models can be significantly improved with the help of information about previous frames. In this work, we study different ways of integrating recurrent blocks and attention mechanisms into a common self-supervised depth estimation pipeline. We propose a set of modifications that utilize temporal information from previous frames and provide new neural network architectures for monocular depth estimation in a self-supervised manner. Our experiments on the KITTI dataset show that proposed modifications can be an effective tool for exploiting temporal information in a depth prediction pipeline.

Читать в источнике

Automatic Interpretation of Ancient Egyptian Texts for Education and Research

Максим Голядкин, Иннокентий Хумонен, I. Plevokas, ЕКАТЕРИНА БУРЕЕВА, ЕКАТЕРИНА АЛЕКСАНДРОВА, Илья Макаров

Читать источник

Search Swarm: Multi-agent Large Language Models Framework for E-commerce Product Search

Нагим Исянбаев, Илья Макаров

Читать источник

Machine Learning Driven Optimization of Fe-Based TMCs for Photodynamic Therapy

Владимир Мануилов, Antonio Francés Monerris, Abdelazim Abdelgawwad, Daniel Escudero, Илья Макаров

Читать источник

ATGen: A Framework for Active Text Generation

Аким Цвигун, Даниил Васильев, Иван Цвигун, Иван Лысенко, Талгат Бектлеуов, Александр Медведев, Ульяна Виноградова, Никита Северин, Михаил Мозиков, Андрей Савченко, Ростислав Григорьев, Рамиль Кулеев, Федор Жданов, Артем Шелманов, Илья Макаров

Читать источник

WISP: Workframe for Interferogram Signal Phase-unwrapping

Тимофей Хирианов, Александра Хирианова, Егор Паркевич, Илья Макаров

Читать источник

MatMuls are Enough for Efficient and Performant Linear-Time Attention

Andrew Argatkiny, Илья Макаров

Читать источник

Optimizing state monitoring with domain degradation knowledge

Дмитрий Жевненко, Илья Макаров

Читать источник