ru

About
Publications
Blog
Careers

ru

Source

SISY

DATE OF PUBLICATION

02/13/2023

Authors

Kirill Zhingalov Aleksei Karpov Ilya Makarov

Share

Multi-modal RGBD Attention Fusion for Dense Depth Estimation

Abstract

With the development of autonomous vehicles and augmented reality devices, LiDARs and cameras are becoming the main tools for object recognition. However, fusing information from multiple data sources is a challenging task in computer vision. One of the most promising directions of fusing is self-supervised training. However, previous works in this field focused only on simple mechanisms for fusion. In this paper, a novel model architecture with improved fusion blocks with attention mechanisms is presented. A comparison of the impact of input modalities and loss functions on the model is also provided. Experiments demonstrate the ability of presented fusion block in the LiDAR and camera data fusing. The proposed Neural network architecture and learning framework show promising results in depth completion tasks.

Full text

Similar publications

EAI: Emotional Decision-Making of LLMs in Strategic Games and Ethical Dilemmas

Mikhail Mozikov, Nikita Severin, Valeria Bodishtianu, Maria Glushanina, Ivan Nasonov, Daniil Orekhov, Vladislav Pekhotin, Ivan Makovetskiy, Mikhail Baklashkin, Vasily Lavrentyev, Akim Tsvigun, Denis Turdakov, Tatyana Shavrina, Andrey Savchenko, Ilya Makarov

SOURCE

Optimizing state monitoring with domain degradation knowledge

Dmitry Zhevnenko, Ilya Makarov

SOURCE

SODAOpt: Socio-Demographic and Textual Adaptive Fusion for Optimizing Developer Task Assignment

Karina Romanova, Sergey Senichev, Lina Veltman, Ivan Nasonov, Andrey Kuznetsov, Ilya Makarov

SOURCE

Poster Abstract: Minimizing Labeling Efforts for Fault Detection and Diagnosis

Maria Shtark, Alexander Kozhevnikov, Petr Ivanov, Ilya Makarov

SOURCE

Poster Abstract: Exploring the Autoencoder Sequence Pooling

Petr Ivanov, Maria Shtark, Alexander Kozhevnikov, Ilya Makarov

SOURCE

Poster Abstract: Autonomous AI-Driven Grid Protection: Sub-Cycle Fault Response via NPU-Optimized Neural Networks

Alexander Kovalenko, Aleksey Evdakov, Galina Filatova, Andrey Yablokov, Aleksandr Bulashov, Ilya Makarov

SOURCE

Enhancing Emotion Recognition in Speech based on Self-Supervised Learning: Cross-Attention Fusion of Acoustic and Semantic Features

Bashar M. Deeb, Andrey Savchenko, Ilya Makarov

SOURCE

AIRI Institute

You can ask us a question or suggest a joint project in the field of AI

About
Publications
Blog
Careers

event@airi.net

For events invitations

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media

people@airi.net

For any questions connected with
employees and employment

© 2025, AIRI

Join AIRI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

About
- Values
- Numbers
- Focus areas
- Research
- Partners
- Management
- Contacts
Publications
Blog
Careers

Contact us

Join AIRI

You can ask us a question or suggest a joint project in the field of AI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media