Источник

Logic Journal of the IGPL

Дата публикации

22.03.2024

Авторы

Александр Панов

Анфиса Чуганская Алексей Ковалёв

Поделиться

Sign-based image criteria for social interaction visual question answering

Аннотация

The multi-modal tasks have started to play a significant role in the research on artificial intelligence. A particular example of that domain is visual–linguistic tasks, such as visual question answering. The progress of modern machine learning systems is determined, among other things, by the data on which these systems are trained. Most modern visual question answering data sets contain limited type questions that can be answered either by directly accessing the image itself or by using external data. At the same time, insufficient attention is paid to the issues of social interactions between people, which limits the scope of visual question answering systems. In this paper, we propose criteria by which images suitable for social interaction visual question answering can be selected for composing such questions, based on psychological research. We believe this should serve the progress of visual question answering systems.

Читать в источнике Cкачать pdf

Model-based Policy Optimization using Symbolic World Model

Андрей Городецкий, Константин Миронов, Александр Панов

Читать источник

Instruction Following with Goal-Conditioned Reinforcement Learning in Virtual Environments

Зоя Воловикова, Алексей Скрынник, Петр Кудеров, Александр Панов

Читать источник

Generative models for grid-based and image-based pathfinding

Даниил Кирилленко, Антон Андрейчук, Александр Панов, Константин Яковлев

Читать источник

Skill Learning with Empowerment in Reinforcement Learning

Артем Латышев, Александр Панов

Читать источник

Hebbian spatial encoder with adaptive sparse connectivity

Петр Кудеров, Евгений Дживеликян, Александр Панов

Читать источник

OFMPNet: Deep end-to-end model for occupancy and flow prediction in urban environment

Youshaa Murhij, Дмитрий Юдин

Читать источник

FFStreams: Fast Search With Streams for Autonomous Maneuver Planning

Mais Jamal, Александр Панов

Читать источник