Source
NAACL
DATE OF PUBLICATION
04/29/2025
Authors
Share
Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images
Abstract
Measuring how real images look is a complextask in artificial intelligence research. For example,an image of a boy with a vacuum cleanerin a desert violates common sense. We introducea novel method, which we call Throughthe Looking Glass (TLG), to assess image commonsense consistency using Large Vision-Language Models (LVLMs) and Transformerbasedencoder. By leveraging LVLMs to extractatomic facts from these images, we obtaina mix of accurate facts. We proceed byfine-tuning a compact attention-pooling classifierover encoded atomic facts. Our TLG hasachieved a new state-of-the-art performanceon the WHOOPS! and WEIRD datasets whileleveraging a compact fine-tuning component.
Similar publications
You can ask us a question or suggest a joint project in the field of AI
partner@airi.net
For scientific cooperation and
partnership
partnership
pr@airi.net
For journalists and media