ru

About
Publications
Blog
Careers

ru

Source

ACL

DATE OF PUBLICATION

05/21/2022

Authors

Artem Vazhentsev Gleb Kuzmin Artem Shelmanov Akim Tsvigun Evgenii Tsymbalov Kirill Fedyanin Maxim Panov Alexander Panchenko Gleb Gusev Mikhail Burtsev Manvel Avetisian Leonid Zhukov

Share

Uncertainty Estimation of Transformer Predictions for Misclassification Detection

Uncertainty estimation, Transformers, Mahalanobis distance, Dropout, Determinantal point process

Abstract

Uncertainty estimation (UE) of model predictions is a crucial step for a variety of tasks such as active learning, misclassification / adversarial attack / out-of-distribution detection, etc. Most of the works on modeling the uncertainty of deep neural networks evaluate these methods on image classification tasks. Little attention has been paid to UE in natural language processing. To fill this gap, we perform a vast empirical investigation of state-of-the-art UE methods for Transformer models on misclassification detection in named entity recognition and text classification tasks and propose two computationally efficient modifications, one of which improves the state of the art and outperforms computationally intensive methods.

Full text

Similar publications

BABILong: Testing the Limits of LLMs with Long Context Reasoning-in-a-Haystack

Yuri Kuratov, Aydar Bulatov, Petr Anokhin, Ivan Rodkin, Dmitry Sorokin, Artyom Sorokin, Mikhail Burtsev

SOURCE

EAI: Emotional Decision-Making of LLMs in Strategic Games and Ethical Dilemmas

Mikhail Mozikov, Nikita Severin, Valeria Bodishtianu, Maria Glushanina, Ivan Nasonov, Daniil Orekhov, Vladislav Pekhotin, Ivan Makovetskiy, Mikhail Baklashkin, Vasily Lavrentyev, Akim Tsvigun, Denis Turdakov, Tatyana Shavrina, Andrey Savchenko, Ilya Makarov

SOURCE

GENATATOR: de novo Gene Annotation With DNA Language Model

Aleksei Shmelev, Artem Shadskiy, Yuri Kuratov, Mikhail Burtsev, Olga Kardymon, Veniamin Fishman

SOURCE

Searching for Phenotypic Needles in Genomic Haystacks: DNA Language Models for Sex Prediction

Alla Chepurova, Yuri Kuratov, Polina Belokopytova, Mikhail Burtsev, Veniamin Fishman

SOURCE

SkipCLM: Enchancing Crosslingual Alignment of Decoder Transformer Models via Contrastive Learning and Skip Connection

Nikita Sushko, Alexander Panchenko, Elena Tutubalina

SOURCE

Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models

Gleb Kuzmin, Nemeesh Yadav, Ivan Smirnov, Timothy Baldwin, Artem Shelmanov

SOURCE

Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images

Elisei Rykov, Ksenia Petrushina, Ksenia Titova, Anton Razzhigaev, Alexander Panchenko, Vasily Konovalov

SOURCE

AIRI Institute

You can ask us a question or suggest a joint project in the field of AI

About
Publications
Blog
Careers

event@airi.net

For events invitations

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media

people@airi.net

For any questions connected with
employees and employment

© 2025, AIRI

Join AIRI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

About
- Values
- Numbers
- Focus areas
- Research
- Partners
- Management
- Contacts
Publications
Blog
Careers

Contact us

Join AIRI

You can ask us a question or suggest a joint project in the field of AI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media