Source
ACL
DATE OF PUBLICATION
05/21/2022
Authors
Artem Shelmanov Mikhail Burtsev Kirill Fedyanin Maxim Panov Manvel Avetisian Leonid Zhukov Alexander Panchenko Artem Vazhentsev Gleb Kuzmin Akim Tsvigun Evgenii Tsymbalov Gleb Gusev
Share

Uncertainty Estimation of Transformer Predictions for Misclassification Detection

Abstract

Uncertainty estimation (UE) of model predictions is a crucial step for a variety of tasks such as active learning, misclassification / adversarial attack / out-of-distribution detection, etc. Most of the works on modeling the uncertainty of deep neural networks evaluate these methods on image classification tasks. Little attention has been paid to UE in natural language processing. To fill this gap, we perform a vast empirical investigation of state-of-the-art UE methods for Transformer models on misclassification detection in named entity recognition and text classification tasks and propose two computationally efficient modifications, one of which improves the state of the art and outperforms computationally intensive methods.

Join AIRI