Source
ACL
DATE OF PUBLICATION
05/21/2022
Authors
Artem Shelmanov
Mikhail Burtsev
Kirill Fedyanin
Maxim Panov
Manvel Avetisian
Leonid Zhukov
Alexander Panchenko
Artem Vazhentsev
Gleb Kuzmin
Akim Tsvigun
Evgenii Tsymbalov
Gleb Gusev
Share
Uncertainty Estimation of Transformer Predictions for Misclassification Detection
Abstract
Uncertainty estimation (UE) of model predictions is a crucial step for a variety of tasks such as active learning, misclassification / adversarial attack / out-of-distribution detection, etc. Most of the works on modeling the uncertainty of deep neural networks evaluate these methods on image classification tasks. Little attention has been paid to UE in natural language processing. To fill this gap, we perform a vast empirical investigation of state-of-the-art UE methods for Transformer models on misclassification detection in named entity recognition and text classification tasks and propose two computationally efficient modifications, one of which improves the state of the art and outperforms computationally intensive methods.
Similar publications
You can ask us a question or suggest a joint project in the field of AI
partner@airi.net
For scientific cooperation and
partnership
partnership
pr@airi.net
For journalists and media