ru

About
Publications
Blog
Careers

ru

Source

LREC

DATE OF PUBLICATION

06/25/2022

Authors

Tatyana Shavrina

Mikhail Burtsev

Oleg Serikov

Sanzhar Murzakhmetov Anastasia Chizhikova

Share

Attention Understands Semantic Relations

Ontology extraction, Knowledge probing, Semantic probing, Explainable AI (XAI), Language models interpretation, Bertology

Abstract

Today, natural language processing heavily relies on pre-trained large language models. Even though such models are criticized
for the poor interpretability, they still yield state-of-the-art solutions for a wide set of very different tasks. While lots of probing
studies have been conducted to measure the models awareness of the grammatical knowledge, semantic probing is less popular.
In this work, we introduce the probing pipeline to study the representendess of semantic relations in transformer language
models. We show that in this task, attention scores are nearly as expressive as the layers’ output activations, despite their lesser
ability to represent surface cues. This supports the hypothesis that attention mechanisms are focusing not only on the syntactic
relational information but also on the semantic one.

Full text

Similar publications

Beyond Attention: Breaking the Limits of Transformer Context Length with Recurrent Memory

Aydar Bulatov, Yuri Kuratov, Yermek Kapushev, Mikhail Burtsev

SOURCE

mGPT: Few-Shot Learners Go Multilingual

Oleh Shliazhko, Alena Fenogenova, Maria Tikhonova, Anastasia Kozlova, Vladislav Mikhailov, Tatyana Shavrina

SOURCE

Better Together: Enhancing Generative Knowledge Graph Completion with Language Models and Neighborhood Information.

Alla Chepurova, Aydar Bulatov, Yuri Kuratov, Mikhail Burtsev

SOURCE

A Study on Manual and Automatic Evaluation for Text Style Transfer: The Case of Detoxification

Varvara Logacheva, Daryna Dementieva, Irina Krotova, Alena Fenogenova, Irina Nikishina, Tatyana Shavrina, Alexander Panchenko

SOURCE

A review of neural architecture search

Dilyara Baymurzina, Eugene Golikov, Mikhail Burtsev

SOURCE

Hybrid Uncertainty Estimation for Selective Text Classification in Ambiguous Tasks

Artem Vazhentsev, Akim Tsvigun, Gleb Kuzmin, Zeerak Talat, Alexander Panchenko, Maxim Panov, Mikhail Burtsev, Artem Shelmanov

SOURCE

Vote’n’Rank: Revision of Benchmarking with Social Choice Theory

Mark Rofin, Vladislav Mikhailov, Mikhail Florinskiy , Andrey Kravchenko, Elena Tutubalina, Tatyana Shavrina, Daniel Karabekyan, Ekaterina Artemova

SOURCE

Artificial Intelligence Research Institute AIRI

You can ask us a question or suggest a joint project in the field of AI

About
Publications
Blog
Careers

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media

people@airi.net

For any questions connected with
employees and employment

© 2024, AIRI

Join AIRI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

About
- Values
- Numbers
- Focus areas
- Research
- Partners
- Management
- Contacts
Publications
Blog
Careers

Contact us

Join AIRI

You can ask us a question or suggest a joint project in the field of AI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media