Source

Dialogue

DATE OF PUBLICATION

06/18/2022

Authors

Oleg Serikov Tatyana Shavrina Ekaterina Voloshina

Share

Is language acquisition similar in language models and humans? A chronological probing study

probing, language acquisition, language modeling, transformers

Abstract

The probing methodology allows one to obtain a partial representation of linguistic phenomena stored in the inner layers of the neural network, using external classifiers and statistical analysis.
Pretrained transformer-based language models are widely used both for natural language understanding (NLU) and natural language generation (NLG) tasks making them most commonly used for downstream applications. However, no analysis was carried out, whether the models were pretrained enough or contained knowledge correlated with linguistic theory.
We are presenting the chronological probing study of transformer English models such as MultiBERT and T5. We sequentially compare the information about the language learned by the models in the process of training on corpora. The results show that 1) linguistic information is acquired in the early stages of training 2) both language models demonstrate capabilities to capture various features from various levels of language, including morphology, syntax, and even discourse, while they also can inconsistently fail on tasks that are perceived as easy.

Full text

EAI: Emotional Decision-Making of LLMs in Strategic Games and Ethical Dilemmas

Mikhail Mozikov, Nikita Severin, Valeria Bodishtianu, Maria Glushanina, Ivan Nasonov, Daniil Orekhov, Vladislav Pekhotin, Ivan Makovetskiy, Mikhail Baklashkin, Vasily Lavrentyev, Akim Tsvigun, Denis Turdakov, Tatyana Shavrina, Andrey Savchenko, Ilya Makarov

SOURCE

Of Models and Men: Probing Neural Networks for Agreement Attraction with Psycholinguistic Data

Maxim Bazhukov, Ekaterina Voloshina, Sergey Pletenev, Arseny Anisimov, Oleg Serikov, Svetlana Toldova

SOURCE

Representational dissimilarity component analysis (ReDisCA)

Alexey Ossadtchi, Ilia Semenkov, Anna Zhuravleva, Oleg Serikov, Ekaterina Voloshina

SOURCE

Super donors and super recipients: Studying cross-lingual transfer between high-resource and low-resource languages

Vitaly Protasov, Elisei Stakovskii, Ekaterina Voloshina, Tatyana Shavrina, Alexander Panchenko

SOURCE

mGPT: Few-Shot Learners Go Multilingual

Oleh Shliazhko, Alena Fenogenova, Maria Tikhonova, Anastasia Kozlova, Vladislav Mikhailov, Tatyana Shavrina

SOURCE

A Study on Manual and Automatic Evaluation for Text Style Transfer: The Case of Detoxification

Varvara Logacheva, Daryna Dementieva, Irina Krotova, Alena Fenogenova, Irina Nikishina, Tatyana Shavrina, Alexander Panchenko

SOURCE

Vote’n’Rank: Revision of Benchmarking with Social Choice Theory

Mark Rofin, Vladislav Mikhailov, Mikhail Florinskiy , Andrey Kravchenko, Elena Tutubalina, Tatyana Shavrina, Daniel Karabekyan, Ekaterina Artemova

SOURCE