Natural language is the most accessible and flexible source of data on human thinking. The result of the intellectual activity of many people, the text data, allows modern neural networks to generalize knowledge, acquire skills, and reproduce new texts for given tasks.
The NLP department combines research in the field of language modelling, benchmarks, and creating multilingual and multimodal models.
Multilingual Model Training
Models uniting texts, images, speech, sound
BlackboxNLP and low-resource languages problem solving
Large and reliable corpora collection in 60+ languages
Testing intellectual abilities of humans and models