ru

About
Publications
Blog
Careers

ru

Source

DATE OF PUBLICATION

07/13/2025

Authors

Andrey Sakhovskiy Elena Tutubalina

Share

BALI: Enhancing Biomedical Language Representations through Knowledge Graph and Language Model Alignment

biomedical knowledge graph, biomedical language model, natural language processing, biomedical natural language processing, representation learning, contrastive Learning

Abstract

In recent years, there has been substantial progress in using pretrainedLanguage Models (LMs) on a range of tasks aimed at improvingthe understanding of biomedical texts. Nonetheless, existingbiomedical LLMs show limited comprehension of complex,domain-specific concept structures and the factual information encodedin biomedical Knowledge Graphs (KGs). In this work, wepropose BALI (Biomedical Knowledge Graph and Language ModelAlignment), a novel joint LM and KG pre-training method thataugments an LM with external knowledge by the simultaneouslearning of a dedicated KG encoder and aligning the representationsof both the LM and the graph. For a given textual sequence, welink biomedical concept mentions to the Unified Medical LanguageSystem (UMLS) KG and utilize local KG subgraphs as cross-modalpositive samples for these mentions. Our empirical findings indicatethat implementing our method on several leading biomedical LMs,such as PubMedBERT and BioLinkBERT, improves their performanceon a range of language understanding tasks and the qualityof entity representations, even with minimal pre-training on a smallalignment dataset sourced from PubMed scientific abstracts.

Full text

Similar publications

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Mikhail Salnikov, Andrey Sakhovskiy, Irina Nikishina, Aida Usmanova, Angelie Kraft, Cedric Möller, Debayan Banerjee, Junbo Huang, Longquan Jiang, Rana Abdullah, Xi Yan, Elena Tutubalina, Ricardo Usbeck, Alexander Panchenko

SOURCE

The benefits of query-based KGQA systems for complex and temporal questions in LLM era

Artem Alekseev, Mikhail Chaichuk, Miron Butko, Alexander Panchenko, Elena Tutubalina, Oleg Somov

SOURCE

Overview of the 10th Social Media Mining for Health (#SMM4H) and Health Real-World Data (HeaRD) Shared Tasks at ICWSM 2025

Graciela Gonzalez-Hernandez, Dongfang Xu, Takeshi Onishi, Guillermo Lopez-Garcia, Ivan Flores, Ari Klein, Abeed Sarker, Jeanne Powell, Swati Rajwal, Pierre Zweigenbaum, Lisa Raithel, Roland Roller, Philippe Thomas, Elena Tutubalina, Tirthankar Dasgupta, Manjira Sinha, Sudeshna Jana, Sedigh Khademi

SOURCE

SkipCLM: Enchancing Crosslingual Alignment of Decoder Transformer Models via Contrastive Learning and Skip Connection

Nikita Sushko, Alexander Panchenko, Elena Tutubalina

SOURCE

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Daniil Moskovskiy, Nikita Sushko, Sergey Pletenev, Alexander Panchenko, Elena Tutubalina

SOURCE

BioASQ at CLEF2025: The thirteenth edition of the large-scale biomedical semantic indexing and question answering challenge

Anastasios Nentidis, Georgios Katsimpras, Anastasia Krithara, Martin Krallinger, Miguel Rodriguez Ortega, Natalia Loukachevitch, Andrey Sakhovskiy, Elena Tutubalina, Grigorios Tsoumakas, George Giannakoulas, Alexandra Bekiaridou, Athanasios Samaras, Giorgio Maria Di Nunzio, Nicola Ferro, Stefano Marchesin, Laura Menotti, Gianmaria Silvello, Georgios Paliouras

SOURCE

The generalization and error detection in LLM-based Text-to-SQL systems

SOURCE

AIRI Institute

You can ask us a question or suggest a joint project in the field of AI

About
Publications
Blog
Careers

event@airi.net

For events invitations

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media

people@airi.net

For any questions connected with
employees and employment

© 2025, AIRI

Join AIRI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

About
- Values
- Numbers
- Focus areas
- Research
- Partners
- Management
- Contacts
Publications
Blog
Careers

Contact us

Join AIRI

You can ask us a question or suggest a joint project in the field of AI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media