ru

About
Publications
Blog
Careers

ru

Source

ACL

DATE OF PUBLICATION

08/15/2024

Authors

Mikhail Burtsev

Yuri Kuratov

Aydar Bulatov Alla Chepurova

Share

Prompt Me One More Time: A Two-Step Knowledge Extraction Pipeline with Ontology-Based Verification, Workshop TextGraphs-17: Graph-based Methods for Natural Language Processing

Abstract

This study explores a method for extending real-world knowledge graphs (specifically, Wikidata) by extracting triplets from texts with the aid of Large Language Models (LLMs). We propose a two-step pipeline that includes the initial extraction of entity candidates, followed by their refinement and linkage to the canonical entities and relations of the knowledge graph. Finally, we utilize Wikidata relation constraints to select only verified triplets. We compare our approach to a model that was fine-tuned on a machine-generated dataset and demonstrate that it performs better on natural data. Our results suggest that LLM-based triplet extraction from texts, with subsequent verification, is a viable method for real-world applications.

Full text DOWNLOAD pdf

Similar publications

Neural Computation

Associative Learning and Active Inference

Petr Anokhin, Artyom Sorokin, Mikhail Burtsev, Karl Friston

SOURCE

Biomedical Signal Processing and Control

Negligible effect of brain MRI data preprocessing for tumor segmentation

Ekaterina Kondratyeva , Polina Druzhinina, Alexandra Dalechina, Svetlana Zolotova, Andrey Golanov, Boris Shirokikh, Mikhail Belyaev, Anvar Kurmukov

SOURCE

Nucleic Acids Research

GENA-Web - GENomic Annotations Web Inference using DNA language models

Aleksei Shmelev, Maxim Petrov, Dmitry Penzar, Nikolay Akhmetyanov , Maksim Tavritskiy, Stepan Mamontov, Yuri Kuratov, Mikhail Burtsev, Olga Kardymon, Veniamin Fishman

SOURCE

User Modeling and User-Adapted Interaction

Federated privacy-preserving collaborative filtering for on-device next app prediction

Albert Sayapin, Gleb Balitskiy, Daniel Bershatsky, Aleksandr Katrutsa, Evgeny Frolov, Alexey Frolov, Ivan Oseledets, Vitaliy Kharin

SOURCE

Weak-to-Strong 3D Object Detection with X-Ray Distillation

Alexander Gambashidze , Aleksandr Dadukin, Maksim Golyadkin, Maria Razzhivina, Ilya Makarov

SOURCE

BioASQ at CLEF2024: The Twelfth Edition of the Large-Scale Biomedical Semantic Indexing and Question Answering Challenge

Anastasios Nentidis, Anastasia Krithara, Georgios Paliouras, Martin Krallinger, Luis Gasco Sánchez, Salvador Lima, Eulalia Farre, , Natalia Loukachevitch, Vera Davydova, Elena Tutubalina

SOURCE

ICLR / Workshop

Recurrent memory augmentation of GENA-LM improves performance on long DNA sequence tasks

Yuri Kuratov, Aleksei Shmelev, Veniamin Fishman, Olga Kardymon, Mikhail Burtsev

SOURCE

AIRI Institute

You can ask us a question or suggest a joint project in the field of AI

About
Publications
Blog
Careers

event@airi.net

For events invitations

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media

people@airi.net

For any questions connected with
employees and employment

© 2024, AIRI

Join AIRI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

About
- Values
- Numbers
- Focus areas
- Research
- Partners
- Management
- Contacts
Publications
Blog
Careers

Contact us

Join AIRI

You can ask us a question or suggest a joint project in the field of AI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media