Source
ACL
DATE OF PUBLICATION
08/15/2024
Authors
Mikhail Burtsev
Yuri Kuratov
Aydar Bulatov
Alla Chepurova
Share
Prompt Me One More Time: A Two-Step Knowledge Extraction Pipeline with Ontology-Based Verification, Workshop TextGraphs-17: Graph-based Methods for Natural Language Processing
Abstract
This study explores a method for extending real-world knowledge graphs (specifically, Wikidata) by extracting triplets from texts with the aid of Large Language Models (LLMs). We propose a two-step pipeline that includes the initial extraction of entity candidates, followed by their refinement and linkage to the canonical entities and relations of the knowledge graph. Finally, we utilize Wikidata relation constraints to select only verified triplets. We compare our approach to a model that was fine-tuned on a machine-generated dataset and demonstrate that it performs better on natural data. Our results suggest that LLM-based triplet extraction from texts, with subsequent verification, is a viable method for real-world applications.
Similar publications
You can ask us a question or suggest a joint project in the field of AI
partner@airi.net
For scientific cooperation and
partnership
partnership
pr@airi.net
For journalists and media