ru

About
Publications
Blog
Careers

ru

Source

WSDM

DATE OF PUBLICATION

03/10/2025

Authors

Oleg Somov

Share

The generalization and error detection in LLM-based Text-to-SQL systems

Text-to-SQL, distribution shift, generalization, error detection

Abstract

Text-to-SQL systems streamline human-database interactions, improvingdata retrieval and decision-making. Although large languagemodels (LLMs) can now generate SQL code, challenges withgeneralization and uncontrolled generation hinder their use in production.Text-to-SQL tasks are particularly sensitive to distributionshifts, where performance declines with unfamiliar database elementsor novel queries. Effective systems must maintain quality,measured in terms of generalization (correct processing of noveluser requests) and error detection (identification of incorrect generations).This study empirically assesses LLM-based Text-to-SQLsystems limitations, defining reliable production scenarios. Currentcontributions include a cross-lingual generalization research,study on generative model generalization abilities and the qualityof selective classification for error detection risk under differentdistribution shifts in task of Text-to-SQL.

Full text

Similar publications

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Mikhail Salnikov, Andrey Sakhovskiy, Irina Nikishina, Aida Usmanova, Angelie Kraft, Cedric Möller, Debayan Banerjee, Junbo Huang, Longquan Jiang, Rana Abdullah, Xi Yan, Elena Tutubalina, Ricardo Usbeck, Alexander Panchenko

SOURCE

The benefits of query-based KGQA systems for complex and temporal questions in LLM era

Artem Alekseev, Mikhail Chaichuk, Miron Butko, Alexander Panchenko, Elena Tutubalina, Oleg Somov

SOURCE

BALI: Enhancing Biomedical Language Representations through Knowledge Graph and Language Model Alignment

Andrey Sakhovskiy, Elena Tutubalina

SOURCE

Overview of the 10th Social Media Mining for Health (#SMM4H) and Health Real-World Data (HeaRD) Shared Tasks at ICWSM 2025

Graciela Gonzalez-Hernandez, Dongfang Xu, Takeshi Onishi, Guillermo Lopez-Garcia, Ivan Flores, Ari Klein, Abeed Sarker, Jeanne Powell, Swati Rajwal, Pierre Zweigenbaum, Lisa Raithel, Roland Roller, Philippe Thomas, Elena Tutubalina, Tirthankar Dasgupta, Manjira Sinha, Sudeshna Jana, Sedigh Khademi

SOURCE

SkipCLM: Enchancing Crosslingual Alignment of Decoder Transformer Models via Contrastive Learning and Skip Connection

Nikita Sushko, Alexander Panchenko, Elena Tutubalina

SOURCE

SynthDetoxM: Modern LLMs are Few-Shot Parallel Detoxification Data Annotators

Daniil Moskovskiy, Nikita Sushko, Sergey Pletenev, Alexander Panchenko, Elena Tutubalina

SOURCE

BioASQ at CLEF2025: The thirteenth edition of the large-scale biomedical semantic indexing and question answering challenge

Anastasios Nentidis, Georgios Katsimpras, Anastasia Krithara, Martin Krallinger, Miguel Rodriguez Ortega, Natalia Loukachevitch, Andrey Sakhovskiy, Elena Tutubalina, Grigorios Tsoumakas, George Giannakoulas, Alexandra Bekiaridou, Athanasios Samaras, Giorgio Maria Di Nunzio, Nicola Ferro, Stefano Marchesin, Laura Menotti, Gianmaria Silvello, Georgios Paliouras

SOURCE

AIRI Institute

You can ask us a question or suggest a joint project in the field of AI

About
Publications
Blog
Careers

event@airi.net

For events invitations

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media

people@airi.net

For any questions connected with
employees and employment

© 2025, AIRI

Join AIRI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

About
- Values
- Numbers
- Focus areas
- Research
- Partners
- Management
- Contacts
Publications
Blog
Careers

Contact us

Join AIRI

You can ask us a question or suggest a joint project in the field of AI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media