ru

About
Publications
Blog
Careers

ru

Source

CIKM

DATE OF PUBLICATION

08/05/2024

Authors

Danil Gusak Gleb Mezentsev Ivan Oseledets Evgeny Frolov

Share

RECE: Reduced Cross-Entropy Loss for Large-Catalogue Sequential Recommenders

sequential recommendation, cross-entropy loss, negative sampling

Abstract

Scalability is a major challenge in modern recommender systems. Insequential recommendations, full Cross-Entropy (CE) loss achievesstate-of-the-art recommendation quality but consumes excessiveGPU memory with large item catalogs, limiting its practicality.Using a GPU-efficient locality-sensitive hashing-like algorithmfor approximating large tensor of logits, this paper introduces anovel RECE (REduced Cross-Entropy) loss. RECE significantlyreduces memory consumption while allowing one to enjoy thestate-of-the-art performance of full CE loss. Experimental results onvarious datasets show that RECE cuts training peak memory usageby up to 12 times compared to existing methods while retaining orexceeding performance metrics of CE loss. The approach also opensup new possibilities for large-scale applications in other domains.

Full text DOWNLOAD pdf

Similar publications

Run LoRA Run: Faster and Lighter LoRA Implementations

Daria Cherniuk, Aleksandr Mikhalev, Ivan Oseledets

SOURCE

CLEAR: Character Unlearning in Textual and Visual Modalities

Alexey Dontsov, Dmitrii Korzh, Alexey Zhavoronkin, Boris Mikheev, Denis Bobkov, Aibek Alanov, Oleg Rogov, Ivan Oseledets, Elena Tutubalina

SOURCE

ShortPathQA: A Dataset for Controllable Fusion of Large Language Models with Knowledge Graphs

Mikhail Salnikov, Andrey Sakhovskiy, Irina Nikishina, Aida Usmanova, Angelie Kraft, Cedric Möller, Debayan Banerjee, Junbo Huang, Longquan Jiang, Rana Abdullah, Xi Yan, Elena Tutubalina, Ricardo Usbeck, Alexander Panchenko

SOURCE

The benefits of query-based KGQA systems for complex and temporal questions in LLM era

Artem Alekseev, Mikhail Chaichuk, Miron Butko, Alexander Panchenko, Elena Tutubalina, Oleg Somov

SOURCE

Memory Efficient LM Compression using Fisher Information from Low-Rank Representations

Daniil Moskovskiy, Sergey Pletenev, Sergey Zagoruyko, Alexander Panchenko

SOURCE

T-Comm: Телекоммуникации и транспорт

ИЗМЕРИТЕЛЬНЫЕ СИГНАЛЫ НА ОСНОВЕ ПЕРЕСТАНОВОЧНЫХ ПОЛИНОМОВ ДЛЯ ВОСПОЛНЕНИЯ ТЕНЗОРОВ КАНАЛА OFDM MIMO

, , , Vladimir Lyashev, Ivan Oseledets

SOURCE

Statistical Papers

Optimal experimental design: from design point to design region

Martin Bubel, Philipp Seufert, Gleb Karpov, Jan Schwientek, Michael Bortz, Ivan Oseledets

SOURCE

AIRI Institute

You can ask us a question or suggest a joint project in the field of AI

About
Publications
Blog
Careers

event@airi.net

For events invitations

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media

people@airi.net

For any questions connected with
employees and employment

© 2025, AIRI

Join AIRI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

About
- Values
- Numbers
- Focus areas
- Research
- Partners
- Management
- Contacts
Publications
Blog
Careers

Contact us

Join AIRI

You can ask us a question or suggest a joint project in the field of AI

Name Email Your message I'm not a robot By submitting the form, I consent to the processing of my personal data

Message sent.

Thank you!

Something went wrong. Try again

partner@airi.net

For scientific cooperation and
partnership

pr@airi.net

For journalists and media