Источник
GigaScience
Дата публикации
27.03.2023
Авторы
Манвел Аветисян Ольга Кардымон Вениамин Фишман Николай Чеканов Никита Баранов Elian Malkin Александр Лапин Мария Синдеева
Поделиться

Cell type-specific interpretation of noncoding variants using deep learning-based methods

Аннотация

Interpretation of non-coding genomic variants is one of the most important challenges in human genetics. Machine learning methods have emerged recently as a powerful tool to solve this problem. State-of-the-art approaches allow prediction of transcriptional and epigenetic effects caused by non-coding mutations. However, these approaches require specific experimental data for training and can not generalize across cell types where required features were not experimentally measured. We show here that available epigenetic characteristics of human cell types are extremely sparse, limiting those approaches that rely on specific epigenetic input. We propose a new neural network architecture, DeepCT, which can learn
complex interconnections of epigenetic features and infer unmeasured data from any available input. Furthermore, we show that DeepCT can learn cell type-specific properties, build biologically meaningful vector representations of cell types and utilize these representations to generate cell type-specific predictions of the effects of non-coding variations in the human genome.

Присоединяйтесь к AIRI в соцсетях