[an error occurred while processing this directive]

Hereditas(Beijing) ›› 2024, Vol. 46 ›› Issue (9): 701-715.doi: 10.16288/j.yczz.24-151

• Review • Previous Articles     Next Articles

Progress on deep learning in genomics

Yanchun Bao1,2(), Caixia Shi1, Chuanqiang Zhang3,4, Mingjuan Gu1, Lin Zhu1, Zaixia Liu1,2, Le Zhou1,2, Fengying Ma1,2, Risu Na1(), Wenguang Zhang5()   

  1. 1. College of Animal Science and Technology, Inner Mongolia Agricultural University, Hohhot 010018, China
    2. Inner Mongolia Engineering Research Center of Genomic Big Data for Agriculture, Hohhot 010018, China
    3. Inner Mongolia Saikexing Institute of Breeding and Reproductive Biotechnology in Domestic Animal, Hohhot 011517, China
    4. National Center of Technology Innovation for Dairy Industry, Hohhot 010080, China
    5. College of Life Sciences, Inner Mongolia Agricultural University, Hohhot 010021, China
  • Received:2024-05-27 Revised:2024-08-09 Online:2024-09-20 Published:2024-08-16
  • Contact: Risu Na, Wenguang Zhang E-mail:byc107054@163.com;narisu@swu.edu.cn;atcgnmbi@aliyun.com
  • Supported by:
    Natural Science Foundation of Inner Mongolia Autonomous Region(2021ZD05);Sub-theme of the National Dairy Technology Innovation Center Project(2023-JSGG-2);Basic Research Expenses of Universities Directly of Inner Mongolia Autonomous Region(BR221024);Basic Research Expenses of Universities Directly of Inner Mongolia Autonomous Region(BR221113);Funding for Double First-class Construction of Inner Mongolia Agricultural University(BZX202201);Funding for Double First-class Construction of Inner Mongolia Agricultural University(QF202206);Funding for Double First-class Construction of Inner Mongolia Agricultural University(NDYB2022-1)

Abstract:

With the rapid growth of data driven by high-throughput sequencing technologies, genomics has entered an era characterized by big data, which presents significant challenges for traditional bioinformatics methods in handling complex data patterns. At this critical juncture of technological progress, deep learning—an advanced artificial intelligence technology—offers powerful capabilities for data analysis and pattern recognition, revitalizing genomic research. In this review, we focus on four major deep learning models: Convolutional Neural Network(CNN), Recurrent Neural Network(RNN), Long Short-Term Memory(LSTM), and Generative Adversarial Network(GAN). We outline their core principles and provide a comprehensive review of their applications in DNA, RNA, and protein research over the past five years. Additionally, we also explore the use of deep learning in livestock genomics, highlighting its potential benefits and challenges in genetic trait analysis, disease prevention, and genetic enhancement. By delivering a thorough analysis, we aim to enhance precision and efficiency in genomic research through deep learning and offer a framework for developing and applying livestock genomic strategies, thereby advancing precision livestock farming and genetic breeding technologies.

Key words: deep learning, genome, CNN, RNN, LSTM, GAN