[an error occurred while processing this directive]

Hereditas(Beijing) ›› 2023, Vol. 45 ›› Issue (10): 922-932.doi: 10.16288/j.yczz.23-120

• Research Article • Previous Articles     Next Articles

Genomic prediction of pig growth traits based on machine learning

Dong Chen1,2,3(), Shujie Wang1,2,3, Zhenjian Zhao1,2,3, Xiang Ji1,2,3, Qi Shen1,2,3, Yang Yu1,2,3, Shengdi Cui1,2,3, Junge Wang1,2,3, Ziyang Chen1,2,3, Jinyong Wang4, Zongyi Guo4, Pingxian Wu4, Guoqing Tang1,2,3()   

  1. 1. Key Laboratory of Livestock and Poultry Multi-omics, Ministry of Agriculture and Rural Affairs, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu 611130, China
    2. Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu 611130, China
    3. State Key Laboratory of Swine and Poultry Breeding Industry, College of Animal Science and Technology, Sichuan Agricultural University, Chengdu 611130, China
    4. National Center of Technology Innovation for Pigs, Chongqing 402460, China
  • Received:2023-04-26 Revised:2023-08-14 Online:2023-10-20 Published:2023-08-16
  • Contact: Guoqing Tang E-mail:1123278154@qq.com;tyq003@163.com
  • Supported by:
    Strategic Priority Research Program of the National Center of Technology Innovation for Pigs(NCTIP-XD/B01);Sichuan Science and Technology Program(2020YFN0024);Sichuan Science and Technology Program(2021ZDZX0008);Sichuan Science and Technology Program(2021YFYZ0030);Sichuan Innovation Team of Pig(sccxtd-2022-08)

Abstract:

This study aimed to assess and compare the performance of different machine learning models in predicting selected pig growth traits and genomic estimated breeding values (GEBV) using automated machine learning, with the goal of optimizing whole-genome evaluation methods in pig breeding. The research employed genomic information, pedigree matrices, fixed effects, and phenotype data from 9968 pigs across multiple companies to derive four optimal machine learning models: deep learning (DL), random forest (RF), gradient boosting machine (GBM), and extreme gradient boosting (XGB). Through 10-fold cross-validation, predictions were made for GEBV and phenotypes of pigs reaching weight milestones (100 kg and 115 kg) with adjustments for backfat and days to weight. The findings indicated that machine learning models exhibited higher accuracy in predicting GEBV compared to phenotypic traits. Notably, GBM demonstrated superior GEBV prediction accuracy, with values of 0.683, 0.710, 0.866, and 0.871 for B100, B115, D100, and D115, respectively, slightly outperforming other methods. In phenotype prediction, GBM emerged as the best-performing model for pigs with B100, B115, D100, and D115 traits, achieving prediction accuracies of 0.547, followed by DL at 0.547, and then XGB with accuracies of 0.672 and 0.670. In terms of model training time, RF required the most time, while GBM and DL fell in between, and XGB demonstrated the shortest training time. In summary, machine learning models obtained through automated techniques exhibited higher GEBV prediction accuracy compared to phenotypic traits. GBM emerged as the overall top performer in terms of prediction accuracy and training time efficiency, while XGB demonstrated the ability to train accurate prediction models within a short timeframe. RF, on the other hand, had longer training times and insufficient accuracy, rendering it unsuitable for predicting pig growth traits and GEBV.

Key words: genomic estimated breeding values, growth traits, automated machine learning, performance comparison