[an error occurred while processing this directive]

Hereditas(Beijing) ›› 2019, Vol. 41 ›› Issue (11): 1041-1049.doi: 10.16288/j.yczz.19-155

• Research Article • Previous Articles     Next Articles

MHC-I epitope presentation prediction based on transfer learning

Weipeng Hu1,2,3,Youping Li2,3,4,Xiuqing Zhang2,3,4()   

  1. 1. School of Biology and Biological Engineering, South China University of Technology, Guangzhou, 510006, China
    2. BGI-Shenzhen, Shenzhen 518083, China
    3. BGI-GenoImmune, Wuhan 4300794, China
    4. BGI Education Center, University of Chinese Academy of Sciences, Shenzhen 518083, China
  • Received:2019-06-21 Revised:2019-09-17 Online:2019-11-20 Published:2019-10-08
  • Contact: Zhang Xiuqing E-mail:zhangxq@genomics.cn
  • Supported by:
    Supported by the National Natural Science Foundation of China Nos.(81702826);Supported by the National Natural Science Foundation of China Nos.(81772910);Science, Technology and Innovation Commission of Shenzhen Municipality No.(JCYJ20170303151334808);and Shenzhen Municipal Government of China No.(20170731162715261)

Abstract:

Accurate epitope presentation prediction is a key procedure in tumour immunotherapies based on neoantigen for targeting T cell specific epitopes. Epitopes identified by mass spectrometry (MS) is valuable to train an epitope presentation prediction model. In spite of the accelerating accumulation of MS data, the number of epitopes that match most of human leukocyte antigens (HLAs) is relatively small, which makes it difficult to build a reliable prediction model. Therefore, this research attempted to use the transfer learning method to train a model to learn common features among the mixed allele specific epitopes. Then based on this pre-trained model, we used the allele-specific epitopes to train the final epitope presentation prediction model, termed Pluto. The average 0.1% positive predictive value (PPV) of Pluto outperformed the prediction model without pretraining with a margin of 0.078 on the same validation dataset. When evaluating Pluto on external HLA eluted ligand datasets, Pluto achieved an averaged 0.1% PPV of 0.4255, which is better than the prediction model without pretraining (0.3824) and other popular methods, including MixMHCpred (0.3369), NetMHCpan4.0-EL (0.4000), NetMHCpan4.0-BA (0.3188) and MHCflurry (0.3002). Moreover, when it comes to the evaluation of predicting immunogenicity, Pluto can identify more neoantigens than other tools. Pluto is publicly available at https://github.com/weipenegHU/Pluto.

Key words: immunotherapy, neoantigen, epitope presentation, deep learning, transfer learning