[an error occurred while processing this directive]

Hereditas(Beijing) ›› 2021, Vol. 43 ›› Issue (11): 1023-1037.doi: 10.16288/j.yczz.21-214

• Orginal Articles • Previous Articles     Next Articles

Pan-genome: setting a new standard for high-quality reference genomes

Peipei Bian(), Yu Zhang, Yu Jiang()   

  1. College of Animal Science and Technology, Northwest A&F University, Yangling 712100, China
  • Received:2021-08-26 Revised:2021-10-28 Online:2021-11-20 Published:2021-10-28
  • Contact: Jiang Yu E-mail:bppisc@163.com;yu.jiang@nwafu.edu.cn
  • Supported by:
    Supported by the National Natural Science Foundation of China No(31822052)

Abstract:

With the release of high-quality reference genomes assembled by long reads from the third-generation sequencing technology, as well as extensive re-sequencing and population genetic analysis, researchers found that a single reference genome does not represent the diversity within a species. The missing sequences on the reference genome result in an incomplete population genetic polymorphism map. The emergence of pan-genome can well repair the deficiency of single reference genome, which include core genome (responsible for basic biological functions and the main phenotypic characteristics within a species) and the variable genome (related to the genetic diversity or biological characteristics). According to the core and variable genome proportion, the types of pan-genomes can be either open or closed. Here, we review the current exploring of pan-genome for a range of species, to discuss the characteristics of pan-genome in various biological groups. The pan-genome of mammals are more likely closed, while the pan-genomes of microbes, angiosperms, and some invertebrates are likely non-closed. It is possible to complete the reference genome and obtain complete variation information through the pan-genomic study, which will contribute to the study of molecular mechanism for genetic diversity and phenotypic evolution.

Key words: pan-genome, presence and absence variations, core genome, variable genome