A platinum standard pan-genome resource that represents the population structure of Asian rice
Authors
Scientific Data volume 7, Article number: 113 (2020)
Abstract
As the human population grows from 7.8 billion to 10 billion over the next 30 years, breeders must do everything possible to create crops that are highly productive and nutritious, while simultaneously having less of an environmental footprint. Rice will play a critical role in meeting this demand and thus, knowledge of the full repertoire of genetic diversity that exists in germplasm banks across the globe is required. To meet this demand, we describe the generation, validation and preliminary analyses of transposable element and long-range structural variation content of 12 near-gap-free reference genome sequences (RefSeqs) from representatives of 12 of 15 subpopulations of cultivated Asian rice. When combined with 4 existing RefSeqs, that represent the 3 remaining rice subpopulations and the largest admixed population, this collection of 16 Platinum Standard RefSeqs (PSRefSeq) can be used as a template to map resequencing data to detect virtually all standing natural variation that exists in the pan-genome of cultivated Asian rice.
Measurement(s) | genome • DNA • sequence_assembly • sequence feature annotation • physical map |
Technology Type(s) | DNA sequencing • PacBio Sequel System • sequence assembly process • transposable elements annotation • Optical Mapping Illumina sequencing |
Factor Type(s) | Oryza sativa cv. variety |
Sample Characteristic - Organism | Oryza sativa |
Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.11950596