This page describes the new (December 2017) high-quality genome assembly for peanut (Arachis hypogaea), cultivar "Tifrunner." And the two wild progenitors of cultivated peanut, A. durenensis and A. ipaensis. Tifrunner is an important U.S. variety, with good market and growth characteristics and resistance to several peanut diseases (early and late leaf spot and TSWV/spotted wilt).

These genome assemblies are products of the International Peanut Genome Initiative, produced to accelerate breeding progress and to get more productive, disease-resistant, stress-tolerant varieties to farmers.

Cultivated peanut, Arachis hypogaea cv. Tifrunner: assembly, annotation
Citation: Bertioli et al. 'The genome sequence of segmental allotetraploid peanut Arachis hypogaea.' Nature Genetics 2019 May;51(5):877-884.
Genome browser: GBrowse and JBrowse
Downloads: Arachis hypogaea cv. Tifrunner data store
GenBank: assembly GCA_003086295.2, downloads

Diploid progenitor Arachis duranensis: assembly, annotation
Citation: Bertioli et al. 'The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut.' Nature Genetics. volume 51, pages877–884 (2019)
Genome browser: GBrowse and JBrowse
Downloads: Arachis duranensis data store
GenBank: assembly GCF_000817695.2, downloads

Diploid progenitor Arachis ipaensis: assembly, annotation
Citation: Bertioli et al. 'The genome sequences of Arachis duranensis and Arachis ipaensis, the diploid ancestors of cultivated peanut.' Nature Genetics. volume 51, pages877–884 (2019)
Genome browser: GBrowse and JBrowse
Downloads: Arachis ipaensis data store
GenBank: assembly GCF_000816755.2, downloads

Additional details about the A. hypogaea assembly:
The assembly size is 2,556 Mbp, which we estimate to span more than 99% of the actually genome. The scaffold N50 (a measure of the assembly contiguity) is 135.2 MB (the scale of the complete peanut chromosomes). A total of 48.25x of PACBIO sequence (avg. read length of 11,525) was used to generate the initial assembly, which was subsequently polished using Illumina sequences and ARROW. Homozygous SNPs and INDELs were corrected in the release sequence using ~40x of illumina reads (2x250, 800bp insert, library ID ICIH and ICID). Synteny with the diploid A. duranensis and A. ipaensis, along with 1 genetic map and 2 synthetic maps (provided by David Bertioli) were used to identify misjoins in the raw assembly. The resulting assembly was then scaffolded using HiC data. Post scaffolding, 6 additional breaks were made to resolve misjoins introduced during the scaffolding procedure.

The original sequences were combined with the duplicated tetrasomic regions and joined together using 26 joins to create the 20 A. hypogaea chromosomes. During the construction of the chromosomes, all 500bp scaffolded gaps were converted to 1,000 bp gaps, and the map joins that were added consisted of 10,000 bp gaps. Chromosomes were numbered as Arahy.01-Arahy.20, where the A genome is represented as Arahy.01-Arahy.10 and the B genome is represented as Arahy.11-Arahy.20. 99.3% of the assembled sequence is contained in the chromosomes.