This page describes the new (December 2017) high-quality genome assembly for peanut (Arachis hypogaea), cultivar "Tifrunner." Tifrunner is an important U.S. variety, with good market and growth characteristics and resistance to several peanut diseases (early and late leaf spot and TSWV/spotted wilt).

This is a project of the International Peanut Genome Initiative, in order to accelerate breeding progress and get more productive, disease-resistant, stress-tolerant varieties to farmers. The IPGI project has sequenced the genomes of the two diploid progenitors of cultivated peanut, as well as the genome of cultivated peanut itself.

Please note that this assembly is released under the Ft. Lauderdale agreement, in which this assembly is made available with the understanding that no group may publish research on the full genome until the first formal description is published by the IPGI group.

For cultivated peanut, Arachis hypogaea cv. Tifrunner: assembly (note usage agreement)
Genome browser: GBrowse and JBrowse
Downloads: Arachis hypogaea cv. Tifrunner data store

For the diploid progenitor Arachis duranensis: assembly, annotation
Genome browser: GBrowse and JBrowse
Downloads: Arachis duranensis data store
GenBank: Assembly GCF_000817695.2, downloads

For the diploid progenitor Arachis ipaensis: assembly, annotation
Genome browser: GBrowse and JBrowse
Downloads: Arachis ipaensis data store
GenBank: Assembly GCF_000816755.2, downloads

Additional details about the A. hypogaea assembly:
The assembly size is 2,556 Mbp, which we estimate to span more than 99% of the actually genome. The scaffold N50 (a measure of the assembly contiguity) is 135.2 MB (the scale of the complete peanut chromosomes). A total of 48.25x of PACBIO sequence (avg. read length of 11,525) was used to generate the initial assembly, which was subsequently polished using Illumina sequences and ARROW. Homozygous SNPs and INDELs were corrected in the release sequence using ~40x of illumina reads (2x250, 800bp insert, library ID ICIH and ICID). Synteny with the diploid A. duranensis and A. ipaensis, along with 1 genetic map and 2 synthetic maps (provided by David Bertioli) were used to identify misjoins in the raw assembly. The resulting assembly was then scaffolded using HiC data. Post scaffolding, 6 additional breaks were made to resolve misjoins introduced during the scaffolding procedure.

The original sequences were combined with the duplicated tetrasomic regions and joined together using 26 joins to create the 20 A. hypogaea chromosomes. During the construction of the chromosomes, all 500bp scaffolded gaps were converted to 1,000 bp gaps, and the map joins that were added consisted of 10,000 bp gaps. Chromosomes were numbered as Arahy.01-Arahy.20, where the A genome is represented as Arahy.01-Arahy.10 and the B genome is represented as Arahy.11-Arahy.20. 99.3% of the assembled sequence is contained in the chromosomes.