HPRC Data Release 2

Announcing the Human Pangenome Reference Consortium Data Release 2

May 12, 2025

We are excited to announce the second data release (Release 2) of the Human Pangenome Reference Consortium (HPRC) in pre-publication form. Release 2 includes sequencing data and high-quality phased genomes from over 200 individuals, a nearly fivefold increase over Release 1. This open-access release represents a substantial step forward in our effort to build a more complete, accurate, and representative human pangenome reference. It is also the first release to incorporate samples sequenced and assembled by international partners, including collaborators at the University of Tokyo in Japan and the Human Technopole in Italy. In addition to the National Human Genome Research Institute (NHGRI), the project has been generously supported by company partners, including PacBio, Oxford Nanopore Technologies, Illumina, Dovetail Genomics, Google, and Amazon Web Services (AWS).

Highlights of HPRC Release 2 Sequencing Data:

Highlights of HPRC Release 2 Assemblies:

These new assemblies and the underlying sequencing data are a cornerstone of the pangenome reference, enabling researchers to better identify and interpret genetic variation, particularly in complex and repetitive regions of the genome that were previously challenging to analyze with a single linear reference. Additional resources forming the complete second release of the Human Pangenome will soon follow, including:

Data Access:

All data from HPRC Release 2, including the new assemblies, is publicly available in our AWS S3 bucket as well as public nucleotide archives (INSDC). You can learn more about the data and access download links in a number of ways:

We encourage the community to explore and utilize this valuable resource to advance our understanding of human genetic variation and its impact on health and disease. For scientists interested in publishing using this pre-publication data, please see our consortium publication policy and best practices for using the data.