Sequencing data, assemblies, and pangenomes are stored in publicly accesible cloud buckets, the AnVIL data ecosystem, and in SRA/ENA/DDBJ.
- Data is uploaded to both AWS S3 and Google Cloud buckets. GitHub repositories have been created and include details about the data generation as well as index files with locations of the data stored in S3 and GCP. The HPRC S3 bucket does not charge egress fees making it a good option if you would like to download data to your local machine.
- AnVIL is a cloud environment that allows you to view the data organized in convenient data tables that refer to copies of the data in GCP. AnVIL also includes a workflow runner so you can analyze the data withough.
- All data is also uploaded to INSDCs (SRA/ENA/DDBJ) in BioProjects for sequencing data, assemblies, and pangenomes.