Data download

Representative genome sets

The representative genomes available here are non-redundant collections of genomes which include the highest quality genome from every specI species cluster. As many specI clusters could be assigned to a habitat, we also provide habitat specific sets of representative genomes.

TypeContigsGenesProteins
Representative genomes contigs.representatives.fasta.gz genes.representatives.fasta.gz proteins.representatives.fasta.gz
Aquatic aquatic.contigs.fa.gz aquatic.genes.fa.gz aquatic.proteins.fa.gz
Disease associated disease_associated.contigs.fa.gz disease_associated.genes.fa.gz disease_associated.proteins.fa.gz
Food associated food_associated.contigs.fa.gz food_associated.genes.fa.gz food_associated.proteins.fa.gz
Freshwater freshwater.contigs.fa.gz freshwater.genes.fa.gz freshwater.proteins.fa.gz
Host associated host_associated.contigs.fa.gz host_associated.genes.fa.gz host_associated.proteins.fa.gz
Host plant associated host_plant_associated.contigs.fa.gz host_plant_associated.genes.fa.gz host_plant_associated.proteins.fa.gz
Sediment mud sediment_mud.contigs.fa.gz sediment_mud.genes.fa.gz sediment_mud.proteins.fa.gz
Soil soil.contigs.fa.gz soil.genes.fa.gz soil.proteins.fa.gz

Other datasets

TypeFile
Habitats per isolateproGenomes2.1_habitat_isolates.tab
Habitats per specI clusterproGenomes2.1_habitat_specI.tab
Marker genesproGenomes2.1_markerGenes.tar.gz
SpecI clustering dataproGenomes2.1_specI_clustering.tab
NCBI taxonomyproGenomes2.1_specI_lineageNCBI.tab