The representative genomes available here are non-redundant collections of genomes which include the highest quality genome from every specI species cluster. As many specI clusters could be assigned to a habitat, we also provide habitat specific sets of representative genomes.
| Type | Contigs | Genes | Proteins |
|---|---|---|---|
| Representative genomes | contigs.representatives.fasta.gz | genes.representatives.fasta.gz | proteins.representatives.fasta.gz |
| Aquatic | aquatic.contigs.fa.gz | aquatic.genes.fa.gz | aquatic.proteins.fa.gz |
| Disease associated | disease_associated.contigs.fa.gz | disease_associated.genes.fa.gz | disease_associated.proteins.fa.gz |
| Food associated | food_associated.contigs.fa.gz | food_associated.genes.fa.gz | food_associated.proteins.fa.gz |
| Freshwater | freshwater.contigs.fa.gz | freshwater.genes.fa.gz | freshwater.proteins.fa.gz |
| Host associated | host_associated.contigs.fa.gz | host_associated.genes.fa.gz | host_associated.proteins.fa.gz |
| Host plant associated | host_plant_associated.contigs.fa.gz | host_plant_associated.genes.fa.gz | host_plant_associated.proteins.fa.gz |
| Sediment mud | sediment_mud.contigs.fa.gz | sediment_mud.genes.fa.gz | sediment_mud.proteins.fa.gz |
| Soil | soil.contigs.fa.gz | soil.genes.fa.gz | soil.proteins.fa.gz |
| Type | File |
|---|---|
| Habitats per isolate | proGenomes2.1_habitat_isolates.tab |
| Habitats per specI cluster | proGenomes2.1_habitat_specI.tab |
| Marker genes | proGenomes2.1_markerGenes.tar.gz |
| SpecI clustering data | proGenomes2.1_specI_clustering.tab |
| NCBI taxonomy | proGenomes2.1_specI_lineageNCBI.tab |