GFOBAP:FAA|Trees|Metadata

Orthogroups: Pangenome Matrices, Pangenome Dataframes and Fasta files

The core of the comparative genomics analysis presented in our articles relies in the computation of orthogroups across a set of genomes. In our case, because we had 9 different taxonomic groups, we computed 9 different sets of orthogroups using Orthofinder.

If you are interested in reproducing the orthogroups we constructed please refer to the following scripts and guide.

Scripts to compute orthogroups using Orthofinder and Diamond as a search engine

Given the orthogroups computed, we reconstructed pangenome matrices that depicts the abundance of each orthogroups across the genomes in a given dataset. The matrices files provided are tab delimited files with orthogroups as rows and genomes as columns.

Other relevant structures we share in this section are dataframes that show which coding sequences (CDS) in each genome belong to which orthogroup.

Finally, we also provide the fasta aminoacid file of each Orthogroup computed for all the 9 taxonomic groups analyzed. In the MAFFT|HMM section we utilized these files to reconstruct alignments and HMM profiles for the orthogroups.

The script utilized to compute the files described above can be found in the next link.

Scripts to compute pangenome matrices, dataframes and orthogroups fasta files

Reference

If you utilize any resource in this webpage please cite:
Levy, A, I Salas González, M. Mittelviefhaus, S Clingenpeel, S Herrera Paredes, J Miao, K Wang, G Devescovi, K Stillman, F Monteiro, BR Alvarez, DS Lundberg, T-Y Lu, S Lebeis, Z Jin, M McDonald, AP Klein, ME Feltcher, T Glavina del Rio, SR Grant, SL Doty, RE Ley, DA Pelletier, J Vorholt, SG Tringe†, T Woyke† and JL Dangl† (2017) Genomic determinants of bacterial adaptation to plants. Nature Genetics doi: 10.1038/s41588-017-0012-9.

Contact

Questions concerning the content in this website: isai@email.unc.edu

Design: TEMPLATED

Genomic features of

bacterial adaptation to plants

Dataset S6

Pangenome Matrices

Dataset S7

Pangenome Dataframes

Dataset S8

Orthogroups Fasta

Reference

Contact

Genomic features of bacterial adaptation to plants

Genomic features of

bacterial adaptation to plants