Create file with X% consensus annotations ("PCG pangenome"). Save to files parsed list in BiGG/KEGG/EC reaction format.
This script generates a list of core functions for a given phylogenetic core group (PCG). This list is considered that PCG's pangenome, defined as annotations present in at least 90% of available genomes for each PCG. The input for this tool is a path containing .tsv annotation files generated with eggNOG-mapper, for which a consensus annotation file will be generated. The core reactions can be retrieved in KEGG, EC and in BiGG format. A file containing descriptions corresponding to EC and KEGG entries is also generated.
More reactions can be added manually using tools like MetaNetX or by creating custom ko/EC files
This script reads the files (either BiGG or KEGG) from the first step and goes through each PCG, comparing its reactions with the reactions of other groups, and keeping track of the unique reactions for each group in a dictionary.
This folder contains the specific version of script #1 used at "Leveraging phylogenetic signal to unravel microbial community function and assembly rules" by Talavera-Marcos, Parras-Moltó and Aguirre de Cárcer