Performs mapping of residue conservation into PDB structures
To derive residue conservation scoring from a multiple alignment of homologous proteins and map it into the structure of one of the aligned sequences
- PDB file of the solved structure
- Protein sequences of homologs
- MUSCLE multiple alignment program
- Web server address: http://www.ebi.ac.uk/Tools/msa/muscle
- The source file can be downloaded from: http://www.drive5.com/muscle
- PyMOL molecular graphics program
- Conservation mapping program (installation is described below)
- Installation of the conservation mapping program
- Open Terminal (available only in Mac OS and Linux)
- Make sure that pip is installed by running this command in the Terminal:
- easy_install pip
- Install the conservation mapping program using pip with the following the Terminal:
- pip install git+https://github.com/joemarct/map-conservation.git
- Preparation of the structure and derived sequence
- Get the amino acid sequence of the solved structure in FASTA format. If you do not have it separately, you can extract the sequence from the PDB file by opening it in PyMOL and saving the molecule in FASTA format.
- If you have exported your sequence from PyMOL, check the sequence to make sure the written sequence matches with what is displayed in PyMOL. Correct the exported sequence, if necessary.
- Rename the PDB file into structure.pdb.
- Multiple sequence alignment
- Convert the sequence file of the homologs into FASTA format, if it is not in that format yet.
- Append the sequence of the solved structure into the sequence file of the homologs.
- Do a multiple alignment of the combined sequence file using MUSCLE. Manually check the alignment and remove odd sequences, if necessary. You may use Jalview for this (www.jalview.org).
- Rename the output alignment file into input_alignment.fasta.
- Residue conservation scoring and mapping
- Put the structure.pdb and input_alignment.fasta file into a single folder / directory.
- Open Terminal, change directory to the working folder with the following command:
- cd [path_to_the_directory]
- Execute the conservation mapping script by issuing the following command:
- map_conservation.py
- Drag the resulting conservation_mapping.py file into PyMOL. This will show the color mapping of the residue conservation. By default, the color spectrum is yellow-white-blue in order of increasing conservation. Hence, the least conserved residues are in yellow and the more conserved ones are in blue.