Supplementary Data for "Plastid-localized xanthorhodopsin increases diatom biomass and ecosystem productivity in iron-limited surface oceans"
The repository contains supplementary data and metadata used in the Nature Microbiology research paper Plastid-localized xanthorhodopsin increases diatom biomass and ecosystem productivity in iron-limited surface oceans reporting on a xanthorhodopsin variant FcR1 isolated from the polar diatom Fragilariopsis cylindrus.
A predicted (3D) structure of FcR1 was generated using ColabFold. The Protein Data Bank (PDB) was scanned for homologous structures using a Dali search. The experimental structure of PDB entry 4HYJ, a top five search hit was manually chosen based on highest quality according to PDB indicators and used for structural alignment with FcR1 using ChimeraX to position the retinal in the predicted FcR1 structure. The final FcR1 model was generated by removing the structure of PDB ID 4HYJ and visualized using ChimeraX.
A secondary structure of FcR1 with key residues highlighted was generated using Protter v1.0 and manually edited.
Rhodopsin sequences focusing on eukaryotic xanthorhodopsin protein sequences were collected from the scientific literature and supplemented with manually curated sequences from BlastP search results of the Fragilariopsis cylindrus Rhodopsin 1 (FcR1) protein sequence (UniProt: A0A1E7EXA4) against NCBI databases, a custom database from Beja Lab, as well as similar sequences at an amino acid sequence identity threshold of ≥50% based on their membership in the UniRef50 cluster built using FcR1 as seed sequence (UniRef50_A0A7S2RIV1). The selected set of proteins was further supplemented with a manually curated set of outgroup sequences including characterized members of the proteorhodopsin subfamily totalling 242 sequences.
Sequences were aligned using MAFFT with default parameters and positions with 25% or more gaps were masked. The maximum-likelihood (ML) reference reconstruction was built using RAxML v.8.277 under the LG+G4 model (selected using IQ-TREE with the corrected Akaike information criterion).
The polar diatom F. cylindrus was grown at different concentrations of silicate ranging from 0.01 - 100 µM to determine mean growth rates and its half-saturation constant Km for silicate.
Vector | Description | Reference |
---|---|---|
pGEMHE | Xenopus laevis expression vector pGEMHE containing 5’ and 3’UTRs from a Xenopus beta-globin gene | Kreig, P. A., and Melton, D.A. (1984), Nucl. Acids Res. 72, 7057-70; Liman, E. R. et al. (1992), Neuron 9(5): 861-71 |
pPha-T1 | Phaeodactylum tricornutum transformation vector containing fucoxanthin chlorophyll-binding protein (fcp) regulatory sequences to drive constitutive expression of bleomycin resistance protein (Ble) conferring zeocin resistance and gene of interest | GenBank:AF219942; Zaslavskaia et al. (2001), J. Phycol. 36(2): 379 |
StuI-GFP-pPha-T1 | Phaeodactylum tricornutum transformation vector for generation of eGFP fusion proteins. To generate GFP fusion constructs, the sequence of interest is to be cloned into the StuI restriction site at 5’ end of eGFP. Derivate of the pPha-T1 vector | Gruber et al. (2007), Plant Mol. Biol. 64(5): 519; Zaslavskaia et al. (2001), J. Phycol. 36(2): 379 |
pTpFcp-GFP/FcpNat | Thalassiosira pseudonana transformation vector for constitutive expression of eGFP fusion proteins | Poulsen et al. (2006), J. Phycol. 42(5):1059; Scheffel et al. (2011), PNAS 108(8):3175 |
data/FcR1_A0A1E7EXA4.cif
- AlphaFold2 structure prediction of FcR1 with retinal in crystallographic information file (CIF) formatdata/Xanthorhodopsins_Strauss_et_al2023_AlignmentWithGaps.fasta
- raw alignmentdata/Xanthorhodopsins_Strauss_et_al2023_MaskedAlignment.fasta
- masked alignmentdata/Xanthorhodopsins_Strauss_et_al2023_RAxML.tree
- Newick tree filedata/Strauss_SI_Fig1_SourceData.csv
- Growth rates of F. cylindrus at different silicate concentrationsdata/pGEMHE.gb
- Genbank flat file for Xenopus laevis oocyte expression vector pGEMHEdata/pPha-T1.gb
- Genbank flat file for Phaeodactylum tricornutum transformation vector pPha-T1data/StuI-GFP-pPha-T1.gb
- Genbank flat file for Phaeodactylum tricornutum transformation vector StuI-GFP-pPha-T1data/pTpFcpGFP-FcpNat.gb
- Genbank flat file for Thalassiosira pseudonana transformation vector pTpFCP-GFP/fcpNatmetadata/Xanthorhodopsins_Strauss_et_al2023_metadata.csv
- metadata file containing xanthorhodopsin sequence data
Column | Description |
---|---|
Name | Sequence ID/accession |
Accession | NCBI accession version |
Description | Sequence/NCBI definition line |
Organism | Organism |
Sequence | Amino acid sequence |
Taxonomy | Taxonomic lineage |
db_xref | NCBI taxonomy ID |
protein_family | Rhodopsin clade/family |
taxon_group | Taxonomic group |