Structural basis for molecular assembly of fucoxanthin chlorophyll a/c-binding proteins in a diatom photosystem I supercomplex
eLife Assessment
This important study presents a high-resolution cryoEM structure of the supercomplex between photosystem I (PSI) and fucoxanthin chlorophyll a/c-binding proteins (FCPs) from the model diatom Thalassiosira pseudonana CCMP1335, revealing subunits, protein:protein interactions and pigments not previously seen in other diatoms or red/green photosynthetic lineages. Combining structural, sequence and phylogenetic analyses, the authors provide convincing evidence of conserved motifs crucial for the binding of FCPIs, accompanied by interesting speculation about the mechanisms governing the assembly of PSI-FCPI supercomplexes in diatoms and their implications for related PSI-LHCI supercomplexes in plants. The findings set the stage for functional experiments that will further advance the fields of photosynthesis, bioenergy, ocean biogeochemistry and evolutionary relationships between photosynthetic organisms.
https://doi.org/10.7554/eLife.99858.3.sa0Important: Findings that have theoretical or practical implications beyond a single subfield
- Landmark
- Fundamental
- Important
- Valuable
- Useful
Convincing: Appropriate and validated methodology in line with current state-of-the-art
- Exceptional
- Compelling
- Convincing
- Solid
- Incomplete
- Inadequate
During the peer-review process the editor and reviewers write an eLife Assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife Assessments
Abstract
Photosynthetic organisms exhibit remarkable diversity in their light-harvesting complexes (LHCs). LHCs are associated with photosystem I (PSI), forming a PSI-LHCI supercomplex. The number of LHCI subunits, along with their protein sequences and pigment compositions, has been found to differ greatly among the PSI-LHCI structures. However, the mechanisms by which LHCIs recognize their specific binding sites within the PSI core remain unclear. In this study, we determined the cryo-electron microscopy structure of a PSI supercomplex incorporating fucoxanthin chlorophyll a/c-binding proteins (FCPs), designated as PSI-FCPI, isolated from the diatom Thalassiosira pseudonana CCMP1335. Structural analysis of PSI-FCPI revealed five FCPI subunits associated with a PSI monomer; these subunits were identified as RedCAP, Lhcr3, Lhcq10, Lhcf10, and Lhcq8. Through structural and sequence analyses, we identified specific protein–protein interactions at the interfaces between FCPI and PSI subunits, as well as among FCPI subunits themselves. Comparative structural analyses of PSI-FCPI supercomplexes, combined with phylogenetic analysis of FCPs from T. pseudonana and the diatom Chaetoceros gracilis, underscore the evolutionary conservation of protein motifs crucial for the selective binding of individual FCPI subunits. These findings provide significant insights into the molecular mechanisms underlying the assembly and selective binding of FCPIs in diatoms.
Introduction
Oxygenic photosynthesis in cyanobacteria, algae, and land plants converts solar energy into chemical energy and releases molecular oxygen into the atmosphere (Blankenship, 2021). The conversion of light energy takes place within two multi-subunit membrane protein complexes, known as photosystem I (PSI) and photosystem II (PSII), which perform light harvesting, charge separation, and electron transfer reactions (Golbeck, 1992; Brettel and Leibl, 2001; Shen, 2015; Shevela et al., 2023). To optimize light energy capture, numerous light-harvesting antenna subunits are associated with the periphery of the PSI and PSII core complexes, transferring excitation energy to the respective photosystem cores (Blankenship, 2021). These light-harvesting antennae exhibit significant diversity among photosynthetic organisms, both in protein sequences and pigment compositions, and can be broadly categorized into two major groups: membrane proteins and water-soluble proteins (Blankenship, 2021).
The membrane protein category primarily consists of the light-harvesting complex (LHC) protein superfamily (Engelken et al., 2010; Sturm et al., 2013), which absorbs light energy through chlorophylls (Chls) and carotenoids (Cars). The number and types of Chls and Cars vary significantly among LHCs, which can be grouped into green and red lineages, leading to color diversity in photosynthetic organisms (Falkowski et al., 2004). The green lineage includes green algae and land plants, while the red lineage encompasses red algae, diatoms, haptophytes, cryptophytes, and dinoflagellates (Falkowski et al., 2004). LHCs specific to PSI (LHCIs) bind to a eukaryotic PSI monomer, forming a PSI-LHCI supercomplex (Hippler and Nelson, 2021; Shen, 2022), the structures of which have been revealed by cryo-electron microscopy (cryo-EM) in various eukaryotes (Hippler and Nelson, 2021; Shen, 2022). In the red lineage, the number of LHCIs and their protein sequences and pigment compositions exhibit considerable variation among the PSI-LHCI structures of red algae (Pi et al., 2018; Antoshvili et al., 2019; You et al., 2023; Kato et al., 2024), a diatom (Nagao et al., 2020a; Xu et al., 2020), a cryptophyte (Zhao et al., 2023), and dinoflagellates (Li et al., 2024; Zhao et al., 2024).
Recently, we demonstrated the conservation and diversity of LHCIs among red-lineage algae through structural and phylogenetic analyses of PSI-LHCI supercomplexes (Kato et al., 2024). This study revealed that while the binding sites of LHCIs to PSI were conserved to some extent among red-lineage algae, their evolutionary relationships were weak. It is known that LHCIs have similar overall protein structures across photosynthetic organisms, with particular similarity in their three-transmembrane helices, regardless of whether they belong to the green or red lineages (Hippler and Nelson, 2021; Shen, 2022). However, individual LHCIs have altered their sequences and structures to adapt their respective binding sites to the PSI cores during the assembly of PSI-LHCI supercomplexes. These observations raise a critical question: how do LHCIs recognize their binding sites in the PSI core?
Diatoms are among the most essential phytoplankton in aquatic environments, playing a crucial role in the global carbon cycle, supporting marine food webs, and contributing significantly to nutrient cycling, thus ensuring the health and sustainability of marine ecosystems (Field et al., 1998). Diatoms possess unique LHCs known as fucoxanthin Chl a/c-binding proteins (FCPs), which differ in pigment composition and amino acid sequences from the LHCs of land plants (Green and Durnford, 1996; Büchel, 2020; Wang and Shen, 2021). Previous studies have reported the isolation and structural characterization of PSI-FCPI supercomplexes from the diatom Chaetoceros gracilis (Nagao et al., 2020a; Xu et al., 2020; Ikeda et al., 2008; Nagao et al., 2019a; Nagao et al., 2019b; Nagao et al., 2019c; Nagao et al., 2019d; Nagao et al., 2020c). Kumazawa et al. showed significant diversity in FCPs between C. gracilis and Thalassiosira pseudonana, with 46 and 44 FCPs identified, respectively (Kumazawa et al., 2022). These FCPs are categorized into multiple, closely related subgroups (Kumazawa et al., 2022), and their amino acid sequences are not entirely identical between the two diatoms. Consequently, comparing FCPIs, including their amino acid residues and protein structures at similar binding sites in PSI-FCPIs, may provide molecular insights into how FCPIs interact with PSI. However, an overall structure of the T. pseudonana PSI-FCPI supercomplex has yet to be solved.
In this study, we solved the structure of the PSI-FCPI supercomplex from T. pseudonana CCMP1335 at a resolution of 2.30 Å by cryo-EM single-particle analysis. The structure reveals a PSI-monomer core and five FCPI subunits. Structural and sequence comparisons highlight unique protein–protein interactions between each FCPI subunit and PSI. Based on these findings, we discuss the molecular assembly and selective binding mechanisms of FCPI subunits in diatom species.
Results and discussion
Overall structure of the T. pseudonana PSI-FCPI supercomplex
The PSI-FCPI supercomplexes were purified from the diatom T. pseudonana CCMP1335 and analyzed by biochemical and spectroscopic techniques (Figure 1—figure supplement 1). Notably, the protein bands of PSI-FCPI closely resembled those reported in a previous study (Ikeda et al., 2013). Cryo-EM images of the PSI-FCPI supercomplex were obtained using a JEOL CRYO ARM 300 electron microscope operated at 300 kV. The final cryo-EM map was determined at a resolution of 2.30 Å with a C1 symmetry (Figure 1—figure supplements 2 and 3, and Table 1), based on the ‘gold standard’ Fourier shell correlation (FSC) = 0.143 criterion (Figure 1—figure supplement 3A).
The atomic model of PSI-FCPI was built based on the cryo-EM map obtained (see Methods; Figure 1—figure supplement 3 and Tables 1–3). The structure reveals a monomeric PSI core associated with five FCPI subunits (Figure 1A, B). The five FCPI subunits were named FCPI-1–5 (Figure 1A), following the nomenclature of LHCI subunits in the PSI-LHCI structure of Cyanidium caldarium RK-1 (NIES-2137) (Kato et al., 2024). Specifically, the positions of FCPI-1 and FCPI-2 in the T. pseudonana PSI-FCPI structure (Figure 1A) correspond to those of LHCI-1 and LHCI-2 in the C. caldarium PSI-LHCI structure. The PSI core comprises 94 Chls a, 18 β-carotenes (BCRs), 1 zeaxanthin (ZXT), 3 [4Fe-4S] clusters, 2 phylloquinones, and 6 lipid molecules, whereas the 5 FCPI subunits include 45 Chls a, 7 Chls c, 2 BCRs, 15 fucoxanthins (Fxs), 7 diadinoxanthins (Ddxs), and 3 lipid molecules (Table 3).
Structure of the T. pseudonana PSI core
The PSI core contains 12 subunits, 11 of which are identified as PsaA, PsaB, PsaC, PsaD, PsaE, PsaF, PsaI, PsaJ, PsaL, PsaM, and Psa29 (Figure 1B). The remaining subunit could not be assigned due to insufficient map resolution and was therefore modeled as polyalanines (Figure 1—figure supplement 4A). This unidentified subunit, designated as Unknown, occupies the same site as Psa28 in the C. gracilis PSI-FCPI (Nagao et al., 2020a). The structural comparison reveals that Unknown closely resembles Psa28 in the C. gracilis PSI-FCPI (Figure 1—figure supplement 4B). Psa28, a novel subunit identified in the C. gracilis PSI-FCPI structure (Nagao et al., 2020a), follows the previously established nomenclature rule (Kashino et al., 2002). Historically, genes encoding PSI proteins have been designated as psaA, psaB, and so forth. PsaZ was identified in the PSI cores of Gloeobacter violaceus PCC 7421 (Inoue et al., 2004; Kato et al., 2022). Subsequent discoveries led to the designation of a new subunit as Psa27, which was identified in the PSI cores of Acaryochloris marina MBIC11017 (Tomo et al., 2008; Hamaguchi et al., 2021; Xu et al., 2021). Consequently, we designated this novel subunit as Psa28 (Nagao et al., 2020a). However, Xu et al. referred to this subunit as PsaR in the PSI-FCPI structure of C. gracilis (Xu et al., 2020).
Psa29 is newly identified in the T. pseudonana PSI-FCPI structure using ModelAngelo (Jamali et al., 2024) and the NCBI database (https://www.ncbi.nlm.nih.gov/) (Figure 2). The subunit corresponding to Psa29 was also observed previously in the C. gracilis PSI-FCPI structures (Nagao et al., 2020a; Xu et al., 2020), where it was modeled as polyalanines and referred to as either Unknown1 (Nagao et al., 2020a) or PsaS (Xu et al., 2020). Psa29 exhibits a unique structure distinct from the other PSI subunits in the T. pseudonana PSI-FCPI (Figure 2A and Figure 1—figure supplement 4C) and engages in multiple interactions with PsaB, PsaC, PsaD, and PsaL at distances of 2.5–3.2 Å (Figure 2B–G). Sequence analyses suggest that Psa29 has undergone evolutionary divergence between Bacillariophyceae (diatoms) and Bolidophyceae, the latter of which is a sister group of diatoms within Stramenopiles (Figure 2H), although this subunit has not been found in other organisms. The arrangement of PSI subunits in the T. pseudonana PSI-FCPI is virtually identical to that in the C. gracilis PSI-FCPI structures already reported (Nagao et al., 2020a; Xu et al., 2020). However, the functional and physiological roles of Psa29 remain unclear at present. It is evident that Psa29 does not have any pigments, quinones, or metal complexes, suggesting no contribution of Psa29 to electron transfer reactions within PSI. Further mutagenesis studies will be necessary to investigate the role of Psa29 in diatom photosynthesis.
The number and arrangement of Chls and Cars within the PSI core in the T. pseudonana PSI-FCPI structure (Figure 1—figure supplement 4D, E) are largely similar to those in the C. gracilis PSI-FCPI structure (Nagao et al., 2020a). However, Chl a102 of PsaI is found in the T. pseudonana PSI-FCPI structure but not in the C. gracilis PSI-FCPI structure (Nagao et al., 2020a), whereas a844 of PsaA and BCR843 of PsaB are identified in the C. gracilis PSI-FCPI structure (Nagao et al., 2020a) but not in the T. pseudonana PSI-FCPI structure. One of the Car molecules in PsaJ is identified as ZXT103 in the T. pseudonana PSI-FCPI structure, while it is BCR103 in the C. gracilis PSI-FCPI structure (Nagao et al., 2020a).
Structure of the T. pseudonana FCPIs
Kumazawa et al. classified 44 Lhc genes in T. pseudonana, designating them as Lhcf, Lhcq, Lhcr, Lhcx, Lhcz, and CgLhcr9 homologs (Kumazawa et al., 2022). Based on this classification, the five FCPI subunits in the PSI-FCPI structure are identified using five genes: RedCAP, Lhcr3, Lhcq10, Lhcf10, and Lhcq8, corresponding to FCPI-1–5, respectively (Figure 1A). It is important to note that RedCAP is not included among the 44 Lhc genes (Kumazawa et al., 2022) but is classified within the LHC protein superfamily (Engelken et al., 2010; Sturm et al., 2013). For the assignment of each FCPI subunit, we focused on characteristic amino acid residues derived from their cryo-EM map, especially S61/V62/Q63 in FCPI-1; A70/R71/W72 in FCPI-2; Y64/R65/E66 in FCPI-3; M63/R64/Y65 in FCPI-4; and A62/R63/R64 in FCPI-5 (Figure 1—figure supplement 5). The root mean square deviations of the structures between FCPI-4 and the other four FCPIs range from 1.91 to 3.73 Å (Table 4).
Each FCPI subunit binds several Chl and Car molecules: 7 Chls a/1 Chl c/2 Fxs/3 Ddxs/2 BCRs in FCPI-1; 10 Chls a/1 Chl c/3 Fxs/1 Ddx in FCPI-2; 7 Chls a/3 Chls c/2 Fxs/2 Ddxs in FCPI-3; 11 Chls a/2 Chls c/4 Fxs in FCPI-4; and 10 Chls a/4 Fxs/1 Ddx in FCPI-5 (Figure 1—figure supplement 6A–E and Table 3). The axial ligands of the central Mg atoms of Chls within each FCPI are primarily provided by the main and side chains of amino acid residues (Table 5). Potential excitation-energy-transfer pathways can be proposed based on the close physical interactions among Chls between FCPI-3 and PsaA, between FCPI-3 and PsaL, between FCPI-1 and PsaI, and between FCPI-2 and PsaB (Figure 1—figure supplement 7).
Structural characteristics of RedCAP and its evolutionary implications
Among the FCPI subunits, only FCPI-1 contains two BCRs in addition to Fxs and Ddxs (Figure 1—figure supplement 6A, F). This is the first report of BCR binding to FCPIs in diatoms. FCPI-1 is identified as RedCAP, a member of the LHC protein superfamily but distinct from the LHC protein family (Engelken et al., 2010; Sturm et al., 2013); however, the functional and physiological roles of RedCAP remain unknown. FCPI-1 is positioned near PsaB, PsaI, and PsaL through protein–protein interactions with these subunits at both the stromal and lumenal sides (Figure 3A). At the stromal side, I138 and S139 of FCPI-1 interact with K121, G122, and F125 of PsaL (Figure 3B), whereas at the lumenal side, multiple interactions occur between I109 of FCPI-1 and F5 of PsaI, between T105/L106/T108 of FCPI-1 and W92/P94/F96 of PsaB, and between E102/W103 of FCPI-1 and S71/I73 of PsaL (Figure 3C). The protein–protein interactions at the lumenal side (Figure 3C) appear to be caused by a loop structure of FCPI-1 from Q96 to T116 (pink in Figure 3D), which is unique to FCPI-1 but absent in the other four FCPI subunits (pink in Figure 3E). This loop structure is inserted into a cavity formed by PsaB, PsaI, and PsaL (Figure 3C, D). These findings indicate that the Q96–T116 loop of FCPI-1 specifically recognizes and binds to the cavity provided by the PSI subunits.
RedCAP of C. gracilis (CgRedCAP) was not identified in the C. gracilis PSI-FCPI structures (Nagao et al., 2020a; Xu et al., 2020). As previously discussed (Kato et al., 2024), we proposed that CgRedCAP may bind to the C. gracilis PSI core at a site similar to LHCI-1 in the red alga C. caldarium PSI-LHCI through sequence analysis. This site corresponds to the FCPI-1 site in the PSI-FCPI of T. pseudonana in this study. A sequence alignment between RedCAP of T. pseudonana (TpRedCAP) and CgRedCAP is shown in Figure 3—figure supplement 1A, exhibiting a 72% sequence similarity. CgRedCAP contains a protein motif, Q106–I113 (QWGTLATI), corresponding to E102–I109 (EWGTLATI) in TpRedCAP (Figure 3C). These findings suggest the potential binding of CgRedCAP to PSI in C. gracilis at a position similar to FCPI-1 in the T. pseudonana PSI-FCPI structure. However, it remains unclear (1) whether CgRedCAP is indeed bound to the C. gracilis PSI-FCPI supercomplex and (2) if a loop structure corresponding to the Q96–T116 loop of TpRedCAP exists in CgRedCAP. Further structural studies of the C. gracilis PSI-FCPI are required to elucidate the molecular assembly mechanism of diatom RedCAPs.
RedCAPs have been found in the structures of PSI-LHCI in the red alga Porphyridium purpureum (You et al., 2023) and a PSI supercomplex with alloxanthin Chl a/c-binding proteins (PSI-ACPI) in the cryptophyte Chroomonas placoidea (Zhao et al., 2023), as summarized in our previous study (Kato et al., 2024). Both P. purpureum RedCAP (PpRedCAP) and C. placoidea RedCAP (CpRedCAP) exhibit loop structures similar to the Q96–T116 loop in TpRedCAP observed in the present study (Figure 3—figure supplement 1B). Multiple sequence alignments of TpRedCAP with PpRedCAP and CpRedCAP are shown in Figure 3—figure supplement 1C, revealing sequence similarities of 39% and 60%, respectively. PpRedCAP contains a protein motif of V105–L112 (VWGPLAQL), while CpRedCAP has a protein motif of Q117–A124 (QWGPLASA). These motifs correspond to E102–I109 (EWGTLATI) in TpRedCAP; however, the sequence conservation between TpRedCAP and PpRedCAP/CpRedCAP is lower than between TpRedCAP and CgRedCAP. Among the four RedCAPs, the amino acids Trp, Gly, Leu, and Ala are conserved in the protein motifs (xWGxLAxx), implying that this conserved loop structure contributes to the binding of RedCAP to PSI across the red-lineage algae.
Protein–protein interactions of the other FCPI subunits
FCPI-2 (Lhcr3) is positioned near PsaB and PsaM, engaging in protein–protein interactions with these subunits at distances of 3.0–4.3 Å at both the stromal and lumenal sides (Figure 4). The amino acid residues I63/T65/D66/Y69/W134/Y138/D140 of FCPI-2 are associated with W153/L154/K159/F160/W166 of PsaB at the stromal side (Figure 4B), while F116 and F120 of FCPI-2 interact with F5/I9/M12 of PsaM at the lumenal side (Figure 4C). The amino acid sequences corresponding to I63–Y69, F116–F120, and W134–D140 in Lhcr3 are not conserved in the Lhcr subfamily, comprising Lhcr1, Lhcr4, Lhcr7, Lhcr11, Lhcr12, Lhcr14, Lhcr17, Lhcr18, Lhcr19, and Lhcr20, as reported by Kumazawa et al., 2022 (Figure 4—figure supplement 1).
FCPI-3 (Lhcq10) is positioned near PsaL, with protein–protein interactions at distances of 2.3–4.2 Å at the stromal side (Figure 5A, B). The amino acid residues L126/I130/L142/Y146/W147/V148/W155 of FCPI-3 are associated with F4/K6/P20/S25/L26/L30 of PsaL (Figure 5B). Given the homology between TpLhcq10 and CgLhcr9 (Kumazawa et al., 2022), we compared the amino acid sequence of Lhcq10 with the Lhcq and Lhcr subfamilies in T. pseudonana (Figure 5—figure supplement 1A, B). The sequence L126–W155 of Lhcq10 is not conserved in the Lhcq subfamily, comprising Lhcq1, Lhcq2, Lhcq3, Lhcq4, Lhcq5, Lhcq6, Lhcq7, Lhcq8, and Lhcq9 (Figure 5—figure supplement 1A), nor in the Lhcr subfamily, comprising Lhcr1, Lhcr3, Lhcr4, Lhcr7, Lhcr11, Lhcr12, Lhcr14, Lhcr17, Lhcr18, Lhcr19, and Lhcr20, as reported by Kumazawa et al., 2022 (Figure 5—figure supplement 1B).
FCPI-4 (Lhcf10) is positioned near FCPI-5 through protein–protein interactions with it at distances of 2.6–3.6 Å at the lumenal side (Figure 5A, C). The amino acid residues Y196/P198/F199 of FCPI-4 interact with F82/F86/G87 of FCPI-5 (Figure 5C). The amino acid sequence Y196–F199 of Lhcf10 is not conserved in the Lhcf subfamily, comprising Lhcf1, Lhcf2, Lhcf3, Lhcf4, Lhcf5, Lhcf6, Lhcf7, Lhcf8, Lhcf9, Lhcf11, and Lhcf12, as reported by Kumazawa et al., 2022 (Figure 5—figure supplement 2).
FCPI-5 (Lhcq8) is positioned near PsaL and FCPI-4 through protein–protein interactions at distances of 2.6–4.1 Å at both the stromal and lumenal sides (Figure 5A, C, D). The amino acid residues P108/Q109/A112/I115 of FCPI-5 interact with F134/I137/S141 of PsaL at the lumenal side (Figure 5D). The interactions between FCPI-5 and FCPI-4 are shown in Figure 5C. The amino acid sequences F82–G87 and P107–I115 of Lhcq8 are not conserved in the Lhcq subfamily, comprising Lhcq1, Lhcq2, Lhcq3, Lhcq4, Lhcq5, Lhcq6, Lhcq7, Lhcq9, and Lhcq10, as reported by Kumazawa et al., 2022 (Figure 5—figure supplement 3A, B).
Molecular insights into the assembly of FCPIs in diatom PSI-FCPI supercomplexes
To evaluate the molecular assembly of FCPI subunits in the T. pseudonana PSI-FCPI structure, we focused on protein–protein interactions based on their close proximities (Figures 3—5) and the amino acid residues in non-conserved regions among 44 FCPs (Figure 4—figure supplement 1, Figure 5—figure supplements 1–3). This approach is based on the premise that selective associations of FCPIs with PSI require specific amino acid residues unique to each FCPI. Protein–protein interactions among FCPI subunits, as well as between FCPI and PSI subunits, occur at both the stromal and lumenal sides (Figures 3—5), and are likely recognized by unique amino acid residues of FCPIs that are not conserved in each LHC subfamily (Figure 4—figure supplement 1, Figure 5—figure supplements 1–3). Thus, the binding and assembly of each FCPI subunit to PSI are likely determined by the amino acid sequences within the loop regions of the 44 FCPs in T. pseudonana.
The diatom C. gracilis exhibits two distinct PSI-FCPI structures: one with 16 FCPI subunits (Nagao et al., 2020a) and the other with 24 FCPI subunits (Xu et al., 2020). These structural variations arise from changes in the antenna sizes of FCPIs within the C. gracilis PSI-FCPI supercomplexes, in response to varying growth conditions, especially CO2 concentrations and temperatures (Nagao et al., 2020b). Notably, the C. gracilis PSI-FCPI structure contains five FCPI subunits located at the same binding sites as FCPI-1–5 in the T. pseudonana PSI-FCPI structure (Figure 6A). A summary of the relationship between the Lhc genes encoding FCPs, the distinct gene RedCAP, and the binding positions of FCPI-1–5 in T. pseudonana and C. gracilis is shown in Figure 6B. The gene nomenclature for the C. gracilis FCPIs follows the conventions established by Kumazawa et al., 2022, as discussed in our recent study (Kato et al., 2024).
Phylogenetic analysis clearly showed that at the FCPI-1, 2, 3, and 5 sites in the T. pseudonana PSI-FCPI structure, TpRedCAP, TpLhcr3, TpLhcq10, and TpLhcq8 are orthologous to CgRedCAP, CgLhcr1, CgLhcr9, and CgLhcq12, respectively (Figure 6C). The characteristic protein loops of TpRedCAP and CpRedCAP likely participate in interactions with PSI at the FCPI-1 site, as noted above (Figure 3—figure supplement 1). At the FCPI-2 site, comparative analyses revealed that the amino acid residues facilitating interactions between TpLhcr3 and TpPsaB/TpPsaM closely parallel those observed in the CgLhcr1-CgPsaB and CgLhcr1-CgPsaM pairs (Figure 6—figure supplement 1). Similarly, a high degree of similarity characterized the residues involved in the interaction pairs of TpLhcq10-TpPsaL/CgLhcr9-CgPsaL at the FCPI-3 site (Figure 6—figure supplement 2A, B), as well as TpLhcq8-TpPsaL/CgLhcq12-CgPsaL at the FCPI-5 site (Figure 6—figure supplement 2C, D), underscoring the conserved nature of these interactions. However, TpLhcf10 is not homologous to CgLhcf3 (Figure 6C), despite both being located at the FCPI-4 site in their respective PSI-FCPI structures (Figure 6A). These findings suggest that the two diatoms possess both a conserved mechanism of protein–protein interactions across characteristic protein motifs between FCPI and PSI subunits, and a different mechanism of interactions among FCPIs.
It is notable that the C. gracilis PSI-FCPI structure binds remarkably more FCPI subunits than that of T. pseudonana, for example, 16 or 24 subunits in C. gracilis as reported in the previous studies (Nagao et al., 2020a; Xu et al., 2020), versus 5 subunits in T. pseudonana in the present study. The reason for this difference remains unclear. One possibility is that some FCPI subunits are released during detergent solubilization in T. pseudonana, while they are retained in C. gracilis. Alternatively, the number of FCPI subunits may be inherently lower in T. pseudonana, which may reflect adaptations to different living environments. Further studies are needed to resolve this question.
Extension to molecular assembly of PSI-LHCI supercomplexes
The mechanisms of protein–protein interactions in diatom PSI-FCPI supercomplexes are likely developed by the specific binding of FCPs selected from 44 TpFCPs and 46 CgFCPs in addition to RedCAPs. Like a lock-and-key mechanism, one FCP cannot be substituted by another in forming the PSI-FCPI supercomplexes in the two diatoms; for example, TpLhcq10 binds specifically at the FCPI-3 site but not at the other sites such as FCPI-2. This selective binding mechanism of FCPIs may dictate the molecular assembly of PSI-FCPI. Importantly, the selective binding of FCPIs was identified for the first time by comparing the structures of PSI-FCPI supercomplexes and the amino acid sequences of FCPIs between the two diatom species. This approach can be extended to the LHC protein superfamily in the green and red lineages, enabling comparisons of protein structures and sequences of PSI-LHCI supercomplexes among closely related species. This, in turn, lays the foundation for elucidating the underlying mechanism of PSI-LHCI supercomplex assembly. Thus, this study will shed light on answering the evolutionary question of how LHCIs recognize their binding sites at PSI in photosynthetic organisms.
Methods
Cell growth and preparation of thylakoid membranes
The marine centric diatom T. pseudonana CCMP1335 was grown in artificial seawater supplemented with sodium metasilicate and KW21 (Nagao et al., 2007) at 20°C under a photosynthetic photon flux density of 30 μmol photons m−2 s−1 provided by white LED, with bubbling of air containing 3% (vol/vol) CO2. The cells were harvested by centrifugation, disrupted by agitation with glass beads (Nagao et al., 2017), and the thylakoid membranes were pelleted by further centrifugation. The resulting thylakoid membranes were suspended in 50 mM Mes-NaOH (pH 6.5) buffer containing 1 M betaine and 1 mM ethylenediaminetetraacetic acid (EDTA).
Purification of the PSI-FCPI supercomplex
Thylakoid membranes were solubilized with 1% (wt/vol) n-dodecyl-β-D-maltoside (β-DDM) at a Chl concentration of 0.5 mg ml−1 for 20 min on ice in the dark with gentle stirring. After centrifugation at 162,000 × g for 20 min at 4°C, the supernatant was loaded onto a Q-Sepharose anion-exchange column (1.6 cm inner diameter, 25 cm length) equilibrated with 20 mM Mes-NaOH (pH 6.5) buffer containing 0.2 M trehalose, 5 mM CaCl2, 10 mM MgCl2, and 0.03% β-DDM (buffer A). The column was washed with buffer A until the eluate became colorless. Elution was performed at a flow rate of 1.0 ml min−1 using a linear gradient of buffer A and buffer B (buffer A plus 500 mM NaCl) with the following time and gradient: 0–600 min, 0–60% buffer B; 600–800 min, 60–100% buffer B; 800–900 min, 100% buffer B. The PSI-FCPI-enriched fraction was eluted at 194–247 mM NaCl, then collected and subsequently loaded onto a linear gradient containing 10–40% (wt/vol) trehalose in 20 mM Mes-NaOH (pH 6.5) buffer containing 5 mM CaCl2, 10 mM MgCl2, 100 mM NaCl, and 0.03% β-DDM. After centrifugation at 154,000 × g for 18 hr at 4°C (P40ST rotor; Hitachi), a green fraction (Figure 1—figure supplement 1A) was collected and concentrated using a 150-kDa cut-off filter (Apollo; Orbital Biosciences) at 4000 × g. The concentrated samples were stored in liquid nitrogen until use.
Biochemical and spectroscopic analyses of the PSI-FCPI supercomplex
The polypeptide bands of PSI-FCPI were analyzed by sodium dodecyl sulfate–polyacrylamide gel electrophoresis with 16% (wt/vol) acrylamide and 7.5 M urea, following the method of Ikeuchi and Inoue, 1988 (Figure 1—figure supplement 1B, Figure 1—figure supplement 1—source data 1 and 2). The PSI-FCPI supercomplexes (4 µg of Chl) were solubilized in 3% lithium lauryl sulfate and 75 mM dithiothreitol for 10 min at 60°C, and then loaded onto the gel. A standard molecular weight marker (SP-0110; APRO Science) was used. The absorption spectrum of PSI-FCPI was measured at room temperature using a UV-Vis spectrophotometer (UV-2450; Shimadzu) (Figure 1—figure supplement 1C), and the fluorescence emission spectrum of PSI-FCPI was measured at 77 K upon excitation at 430 nm using a spectrofluorometer (RF-5300PC; Shimadzu) (Figure 1—figure supplement 1D). The pigment composition of PSI-FCPI was analyzed by high-performance liquid chromatography following the method of Nagao et al., 2013, and the elution profile was monitored at 440 nm (Figure 1—figure supplement 1E).
Cryo-EM data collection
A 3 μl aliquot of the T. pseudonana PSI-FCPI supercomplex (3.0 mg of Chl ml−1) in 20 mM Mes-NaOH (pH 6.5) buffer containing 0.5 M betaine, 5 mM CaCl2, 10 mM MgCl2, and 0.03% β-DDM was applied to Quantifoil R1.2/1.3 Cu 300 mesh grids in the chamber of FEI Vitrobot Mark IV (Thermo Fisher Scientific). The grid was then blotted with filter paper for 4 s at 4°C under 100% humidity and plunged into liquid ethane cooled by liquid nitrogen. The frozen grid was transferred to a CRYO ARM 300 electron microscope (JEOL) equipped with a cold-field emission gun operated at 300 kV. All image stacks were collected from 5 × 5 holes per stage adjustment to the central hole and image shifts were applied to the surrounding holes while maintaining an axial coma-free condition. The images were recorded using an in-column energy filter with a slit width of 20 eV at a nominal magnification of ×60,000 on a direct electron detector (Gatan K3, AMETEK). The nominal defocus range was −1.8 to −1.2 μm, and the physical pixel size corresponded to 0.752 Å. Each image stack was exposed at a dose rate of 21.46 e− Å−2 s−1 for 2.33 s in CDS mode, with dose-fractionated 50 movie frames. A total of 8,950 image stacks were collected.
Cryo-EM image processing
The resultant movie frames were aligned and summed using MotionCor2 (Zheng et al., 2017) to produce dose-weighted images. The contrast transfer function (CTF) estimation was performed using CTFFIND4 (Mindell and Grigorieff, 2003). All subsequent processes were carried out using RELION-4.0 (Kimanius et al., 2021). A total of 2,733,572 particles were automatically picked and subjected to reference-free 2D classification. From these, 1,132,721 particles were selected from well-defined 2D classes and further processed for 3D classification without imposing any symmetry. An initial model for the first 3D classification was generated de novo from the 2D classification. A 240-Å spherical mask was used during the 3D classification and refinement processes. As illustrated in Figure 1—figure supplement 2C, the final PSI-FCPI structure was reconstructed from 75,667 particles. The overall resolution of the cryo-EM map was determined to be 2.30 Å, based on the gold-standard FSC curve with a cut-off value of 0.143 (Figure 1—figure supplement 3A; Grigorieff and Harrison, 2011). Local resolutions were calculated using RELION (Figure 1—figure supplement 3C).
Model building and refinement
Two types of the cryo-EM maps were employed for the model building of the PSI-FCPI supercomplex: a postprocessed map and a denoised map generated using Topaz version 0.2.4 (Bepler et al., 2020). The postprocessed map was denoised using a trained model over 100 epochs using two half-maps. Initial models of each subunit in the PSI-FCPI supercomplex were generated by ModelAngelo (Jamali et al., 2024) and subsequently inspected and manually adjusted against the maps with Coot (Emsley et al., 2010). Each model was built based on interpretable features from the density maps at a contour level of 2.5 σ in both the denoised and postprocessed maps. For the assignment of Chls, Chls a and c were distinguished by inspecting the density map corresponding to the phytol chain at the least level not to link the map of Chls with that of noise. All Chls c were assigned as Chl c1 due to the inability to distinguish between Chl c1 and Chl c2 at the present resolution. For the assignment of Cars, Fx, and Ddx were distinguished based on the density surrounding the head groups of Cars with the above threshold. The PSI-FCPI structure was refined using phenix.real_space_refine (Adams et al., 2010) and Servalcat (Yamashita et al., 2021), incorporating geometric restraints for protein-cofactor coordination. The final model was validated with MolProbity (Chen et al., 2010), EMRinger (Barad et al., 2015), and Q-score (Pintilie et al., 2020). The statistics for all data collection and structure refinement are summarized in Tables 1 and 2. All structural figures were prepared using PyMOL (Schrödinger, 2021), UCSF Chimera (Pettersen et al., 2004), and UCSF ChimeraX (Pettersen et al., 2021). Since the numbering of Chls, Cars, and other cofactors in this paper differs from those in the PDB data, the corresponding relationships are provided in Tables 6–8.
Phylogenetic analysis
Amino acid sequences were aligned using MAFFT L-INS-i v7.490 or MAFFT E-INS-i v7.520 (Katoh and Standley, 2013). The alignment was trimmed using ClipKit v1.4.1 with the smart-gap mode. Phylogenetic trees were inferred using IQ-TREE 2 (Minh et al., 2020) with the model selected by ModelFinder (Kalyaanamoorthy et al., 2017). The trees were visualized using iTOL v6 (Letunic and Bork, 2021). Ultrafast bootstrap approximation was performed with 1000 replicates (Hoang et al., 2018).
Data availability
Atomic coordinates, cryo-EM maps, and raw image data for the reported structure have been deposited in the Protein Data Bank under an accession code 8XLS (https://www.rcsb.org/structure/8XLS), in the Electron Microscopy Data Bank under an accession code EMD-38457 (https://www.ebi.ac.uk/emdb/EMD-38457), and in the Electron Microscopy Public Image Archive under an accession code EMPIAR-12142 (https://www.ebi.ac.uk/empiar/EMPIAR-12142/), respectively.
-
RCSB Protein Data BankID 8XLS. PSI-FCPI of the diatom Thalassiosira pseudonana CCMP1335.
-
Electron Microscopy Data BankID EMD-38457. PSI-FCPI of the diatom Thalassiosira pseudonana CCMP1335.
-
Electron Microscopy Public Image ArchiveID EMPIAR-12142. PSI-FCPI of the diatom Thalassiosira pseudonana CCMP1335.
References
-
PHENIX: a comprehensive Python-based system for macromolecular structure solutionActa Crystallographica. Section D, Biological Crystallography 66:213–221.https://doi.org/10.1107/S0907444909052925
-
Structure and function of photosystem I in Cyanidioschyzon merolaePhotosynthesis Research 139:499–508.https://doi.org/10.1007/s11120-018-0501-4
-
Topaz-Denoise: general deep denoising models for cryoEM and cryoETNature Communications 11:5208.https://doi.org/10.1038/s41467-020-18952-1
-
Electron transfer in photosystem IBiochimica et Biophysica Acta (BBA) - Bioenergetics 1507:100–114.https://doi.org/10.1016/S0005-2728(01)00202-X
-
Light harvesting complexes in chlorophyll c-containing algaeBiochimica et Biophysica Acta (BBA) - Bioenergetics 1861:148027.https://doi.org/10.1016/j.bbabio.2019.05.003
-
MolProbity: all-atom structure validation for macromolecular crystallographyActa Crystallographica. Section D, Biological Crystallography 66:12–21.https://doi.org/10.1107/S0907444909042073
-
Features and development of CootActa Crystallographica. Section D, Biological Crystallography 66:486–501.https://doi.org/10.1107/S0907444910007493
-
Structure and function of photosystem IAnnual Review of Plant Physiology and Plant Molecular Biology 43:293–324.https://doi.org/10.1146/annurev.arplant.43.1.293
-
The chlorophyll-carotenoid proteins of oxygenic photosynthesisAnnual Review of Plant Physiology and Plant Molecular Biology 47:685–714.https://doi.org/10.1146/annurev.arplant.47.1.685
-
Near-atomic resolution reconstructions of icosahedral viruses from electron cryo-microscopyCurrent Opinion in Structural Biology 21:265–273.https://doi.org/10.1016/j.sbi.2011.01.008
-
The plasticity of photosystem IPlant & Cell Physiology 62:1073–1081.https://doi.org/10.1093/pcp/pcab046
-
UFBoot2: improving the ultrafast bootstrap approximationMolecular Biology and Evolution 35:518–522.https://doi.org/10.1093/molbev/msx281
-
Photosystem I complexes associated with fucoxanthin-chlorophyll-binding proteins from a marine centric diatom, Chaetoceros gracilisBiochimica et Biophysica Acta (BBA) - Bioenergetics 1777:351–361.https://doi.org/10.1016/j.bbabio.2008.01.011
-
Two types of fucoxanthin-chlorophyll-binding proteins I tightly bound to the photosystem I core complex in marine centric diatomsBiochimica et Biophysica Acta (BBA) - Bioenergetics 1827:529–539.https://doi.org/10.1016/j.bbabio.2013.02.003
-
MAFFT multiple sequence alignment software version 7: improvements in performance and usabilityMolecular Biology and Evolution 30:772–780.https://doi.org/10.1093/molbev/mst010
-
New tools for automated cryo-EM single-particle analysis in RELION-4.0The Biochemical Journal 478:4169–4185.https://doi.org/10.1042/BCJ20210708
-
Interactive Tree Of Life (iTOL) v5: an online tool for phylogenetic tree display and annotationNucleic Acids Research 49:W293–W296.https://doi.org/10.1093/nar/gkab301
-
Accurate determination of local defocus and specimen tilt in electron microscopyJournal of Structural Biology 142:334–347.https://doi.org/10.1016/s1047-8477(03)00069-8
-
IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic eraMolecular Biology and Evolution 37:1530–1534.https://doi.org/10.1093/molbev/msaa015
-
High excitation energy quenching in fucoxanthin chlorophyll a/c-binding protein complexes from the diatom Chaetoceros gracilisThe Journal of Physical Chemistry. B 117:6888–6895.https://doi.org/10.1021/jp403923q
-
Genetically introduced hydrogen bond interactions reveal an asymmetric charge distribution on the radical cation of the special-pair chlorophyll P680The Journal of Biological Chemistry 292:7474–7486.https://doi.org/10.1074/jbc.M117.781062
-
Ultrafast excitation energy dynamics in a diatom photosystem I-antenna complex: a femtosecond fluorescence upconversion studyThe Journal of Physical Chemistry. B 123:2673–2678.https://doi.org/10.1021/acs.jpcb.8b12086
-
Low-energy chlorophylls in fucoxanthin chlorophyll a/c-binding protein conduct excitation energy transfer to photosystem I in diatomsThe Journal of Physical Chemistry. B 123:66–70.https://doi.org/10.1021/acs.jpcb.8b09253
-
pH-sensing machinery of excitation energy transfer in diatom PSI-FCPI complexesThe Journal of Physical Chemistry Letters 10:3531–3535.https://doi.org/10.1021/acs.jpclett.9b01314
-
Effects of CO2 and temperature on photosynthetic performance in the diatom Chaetoceros gracilisPhotosynthesis Research 146:189–195.https://doi.org/10.1007/s11120-020-00729-8
-
Excitation-energy transfer and quenching in diatom PSI-FCPI upon P700 cation formationThe Journal of Physical Chemistry. B 124:1481–1486.https://doi.org/10.1021/acs.jpcb.0c00715
-
UCSF Chimera-A visualization system for exploratory research and analysisJournal of Computational Chemistry 25:1605–1612.https://doi.org/10.1002/jcc.20084
-
The structure of photosystem II and the mechanism of water oxidation in photosynthesisAnnual Review of Plant Biology 66:23–48.https://doi.org/10.1146/annurev-arplant-050312-120129
-
BookStructure, function, and variations of the photosystem I-antenna supercomplex from different photosynthetic organismsIn: Harris JR, Marles-Wright J, editors. Macromolecular Protein Complexes IV. Subcellular Biochemistry. Springer Cham. pp. 351–377.https://doi.org/10.1007/978-3-031-00793-4_11
-
Solar energy conversion by photosystem II: principles and structuresPhotosynthesis Research 156:279–307.https://doi.org/10.1007/s11120-022-00991-y
-
Characterization of highly purified photosystem I complexes from the chlorophyll d-dominated cyanobacterium Acaryochloris marina MBIC 11017The Journal of Biological Chemistry 283:18198–18209.https://doi.org/10.1074/jbc.M801805200
-
BookStructure, organization and function of light-harvesting complexes associated with photosystem IIIn: Shen J-R, Satoh K, Allakhverdiev SI, editors. Photosynthesis: Molecular Approaches to Solar Energy Conversion. Springer Cham. pp. 163–194.https://doi.org/10.1007/978-3-030-67407-6_6
-
A unique photosystem I reaction center from a chlorophyll d-containing cyanobacterium Acaryochloris marinaJournal of Integrative Plant Biology 63:1740–1752.https://doi.org/10.1111/jipb.13113
-
Cryo-EM single-particle structure refinement and map calculation using ServalcatActa Crystallographica. Section D, Structural Biology 77:1282–1291.https://doi.org/10.1107/S2059798321009475
Article and author information
Author details
Funding
Japan Society for the Promotion of Science (JP22KJ2017)
- Minoru Kumazawa
Japan Society for the Promotion of Science (JP23K14211)
- Yoshiki Nakajima
Japan Society for the Promotion of Science (JP22H04916)
- Jian-Ren Shen
Japan Society for the Promotion of Science (JP23H02347)
- Kentaro Ifuku
Japan Society for the Promotion of Science (JP23H02423)
- Ryo Nagao
Takeda Science Foundation
- Koji Kato
- Ryo Nagao
The funders had no role in study design, data collection, and interpretation, or the decision to submit the work for publication.
Acknowledgements
We thank Kumiyo Kato and Satoko Kakiuchi for their assistance in this study. The cells of T. pseudonana CCMP1335 were given by Prof. Yusuke Matsuda, Kwansei Gakuin University, Japan. Cryo-EM data was obtained using EM01CT and EM02CT of SPring-8 with the approval of the Japan Synchrotron Radiation Research Institute (JASRI Proposal No. 2022B2728 (J-RS) and No. 2023A2715 (YN)). This work was supported by JSPS KAKENHI grant Nos. JP22KJ2017 (MK), JP23K14211 (YN), JP22H04916 (J-RS), JP23H02347 (KI), and JP23H02423 (RN), Takeda Science Foundation (RN, KK), and Research Support Project for Life Science and Drug Discovery (Basis for Supporting Innovative Drug Discovery and Life Science Research (BINDS)) from AMED under support No. 4176 (J-RS).
Version history
- Preprint posted:
- Sent for peer review:
- Reviewed Preprint version 1:
- Reviewed Preprint version 2:
- Version of Record published:
Cite all versions
You can cite all versions using the DOI https://doi.org/10.7554/eLife.99858. This DOI represents all versions, and will always resolve to the latest one.
Copyright
© 2024, Kato, Nakajima, Xing et al.
This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.