Upstream open reading frames buffer translational variability during Drosophila evolution and development

Yuanqiang Sun; Yuange Duan; Peixiang Gao; Chenlu Liu; Kaichun Jin; Shengqian Dou; Wenxiong Tang; Hong Zhang; Jian Lu

doi:10.7554/eLife.104074.1

eLife Assessment

This study reveals the important role of upstream open reading frames (uORFs) in limiting the translational variability of downstream coding sequences. Through a combination of computational simulations, comparative analyses of translation efficiency across different developmental stages in two closely related Drosophila species, and manipulative, experimental validation of translation buffering by an uORF for a gene, the authors provide convincing evidence supporting their conclusions. This work will be of broad interest to molecular biologists and geneticists.

https://doi.org/10.7554/eLife.104074.1.sa2

Significance of findings

important: Findings that have theoretical or practical implications beyond a single subfield

landmark
fundamental
important
valuable
useful

Strength of evidence

convincing: Appropriate and validated methodology in line with current state-of-the-art

exceptional
compelling
convincing
solid
incomplete
inadequate

During the peer-review process the editor and reviewers write an eLife assessment that summarises the significance of the findings reported in the article (on a scale ranging from landmark to useful) and the strength of the evidence (on a scale ranging from exceptional to inadequate). Learn more about eLife assessments

Abstract

Protein abundance tends to be more evolutionarily conserved than mRNA levels both within and between species, yet the mechanisms underlying this phenomenon remain largely unknown. Upstream open reading frames (uORFs) are widespread cis-regulatory elements in eukaryotic genomes that regulate translation, but it remains unclear whether and how uORFs contribute to stabilizing protein levels. In this study, we performed ribosome translation simulations on mRNA to quantitatively assess the extent to which uORF translation influences the translational variability of downstream coding sequences (CDS) across varying contexts. Our simulations revealed that uORF translation dampens CDS translational variability, with buffering capacity increasing in proportion to uORF efficiency, length, and number. We then compared the translatomes at different developmental stages of two Drosophila species, demonstrating that uORFs buffer mRNA translation fluctuations during both evolution and development. Experimentally, deleting a uORF in the bcd gene—a prominent example of translational buffering—resulted in extensive changes in gene expression and phenotypes in Drosophila melanogaster. Additionally, we observed uORF-mediated buffering between primates and within human populations. Together, our results reveal a novel regulatory mechanism by which uORFs stabilize gene translation during development and across evolutionary time.

Introduction

Organisms have evolved various strategies for the spatiotemporal regulation of gene expression ^1–3. This is important because aberrant gene expression can result in phenotypic defects or diseases, while the variation and evolution of gene expression patterns frequently promote phenotypic diversification and adaptation ⁴. Although variations in mRNA abundance are widely observed within or between species, protein abundance tends to show stronger evolutionary constraint ^5,6, as observed in yeasts ^7–9, primates ^10,11, and other organisms ^12–14. Nevertheless, the molecular mechanisms by which the conservation of protein abundance across species is achieved are largely unknown ^5,6.

Eukaryotic mRNA translation is a crucial step in gene expression and is highly regulated by multilayered mechanisms ^15–17. Upstream open reading frames (uORFs), short open reading frames in the 5-terminal untranslated regions (5’UTRs) of eukaryotic mRNAs, play crucial roles in regulating mRNA translation. Approximately 50% of eukaryotic genes contain uORFs ¹⁸, and their evolution has been tightly shaped by natural selection ^19–23. The functions of uORFs have been explored in various contexts, including development ^23–30, disease ^31–35 and stress responses ^36–43. The prevailing consensus is that uORFs typically repress downstream coding sequence (CDS) translation by sequestering ribosomes, a process influenced by factors such as uORF length, position, and sequence context ^34,43–48. However, under stress conditions, certain uORFs can facilitate CDS translation by promoting ribosome reinitiation, illustrating their context-dependent functions ^{42,43,48–52}.

Gene expression noise, which arises from the inherent stochasticity of biological processes such as transcription and translation, is generally detrimental to organismal fitness ⁵³ and is primarily determined at the translational level ⁵⁴. Recent studies suggest uORFs might play essential roles in buffering translational noise and stabilizing protein expression. For example, Wu et al. (2022) demonstrated that uORFs reduce protein production rates to stabilize TOC1 protein levels, ensuring precise circadian clock function in plants ⁵⁵. Similarly, Bottorff et al. (2022) used a human cell reporter system to show that a ribosome stall in cytomegaloviral UL4 uORFs buffers against CDS translation reductions ⁵⁶. Under stress, translation initiation is typically downregulated, yet most human mRNAs resistant to this inhibition contain translated uORFs, with a single uORF often being sufficient for resistance ⁴². The computational model of Initiation Complexes Interference with Elongating Ribosomes (ICIER) suggests that derepression of downstream translation is a general mechanism of uORF-mediated stress resistance ⁴³. Despite these findings, the current understanding of uORFs in stabilizing translation is limited to single-gene cases or stressed conditions. It remains unclear whether and how uORFs affect gene translation variability on a genome-wide scale during evolution and development, and whether the identified mechanisms are universal or vary significantly among different taxa. To address these questions, a combination of modeling, genome-wide analyses, and comparative studies across species is required.

In this study, we first adapted the ICIER framework ⁴³ to simulate the translating ribosome on a mRNA to quantitatively measure the extent to which uORF translation reduces the translational variability of the downstream CDS under different translation contexts. We then compared the translatomes of two closely related Drosophila species, D. melanogaster and D. simulans, and further supported the notion that uORFs could buffer the fluctuations of CDS translation during the development and evolution of Drosophila. The patterns also reappeared among primates and human populations. We next knocked out the bcd uORF, a case showing significant buffering effect in our data, and observed wide changes in embryonic transcriptome and phenotypic defects in D. melanogaster. Together, our results demonstrate a novel role for uORFs by maintaining translation stabilization during Drosophila evolution and development.

Results

An extended ICIER model for quantifying uORF buffering in CDS translation

To quantitatively assess how uORF translation modulates the variability of downstream CDS translation, we adapted the ICIER model ⁴³, originally grounded in the totally asymmetric simple exclusion process (TASEP). TASEP has been extensively utilized to model the stochastic nature of ribosome movement along mRNA, capturing the effects of ribosome traffic jams, where ribosomes may slow down or stall when a site ahead is occupied ^57–60. The ICIER model extends this by simulating the interplay between scanning (40S) and elongating (80S) ribosomes, particularly focusing on how uORFs impact the overall translation efficiency of the main CDS ⁴³.

We extended the original ICIER model ⁴³ with several major modifications (Fig. 1A). First, while the original ICIER model only considered the scenario where the elongating ribosome (80S) causes downstream scanning ribosomes (40S) to dissociate from the mRNA when they move along the mRNA and collide, recent findings have shown that upstream dissociation can also play a critical role in uORF-mediated regulation ⁵⁶. To incorporate this, we accounted for more complex ribosome interactions, including three possible scenarios where the 80S collides with 40S: i) 80S only causes the downstream 40S to dissociate from the mRNA with a probability of K_down (“downstream dissociation”, K_down ranging from 0 to 1), following the original ICIER model; ii) the 80S only causes the upstream 40S to dissociate from the mRNA with a probability of K_up (“upstream dissociation”, K_up > 0 and K_down = 0); and iii) a combination of the downstream and upstream dissociation models (“double dissociation”, K_up > 0 and K_down > 0).

Modeling simulation of uORF-mediated translation buffering.
(A) Model schema of the modified ICIER model (on the top); the parameters are listed in the box below the schema. (B) Heatmap showing the CVs of CDS TE (*N_EC*) under different *I_CDS* (x-axis) and *I_uORF* (y-axis) combinations with a uniform distribution of *R_in* input and the downstream dissociation model. The left panels elicited by the dotted lines from specific squares of right heatmap were two examples showing the distribution of *N_EC* under *I_CDS* = 0.8 & *I_uORF* = 0 (top panel, without uORF) and *I_CDS* = 0.8 & *I_uORF* = 0.4 (bottom panel, with uORF). (C) Heatmap showing CVs of CDS TE (*N_EC*) under different *I_uORF* (x-axis) and *I_uORF* (y-axis) combinations with a uniform distribution of *R_in* input and the downstream dissociation model. The left panels elicited by the dotted lines from specific squares of right heatmap were two examples showing the distribution of *N_EC* under *L_uORF* = 2 & *I_uORF* = 0.2 (top panel) and *L_uORF* = 30 & *I_uORF* = 0.2 (bottom panel). (D) Heatmap showing median δ [*log*₂(Δ*N_EC*/Δ*N_EU*₌)] under different *I_CDS* (x-axis) and *I_uORF* (y-axis) combinations with a uniform distribution of *R_in* input and the downstream dissociation model. The left panel elicited by the dotted line from a specific square of right heatmap was an example showing the distribution of δ under *I_CDS* = 0.8 & *I_uORF* = 0.2. The vertical dashed line indicated the median value of δ.

Second, the original ICIER model only considered the ribosome collision and dissociation in the uORF and counted the 40S scanning ribosome escaping from the uORF as a proxy of the CDS translation rate. In our extended model, we also considered the ribosome collision and dissociation events in the CDS that is downstream of the uORF. We recorded the number of 80S ribosomes that completed translation at the stop codon of a CDS (N_EC) or uORF (N_Eu) during a given time interval, using these counts as proxies to quantify the translation efficiency (TE) of CDS or uORF. These indices allowed us to directly and quantitively measure the impact of uORF-mediated translational buffering.

Third, while the original ICIER model only considered the effect of a single uORF, we also considered the buffering effects of two uORFs, allowing us to test the possible combinatorial effects of uORF-mediated translation regulation in this more complex yet more common scenario, as previous studies have shown uORFs tend to be clustered in genes ¹⁸.

These extensions allow for a more comprehensive exploration of uORF-mediated translational buffering, offering deeper insights into how these regulatory elements might stabilize protein synthesis across varying translation contexts.

uORF-mediated buffering of CDS translation across different parameter settings

To systematically investigate the extent to which uORF translation modulates the variability of downstream CDS translation, we conducted simulations across various parameter settings using three dissociation models: upstream, downstream, and double dissociation, each reflecting different possible interactions between scanning and elongating ribosomes on mRNA. We considered a range of parameters crucial to the translation process (Table S1), including the length of the 5’ leader before the uORF (fixed at 150 nucleotides), the length of the uORF itself (ranging from 2 to 100 codons), the distance between the uORF stop codon and the CDS start codon (150 nucleotides), the length of the CDS (500 codons), and the length of the 3’ UTR (150 nucleotides). Additionally, we modeled the probabilities associated with ribosome movement and initiation, such as the probability of a 40S ribosome moving to the next nucleotide (v_s = 0.3), the probability of an 80S ribosome moving to the next position within the uORF (v_Eu = 0.3) and within the CDS (v_EC = 0.5), and the probability of loading a new 40S ribosome at the 5’ end of the mRNA (R_in). We also explored different probabilities of translation initiation at both the uORF start codon (I_uORF) and the CDS start codon (I_CDS). Key parameters were adapted from the original ICIER model ⁴³, ensuring a robust basis for comparison while allowing exploration of additional variables that influence uORF-mediated translational buffering.

In our simulations, R_in values were varied to simulate fluctuations in translational resources, such as ribosome availability, that could arise from genetic differences or environmental changes during evolution or development. By generating 1,000 R_in values following either uniform or exponential distributions (ranging from 0 to 0.1, Fig. S1A), we aimed to capture the natural variability in ribosome loading rates that might occur across different species, individuals, or developmental stages. These values were then fed into the simulation models to evaluate their impact on both the level and variability of translation efficiency for CDSs, with the number of ribosomes completing translation on CDSs (N_EC) in a given time interval used as proxies for translational efficiency (Fig. S1B).

Across all model settings, uORF translation (I_uORF > 0) consistently reduced CDS translation efficiency (N_EC) by about 30% to 80% as I_uORF increased from 0.1 to 0.5, compared to scenarios where the uORF was absent or untranslated (I_uORF = 0) (Fig. S2 and S3). This confirms the inhibitory effect of uORF on downstream CDS translation. Notably, the coefficient of variation (CV), a measurement of variability for translation efficiency (N_EC), was lower when the uORF was translated (Fig. 1B). These CV values further decreased by approximately 10% to 25% as I_uORF increased from 0.1 to 0.5 (Fig. 1B), and this buffering effect persisted across different parameter settings (Fig. S4 and S5). Moreover, the CV of translation efficiency (N_EC) further decreased by about 6% to 30% as uORF length (I_uORF) increased from 2 to 100 codons (Fig. 1C and S6-7). These simulation results indicate that uORF translation can reduce variability in downstream CDS translation, with the buffering capacity positively correlated with both uORF translation initiation efficiency and length under the applied simulation conditions.

To more quantitatively investigate the relationship between changes in uORF (N_EU) and CDS (N_EC) translation, for each parameter setting, we used the median value (R_in₂) of the 1000 R_in inputs as the baseline, calculating the corresponding N_EC₂ and N_EU₂values. We then calculated the difference in changes in N_EC (Δ N_EC) and N_EU (Δ N_EU) relative to N_EC₂ and N_EU₂ for each R_in value. Across different parameter settings, ΔN_EU was consistently and significantly positively correlated with ΔN_EC (P < 0.001, Spearman’s correlation) (Fig. S8-9), indicating that fluctuations of the ribosome loading rate influence the translation of both uORFs and CDSs in the same direction. Nevertheless, our simulations showed that variations in R_in led to a larger change in N_EU than in N_EC, as the median value of δ [defined by log₂(ΔN_EC/ΔN_EU)] was consistently less than 0 across various I_uORF and I_CDS combinations (Fig. 1D and Fig. S10-11). This finding suggests that uORFs buffer against upstream fluctuations, exhibiting greater translational fluctuations than downstream CDSs.

Simulations of mRNAs with two uORFs revealed patterns consistent with a buffering role for uORFs (Fig. S12). To compare the buffering effects of a single uORF versus two uORFs, we calculated the ratio of the CV of N_EC with two uORFs to that with a single uORF. A ratio less than 1 suggests that two uORFs provide greater buffering than a single uORF. For comparability, we examined the CV of N_EC where the I_uORF in the single-uORF model equals I_uORF₌ (I_uORF of the first uORF) in the two-uORF model, both ranging from 0 to 0.5. The CV ratio consistently remained below 1 across a range of I_uORF_> (I_uORF of the second uORF) (Fig. S13), indicating that two uORFs offer stronger buffering than a single uORF.

Collectively, these simulations collectively suggest: (1) uORF-mediated translational control buffers against CDS translation variability, (2) uORFs exhibit greater translational fluctuations than downstream CDSs, and (3) the buffering capacity of uORFs positively correlates with their translation initiation efficiency and length. Subsequently, we sought to validate these simulation results in a biological context by confirming the uORF-mediated buffering effect during organismal evolution and development, as both processes face environmental and/or genetic changes that frequently disturb mRNA translation.

Generating matched translatome data from two Drosophila species for comparative analysis

To validate the uORF-mediated translational buffering during Drosophila evolution, we performed a comparative analysis of the translatomes of two closely related Drosophila species, D. melanogaster and D. simulans, which diverged approximately 5.4 million years ago ⁶¹. We generated high-throughput sequencing data for D. simulans, including transcriptome (mRNA-Seq) and translatome (Ribo-Seq) profiles from various developmental stages and tissues, such as embryos at 0-2 h, 2-6 h, 6-12 h, and 12-24 h, third-instar larvae, P7-8 pupae, female and male bodies, and female and male heads. In total, we obtained approximately 786 million high-quality reads for D. simulans (Table S2). These datasets were designed to be directly comparable to the previously published D. melanogaster data ²³, with identical embryonic stages and tissue types. This comprehensive comparative translatome analysis, utilizing matched developmental stages and tissues between the two species, allowed us to test the uORF-mediated translational buffering effects during evolution.

Translational conservation and dominance of uORFs between Drosophila species

Given that the translation of uORFs is a crucial determinant for their functional impact, we first characterized the translational profiles of uORFs in D. melanogaster and D. simulans to explore the evolutionary roles of uORFs in gene regulation. We identified 18,412 canonical uORFs shared between the two species (referred to as conserved uORFs hereafter), 2,789 uORFs specific to D. melanogaster and 2,440 uORFs specific to D. simulans. The translational efficiencies (TEs) of the conserved uORFs were highly correlated between the two species across all developmental stages and tissues examined, with Spearman correlation coefficients ranging from 0.478 to 0.573 (Fig. 2A). Notably, conserved uORFs exhibited significantly higher translational efficiencies (TEs) compared to species-specific uORFs in both species. The median TE of conserved uORFs was 1.62 times that of non-conserved uORFs in D. simulans, while the corresponding ratio in D. melanogaster was 1.52 (Fig. 2B).

Conservation and translation of uORFs between *D. melanogaster* and *D. simulans*.
(A) Spearman’s correlation coefficients (*Rho*, represented by the bars) of conserved TEs between Dm (*D. melanogaster*) and Ds (*D. simulans*). ***, P < 0.001. Data for the female head sample is shown as an example in the right panel. The x- and y-axes represent the uORF TEs in Dm and Ds. (B) The median of TE of conserved and species-specific uORFs in each sample. Each dot represents the median TE of a sample for a specific uORF class. Data from the female head sample is shown as an example in the right panel. P values were obtained from Wilcoxon rank sum tests. ***, P < 0.001. (C) uORFs were ranked by decreasing TEs within each gene. The uORF with the highest TE within each gene was defined as the dominant uORF (#1). *_EC*2” represents the second highest uORF TE and the same goes for *_EC*3” and “>3”. Each dot represents the median TE of a sample. (D) Fraction of conserved uORFs among dominant uORFs and other translated uORFs in each sample. The paired samples in Dm and Ds were linked together. The P value was obtained by the paired Wilcoxon signed rank test. ***, P < 0.001. (E) Absolute values of the interspecific TE fold changes (log₂TE-FC) of dominant uORFs and the other translated uORFs in each sample. The paired samples in Dm and Ds were linked together. The median value of each sample is shown. The P value was obtained via the paired Wilcoxon signed rank test. ***, P < 0.001. Data from the female head sample were used as an example in the right panel.

In D. melanogaster, 7,259 (52.2%) genes had no uORFs, 2,687 (19.3%) had a single uORF, and 3,961 (28.5%) contained multiple uORFs. Among genes with multiple uORFs, one uORF generally emerged as dominant, displaying a higher TE than the others within the same gene (Fig. 2C). The median TE of the dominant uORF was 4.84 times that of the second-highest uORF within the same gene in D. melanogaster, and the corresponding ratio was 5.21 times in D. simulans (Fig. 2C). To assess the consistency of this dominance across different tissues and developmental stages, we identified 3,072 multiple-uORF genes in D. melanogaster with at least one translated uORF (TE > 0.1 in at least five stages/tissues). Of these, 569 genes consistently used the same dominant uORF across the measured samples, significantly higher than the number expected under randomness (5 genes, 95% confidence interval: 1-10) based on shuffling the TEs of uORFs 1,000 times (Fig. S14). This trend was also observed in D. simulans and persisted under different thresholds for defining “translated uORFs” (Fig. S14). Furthermore, the dominant uORFs showed a higher proportion of conserved uATGs than the other translated uORFs (median proportion 82.5% versus 78.8%, P < 0.001) (Fig. 2D). Additionally, TE fold-changes (|log₂TE-FC|) between the two species were, on an average, 23.2% smaller for dominant uORFs than for other conserved uORFs (Fig. 2E), suggesting that dominant uORFs are more likely under stronger stabilizing selection. These findings suggest that, in genes with multiple uORFs, the dominantly translated uORF may play a more important role in regulating CDS translation than the other uORFs.”

uORFs and CDSs show correlated translation differences between D. melanogaster and D. simulans

To investigate the relationship between translational changes in uORFs and their downstream CDSs across species, we analyzed the translation efficiency (TE) of uORFs and corresponding downstream CDSs in D. melanogaster and D. simulans. Consistent with our simulations that the translation of a uORF is tightly linked to that of downstream CDS, uORFs exhibited a significant positive correlation with the TE of their downstream CDSs in all samples analyzed (P < 0.001, Spearman’s correlation) (Fig. 3A). We then compared the interspecific TE change of a uORF (β_u = TE_uORF_,&.2/TE_uORF_,2CD) with that of its corresponding CDS (β_c = TE_CDS_,&.2/TE_CDS_,2CD) between D. melanogaster and D. simulans. We found β_u is significantly positively correlated with β_c across all samples (P values < 0.001, Spearman’s correlation) (Fig. 3B). These results align well with our simulations, which showed that fluctuations in translational factors (such as ribosomes) influence both uORF and CDS translation in the same direction (Fig. S8-S9).

uORFs reduce CDS translational divergence between *D. melanogaster* and *D. simulans*.
(A) The correlation of uORF TEs and the corresponding CDS TEs in 10 samples of Dm (*D. melanogaster*) and Ds (*D. simulans*). The bars represent Spearman’s correlation coefficient (*Rho*). In all samples, we obtained both P values < 0.001. Data for the female head sample of Dm and Ds are shown as examples in the right panel. (B) Correlations between interspecific uORF TE changes (log₂*β_u*) and CDS TE changes (log₂*β_C*) in 10 samples. The x-axis was divided into 50 equal bins with increasing *β_u*. Spearman’s correlation coefficients (*Rho*) are shown at the top left. ***, P < 0.001 in the correlation test. (C) Genes expressed in female heads (mRNA RPKM > 0.1 in both species) were classified into three classes according to whether a gene had a conserved and dominantly translated uORF or not. Boxplots showing interspecific CDS TE variability |*log*₂(*β_c*)| of different gene classes. P values were calculated using Wilcoxon rank sum tests between the neighboring groups. ***, P < 0.001. (D) Genes expressed in female heads were classified into three classes according to the length of translated uORFs. Boxplots showing interspecific CDS TE variability |*log*₂(*β_c*)| of different gene classes. P values were calculated using Wilcoxon rank sum tests between the neighboring groups. ***, P < 0.001.

uORFs buffer interspecific translational divergence of CDSs

While the direction of translational efficiency (TE) changes for uORFs and CDSs tends to be consistent, our simulations suggest that the magnitude of TE changes in CDSs is generally smaller due to the buffering effect of uORF translation (Fig. 1B, S4-5). To validate this, we first identified uORFs and CDSs with significant interspecific TE differences by assessing whether β_u or β_C significantly deviated from 1, using an established statistical framework ²³. This analysis uncovered 1,151 to 4,189 CDSs with significant interspecific TE changes (FDR < 0.05) (Table 1), with genes involved in development, morphogenesis, and differentiation being significantly enriched during embryonic stages (Fig. S15). Conversely, genes related to metabolism, response to stimuli, and signaling were enriched in larval, pupal, and adult stages (Fig. S15). Additionally, we identified 144 to 1,193 uORFs with significant TE differences between species, accounting for approximately 1-15% of expressed uORFs (Table 1). The smaller number of uORFs showing significant TE changes compared to CDSs between D. melanogaster and D. simulans likely reflects their shorter length and reduced statistical power, rather than indicating that uORFs are less variable in translation than CDSs.

Numbers of genes showing different magnitudes of TE changes between uORFs and CDS at the interspecific level.

To further investigate, we quantitatively compared the magnitude of interspecific TE changes between uORFs and their corresponding CDSs using a previous method ²³. We defined γ =β_c/β_u for a uORF-CDS pair within the same mRNA and tested whether γ was significantly different from 1 to identify pairs with differential TE changes between uORFs and CDSs (Fig. S16). When γ < 1 and β_u >1, or γ > 1 and β_$_u<1, it indicates that the TE change for the CDS is smaller than that for the uORF (Fig. S16). Among CDS-uORF pairs where β_u > 1, nearly all (8-487) showed a significant γ < 1 in each sample, except for one pair from the pupal stage where γ > 1 (Table 1). This suggests that the magnitude of TE changes in CDSs was generally smaller than in uORFs when uORF TE increased, and vice versa when uORF TE decreased (β_u < 1) (Table 1). These comparative translatome analyses indicate that uORFs buffered mRNA translation changes during the evolutionary divergence of D. melanogaster and D. simulans.

uORF buffering is influenced by its conservation, dominance, and length

To investigate how the conservation level and translation patterns of uORFs influence their buffering capacity on CDS translation, we categorized genes expressed in each pair of samples into three classes: Class I, genes with uORFs conserved and dominantly translated in both Drosophila species; Class II, genes with conserved uORFs translated in both species but not dominantly in at least one; and Class III, the remaining expressed genes. We then compared the interspecific translation efficiency (TE) changes of the CDS (|β_c|) across these three categories. Significant differences in |β_c| were observed, with a consistent hierarchy of Class I < II < III across all pairs of samples (Fig. 3C and S17). On average, Class I genes exhibited an average of 8.18% and 23.8% lower |β_c| values compared to Class II and Class III, respectively. This indicates that conserved and dominantly translated uORFs exert a stronger buffering effect on CDS translation.

To further validate the simulation results suggesting that longer uORFs have a stronger buffering effect (Fig. 1C and S6-7), we divided genes expressed in each pair of samples into three groups: those without translated uORFs (No), those with short uORFs (short, total length below the median), and those with long uORFs (long, total length above the median). Consistently, longer uORFs were associated with stronger buffering effects on CDS translation across all pairs of samples (Fig. 3D and S18). Specifically, genes with longer uORFs showed 12.7% and 26.5% lower |β_c| values compared to genes with short uORFs or no uORFs, respectively.

Overall, these findings underscore that the buffering capability of a uORF is positively correlated with its conservation level, translation dominance, and length.

uORFs buffer translational fluctuations during Drosophila development

Gene expression undergoes dynamic changes during Drosophila development, with significant alterations in the translation program to meet developmental demands, including shifts in ribosome loading rates ^24,62,63. Therefore, we extended our analysis to investigate the role of uORFs in buffering these translational fluctuations, hypothesizing that uORFs could mitigate them during development. According to our hypothesis, if a gene has a translated uORF in D. melanogaster but not in its orthologous gene in D. simulans, then the translation of this gene is likely more stable across developmental stages in D. melanogaster than its ortholog in D. simulans, and vice versa.

To test this hypothesis, we compared the CV of CDS TE across 10 developmental stages in D. melanogaster and in D. simulans, respectively. Genes with translated uORFs (TE>0.1 in at least one sample) in D. melanogaster exhibited significantly 22.5% smaller CVs in D. melanogaster than their orthologs lacking these uORFs in D. simulans, indicating more stable translation (Fig. 4A). Consistently, for genes with translated uORFs in D. simulans but not in D. melanogaster, the CVs were 13.3% lower in D. simulans compared to their orthologs lacking these uORFs in D. melanogaster (Fig. 4B). Consistent results were observed when a uORF is required to be translated (TE > 0.1) in all 10 samples (Fig. S19A), in the 4 embryonic stages (Fig. S19B) or in the 6 stages including embryos, larva, and pupa (Fig. S19C). Moreover, within each species, genes with translated uORFs also showed less variability in CDS TE across developmental stages compared to those without translated uORFs, with a 31.8% reduction in D. melanogaster and a 28.9% reduction in D. simulans (Fig. 4C). This effect was consistent across different thresholds for defining “translated uORFs” (Fig. S20).

uORFs could reduce CDS translational fluctuation during *Drosophila* development.
(A) The CV of TE_CDS across 10 Dm (*D. melanogaster*) samples and 10 Ds (*D. simulans*) samples. The selected gene with uORFs translated (TE > 0.1) in at least one Dm sample but its homologous gene without translated uORF in Ds samples. Each pair of dots linked by a gray line represents a pair of homologous genes in Dm and Ds. ***, P < 0.001, Wilcoxon signed-rank test. (B) The CV of TE_CDS across 10 Dm samples and 10 Ds samples. The selected gene with uORFs translated (TE > 0.1) in at least one Ds sample but its homologous gene without translated uORF in Dm samples. Each pair of dots linked by a gray line represents a pair of homologous genes in Dm and Ds. ***, P < 0.001, Wilcoxon signed-rank test. (C) Within each *Drosophila* species, the CV of TE_CDS of genes with translated uORFs compared to genes without the translated uORFs. The P values are obtained by the Wilcoxon rank sum test. ***, P < 0.001.

These results suggest that uORFs function as translational buffers, reducing gene translation fluctuations during Drosophila development.

Knocking out the uORF of bcd increased bcd CDS translation in D. melanogaster

After verifying the uORFs-mediated translational buffering during Drosophila evolution and development, we next aimed to directly explore the biological function of these buffering-capable uORFs in vivo. We first applied stringent criteria to identify uORFs with significant buffering effects on CDS translation between D. melanogaster and D. simulans. Specifically, we looked for 1) uORFs with significant TE changes (| log₂ (β_u)| > 1.5, adjusted P < 0.05), 2) negligible changes in its corresponding CDS translation (|log₂(β_u)| < 0.05, adjusted P > 0.05), and 3) a significant difference between the magnitude of these changes (|log₂(γ)| > 1.5, adjusted P < 0.05). We identified 131 uORF-CDS pairs in 103 genes that meet these criteria in at least one stage/tissue (Table S3), with a majority of the genes (67%, 69 out of 103) from embryonic stages (Fig. S21A), suggesting a crucial role for uORFs in maintaining translational stability during early development. Among these genes, one notable case is the bicoid (bcd) gene, a master regulator of anterior-posterior axis patterning during early embryogenesis ^64–66. The bicoid gene contains a 4-codon uORF (excluding the stop codon) in its 5’ UTR, and branch length score (BLS) analysis ¹⁸ showed that the start codon (uATG) of the bicoid uORF is highly conserved across the Drosophila phylogeny, with a BLS of 0.90 (on a scale from 0 to 1, where higher values indicate greater conservation) (Fig. 5A and S21B). Ribo-Seq data revealed that in 0-2 h embryos, the TE of the bcd uORF varied more than threefold, while the TE of the CDS was virtually the same between D. melanogaster and D. simulans (Fig. 5B and Table S3). This suggests that the bcd uORF buffers translation during early development.

The strong buffering uORF of *bcd* and it knockout.
(A) Multiple sequence alignment of the *bcd* uORF and partial CDS in *D. melanogaster* and 20 other *Drosophila* species. The uORF and CDS are boxed in green and purple, respectively. The start codons of the uORF and CDS are boxed in red. (B) The coverage of mRNA-Seq (top), Ribo-Seq (middle), and TEs (bottom) of the *bcd* uORF and CDS in 0-2 h embryos of *D. melanogaster* (red) and *D. simulans* (blue). The uORF and CDS are denoted at the lower panel with dark green triangles and purple boxes, respectively. The 2 dashed lines mark the CDS region. The uORF TE, CDS TE and their interspecific changes were labeled at the bottom. (C) Genotypes of WT and two uORF knock-out strains (uKO1 and uKO2) generated by CRISPR-Cas9 technology. The uORF is boxed in dark green, and the red ATG represents the start codon of the uORF in the *D. melanogaster* genome.

To investigate the regulatory role of the bcd uORF, we used CRISPR-Cas9 to knock out its start codon in D. melanogaster, generating two mutant homozygotes (uKO1/uKO1 and uKO2/uKO2) with a genetic background matched to that of the wild-type (WT) (Figs. 6A and S22). To determine whether these mutations enhanced bcd CDS translation, we performed ribosome fractionation followed by qPCR ^67–69, comparing bcd mRNA levels in polysome and monosome fractions (P-to-M ratio) from 0-2 h embryos of uKO1/uKO1, uKO2/uKO2, and WT flies (Fig. 6B). A larger P-to-M ratio means more mRNAs are enriched in the polysome fractions and bound by more ribosomes, thus indicative of higher translation efficiency. P-to-M ratios were higher in the mutants compared to WT at 29°C (Fig. 6C), with a similar but less pronounced trend at 25°C, where the difference between uKO2/uKO2 and WT was not statistically significant (Fig. 6C). These findings, along with the known impact of temperature on gene expression and phenotypic plasticity ^70,71, suggest that the regulatory function of the uORF and overall translation efficiency are temperature-sensitive, highlighting a complex interplay between environmental conditions and gene regulation. To further confirm the bcd uORF’s regulatory function, we conducted dual-luciferase reporter assays. The 5’UTR from bcd mutant (uKO1 or uKO2) or WT was cloned into a Renilla luciferase reporter construct (Fig. 6D). Luciferase activity was significantly higher in uKO1 and uKO2 compared to the WT 5’UTR, confirming the repressive role of the bcd uORF.

Knocking out the *bcd* uORF increases CDS translation and perturbs the transcriptome during *D. melanogaster* embryogenesis.
(A) Dual-luciferase assay for *bcd* WT uORF and mutated uORF. The reporter structures of the WT and uORF mutants are illustrated on the left. The uORF mutant sequence was the same as that in the fly mutant created with CRISPR-Cas9 technology. The relative activity of *Renilla* luciferase was normalized to that of firefly luciferase. Error bars represent the S.E. of six biological replicates. Asterisks indicate statistical significance (***, P < 0.001). (B) Two ribosome fractions (monosome and polysome) of 0-2 h embryos were separated in a sucrose density gradient. Relative RNA abundance in the monosome and polysome fractions was quantified by real-time quantitative PCR. (C) P-to-M ratio of *bcd* mRNA (*bcd* mRNA abundance in polysome fraction/*bcd* mRNA abundance in monosome fraction) at 25°C (left) and 29°C (right). The P-to-M ratios of mutants were normalized to WT controls at 25°C and at 29°C, respectively. Error bars represent the S.E. of six biological replicates. Asterisks indicate statistical significance (*, P < 0.05; **, P < 0.01; ***, P < 0.001; n.s., P > 0.05). (D) The number of DEGs in each stage and their intersection with each other at 25°C (top) and 29°C (bottom). (E) Gene ontology analysis of DEGs at 29°C in each stage. The biological process (BP) terms with q-values < 0.05 in each stage are indicated in red and others are indicated in white.

bcd uORF mutants show wide transcriptomic alteration during Drosophila embryogenesis

Since Bcd regulates the expression of many zygotic genes ^64–66, we anticipated that the increased translation of bcd resulting from disrupting the uORF would influence Drosophila transcriptomes and phenotypes. To verify this notion, we performed RNA sequencing on embryos from WT and uKO2/uKO2 flies at four developmental stages (0-2 h, 2-6 h, 6-12 h, and 12-24 h) under both 25°C and 29°C conditions, using two biological replicates (Table S4 and Fig. S23). Differential expression analysis revealed widespread alterations in gene expression between WT and mutant embryos, with the number of differentially expressed genes (DEGs) increasing over developmental time and at higher temperature. At 25°C, we identified 674, 817, 1047, and 2,041 DEGs in 0-2 h, 2-6 h, 6-12 h, and 12-24 h embryos, respectively; while at 29°C, we detected 3,884, 2,358, 2,901, and 4,164 DEGs in the corresponding stages (Fig. 6D). The majority of DEGs were stage-specific, with only a small fraction consistently differentially expressed across all four stages. Functional enrichment analysis of the DEGs revealed distinct biological pathways affected at each stage, including cell morphogenesis and pattern specification in 0-2 h embryos, metabolic processes and tissue development in 2-6 h and 6-12 h embryos, and mitochondrial respiration in 12-24 h embryos (Fig. 6E). Notably, direct targets of Bcd ⁷² were significantly enriched among the DEGs in three out of the four stages, with the exception of 2-6 h embryos (Fig. S24). RT-qPCR validation of 20 target genes of Bcd confirmed the reliability of the RNA-seq differential expression analysis (Fig. S25). Together, these findings demonstrate that disruption of the bcd uORF leads to widespread transcriptional changes during Drosophila development, affecting processes ranging from embryogenesis to postembryonic metabolism.

bcd uORF mutants display decreased hatching rates and starvation resistance

Given the widespread transcriptome alterations, we anticipated phenotypic abnormalities in the bcd uORF mutants. As expected, both uKO1/uKO1 and uKO2/uKO2 mutants exhibited significantly lower hatching rates compared to WT (P = 1.4×10⁻⁶ and 7.9×10⁻⁶, respectively, Wilcoxon rank sum test [WRST]; Fig. 7A). At 25°C, uKO1/uKO1 mutants produced fewer offspring than WT flies (P < 0.001, WRST, Fig. 7B). Given that bcd is a maternal gene, we expected reciprocal crosses between uKO1/uKO1 mutants and WT flies to produce different outcomes. Indeed, crossing uKO1/uKO1 males with WT females resulted in offspring numbers similar to those from WT crosses, while crossing uKO1/uKO1 females with WT males yielded offspring numbers comparable to crosses between uKO1/uKO1 mutants (Fig. 7B). This verified that the reduction in offspring is due to maternal defects in the uKO1/uKO1 mutants. Similar patterns were observed for uKO2/uKO2 mutants at 25°C (Fig. 7B). Notably, the fecundity reduction in uORF-KO mutants was more pronounced at 29°C (Fig. 7C). Furthermore, crossing uKO1/uKO1 and uKO2/uKO2 mutants produced significantly fewer progeny than WT flies (Fig. 7C), ruling out genetic background or off-target effects. Collectively, these data demonstrate that disrupting the bcd uORF significantly impaired hatchability and fertility.

Knockout of the *bcd* uORF reduces offspring number and starvation resistance.
(A) Comparison of the hatching rates (%) of mutant and WT offspring (n=20, Wilcoxon rank sum test; ***, P < 0.001). (B) The offspring number per maternal parent in different crosses over 10 days at 25°C. Asterisks indicate significant differences between various crosses and crosses of WT females with WT males (n=20, Wilcoxon rank sum test; *, P < 0.05; **, P < 0.01; ***, P < 0.001; n.s., P > 0.05). The different crosses were denoted as the x-axis labels. (C) The offspring number per maternal parent in different crosses over 10 days at 29°C. (D) Survival curves of WT and mutant adult flies of females (left) and males (right) under starvation conditions. The black line represents the WT, the red line represents the uKO1/uKO1 mutant, and the blue line represents the uKO2/uKO2 mutant. Asterisks indicate significant differences compared to the WT. (n=200, log-rank test; ***, P < 0.001; n.s., P > 0.05).

We also found both uKO1/uKO1 and uKO2/uKO2 female mutants perished significantly faster than WT flies under starvation conditions (Fig. 7D). Males showed similar tendencies, although the difference was not statistically significant for uKO1/uKO1 mutants (Fig. 7D). These data suggest that the knockout of bcd uORF diminished starvation resistance in adults, likely due to embryogenesis abnormalities induced by the bcd uORF deletion, even in those that successfully developed to adulthood.

Conservation of uORF-mediated translational buffering in primates

To explore the generality of uORF-mediated translational buffering across evolutionary clades, we analyzed previously published transcriptome and translatome data from three tissues (brain, liver, and testis) in humans and macaques ⁷³. We identified 33,680 canonical uORFs in humans and 29,516 in macaques, with 24,385 conserved between the two species. Despite the larger number of uORFs in primates compared to Drosophila due to differences in genome size and gene number, the median TE of conserved uORFs was 1.79 times that of non-conserved uORFs in humans, and the corresponding ratio was 3.43 in macaques (Fig. 8A and Fig. S26). TEs of uORFs were positively correlated between humans and macaques across all tissues (P < 0.001, Fig. 8B). Additionally, significant positive correlations were observed between the TEs of uORFs and their corresponding coding sequences (CDSs) in all tissues (Fig. S27). Although interspecific TE divergence of uORFs (β_u) and CDSs (β_C) were positively correlated (P < 0.001, Fig. 8C), uORFs generally exhibited larger divergence (Table S5). Notably, longer uORFs showed stronger buffering effects on CDS translation, reducing interspecific TE divergence by 12.9% and 38.0% compared to genes with short or no uORFs (Fig. 8D and Fig. S28).

uORFs function as translational buffers in primates.
(A) Boxplots showing the TEs of conserved and species-specific uORFs between Hs (*H. sapiens*) and Mm (*M. mulatta*). Data for the brain is shown as an example. Wilcoxon rank sum tests. ***, P < 0.001. (B) Spearman’s correlation coefficient (*Rho*) of uORFs’ TE between humans and macaques. The *Rho* values in the brain, liver, and testis were shown as bar plots. ***, P < 0.001. Data for the brain is shown as an example in the right panel. (C) Correlation between interspecific uORF TE changes (log₂*β_u*) and corresponding CDS TE changes (log₂*β_C*) in three tissues. The x-axis was divided into 50 equal bins with increasing *β_u*. (D) Genes expressed in brains were classified into three classes according to the total length of translated uORFs. Boxplots showing interspecific CDS TE variability |*log*₂(*β_c*)| of different gene classes. P values were calculated using Wilcoxon rank sum tests between the neighboring groups. ***, P < 0.001. (E) Genes expressed in brains (mRNA RPKM > 0.1 in both species) were classified into three classes according to whether a gene had a conserved and dominantly translated uORF (TE > 0.1) in both species or not. Boxplots showing interspecific CDS TE variability |*log*₂(*β_c*)| of different gene classes. P values were calculated using Wilcoxon rank sum tests between the neighboring groups. ***, P < 0.001. (F) Boxplot showing the coefficients of variation (CVs) of CDS TE among the 69 lymphoblastoid cell lines (LCLs). Expressed genes (mean mRNA RPKM > 0.1) were divided into 20 bins with increased mRNA expression levels. In each bin, the genes were divided into two fractions according to whether the gene had a translated uORF or not. Wilcoxon rank sum tests. *, P < 0.05; **, P < 0.01; ***, P < 0.001.

As in Drosophila, we categorized expressed human genes into three classes based on uORF conservation and translation: Class I, genes with conserved and dominantly translated uORFs in both humans and macaques; Class II, genes with conserved uORFs translated in both species but not dominantly in at least one; and Class III, the remaining expressed genes (Fig. 8E and Fig. S29). Consistent with findings in Drosophila, significant differences in |β_c| between humans and macaques were observed in the order of Class I < II < III, with Class I genes showing 9.8% and 17.1% lower |β_c| values compared to Class II and Class III genes, respectively (Fig. 8E and Fig. S29). These similarities between primates and Drosophila—two clades that diverged over 700 million years ago ⁷⁴ —suggest that uORF-mediated translational buffering is a widespread mechanism for stabilizing gene translation across evolutionary clades.

We also analyzed matched mRNA-Seq and Ribo-Seq data from 69 human lymphoblastoid cell lines ^75,76 to test whether uORFs buffer against translational variability across different individuals within humans. Genes with translated uORFs exhibited, on average, 8.65% lower CVs in CDS TE across individuals than genes without translated uORFs (Fig. 8F), with longer uORFs showing stronger buffering effects (Fig. S30). Collectively, these findings suggest that uORFs play a crucial role in reducing translational variability both across evolutionary clades and within species.

Discussion

Translational control is vital for maintaining protein homeostasis and cellular activities ^77,78. However, the mechanisms underlying the conservation of protein abundance across species remain largely unknown ^5,6. In this study, we extended the ICIER model and conducted simulations to explore uORFs’ regulatory roles in buffering translation during evolution. Our simulations demonstrated that uORF translation reduces variability in downstream CDS translation, with buffering capacity positively correlated with uORF translation efficiency, length, and number. Comparative translatome analyses across developmental stages of two Drosophila species provided evidence that uORFs mitigate interspecific differences in CDS translation. Similar patterns were observed between humans and macaques, and between human individuals, suggesting that uORF-mediated buffering is an evolutionarily conserved mechanism in animals. Additionally, in vivo experiments showed that knocking out the bcd uORF led to aberrant embryogenesis and altered starvation resistance, with more pronounced phenotypic effects at higher temperatures, supporting the role of uORFs in fine-tuning translation and phenotypic outcomes.

While the prevailing consensus on uORFs’ functions is that they repress the translation of downstream CDS by sequestering ribosomes ^34,43–48, recent studies based on suggest uORFs might play essential roles in buffering translational noise and stabilizing protein expression ^43,55,56. While these studies have primarily focused on the role of uORFs in single-gene cases or under stress conditions, our work expands this knowledge by demonstrating that uORFs act as a general mechanism for buffering translational variability on a genome-wide scale across multiple species and developmental stages. Our findings also suggest that uORF-mediated translational buffering is not merely a response to environmental stress but an evolutionarily conserved mechanism integral to maintaining protein homeostasis across species. Organisms can buffer phenotypic variation against environmental or genotypic perturbations through “canalization” ⁷⁹. The discovery that uORFs are crucial for Drosophila development links them to canalization, providing evidence that uORFs reduce interspecific CDS translation differences and enhance phenotypic stability. While previous studies primarily focused on the stabilization of transcriptional level ^80–84 or protein level ^{55,82,85–87}, our study opens new avenues for exploring how organisms maintain phenotypic robustness at the translational level through molecular mechanisms like uORF-mediated buffering. This study lays the groundwork for future research into the evolutionary and functional roles of uORFs, particularly in the context of canalization and adaptive evolution.

The original ICIER model was devised to elucidate how a single uORF can confer resistance to global translation inhibition on its host gene during stress conditions ⁴³. It proposed that an elongating ribosome (80S) causes downstream 40S subunits to dissociate from the mRNA after collision. Our revised model expands this scope to include bidirectional dissociation, where an 80S ribosome can release both upstream and downstream 40S subunits, reflecting recent findings that 80S primarily causes upstream 40S dissociation during 40S/80S collisions ⁵⁶. Mechanistically, increasing the loading rate of 40S ribosomes onto the mRNA (R_in) elevates the density of 40S and 80S ribosomes scanning the uORF, thereby raising the likelihood of 40S-80S collisions and subsequent 40S dissociation. This leads to a reduced flow of 40S ribosomes reaching the CDS relative to the initial increase in 40S at the 5’ end of the mRNA and uORF. Conversely, a decrease in 40S ribosome loading rate reduces the density of scanning ribosomes, diminishing the probability of 40S-80S collisions and 40S dissociation, and partially restoring the flow of 40S ribosomes to the CDS. Overall, the 40S-80S collision mechanism and subsequent 40S dissociation at the uORF moderate downstream CDS translation less than changes in ribosome loading at the 5’ end of the mRNA or uORF. The revised model underscores the analogy of uORFs as “molecular dams” adeptly modulating the stream of ribosomes and cushioning downstream CDS translation against fluctuations. Further, the revised model integrates ribosomal collisions and separations within the CDS located downstream of the uORF, shifting the focus from merely tracking 40S escape rates from the uORF to directly quantifying completed translations by 80S ribosomes at the terminus of both uORFs and CDSs. This methodology provides a more direct and quantitative assessment of CDS translation efficiency, offering new insights into the buffering role of uORFs. Additionally, our study broadens its perspective to account for the potential interactive effects of multiple uORFs within a gene, recognizing the prevalent clustering of these regulatory elements and their combinatorial influence on translational control. In essence, our study presents a nuanced and comprehensive expansion of the ICIER model, paving the way for a deeper understanding of uORFs as pivotal elements in the maintenance of protein synthesis stability under variable translational conditions. However, we realize the existence of alternative mechanisms governing uORF function, such as those found on traditional repression models ⁵⁵ or ribosome queuing and re-initiation strategies ⁵⁶. As such, further computational studies are needed to dissect and understand the full spectrum of uORF-mediated translational regulation.

The bcd gene, crucial for Drosophila embryogenesis, has been well studied ^{65,66,88–92}, but its uORF remains underexplored. Our study showed that deleting the bcd uORF led to abnormal phenotypes and reduced fitness, underscoring the importance of uORF-mediated translation. These effects were more severe under heat stress (29℃) than at normal temperatures (25℃), suggesting uORFs play a vital role in buffering phenotypic variation. Rescue experiments, typically used to confirm that phenotypic changes are due to specific genetic edits, faced challenges in our study and previous uORF-KO experiments in animals ^93,94 and plants ^95–98. We altered the start codon of the bcd uORF to minimize effects on the 5’ UTR, without impacting the coding sequence, making traditional rescue experiments impractical. To control for genetic background variations, we backcrossed two bcd uORF-KO mutant strains with w¹¹¹⁸ flies for nine generations, using w1118 as a control. Both uORF-KO mutants (uKO1/uKO1 and uKO2/uKO2) consistently showed reduced progeny, ruling out background or off-target effects. Crosses between the two mutants produced fewer offspring than WT crosses, confirming that the reduced progeny was due to the uORF deletions, not genetic background. As bcd is maternally inherited, we expected defective phenotypes only in offspring from mutant females crossed with wild-type males. This was confirmed: uKO1/uKO1 males crossed with wild-type females had normal offspring, while uKO1/uKO1 females crossed with wild-type males had reduced offspring, similar to mutant crosses. Similar results were observed for uKO2/uKO2 mutants. These findings confirm that disrupting bcd uORFs significantly reduces hatchability and fertility, effects not attributable to genetic background. Our results highlight the critical role of bcd uORFs in development and fecundity, and provide a basis for future studies on uORF regulation in other genes across species.

Taken together, this study demonstrates the role of uORF-mediated translational buffering in mitigating variability in gene translation during species divergence and development, using large-scale comparative transcriptome and translatome data. Our work addresses gaps in understanding the stabilization of gene expression at the translational level and offers new insights into the evolutionary and functional significance of uORFs.

Materials and Methods

Modeling the uORF-mediated buffering effect on CDS translation

We adapted the stochastic ICIER framework developed by Andreev et al. ⁴³ based on the TASEP model, with several major modifications. First, we concatenated a 500-codon CDS downstream of the uORF. A leaky scanning ribosome from the uORF would initiate translation at the CDS start codon with a probability of I_CDS. The probability of elongating ribosomes moving along the CDS (v_EC= 0.5) in each action was higher than that along uORFs (v_Eu= 0.3) in the model settings, considering that uORFs usually encode blocking peptides ^99–103 or contain stalling codons ^56,104,105. Second, we recorded the number of elongating ribosomes that completed translation at the stop codon of a CDS (N_EC) or uORF (N_Eu) during a given time period and regarded them as proxies for quantifying the CDS or uORF TE. Third, we considered three models for simulating the consequences of scanning ribosomes colliding with elongating ribosomes mentioned above: a downstream dissociation model, an upstream dissociation model, and a double dissociation model.

In detail, the mRNA molecule structure in the simulations consisted of five parts: a 5’-leader before the uORF (150 nucleotides), the uORF (ranging from 2 to 100 codons), a segment between the uORF and CDS (150 nucleotides), the CDS (500 codons) and the 3’UTR (150 nucleotides). The R_in determined the loading rate of the 40S scanning ribosomes on the mRNA 5’-terminus, and it roughly represented the availability of trans translational resources in the cell. We generated two sets of R_in values (Fig. S1A) to simulate the variation in the availability of translational resources (40S scanning ribosomes, etc.): i) 1000 values of R_in were generated from a random generator of a uniform distribution, U (0, 0.1); ii) 1000 values were first generated from a random generator of an exponential distribution, E (1), and then each of the 1000 values was divided by 70 to obtain 1000 values of R_in. I_uORF and I_CDS represent the probability of translation at the start codon of a uORF or CDS, respectively. We used different combinations of I_uORF and I_CDS to test how translational initiation strength influenced the buffering effect of uORFs. v_S determined the scanning rate of the 40S ribosome, and v_Euand v_EC determined the elongation rate of the 80S ribosome at a uORF or CDS, respectively. All the parameters used in our simulation are listed in Table S1.

The mRNA molecule was modeled as an array, and the value of each position in the array represented the occupation status of ribosomes along the mRNA molecule. The process of translation was simulated by a series of discrete actions referred to Andreev et al. ⁴³. Upon each action, there was a probability that certain events would occur, including the addition of a 40S ribosome to the 5’-terminus, the transformation of a 40S ribosome into an 80S ribosome at a CDS or uORF start codon, the movement of a 40S ribosome or 80S ribosome to the next codon in the CDS or uORF, etc. More specific operations in each action were described in Supplementary Materials and Methods. The simulation code is freely available at GitHub: https://github.com/lujlab/uORF_buffer.

After 1,000,000 actions, we recorded the number of 80S ribosomes that completed translation at the uORF (N_Eu) and CDS (N_EC) for each R_in input, which was used to represent the protein production rates (i.e. translation efficiency) of the uORF and CDS, respectively. To test the effects of different factors, we used various combinations of the parameters in Table S1 in the simulations. For a given distribution of R_in input (1000 values) and the dissociation model, we obtained the corresponding 1000 N_EC values and their CV values by calculating the ratio of the standard deviation to the mean of N_EC.

In the double-uORF simulation, we only adopted a uniform distribution of R_in input and the downstream dissociation model. Most parameters and simulation processes were similar to those in the single-uORF simulation, except that an additional uORF was introduced upstream of the CDS with an initiation probability of I_uORF₂.

Annotation of uORFs in D. melanogaster and D. simulans

We downloaded the whole-genome sequence alignment (maf) of D. melanogaster (dm6) and 26 other insect species from the University of California Santa Cruz (UCSC) genome browser (genome.ucsc.edu) ¹⁰⁶. To improve the genome correspondence between D. melanogaster and D. simulans, we adopted a newly published reference-quality genome of D. simulans (NCBI, ASM438218v1) ¹⁰⁷ to replace the original D. simulans genome in UCSC maf. We softly masked repetitive sequences in the new genome by RepeatMasker 4.1.1 (http://www.repeatmasker.org) and aligned it to dm6 with lastz ¹⁰⁸ in runLastzChain.sh following UCSC guidelines. The lastz alignment parameters and the scoring matrix were the same as the parameters of dm6 and droSim1 in UCSC. Chained alignments were processed into nets by the chainNet and netSyntenic programs. The alignment was integrated into the multiple alignments of 27 species with multiz ¹⁰⁹.

Based on the genome annotation of D. melanogaster (FlyBase r6.04, https://flybase.org/), we used the Galaxy platform to parse the multiple sequence alignments of 5’UTRs in Drosophila. The 5’UTR sequences of each annotated transcript from D. melanogaster and the corresponding sequences in D. simulans were extracted from the maf. The start codons of putative uORFs were identified by scanning all the ATG triplets (uATGs) within the 5’UTRs of D. melanogaster and D. simulans. uATGs that overlapped with any annotated CDS region were removed. The presence or absence of each uATG of D. melanogaster was determined at orthologous sites in D. simulans based on multiple genome alignments, and vice versa. The conserved uORF was defined as a uORF where its uATG was present in both D. melanogaster and the corresponding orthologous positions of D. simulans. For each protein-coding gene, we only considered the canonical transcript. For transcripts containing multiple uORFs, we defined the uORF that showed the highest TE as the dominant uORF. The branch length score (BLS) of the uATGs was calculated as we previously described ¹⁸.

Fly materials and general raising conditions

The sim4 strain of D. simulans was used to generate all the libraries of D. simulans in this study. All flies were raised on standard corn medium and grown in 12 h light: 12 h dark cycles at 25 °C for general conditions or at 29 °C for specific experimental design. The samples of embryos at different stages, larva, pupa, bodies and heads were collected following a previous protocol ²³.

Processing Drosophila mRNA-Seq and Ribo-Seq data

Ribo-Seq and matched mRNA-Seq libraries for different developmental stages and tissues of D. simulans were constructed as we previously described ²³ and sequenced on an Illumina HiSeq-2500 sequencer (run type: single-end; read length: 50 nt) according to the manufacturer’s protocol. The 3’ adaptor sequences (TGGAATTCTCGGGTGCCAAGG) were trimmed using Cutadapt 3.0 with default parameters ¹¹⁰, and the NGS reads were mapped to the genomes of yeast, Wolbachia, Drosophila viruses and the sequences of tRNAs, ribosomal RNAs (rRNAs), small nuclear RNAs (snRNAs) or small nucleolar RNAs (snoRNAs) of D. melanogaster (FlyBase r6.04) and D. simulans (FlyBase r1.3) using Bowtie2 version 2.2.3 ¹¹¹ with the parameters -p8 --local -k1. The mapped reads of these genomes/sequences were further removed in the downstream analysis.

After filtering, the mRNA-Seq and Ribo-Seq reads were mapped to the reference genomes of D. melanogaster (FlyBase, r6.04) and D. simulans (NCBI, ASM438218v1) respectively using the Spliced Transcripts Alignment to a Reference (STAR) algorithm ¹¹². For Ribo-Seq reads, we assigned a mapped RPF (27-34 nt in length) to its P-site using the psite script from Plastid ¹¹³. The uniquely mapped reads were extracted and then mapped to the CDS of D. melanogaster and D. simulans respectively using STAR ¹¹². The P-sites of RPF or mRNA-Seq reads that overlapped with a CDS were counted separately. The reads that were not mapped to CDSs were then mapped to the 5’UTRs of D. melanogaster and D. simulans. The P-sites of RPF or mRNA-Seq reads that overlapped with uORFs were counted separately.

The TE of a given uORF or CDS was calculated as the RPKM_P-site/RPKM_mRNA ratio. For a few uORFs with RPKM_mRNA = 0 but RPKM_P-site > 0, a pseudocount of 0.1 was added to both RPKM_mRNA and RPKM_P-site to avoid dividing a positive value by zero. In each sample, expressed genes were defined as the genes with a CDS RPKM_mRNA > 0.1 in both species, and translated uORFs were defined as uORFs with a TE > 0.1.

Testing the statistical significance of the difference in the interspecific TE change between a uORF and its downstream CDS

For this analysis, we adopted the methods developed by Zhang et al. ²³. Briefly, for the samples of female bodies or male bodies of D. melanogaster with biological replicates, we obtained the log₂(TE) and SE of the log₂(TE) of a uORF or CDS by contrasting the RPF counts against mRNA-Seq read counts using DESeq2. Then, we fitted the SE values against the normalized mRNA counts and log₂(TE) values using the gam function in the R package mgcv, with a log link to obtain the SE ∼ mRNA counts + log₂TE functions. For other samples without biological replicates, we estimated the SE of the log₂(TE) for a feature (CDS or uORF) by applying the fitted functions obtained based on the biological replicates of female and male bodies to the observed mRNA counts and log₂(TE) values. We identified uORFs whose TEs differed significantly between the paired samples of D. melanogaster and D. simulans by testing whether the value obtained for log₂ (β_u) = log₂ (TE_uORF,sim) – log₂ (TE_uORF,_mel) was significantly different from 0. Based on the SE of the log₂(TE) derived as described above, the SE of the log₂(β_u) can be derived as follows:

As the Wald statistic, , follows a standard normal distribution under the null hypothesis of log₂ (β_u) = 0, we calculated the P value as follows: . The identification of CDSs whose TEs differed significantly between the paired samples of D. melanogaster and D. simulans was similar to the procedures above.

Then, we defined (where β_C = TE_CDS,sim / TE_CDS,mel) and tested whether log₂ (γ) was significantly different from 0 to determine whether the magnitude of interspecific TE changes in CDSs and uORFs was significantly different. We obtained log₂TE_uORF,sim, log₂TE_oORF,mel, log₂TE_CDS,sim, and log₂TE_CDS,mel as described above and estimated SE_log₂TE_uORF,sim, SE_log₂TE_uORF,mel, SE_log₂TE_CDS,sim, and SE_log₂TE_CDS,mel based on the biological replicates of female and male bodies of D. melanogaster. Finally, log₂(γ) also follows a normal distribution with SE denoted as SE_log₂(γ) =

As the Wald statistic, , follows a standard normal distribution under the null hypothesis that log₂ (β_u) = 0, we calculated the P value as follows: .

Knocking out a uORF with CRISPR-Cas9 technology in D. melanogaster

We searched for possible sgRNA target sites near the uATG start codon of a uORF using the Benchling website (https://www.benchling.com/crispr/) to design optimal single guide RNA (sgRNA) sequences with high specificity and low off-target effects. We then synthesized single-stranded complementary DNAs (ssDNAs) and annealed them to obtain double-stranded DNA (dsDNA), which served as the template for sgRNA expression. The template sequences of the sgRNA used for bcd uATG-KO are listed in Table S6. The dsDNA was then ligated into the BbsI-digested pU6B vector. The pU6B-sgRNA plasmid was purified and injected into the embryos of transgenic Cas9 flies collected within one hour of laying at the Tsinghua Fly Center as described in Ni et al. ¹¹⁴. The injected embryos were kept at 25°C and 60% humidity until adulthood (G0). The G0 adult flies that hatched from injected embryos were individually crossed with other flies (y sc v) to increase the number of offspring. Then, the F1 progeny were crossed with flies carrying an appropriate balancer (Dr, e/TM3, Sb). After F2 spawning, the F1 individuals were screened for mutations of interest by genotyping. The primers used for genotyping are listed in Table S6. The F2 progeny whose parents showed positive genotyping results were then screened for the yellow⁻ gene to separate the chromosome carrying nos-Cas9. The screened F2 males were crossed with the flies containing the same balancer as above. After F2 genotyping, the progeny (F3) of positive F2 individuals showing the same mutation status were crossed individually to generate homozygous mutants in the F4 generation. The original homozygous mutants were sequentially outcrossed with w¹¹¹⁸ flies for 9 generations to purify the genetic background (Fig. S22B).

Ribosome fraction analysis by sucrose gradient fractionation and RT‒qPCR

The 0-2 h embryos obtained from w¹¹¹⁸, uKO1/uKO1, and uKO2/uKO2 mutants raised at 25°C and 29°C were collected and homogenized in a Dounce homogenizer with lysis buffer [50 mM Tris pH 7.5, 150 mM NaCl, 5 mM MgCl₂, 1% Triton X-100, 2 mM dithiothreitol (DTT), 20 U/ml SuperaseIn (Ambion), 0.5 tablets of proteinase inhibitor (Roche), 100 µg/ml emetine (Sigma Aldrich), and 50 µM guanosine 5′-[β,γ-imido]triphosphate trisodium salt hydrate (GMP-PNP) (Sigma Aldrich)] at 4°C. The lysates were clarified by centrifugation at 4°C and 20,000×g for 8 min, and the supernatants were transferred to new 1.5 ml tubes. 10-45% sucrose gradients were prepared in buffer (250 mM NaCl, 50 mM Tris pH 7.5, 15 mM MgCl₂, 0.5 mM DTT, 12 U/ml RNaseOUT, 0.5 tablets of protease inhibitor, and 20 µg/ml emetine) using a Gradient Master (Biocomp Instruments) in ULTRA-CLEAR Thinwall Tubes (Beckman Coulter). A sample volume of up to 500 µl was applied to the top of each gradient. After ultracentrifugation with a Hitachi P40ST rotor at 35,000 × rpm for 3 h at 4°C, the monosome and polysome fractions were collected, flash-frozen in liquid nitrogen, and stored at −80°C until further use.

The RNA in the monosome and polysome fractions was extracted separately using TRIzol reagent (Life Technologies, Inc.) and chloroform (Beijing Chemical Works) following the manufacturer’s instructions and were reverse transcribed into cDNA using the PrimeScript™ II 1st Strand cDNA Synthesis Kit (Takara). RT‒qPCR analysis of bcd cDNA and its targets was performed using PowerUp™ SYBR™ Green Master Mix (Thermo Fisher) following the manufacturer’s instructions with rp49 as an internal control. The primer sequences employed for RT-qPCR are listed in Table S6. For each sample, the ratio of bcd mRNA abundance in the polysome fraction to that in the monosome fraction was calculated as the P-to-M ratio. Six biological replicates were performed for each sample.

Dual-luciferase reporter assays

The wild-type (WT) 5’UTR of bcd was cloned from cDNA by PCR, and uATG mutations were introduced into 5’UTR using specific amplification primers (listed in Table S6). The WT and mutated 5’UTR sequences were ligated into a linearized reporter plasmid (psiCHECK-2 vector, Promega). The whole sequence of all the plasmids was validated by Sanger sequencing.

Drosophila S2 cells were cultured in Schneider’s Insect Medium (Sigma) plus 10% (by volume) heat-inactivated fetal bovine serum, 100 U/ml penicillin and 100 µg/ml streptomycin (Thermo Fisher) at 25℃ without CO₂ for 24 h to reach 1–2×10⁶cells/ml before further treatments. Plasmid transfection was conducted with Lipofectamine 3000 (L3000001, Thermo Fisher) according to the supplier’s protocol. The Renilla luciferase activity associated with WT or uORF-mutated 5’UTRs was measured according to the manual of the Dual-Luciferase Reporter Assay System (Promega) 32 h after transfection and was normalized to the activity of firefly luciferase.

mRNA-Seq in the embryos of mutant and WT flies

We collected 0-2 h, 2-6 h, 6-12 h, and 12-24 h embryos of w¹¹¹⁸ and uKO2/uKO2 mutants raised at 25°C and 29°C and conducted mRNA-Seq. Library construction and sequencing with PE150 were conducted by Annoroad on the Illumina Nova6000 platform. Two biological replicates were sequenced for each sample. The clean data were mapped to the reference genome of D. melanogaster (FlyBase, r6.04) using STAR ¹¹². Reads mapped to the exons of each gene were tabulated with htseq-count ¹¹⁵. The differentially expressed genes were identified using DESeq2 ¹¹⁶. The Gene Ontology (GO) analyses were conducted using the “clusterProfiler” package in R ¹¹⁷.

Measurement of embryo hatchability

We collected embryos from WT and mutant flies and manually seeded them in vials containing standard corn medium at a density of 30 embryos per vial and 20 vials per strain. The embryos were cultivated under standard conditions (60% humidity, 12 h light: 12 h dark cycles at 25°C). We counted the number of pupae in each vial after the completion of pupation and calculated the corresponding hatching rate = number_pupa/ 30

Quantification of offspring number per female fly

Newly hatched virgins were picked out and allowed to mature for two days in separate vials. They were then mated by placing one virgin female with three male flies for two days. After that, each female parent was transferred to a new vial to count the offspring number produced in each 10-day period at 25°C and 29°C. All assays were performed with 20 females per genotype.

Measurement of starvation resistance in adult flies

We selected 3- to 5-day-old adult males and females and placed them in the starvation medium (1.5% agar), with 10 flies per vial and 10 vials for both males and females from each strain. We made observations every 6 h or 12 h to count the number of deaths under starvation conditions until all flies had starved to death. The survival curves were plotted by the ggsurvplot package in R.

mRNA-Seq and Ribo-Seq data analysis in primates

The mRNA-Seq and Ribo-Seq data from the brains, livers, and testes of humans and macaques were downloaded from reference ⁷³ with accession number E-MTAB-7247 (ArrayExpress). The mRNA-Seq and Ribo-Seq data of human lymphoblastoid cell lines (LCL) from Yoruba individuals were downloaded from references ^75,76 under accession numbers GSE61742 and E-GEUV-1. The matched mRNA-Seq and Ribo-Seq libraries of 69 individuals were used.

The uORF annotation and downstream analysis procedures for the human and macaque data were similar to those applied in Drosophila as described above. The differential analysis of translational efficiency in humans and macaques was conducted by Xtail ¹¹⁸. In each pair of human-macaque samples, expressed genes were defined as the genes with a CDS RPKM_mRNA > 0.1 in both species. The translated uORFs in a sample were defined as uORFs with a TE > 0.1. For the human cell line data, expressed genes were defined as genes with a mean CDS RPKM_mRNA > 0.1 across the cell lines, and translated uORFs were defined as uORFs with a mean TE > 0.1

Acknowledgements

We thank Drs. Wei Xie, Weiwei Zhai, and Xionglei He for constructive comments and suggestions. This work was supported by grants from the Ministry of Science and Technology of the People’s Republic of China (2022YFE0132000), the Yunnan Provincial Science and Technology Project at Southwest United Graduate School (202302A0370006), the National Natural Science Foundation of China (32070597) and the Natural Science Foundation of Beijing (5212006). We thank the National Center for Protein Sciences at Peking University for their technical assistance. Some of the analyses were performed on the High-Performance Computing Platform of the Center for Life Sciences.

Declaration of interests

The authors declare no competing interests.

Data availability statement

All deep-sequencing data generated in this study, including single-ended mRNA-Seq and Ribo-Seq data of 10 developmental stages and tissues of Drosophila simulans and paired-end mRNA-Seq data of 0-2 h, 2-6 h, 6-12 h, and 12-24 h Drosophila melanogaster embryos, were deposited in the China National Genomics Data Center Genome Sequence Archive (GSA) under accession numbers CRA003198, CRA007425, and CRA007426. The mRNA-Seq and Ribo-Seq data for the different developmental stages and tissues of Drosophila melanogaster were published in our previous paper ²³ and were deposited in the Sequence Read Archive (SRA) under accession number SRP067542.

Supporting information

Supplementary figures and tables

Supplementary Table 3

References

1.
1. Buccitelli C.
2. Selbach M.
2020mRNAs, proteins and the emerging principles of gene expression controlNature Reviews Genetics 21:630–644https://doi.org/10.1038/s41576-020-0258-4
2.
1. Lee T.I.
2. Young R.A.
2013Transcriptional regulation and its misregulation in diseaseCell 152:1237–1251https://doi.org/10.1016/j.cell.2013.02.014
3.
1. MacNeil L.T.
2. Walhout A.J.M.
2011Gene regulatory networks and the role of robustness and stochasticity in the control of gene expressionGenome Research 21:645–657https://doi.org/10.1101/gr.097378.109
4.
1. Hill M.S.
2. Vande Zande P.
3. Wittkopp P.J.
2021Molecular and evolutionary processes generating variation in gene expressionNat Rev Genet 22:203–215https://doi.org/10.1038/s41576-020-00304-w
5.
1. Vogel C.
2013Protein Expression Under PressureScience 342:1052–1053https://doi.org/10.1126/science.1247833
6.
1. Signor S.A.
2. Nuzhdin S.V.
2018The Evolution of Gene Expression in cis and transTrends Genet 34:532–544https://doi.org/10.1016/j.tig.2018.03.007
7.
1. Artieri C.G.
2. Fraser H.B.
2014Evolution at two levels of gene expression in yeastGenome Res 24:411–421https://doi.org/10.1101/gr.165522.113
8.
1. McManus C.J.
2. May G.E.
3. Spealman P.
4. Shteyman A.
2014Ribosome profiling reveals post-transcriptional buffering of divergent gene expression in yeastGenome Res 24:422–430https://doi.org/10.1101/gr.164996.113
9.
1. Wang Z.
2. Sun X.
3. Zhao Y.
4. Guo X.
5. Jiang H.
6. Li H.
7. Gu Z.
2015Evolution of gene regulation during transcription and translationGenome Biol Evol 7:1155–1167https://doi.org/10.1093/gbe/evv059
10.
1. Khan Z.
2. Ford M.J.
3. Cusanovich D.A.
4. Mitrano A.
5. Pritchard J.K.
6. Gilad Y.
2013Primate transcript and protein expression levels evolve under compensatory selection pressuresScience 342:1100–1104https://doi.org/10.1126/science.1242379
11.
1. Wang S.H.
2. Hsiao C.J.
3. Khan Z.
4. Pritchard J.K.
2018Post-translational buffering leads to convergent protein expression levels between primatesGenome Biol 19:83https://doi.org/10.1186/s13059-018-1451-z
12.
1. Schrimpf S.P.
2. Weiss M.
3. Reiter L.
4. Ahrens C.H.
5. Jovanovic M.
6. Malmström J.
7. Brunner E.
8. Mohanty S.
9. Lercher M.J.
10. Hunziker P.E.
11. et al.
2009Comparative functional analysis of the Caenorhabditis elegans and Drosophila melanogaster proteomesPLoS Biol 7:e48https://doi.org/10.1371/journal.pbio.1000048
13.
1. Kusnadi E.P.
2. Timpone C.
3. Topisirovic I.
4. Larsson O.
5. Furic L.
2022Regulation of gene expression via translational bufferingBiochim Biophys Acta Mol Cell Res 1869:119140https://doi.org/10.1016/j.bbamcr.2021.119140
14.
1. Laurent J.M.
2. Vogel C.
3. Kwon T.
4. Craig S.A.
5. Boutz D.R.
6. Huse H.K.
7. Nozue K.
8. Walia H.
9. Whiteley M.
10. Ronald P.C.
11. Marcotte E.M.
2010Protein abundances are more conserved than mRNA abundances across diverse taxaProteomics 10:4209–4212https://doi.org/10.1002/pmic.201000327
15.
1. Teixeira F.K.
2. Lehmann R.
2019Translational Control during Developmental TransitionsCold Spring Harb Perspect Biol 11https://doi.org/10.1101/cshperspect.a032987
16.
1. Jackson R.J.
2. Hellen C.U.
3. Pestova T.V.
2010The mechanism of eukaryotic translation initiation and principles of its regulationNat Rev Mol Cell Biol 11:113–127https://doi.org/10.1038/nrm2838
17.
1. Sonenberg N.
2. Hinnebusch A.G.
2009Regulation of translation initiation in eukaryotes: mechanisms and biological targetsCell 136:731–745https://doi.org/10.1016/j.cell.2009.01.042
18.
1. Zhang H.
2. Wang Y.
3. Wu X.
4. Tang X.
5. Wu C.
6. Lu J.
2021Determinants of genome-wide distribution and evolution of uORFs in eukaryotesNature Communications 12:1076https://doi.org/10.1038/s41467-021-21394-y
19.
1. Zhang H.
2. Wang Y.
3. Lu J.
2019Function and Evolution of Upstream ORFs in EukaryotesTrends Biochem Sci 44:782–794https://doi.org/10.1016/j.tibs.2019.03.002
20.
1. Neafsey D.E.
2. Galagan J.E.
2007Dual modes of natural selection on upstream open reading framesMol Biol Evol 24:1744–1751https://doi.org/10.1093/molbev/msm093
21.
1. Churbanov A.
2. Rogozin I.B.
3. Babenko V.N.
4. Ali H.
5. Koonin E.V.
2005Evolutionary conservation suggests a regulatory function of AUG triplets in 5’-UTRs of eukaryotic genesNucleic Acids Res 33:5512–5520https://doi.org/10.1093/nar/gki847
22.
1. Resch A.M.
2. Ogurtsov A.Y.
3. Rogozin I.B.
4. Shabalina S.A.
5. Koonin E.V.
2009Evolution of alternative and constitutive regions of mammalian 5’UTRsBMC Genomics 10:162https://doi.org/10.1186/1471-2164-10-162
23.
1. Zhang H.
2. Dou S.
3. He F.
4. Luo J.
5. Wei L.
6. Lu J.
2018Genome-wide maps of ribosomal occupancy provide insights into adaptive evolution and regulatory roles of uORFs during Drosophila developmentPLoS biology 16:e2003903
24.
1. Malzer E.
2. Szajewska-Skuta M.
3. Dalton L.E.
4. Thomas S.E.
5. Hu N.
6. Skaer H.
7. Lomas D.A.
8. Crowther D.C.
9. Marciniak S.J.
2013Coordinate regulation of eIF2alpha phosphorylation by PPP1R15 and GCN2 is required during Drosophila developmentJ Cell Sci 126:1406–1415https://doi.org/10.1242/jcs.117614
25.
1. Komonyi O.
2. Papai G.
3. Enunlu I.
4. Muratoglu S.
5. Pankotai T.
6. Kopitova D.
7. Maroy P.
8. Udvardy A.
9. Boros I.
2005DTL, the Drosophila homolog of PIMT/Tgs1 nuclear receptor coactivator-interacting protein/RNA methyltransferase, has an essential role in developmentJ Biol Chem 280:12397–12404https://doi.org/10.1074/jbc.M409251200
26.
1. Medenbach J.
2. Seiler M.
3. Hentze M.W.
2011Translational control via protein-regulated upstream open reading framesCell 145:902–913https://doi.org/10.1016/j.cell.2011.05.005
27.
1. Chen J.
2. Tresenrider A.
3. Chia M.
4. McSwiggen D.T.
5. Spedale G.
6. Jorgensen V.
7. Liao H.
8. van Werven F.J.
9. Unal E.
2017Kinetochore inactivation by expression of a repressive mRNAElife 6https://doi.org/10.7554/eLife.27417
28.
1. Cheng Z.
2. Otto G.M.
3. Powers E.N.
4. Keskin A.
5. Mertins P.
6. Carr S.A.
7. Jovanovic M.
8. Brar G.A.
2018Pervasive, Coordinated Protein-Level Changes Driven by Transcript Isoform Switching during MeiosisCell 172:910–923https://doi.org/10.1016/j.cell.2018.01.035
29.
1. Kurihara Y.
2. Makita Y.
3. Kawashima M.
4. Fujita T.
5. Iwasaki S.
6. Matsui M.
2018Transcripts from downstream alternative transcription start sites evade uORF-mediated inhibition of gene expression in ArabidopsisProc Natl Acad Sci U S A 115:7831–7836https://doi.org/10.1073/pnas.1804971115
30.
1. Yang Y.F.
2. Zhang X.
3. Ma X.
4. Zhao T.
5. Sun Q.
6. Huan Q.
7. Wu S.
8. Du Z.
9. Qian W.
2017Trans-splicing enhances translational efficiency in C. elegansGenome Res 27:1525–1535https://doi.org/10.1101/gr.202150.115
31.
1. Wiestner A.
2. Schlemper R.J.
3. van der Maas A.P.
4. Skoda R.C.
1998An activating splice donor mutation in the thrombopoietin gene causes hereditary thrombocythaemiaNat Genet 18:49–52https://doi.org/10.1038/ng0198-49
32.
1. Liu L.
2. Dilworth D.
3. Gao L.
4. Monzon J.
5. Summers A.
6. Lassam N.
7. Hogg D.
1999Mutation of the CDKN2A 5’ UTR creates an aberrant initiation codon and predisposes to melanomaNat Genet 21:128–132https://doi.org/10.1038/5082
33.
1. Wen Y.
2. Liu Y.
3. Xu Y.
4. Zhao Y.
5. Hua R.
6. Wang K.
7. Sun M.
8. Li Y.
9. Yang S.
10. Zhang X.J.
11. et al.
2009Loss-of-function mutations of an inhibitory upstream ORF in the human hairless transcript cause Marie Unna hereditary hypotrichosisNat Genet 41:228–233https://doi.org/10.1038/ng.276
34.
1. Calvo S.E.
2. Pagliarini D.J.
3. Mootha V.K.
2009Upstream open reading frames cause widespread reduction of protein expression and are polymorphic among humansProc Natl Acad Sci U S A 106:7507–7512https://doi.org/10.1073/pnas.0810916106
35.
1. Lee D.S.M.
2. Park J.
3. Kromer A.
4. Baras A.
5. Rader D.J.
6. Ritchie M.D.
7. Ghanem L.R.
8. Barash Y.
2021Disrupting upstream translation in mRNAs is associated with human diseaseNat Commun 12:1515https://doi.org/10.1038/s41467-021-21812-1
36.
1. Costa-Mattioli M.
2. Walter P.
2020The integrated stress response: From mechanism to diseaseScience 368https://doi.org/10.1126/science.aat5314
37.
1. Hinnebusch A.G.
2005Translational regulation of GCN4 and the general amino acid control of yeastAnnu Rev Microbiol 59:407–450https://doi.org/10.1146/annurev.micro.59.031805.133833
38.
1. Lu P.D.
2. Harding H.P.
3. Ron D.
2004Translation reinitiation at alternative open reading frames regulates gene expression in an integrated stress responseJ Cell Biol 167:27–33https://doi.org/10.1083/jcb.200408003
39.
1. Bohlen J.
2. Harbrecht L.
3. Blanco S.
4. Clemm von Hohenberg K.
5. Fenzl K.
6. Kramer G.
7. Bukau B.
8. Teleman A.A.
2020DENR promotes translation reinitiation via ribosome recycling to drive expression of oncogenes including ATF4Nat Commun 11:4676https://doi.org/10.1038/s41467-020-18452-2
40.
1. Vasudevan D.
2. Neuman S.D.
3. Yang A.
4. Lough L.
5. Brown B.
6. Bashirullah A.
7. Cardozo T.
8. Ryoo H.D.
2020Translational induction of ATF4 during integrated stress response requires noncanonical initiation factors eIF2D and DENRNat Commun 11:4677https://doi.org/10.1038/s41467-020-18453-1
41.
1. Vattem K.M.
2. Wek R.C.
2004Reinitiation involving upstream ORFs regulates ATF4 mRNA translation in mammalian cellsProc Natl Acad Sci U S A 101:11269–11274https://doi.org/10.1073/pnas.0400541101
42.
1. Andreev D.E.
2. O’Connor P.B.
3. Fahey C.
4. Kenny E.M.
5. Terenin I.M.
6. Dmitriev S.E.
7. Cormican P.
8. Morris D.W.
9. Shatsky I.N.
10. Baranov P.V.
2015Translation of 5’ leaders is pervasive in genes resistant to eIF2 repressionElife 4:e03971https://doi.org/10.7554/eLife.03971
43.
1. Andreev D.E.
2. Arnold M.
3. Kiniry S.J.
4. Loughran G.
5. Michel A.M.
6. Rachinskii D.
7. Baranov P.V.
2018TASEP modelling provides a parsimonious explanation for the ability of a single uORF to derepress translation during the integrated stress responseElife 7https://doi.org/10.7554/eLife.32563
44.
1. Kozak M.
1989The scanning model for translation: an updateJ Cell Biol 108:229–241https://doi.org/10.1083/jcb.108.2.229
45.
1. Johnstone T.G.
2. Bazzini A.A.
3. Giraldez A.J.
2016Upstream ORFs are prevalent translational repressors in vertebratesEMBO J 35:706–723https://doi.org/10.15252/embj.201592759
46.
1. Hinnebusch A.G.
2. Ivanov I.P.
3. Sonenberg N.
2016Translational control by 5’-untranslated regions of eukaryotic mRNAsScience 352:1413–1416https://doi.org/10.1126/science.aad9868
47.
1. Morris D.R.
2. Geballe A.P.
2000Upstream open reading frames as regulators of mRNA translationMol Cell Biol 20:8635–8642https://doi.org/10.1128/MCB.20.23.8635-8642.2000
48.
1. Young S.K.
2. Wek R.C.
2016Upstream Open Reading Frames Differentially Regulate Gene-specific Translation in the Integrated Stress ResponseJ Biol Chem 291:16927–16935https://doi.org/10.1074/jbc.R116.733899
49.
1. Dever T.E.
2. Feng L.
3. Wek R.C.
4. Cigan A.M.
5. Donahue T.F.
6. Hinnebusch A.G.
1992Phosphorylation of initiation factor 2 alpha by protein kinase GCN2 mediates gene-specific translational control of GCN4 in yeastCell 68:585–596https://doi.org/10.1016/0092-8674(92)90193-g
50.
1. Palam L.R.
2. Baird T.D.
3. Wek R.C.
2011Phosphorylation of eIF2 facilitates ribosomal bypass of an inhibitory upstream ORF to enhance CHOP translationJ Biol Chem 286:10939–10949https://doi.org/10.1074/jbc.M110.216093
51.
1. Baird T.D.
2. Palam L.R.
3. Fusakio M.E.
4. Willy J.A.
5. Davis C.M.
6. McClintick J.N.
7. Anthony T.G.
8. Wek R.C.
2014Selective mRNA translation during eIF2 phosphorylation induces expression of IBTKalphaMol Biol Cell 25:1686–1697https://doi.org/10.1091/mbc.E14-02-0704
52.
1. Chen Y.J.
2. Tan B.C.
3. Cheng Y.Y.
4. Chen J.S.
5. Lee S.C.
2010Differential regulation of CHOP translation by phosphorylated eIF4E under stress conditionsNucleic Acids Res 38:764–777https://doi.org/10.1093/nar/gkp1034
53.
1. Fraser H.B.
2. Hirsh A.E.
3. Giaever G.
4. Kumm J.
5. Eisen M.B.
2004Noise minimization in eukaryotic gene expressionPLoS Biol 2:e137https://doi.org/10.1371/journal.pbio.0020137
54.
1. Thattai M.
2. van Oudenaarden A.
2001Intrinsic noise in gene regulatory networksProc Natl Acad Sci U S A 98:8614–8619https://doi.org/10.1073/pnas.151588598
55.
1. Wu H.W.
2. Fajiculay E.
3. Wu J.F.
4. Yan C.S.
5. Hsu C.P.
6. Wu S.H.
2022Noise reduction by upstream open reading framesNat Plants 8:474–480https://doi.org/10.1038/s41477-022-01136-8
56.
1. Bottorff T.A.
2. Park H.
3. Geballe A.P.
4. Subramaniam A.R.
2022Translational buffering by ribosome stalling in upstream open reading framesPLoS Genet 18:e1010460https://doi.org/10.1371/journal.pgen.1010460
57.
1. Ciandrini L.
2. Stansfield I.
3. Romano M.C.
2010Role of the particle’s stepping cycle in an asymmetric exclusion process: a model of mRNA translationPhys Rev E Stat Nonlin Soft Matter Phys 81:051904https://doi.org/10.1103/PhysRevE.81.051904
58.
1. Reuveni S.
2. Meilijson I.
3. Kupiec M.
4. Ruppin E.
5. Tuller T.
2011Genome-scale analysis of translation elongation with a ribosome flow modelPLoS Comput Biol 7:e1002127https://doi.org/10.1371/journal.pcbi.1002127
59.
1. von der Haar T.
2012Mathematical and Computational Modelling of Ribosomal Movement and Protein Synthesis: an overviewComput Struct Biotechnol J 1:e201204002https://doi.org/10.5936/csbj.201204002
60.
1. Zhao Y.B.
2. Krishnan J.
2014mRNA translation and protein synthesis: an analysis of different modelling methodologies and a new PBN based approachBMC Syst Biol 8:25https://doi.org/10.1186/1752-0509-8-25
61.
1. Tamura K.
2. Subramanian S.
3. Kumar S.
2004Temporal Patterns of Fruit Fly (Drosophila) Evolution Revealed by Mutation ClocksMolecular Biology and Evolution 21:36–44https://doi.org/10.1093/molbev/msg236
62.
1. Kronja I.
2. Yuan B.
3. Eichhorn S.W.
4. Dzeyk K.
5. Krijgsveld J.
6. Bartel D.P.
7. Orr-Weaver T.L.
2014Widespread changes in the posttranscriptional landscape at the Drosophila oocyte-to-embryo transitionCell Rep 7:1495–1508https://doi.org/10.1016/j.celrep.2014.05.002
63.
1. Qin X.
2. Ahn S.
3. Speed T.P.
4. Rubin G.M.
2007Global analyses of mRNA translational control during early Drosophila embryogenesisGenome Biol 8:R63https://doi.org/10.1186/gb-2007-8-4-r63
64.
1. Berleth T.
2. Burri M.
3. Thoma G.
4. Bopp D.
5. Richstein S.
6. Frigerio G.
7. Noll M.
8. Nusslein-Volhard C.
1988The role of localization of bicoid RNA in organizing the anterior pattern of the Drosophila embryoEMBO J 7:1749–1756
65.
1. Driever W.
2. Nusslein-Volhard C.
1988A gradient of bicoid protein in Drosophila embryosCell 54:83–93https://doi.org/10.1016/0092-8674(88)90182-1
66.
1. Driever W.
2. Nusslein-Volhard C.
1988The bicoid protein determines position in the Drosophila embryo in a concentration-dependent mannerCell 54:95–104https://doi.org/10.1016/0092-8674(88)90183-3
67.
1. Abdelmohsen K.
2. Panda A.C.
3. Kang M.J.
4. Guo R.
5. Kim J.
6. Grammatikakis I.
7. Yoon J.H.
8. Dudekula D.B.
9. Noh J.H.
10. Yang X.
11. et al.
20147SL RNA represses p53 translation by competing with HuRNucleic Acids Res 42:10099–10111https://doi.org/10.1093/nar/gku686
68.
1. Panda A.C.
2. Abdelmohsen K.
3. Martindale J.L.
4. Di Germanio C.
5. Yang X.
6. Grammatikakis I.
7. Noh J.H.
8. Zhang Y.
9. Lehrmann E.
10. Dudekula D.B.
11. et al.
2016Novel RNA-binding activity of MYF5 enhances Ccnd1/Cyclin D1 mRNA translation during myogenesisNucleic Acids Res 44:2393–2408https://doi.org/10.1093/nar/gkw023
69.
1. Panda A.C.
2. Martindale J.L.
3. Gorospe M.
2017Polysome Fractionation to Analyze mRNA Distribution ProfilesBio Protoc 7https://doi.org/10.21769/BioProtoc.2126
70.
1. Amourda C.
2. Chong J.
3. Saunders T.E.
2018MicroRNAs buffer genetic variation at specific temperatures during embryonic developmentbioRxiv 444810https://doi.org/10.1101/444810
71.
1. Lu G.
2. Zhao Y.
3. Chen Q.
4. Lin P.
5. Tang T.
6. Tang Z.
7. Liufu Z.
8. Wu C.-I.
2021When development is constantly but weakly perturbed - Canalization by microRNAsbioRxiv :2021.2009.2004.458966https://doi.org/10.1101/2021.09.04.458966
72.
1. Li X.Y.
2. MacArthur S.
3. Bourgon R.
4. Nix D.
5. Pollard D.A.
6. Iyer V.N.
7. Hechmer A.
8. Simirenko L.
9. Stapleton M.
10. Luengo Hendriks C.L.
11. et al.
2008Transcription factors bind thousands of active and inactive regions in the Drosophila blastodermPLoS Biol 6:e27https://doi.org/10.1371/journal.pbio.0060027
73.
1. Wang Z.Y.
2. Leushkin E.
3. Liechti A.
4. Ovchinnikova S.
5. Mößinger K.
6. Brüning T.
7. Rummel C.
8. Grützner F.
9. Cardoso-Moreira M.
10. Janich P.
11. et al.
2020Transcriptome and translatome co-evolution in mammalsNature 588:642–647https://doi.org/10.1038/s41586-020-2899-z
74.
1. Hedges S.B.
2. Dudley J.
3. Kumar S.
2006TimeTree: a public knowledge-base of divergence times among organismsBioinformatics 22:2971–2972https://doi.org/10.1093/bioinformatics/btl505
75.
1. Battle A.
2. Khan Z.
3. Wang S.H.
4. Mitrano A.
5. Ford M.J.
6. Pritchard J.K.
7. Gilad Y.
2015Genomic variation. Impact of regulatory variation from RNA to proteinScience 347:664–667https://doi.org/10.1126/science.1260793
76.
1. Lappalainen T.
2. Sammeth M.
3. Friedlander M.R.
4. t Hoen P.A.
5. Monlong J.
6. Rivas M.A.
7. Gonzalez-Porta M.
8. Kurbatova N.
9. Griebel T.
10. Ferreira P.G.
11. et al.
2013Transcriptome and genome sequencing uncovers functional variation in humansNature 501:506–511https://doi.org/10.1038/nature12531
77.
1. Stein K.C.
2. Frydman J.
2019The stop-and-go traffic regulating protein biogenesis: How translation kinetics controls proteostasisJournal of Biological Chemistry 294:2076–2084https://doi.org/10.1074/jbc.REV118.002814
78.
1. Sherman M.Y.
2. Qian S.B.
2013Less is more: improving proteostasis by translation slow downTrends Biochem Sci 38:585–591https://doi.org/10.1016/j.tibs.2013.09.003
79.
1. Waddington C.H.
1942CANALIZATION OF DEVELOPMENT AND THE INHERITANCE OF ACQUIRED CHARACTERSNature 150:563–565https://doi.org/10.1038/150563a0
80.
1. Payne J.L.
2. Wagner A.
2015Mechanisms of mutational robustness in transcriptional regulationFront Genet 6:322https://doi.org/10.3389/fgene.2015.00322
81.
1. Denby C.M.
2. Im J.H.
3. Yu R.C.
4. Pesce C.G.
5. Brem R.B.
2012Negative feedback confers mutational robustness in yeast transcription factor regulationProc Natl Acad Sci U S A 109:3874–3878https://doi.org/10.1073/pnas.1116360109
82.
1. Jarosz D.F.
2. Taipale M.
3. Lindquist S.
2010Protein homeostasis and the phenotypic manifestation of genetic diversity: principles and mechanismsAnnu Rev Genet 44:189–216https://doi.org/10.1146/annurev.genet.40.110405.090412
83.
1. Ebert M.S.
2. Sharp P.A.
2012Roles for microRNAs in conferring robustness to biological processesCell 149:515–524https://doi.org/10.1016/j.cell.2012.04.005
84.
1. Lu G.A.
2. Zhang J.
3. Zhao Y.
4. Chen Q.
5. Lin P.
6. Tang T.
7. Tang Z.
8. Wen H.
9. Liufu Z.
10. Wu C.I.
2023Canalization of Phenotypes-When the Transcriptome is Constantly but Weakly PerturbedMol Biol Evol 40https://doi.org/10.1093/molbev/msad005
85.
1. Alon U.
2007Network motifs: theory and experimental approachesNat Rev Genet 8:450–461https://doi.org/10.1038/nrg2102
86.
1. Somogyvari M.
2. Khatatneh S.
3. Soti C.
2022Hsp90: From Cellular to Organismal ProteostasisCells 11https://doi.org/10.3390/cells11162479
87.
1. Zabinsky R.A.
2. Mason G.A.
3. Queitsch C.
4. Jarosz D.F.
2019It’s not magic - Hsp90 and its effects on genetic and epigenetic variationSemin Cell Dev Biol 88:21–35https://doi.org/10.1016/j.semcdb.2018.05.015
88.
1. Cho P.F.
2. Poulin F.
3. Cho-Park Y.A.
4. Cho-Park I.B.
5. Chicoine J.D.
6. Lasko P.
7. Sonenberg N.
2005A new paradigm for translational control: inhibition via 5’-3’ mRNA tethering by Bicoid and the eIF4E cognate 4EHPCell 121:411–423https://doi.org/10.1016/j.cell.2005.02.024
89.
1. Singh A.P.
2. Wu P.
3. Ryabichko S.
4. Raimundo J.
5. Swan M.
6. Wieschaus E.
7. Gregor T.
8. Toettcher J.E.
2022Optogenetic control of the Bicoid morphogen reveals fast and slow modes of gap gene regulationCell Rep 38:110543https://doi.org/10.1016/j.celrep.2022.110543
90.
1. Hannon C.E.
2. Blythe S.A.
3. Wieschaus E.F.
2017Concentration dependent chromatin states induced by the bicoid morphogen gradientElife 6https://doi.org/10.7554/eLife.28275
91.
1. Struhl G.
2. Struhl K.
3. Macdonald P.M.
1989The gradient morphogen bicoid is a concentration-dependent transcriptional activatorCell 57:1259–1273https://doi.org/10.1016/0092-8674(89)90062-7
92.
1. Dubnau J.
2. Struhl G.
1996RNA recognition and translational regulation by a homeodomain proteinNature 379:694–699https://doi.org/10.1038/379694a0
93.
1. Wethmar K.
2. Begay V.
3. Smink J.J.
4. Zaragoza K.
5. Wiesenthal V.
6. Dorken B.
7. Calkhoven C.F.
8. Leutz A.
2010C/EBPbetaDeltauORF mice--a genetic model for uORF-mediated translational control in mammalsGenes Dev 24:15–20https://doi.org/10.1101/gad.557910
94.
1. Miyake T.
2. Inoue Y.
3. Shao X.
4. Seta T.
5. Aoki Y.
6. Nguyen Pham K.T.
7. Shichino Y.
8. Sasaki J.
9. Sasaki T.
10. Ikawa M.
11. et al.
2023Minimal upstream open reading frame of Per2 mediates phase fitness of the circadian clock to day/night physiological body temperature rhythmCell Rep 42:112157https://doi.org/10.1016/j.celrep.2023.112157
95.
1. Xing S.
2. Chen K.
3. Zhu H.
4. Zhang R.
5. Zhang H.
6. Li B.
7. Gao C.
2020Fine- tuning sugar content in strawberryGenome Biol 21:230https://doi.org/10.1186/s13059-020-02146-5
96.
1. Zhang H.
2. Si X.
3. Ji X.
4. Fan R.
5. Liu J.
6. Chen K.
7. Wang D.
8. Gao C.
2018Genome editing of upstream open reading frames enables translational control in plantsNat Biotechnol 36:894–898https://doi.org/10.1038/nbt.4202
97.
1. Si X.
2. Zhang H.
3. Wang Y.
4. Chen K.
5. Gao C.
2020Manipulating gene translation in plants by CRISPR-Cas9-mediated genome editing of upstream open reading framesNat Protoc 15:338–363https://doi.org/10.1038/s41596-019-0238-3
98.
1. Xue C.
2. Qiu F.
3. Wang Y.
4. Li B.
5. Zhao K.T.
6. Chen K.
7. Gao C.
2023Tuning plant phenotypes by precise, graded downregulation of gene expressionNat Biotechnol https://doi.org/10.1038/s41587-023-01707-w
99.
1. Ivanov I.P.
2. Shin B.S.
3. Loughran G.
4. Tzani I.
5. Young-Baird S.K.
6. Cao C.
7. Atkins J.F.
8. Dever T.E.
2018Polyamine Control of Translation Elongation Regulates Start Site Selection on Antizyme Inhibitor mRNA via Ribosome QueuingMol Cell 70:254–264https://doi.org/10.1016/j.molcel.2018.03.015
100.
1. Luo Z.
2. Sachs M.S.
1996Role of an upstream open reading frame in mediating arginine-specific translational control in Neurospora crassaJ Bacteriol 178:2172–2177https://doi.org/10.1128/jb.178.8.2172-2177.1996
101.
1. Lovett P.S.
2. Rogers E.J.
1996Ribosome regulation by the nascent peptideMicrobiol Rev 60:366–385https://doi.org/10.1128/mr.60.2.366-385.1996
102.
1. Vilela C.
2. McCarthy J.E.
2003Regulation of fungal gene expression via short open reading frames in the mRNA 5’untranslated regionMol Microbiol 49:859–867https://doi.org/10.1046/j.1365-2958.2003.03622.x
103.
1. Raney A.
2. Law G.L.
3. Mize G.J.
4. Morris D.R.
2002Regulated translation termination at the upstream open reading frame in s-adenosylmethionine decarboxylase mRNAJ Biol Chem 277:5988–5994https://doi.org/10.1074/jbc.M108375200
104.
1. Lin Y.
2. May G.E.
3. Kready H.
4. Nazzaro L.
5. Mao M.
6. Spealman P.
7. Creeger Y.
8. McManus C.J.
2019Impacts of uORF codon identity and position on translation regulationNucleic Acids Res 47:9358–9367https://doi.org/10.1093/nar/gkz681
105.
1. Meijer H.A.
2. Thomas A.A.
2003Ribosomes stalling on uORF1 in the Xenopus Cx41 5’ UTR inhibit downstream translation initiationNucleic Acids Res 31:3174–3184https://doi.org/10.1093/nar/gkg429
106.
1. Rosenbloom K.R.
2. Armstrong J.
3. Barber G.P.
4. Casper J.
5. Clawson H.
6. Diekhans M.
7. Dreszer T.R.
8. Fujita P.A.
9. Guruvadoo L.
10. Haeussler M.
11. et al.
2015The UCSC Genome Browser database: 2015 updateNucleic Acids Res 43:D670–681https://doi.org/10.1093/nar/gku1177
107.
1. Chakraborty M.
2. Chang C.H.
3. Khost D.E.
4. Vedanayagam J.
5. Adrion J.R.
6. Liao Y.
7. Montooth K.L.
8. Meiklejohn C.D.
9. Larracuente A.M.
10. Emerson J.J.
2021Evolution of genome structure in the Drosophila simulans species complexGenome Res 31:380–396https://doi.org/10.1101/gr.263442.120
108.
1. Chiaromonte F.
2. Yap V.B.
3. Miller W.
2002Scoring pairwise genomic sequence alignmentsPac Symp Biocomput :115–126https://doi.org/10.1142/9789812799623_0012
109.
1. Blanchette M.
2. Kent W.J.
3. Riemer C.
4. Elnitski L.
5. Smit A.F.
6. Roskin K.M.
7. Baertsch R.
8. Rosenbloom K.
9. Clawson H.
10. Green E.D.
11. et al.
2004Aligning multiple genomic sequences with the threaded blockset alignerGenome Res 14:708–715https://doi.org/10.1101/gr.1933104
110.
1. Martin M.
2011Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnetjournal 17:10–12
111.
1. Langmead B.
2. Salzberg S.L.
2012Fast gapped-read alignment with Bowtie 2Nat Methods 9:357–359https://doi.org/10.1038/nmeth.1923
112.
1. Dobin A.
2. Davis C.A.
3. Schlesinger F.
4. Drenkow J.
5. Zaleski C.
6. Jha S.
7. Batut P.
8. Chaisson M.
9. Gingeras T.R.
2013STAR: ultrafast universal RNA-seq alignerBioinformatics 29:15–21
113.
1. Dunn J.G.
2. Weissman J.S.
2016Plastid: nucleotide-resolution analysis of next- generation sequencing and genomics dataBMC Genomics 17:958https://doi.org/10.1186/s12864-016-3278-x
114.
1. Ni J.Q.
2. Zhou R.
3. Czech B.
4. Liu L.P.
5. Holderbaum L.
6. Yang-Zhou D.
7. Shim H.S.
8. Tao R.
9. Handler D.
10. Karpowicz P.
11. et al.
2011A genome-scale shRNA resource for transgenic RNAi in DrosophilaNat Methods 8:405–407https://doi.org/10.1038/nmeth.1592
115.
1. Anders S.
2. Pyl P.T.
3. Huber W.
2015HTSeq--a Python framework to work with high-throughput sequencing dataBioinformatics 31:166–169https://doi.org/10.1093/bioinformatics/btu638
116.
1. McCarthy D.J.
2. Chen Y.
3. Smyth G.K.
2012Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variationNucleic Acids Res 40:4288–4297https://doi.org/10.1093/nar/gks042
117.
1. Yu G.
2. Wang L.G.
3. Han Y.
4. He Q.Y.
2012clusterProfiler: an R package for comparing biological themes among gene clustersOmics 16:284–287https://doi.org/10.1089/omi.2011.0118
118.
1. Xiao Z.
2. Zou Q.
3. Liu Y.
4. Yang X.
2016Genome-wide assessment of differential translations with ribosome profiling dataNat Commun 7:11194https://doi.org/10.1038/ncomms11194

Article and author information

Author information

Yuanqiang Sun
State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
ORCID iD: 0009-0007-6967-2234
- Equal contribution
Yuange Duan
State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
ORCID iD: 0000-0003-2311-9859
- Equal contribution
Peixiang Gao
State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
Chenlu Liu
State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
ORCID iD: 0000-0002-8993-0145
Kaichun Jin
State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
Shengqian Dou
Eye Institute of Shandong First Medical University, State Key Laboratory Cultivation Base, Shandong Provincial Key Laboratory of Ophthalmology, Qingdao, China
Wenxiong Tang
State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
Hong Zhang
College of Ecology, Lanzhou University, Lanzhou, China
Jian Lu
State Key Laboratory of Protein and Plant Gene Research, Center for Bioinformatics, School of Life Sciences, Peking University, Beijing, China
ORCID iD: 0000-0002-4409-1667
- For correspondence: luj@pku.edu.cn

Version history

Sent for peer review: November 13, 2024
Preprint posted: November 17, 2024
Reviewed Preprint version 1: January 6, 2025

Copyright

This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

Metrics

views: 323
downloads: 23
citations: 0

Views, downloads and citations are aggregated across all versions of this paper published by eLife.

Significance of findings

Strength of evidence

Abstract

Introduction

Results

An extended ICIER model for quantifying uORF buffering in CDS translation

Modeling simulation of uORF-mediated translation buffering.

uORF-mediated buffering of CDS translation across different parameter settings

Generating matched translatome data from two Drosophila species for comparative analysis

Translational conservation and dominance of uORFs between Drosophila species

Conservation and translation of uORFs between D. melanogaster and D. simulans.

uORFs and CDSs show correlated translation differences between D. melanogaster and D. simulans

uORFs reduce CDS translational divergence between D. melanogaster and D. simulans.

uORFs buffer interspecific translational divergence of CDSs

Numbers of genes showing different magnitudes of TE changes between uORFs and CDS at the interspecific level.

uORF buffering is influenced by its conservation, dominance, and length

uORFs buffer translational fluctuations during Drosophila development

uORFs could reduce CDS translational fluctuation during Drosophila development.

Knocking out the uORF of bcd increased bcd CDS translation in D. melanogaster

The strong buffering uORF of bcd and it knockout.

Knocking out the bcd uORF increases CDS translation and perturbs the transcriptome during D. melanogaster embryogenesis.

bcd uORF mutants show wide transcriptomic alteration during Drosophila embryogenesis

bcd uORF mutants display decreased hatching rates and starvation resistance

Knockout of the bcd uORF reduces offspring number and starvation resistance.

Conservation of uORF-mediated translational buffering in primates

uORFs function as translational buffers in primates.

Discussion

Materials and Methods

Modeling the uORF-mediated buffering effect on CDS translation

Annotation of uORFs in D. melanogaster and D. simulans

Fly materials and general raising conditions

Processing Drosophila mRNA-Seq and Ribo-Seq data

Testing the statistical significance of the difference in the interspecific TE change between a uORF and its downstream CDS

Knocking out a uORF with CRISPR-Cas9 technology in D. melanogaster

Ribosome fraction analysis by sucrose gradient fractionation and RT‒qPCR

Dual-luciferase reporter assays

mRNA-Seq in the embryos of mutant and WT flies

Measurement of embryo hatchability

Quantification of offspring number per female fly

Measurement of starvation resistance in adult flies

mRNA-Seq and Ribo-Seq data analysis in primates

Acknowledgements

Declaration of interests

Data availability statement

Supporting information

References

Article and author information

Author information

Yuanqiang Sun#

Yuange Duan#

Peixiang Gao

Chenlu Liu

Kaichun Jin

Shengqian Dou

Wenxiong Tang

Hong Zhang

Jian Lu

Version history

Copyright

Metrics

Yuanqiang Sun

Yuange Duan