MicroRNA 3′-compensatory pairing occurs through two binding modes, with affinity shaped by nucleotide identity and position

Abstract
Editor's evaluation
Introduction
Results
Discussion
Materials and methods
Data availability
References
Article and author information
Metrics

Abstract

MicroRNAs (miRNAs), in association with Argonaute (AGO) proteins, direct repression by pairing to sites within mRNAs. Compared to pairing preferences of the miRNA seed region (nucleotides 2–8), preferences of the miRNA 3′ region are poorly understood, due to the sparsity of measured affinities for the many pairing possibilities. We used RNA bind-n-seq with purified AGO2–miRNA complexes to measure relative affinities of >1000 3′-pairing architectures for each miRNA. In some cases, optimal 3′ pairing increased affinity by >500 fold. Some miRNAs had two high-affinity 3′-pairing modes—one of which included additional nucleotides bridging seed and 3′ pairing to enable high-affinity pairing to miRNA nucleotide 11. The affinity of binding and the position of optimal pairing both tracked with the occurrence of G or oligo(G/C) nucleotides within the miRNA. These and other results advance understanding of miRNA targeting, providing insight into how optimal 3′ pairing is determined for each miRNA.

Editor's evaluation

This manuscript will be of interest to readers in the field of microRNA (miRNA) biology, particularly those interested in miRNA targeting. The authors interrogated non-canonical miRNA target recognition to a depth vastly exceeding any study to date. The results revealed unexpected, sequence-specific diversity in miRNA-targeting modes, providing new insights relevant for improved target prediction.

https://doi.org/10.7554/eLife.69803.sa0

Introduction

MicroRNAs (miRNAs) are ~22-nt regulatory RNAs that are processed from hairpin precursors. Upon processing, miRNAs associate with an Argonaute (AGO) protein and base-pair to sites within mRNAs to direct the destabilization and/or translational repression of these mRNA targets (Jonas and Izaurralde, 2015; Bartel, 2018). For most sites that confer repression in mammalian cells, pairing to miRNA nucleotides 2–7, referred to as the miRNA seed, is critical for target recognition, with an additional pair to miRNA position 8 or an A across from miRNA position 1 often enhancing targeting efficacy (Lewis et al., 2005; Bartel, 2009). Such sites with a perfect 6–8-nucleotide (nt) match to the miRNA seed region (Figure 1A, left) are heuristically predictive of repression, with longer sites being more effective than shorter ones and more sites being more effective than fewer sites (Grimson et al., 2007; Agarwal et al., 2015). In addition, contextual features extrinsic to a site itself can influence targeting efficacy (Brown et al., 2005; Ameres et al., 2007; Grimson et al., 2007; Kedde et al., 2007; Nielsen et al., 2007; Saetrom et al., 2007; Tafer et al., 2008; Kedde et al., 2010; Wan et al., 2014; Agarwal et al., 2015; McGeary et al., 2019).

Figure 1

Download asset Open asset

Features of miRNA 3′-compensatory sites characterized using AGO-RBNS.

(A) Pairing of typical canonical sites (left), 3′-supplementary, canonical sites (middle), and 3′-compensatory, noncanonical sites (right). Canonical sites contain contiguous complementarity (blue) to the seed (red). Sites with shifted complementarity (i.e., the 6mer-A1 and 6mer-m8 sites) are sometimes also classified as canonical sites. 3′-supplementary sites have pairing to the miRNA 3′ region, which supplements canonical seed pairing and is reported to be most effective if it centers on miRNA nucleotides 13–16 (green and orange). This 3′ pairing can supplement 8mer sites (as shown) as well as other canonical sites (not shown). 3′-compensatory sites resemble 3′-supplementary sites, except they lack perfect pairing to the seed and thus pairing to the 3′ region helps to compensate for this imperfect seed pairing. Vertical lines represent Watson–Crick pairing. (B) The architectures of 3′ sites. Three independent features define each architecture: (1) the length of 3′ pairing (left), measured as the number of contiguous base pairs to the miRNA 3′ region; (2) the position of 3′ pairing (middle-left), defined as the 5′-most miRNA nucleotide engaged in 3′ pairing; and (3) the offset between the seed pairing and 3′ pairing (middle), which specifies the number of unpaired nucleotides separating the seed- and 3′-paired segments in the target RNA relative to that in the miRNA. Mismatches to the seed pairing (middle-right) or within the 3′ pairing (right) can elaborate on these architectures, as can bulged nucleotides (not shown). (C) A programmed RNA library for using AGO-RBNS to examine 3′ pairing of let-7a. The library contains an 8-nt region with all 18 possible single-nucleotide mismatches (purple) to the let-7a seed (red), with 25 nt of random-sequence RNA upstream of this region and 5 nt of random-sequence RNA downstream. k-mer positions are numbered with respect to the programmed 8-nt mismatched site. B represents C, G, or U; D represents A, G, or U; V represents A, C, or G; N represents A, C, G, or U. The black vertical line depicts perfect pairing at position 8, and gray vertical lines indicate Watson–Crick matches at only five of the six seed positions. (D) The top 20 8-nt k-mers identified by AGO-RBNS performed with the highest concentration of AGO2–let-7a (840 pM) and the programmed library (100 nM). k-mers were ranked by the sum of their enrichments at the five positions of the library at which they were most enriched. Left, alignment of k-mers, indicating in pink nucleotides that were not Watson–Crick matches to the miRNA. Right, heat map showing k-mer enrichment at each position of the library, with pairing shown for the top 8-nt k-mer at the position of its greatest enrichment. Black vertical lines depict perfect Watson–Crick pairing, and gray vertical lines indicate Watson–Crick matches at only five of the six seed positions.

Pairing to the miRNA 3′ region, particularly pairing that includes miRNA nucleotides 13–16, can supplement perfect seed pairing to enhance targeting efficacy beyond that of seed pairing alone, and extensive pairing to the 3′ region can compensate for imperfect seed pairing to enable consequential repression (Brennecke et al., 2005; Lewis et al., 2005; Grimson et al., 2007). These two bipartite site types are referred to as 3′-supplementary and 3′-compensatory sites, respectively (Figure 1A, middle and right). Although 3′-supplementary sites are less common than sites with only a seed match, comprising ~5% of all conserved sites observed in mammals, thousands of sites with preferentially conserved 3′-supplementary pairing are present in human 3′ UTRs (Grimson et al., 2007; Friedman et al., 2009). Conserved 3′-compensatory sites are even less common, comprising only ~1.5% of all preferentially conserved sites observed in human 3′ UTRs (Friedman et al., 2009). Nonetheless, two instances of this relatively rare site type within the 3′ UTR of lin-41 mediate the extreme morphological and developmental defects by which the let-7 miRNA was discovered in C. elegans (Pasquinelli et al., 2000; Reinhart et al., 2000; Ecsedi et al., 2015). Moreover, the use of these 3′-compensatory sites rather than canonical sites for lin-41 repression is consequential; site mutations that create perfect seed pairing while maintaining the 3′ pairing cause precocious repression of the mRNA by other members of the let-7 seed family expressed during earlier larval stages (Brancati and Großhans, 2018). These results support the notion that 3′-compensatory sites enable differential target specificity between miRNAs that share a common seed sequence but differ within their 3′ regions (Brennecke et al., 2005; Lewis et al., 2005).

Although global analyses of site conservation and efficacy provide compelling evidence that pairing to the miRNA 3′ region is also utilized in mammalian cells (Friedman et al., 2009), these approaches have limitations for evaluating which 3′-pairing architectures are most effective, due to the vast number of 3′-pairing architectures that are possible for a single miRNA sequence. The pairing architecture of a 3′-compensatory site can be described by five characteristics: (1) the length of contiguous pairing between the site and the miRNA 3′ region, (2) the position of pairing to the miRNA 3′ region, as defined by the 5′-most miRNA nucleotide involved in 3′ pairing, (3) the difference between the number of unpaired target nucleotides and the number of unpaired miRNA nucleotides bridging the seed and 3′ pairing, hereafter referred to as the ‘3′-pairing offset,’ (4) the nature of the imperfect pairing to the seed, and (5) the nature of any imperfections in the 3′ pairing (Figure 1B). When considering only sites with perfect 3′ pairing with lengths ranging from 4 to 11 base pairs (bp) at all possible 3′ positions, offsets ranging from −4 to +16 nt, and seed pairing interrupted by one of 18 possible single mismatches (or wobbles) to the 6-nt seed, there are >16,000 possible variants to the site architecture. However, for each miRNA, most of these possibilities are not present even once in all the 3′ UTRs of a transcriptome. Thus, data from multiple miRNAs must be aggregated to observe a reliable signal of either efficacy or conservation, which prevents identification of miRNA-specific pairing preferences. Indeed, even when aggregating multiple miRNA-perturbation (e.g., transfection) datasets, which enables efficacy of 3′-supplementary sites to be detected (Grimson et al., 2007), a signal for the efficacy of 3′-compensatory sites has not been reported, underscoring the challenge of using global analyses of conservation or repression efficacy to determine which architectures are more effective than others.

The observation that miRNA targeting efficacy observed in the cell is largely a function of the affinity between AGO–miRNA complexes and their sites (McGeary et al., 2019) indicates that contributions of 3′ pairing to affinities measured in vitro can provide insight into biological targeting efficacy. Early measurements showed that pairing to positions 13–16 of let-7a imparts only a twofold increase in binding affinity, which led to the view that 3′-supplemental pairing contributes only modestly to affinity (Wee et al., 2012). Further measurements revealed some differences between miRNAs, with the observation that pairing to positions 13–16 of miR-21 increases affinity by 11-fold (Salomon et al., 2015), and a striking effect of longer pairing, with the observations that 10 bp of 3′-supplementary pairing to miR-122 and 9 bp of 3′-supplementary pairing (including a terminal G:U wobble) to miR-27a increases affinity by 20- and >400-fold, respectively (Sheu-Gruttadauria et al., 2019a). Other measurements illustrate the influence of the length of the target segment bridging the seed and 3′ pairing, with binding affinity varying ~10-fold as this length is varied over a range of 1–15 nt (Sheu-Gruttadauria et al., 2019b). Taken together, these reports demonstrate the potential for miRNA 3′ pairing to enable high-affinity binding, and also illustrate that the benefit of this pairing varies, depending on the miRNA sequence and 3′-pairing architecture. Understanding how these features together modulate the benefit of 3′ pairing will be possible only after acquiring many more measurements with multiple miRNA sequences.

Imaging-based, high-throughput single-molecule biochemistry has recently been applied to acquire affinity measurements for ~23,000 sites for each of two miRNAs (let-7a and miR-21), including many sites with 3′ pairing (Becker et al., 2019). These measurements revealed that miR-21 relies more on 3′ pairing when binding to a fully complementary target than does let-7a, that homopolymeric insertions are the least disruptive to binding when inserted between nucleotides 8 and 11 within the context of fully complementary binding, and that mismatches near the miRNA 3′ terminus (after position 16) decrease binding affinity but increase target slicing. However, because the design of target libraries was based primarily on fully complementary RNA targets to which varying extents of mismatched, bulged, and deleted nucleotides were introduced, only a small minority of the possible 3′-pairing architectures were queried.

RNA bind-n-seq (RBNS) enables unbiased, high-throughput assessment of binding sites embedded within a larger random-sequence context (Lambert et al., 2014; Dominguez et al., 2018). We recently adapted RBNS for the study of miRNA targeting, and we built an analysis pipeline enabling calculation of relative equilibrium dissociation constants (K_D values) for many thousands of different RNA k-mers ≤12 nt in length (McGeary et al., 2019). Here, we further adapted the AGO-RBNS protocol to enable examination of sites >12 nt in length, thereby enabling the high-throughput investigation of bipartite sites containing near-perfect seed pairing and 4–11 additional pairs to the miRNA 3′ region. We applied this modified protocol to the systematic interrogation of the contribution of 3′ pairing for three natural miRNA sequences and four synthetic derivatives. We also performed a massively parallel reporter assay, which confirmed that key observations derived from affinities measured in vitro apply also to repression in cells.

Results

RBNS measures affinities for many 3′-compensatory sites of let-7a

AGO-RBNS begins with a series of 4–6 binding reactions, each containing an RNA library at a fixed concentration and a purified AGO–miRNA complex at a variable concentration spanning a 100-fold range (McGeary et al., 2019). Each molecule of the RNA library has a central region of random-sequence nucleotides flanked by constant sequences on each side that enable preparation of sequencing libraries. Upon reaching binding equilibrium, each reaction is passed through a nitrocellulose membrane, which retains AGO–miRNA complexes and any library molecules that are bound to the complexes. These bound library molecules are isolated and subjected to high-throughput sequencing, along with the input RNA library. Binding of an individual k-mer can be detected as enrichment in the bound compared to input sequences, and relative K_D values can be estimated simultaneously for hundreds of thousands of different k-mers by fitting a biochemical model to k-mer fractional abundances from each of the bound libraries.

As originally implemented, AGO-RBNS cannot provide reliable information on sites with more than ~5 supplementary/compensatory pairs because such sites, which involve >12 bp of total pairing (Figure 1A, middle and right), are too rare in the sequences obtained from the input RNA library to enable accurate calculation of enrichment values. To overcome this constraint for sites to let-7a, a miRNA with physiologically relevant 3′ pairing (Pasquinelli et al., 2000; Reinhart et al., 2000; Brancati and Großhans, 2018), we used a library that contained a programmed region of imperfect seed pairing to let-7a, with 25 and 5 nt of random-sequence RNA separating the programmed region from the 5′ and 3′ constant sequences, respectively (Figure 1C). In each library molecule, this programmed region of imperfect seed pairing contained a let-7a 8mer site with a mismatch at one of its six seed nucleotides, such that each library molecule had one of 18 possible single-nucleotide seed mismatches (including wobbles) in approximately equal proportion. With this programmed region of imperfect seed pairing, each library contained 3′-compensatory sites at an ~250-fold greater frequency than expected for a fully randomized RNA library.

AGO-RBNS was performed using this programmed library and purified AGO2–let-7a, choosing AGO2 from among the four human AGO paralogs because of its relatively high expression (Völler et al., 2016; Müller et al., 2019) and for comparison to previous biochemical and structural studies that use human or mouse AGO2 (Schirle and MacRae, 2012; Wee et al., 2012; Schirle et al., 2014; Schirle et al., 2015; Chandradoss et al., 2015; Salomon et al., 2015; Klum et al., 2018; Becker et al., 2019; McGeary et al., 2019; Sheu-Gruttadauria et al., 2019a; Sheu-Gruttadauria et al., 2019b). For our initial analysis, we calculated the enrichment of all 8-nt k-mers at each position between the programmed region and the 5′-constant region of the library, after first removing reads with any of the six canonical sites to let-7a. The enriched k-mers had substantial complementarity to the 3′ region of let-7a (Figure 1D). The most enriched was AUACAACC—the perfect Watson–Crick match to positions 11–18 of let-7a (Figure 1D). This 8-nt 3′ site was most strongly enriched when starting at position 15 of the library, which suggested that an internal loop with two miRNA nucleotides (9 and 10) and six target-site nucleotides (positions 9–14) separating seed pairing and 3′ pairing was optimal (Figure 1D, top). Using our nomenclature (Figure 1B), this 3′ site was classified as a position-11 site with pairing length of 8 bp and offset of +4 nt. Note that here and throughout this study we refer to contiguous complementarity as ‘pairing,’ even though constraints imposed by AGO2 might prevent physical pairing from occurring at some complementary positions. This 8-nt, position-11 site was also ≥5-fold enriched at seven other neighboring offsets (corresponding to library positions 8–15), indicating that looping out 3–10 unpaired library nucleotides opposite miRNA nucleotides 9 and 10 was tolerated, albeit to varying degrees (Figure 1D).

The second-most enriched 8-nt k-mer was UACAACCU—the perfect Watson–Crick match to let-7a positions 10–17 (Figure 1D). This 3′ site had a maximal enrichment with five, rather than six, unpaired library nucleotides spanning the seed and 3′ pairing, with the distribution of enrichments shifted by 1 nt in comparison to that of the AUACAACC site. This 1-nt shift in the enrichment distribution corresponded with the 1-nt shift in site position (from 11 to 10 of the miRNA) to maintain an offset of +4 target nucleotides. Indeed, the next 18 most enriched 8-nt k-mers represented 3′ sites with pairing positions ranging from miRNA nucleotides 9–12, with enrichment distributions that correspondingly shifted to reflect an overall optimal offset of +4 target nucleotides (Figure 1D). Each had a contiguous stretch of 6–8 perfect Watson–Crick pairs to the let-7a 3′ region, usually including the ACAACC k-mer, which suggested that perfect pairing to let-7a positions 11–16, with a +4 nt offset, was particularly effective for enhancing site affinity.

let-7a has two distinct 3′-pairing modes

For a more comprehensive examination of 3′ sites of varied lengths, positions, and offsets (Figure 1B), we enumerated 3′ sites of lengths 4–11 nt that perfectly paired to the miRNA starting at any position downstream of nucleotide 8. For each length and position of 3′ pairing (e.g., for the 8mer-m11–18), we further enumerated all pairing offsets compatible with the 3′ site residing within the 25-nt random-sequence region upstream of the programmed site, converting each library position to an offset value based on the pairing position of each 3′ site (Figure 2A). For our initial K_D estimation and analyses, we pooled the reads for the 18 possible seed-mismatch types. This pooling increased the read counts for each 3′-pairing architecture, which enabled examination of sites as long as 11 nt, which in turn enabled analysis of 1006 distinct 3′-pairing architectures.

Figure 2 with 2 supplements see all

Download asset Open asset

Pairing to nucleotide 11 and a positive offset promote high-affinity binding to let-7a.in P.

(A) Correspondence of enrichment and relative K_D value of sites with the AUACAACC k-mer (the perfect match to miRNA positions 11–18) measured at each position in the programmed library. Each of these positions (upper x-axis) corresponds to the indicated offset (lower x-axis). For example, because this k-mer paired to miRNA positions 11–18, pairing beginning at k-mer position 11 had a 0-nt offset. The k-mer enrichments and their associated colors (top) correspond to those of the top row of Figure 1D. For details on how relative K_D values were calculated for each site possibility, see McGeary et al., 2019, Figure 1C–E and Materials and methods section 11. (B) Relative K_D values of let-7a 3′-compensatory sites that had optimally positioned 3′ pairing of 4 (orange) to 11 (dark blue) bp. For each length of 3′ pairing, the optimal position is shown in terms of its complementarity to let-7a (right). For each of the 3′-compensatory sites, the relative K_D value is plotted as a function of its offset (left), as done for sites with 8 bp of optimally positioned 3′ pairing in (A). Vertical lines indicate 95% confidence intervals. The dashed horizontal line indicates the geometric mean of the 18 relative K_D values of the seed mismatch sites, each calculated from reads with <4 nt of contiguous complementarity to the miRNA 3′ region. The horizontal blue and purple lines indicate the relative K_D values of the canonical 6mer and 8mer sites, respectively. The arrows at +2 and +4 nt mark a shift in the optimal offset observed with increasing 3′-pairing length. The asterisk denotes the anomalously low binding affinity measured for 3′ sites that pair contiguously with seed pairing (i.e., sites with pairing at position 9 with an offset of 0 nt). (C) The dependency of let-7a 3′-pairing affinity on pairing length, position, and offset. Each panel shows the relative K_D values of 3′-compensatory sites with 3′ pairing of a specified length over a range of positions and offsets. Each trend line is colored according to pairing position, spanning positions 9 (light violet) to 18 (red) when possible. The arrows between 0 and 1 nt and at +3 nt mark a shift in the optimal offset as the position of 3′ pairing shifted to include nucleotide 11 of let-7a. Otherwise, these panels are as in (B, left). (D) Schematics of the two 3′-binding modes. In the zero-offset binding mode (top), miRNA nucleotide 11 is inaccessible due to occlusion by the central region of the AGO protein. In the positive-offset binding mode (bottom), the longer stretch of bridging target nucleotides enables a conformation in which nucleotide 11 is available for pairing to the target RNA. Although not intended to accurately reflect the conformation of either binding mode, these schematics illustrate how a larger offset might enable pairing to a more centrally located miRNA nucleotide. (E) Affinity profile of the let-7a 3′ region. Each cell indicates the fold change in relative K_D attributed to a 3′ site with indicated length, position, and offset of pairing. Each row within a heat map corresponds to a different miRNA nucleotide at the start of the 3′ pairing, and each column corresponds to a different miRNA nucleotide at the end of the 3′ pairing. Each heat map shows the results for a different offset. The three diagrams indicate the fold-change values and architectures for 3′ sites pairing to miRNA nucleotides 13–16 with an offset of 0 nt (left), pairing to miRNA nucleotides 13–21 with an offset of 0 nt (middle), and pairing to miRNA nucleotides 10–20 with an offset of +4 nt (right). Gray boxes indicate pairing ranges that were either too short (<4 bp) or too long (>11 bp) for relative K_D values to be reliably calculated. Black vertical lines depict perfect Watson–Crick pairing, and gray vertical lines indicate Watson–Crick matches at only five of the six seed positions.

Simultaneous estimation of the fractional abundance of these sites in each of the AGO2–let-7a-bound libraries in comparison to that of the input library enabled calculation of their relative K_D values. As illustrated for the 8-nt k-mer identified as most enriched in the previous analysis (Figure 1D, top row), variation in K_D values qualitatively tracked with that of enrichment values but quantitatively differed due to the attenuating effects of background binding and site saturation on enrichment values (McGeary et al., 2019; Figure 2A). Relative K_D values corresponding to a broad spectrum of 3′-pairing architectures spanned a several hundred–fold range, with strong agreement observed between the results of replicate experiments performed independently with different preparations of both AGO2–let-7a and the let-7a programmed library (r² = 0.96, n = 1477; Figure 2—figure supplement 1A, left). Agreement between the two replicates was maintained, albeit to a lesser degree, when read counts for each 3′-pairing architecture were not pooled over the 18 seed-mismatched sites in the programmed region (r² = 0.78, n = 23,912; Figure 2—figure supplement 1A, right). Furthermore, for shorter 3′ sites, which could be analyzed using data from a standard AGO-RBNS experiment that used a non-programmed random-sequence library (McGeary et al., 2019), the relative K_D values determined from the programmed library correlated well with those determined from a random-sequence library (r² = 0.83, Figure 2—figure supplement 1B). Despite the overall correlation, a minor systematic difference in the values for the same sites determined from the two types of libraries was observed. This distortion was presumed to be due to the absence of library RNA molecules containing no site and was corrected accordingly (Figure 2—figure supplement 1B).

To investigate the interplay of pairing position, length, and offset, we identified the optimal 3′ sites of lengths 4–11 nt and, as in Figure 2A, examined the effect of varying offset on the affinity of each of these sites (Figure 2B). Nearly all possibilities examined had values readily distinguished from the log-averaged value for seed-mismatched sites alone, with compensatory pairing to miRNA nucleotides 11–16 at optimal offsets yielding binding affinities comparable to that of the canonical 6mer (Figure 2B, left). Further inspection of longer 3′ sites underscored the conclusion that pairing to the GGUUGU segment spanning positions 11–16 of let-7a is the most consequential for 3′-compensatory pairing, as all optimal pairing positions for 3′ sites ≥6 nt in length paired to this segment. Moreover, inspection of the optimal positions for shorter sites showed that pairing to the 5′ end of this segment (containing the sequence GGUU) was more impactful than pairing to its 3′ end (Figure 2B, right). In addition, increasing the length of pairing from 4 to 11 bp led not only to increased binding affinity at almost all offsets, as might have been expected, but also to a shift in the optimal offset, with a preferred offset of +2 nt when pairing with 4 bp compared to a preferred offset of +4 nt when pairing with 9–11 bp (Figure 2B, left).

To investigate further the interplay between affinity, pairing position, and pairing offset, we plotted the relative affinities of all possible positions and offsets for let-7a 3′ pairing of lengths ranging from 4 to 9 bp (Figure 2C). These plots revealed a striking change in the affinities and preferred offsets as pairing shifted from position 12 to position 11. For 3′ sites of each length, those that began at let-7a position 12 (dark blue points of Figure 2C) had intermediate affinity and optimal offsets of 0 or +1 nt, with clearly reduced affinity as offsets increased beyond +1 nt. At these offsets of 0 or +1 nt, 3′ sites that began at position 11 (dark purple points) had affinities similar to those that began at position 12. However, in stark contrast to the sites beginning at position 12, sites beginning at 11 had strikingly increased affinity at more positive offsets, with affinity peaking at offsets of +2 or +3 nt. Sites beginning at +10 were similar, with affinity peaking at an offset of +4 nt. These results suggested that pairing to position 11 in the central region of the miRNA is less accessible than pairing to position 12, and therefore a longer loop in the target sequence is required to bridge seed pairing with 3′ pairing that includes position 11 (Figure 2D). Nonetheless, when the increased offset enables pairing to position 11, substantially greater affinity can be achieved. We call this newly defined binding mode, which includes pairing to miRNA position 11 and greatly benefits from short positive offsets, the ‘positive-offset’ binding mode. Accordingly, the more conventional binding mode, which lacks pairing to position 11 and does not benefit from offsets greater than +1 nt, we call the ‘zero-offset’ binding mode.

Some of the weakest relative affinities were observed for extended 3′-pairing possibilities that began at position 9 with an offset of 0 nt (Figure 2B and C, asterisks). These weak values were attributable to AGO2-catalzyed slicing of molecules with extensive contiguous pairing, which would have depleted these molecules from our bound library. Supporting this idea, analogous sites with offsets of either −1 or +1 nt, which were expected to disrupt slicing due to single-nucleotide bulges in either the miRNA or the site, respectively, did not have aberrantly low relative affinities. This idea was also consistent with reports that AGO2 can slice sites that have a seed mismatch but are otherwise extensively paired to the guide RNA (Wee et al., 2012; Chen et al., 2017; Becker et al., 2019).

We next used heat maps to visualize the interplay between 3′-site position and pairing length at different offsets (Figure 2E). Within each heat map, a difference between adjacent cells corresponded to the difference in K_D fold change caused by the addition or removal of a pair at either the 5′ end (adjacent rows) or the 3′ end (adjacent columns) of the 3′ site, while maintaining the same offset. For example, in the heatmaps corresponding to offsets of +2 to +12 nt, the prominent contrast between the row corresponding to pairing beginning with nucleotide 11 and the row corresponding to pairing beginning with nucleotide 12 illustrated the strong benefit of pairing to G11 of let-7a (Figure 2E). At the optimal offset length of +4 nt, pairing to let-7a positions 10–20 conferred an ~380-fold increase in affinity over the average seed-mismatched site alone (Figure 2E), leading to an overall binding affinity rivaling that of the canonical 8mer (Figure 2B). The binding affinity of this site and all other sites decreased nearly uniformly as offset values increased beyond +4 nt. Binding affinity decreases were less uniform as offset values decreased to 0 and –2 nt, which reflected a switch from the positive-offset binding mode to the zero-offset binding mode, with a concomitant reduction in the benefit of pairing to nucleotide 11.

Previous low-throughput measurements of the benefit of 3′ pairing for let-7a examined the influence of pairing to miRNA positions 13–16 at an offset of 0 nt and found that this pairing confers a 1.6–2-fold increase in binding affinity (Wee et al., 2012; Salomon et al., 2015). Likewise, our measurements for this 4-nt 3′ site indicated that it conferred a 1.5-fold increase in affinity (Figure 2E). Furthermore, maintaining the offset of 0 nt and the pairing position of 13 and extending pairing to the very 3′ end of let-7a increased the binding affinity to only 3.1-fold (Figure 2E). These results highlight the importance of both a positive offset and pairing to position 11 of let-7a—two features that would have been difficult to identify without comprehensive investigation of the 3′-pairing preferences of this miRNA. Indeed, the importance of these two features is not revealed in an analysis of a dataset that reports the affinities of ~23,000 different sites to let-7a, because these ~23,000 sites were not designed to analyze the combined effects of varying both pairing position and pairing offset (Becker et al., 2019; Figure 2—figure supplement 2).

Pairing preferences of let-7a correspond with repression efficacy in cells

We next tested whether features associated with higher affinity also conferred greater repression in cells. Our analysis centered on 15 different 3′-compensatory sites, designed to test the consequences of changing the position, length, and/or offset of 3′ pairing (Figure 3A). These sites were each placed in the 3′ UTR of a reporter, at either an upstream position, a downstream position, or at both the upstream and the downstream positions (Figure 3A). For comparison, we analyzed five sites with only seed pairing and seven no-site sequences that had no more than five contiguous pairs to let-7 (Figure 3A). We also analyzed the dual 3′-compensatory sites that mediate lin-41 repression in C. elegans (Figure 3A; Reinhart et al., 2000). To account for the effects of local sequence context, sites were each placed within 14 different sequence contexts, one of which was the native sequence context of sites in the 3′ UTR of C. elegans lin-41 (Figure 3A).

Figure 3 with 2 supplements see all

Download asset Open asset

Interplay between the effects of length, position, and offset of 3′ pairing, as measured for let-7a by comparing efficacy of repression in cells.

(A) Design of reporter mRNAs. For the diagrams of 3′-compensatory sites and seed-matched sites, a large colored circle indicates a Watson–Crick match to let-7a, a smaller plum circle indicates a G wobble across from U at position 6 of let-7a, and a small gray circle indicates lack of complementarity. The diagram for the *lin-41* sites is as in Figure 1A. Gray nucleotides and small gray circles indicate positions allowed to vary in the 14 different contexts. Each of the nonzero offset possibilities (i.e. the +1_A, +1_U, +4_A, and +4_U) was formed by inserting the indicated nucleotides between the two nucleotides opposite those of let-7a positions 9 and 10. (B) Repression attributed to each site type after co-transfecting let-7a into F9 cells. F9 cells were chosen for this experiment because they endogenously express very little let-7 (Mayr et al., 2007). Changes observed upon let-7a co-transfection are plotted for reporters with the single-site configurations (top) and for those with the dual-site configuration (bottom). Mean values are represented by horizontal black lines. For the single-site analysis, changes associated with upstream-only and downstream-only site configurations were plotted separately to yield eight values spanning the four replicate experiments. Changes were normalized to the mean no-site value. (C) The relationship between repression observed in cells and relative K_D values derived from AGO-RBNS. The line represents a fitted model relating binding affinity to predicted repression (r², coefficient of determination of the model fit).

Plasmids designed to express these 952 different reporter variants were co-transfected into F9 cells with either a let-7a duplex, a control miR-1 duplex, or no miRNA duplex (the mock co-transfection), and accumulation of each variant in the presence of each co-transfected miRNA was monitored by high-throughput sequencing and compared to accumulation observed in the mock co-transfection (Figure 3B and Figure 3—figure supplement 1). Most of the conclusions regarding binding affinities inferred from AGO-RBNS also held with respect to repression in cells, including the marginal benefit of pairing to only nucleotides 13–16 of let-7a, the greater benefit of pairing to let-7a nucleotides 11–19 compared to nucleotides 13–21, the strong benefit of a positive offset when pairing to nucleotides 11–19 of let-7a but not when pairing to nucleotides 13–16 or 13–21, and the ability of extended 3′-compensatory pairing at a favorable position and offset to impart activity matching that of the canonical 8mer site (Figure 3B).

Among the sites with pairing to nucleotides 11–19 of let-7a, the most effective was the one with an offset of +4 nt formed by insertion of four consecutive A nucleotides within the segment of the target that linked seed and 3′ pairing (Figure 3B, +4_A). This site was more effective than the two possibilities with an offset of +1 formed by insertion of either a single A or U nucleotide (+1_A and +1_U, respectively; p < 0.02, Tukey’s range test), which were in turn more effective than the site that lacked an insertion (0) and thus had a 0-nt offset (Figure 3B, p < 10⁻⁴, Tukey’s range test). Although this rank ordering was consistent with that predicted from relative affinities, some quantitative disagreement with the relative affinities was observed, with the three sites with positive offsets performing substantially better than expected from their relative affinities (Figure 2C). Another notable divergence from the binding results was the poorer-than-expected efficacy of the site with a +4 nt offset formed by the insertion four consecutive U nucleotides (+4_U). Efficacy of this site was less than half of that of the other sites with positive offsets, and its greater efficacy over the site with a 0-nt offset was statistically significant only for the dual-site configuration (p < 10⁻⁴, Tukey’s range test). For the 3′-compensatory site with 9 nucleotides of complementarity starting at position 13, the efficacy of the +4_U variant was also less than that of the +4_A variant (Figure 3B, p < 0.01, Tukey’s range test). These results indicated that the primary nucleotide identity of the segment that links seed and 3′ pairing can modulate repression. One way this modulation might occur is through the action of RNA-binding proteins, many of which prefer short oligo(U) tracts (Dominguez et al., 2018; Van Nostrand et al., 2020), as binding of a protein to this segment would be expected to interfere with 3′ pairing.

Compared to the single-site configurations, the dual-site configuration yielded greater repression, with an average increase of ~2.8-fold (Figure 3B), implying some cooperativity in the action of the two sites (Grimson et al., 2007; Saetrom et al., 2007; Broderick et al., 2011; Briskin et al., 2020). The most effective synthetic 3′-compensatory sites tested (those with complementarity to nucleotides 11–19 and with a +1_A-nt or +4_A-nt offset) were more repressive on average than the two sites found within lin-41 mRNA of C. elegans (p < 10⁻³, 1.2-fold more repressive). The lin-41 sites, as well as the other effective sites, were all more effective when examined in the lin-41 local sequence context than they were in most of the other contexts (Figure 3—figure supplement 2).

Overall, we found that affinity observed in vitro corresponded well to repression observed in cells (Figure 3C, r² = 0.71). This correspondence for 3′ sites resembled that observed for seed-matched sites (McGeary et al., 2019) and provided counterevidence to a recent proposal that 3′ pairing might be preferentially destabilized in cells (Bibel et al., 2022). When framed in the context of the 3′-compensatory sites acting in the C. elegans lin-41 mRNA, our results indicate that the developing animal exploits 3′-compensatory pairing at a favorable position (position 11) of let-7, which is enhanced through the positive-offset binding mode to confer robust repression of the lin-41 mRNA, with repression further enhanced by favorable site context and some inter-site cooperativity.

Different miRNAs have distinct 3′-pairing preferences

The optimal 3′-pairing architecture for let-7a differed from that previously elucidated for miRNAs more generally (Grimson et al., 2007). When pooling repression and conservation data for 11 miRNAs, pairing to miRNA nucleotides 13–16, with an offset of 0 nt appears to be most consequential (Figure 1A; Grimson et al., 2007). Because the previous analysis represents the average of trends derived from multiple miRNAs, a diversity of miRNA-specific 3′-pairing preferences might explain this disagreement. We therefore measured the 3′-pairing profiles of two other well-studied miRNAs, miR-1 and miR-155, for comparison with the let-7a profile.

Stabilizing 3′ pairing was observed for both miR-1 (Figure 4A) and miR-155 (Figure 4B), with binding affinity increasing with the length of pairing, as observed for let-7a (Figure 2). However, the magnitude of increased binding affinity differed from that of let-7a and that of each other: the affinity of 3′ pairing to miR-1 was more modest, with 3′-compensatory sites seldomly reaching the affinity of its canonical 6mer site (Figure 4A), whereas for miR-155, they often reached the affinity of its canonical 8mer site, and in some cases increased affinity by >500-fold (Figure 4B). The positions of the best sites at each length also differed from those of let-7a. For miR-1, optimal 4-nt sites paired to miRNA nucleotides 12–15, and when considering optimal sites of increasing lengths, pairing extended continuously, primarily toward the 3′ end of the miRNA and never reaching miRNA nucleotide 10 (Figure 4A, right). By contrast, for miR-155, optimal 4-nt sites paired to miRNA nucleotides 13–16, and for optimal sites of increasing lengths, pairing sometimes shifted discontinuously and never included miRNA nucleotide 12 (Figure 4B, right).

Figure 4 with 1 supplement see all

Download asset Open asset

Relative affinity measurements of 3′-compensatory sites of miR-1 and miR-155.

(A) Relative K_D values of miR-1 3′-compensatory sites that had optimally positioned 3′ pairing of 4–11 bp. Otherwise, this panel is as in Figure 2B. (B) Relative K_D values of miR-155 3′-compensatory sites that had optimally positioned 3′ pairing of 4–11 bp. Otherwise, this panel is as in Figure 2B. (**C and D**) Affinity profiles of the 3′ regions of miR-1 (C) and miR-155 (D). Otherwise, these panels are as in Figure 2E.

Analysis of each of the optimal 3′ sites of miR-1 and miR-155 along the length of the random region indicated that, unlike sites for let-7a, those for neither of these two miRNAs underwent a significant shift in the preferred offset (Figure 4A and B, left). Nevertheless, the longer optimal sites of miR-1 extended to position 11, and their range of near-optimal offsets broadened to include values from 0 to +5 nt, consistent with contributions from both binding modes. The offset preferences of miR-155 also broadened with increased pairing. However, instead of coinciding with pairing at position 11, these broadened preferences coincided with pairing to the G19-G20-G21-G22 stretch near the 3′ end of miR-155.

In summary, the most optimal 3′ sites each paired to at least two nucleotides of the miRNA segment spanning positions 13–16, which was previously identified as most consequential for 3′ pairing, but frequently did not pair to the entire segment. Shorter optimal sites consistently preferred pairing to G nucleotides adjacent to miRNA positions 13–16. For example, shorter optimal sites to let-7a paired to the G11-G12 sequence element 5′ of this segment rather than to G15-U16 (Figure 2B, right), the optimal 4-nt site to miR-1 paired to G12 rather than to U16 (Figure 4A, right), and intermediate-length optimal sites to miR-155 paired to G19-G20-G21 rather than to G13-U14 (Figure 4B, right). These trends were also observed when examining many combinations of positions, lengths, and offsets for miR-1 and miR-155 (Figure 4—figure supplement 1). In aggregate, these results supported the report of an intrinsic preference for pairing to miRNA nucleotides 13–16 (Grimson et al., 2007) but also indicated that the miRNA sequence imparts additional preferences, resulting in unanticipated differences between the optimal sites of individual miRNAs. These sequence-specific preferences tended to favor pairing to G residues of the miRNA, which was presumably explained by the greater stability of G:C pairing over A:U pairing, although the presence of only a single C nucleotide prevented investigation of whether pairing to G was preferred over pairing to C. We also observed differences between miRNAs in the strength of 3′ pairing. Compared to 3′-site affinities observed for let-7a, affinities were substantially lower for miR-1 and substantially higher for miR-155 (median increase in affinity with 11 bp of 3′ pairing of 36-fold, 5.8-fold, and 133-fold for let-7a, miR-1 and miR-155, respectively). Thus, our results indicated that association of the guide RNA with the AGO protein does not fully standardize either the architecture of optimal 3′ pairing or the magnitude of its benefit.

Pairing and offset coefficients describe unique 3′-pairing profiles for each miRNA

To summarize the results for miR-1 and miR-155, we generated heat maps representing the binding affinity at all possible pairing positions for all pairing lengths of 4–11 bp, as a function of pairing offset (Figure 4C and D), as with let-7a (Figure 2E). The similarities observed between heat maps for the same miRNA at different offsets indicated that each change in offset altered the binding affinity of all 3′-pairing possibilities in a consistent manner, which in turn indicated that for each of the three miRNAs, the effect of pairing offset was largely independent of the effect of guide–target complementarity (Figures 2E and 4C, D). This overall independence was observed for let-7a, despite its two binding modes, because the contribution of the positive-offset binding mode, which had the higher affinities, dominated over that of the other binding mode.

To test this independence, we examined how well the affinities could be explained as the product of two coefficients, one representing the contribution of the pairing range, which was defined by pairing position and length (represented by the location of a cell within the heat maps of Figures 2E and 4C, D), and the other representing the contribution of the pairing offset. Our model fit the data well (r² = 0.92, 0.86, and 0.96 for let-7a, miR-1, and miR-155, respectively; Figure 5—figure supplement 1), and yielded a set of pairing and offset coefficients for each miRNA. Each pairing coefficient represented the maximum beneficial ∆G associated with complementarity to the corresponding range of miRNA nucleotides, and each offset coefficient represented the fraction of the maximum beneficial ∆G observed at each pairing offset (Figure 5A–C). For each miRNA, the pairing coefficients corresponded well with the affinities observed at the preferred offset (Figure 5A–C, comparison of right-most heat maps; r² = 0.98, 0.97, and 0.96, respectively). Moreover, these coefficients, which distilled the pairing preferences indicated by the 934, 1061, and 1180 relative K_D values measured for let-7a, miR-1, and miR-155, respectively, quantitatively captured the qualitative observations made earlier from analysis of subsets of the data.

Figure 5 with 8 supplements see all

Download asset Open asset

Distinct pairing-range, offset, and seed-mismatch preferences of different miRNAs.

(**A–C**) Model-based analyses of 3′-pairing preferences of let-7a (A), miR-1 (B), and miR-155 (C). For each miRNA, 3′-pairing affinities are described by a set of pairing coefficients (left) and offset coefficients (middle-left; dashed lines, 95% confidence interval), which when multiplied together (middle-right) approximated measured K_D fold-change values (right; let-7a values replotted from Figure 2E). The parameters were obtained by maximum-likelihood estimation with a nonlinear energy model. For both miR-1 (B) and miR-155 (C), the two pairing diagrams indicate the fold-change value and architecture for a 3′ site pairing to miRNA nucleotides 13–16 (top) in comparison to the fold-change value and architecture of the 3′ site with the greatest measured affinity (bottom) at their shared optimal offset of +1 nt. Pairing coefficients, model predictions, and K_D fold-change values of miR-1 were not calculated for pairing to miRNA positions 15–18 and 19–22 because these two segments were identical (gray boxes). (D) Predicted ∆G values of the 3′ sites with pairing ranges in (**A–C**). (E) The relationship between the model-derived pairing coefficients (**A–C**) and the predicted ∆G values (D). Points are colored according to pairing length, as in Figure 2B. To control for the trivial effect of increasing pairing length, pairing coefficients were divided by the geometric mean of all coefficients with the same length, and ∆G values of each length were normalized to the mean ∆G value of pairings with the same length. The gray region represents the 95% confidence interval of the relationship when fitting a linear model to the data (r², coefficient of determination), and the dashed line represents the predicted thermodynamic relationship given by K = e^−∆G/RT. (F) Distinct effects of seed mismatches on 3′-pairing affinities of let-7a, miR-1, and miR-155. For each miRNA, seed-mismatch coefficients were derived by maximum-likelihood estimation, fitting a nonlinear model to the K_D fold-change values observed when examining 3′-site enrichment separately for each of the 18 seed mismatches. The error bars indicate 95% confidence intervals. Wobble pairing in which the G was in either the miRNA or the target is indicated in blue and red, respectively. (G) Relationship between affinity of 3′-compensatory pairing and that of seed-site binding. For each seed mismatch, the coefficient from (F) is plotted as a function of the relative K_D value of that mismatch, as measured using results from the programmed libraries for let-7a (black), miR-1 (blue), and miR-155 (red). The dashed line shows the linear least-squares fit to the data, with the gray interval indicating the 95% confidence interval. (H) Relationship between affinity of 3′-supplementary pairing and that of seed-site binding. For each of the six seed-matched site types (Figure 1A, left) and for each of the six miRNAs (key), the relative affinity of the top quartile of all 4- and 5-nt 3′ sites with their preferred offsets is plotted as a function of the relative affinity of the seed-matched site. Relative affinities were measured from analysis of previous AGO-RBNS that used a random-sequence library (McGeary et al., 2019; Figure 5—figure supplement 8).

Because the pairing coefficients represented the thermodynamic benefit of each pairing possibility, we examined how well each set of pairing coefficients was explained by the nearest-neighbor model that predicts the stability of RNA hybridization in solution. To do so, we calculated the predicted ∆G value for each 3′ site (Figure 5D) and adjusted each value by subtracting the mean value for that length of pairing, which was done to remove the trivial effect of increasing pairing length (Figure 5E). When comparing these length-adjusted values with analogously adjusted pairing coefficients, we observed a strong relationship for both let-7a and miR-155, and a much weaker relationship for miR-1. Nevertheless, even when focusing on results for let-7a and miR-155, the apparent effect size was less than that expected by the relationship ∆G = −RT lnK (Figure 5E, dashed lines). Thus, as observed with the miRNA seed region (Salomon et al., 2015; McGeary et al., 2019), compared to RNA free in solution, association with AGO reduces the differences in binding energy observed when hybridizing to different miRNA 3′-end sequences.

This reduction in magnitude also applied to the overall contribution of 3′ pairing (Figure 5—figure supplement 2). For instance, although the >200-fold differences in binding affinity imparted by the top 11-nt 3′ sites of let-7a and miR-155 might seem large, the ∆G predicted for each of these sites was −14.8 kcal/mol and −20.1 kcal/mol, which corresponded to respective fold differences in affinity of 2.7 × 10¹⁰ and 1.5 × 10¹⁴. Presumably, the benefit of pairing to 3′ sites was mostly offset by the cost of disrupting favorable interactions between unpaired 3′ regions and AGO, as proposed in the context of fully paired sites (Tomari and Zamore, 2005). The magnitude of this inferred cost appeared specific to each miRNA, implying that AGO might have some sequence preferences when interacting with unpaired miRNA 3′ regions. For example, pairing to either nucleotides 9–19 of let-7a or nucleotides 11–21 of miR-1 was predicted to occur with equivalent ∆G values of −13.5 kcal/mol, yet the model-determined contributions of these sites were 160- and 14-fold, respectively (Figure 5—figure supplement 2A, left and middle).

Separating the comparison between K_D fold-change and predicted ∆G based on whether the contiguous range of pairing included the G11, G12, and G20 of let-7a, miR-1, and miR-155, respectively, revealed a cooperative benefit of pairing to these nucleotides (Figure 5—figure supplement 2B, C), such that their inclusion within the 3′ pairing enabled the other paired nucleotides to contribute more to the interaction. We also note that using the measured affinities rather than pairing coefficients did not increase agreement with ∆G (Figure 5—figure supplement 2D, E), suggesting that the use of the pairing coefficients did not lead to loss of information contained within the data from which they were generated.

We next used data obtained previously from fully randomized libraries (McGeary et al., 2019) to extend our analyses to miR-124, lsy-6, and miR-7 (Figure 5—figure supplement 3A–F), for 3′ sites as long as eight nt. This upper-bound of 8 nt was selected because pairing and offset coefficients calculated for 3′ sites of let-7a, miR-1, and miR-155 using results from fully randomized libraries agreed with those calculated using results from the respective programmed libraries, provided that the sites did not exceed 8 nt (Figure 5—figure supplement 3G, H). Like let-7a, miR-124 had both preferred pairing to position 11 and an optimal pairing offset of >2 nt (Figure 5—figure supplement 3D). To look for evidence of multiple binding modes, we repeated the analyses of both Figure 2B (for pairing lengths of 4–8 bp) and Figure 2C (for pairing lengths of 4 and 5 bp), using the prior AGO-RBNS data for miR-124, lsy-6, and miR-7 (Figure 5—figure supplement 4). For comparison, we also repeated these analyses using the prior AGO-RBNS data for let-7a, for which we had evidence of two binding modes from the programmed-library AGO-RBNS data. For each of the four miRNAs, we found evidence of the two binding modes. Both let-7a and miR-124 had the previously observed pattern, in which the positive-offset binding mode had binding affinity greater than that of the zero-offset binding mode (Figure 5—figure supplement 4A–D). However, lsy-6 and miR-7 had a different pattern, in which the binding affinities of both modes were similar (Figure 5—figure supplement 4E–H). Perhaps pairing to the G11-G12 dinucleotide found in both the let-7a and miR-124 sequences enabled the positive-offset binding mode to dominate over the zero-offset binding mode, whereas pairing to the single G11 found in lsy-6 and miR-7 added to site affinity but did not enable the positive-offset binding mode to dominate.

The analyses of miR-124 and lsy-6, which each had multiple C nucleotides in their 3′ region, allowed us to return to the question of whether pairing to miRNA G nucleotides might be favored over pairing to C nucleotides. Pairing to C15 of lsy-6 substantially added to binding affinity. For example, the 4.2-fold greater affinity of the position 12–15 site over the position 11–14 site indicated that pairing to C15 was favored over pairing to G11, and extending pairing from positions 11–14 to 11–15 increased affinity 8.2-fold (Figure 5—figure supplement 3E). Pairing to C13 was also somewhat preferred, as illustrated by the 1.8-fold greater affinity of the 13–17 site over the 14–18 site, and the 3.2-fold benefit of extending pairing from positions 14–18 to 13–18. However, pairing to C19-C20 of miR-124 did not seem to have the same impact as pairing to G19-G20 of miR-155, as illustrated by the negligible (0.9-fold) benefit of extending the miR-124 pairing from positions 13–18 to 13–20, compared to the 14-fold benefit for miR-155. These results supported the idea that pairing to a G in the miRNA 3′ region is generally favored over pairing to a C, although pairing to a C located within positions 13–16 of the 3′ region can be impactful.

The type of seed mismatch affects the affinity of 3′ pairing

To examine the influence of seed-mismatch position and identity, we analyzed the full set of 16,235, 18,076, and 19,666 relative K_D values of let-7a, miR-1, and miR-155, no longer combining read counts for the 18 possible seed-mismatch sites in the programmed library prior to K_D estimation. For each pairing, offset, and seed-mismatch possibility, the relative K_D value of the 3′-compensatory site was divided by that of its seed-mismatch site to generate a fold-change value representing the contribution of the 3′ site to affinity (Figure 5—figure supplements 5–7). These values revealed a striking effect of seed-mismatch identity on the benefit of 3′ pairing. This effect was of greater magnitude for more favorable 3′ sites, causing affinities to vary >10-fold for the most optimal sites to miR-155. To further study this effect, we expanded our model to include a seed-mismatch coefficient, such that each log₁₀(K_D fold change) value was described as the product of the pairing, offset, and seed-mismatch coefficients corresponding to its 3′-pairing architecture (Figure 5—figure supplements 5–7).

The affinity of seed-mismatch sites lacking 3′ pairing had little relationship with the influence of the mismatch on 3′-pairing affinity (Figure 5G). Likewise, examination of data from the six random-library AGO-RBNS experiments found no relationship between the affinities of canonical sites lacking 3′ pairing and the relative influence of each canonical site on the benefit of supplemental pairing (Figure 5H and Figure 5—figure supplement 8). Furthermore, the average effect of canonical-site type on 3′ binding affinity was small, with only six out of the 36 miRNA–site combinations having a >0.1 effect on log₁₀(K_D fold change), corresponding to an ~25% change in binding affinity (Figure 5H). Together, these results indicate that for 3′-supplementary pairing, the benefit of the 3′ pairing is largely the same between sites, but that for 3′-compensatory pairing, the potential benefit of 3′ pairing differs depending on the identity of the seed mismatch. This might be due to a differential ability of these mismatches to elicit a conformational change in AGO allowing pairing to the 3′ end (Schirle et al., 2014; Sheu-Gruttadauria et al., 2019b). Another potential contribution might stem from variation in elemental rate constants of seed-mismatch sites of similar affinity, whereby some sites have dwell times that are too short to establish pairing to the miRNA 3′ region.

When comparing the effects for guide–target nucleotide possibilities, strong trends did not emerge within miRNAs (e.g., when comparing the effects of mismatches to the G at position 2 with those of the mismatches to the G at position 4 of let-7a), or between miRNAs (e.g. when comparing the effects of mismatches to the G at position 3 of miR-1 with those to the G at position 6 of miR-155) (Figure 5F). However, in cases in which the same nucleotide occurred at the same position for two different miRNAs, some correspondence was observed (positions 2 and 6 of let-7a and miR-1, position 3 of let-7a and miR-155, position 4 of miR-1 and miR-155) (Figure 5F). Notably, the miRNA–target U:G mismatch at position 6, which was the most favored mismatch for both let-7 and miR-1, occurs within one of the two compensatory sites within the 3′ UTR of C. elegans lin-41 (Figure 3A; Pasquinelli et al., 2000; Reinhart et al., 2000), further helping to explain the activity of this site in C. elegans development.

Pairing preferences of miRNA 3′-end nucleotides are independent of the seed sequence and maintained at adjacent positions

Having found that the 3′-pairing affinities of each of the three miRNAs were largely a function of the miRNA pairing, offset, and seed-mismatch preferences, we investigated the extent to which different regions of each miRNA contributed to these preferences. To do so, we performed AGO-RBNS with synthetic miRNA variants. Two of these variants were chimeric miRNAs in which nucleotides 1–8 of let-7a and miR-155 were swapped (Figure 6A). The other two were let-7a variants in which the 3′ sequence was shifted by one nucleotide in either direction (Figure 6B). Results for these variants showed that both the pairing and offset preferences tracked with the 3′ sequence (Figure 6—figure supplement 1A–G), with the quantitative contribution of each 3′ nucleotide to pairing largely maintained when shifted to an adjacent position (Figure 6—figure supplement 2A–E). By contrast, the seed-mismatch preferences tracked with the seed sequence, with little pairwise difference in these preferences observed for miRNA variants sharing nucleotides 1–8 but possessing distinct 3′ sequences (Figure 6—figure supplement 1H, I and Figure 6—figure supplement 2F).

Figure 6 with 2 supplements see all

Download asset Open asset

Variant miRNAs designed to query the contributions of the seed and 3′ regions to binding, and the positional dependence of pairing preferences of the 3′ region.

(A) Sequences of native let-7a, native miR-155, a chimeric miRNA containing the seed region of let-7a appended to nucleotides 9–23 of miR-155 (let-7a–miR-155), and a chimeric miRNA containing the seed region of miR-155 appended to nucleotides 9–21 of let-7a (miR-155–let-7a). (B) Sequences of let-7a(−1), which has a 3′ region permuted one nucleotide toward the 5′ end, native let-7a, and let-7a(+1), which has a 3′ region permuted one nucleotide toward the 3′ end. The 3′ sequence shared between all three miRNAs is shaded in blue, and the A and U nucleotides that were rearranged to generate the permuted variants are in blue and purple, respectively.

Effects of mismatches within 3′ sites are consistent across miRNAs but explained poorly by the nearest-neighbor model

Having systematically analyzed the effects of seed-mismatch identity and of the length, position, and offset of perfect 3′ pairing, we next sought to measure the effects of any imperfections—that is, mismatches, wobbles, or bulged nucleotides—within this 3′ pairing. Accordingly, we measured the relative affinities of variants of each site considered thus far, looking at each possible variant that had one of the eight possible single-nucleotide imperfections at one position within the site. These eight imperfections considered at each position of interest included three possible mismatched nucleotides (including G:U wobbles), four possible single-nucleotide bulges (occurring opposite the linkage of two miRNA positions and assigned to the more 3′ miRNA position), and one single-nucleotide deletion (i.e., a bulged nucleotide in the miRNA). Consideration of these variants together with the original sites with perfect contiguous pairing resulted in the measurement of K_D values for 38,108 let-7a sites, 44,190 miR-1 sites, and 52,166 miR-155 sites. Analysis of these variants in the context of the best sites at each length (Figure 7A–C and Figure 7—figure supplement 1) revealed no imperfections that increased 3′-site affinity, which indicated that there were no positions at which the altered helical geometry of a mismatch was favored over Watson–Crick pairing. When comparing effects of internal mismatches to those of mismatches occurring at the end of the pairing, no striking differences were observed. Nonetheless, effects at some positions were more striking than others, with larger effects observed for mismatches involving any of nucleotides 11–15 of let-7a (Figure 7A), 12–15 of miR-1 (Figure 7B), and 15–22 of miR-155 (Figure 7C), which concurred with the importance of extending pairing to G11-G12, G12, and G19-G20-G21-G22 of the respective miRNAs.

Figure 7 with 2 supplements see all

Download asset Open asset

The impact of mismatched, bulged, and deleted target nucleotides on 3′-compensatory pairing.

(A) The effect of mismatched, bulged, and deleted target nucleotides on 3′-compensatory pairing to let-7a. At the top is a schematic depicting the position of highest-affinity 3′-pairing for 3′ sites of lengths 8–11 nt, redrawn from Figure 2B. Below, at the left are heat maps corresponding to each of the pairing positions shown above, indicating the affinities with each of the four possible nucleotides at each position of the site. Cells corresponding to the Watson–Crick match are outlined in blue. Cells for affinities of mismatches that could not be calculated due to sequence similarity to another site type (e.g., the mismatched U across from position 14, which was indistinguishable from a 6mer-m8 seed site) are in gray. To the right are heat maps that correspond to the same pairing ranges but indicate the effects of a bulged or a deleted (del.) 3′-target nucleotide. A bulged nucleotide at position n corresponded to an extra target nucleotide inserted between the nucleotides pairing to miRNA positions n – 1 and n. (**B and C**) The effects of mismatched, bulged, and deleted target nucleotides on 3′-compensatory pairing to miR-1 (B) and miR-155 (C). Otherwise, these panels are as in (A). (D) Profiles of 3′-pairing mismatch tolerances. Each bar represents the ∆∆G value when averaging over the three possible mismatches at that position, for let-7a (top), miR-1 (middle), and miR-155 (bottom). Each of the mismatch ∆∆G values was an average of the values observed in the context of each 10-nt 3′ site that included the position. The color indicates whether the miRNA nucleotide was an A (blue), U (green), C (red), or G (yellow). (E) Profiles of tolerances to bulged and deleted nucleotides. Each colored bar represents the ∆∆G value when deleting the target nucleotide complementary to the miRNA at that position, and each dark gray bar represents the ∆∆G value when averaging all four bulged nucleotide possibilities occurring at that inter-nucleotide position, for let-7a (top), miR-1 (middle), and miR-155 (bottom). Each of the ∆∆G values represents the average of the values observed in the context of each 10-nt 3′ site that included the position. The color of each bar corresponding to a deletion indicates whether the resulting bulged miRNA nucleotide was an A (blue), U (green), C (red), or G (yellow). (F) The tolerance of bulged nucleotides near the ends of 3′ sites. Plotted are ratios of K_D fold-changes comparing a site that has a bulged nucleotide between the penultimate and terminal base pairs with a site that does not have the terminal base pair (in which case, the bulged nucleotide in the former pairing architecture becomes a terminal mismatch). The box plots indicate the minimum, lower-quartile, median, upper-quartile, and maximum values. For each of the three miRNAs, comparisons are made for bulges occurring at the 5′ end of the 3′ pairing (5p), and at the 3′ end of the 3′ pairing (3p). The vertical gray line indicates a K_D fold-change ratio of 1.0. At the top is an example of a 3p comparison. (G) Comparison of the measured mismatch ∆∆G values in 3′ sites with values predicted by nearest-neighbor rules. Left, comparison of the average measured ∆∆G value with the average predicted value for each of the 12 possible miRNA–target mismatch combinations. Right, comparison of measured and predicted average fractional reduction in ∆G attributed to each mismatch. The fractional reduction was given by (∆G_WC − ∆G_mm)/∆G_WC, where ∆G_WC corresponds to the ∆G of the site with full Watson–Crick pairing, and ∆G_mm corresponds to the ∆G of a site containing the mismatch. These average values were calculated using K_D fold-change values determined for 10-nt sites, first averaging results for the same position over all 10-nt sites that included the position, then averaging results for that mismatch across all positions of the miRNA that had that mismatch, and then averaging the results across all three miRNAs. Colors and symbols indicate miRNA and target nucleotide identities, respectively (key). (H) Comparison of the measured seed mismatch ∆∆G values with values predicted by nearest-neighbor rules. For each mismatch type, both the measured and predicted ∆∆G values were the average over all occurrences within positions 2–7 for let-7a, miR-1, miR-155, miR-124, lsy-6, and miR-7, using relative K_D values from analyses of random-sequence AGO-RBNS results. Otherwise, this panel is as in (G).

To investigate mismatch tolerance across the range of miRNA 3′-end positions, we calculated the geometric mean of the K_D fold change for a mismatch at each position for all three miRNAs, averaging both over the three mismatches at each position and over each of the 10-nt sites that contained the position (Figure 7D). As expected, reduced binding affinity tracked with the importance of the positions for 3′ pairing, with the greatest effects observed at G11 and G12 of let-7a, the G12–G15 of miR-1, and G13 and G15–G21 of miR-155 (Figure 7D). The greater importance of pairing to G13 compared to pairing to C12 of miR-155 further supported the idea that pairing to G had a greater impact over pairing to C in the miRNA 3′ region. Nonetheless, extending the analyses of mismatches, wobbles, and bulges to the random-sequence RBNS datasets previously acquired for six miRNAs (Figure 7—figure supplement 2) indicated that disrupting pairing to either C13 or C15 of the C13-G14-C15 trinucleotide of lsy-6 greatly reduced affinity. Thus, in some nearest-neighbor and positional contexts, pairing to a miRNA C nucleotide can be as important as pairing to a miRNA G nucleotide. More generally, these results showed that the effect of a mismatch to a particular nucleotide was informed primarily by the overall importance of that miRNA nucleotide for pairing (as determined by its nucleotide identity and position within the miRNA 3′ end), irrespective of whether the target nucleotide fell within the middle or terminus of the 3′ site.

To summarize the positional tolerance of bulges, we averaged the effects of the four bulges at each inter-nucleotide position and over each of the 10-nt sites containing that position. Likewise for the deletions, we averaged each single possibility over the 10-nt sites containing that position (Figure 7E). At each position, the severity of both types of lesions tracked with that observed for the mismatches, with the effects of deletions generally similar to those of their corresponding mismatches, and effects of bulged nucleotides marginally less severe.

To examine if the benefit of bulged nucleotides over mismatched nucleotides applied to the very 5′ and 3′ ends of 3′ sites, we considered all possible 10-nt sites for all three miRNAs with programmed libraries, and calculated the fold difference in relative K_D observed when comparing a site with a terminal mismatch to that of the site with a corresponding terminal bulged nucleotide (i.e. the site variant in which the target nucleotide following the mismatch can pair to the mismatched miRNA nucleotide). For each miRNA, a small but significant benefit to terminal bulges was observed (Figure 7F; p = 2.4 × 10⁻⁵, 1.4 × 10⁻⁶, and 4.5 × 10⁻⁴ for let-7a, miR-1, and miR-155, respectively; one-tailed Wilcoxon signed rank test). Thus, an isolated complementary target nucleotide separated from a longer contiguous stretch of pairing can contribute modestly to site affinity.

To enable comparison of the observed effects of mismatches with those predicted by the nearest-neighbor model of RNA duplex stability, we calculated the ∆∆G of each mismatch in the context of all 10-nt 3′ sites of the three miRNAs. We first averaged these values over all the contiguous sites, and then over all positions with the same miRNA nucleotide, and then over the three miRNAs, resulting in one global average ∆∆G value for each of the 12 possible miRNA–target mismatch possibilities. Comparison of these values with those predicted using the nearest-neighbor parameters revealed that the effects of the mismatches were typically much lower than expected for RNA in solution, with no strong relationship between the observed and predicted ∆∆G values (Figure 7G, left; r² = 0.02). The outlier in this analysis was the miRNA–target U:G wobble, which was as disruptive as the typical mismatch but predicted to be much less so (Figure 7G, left, green +). Next, to account directly for the reduced binding energy of the fully complementary sites in comparison to their predicted ∆G values, we compared the average observed and predicted fractional reduction in ∆G of each site caused by each of the 12 mismatch values (Figure 7G, right). For eight of 12 mismatches, the fractional reduction in ∆G was within 10% of its prediction, but the miRNA–target A:G, G:G, G:U, and U:G mismatches respectively caused 31%, 42%, 21%, and 48% more reduction in binding energy than predicted. These results indicated that the nearest-neighbor parameters were not suited for predicting the contribution of miRNA 3′ pairing in three respects: (1) the overall contribution to binding energy was far less than that predicted, (2) mismatched target G nucleotides were relatively more deleterious than predicted, and (3) wobble pairing was relatively less favorable than predicted. Indeed, the U:G possibility, which both contained a target G nucleotide and was a wobble, was the mismatch with the greatest deviation from expectation.

For comparison, we repeated these analyses for mismatches to the miRNA seed (i.e., miRNA positions 2–7) within the context of canonical 8mer pairing, calculating the average ∆∆G and the fractional reduction in ∆G for each type of mismatch for each of the six miRNAs for which there was random-sequence RBNS data (McGeary et al., 2019; Figure 7H). These analyses indicated that the effects of mismatches within seed pairing also did not agree with predicted pairing energetics, albeit differently than the effects of mismatches within 3′ pairing. First, a mismatch within the seed pairing had a much larger influence on ∆∆G than did a mismatch within the 3′ pairing. Moreover, the reductions in binding affinities for mismatches within the seed pairing were even more regular than those for mismatches within the 3′ pairing, with a ~3 kcal/mol detriment for each of the 12 mismatch/wobble possibilities (Figure 7H, left). The fractional reduction in ∆G had a similarly large and uniform effect size, with no subset of the mismatch possibilities showing a relationship with that predicted (Figure 7H, right). Thus, the binding preferences at both the seed and 3′ regions of the miRNA were not well explained by nearest-neighbor rules, although the nature of the deviations differed in these two regions.

Discussion

An AGO-loaded miRNA can be divided into three regions: the seed region (nucleotides 2–8), the central region (nucleotides 9–10 or 9–11), and the 3′ region (Figure 1A; Bartel, 2018). Because the most effective 3′ pairing is reported to center on nucleotides 13–16 (Grimson et al., 2007), some subdivide the 3′ region into the 3′-supplementary region (nucleotides 13–16), and the tail (nucleotides 17 to the terminus), while expanding the central region to include nucleotide 12 (Wee et al., 2012; Schirle et al., 2014; Salomon et al., 2015; Sheu-Gruttadauria et al., 2019b). The structure of AGO2–miR-122 bound to a 3′-supplementary site, which shows that miRNA nucleotides 9–11 are not available for pairing due to both helical distortion and inaccessibility caused by residues of the PIWI and L2 loop, seems to support the notion of a 3′-supplementary region at nucleotides 13–16 (Sheu-Gruttadauria et al., 2019b). However, greater affinities are observed with more extended 3′ pairing (Becker et al., 2019; Sheu-Gruttadauria et al., 2019a), and we found that 3′-site affinities nearly always increased as potential for pairing expanded to include most of the 3′ region—and in the positive-offset binding mode, some of the central region. Thus, productive 3′ pairing can encompass the entire miRNA 3′ region and should not be thought of as limited to a short 3′-supplementary region. Indeed, the study reporting that pairing to nucleotides 13–16 is most effective for supplementing seed pairing uses a model for predicting the efficacy of 3′ pairing that rewards extension of that pairing into the remainder of the 3′ region (Grimson et al., 2007).

Also problematic for the notion of a short 3′-supplementary region common to all miRNAs was our observation that the positions most important for 3′ pairing differed between different miRNAs. For example, at their optimal offsets, both let-7a and miR-124 preferred pairing to nucleotides 11–14 over pairing to nucleotides 13–16 (Figures 2B and 5A, and Figure 5—figure supplement 3A,D), and the synthetic let-7a(−1) preferred pairing to nucleotides 10–13 over pairing to nucleotides 13–16 (Figure 6—figure supplement 2B). Moreover, although miR-155 preferred pairing to nucleotides 13–16 over other 4-nt possibilities, when examining 7-nt 3′ sites, it preferred pairing to nucleotides 15–21 over pairing that included nucleotides 13–16 (Figure 4B and Figure 4—figure supplement 1B). These observations showing that the preferred positions of 3′ pairing can vary so widely between miRNAs, to include virtually any nucleotide downstream of the seed, argued strongly against assigning the same short 3′-supplementary region to all miRNAs.

Our observations that pairing to nucleotides 11–14 of let-7a imparted greater affinity than did pairing to nucleotides 13–16 (Figure 2B and C) and that pairing to nucleotides 11–19 imparted greater repression than did pairing to nucleotides 13–21 (Figure 3), concurred with recent analyses of the relative importance of these nucleotides in Caenorhabditis elegans. C. elegans requires let-7 repression of lin-41 for viability, and this repression occurs through two 3′-compensatory sites that each have pairing to nucleotides 11–19 of the miRNA (Figure 3A; Pasquinelli et al., 2000; Reinhart et al., 2000; Aeschimann et al., 2019). Mutagenesis of individual nucleotides of the let-7 miRNA indicates that nucleotides 11, 12, and 13 are each critical for viability, whereas nucleotides 14, 15, and 16 each have intermediate importance, and nucleotides 17, 18, and 19 each have no detectable importance (Duan et al., 2021). Inspection of our data for let-7a, examining the effects of mismatches within the 3′ site that has the same architecture as that of the two sites within lin-41 (i.e., 9 bp of pairing beginning at position 11 with an offset of +1 nt) revealed a similar polarity, with mismatches near position 11 tending to be most consequential and those near position 19 tending to be least consequential (Figure 7—figure supplement 1D).

Although our results showed that preferred pairing often did not correspond precisely to positions 13–16, preferred pairing did always at least partially overlap this segment. Moreover, as pairing lengths increased from 4 to 6 bp, overlap between preferred pairing and this segment increased, such that the preferred 6-nt sites for let-7a, miR-1, miR-155, miR-124, miR-7 and lsy-6 each included pairing to miRNA nucleotides 13–16. The only exception we observed was the preferred 6-nt site for synthetic let-7a(−1), which paired to nucleotides 10–15. Thus, our results explain why an overall preference for pairing to nucleotides 13–16 was detected in meta-analyses of both functional data for 11 miRNAs and evolutionary conservation of sites for 73 miRNA families (Grimson et al., 2007). Our key added insight is that sequence identity in the 3′ region—particularly the placement of stretches of G residues—imparts additional preferences that supplement the positional preferences to specify different optimal regions of 3′ pairing for different miRNAs.

Another key insight is evidence of two distinct 3′-binding modes, observed as different offset preferences of let-7a, miR-124, lsy-6, and miR-7 with and without pairing to nucleotide 11 (Figure 2B and C and Figure 5—figure supplement 4). In the zero-offset binding mode, an offset of 0 or +1 nt is optimal for 3′ pairing starting at position 12, whereas in the positive-offset binding mode, additional nucleotides are required to bridge pairing to positions 10 or 11, resulting in optimal offsets that exceed +1 nt. In a crystal structure of AGO2–miR-122 bound to a 3′-supplementary target that pairs to nucleotides 13–16 with an offset of 0 nt, nucleotide 12 is the first nucleotide available for pairing, whereas pairing to nucleotide 11 is occluded by the central gate (Sheu-Gruttadauria et al., 2019b). We suggest that this structure reflects the conformation of the zero-offset binding mode, as it provides a physical model for why extension of potential pairing from nucleotide 12 to nucleotide 11 results in almost no increased binding affinity (Figure 5—figure supplement 4) for sites with an offset of 0 nt. However, another structure will be required to visualize the positive-offset binding mode that enables optimal pairing to let-7a and miR-124, as well as strong pairing to lsy-6 and miR-7. Genetically identified sites inferred to be utilizing this second binding mode include the two let-7a sites within the 3′ UTR of C. elegans lin-41, which both include pairing to nucleotide 11 and an offset of +1 nt, as well as the first lsy-6 site within the 3′ UTR of C. elegans cog-1, which includes pairing to nucleotide 11 and an offset of +2 nt. The discovery of these two binding modes required knowledge of the interplay between preferred pairing position and preferred pairing offset, which underscored the utility of obtaining affinity measurements for a large diversity of 3′ sites.

Early attempts to either explain targeting efficacy or predict target sites used scores incorporating, among other things, the predicted binding energy between the miRNAs and their proposed targets (Enright et al., 2003; Lewis et al., 2003; Doench and Sharp, 2004; Rajewsky and Socci, 2004; Krek et al., 2005). That these metrics were less useful in identifying consequential 3′ pairing than simpler rubrics scoring only the length and position of complementarity (Grimson et al., 2007) suggests that the parameters derived from interactions of purified RNAs in solution are not directly relevant to miRNAs associated with AGO. The breadth of our affinity measurements provided the ability to assess why such parameters are not as useful. Although high correspondence was observed between the predicted ∆G and measured 3′-pairing affinities (Figure 5—figure supplement 2A), for miR-1 this relationship nearly disappeared when normalizing for pairing length (Figure 5E). For let-7a and miR-155, a relationship was retained after normalizing for length, but four factors limit the utility of using this relationship for ranking target predictions. The first is the strong effect of position, with complementarity to the seed much more consequential than complementarity to the 3′ region, and complementarity at some positions in the 3′ region more consequential than complementarity to others, and much more consequential than complementarity to positions 1, 9, and often, 10. The second is the effect of primary sequence, as illustrated by the outsized benefit pairing to the G11, G12, and G20 nucleotides of let-7a, miR-1, and miR-155, respectively (Figure 5—figure supplement 2B, C). The third is the poor relationship between the predicted and measured effects of some internal mismatches and wobbles (Figure 7G), and the fourth is a lack of a consistent relationship between predicted ∆G and measured binding affinities between miRNAs (Figure 5—figure supplement 2A, comparing the slope for miR-1 with that for either let-7a or miR-155).

Comparison of the 3′ regions of the four miRNAs that were more effective at 3′ pairing with those of the two that were not suggested a feature that might have conferred higher 3′-pairing affinity: the presence of two or more adjacent G nucleotides (e.g. the G11-G12 of both let-7a and miR-124, and the G19-G20-G21-G22 of miR-155). Although lsy-6 did not have an oligo(G) stretch, it did have a well-positioned C13-G14-C15 trinucleotide, which together with G11 was critical for pairing affinity. When considering all four miRNAs together, as well as the lack of any GG, CG, or GC dinucleotides within the 3′ regions of miR-1 or miR-7, we suggest that miRNAs with GG, CG, or GC dinucleotides within positions 13–16 are the ones most likely to participate in productive 3′ pairing, and that pairing that extends to an oligo(G) sequence outside of positions 13–16 preferentially enhances affinity.

The importance of pairing to miRNA G nucleotides, and not C nucleotides (other than the C13-G14-C15 of lsy-6), suggested that a miRNA–target G:C base pair is read out differently than a C:G base pair. Perhaps G nucleotides participate in base-stacking interactions that position or pre-organize the guide strand to favor nucleation of 3′ pairing. Alternatively, the explanation might involve target-site accessibility. Pairing to a C in the miRNA 3′ region would require a G in the vicinity of the seed match, which compared to a C would cause poorer target-site accessibility (McGeary et al., 2019), thereby reducing the net contribution to binding.

Our results also revealed a functional difference between 3′-supplementary and 3′-compensatory pairing. The added affinity of a 3′ site was relatively constant when it supplemented different sites that had seed matches (Figure 5H and Figure 5—figure supplement 8), whereas it varied in the context of different 3′-compensatory sites that had different seed mismatches (Figure 5F and Figure 5—figure supplements 5–7). The effects of seed mismatches were miRNA-specific and unrelated to their binding affinities (Figure 5G). Additionally, our experiments using chimeric miRNAs demonstrated the separability of the mismatch effects from the length, position, offset, and nucleotide-identity preferences of the 3′ region (Figure 6—figure supplement 1).

Pairing to the miRNA 3′ region not only increases site affinity and target repression, but it can also influence the stability of the miRNA itself, in a process called target-directed miRNA degradation (TDMD) (Ameres et al., 2010; Cazalla et al., 2010; de la Mata et al., 2015; Bitetti et al., 2018; Kleaveland et al., 2018). The handful of target sites known to trigger TDMD have diverse 3′-pairing architectures. For example, degradation of miR-7 triggered by the cellular Cyrano transcript occurs through a canonical 8mer site supplemented with 14 contiguous pairs to the 3′ end of the miRNA (Kleaveland et al., 2018), whereas degradation of miR-27a triggered by the m169 RNA from murine cytomegalovirus occurs through a canonical 7mer-A1 site supplemented with only six contiguous pairs to the 3′ end of the miRNA (Marcinowski et al., 2012). Our finding that miR-7 has the weakest 3′ pairing among the six miRNAs we studied provides a potential explanation as to why its TDMD trigger Cyrano has such a long 3′ site.

The crystal structures of several known TDMD substrates bound to their corresponding TDMD-inducing target sites reveal a distinct conformation for these AGO–miRNA–target RNA ternary complexes in comparison to ternary complexes that have supplementary pairing involving only nucleotides 13–16 (Sheu-Gruttadauria et al., 2019a; Sheu-Gruttadauria et al., 2019b). During TDMD, this distinct conformation is thought to be recognized by the ZSWIM8 E3 ubiquitin ligase, causing AGO proteolysis through the ubiquitin–proteasome system, which exposes the miRNA to degradation by cellular nucleases (Han et al., 2020; Shi et al., 2020). Our discovery of the two 3′ binding modes raises the question of whether one of them might be more compatible with TDMD, perhaps due to a preference of the ZSWIM8 E3 ligase. Although the TDMD ternary complexes of the published structures all have 3′ pairing beginning at nucleotide 12 or later and offsets of 0 or −1 nt (Sheu-Gruttadauria et al., 2019a) and thereby represent the zero-offset binding mode, the 3′ pairing between miR-7 and Cyrano begins at G11 and has a +2-nt offset, which represents the positive-offset binding mode. Thus, the two 3′-binding modes both appear to be compatible with either of the two gene-regulatory processes that involve 3′ pairing—TDMD and miRNA-mediated repression.

Share this article

Cite this article

Features of miRNA 3′-compensatory sites characterized using AGO-RBNS.

Pairing to nucleotide 11 and a positive offset promote high-affinity binding to let-7a.in P.

Interplay between the effects of length, position, and offset of 3′ pairing, as measured for let-7a by comparing efficacy of repression in cells.

Relative affinity measurements of 3′-compensatory sites of miR-1 and miR-155.

Distinct pairing-range, offset, and seed-mismatch preferences of different miRNAs.

Variant miRNAs designed to query the contributions of the seed and 3′ regions to binding, and the positional dependence of pairing preferences of the 3′ region.

The impact of mismatched, bulged, and deleted target nucleotides on 3′-compensatory pairing.

Author details

Sean E McGeary

Contribution

Contributed equally with

Competing interests

Namita Bisaria

Contribution

Contributed equally with

Competing interests

Thy M Pham

Contribution

Competing interests

Peter Y Wang

Contribution

Competing interests

David P Bartel

Contribution

For correspondence

Competing interests

Downloads (link to download the article as PDF)

Open citations (links to open the citations from this article in various online reference manager services)

Cite this article (links to download the citations from this article in formats compatible with various reference manager tools)

Categories and tags

Research organisms

Further reading