Data availability
Deep sequencing data that support the findings of this study have been deposited in GEO database with the accession code GSE145046. Source data are provided with this paper.
Code availability
All the procedures but those for sequencing reads cleaning were completed using custom Perl and R scripts, which are available upon reasonable request. Source data are provided with this paper.
References
Bicknell, A. A. & Ricci, E. P. When mRNA translation meets decay. Biochem. Soc. Trans. 45, 339–351 (2017).
Schwartz, D. C. & Parker, R. Mutations in translation initiation factors lead to increased rates of deadenylation and decapping of mRNAs in Saccharomyces cerevisiae. Mol. Cell Biol. 19, 5247–5256 (1999).
Kurosaki, T., Popp, M. W. & Maquat, L. E. Quality and quantity control of gene expression by nonsense-mediated mRNA decay. Nat. Rev. Mol. Cell Biol. 20, 406–420 (2019).
Presnyak, V. et al. Codon optimality is a major determinant of mRNA stability. Cell 160, 1111–1124 (2015).
Chan, L. Y., Mugler, C. F., Heinrich, S., Vallotton, P. & Weis, K. Non-invasive measurement of mRNA decay reveals translation initiation as the major determinant of mRNA stability. Elife 7, e32536 (2018).
Hinnebusch, A. G., Ivanov, I. P. & Sonenberg, N. Translational control by 5′-untranslated regions of eukaryotic mRNAs. Science 352, 1413–1416 (2016).
Dvir, S. et al. Deciphering the rules by which 5′-UTR sequences affect protein expression in yeast. Proc. Natl Acad. Sci. USA 110, E2792–E2801 (2013).
Noderer, W. L. et al. Quantitative analysis of mammalian translation initiation sites by FACS-seq. Mol. Syst. Biol. 10, 748 (2014).
Cuperus, J. T. et al. Deep learning of the regulatory grammar of yeast 5′ untranslated regions from 500,000 random sequences. Genome Res. 27, 2015–2024 (2017).
Sample, P. J. et al. Human 5′ UTR design and variant effect prediction from a massively parallel translation assay. Nat. Biotechnol. 37, 803–809 (2019).
Starck, S. R. et al. Translation from the 5′ untranslated region shapes the integrated stress response. Science 351, aad3867 (2016).
Dersh, D., Yewdell, J. W. & Wei, J. A SIINFEKL-based system to measure MHC class I antigen presentation efficiency and kinetics. Methods Mol. Biol. 1988, 109–122 (2019).
Ingolia, N. T., Lareau, L. F. & Weissman, J. S. Ribosome profiling of mouse embryonic stem cells reveals the complexity and dynamics of mammalian proteomes. Cell 147, 789–802 (2011).
Lee, S., Liu, B., Huang, S. X., Shen, B. & Qian, S. B. Global mapping of translation initiation sites in mammalian cells at single-nucleotide resolution. Proc. Natl Acad. Sci. USA 109, E2424–E2432 (2012).
Kearse, M. G. & Wilusz, J. E. Non-AUG translation: a new start for protein synthesis in eukaryotes. Genes Dev. 31, 1717–1731 (2017).
Fritz, D. T., Ford, L. P. & Wilusz, J. An in vitro assay to study regulated mRNA stability. Sci. STKE 2000, pl1 (2000).
Kim, Y. K. & Maquat, L. E. UPFront and center in RNA decay: UPF1 in nonsense-mediated mRNA decay and beyond. RNA 25, 407–422 (2019).
Hogg, J. R. & Goff, S. P. Upf1 senses 3′UTR length to potentiate mRNA decay. Cell 143, 379–389 (2010).
Kwok, C. K., Marsico, G. & Balasubramanian, S. Detecting RNA G-quadruplexes (rG4s) in the transcriptome. Cold Spring Harb. Perspect. Biol. 10, a032284 (2018).
Fay, M. M., Lyons, S. M. & Ivanov, P. RNA G-quadruplexes in biology: principles and molecular mechanisms. J. Mol. Biol. 429, 2127–2147 (2017).
Chen, X. C. et al. Tracking the dynamic folding and unfolding of RNA G-quadruplexes in live cells. Angew. Chem. Int. Ed. Engl. 57, 4702–4706 (2018).
Kumari, S., Bugaut, A., Huppert, J. L. & Balasubramanian, S. An RNA G-quadruplex in the 5′ UTR of the NRAS proto-oncogene modulates translation. Nat. Chem. Biol. 3, 218–221 (2007).
Herdy, B. et al. Analysis of NRAS RNA G-quadruplex binding proteins reveals DDX3X as a novel interactor of cellular G-quadruplex containing transcripts. Nucleic Acids Res. 46, 11592–11604 (2018).
Weingarten-Gabbay, S. et al. Systematic discovery of cap-independent translation sequences in human and viral genomes. Science 351, aad4939 (2016).
Wahle, E. & Winkler, G. S. RNA decay machines: deadenylation by the Ccr4–Not and Pan2–Pan3 complexes. Biochim. Biophys. Acta 1829, 561–570 (2013).
Gilbert, W. V., Zhou, K., Butler, T. K. & Doudna, J. A. Cap-independent translation is required for starvation-induced differentiation in yeast. Science 317, 1224–1227 (2007).
Mayr, C. Regulation by 3′-untranslated regions. Annu. Rev. Genet. 51, 171–194 (2017).
Martin, M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 17, 10–12 (2011).
Acknowledgements
We thank the Grimson lab for providing us with several shRNAs targeting mammalian decay factors. We are grateful to Cornell University Life Sciences Core Laboratory Center for sequencing, FACS, and confocal microscope support. This work was supported by US National Institutes of Health (R01GM1222814 and R21CA227917) and HHMI Faculty Scholar (55108556) to S.-B.Q.
Author information
Author notes
These authors contributed equally: Longfei Jia, Yuanhui Mao.
Authors and Affiliations
Division of Nutritional Sciences, Cornell University, Ithaca, NY, USA
Longfei Jia,Yuanhui Mao,Quanquan Ji&Shu-Bing Qian
Cellular Biology Section, Laboratory of Viral Diseases, NIAID, Bethesda, MD, USA
Devin Dersh&Jonathan W. Yewdell
Authors
- Longfei Jia
View author publications
You can also search for this author in PubMedGoogle Scholar
- Yuanhui Mao
View author publications
You can also search for this author in PubMedGoogle Scholar
- Quanquan Ji
View author publications
You can also search for this author in PubMedGoogle Scholar
- Devin Dersh
View author publications
You can also search for this author in PubMedGoogle Scholar
- Jonathan W. Yewdell
View author publications
You can also search for this author in PubMedGoogle Scholar
- Shu-Bing Qian
View author publications
You can also search for this author in PubMedGoogle Scholar
Contributions
S.-B.Q. conceived the project and designed the experiments. L.J. performed the majority of experiments and Y.M. conducted the majority of data analysis. Q.J. contributed to the PABP1 knockdown experiments. D.D. and J.W.Y. helped with 25D1 reagents and HEK293-Kb cells. S.-B.Q. wrote the manuscript with comments from L.J. and Y.M. All authors discussed the results and edited the manuscript.
Corresponding author
Correspondence to Shu-Bing Qian.
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Peer review informaiton Peer reviewer reports are available. Anke Sparmann was the primary editor on this article and managed its editorial process and peer review in collaboration with the rest of the editorial team.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Extended data
Extended Data Fig. 1 Characterization of uORF reporters.
a, Basic design of the uORF reporter with the SIINFEKL sequence highlighted. 5 additional amino acids (LEQLE) are present, which permits processing of SIINFEKL from the same flanking amino acids regardless of the TIS sequence. b, Bar graphs show the 25D1 or GFP mean fluorescence of HEK293-Kb cells transfected with plasmids of uORF reporters. n = 2 biological replicates, Error bars indicate SEM. c, Representative flow cytometry scatterplots of HEK 293-Kb cells transfected with synthetic mRNA reporters with optimal or suboptimal AUG codons. d, A bar graph shows the ratio of 25D1/GFP in HEK 293-Kb cells transfected with synthetic mRNA or plasmid DNA. Error bars, mean ± s.e.m; n = 3 biological replicates.
Source data
Extended Data Fig. 2 Establishing massively parallel uORF reporters.
a, Schematic of generating a library of RNA-based uORF reporters by PCR-amplification using primers composed of random 10-nt sequences upstream of the uORF. Pooled PCR products were utilized as templates for in vitro RNA synthesis followed by 5’ capping and 3’ polyadenylation. b, Comparison of sequence randomness for nucleotide oligos synthesized by different vendors. c, A histogram shows the distribution of read count on individual unique random sequences. d, A scatter plot shows the correlation of read counts between two biological replicates of original oligo sequences (Rho = 0.79, P < 2.2 × 10-16).
Extended Data Fig. 3 Characterizing massively parallel uORF reporters using flow cytometry.
a, Representative flow cytometry scatterplots of HEK 293-Kb cells transfected with the GFP mRNA reporter or pooled uORF reporters. Relative GFP and 25D1 fluorescence intensity between GFP and uORF-GFP reporters are shown in histograms as well as bar graphs. Error bars, mean ± s.e.m. n = 3 biological replicates. b, HEK293-Kb cells were transfected with mRNA reporters followed by FACS soring into GFPH and 25D1H populations. A bar plot (top) shows the ratio of triplet frequency within the random sequences enriched in the 25D1H population over the GFPH population. Only the top 10% sequence variants ranked in 25D1H and GFPH populations are used. The original frequency of triplets in different populations is shown as a heat map (bottom). c, Correlation of triplet frequencies within the sequence variants enriched in 25D1H or GFPH populations. All points are color-encoded based on the similarity to ATG.
Source data
Extended Data Fig. 4 Identification of TIS sequence features in uORF reporters enriched in ribosome fractions.
a, A total of 14,381 sequences with all permutations of NNNNAUGNNN are ranked based on the ratio of frequency between monosome and polysome. Both the top and bottom hits are highlighted. b, Sequence logo of 10 nt random sequences with high (top) or low (bottom) M/P ratio. Note that the high M/P sequence is consistent with the Kozak consensus sequence. c, A violin plot shows the ratio of relative frequencies of inserted sequences in monosome over polysome, when ATG triplet is followed by different nucleotides. d, Representative flow cytometry scatterplots of HEK 293-Kb cells transfected with mRNA reporters with sequence variants chosen from monosome (M1 - M4) or polysome (P1 - P4).
Source data
Extended Data Fig. 5 uORF translation triggers mRNA decay in a ribosome-dependent manner.
a, A scatter plot shows the correlation of triplet frequencies enriched in polysome and GFPH population from HEK293-Kb cells transfected with mRNA reporters. All points are color-encoded based on the similarity to ATG. b, A scatter plot shows the correlation of in vivo and in vitro half-lives of mRNA reporters. Half-life values were centered to medians. c, A boxplot shows positive correlation between GFP intensities and half-lives of mRNA reporters. All random sequences were divided into five groups based GFP intensity measured by flow cytometry. d, HEK293-Kb cells were transfected with DNA plasmids with or without optimal ATG codons (left) or representative hits from M and P fractions (right), followed by RT-qPCR at indicated time points (n = 3 biological replicates; t test). Error bars indicate SEM. ** P < 0.01; * P < 0.05. e, Representative flow cytometry scatterplots of HEK 293-Kb cells transfected with mRNA reporters capped with ApppG with sequence variants chosen from monosome (M1) or polysome (P1).
Source data
Extended Data Fig. 6 Identification of inhibitory elements in 5’UTR that impair translation.
a, Flow cytometry scatterplots of HEK 293-Kb cells transfected with mRNA reporters enriched in the ribosome-free fractions (N1 – N8). b, HEK293-Kb cells were transfected with mRNA reporters or plasmids with sequence variants chosen from the ribosome-free fractions (N1 – N8), followed by RT-qPCR at indicated time points (n = 3 biological replicates; t test). Error bars indicate SEM. ** P < 0.01. c, Flow cytometry scatterplots of HEK 293-Kb cells transfected with mRNA reporters enriched in the ribosome-free fractions (N1 – N8) and stained with QUMA-1.
Source data
Extended Data Fig. 7 RG4 in 5’UTR triggers mRNA decay in the P-body.
a, HEK293-Kb cells with specific decay factor knockdown were transfected with the N1 mRNA reporter, followed by RT-qPCR at indicated time points. (n = 3 biological replicates; t test). Error bars indicate SEM. * P < 0.05. b, Representative confocal images of HEK 293-Kb cells transfected with P1 or N1 mRNA reporters. The mRNA reporters were synthesized in the presence of Alexa Fluor-UTP and the transfected cells were co-stained with a DCP2 antibody. DNA was counter-stained with Hoechst 33342. Arrowheads indicate typical mRNA foci. Bar, 10 μm. Images are representative of at least 50 cells.
Source data
Extended Data Fig. 8 RG4 in 5’UTR derived from NRAS triggers mRNA decay.
a, The left panel shows the schematic of mRNA reporter with 5’UTR derived from NRAS with (WT) or without (MT) RG4. The right panel shows the decay of mRNA reporters in transfected HEK293-Kb cells. (n = 3 biological replicates; t test). Error bars indicate SEM. ** P < 0.01. b, The stability of endogenous NRAS was measured in HEK293-Kb cells with or without DHX36 knockdown. (n = 3 biological replicates; t test). Error bars indicate SEM. * P < 0.05.
Source data
Extended Data Fig. 9 An A-rich element in 5’UTR enables cap-independent translation.
a, A scatter plot shows the distribution of quadruplets between ribosome free and ribosome-bound fractions. Each point refers a 4-mers sequence, and all points are color-encoded based on the ratio of A/C. b, Representative flow cytometry of HEK 293-Kb cells transfected with 10C, 10A, 5A, or 4A mRNA reporters capped with non-functional ApppG. c, Representative flow cytometry of HEK 293-Kb cells transfected with mRNA reporters containing AAAAAUGAAA capped with m7G or ApppG.
Source data
Extended Data Fig. 10 An A-rich element in 5’UTR promotes translation-independent decay.
a, HEK293-Kb cells were transfected with 10A or 10C mRNA reporters capped with m7G (right) or ApppG (left), followed by RT-qPCR at indicated time points. (n = 3 biological replicates; t test). Error bars indicate SEM. b, The left panel shows the distribution of in vitro half-lives of mRNA reporters. The most stable (top 10%) sequences are highlighted in red, and the most unstable sequences are highlighted in light blue. The right panel shows the heat map of base frequency at different positions of random sequences. c, A violin plot shows half-life of mRNAs groups with different number of continuous As in random sequences. d, A heat map shows the effect of A-cluster length and position on the in vitro half-life of mRNA reporters. e, The in vitro decay of mRNA reporters (10A, 5A, and 4A) in the lysates of HEK293-Kb cells was determined by RT-qPCR at indicated time points. (n = 3 biological replicates; t test). Error bars indicate SEM. f, The in vitro decay of 10A mRNA reporters in the lysates of HEK293-Kb cells with or without UPF1 knockdown was determined by RT-qPCR at indicated time points (left). For the in vivo stability, HEK293-Kb cells with or without UPF1 knockdown were transfected with 10A mRNA reporters followed by RT-qPCR at indicated time points (right). (n = 3 biological replicates; t test). Error bars indicate SEM. g, The in vitro stability of mRNA reporters (10A, M1, and P1) in the lysates of HEK293-Kb cells with CNOT1, PARN, or PAN3 knockdown was determined by RT-qPCR at indicated time points. (n = 3 biological replicates; t test). Error bars indicate SEM. ** P < 0.01.
Source data
Supplementary information
Supplementary Table 1
List of all oligonucleotide sequences used
Source data
Source Data Fig. 1
Statistical Source Data
Source Data Fig. 2
Statistical Source Data
Source Data Fig. 3
Statistical Source Data
Source Data Fig. 4
Statistical Source Data
Source Data Fig. 5
Statistical Source Data
Source Data Fig. 6
Statistical Source Data
Source Data Extended Data Fig. 1
Statistical Source Data
Source Data Extended Data Fig. 3
Statistical Source Data
Source Data Extended Data Fig. 4
Statistical Source Data
Source Data Extended Data Fig. 5
Statistical Source Data
Source Data Extended Data Fig. 6
Statistical Source Data
Source Data Extended Data Fig. 7
Statistical Source Data
Source Data Extended Data Fig. 8
Statistical Source Data
Source Data Extended Data Fig. 9
Statistical Source Data
Source Data Extended Data Fig. 10
Statistical Source Data
Rights and permissions
About this article
Cite this article
Jia, L., Mao, Y., Ji, Q. et al. Decoding mRNA translatability and stability from the 5′ UTR. Nat Struct Mol Biol 27, 814–821 (2020). https://doi.org/10.1038/s41594-020-0465-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41594-020-0465-x