Shared posts

18 May 07:30

Synthesis and applications of RNAs with position-selective labelling and mosaic composition

by Yu Liu

Nature advance online publication 04 May 2015. doi:10.1038/nature14352

Authors: Yu Liu, Erik Holmstrom, Jinwei Zhang, Ping Yu, Jinbu Wang, Marzena A. Dyba, De Chen, Jinfa Ying, Stephen Lockett, David J. Nesbitt, Adrian R. Ferré-D’Amaré, Rui Sousa, Jason R. Stagno & Yun-Xing Wang

Knowledge of the structure and dynamics of RNA molecules is critical to understanding their many biological functions. Furthermore, synthetic RNAs have applications as therapeutics and molecular sensors. Both research and technological applications of RNA would be dramatically enhanced by methods that enable incorporation of modified or labelled nucleotides into specifically designated positions or regions of RNA. However, the synthesis of tens of milligrams of such RNAs using existing methods has been impossible. Here we develop a hybrid solid–liquid phase transcription method and automated robotic platform for the synthesis of RNAs with position-selective labelling. We demonstrate its use by successfully preparing various isotope- or fluorescently labelled versions of the 71-nucleotide aptamer domain of an adenine riboswitch for nuclear magnetic resonance spectroscopy or single-molecule Förster resonance energy transfer, respectively. Those RNAs include molecules that were selectively isotope-labelled in specific loops, linkers, a helix, several discrete positions, or a single internal position, as well as RNA molecules that were fluorescently labelled in and near kissing loops. These selectively labelled RNAs have the same fold as those transcribed using conventional methods, but they greatly simplify the interpretation of NMR spectra. The single-position isotope- and fluorescently labelled RNA samples reveal multiple conformational states of the adenine riboswitch. Lastly, we describe a robotic platform and the operation that automates this technology. Our selective labelling method may be useful for studying RNA structure and dynamics and for making RNA sensors for a variety of applications including cell-biological studies, substance detection, and disease diagnostics.

15 May 02:40

Apoptosis Triggers Specific, Rapid, and Global mRNA Decay with 3′ Uridylated Intermediates Degraded by DIS3L2

Publication date: 19 May 2015
Source:Cell Reports, Volume 11, Issue 7
Author(s): Marshall P. Thomas , Xing Liu , Jennifer Whangbo , Geoffrey McCrossan , Keri B. Sanborn , Emre Basar , Michael Walch , Judy Lieberman
Apoptosis is a tightly coordinated cell death program that damages mitochondria, DNA, proteins, and membrane lipids. Little is known about the fate of RNA as cells die. Here, we show that mRNAs, but not noncoding RNAs, are rapidly and globally degraded during apoptosis. mRNA decay is triggered early in apoptosis, preceding membrane lipid scrambling, genomic DNA fragmentation, and apoptotic changes to translation initiation factors. mRNA decay depends on mitochondrial outer membrane permeabilization and is amplified by caspase activation. 3′ truncated mRNA decay intermediates with nontemplated uridylate-rich tails are generated during apoptosis. These tails are added by the terminal uridylyl transferases (TUTases) ZCCHC6 and ZCCHC11, and the uridylated transcript intermediates are degraded by the 3′ to 5′ exonuclease DIS3L2. Knockdown of DIS3L2 or the TUTases inhibits apoptotic mRNA decay, translation arrest, and cell death, whereas DIS3L2 overexpression enhances cell death. Our results suggest that global mRNA decay is an overlooked hallmark of apoptosis.

Graphical abstract

image

Teaser

Thomas et al. show that mRNAs, but not noncoding RNAs, are rapidly and globally degraded early in apoptosis. mRNA decay is triggered by mitochondrial outer membrane permeabilization and involves 3′ uridylation by terminal uridylyl transferases (TUTases) and exonuclease digestion by the uridylate-specific exonuclease DIS3L2.
04 May 07:46

RBM7 subunit of the NEXT complex binds U-rich sequences and targets 3'-end extended forms of snRNAs

by Hrossova, D., Sikorsky, T., Potesil, D., Bartosovic, M., Pasulka, J., Zdrahal, Z., Stefl, R., Vanacova, S.

The Nuclear Exosome Targeting (NEXT) complex is a key cofactor of the mammalian nuclear exosome in the removal of Promoter Upstream Transcripts (PROMPTs) and potentially aberrant forms of other noncoding RNAs, such as snRNAs. NEXT is composed of three subunits SKIV2L2, ZCCHC8 and RBM7. We have recently identified the NEXT complex in our screen for oligo(U) RNA-binding factors. Here, we demonstrate that NEXT displays preference for U-rich pyrimidine sequences and this RNA binding is mediated by the RNA recognition motif (RRM) of the RBM7 subunit. We solved the structure of RBM7 RRM and identified two phenylalanine residues that are critical for interaction with RNA. Furthermore, we showed that these residues are required for the NEXT interaction with snRNAs in vivo. Finally, we show that depletion of components of the NEXT complex alone or together with exosome nucleases resulted in the accumulation of mature as well as extended forms of snRNAs. Thus, our data suggest a new scenario in which the NEXT complex is involved in the surveillance of snRNAs and/or biogenesis of snRNPs.

02 May 01:37

The ribonuclease DIS3 promotes let-7 miRNA maturation by degrading the pluripotency factor LIN28B mRNA.

by Segalla S, Pivetti S, Todoerti K, Chudzik MA, Giuliani EC, Lazzaro F, Volta V, Lazarevic D, Musco G, Muzi-Falconi M, Neri A, Biffo S, Tonon G

The ribonuclease DIS3 promotes let-7 miRNA maturation by degrading the pluripotency factor LIN28B mRNA.

Nucleic Acids Res. 2015 Apr 29;

Authors: Segalla S, Pivetti S, Todoerti K, Chudzik MA, Giuliani EC, Lazzaro F, Volta V, Lazarevic D, Musco G, Muzi-Falconi M, Neri A, Biffo S, Tonon G

Abstract
Multiple myeloma, the second most frequent hematologic tumor after lymphomas, is an incurable cancer. Recent sequencing efforts have identified the ribonuclease DIS3 as one of the most frequently mutated genes in this disease. DIS3 represents the catalytic subunit of the exosome, a macromolecular complex central to the processing, maturation and surveillance of various RNAs. miRNAs are an evolutionarily conserved class of small noncoding RNAs, regulating gene expression at post-transcriptional level. Ribonucleases, including Drosha, Dicer and XRN2, are involved in the processing and stability of miRNAs. However, the role of DIS3 on the regulation of miRNAs remains largely unknown. Here we found that DIS3 regulates the levels of the tumor suppressor let-7 miRNAs without affecting other miRNA families. DIS3 facilitates the maturation of let-7 miRNAs by reducing in the cytoplasm the RNA stability of the pluripotency factor LIN28B, a inhibitor of let-7 processing. DIS3 inactivation, through the increase of LIN28B and the reduction of mature let-7, enhances the translation of let-7 targets such as MYC and RAS leading to enhanced tumorigenesis. Our study establishes that the ribonuclease DIS3, targeting LIN28B, sustains the maturation of let-7 miRNAs and suggests the increased translation of critical oncogenes as one of the biological outcomes of DIS3 inactivation.

PMID: 25925570 [PubMed - as supplied by publisher]

01 May 05:42

Circular RNAs in the Mammalian Brain Are Highly Abundant, Conserved, and Dynamically Expressed.

by Rybak-Wolf A, Stottmeister C, Glažar P, Jens M, Pino N, Giusti S, Hanan M, Behm M, Bartok O, Ashwal-Fluss R, Herzog M, Schreyer L, Papavasileiou P, Ivanov A, Öhman M, Refojo D, Kadener S, Rajewsky N

Circular RNAs in the Mammalian Brain Are Highly Abundant, Conserved, and Dynamically Expressed.

Mol Cell. 2015 Apr 22;

Authors: Rybak-Wolf A, Stottmeister C, Glažar P, Jens M, Pino N, Giusti S, Hanan M, Behm M, Bartok O, Ashwal-Fluss R, Herzog M, Schreyer L, Papavasileiou P, Ivanov A, Öhman M, Refojo D, Kadener S, Rajewsky N

Abstract
Circular RNAs (circRNAs) are an endogenous class of animal RNAs. Despite their abundance, their function and expression in the nervous system are unknown. Therefore, we sequenced RNA from different brain regions, primary neurons, isolated synapses, as well as during neuronal differentiation. Using these and other available data, we discovered and analyzed thousands of neuronal human and mouse circRNAs. circRNAs were extraordinarily enriched in the mammalian brain, well conserved in sequence, often expressed as circRNAs in both human and mouse, and sometimes even detected in Drosophila brains. circRNAs were overall upregulated during neuronal differentiation, highly enriched in synapses, and often differentially expressed compared to their mRNA isoforms. circRNA expression correlated negatively with expression of the RNA-editing enzyme ADAR1. Knockdown of ADAR1 induced elevated circRNA expression. Together, we provide a circRNA brain expression atlas and evidence for important circRNA functions and values as biomarkers.

PMID: 25921068 [PubMed - as supplied by publisher]

01 May 05:19

Synergistic and Independent Actions of Multiple Terminal Nucleotidyl Transferases in the 3’ Tailing of Small RNAs in Arabidopsis

by Xiaoyan Wang et al.

by Xiaoyan Wang, Shuxin Zhang, Yongchao Dou, Chi Zhang, Xuemei Chen, Bin Yu, Guodong Ren

All types of small RNAs in plants, piwi-interacting RNAs (piRNAs) in animals and a subset of siRNAs in Drosophila and C. elegans are subject to HEN1 mediated 3’ terminal 2’-O-methylation. This modification plays a pivotal role in protecting small RNAs from 3’ uridylation, trimming and degradation. In Arabidopsis, HESO1 is a major enzyme that uridylates small RNAs to trigger their degradation. However, U-tail is still present in null hen1 heso1 mutants, suggesting the existence of (an) enzymatic activities redundant with HESO1. Here, we report that UTP: RNA uridylyltransferase (URT1) is a functional paralog of HESO1. URT1 interacts with AGO1 and plays a predominant role in miRNA uridylation when HESO1 is absent. Uridylation of miRNA is globally abolished in a hen1 heso1 urt1 triple mutant, accompanied by an extensive increase of 3’-to-5’ trimming. In contrast, disruption of URT1 appears not to affect the heterochromatic siRNA uridylation. This indicates the involvement of additional nucleotidyl transferases in the siRNA pathway. Analysis of miRNA tailings in the hen1 heso1 urt1 triple mutant also reveals the existence of previously unknown enzymatic activities that can add non-uridine nucleotides. Importantly, we show HESO1 may also act redundantly with URT1 in miRNA uridylation when HEN1 is fully competent. Taken together, our data not only reveal a synergistic action of HESO1 and URT1 in the 3’ uridylation of miRNAs, but also independent activities of multiple terminal nucleotidyl transferases in the 3’ tailing of small RNAs and an antagonistic relationship between uridylation and trimming. Our results may provide further insight into the mechanisms of small RNA 3’ end modification and stability control.
01 May 05:17

Distinct and Cooperative Activities of HESO1 and URT1 Nucleotidyl Transferases in MicroRNA Turnover in Arabidopsis

by Bin Tu et al.

by Bin Tu, Li Liu, Chi Xu, Jixian Zhai, Shengben Li, Miguel A. Lopez, Yuanyuan Zhao, Yu Yu, Vanitharani Ramachandran, Guodong Ren, Bin Yu, Shigui Li, Blake C. Meyers, Beixin Mo, Xuemei Chen

3’ uridylation is increasingly recognized as a conserved RNA modification process associated with RNA turnover in eukaryotes. 2’-O-methylation on the 3’ terminal ribose protects micro(mi)RNAs from 3’ truncation and 3’ uridylation in Arabidopsis. Previously, we identified HESO1 as the nucleotidyl transferase that uridylates most unmethylated miRNAs in vivo, but substantial 3’ tailing of miRNAs still remains in heso1 loss-of-function mutants. In this study, we found that among nine other potential nucleotidyl transferases, UTP:RNA URIDYLYLTRANSFERASE 1 (URT1) is the single most predominant nucleotidyl transferase that tails miRNAs. URT1 and HESO1 prefer substrates with different 3’ end nucleotides in vitro and act cooperatively to tail different forms of the same miRNAs in vivo. Moreover, both HESO1 and URT1 exhibit nucleotidyl transferase activity on AGO1-bound miRNAs. Although these enzymes are able to add long tails to AGO1-bound miRNAs, the tailed miRNAs remain associated with AGO1. Moreover, tailing of AGO1-bound miRNA165/6 drastically reduces the slicing activity of AGO1-miR165/6, suggesting that tailing reduces miRNA activity. However, monouridylation of miR171a by URT1 endows the miRNA the ability to trigger the biogenesis of secondary siRNAs. Therefore, 3’ tailing could affect the activities of miRNAs in addition to leading to miRNA degradation.
30 Apr 01:53

Argonaute 2 binds directly to tRNA genes and promotes gene repression in cis.

by Woolnough JL, Atwood BL, Giles KE

Argonaute 2 binds directly to tRNA genes and promotes gene repression in cis.

Mol Cell Biol. 2015 Apr 27;

Authors: Woolnough JL, Atwood BL, Giles KE

Abstract
To further our understanding of the RNAi machinery within the human nucleus, we analyzed the chromatin and RNA binding of AGO2 within human cancer cell lines. Our data indicated that AGO2 binds directly to nascent tRNA and 5S rRNA, and to the genomic loci from which these RNAs are transcribed, in a siRNA- and DICER-independent manner. AGO2 chromatin binding was not observed at non-TFIIIC-dependent POL III genes or at extra-TFIIIC (ETC) sites, indicating that the interaction is specific for TFIIIC-dependent POL III genes. A genome-wide analysis indicated that loss of AGO2 caused a global increase in the FPKM among genes that flank AGO2-bound tRNA genes. This effect was shown to be distinct from that of the disruption of DICER, DROSHA, or CTCF. We propose that AGO2 binding to tRNA genes has a novel and important regulatory role in human cells.

PMID: 25918241 [PubMed - as supplied by publisher]

29 Apr 02:19

Identification of a large set of rare complete human knockouts

by Patrick Sulem

Nature Genetics 47, 448 (2015). doi:10.1038/ng.3243

Authors: Patrick Sulem, Hannes Helgason, Asmundur Oddson, Hreinn Stefansson, Sigurjon A Gudjonsson, Florian Zink, Eirikur Hjartarson, Gunnar Th Sigurdsson, Adalbjorg Jonasdottir, Aslaug Jonasdottir, Asgeir Sigurdsson, Olafur Th Magnusson, Augustine Kong, Agnar Helgason, Hilma Holm, Unnur Thorsteinsdottir, Gisli Masson, Daniel F Gudbjartsson & Kari Stefansson

Loss-of-function mutations cause many mendelian diseases. Here we aimed to create a catalog of autosomal genes that are completely knocked out in humans by rare loss-of-function mutations. We sequenced the whole genomes of 2,636 Icelanders and imputed the sequence variants identified in this set into 101,584 additional chip-genotyped and phased Icelanders. We found a total of 6,795 autosomal loss-of-function SNPs and indels in 4,924 genes. Of the genotyped Icelanders, 7.7% are homozygotes or compound heterozygotes for loss-of-function mutations with a minor allele frequency (MAF) below 2% in 1,171 genes (complete knockouts). Genes that are highly expressed in the brain are less often completely knocked out than other genes. Homozygous loss-of-function offspring of two heterozygous parents occurred less frequently than expected (deficit of 136 per 10,000 transmissions for variants with MAF <2%, 95% confidence interval (CI) = 10–261).

29 Apr 02:19

Exome sequencing links mutations in PARN and RTEL1 with familial pulmonary fibrosis and telomere shortening

by Bridget D Stuart

Nature Genetics 47, 512 (2015). doi:10.1038/ng.3278

Authors: Bridget D Stuart, Jungmin Choi, Samir Zaidi, Chao Xing, Brody Holohan, Rui Chen, Mihwa Choi, Pooja Dharwadkar, Fernando Torres, Carlos E Girod, Jonathan Weissler, John Fitzgerald, Corey Kershaw, Julia Klesney-Tait, Yolanda Mageto, Jerry W Shay, Weizhen Ji, Kaya Bilguvar, Shrikant Mane, Richard P Lifton & Christine Kim Garcia

Idiopathic pulmonary fibrosis (IPF) is an age-related disease featuring progressive lung scarring. To elucidate the molecular basis of IPF, we performed exome sequencing of familial kindreds with pulmonary fibrosis. Gene burden analysis comparing 78 European cases and 2,816 controls implicated PARN, an exoribonuclease with no previous connection to telomere biology or disease, with five new heterozygous damaging mutations in unrelated cases and none in controls (P = 1.3 × 10−8); mutations were shared by all affected relatives (odds in favor of linkage = 4,096:1). RTEL1, an established locus for dyskeratosis congenita, harbored significantly more new damaging and missense variants at conserved residues in cases than in controls (P = 1.6 × 10−6). PARN and RTEL1 mutation carriers had shortened leukocyte telomere lengths, and we observed epigenetic inheritance of short telomeres in family members. Together, these genes explain ∼7% of familial pulmonary fibrosis and strengthen the link between lung fibrosis and telomere dysfunction.

27 Apr 05:33

The ribosome prophecy

by Maria Barna

Nature Reviews Molecular Cell Biology 16, 268 (2015). doi:10.1038/nrm3993

Author: Maria Barna

A prophetic theoretical article, which hypothesized that ribosomes have regulatory roles in controlling gene expression, inspired Maria Barna to rethink the value of purely theoretical scientific publications.

27 Apr 05:19

Mammalian NET-Seq Reveals Genome-wide Nascent Transcription Coupled to RNA Processing.

by Nojima T, Gomes T, Grosso AR, Kimura H, Dye MJ, Dhir S, Carmo-Fonseca M, Proudfoot NJ
Related Articles

Mammalian NET-Seq Reveals Genome-wide Nascent Transcription Coupled to RNA Processing.

Cell. 2015 Apr 23;161(3):526-540

Authors: Nojima T, Gomes T, Grosso AR, Kimura H, Dye MJ, Dhir S, Carmo-Fonseca M, Proudfoot NJ

Abstract
Transcription is a highly dynamic process. Consequently, we have developed native elongating transcript sequencing technology for mammalian chromatin (mNET-seq), which generates single-nucleotide resolution, nascent transcription profiles. Nascent RNA was detected in the active site of RNA polymerase II (Pol II) along with associated RNA processing intermediates. In particular, we detected 5'splice site cleavage by the spliceosome, showing that cleaved upstream exon transcripts are associated with Pol II CTD phosphorylated on the serine 5 position (S5P), which is accumulated over downstream exons. Also, depletion of termination factors substantially reduces Pol II pausing at gene ends, leading to termination defects. Notably, termination factors play an additional promoter role by restricting non-productive RNA synthesis in a Pol II CTD S2P-specific manner. Our results suggest that CTD phosphorylation patterns established for yeast transcription are significantly different in mammals. Taken together, mNET-seq provides dynamic and detailed snapshots of the complex events underlying transcription in mammals.

PMID: 25910207 [PubMed - as supplied by publisher]

24 Apr 02:12

Systematic Profiling of Poly(A)+ Transcripts Modulated by Core 3’ End Processing and Splicing Factors Reveals Regulatory Rules of Alternative Cleavage and Polyadenylation

by Wencheng Li et al.

by Wencheng Li, Bei You, Mainul Hoque, Dinghai Zheng, Wenting Luo, Zhe Ji, Ji Yeon Park, Samuel I. Gunderson, Auinash Kalsotra, James L. Manley, Bin Tian

Alternative cleavage and polyadenylation (APA) results in mRNA isoforms containing different 3’ untranslated regions (3’UTRs) and/or coding sequences. How core cleavage/polyadenylation (C/P) factors regulate APA is not well understood. Using siRNA knockdown coupled with deep sequencing, we found that several C/P factors can play significant roles in 3’UTR-APA. Whereas Pcf11 and Fip1 enhance usage of proximal poly(A) sites (pAs), CFI-25/68, PABPN1 and PABPC1 promote usage of distal pAs. Strong cis element biases were found for pAs regulated by CFI-25/68 or Fip1, and the distance between pAs plays an important role in APA regulation. In addition, intronic pAs are substantially regulated by splicing factors, with U1 mostly inhibiting C/P events in introns near the 5’ end of gene and U2 suppressing those in introns with features for efficient splicing. Furthermore, PABPN1 inhibits expression of transcripts with pAs near the transcription start site (TSS), a property possibly related to its role in RNA degradation. Finally, we found that groups of APA events regulated by C/P factors are also modulated in cell differentiation and development with distinct trends. Together, our results support an APA code where an APA event in a given cellular context is regulated by a number of parameters, including relative location to the TSS, splicing context, distance between competing pAs, surrounding cis elements and concentrations of core C/P factors.
24 Apr 02:10

[Report] Translational tuning optimizes nascent protein folding in cells

by Soo Jung Kim
In cells, biosynthetic machinery coordinates protein synthesis and folding to optimize efficiency and minimize off-pathway outcomes. However, it has been difficult to delineate experimentally the mechanisms responsible. Using fluorescence resonance energy transfer, we studied cotranslational folding of the first nucleotide-binding domain from the cystic fibrosis transmembrane conductance regulator. During synthesis, folding occurred discretely via sequential compaction of N-terminal, α-helical, and α/β-core subdomains. Moreover, the timing of these events was critical; premature α-subdomain folding prevented subsequent core formation. This process was facilitated by modulating intrinsic folding propensity in three distinct ways: delaying α-subdomain compaction, facilitating β-strand intercalation, and optimizing translation kinetics via codon usage. Thus, de novo folding is translationally tuned by an integrated cellular response that shapes the cotranslational folding landscape at critical stages of synthesis. Authors: Soo Jung Kim, Jae Seok Yoon, Hideki Shishido, Zhongying Yang, LeeAnn A. Rooney, Jose M. Barral, William R. Skach
24 Apr 02:09

[Perspective] The delicate dance of translation and folding

by Joseph D. Puglisi
The past decade has seen rapid growth in our knowledge of how proteins are synthesized in cells. This includes the basic step of how transfer RNAs (tRNAs) decode messenger RNAs (mRNAs) with high fidelity and speed, how the ribosome moves along mRNA from codon to codon during translation, and how synthesis of the corresponding polypeptide chain is initiated and terminated at specific points on the mRNA (1). Structures of the ribosomal particles—megadalton RNA-protein assemblies—have provided detailed molecular views of the active sites for mRNA decoding and peptide bond formation, and suggested pathways for movement of ligands, factors, and the ribosomal subunits themselves. Obscured in this rich cache of knowledge is the fate of the protein product. How does it fold during mRNA translation and how might protein folding affect translation itself? These questions are addressed by elegant biophysical and biochemical approaches reported by Goldman et al. (2) and Kim et al. (3) on pages 457 and 444, respectively, of this issue, adding to a growing appreciation of cotranslational protein folding (4–6). Author: Joseph D. Puglisi
23 Apr 02:21

Alternative 3′ UTRs act as scaffolds to regulate membrane protein localization

by Binyamin D. Berkovits

Nature advance online publication 20 April 2015. doi:10.1038/nature14321

Authors: Binyamin D. Berkovits & Christine Mayr

About half of human genes use alternative cleavage and polyadenylation (ApA) to generate messenger RNA transcripts that differ in the length of their 3′ untranslated regions (3′ UTRs) while producing the same protein. Here we show in human cell lines that alternative 3′ UTRs differentially regulate the localization of membrane proteins. The long 3′ UTR of CD47 enables efficient cell surface expression of CD47 protein, whereas the short 3′ UTR primarily localizes CD47 protein to the endoplasmic reticulum. CD47 protein localization occurs post-translationally and independently of RNA localization. In our model of 3′ UTR-dependent protein localization, the long 3′ UTR of CD47 acts as a scaffold to recruit a protein complex containing the RNA-binding protein HuR (also known as ELAVL1) and SET to the site of translation. This facilitates interaction of SET with the newly translated cytoplasmic domains of CD47 and results in subsequent translocation of CD47 to the plasma membrane via activated RAC1 (ref. 5). We also show that CD47 protein has different functions depending on whether it was generated by the short or long 3′ UTR isoforms. Thus, ApA contributes to the functional diversity of the proteome without changing the amino acid sequence. 3′ UTR-dependent protein localization has the potential to be a widespread trafficking mechanism for membrane proteins because HuR binds to thousands of mRNAs, and we show that the long 3′ UTRs of CD44, ITGA1 and TNFRSF13C, which are bound by HuR, increase surface protein expression compared to their corresponding short 3′ UTRs. We propose that during translation the scaffold function of 3′ UTRs facilitates binding of proteins to nascent proteins to direct their transport or function—and this role of 3′ UTRs can be regulated by ApA.

21 Apr 01:33

Computational and analytical challenges in single-cell transcriptomics.

by Stegle O, Teichmann SA, Marioni JC
Related Articles

Computational and analytical challenges in single-cell transcriptomics.

Nat Rev Genet. 2015 Mar;16(3):133-45

Authors: Stegle O, Teichmann SA, Marioni JC

Abstract
The development of high-throughput RNA sequencing (RNA-seq) at the single-cell level has already led to profound new discoveries in biology, ranging from the identification of novel cell types to the study of global patterns of stochastic gene expression. Alongside the technological breakthroughs that have facilitated the large-scale generation of single-cell transcriptomic data, it is important to consider the specific computational and analytical challenges that still have to be overcome. Although some tools for analysing RNA-seq data from bulk cell populations can be readily applied to single-cell RNA-seq data, many new computational strategies are required to fully exploit this data type and to enable a comprehensive yet detailed study of gene expression at the single-cell level.

PMID: 25628217 [PubMed - indexed for MEDLINE]

21 Apr 01:30

Dual 3’Seq using deepSuperSAGE uncovers transcriptomes of interacting Salmonella enterica Typhimurium and human host cells

by Fabian Afonso-Grunz
Background: The interaction of eukaryotic host and prokaryotic pathogen cells is linked to specific changes in the cellular proteome, and consequently to infection-related gene expression patterns of the involved cells. To simultaneously assess the transcriptomes of both organisms during their interaction we developed dual 3’Seq, a tag-based sequencing protocol that allows for exact quantification of differentially expressed transcripts in interacting pro- and eukaryotic cells without prior fixation or physical disruption of the interaction. Results: Human epithelial cells were infected with Salmonella enterica Typhimurium as a model system for invasion of the intestinal epithelium, and the transcriptional response of the infected host cells together with the differential expression of invading and intracellular pathogen cells was determined by dual 3’Seq coupled with the next-generation sequencing-based transcriptome profiling technique deepSuperSAGE (deep Serial Analysis of Gene Expression). Annotation to reference transcriptomes comprising the operon structure of the employed S. enterica Typhimurium strain allowed for in silico separation of the interacting cells including quantification of polycistronic RNAs. Eighty-nine percent of the known loci are found to be transcribed in prokaryotic cells prior or subsequent to infection of the host, while 75% of all protein-coding loci are represented in the polyadenylated transcriptomes of human host cells. Conclusions: Dual 3’Seq was alternatively coupled to MACE (Massive Analysis of cDNA ends) to assess the advantages and drawbacks of a library preparation procedure that allows for sequencing of longer fragments. Additionally, the identified expression patterns of both organisms were validated by qRT-PCR using three independent biological replicates, which confirmed that RELB along with NFKB1 and NFKB2 are involved in the initial immune response of epithelial cells after infection with S. enterica Typhimurium.
20 Apr 09:30

Analysis of RNA decay factor mediated RNA stability contributions on RNA abundance.

by Maekawa S, Imamachi N, Irie T, Tani H, Matsumoto K, Mizutani R, Imamura K, Kakeda M, Yada T, Sugano S, Suzuki Y, Akimitsu N

Analysis of RNA decay factor mediated RNA stability contributions on RNA abundance.

BMC Genomics. 2015;16(1):154

Authors: Maekawa S, Imamachi N, Irie T, Tani H, Matsumoto K, Mizutani R, Imamura K, Kakeda M, Yada T, Sugano S, Suzuki Y, Akimitsu N

Abstract
BACKGROUND: Histone epigenome data determined by chromatin immunoprecipitation sequencing (ChIP-seq) is used in identifying transcript regions and estimating expression levels. However, this estimation does not always correlate with eventual RNA expression levels measured by RNA sequencing (RNA-seq). Part of the inconsistency may arise from the variance in RNA stability, where the transcripts that are more or less abundant than predicted RNA expression from histone epigenome data are inferred to be more or less stable. However, there is little systematic analysis to validate this assumption. Here, we used stability data of whole transcriptome measured by 5'-bromouridine immunoprecipitation chase sequencing (BRIC-seq), which enabled us to determine the half-lives of whole transcripts including lincRNAs, and we integrated BRIC-seq with ChIP-seq to achieve better estimation of the eventual transcript levels and to understand the importance of post-transcriptional regulation that determine the eventual transcript levels.
RESULTS: We identified discrepancies between the RNA abundance estimated by ChIP-seq and measured RNA expression from RNA-seq; for number of genes and estimated that the expression level of 865 genes was controlled at the level of RNA stability in HeLa cells. ENCODE data analysis supported the idea that RNA stability control aids to determine transcript levels in multiple cell types. We identified UPF1, EXOSC5 and STAU1, well-studied RNA degradation factors, as controlling factors for 8% of cases. Computational simulations reasonably explained the changes of eventual mRNA levels attributable to the changes in the rates of mRNA half-lives. In addition, we propose a feedback circuit that includes the regulated degradation of mRNAs encoding transcription factors to maintain the steady state level of RNA abundance. Intriguingly, these regulatory mechanisms were distinct between mRNAs and lincRNAs.
CONCLUSIONS: Integrative analysis of ChIP-seq, RNA-seq and our BRIC-seq showed that transcriptional regulation and RNA degradation are independently regulated. In addition, RNA stability is an important determinant of eventual transcript levels. RNA binding proteins, such as UPF1, STAU1 and EXOSC5 may play active roles in such controls.

PMID: 25879614 [PubMed - as supplied by publisher]

20 Apr 08:35

Ribosome A and P sites revealed by length analysis of ribosome profiling data

by Martens, A. T., Taylor, J., Hilser, V. J.

The high-throughput sequencing of nuclease-protected mRNA fragments bound to ribosomes, a technique known as ribosome profiling, quantifies the relative frequencies with which different regions of transcripts are translated. This technique has revealed novel translation initiation sites with unprecedented scope and has furthered investigations into the connections between codon biases and translation rates. Yet the location of the codon being decoded in ribosome footprints is still unknown, and has been complicated by the recent observation of footprints with non-canonical lengths. Here we show how taking into account the variations in ribosome footprint lengths can reveal the ribosome aminoacyl (A) and peptidyl (P) site locations. These location assignments are in agreement with the proposed mechanisms for various ribosome pauses and further enhance the resolution of the profiling data. We also show that GC-rich motifs at the 5' ends of footprints are found in yeast, calling into question the anti-Shine-Dalgarno effect's role in ribosome pausing.

20 Apr 08:34

Pathway analysis from lists of microRNAs: common pitfalls and alternative strategy

by Godard, P., van Eyll, J.

MicroRNAs (miRNAs) are involved in the regulation of gene expression at a post-transcriptional level. As such, monitoring miRNA expression has been increasingly used to assess their role in regulatory mechanisms of biological processes. In large scale studies, once miRNAs of interest have been identified, the target genes they regulate are often inferred using algorithms or databases. A pathway analysis is then often performed in order to generate hypotheses about the relevant biological functions controlled by the miRNA signature. Here we show that the method widely used in scientific literature to identify these pathways is biased and leads to inaccurate results. In addition to describing the bias and its origin we present an alternative strategy to identify potential biological functions specifically impacted by a miRNA signature. More generally, our study exemplifies the crucial need of relevant negative controls when developing, and using, bioinformatics methods.

20 Apr 01:31

StructureFold: Genome-wide RNA secondary structure mapping and reconstruction in vivo.

by Tang Y, Bouvier E, Kwok CK, Ding Y, Nekrutenko A, Bevilacqua PC, Assmann SM

StructureFold: Genome-wide RNA secondary structure mapping and reconstruction in vivo.

Bioinformatics. 2015 Apr 16;

Authors: Tang Y, Bouvier E, Kwok CK, Ding Y, Nekrutenko A, Bevilacqua PC, Assmann SM

Abstract
MOTIVATION: RNAs fold into complex structures that are integral to the diverse mechanisms underlying RNA regulation of gene expression. Recent development of transcriptome-wide RNA structure profiling through the application of structure-probing enzymes or chemicals combined with high-throughput sequencing has opened a new field that greatly expands the amount of in vitro and in vivo RNA structural information available. The resultant datasets provide the opportunity to investigate RNA structural information on a global scale. However, the analysis of high-throughput RNA structure profiling data requires considerable computational effort and expertise.
RESULTS: We present a new platform, StructureFold, that provides an integrated computational solution designed specifically for large-scale RNA structure mapping and reconstruction across any transcriptome. StructureFold automates the processing and analysis of raw high-throughput RNA structure profiling data, allowing the seamless incorporation of wet-bench structural information from chemical probes and/or ribonucleases to restrain RNA secondary structure prediction via the RNAstructure and ViennaRNA package algorithms. StructureFold performs reads mapping and alignment, normalization and reactivity derivation, and RNA structure prediction in a single user-friendly web interface or via local installation. The variation in transcript abundance and length that prevails in living cells and consequently causes variation in the counts of structure-probing events between transcripts is accounted for. Accordingly, StructureFold is applicable to RNA structural profiling data obtained in vivo as well as to in vitro or in silico datasets. StructureFold is deployed via the Galaxy platform.
AVAILABILITY AND IMPLEMENTATION: StructureFold is freely available as a component of Galaxy available at: http://test.galaxyproject.org CONTACT: Y.T. (yxt148@psu.edu) or S.M.A. (sma3@psu.edu).
SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online, including a test dataset for StructureFold.

PMID: 25886980 [PubMed - as supplied by publisher]

16 Apr 03:37

The Landscape of long noncoding RNA classification.

by St Laurent G, Wahlestedt C, Kapranov P
Related Articles

The Landscape of long noncoding RNA classification.

Trends Genet. 2015 Apr 10;

Authors: St Laurent G, Wahlestedt C, Kapranov P

Abstract
Advances in the depth and quality of transcriptome sequencing have revealed many new classes of long noncoding RNAs (lncRNAs). lncRNA classification has mushroomed to accommodate these new findings, even though the real dimensions and complexity of the noncoding transcriptome remain unknown. Although evidence of functionality of specific lncRNAs continues to accumulate, conflicting, confusing, and overlapping terminology has fostered ambiguity and lack of clarity in the field in general. The lack of fundamental conceptual unambiguous classification framework results in a number of challenges in the annotation and interpretation of noncoding transcriptome data. It also might undermine integration of the new genomic methods and datasets in an effort to unravel the function of lncRNA. Here, we review existing lncRNA classifications, nomenclature, and terminology. Then, we describe the conceptual guidelines that have emerged for their classification and functional annotation based on expanding and more comprehensive use of large systems biology-based datasets.

PMID: 25869999 [PubMed - as supplied by publisher]

16 Apr 03:37

RNA Sequencing and Analysis.

by Kukurba KR, Montgomery SB
Related Articles

RNA Sequencing and Analysis.

Cold Spring Harb Protoc. 2015 Apr 13;

Authors: Kukurba KR, Montgomery SB

Abstract
RNA sequencing (RNA-Seq) uses the capabilities of high-throughput sequencing methods to provide insight into the transcriptome of a cell. Compared to previous Sanger sequencing- and microarray-based methods, RNA-Seq provides far higher coverage and greater resolution of the dynamic nature of the transcriptome. Beyond quantifying gene expression, the data generated by RNA-Seq facilitate the discovery of novel transcripts, identification of alternatively spliced genes, and detection of allele-specific expression. Recent advances in the RNA-Seq workflow, from sample preparation to library construction to data analysis, have enabled researchers to further elucidate the functional complexity of the transcription. In addition to polyadenylated messenger RNA (mRNA) transcripts, RNA-Seq can be applied to investigate different populations of RNA, including total RNA, pre-mRNA, and noncoding RNA, such as microRNA and long ncRNA. This article provides an introduction to RNA-Seq methods, including applications, experimental design, and technical challenges.

PMID: 25870306 [PubMed - as supplied by publisher]

16 Apr 03:36

Improving small RNA-seq by using a synthetic spike-in set for size-range quality control together with a set for data normalization.

by Locati MD, Terpstra I, de Leeuw WC, Kuzak M, Rauwerda H, Ensink WA, van Leeuwen S, Nehrdich U, Spaink HP, Jonker MJ, Breit TM, Dekker RJ
Related Articles

Improving small RNA-seq by using a synthetic spike-in set for size-range quality control together with a set for data normalization.

Nucleic Acids Res. 2015 Apr 13;

Authors: Locati MD, Terpstra I, de Leeuw WC, Kuzak M, Rauwerda H, Ensink WA, van Leeuwen S, Nehrdich U, Spaink HP, Jonker MJ, Breit TM, Dekker RJ

Abstract
There is an increasing interest in complementing RNA-seq experiments with small-RNA (sRNA) expression data to obtain a comprehensive view of a transcriptome. Currently, two main experimental challenges concerning sRNA-seq exist: how to check the size distribution of isolated sRNAs, given the sensitive size-selection steps in the protocol; and how to normalize data between samples, given the low complexity of sRNA types. We here present two separate sets of synthetic RNA spike-ins for monitoring size-selection and for performing data normalization in sRNA-seq. The size-range quality control (SRQC) spike-in set, consisting of 11 oligoribonucleotides (10-70 nucleotides), was tested by intentionally altering the size-selection protocol and verified via several comparative experiments. We demonstrate that the SRQC set is useful to reproducibly track down biases in the size-selection in sRNA-seq. The external reference for data-normalization (ERDN) spike-in set, consisting of 19 oligoribonucleotides, was developed for sample-to-sample normalization in differential-expression analysis of sRNA-seq data. Testing and applying the ERDN set showed that it can reproducibly detect differential expression over a dynamic range of 2(18). Hence, biological variation in sRNA composition and content between samples is preserved while technical variation is effectively minimized. Together, both spike-in sets can significantly improve the technical reproducibility of sRNA-seq.

PMID: 25870415 [PubMed - as supplied by publisher]

16 Apr 01:47

Systematic Dissection of the Sequence Determinants of Gene 3’ End Mediated Expression Control

by Ophir Shalem et al.

by Ophir Shalem, Eilon Sharon, Shai Lubliner, Ifat Regev, Maya Lotan-Pompan, Zohar Yakhini, Eran Segal

The 3’end genomic region encodes a wide range of regulatory process including mRNA stability, 3’ end processing and translation. Here, we systematically investigate the sequence determinants of 3’ end mediated expression control by measuring the effect of 13,000 designed 3’ end sequence variants on constitutive expression levels in yeast. By including a high resolution scanning mutagenesis of more than 200 native 3’ end sequences in this designed set, we found that most mutations had only a mild effect on expression, and that the vast majority (~90%) of strongly effecting mutations localized to a single positive TA-rich element, similar to a previously described 3’ end processing efficiency element, and resulted in up to ten-fold decrease in expression. Measurements of 3’ UTR lengths revealed that these mutations result in mRNAs with aberrantly long 3’UTRs, confirming the role for this element in 3’ end processing. Interestingly, we found that other sequence elements that were previously described in the literature to be part of the polyadenylation signal had a minor effect on expression. We further characterize the sequence specificities of the TA-rich element using additional synthetic 3’ end sequences and show that its activity is sensitive to single base pair mutations and strongly depends on the A/T content of the surrounding sequences. Finally, using a computational model, we show that the strength of this element in native 3’ end sequences can explain some of their measured expression variability (R = 0.41). Together, our results emphasize the importance of efficient 3’ end processing for endogenous protein levels and contribute to an improved understanding of the sequence elements involved in this process.
16 Apr 01:47

A protein interaction map of the LSU processome [Resource/Methodology]

by McCann, K. L., Charette, J. M., Vincent, N. G., Baserga, S. J.

Maturation of the large ribosomal subunit (LSU) in eukaryotes is a complex and highly coordinated process that requires the concerted action of a large, dynamic, ribonucleoprotein complex, the LSU processome. While we know that >80 ribosome biogenesis factors are required throughout the course of LSU assembly, little is known about how these factors interact with each other within the LSU processome. To interrogate its organization and architecture, we took a systems biology approach and performed a semi-high-throughput, array-based, directed yeast two-hybrid assay. Assaying 4800 protein–protein interactions, we identified 232 high-confidence, binary-interacting protein pairs, representing a fourfold increase from current knowledge. The resulting LSU processome interactome map has enhanced our understanding of the organization and function of the biogenesis factors within the LSU processome, revealing both novel and previously identified subcomplexes and hub proteins, including Nop4.

15 Apr 09:58

Sequence Features of Drosha and Dicer Cleavage Sites Affect the Complexity of IsomiRs.

by Starega-Roslan J, Witkos TM, Galka-Marciniak P, Krzyzosiak WJ

Sequence Features of Drosha and Dicer Cleavage Sites Affect the Complexity of IsomiRs.

Int J Mol Sci. 2015;16(4):8110-8127

Authors: Starega-Roslan J, Witkos TM, Galka-Marciniak P, Krzyzosiak WJ

Abstract
The deep-sequencing of small RNAs has revealed that different numbers and proportions of miRNA variants called isomiRs are formed from single miRNA genes and that this effect is attributable mainly to imprecise cleavage by Drosha and Dicer. Factors that influence the degree of cleavage precision of Drosha and Dicer are under investigation, and their identification may improve our understanding of the mechanisms by which cells modulate the regulatory potential of miRNAs. In this study, we focused on the sequences and structural determinants of Drosha and Dicer cleavage sites, which may explain the generation of homogeneous miRNAs (in which a single isomiR strongly predominates) as well as the generation of heterogeneous miRNAs. Using deep-sequencing data for small RNAs, we demonstrate that the generation of homogeneous miRNAs requires more sequence constraints at the cleavage sites than the formation of heterogeneous miRNAs. Additionally, our results indicate that specific Drosha cleavage sites have more sequence determinants in miRNA precursors than specific cleavage sites for Dicer and that secondary structural motifs in the miRNA precursors influence the precision of Dicer cleavage. Together, we present the sequence and structural features of Drosha and Dicer cleavage sites that influence the heterogeneity of the released miRNAs.

PMID: 25867481 [PubMed - as supplied by publisher]

15 Apr 09:57

Global analysis of the RNA-protein interaction and RNA secondary structure landscapes of the Arabidopsis nucleus.

by Gosai SJ, Foley SW, Wang D, Silverman IM, Selamoglu N, Nelson AD, Beilstein MA, Daldal F, Deal RB, Gregory BD
Related Articles

Global analysis of the RNA-protein interaction and RNA secondary structure landscapes of the Arabidopsis nucleus.

Mol Cell. 2015 Jan 22;57(2):376-88

Authors: Gosai SJ, Foley SW, Wang D, Silverman IM, Selamoglu N, Nelson AD, Beilstein MA, Daldal F, Deal RB, Gregory BD

Abstract
Posttranscriptional regulation in eukaryotes requires cis- and trans-acting features and factors including RNA secondary structure and RNA-binding proteins (RBPs). However, a comprehensive view of the structural and RBP interaction landscape of nuclear RNAs has yet to be compiled for any organism. Here, we use our ribonuclease-mediated structure and RBP-binding site mapping approaches to globally profile these features in Arabidopsis seedling nuclei in vivo. We reveal anticorrelated patterns of secondary structure and RBP binding throughout nuclear mRNAs that demarcate sites of alternative splicing and polyadenylation. We also uncover a collection of protein-bound sequence motifs, and identify their structural contexts, co-occurrences in transcripts encoding functionally related proteins, and interactions with putative RBPs. Finally, using these motifs, we find that the chloroplast RBP CP29A also interacts with nuclear mRNAs. In total, we provide a simultaneous view of the RNA secondary structure and RBP interaction landscapes in a eukaryotic nucleus.

PMID: 25557549 [PubMed - indexed for MEDLINE]

15 Apr 09:52

High-throughput spatial mapping of single-cell RNA-seq data to tissue of origin.

by Achim K, Pettit JB, Saraiva LR, Gavriouchkina D, Larsson T, Arendt D, Marioni JC
Related Articles

High-throughput spatial mapping of single-cell RNA-seq data to tissue of origin.

Nat Biotechnol. 2015 Apr 13;

Authors: Achim K, Pettit JB, Saraiva LR, Gavriouchkina D, Larsson T, Arendt D, Marioni JC

Abstract
Understanding cell type identity in a multicellular organism requires the integration of gene expression profiles from individual cells with their spatial location in a particular tissue. Current technologies allow whole-transcriptome sequencing of spatially identified cells but lack the throughput needed to characterize complex tissues. Here we present a high-throughput method to identify the spatial origin of cells assayed by single-cell RNA-sequencing within a tissue of interest. Our approach is based on comparing complete, specificity-weighted mRNA profiles of a cell with positional gene expression profiles derived from a gene expression atlas. We show that this method allocates cells to precise locations in the brain of the marine annelid Platynereis dumerilii with a success rate of 81%. Our method is applicable to any system that has a reference gene expression database of sufficiently high resolution.

PMID: 25867922 [PubMed - as supplied by publisher]