Moving beyond RNA sequence: uncovering the functional role of RNA structure

Vincent Corbin0, Phillip Tomezko1, Paromita Gupta1, Margalit Glasgow1, Sitara Persad1, Lachlan McIntosh2, Anthony Papenfuss3, Silvia Rouskin1
(0) Peter MacCallum Cancer Center
(1) Whitehead Institute
(2) WEHI
(3) The Walter and Eliza Hall Institute of Medical Research

Find me on Tues Nov 24th, 1:40-3pm AEDT in Remo, table 125

Abstract
RNA is unique among biomolecules in its possession of both information storing and enzymatic capacities. In addition to the genetic code itself, the conformation of RNA secondary structure can regulate processes as varied as splicing, localisation, translation efficiency and protein binding.

Existing assays based on RNA chemical probing (DMS, SHAPE) have attempted to explore RNA structure in the cell, however to date they have assumed structural homogeneity and derived a single average estimated structure, leading to a potentially false structural interpretation and an inability to untangle the multiple conformations of RNA inside a single cell.

We developed a method capable of detecting alternative RNA structures which form from the same underlying sequence, both in vitro and ex vivo. We applied this to the RNA retrovirus human immunodeficiency virus-1 (HIV-1), and successfully revealed genomic structure heterogeneity with novel functional implications.

HIV-1 has a 10-kb single-stranded RNA genome. It expresses all gene products from the same primary transcript, which undergoes alternative splicing to produce diverse protein products which include structural proteins and regulatory factors. Despite the critical role of alternative splicing, the mechanisms driving splice-site choice are poorly understood, as HIV-1 does not encode any of its own splicing factors.

We used standard DMS-MaPseq to probe the structure of HIV-1 RNA in cells, and developed a novel algorithm called Detection of RNA folding Ensembles using Expectation-Maximization (DREEM), which successfully revealed alternative conformations assumed by the same RNA sequence [1]. DMS-MaPseq highlights nucleotides within unpaired sections of RNA structures by mutating them. DREEM models the mutational profile of the reads using a Mixture of Multivariate Bernoulli distributions, and uses an Expectation-Maximization algorithm to cluster the reads into groups corresponding to distinct RNA conformations. Contrary to previous models which analyzed population averages, our results revealed the widespread heterogeneous nature of HIV-1 RNA structure.

In addition to confirming in vitro characterized alternative structures for HIV-1 exists in cells, we discovered alternative conformations at critical splice sites which influence the ratio of transcript isoforms.

Our simultaneous measurement of splicing and intracellular RNA structure provides the first evidence for the long-standing hypothesis that RNA folding regulates splice site usage, and indicates a major role for RNA conformation heterogeneity in regulating RNA gene expression.

[1] Corbin VDA*, Tomezsko P*, Gupta P et al. Determination of RNA structural diversity and its role in HIV-1 RNA splicing. Nature 582, 438-442 (2020)