Reference-free reconstruction and quantification of transcriptomes from Nanopore long-read sequencing
Ivan de la Rubia0, Joel Indi1, Silvia Carbonell-Sala2, Julien Lagarde2, Mar Alba3, Eduardo Eyras4
(0) Pompeu Fabra University
(1) University of Lisboa
(2) Centre for Genomic Regulation
(3) Hospital del Mar Medical Research Institute (IMIM)
(4) AUSTRALIAN NATIONAL UNIVERSITY
Find me on Wed Nov 25th, 1:30-2:50pm AEDT in Remo, table 60
Abstract
Single-molecule long-read sequencing with Nanopore provides an unprecedented opportunity to measure transcriptomes from any sample. However, current analysis methods rely on the comparison with a reference genome or transcriptome, or the use of multiple sequencing technologies, thereby precluding cost-effective studies in species with no genome assembly available, in individuals underrepresented in the existing reference, and for the discovery of disease-specific transcripts not readily identifiable from a reference genome. Methods for DNA assembly cannot be directly transferred to transcriptomes since their consensus sequences lack the required interpretability for genes with multiple transcript isoforms. To address these challenges, we have developed RATTLE (https://github.com/comprna/RATTLE), the first tool to perform reference-free reconstruction and quantification of transcripts from Nanopore long reads. Using simulated data, isoform spike-ins, and sequencing data from tissues and cell lines, we demonstrate that RATTLE accurately determines transcript sequence and abundance, is comparable to reference-based methods, and shows saturation in the number of predicted transcripts with increasing number of input reads.
Comments