Exploring Neural Network models for CNV detection from Exome Data

Simon Sadedin0
(0) Murdoch Childrens Research Institute

Find me on Tues Nov 24th, 1:40-3pm AEDT in Remo, table 43

Abstract
In recent years many methods have been developed to detect copy number variants (CNVs) from exome and targeted sequencing data. These methods typically rely exclusively on read depth to ascertain CNVs as other signals are diluted or missing due to the sparsity of targeted regions in a typical exome capture. Despite some significant success, accurate CNV detection remains challenging, especially for small events that span few exons or target regions of the exome capture. A striking feature of this problem is that humans can often easily recognise CNV false positives through visual inspection of the read depth signal based on the precise shape and local features of the surrounding signal variation. This phenomenon raises the question of whether approaches used in other domains for image classification could be applicable to the problem of CNV detection from exome data. In this presentation, I explore applicability of various neural network models to detect CNVs, including combinations of convolutional neural networks (CNNs), Recurrent Neural Networks (RNNs) and Long-Short-Term-Memory networks (LSTM). Through the use of small scale models and simulated data I will show that there is strong potential for these models to improve on accuracy of existing CNV detection methods.