Tampa: interpretable analysis and visualization of metagenomics-based taxon abundance profiles
Jaqueline Brito0, Varuni Sarwal1, Serghei Mangul2, David Koslicki3
(0) University of Southern California
(1) University of California Los Angeles
(2) university of Southern California
(3) The Pennsylvania State University
Find me on Tues Nov 24th, 1:40-3pm AEDT in Remo, table 137
Abstract
Taxonomic metagenome profiling aims to predict the identity and relative abundances of taxa in a given whole genome shotgun (WGS) metagenomic sample. A recent surge in computational methods that aim to accomplish this, called taxonomic profilers, has motivated community-driven efforts to create standardized benchmarking datasets, standardized taxonomic profile formats, as well as a benchmarking platform to assess tool performance. However, existing tools are either integrated into a single taxonomic profiling method or lack the flexibility and interpretability to analyze and visualize multiple taxonomic profiles. We address this lack of a metagenomic taxonomic profile analysis and visualization platform by proposing a software package Tampa (Taxonomic metagenome profiling evaluation). Tampa allows any user to effectively analyze one or more taxonomic profiles produced by numerous taxonomic profiling methods. Additionally, Tampa can leverage a wide range of formats, including the widely utilized and community developed BIOM and CAMI profiling formats. We demonstrate Tampa’s ability by illuminating the critical biological differences between samples and conditions otherwise missed by commonly utilized metrics. Additionally, we show that Tampa can enable biologists to effectively choose the most appropriate profiling method to use on their real data when a “ground truth” taxonomic profile does not exist; the most frequently encountered roadblock when a biologist asks, “Which tool should I use to analyze my data?” When ground truth taxonomic profiles are available (such as with simulated or mock metagenomic communities), we show how Tampa can augment existing benchmarking platforms such as OPAL. Tampa will be provided in a platform-independent fashion (utilizing Bioconda) and integrated into the Galaxy Toolshed for easy “point and click” analysis for less computationally inclined users. Tampa will allow scientists to quickly contextualize, assess, and extract insight from taxonomic profiles instead of relying primarily on statistical summaries or manual manipulation.
Comments