An Evolutionary Approach to Network Analysis of Cancer Transcriptomes Reveals Common Indicators of Enhanced Malignancy Across a Range of Solid Tumours

David Goode0, Anna Trigos0, Rick Pearson0, Tony Papenfuss1
(0) Peter MacCallum Cancer Centre
(1) The Walter and Eliza Hall Institute of Medical Research

Find me on Tues Nov 24th, 1:40-3pm AEDT in Remo, table 101

Abstract
Despite diverse origins and significant genomic heterogeneity, all types of cancers display common molecular features, including uncontrolled proliferation, altered metabolism and dedifferentiation.

These features resemble properties of unicellular organisms such as bacteria, yeast and protozoa, suggesting cancer is driven by the breakdown of gene regulatory networks (GRNs) that evolved in the earliest metazoans (multicellular animals) to control basic processes such as cell division and differentiation.

To study this hypothesis, we developed Evolutionary Network Analysis (ENA). Our approach combines phylogenetics with network biology and cancer transcriptome data to investigate how GRNs are disrupted and rewired in cancer and how this relates to the evolutionary origins of the genes involved.

Applying ENA to data from The Cancer Genome Atlas showed strong enrichment of somatic mutations in master regulators that control communication between unicellular and multicellular genes in cancer, resulting in increased expression of unicellular genes across multiple tumour types.

To better understand how disrupted communication between unicellular and multicellular genes drives tumour progression, we identified gene co-expression modules in over 30 type of cancers, and matched normal tissues where available. Using protein sequence conservation, modules were classified as having predominantly unicellular genes, predominantly multicellular genes or a mix of unicellular and multicellular genes (mixed UC-MC modules).

Mixed UC-MC module showed the greatest degree of difference in their topology and expression between tumours and normal tissue. Thus, changes in co-expression of unicellular and multicellular genes are common features of tumours. These mixed UC-MC module were frequently associated with copy-number alterations, particularly amplifications.

Comparing high-grade to low-grade tumours showed progression to higher grades involved further disruption and rewiring of the links between genes in mixed UC-MC modules, suggesting continued alterations to connections between unicellular and multicellular genes enhance tumour malignancy.

To examine the potential prognostic implications of this finding, we built Random Forest classifiers based on the expression of unicellular, multicellular or mixed UC-MC modules and tested their ability to distinguish low-grade from high-grade brain tumours. Classification based on mixed UC-MC modules performed the best for discriminating low-grade glioma tumours from high-grade glioblastoma tumours. Furthermore, survival in the low-grade glioma cohort was linked to activity of selected mixed UC-MC modules.

Evolutionary Network Analysis demonstrates the power and utility of incorporating the evolutionary origins of genes into the study of cancer transcriptomes. This opens the door to development of new classes of evolutionary-informed prognostic gene expression signatures potentially applicable to any type of cancer.