Deep-learning approach to identifying cancer subtypes using high-dimensional genomic data

Chen, Yang, Goodison, Sun (2020) Deep-learning approach to identifying cancer subtypes using high-dimensional genomic data Bioinformatics (IF: 5.8) 36(5) 1476-1483
Full Text
Full text

Click the PDF icon to view the full text of the paper

Abstract

Cancer subtype classification has the potential to significantly improve disease prognosis and develop individualized patient management. Existing methods are limited by their ability to handle extremely high-dimensional data and by the influence of misleading, irrelevant factors, resulting in ambiguous and overlapping subtypes.To address the above issues, we proposed a novel approach to disentangling and eliminating irrelevant factors by leveraging the power of deep learning. Specifically, we designed a deep-learning framework, referred to as DeepType, that performs joint supervised classification, unsupervised clustering and dimensionality reduction to learn cancer-relevant data representation with cluster structure. We applied DeepType to the METABRIC breast cancer dataset and compared its performance to state-of-the-art methods. DeepType significantly outperformed the existing methods, identifying more robust subtypes while using fewer genes. The new approach provides a framework for the derivation of more accurate and robust molecular cancer subtypes by using increasingly complex, multi-source data.An open-source software package for the proposed method is freely available at http://www.acsu.buffalo.edu/~yijunsun/lab/DeepType.html.Supplementary data are available at Bioinformatics online.© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Links

http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8215925
http://www.ncbi.nlm.nih.gov/pubmed/31603461
http://dx.doi.org/10.1093/bioinformatics/btz769

Similar articles

Tools