Transfer learning across cancers on DNA copy number variation analysis

Huanan Zhang, Ze Tian, Rui Kuang

Research output: Contribution to journalConference articlepeer-review

5 Scopus citations

Abstract

DNA copy number variations (CNVs) are prevalent in all types of tumors. It is still a challenge to study how CNVs play a role in driving tumorgenic mechanisms that are either universal or specific in different cancer types. To address the problem, we introduce a transfer learning framework to discover common CNVs shared across different tumor types as well as CNVs specific to each tumor type from genome-wide CNV data measured by array CGH and SNP genotyping array. The proposed model, namely Transfer Learning with Fused LASSO (TLFL), detects latent CNV components from multiple CNV datasets of different tumor types to distinguish the CNVs that are common across the datasets and those that are specific in each dataset. Both the common and type-specific CNVs are detected as latent components in matrix factorization coupled with fused LASSO on adjacent CNV probe features. TLFL considers the common latent components underlying the multiple datasets to transfer knowledge across different tumor types. In simulations and experiments on real cancer CNV datasets, TLFL detected better latent components that can be used as features to improve classification of patient samples in each individual dataset compared with the model without the knowledge transfer. In cross-dataset analysis on bladder cancer and cross-domain analysis on breast cancer and ovarian cancer, TLFL also learned latent CNV components that are both predictive of tumor stages and correlate with known cancer genes.

Original languageEnglish (US)
Article number6729635
Pages (from-to)1283-1288
Number of pages6
JournalProceedings - IEEE International Conference on Data Mining, ICDM
DOIs
StatePublished - 2013
Event13th IEEE International Conference on Data Mining, ICDM 2013 - Dallas, TX, United States
Duration: Dec 7 2013Dec 10 2013

Keywords

  • Cancer Genomics
  • DNA Copy Number
  • Fused LASSO Components
  • Transfer Learning

Fingerprint

Dive into the research topics of 'Transfer learning across cancers on DNA copy number variation analysis'. Together they form a unique fingerprint.

Cite this