TY - GEN
T1 - Partially-independent component analysis for tissue heterogeneity correction in microarray gene expression analysis
AU - Wang, Yue
AU - Zhang, Junying
AU - Khan, Javed
AU - Clarke, Robert
AU - Gu, Zhiping
N1 - Publisher Copyright:
© 2003 IEEE.
Copyright:
Copyright 2015 Elsevier B.V., All rights reserved.
PY - 2003
Y1 - 2003
N2 - Gene microarray technologies provide powerful tools for the large scale analysis of gene expression in cancer research. Clinical applications often aim to facilitate a molecular classification of cancers based on discriminatory genes associated with different clinical stages or outcomes. However, gene expression profiles often represent a composite of more than one distinct source due to tissue heterogeneity, and could result in extracting signatures reflecting the proportion of stromal contamination in the sample, rather than underlying tumor biology. We therefore wish to introduce a computational approach which allows for a blind decomposition of gene expression profiles from mixed cell populations. The algorithm is based on a linear latent variable model, whose parameters are estimated using partially-independent component analysis, supported by a subset of differentially-expressed genes. We demonstrate the principle of the approach on the data sets derived from mixed cell lines of small round blue cell tumors. Because accurate source separation can be achieved blindly and numerically, we anticipate that computational correction of tissue heterogeneity will be useful in a wide variety of gene microarray studies.
AB - Gene microarray technologies provide powerful tools for the large scale analysis of gene expression in cancer research. Clinical applications often aim to facilitate a molecular classification of cancers based on discriminatory genes associated with different clinical stages or outcomes. However, gene expression profiles often represent a composite of more than one distinct source due to tissue heterogeneity, and could result in extracting signatures reflecting the proportion of stromal contamination in the sample, rather than underlying tumor biology. We therefore wish to introduce a computational approach which allows for a blind decomposition of gene expression profiles from mixed cell populations. The algorithm is based on a linear latent variable model, whose parameters are estimated using partially-independent component analysis, supported by a subset of differentially-expressed genes. We demonstrate the principle of the approach on the data sets derived from mixed cell lines of small round blue cell tumors. Because accurate source separation can be achieved blindly and numerically, we anticipate that computational correction of tissue heterogeneity will be useful in a wide variety of gene microarray studies.
UR - http://www.scopus.com/inward/record.url?scp=33751110185&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33751110185&partnerID=8YFLogxK
U2 - 10.1109/NNSP.2003.1318001
DO - 10.1109/NNSP.2003.1318001
M3 - Conference contribution
AN - SCOPUS:33751110185
T3 - Neural Networks for Signal Processing - Proceedings of the IEEE Workshop
SP - 23
EP - 32
BT - 2003 IEEE 13th Workshop on Neural Networks for Signal Processing, NNSP 2003
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 13th IEEE Workshop on Neural Networks for Signal Processing, NNSP 2003
Y2 - 17 September 2003 through 19 September 2003
ER -