Abstract
The reliability and reproducibility of gene biomarkers for classification of cancer patients has been challenged due to measurement noise and biological heterogeneity among patients. In this paper, we propose a novel module-based feature selection framework, which integrates biological network information and gene expression data to identify biomarkers not as individual genes but as functional modules. Results from four breast cancer studies demonstrate that the identified module biomarkers • achieve higher classification accuracy in independent validation datasets • are more reproducible than individual gene markers • improve the biological interpretability of results • are enriched in cancer 'disease drivers'.
Original language | English (US) |
---|---|
Pages (from-to) | 284-302 |
Number of pages | 19 |
Journal | International Journal of Data Mining and Bioinformatics |
Volume | 7 |
Issue number | 3 |
DOIs | |
State | Published - 2013 |
Externally published | Yes |
Keywords
- Cancer biomarkers
- Disease classification
- Feature selection
- Systems biology