Causality-guided feature selection

Alzheimer’s Disease Neuroimaging Initiative

Research output: Chapter in Book/Report/Conference proceedingConference contribution

4 Scopus citations

Abstract

Identifying meaningful features that drive a phenomenon (response) of interest in complex systems of interconnected factors is a challenging problem. Causal discovery methods have been previously applied to estimate bounds on causal strengths of factors on a response or to identify meaningful interactions between factors in complex systems, but these approaches have been used only for inferential purposes. In contrast, we posit that interactions between factors with a potential causal association on a given response could be viable candidates not only for hypothesis generation but also for predictive modeling. In this work, we propose a causality-guided feature selection methodology that identifies factors having a potential cause-effect relationship in complex systems, and selects features by clustering them based on their causal strength with respect to the response. To this end, we estimate statistically significant causal effects on the response of factors taking part in potential causal relationships, while addressing associated technical challenges, such as multicollinearity in the data. We validate the proposed methodology for predicting response in five real-world datasets from the domain of climate science and biology. The selected features show predictive skill and consistent performance across different domains.

Original languageEnglish (US)
Title of host publicationAdvanced Data Mining and Applications - 12th International Conference, ADMA 2016, Proceedings
EditorsJinyan Li, Xue Li, Shuliang Wang, Jianxin Li, Quan Z. Sheng
PublisherSpringer
Pages391-405
Number of pages15
ISBN (Print)9783319495859
DOIs
StatePublished - 2016
Event12th International Conference on Advanced Data Mining and Applications, ADMA 2016 - Gold Coast, Australia
Duration: Dec 12 2016Dec 15 2016

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume10086 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other12th International Conference on Advanced Data Mining and Applications, ADMA 2016
Country/TerritoryAustralia
CityGold Coast
Period12/12/1612/15/16

Bibliographical note

Publisher Copyright:
© Springer International Publishing AG 2016.

Fingerprint

Dive into the research topics of 'Causality-guided feature selection'. Together they form a unique fingerprint.

Cite this