Unsupervised Ensemble Classification with Sequential and Networked Data

Panagiotis A. Traganitis, Georgios B. Giannakis

Research output: Contribution to journalArticlepeer-review

Abstract

Ensemble learning, the machine learning paradigm where multiple models are combined, has exhibited promising perfomance in a variety of tasks. The present work focuses on unsupervised ensemble classification. The term unsupervised refers to the ensemble combiner who has no knowledge of the ground-truth labels that each classifier has been trained on. While most prior works on unsupervised ensemble classification are designed for independent and identically distributed (i.i.d.) data, the present work introduces an unsupervised scheme for learning from ensembles of classifiers in the presence of data dependencies. Two types of data dependencies are considered: sequential data and networked data whose dependencies are captured by a graph. For both, novel moment matching and Expectation-Maximization algorithms are developed. Performance of these algorithms is evaluated on synthetic and real datasets, which indicate that knowledge of data dependencies in the meta-learner is beneficial for the unsupervised ensemble classification task.

Original languageEnglish (US)
JournalIEEE Transactions on Knowledge and Data Engineering
DOIs
StateAccepted/In press - 2020

Keywords

  • crowdsourcing
  • Data models
  • dependent data
  • Distributed databases
  • Ensemble learning
  • Inference algorithms
  • sequential classification
  • Signal processing algorithms
  • Tagging
  • Task analysis
  • Tensors
  • unsupervised

Fingerprint

Dive into the research topics of 'Unsupervised Ensemble Classification with Sequential and Networked Data'. Together they form a unique fingerprint.

Cite this