Stacked inverse probability of censoring weighted bagging: A case study in the InfCareHIV Register

Pablo Gonzalez Ginestet, Ales Kotalik, David M. Vock, Julian Wolfson, Erin E. Gabriel

Research output: Contribution to journalArticlepeer-review

Abstract

We propose an inverse probability of censoring weighted (IPCW) bagging (bootstrap aggregation) pre-processing that enables the application of any machine learning procedure for classification to be used to predict the cause-specific cumulative incidence, properly accounting for right-censored observations and competing risks. We consider the IPCW area under the time-dependent ROC curve (IPCW-AUC) as a performance evaluation metric. We also suggest a procedure to optimally stack predictions from any set of IPCW bagged methods. We illustrate our proposed method in the Swedish InfCareHIV register by predicting individuals for whom treatment will not maintain an undetectable viral load for at least 2 years following initial suppression. The R package stackBagg that implements our proposed method is available on Github.

Original languageEnglish (US)
Pages (from-to)51-65
Number of pages15
JournalJournal of the Royal Statistical Society. Series C: Applied Statistics
Volume70
Issue number1
DOIs
StatePublished - Jan 2021

Bibliographical note

Publisher Copyright:
© 2020 The Authors. Journal of the Royal Statistical Society: Series C (Applied Statistics) published by John Wiley & Sons Ltd

Keywords

  • HIV treatment
  • competing risk
  • cumulative incidence
  • ensemble learning
  • inverse probability of censoring weighting
  • machine learning
  • stacking

Fingerprint Dive into the research topics of 'Stacked inverse probability of censoring weighted bagging: A case study in the InfCareHIV Register'. Together they form a unique fingerprint.

Cite this