Latent Dirichlet conditional naive-Bayes models

Arindam Banerjee; Hanhuai Shan

doi:10.1109/ICDM.2007.55

Latent Dirichlet conditional naive-Bayes models

Arindam Banerjee, Hanhuai Shan

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

11 Scopus citations

Abstract

In spite of the popularity of probabilistic mixture models for latent structure discovery from data, mixture models do not have a natural mechanism for handling sparsity, where each data point only has a few non-zero observations. In this paper, we introduce conditional naive-Bayes (CNB) models, which generalize naive-Bayes mixture models to naturally handle sparsity by conditioning the model on observed features. Further, we present latent Dirichlet conditional naive-Bayes (LD-CNB) models, which constitute a family of powerful hierarchical Bayesian models for latent structure discovery from sparse data. The proposed family of models are quite general and can work with arbitrary regular exponential family conditional distributions. We present a variational inference based EM algorithm for learning along with special case analyses for Gaussian and discrete distributions. The efficacy of the proposed models are demonstrated by extensive experiments on a wide variety of different datasets.

Original language	English (US)
Title of host publication	Proceedings of the 7th IEEE International Conference on Data Mining, ICDM 2007
Pages	421-426
Number of pages	6
DOIs	https://doi.org/10.1109/ICDM.2007.55
State	Published - 2007
Event	7th IEEE International Conference on Data Mining, ICDM 2007 - Omaha, NE, United States Duration: Oct 28 2007 → Oct 31 2007

Publication series

Name	Proceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)	1550-4786

Other

Other	7th IEEE International Conference on Data Mining, ICDM 2007
Country/Territory	United States
City	Omaha, NE
Period	10/28/07 → 10/31/07

Access

10.1109/ICDM.2007.55

OpenUrl availability

Full text

Cite this

Banerjee, A & Shan, H 2007, Latent Dirichlet conditional naive-Bayes models. in Proceedings of the 7th IEEE International Conference on Data Mining, ICDM 2007., 4470267, Proceedings - IEEE International Conference on Data Mining, ICDM, pp. 421-426, 7th IEEE International Conference on Data Mining, ICDM 2007, Omaha, NE, United States, 10/28/07. https://doi.org/10.1109/ICDM.2007.55

@inproceedings{c64b9724896549e998b79d9be9d18570,

title = "Latent Dirichlet conditional naive-Bayes models",

abstract = "In spite of the popularity of probabilistic mixture models for latent structure discovery from data, mixture models do not have a natural mechanism for handling sparsity, where each data point only has a few non-zero observations. In this paper, we introduce conditional naive-Bayes (CNB) models, which generalize naive-Bayes mixture models to naturally handle sparsity by conditioning the model on observed features. Further, we present latent Dirichlet conditional naive-Bayes (LD-CNB) models, which constitute a family of powerful hierarchical Bayesian models for latent structure discovery from sparse data. The proposed family of models are quite general and can work with arbitrary regular exponential family conditional distributions. We present a variational inference based EM algorithm for learning along with special case analyses for Gaussian and discrete distributions. The efficacy of the proposed models are demonstrated by extensive experiments on a wide variety of different datasets.",

author = "Arindam Banerjee and Hanhuai Shan",

year = "2007",

doi = "10.1109/ICDM.2007.55",

language = "English (US)",

isbn = "0769530184",

series = "Proceedings - IEEE International Conference on Data Mining, ICDM",

pages = "421--426",

booktitle = "Proceedings of the 7th IEEE International Conference on Data Mining, ICDM 2007",

note = "7th IEEE International Conference on Data Mining, ICDM 2007 ; Conference date: 28-10-2007 Through 31-10-2007",

}

TY - GEN

T1 - Latent Dirichlet conditional naive-Bayes models

AU - Banerjee, Arindam

AU - Shan, Hanhuai

PY - 2007

Y1 - 2007

N2 - In spite of the popularity of probabilistic mixture models for latent structure discovery from data, mixture models do not have a natural mechanism for handling sparsity, where each data point only has a few non-zero observations. In this paper, we introduce conditional naive-Bayes (CNB) models, which generalize naive-Bayes mixture models to naturally handle sparsity by conditioning the model on observed features. Further, we present latent Dirichlet conditional naive-Bayes (LD-CNB) models, which constitute a family of powerful hierarchical Bayesian models for latent structure discovery from sparse data. The proposed family of models are quite general and can work with arbitrary regular exponential family conditional distributions. We present a variational inference based EM algorithm for learning along with special case analyses for Gaussian and discrete distributions. The efficacy of the proposed models are demonstrated by extensive experiments on a wide variety of different datasets.

AB - In spite of the popularity of probabilistic mixture models for latent structure discovery from data, mixture models do not have a natural mechanism for handling sparsity, where each data point only has a few non-zero observations. In this paper, we introduce conditional naive-Bayes (CNB) models, which generalize naive-Bayes mixture models to naturally handle sparsity by conditioning the model on observed features. Further, we present latent Dirichlet conditional naive-Bayes (LD-CNB) models, which constitute a family of powerful hierarchical Bayesian models for latent structure discovery from sparse data. The proposed family of models are quite general and can work with arbitrary regular exponential family conditional distributions. We present a variational inference based EM algorithm for learning along with special case analyses for Gaussian and discrete distributions. The efficacy of the proposed models are demonstrated by extensive experiments on a wide variety of different datasets.

UR - http://www.scopus.com/inward/record.url?scp=49749117072&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=49749117072&partnerID=8YFLogxK

U2 - 10.1109/ICDM.2007.55

DO - 10.1109/ICDM.2007.55

M3 - Conference contribution

AN - SCOPUS:49749117072

SN - 0769530184

SN - 9780769530185

T3 - Proceedings - IEEE International Conference on Data Mining, ICDM

SP - 421

EP - 426

BT - Proceedings of the 7th IEEE International Conference on Data Mining, ICDM 2007

T2 - 7th IEEE International Conference on Data Mining, ICDM 2007

Y2 - 28 October 2007 through 31 October 2007

ER -

Latent Dirichlet conditional naive-Bayes models

Abstract

Publication series

Other

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this