Generalizing the notion of confidence

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

In this paper, we explore extending association analysis to non-traditional types of patterns and non-binary data by generalizing the notion of confidence. We begin by describing a general framework that measures the strength of the connection between two association patterns by the extent to which the strength of one association pattern provides information about the strength of another. Although this framework can serve as the basis for designing or analyzing measures of association, the focus in this paper is to use the framework as the basis for extending the traditional concept of confidence to error-tolerant itemsets (ETIs) and continuous data. To that end, we provide two examples. First, we (1) describe an approach to defining confidence for ETIs that preserves the interpretation of confidence as an estimate of a conditional probability, and (2) show how association rules based on ETIs can have better coverage (at an equivalent confidence level) than rules based on traditional itemsets. Next, we derive a confidence measure for continuous data that agrees with the standard confidence measure when applied to binary transaction data. Further analysis of this result exposes some of the important issues involved in constructing a confidence measure for continuous data.

Original languageEnglish (US)
Pages (from-to)279-299
Number of pages21
JournalKnowledge and Information Systems
Volume12
Issue number3
DOIs
StatePublished - Aug 2007

Keywords

  • Association rules
  • Confidence
  • Data mining
  • Error-tolerant itemsets
  • Support

Fingerprint

Dive into the research topics of 'Generalizing the notion of confidence'. Together they form a unique fingerprint.

Cite this