Precision-time tradeoffs: A paradigm for processing statistical queries on databases

Jaideep Srivastava, Doron Rotem

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Conventional query processing techniques are aimed at queries which access small amounts of data, and require each data item for the answer. In case the database is used for statistical analysis as well as operational purposes, for some types of queries a large part of the database may be required to compute the answer. This may lead to a data access bottleneck, caused by the excessive number of disk accesses needed to get the data into primary memory. An example is computation of statistical parameters, such as count, average, median, and standard deviation, which are useful for statistical analysis of the database. Yet another example that faces this bottleneck is the verification of the truth of a set of predicates (goals), based on the current database state, for the purposes of intelligent decision making. A solution to this problem is to maintain a set of precomputed information about the database in a view or a snapshot. Statistical queries can be processed using the view rather than the real database. A crucial issue is that the precision of the precomputed information in the view deteriorates with time, because of the dynamic nature of the underlying database. Thus the answer provided is approximate, which is acceptable under many circumstances, especially when the error is bounded. The tradeoff is that the processing of queries is made faster at the expense of the precision in the answer. The concept of precision in the context of database queries is formalized, and a data model to incorporate it is developed. Algorithms are designed to maintain materialized views of data to specified degrees of precision.

Original languageEnglish (US)
Title of host publicationStatistical and Scientific Database Management - 4th International Working Conference, SSDBM, Proceedings
EditorsPer Svensson, Maurizio Rafanelli, John C. Klensin
PublisherSpringer- Verlag
Pages226-245
Number of pages20
ISBN (Print)9783540505754
DOIs
StatePublished - Jan 1 1989
Event4th International Working Conference on Statistical and Scientific Database Management, SSDBM 1988 - Rome, Italy
Duration: Jun 21 1988Jun 23 1988

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume339 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other4th International Working Conference on Statistical and Scientific Database Management, SSDBM 1988
CountryItaly
CityRome
Period6/21/886/23/88

Fingerprint Dive into the research topics of 'Precision-time tradeoffs: A paradigm for processing statistical queries on databases'. Together they form a unique fingerprint.

Cite this