About This Item

Ask the MINDS@UW Librarian

End-Biased Samples for Join Cardinality Estimation

Show full item record

File(s):

Author(s)
Estan, Cristian; Naughton, Jeffrey F.
Publisher
University of Wisconsin-Madison Department of Computer Sciences
Date
Mar 15, 2012
Abstract
We present a new technique for using samples to estimate join cardinalities. This technique, which we term ìend-biased samples,î is inspired by recent work in network traffic measurement. It improves on random samples by using coordinated pseudo-random samples and retaining the sampled values in proportion to their frequency. We show that end-biased samples always provide more accurate estimates than random samples with the same sample size. The comparison with histograms is more interesting ó while end-biased histograms are somewhat better than end-biased samples for uncorrelated data sets, end-biased samples dominate by a large margin when the data is correlated. Finally, we compare end-biased samples to the recently proposed ìskimmed sketchesî and show that neither dominates the other, that each has different and compelling strengths and weaknesses. These results suggest that endbiased samples may be a useful addition to the repertoire of techniques used for data summarization.
Permanent link
http://digital.library.wisc.edu/1793/60464 
Export
Export to RefWorks 

Part of

Show full item record

Search and browse




About MINDS@UW

Deposit materials

  1. Register to deposit in MINDS@UW
  2. Need deposit privileges? Contact us.
  3. Already registered? Have deposit privileges? Deposit materials.