About This Item

Ask the MINDS@UW Librarian

Badger: An Entropy-Based Web Search Clustering System with Randomization and Voting

Show full item record

File(s):

Author(s)
Wang, Lidan; Schulze, Chloe Whyte
Publisher
University of Wisconsin-Madison Department of Computer Sciences
Citation
TR1537
Date
2005
Abstract
We have implemented and improved an entropy-based clustering algorithm. In addition to utilizing entropy as a clustering mechanism, our algorithm, Badger, uses randomization and a voting scheme to improve the quality of the resulting clusters. Using parsed web search result snippets, we have tested our algorithm and compared it against EigenCluster, a clustering meta-search engine developed by a research group at MIT. Our algorithm performs comparably to EigenCluster, but with slightly more overhead due to the extra work of the randomization step. We have found entropy to be a valid and interesting measure of document similarity and additionally we find it produces cohesive clusters.
Permanent link
http://digital.library.wisc.edu/1793/60458 
Export
Export to RefWorks 
‚Äč

Part of

Show full item record

Search and browse




About MINDS@UW

Deposit materials

  1. Register to deposit in MINDS@UW
  2. Need deposit privileges? Contact us.
  3. Already registered? Have deposit privileges? Deposit materials.