| |
| | [No title] |
 | | This paper presents a very nice approach to the problem of finding nearest neighbors in high-D, with an application to a current hot topic, web clustering. |
 | | The authors' explanation of the background material - word bagging, tfidf, and document similarity - is great. |
 | | The authors, though, never make it entirely clear how the family of hashing functions they choose, described in the second and third paragraphs of section 4, reasonably captures the relevant information of the high-D space. |
| www.lans.ece.utexas.edu /~krump/review6.html (900 words) |
|