Low discrepancy sets yield approximate min-wise independent permutation families
Title | Low discrepancy sets yield approximate min-wise independent permutation families |
Publication Type | Journal Articles |
Year of Publication | 2000 |
Authors | Saks M, Srinivasan A, Zhou S, Zuckerman D |
Journal | Information Processing Letters |
Volume | 73 |
Issue | 1–2 |
Pagination | 29 - 32 |
Date Published | 2000/01/31/ |
ISBN Number | 0020-0190 |
Keywords | combinatorial problems, Document filtering, explicit constructions, Information retrieval, Min-wise independent permutations, Pseudorandom permutations |
Abstract | Motivated by a problem of filtering near-duplicate Web documents, Broder, Charikar, Frieze and Mitzenmacher defined the following notion of ϵ-approximate min-wise independent permutation families. A multiset F of permutations of {0,1,…,n−1} is such a family if for all K⫅{0,1,…,n−1} and any x∈K, a permutation π chosen uniformly at random from F satisfies |Pr[min{π(K)}=π(x)]−1|K||≤ϵ|K|. We show connections of such families with low discrepancy sets for geometric rectangles, and give explicit constructions of such families F of size nO(logn) for ϵ=1/nΘ(1), improving upon the previously best-known bound of Indyk. We also present polynomial-size constructions when the min-wise condition is required only for |K|≤2O(log2/3n), with ϵ≥2−O(log2/3n). |
URL | http://www.sciencedirect.com/science/article/pii/S0020019099001635 |
DOI | 10.1016/S0020-0190(99)00163-5 |