Name: Competitive Distribution Estimation: Why is Good-Turing Good
Start: 2015-12-08T16:30:00-0500
End: 2015-12-08T17:30:00-0500

Back To Schedule

Competitive Distribution Estimation: Why is Good-Turing Good

Estimating distributions over large alphabets is a fundamental machine-learning tenet. Yet no method is known to estimate all distributions well. For example, add-constant estimators are nearly min-max optimal but often perform poorly in practice, and practical estimators such as absolute discounting, Jelinek-Mercer, and Good-Turing are not known to be near optimal for essentially any distribution.We describe the first universally near-optimal probability estimators. For every discrete distribution, they are provably nearly the best in the following two competitive ways. First they estimate every distribution nearly as well as the best estimator designed with prior knowledge of the distribution up to a permutation. Second, they estimate every distribution nearly as well as the best estimator designed with prior knowledge of the exact distribution, but as all natural estimators, restricted to assign the same probability to all symbols appearing the same number of times.Specifically, for distributions over $k$ symbols and $n$ samples, we show that for both comparisons, a simple variant of Good-Turing estimator is always within KL divergence of $(3+o(1))/n^{1/3}$ from the best estimator, and that a more involved estimator is within $\tilde{\mathcal{O}}(\min(k/n,1/\sqrt n))$. Conversely, we show that any estimator must have a KL divergence $\ge\tilde\Omega(\min(k/n,1/ n^{2/3}))$ over the best estimator for the first comparison, and $\ge\tilde\Omega(\min(k/n,1/\sqrt{n}))$ for the second.

Speakers

Ananda Theertha Suresh

Tuesday December 8, 2015 16:30 - 17:30 EST
Room 210 A

Orals

NIPS 2015

Ananda Theertha Suresh

Attendees (0)

NIPS 2015

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Ananda Theertha Suresh

Attendees (0)