Name: Learning with a Wasserstein Loss
Start: 2015-12-07T19:00:00-0500
End: 2015-12-07T23:59:00-0500

Learning with a Wasserstein Loss

Learning to predict multi-label outputs is challenging, but in many problems there is a natural metric on the outputs that can be used to improve predictions. In this paper we develop a loss function for multi-label learning, based on the Wasserstein distance. The Wasserstein distance provides a natural notion of dissimilarity for probability measures. Although optimizing with respect to the exact Wasserstein distance is costly, recent work has described a regularized approximation that is efficiently computed. We describe an efficient learning algorithm based on this regularization, as well as a novel extension of the Wasserstein distance from probability measures to unnormalized measures. We also describe a statistical learning bound for the loss. The Wasserstein loss can encourage smoothness of the predictions with respect to a chosen metric on the output space. We demonstrate this property on a real-data tag prediction problem, using the Yahoo Flickr Creative Commons dataset, outperforming a baseline that doesn't use the metric.

Speakers

NIPS 2015

Charlie Frogner

Hossein Mobahi

Chiyuan Zhang

NIPS 2015

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Charlie Frogner

Hossein Mobahi

Chiyuan Zhang