Name: Expressing an Image Stream with a Sequence of Natural Sentences
Start: 2015-12-08T19:00:00-0500
End: 2015-12-08T23:59:00-0500

Back To Schedule

Expressing an Image Stream with a Sequence of Natural Sentences

We propose an approach for generating a sequence of natural sentences for an image stream. Since general users usually take a series of pictures on their special moments, much online visual information exists in the form of image streams, for which it would better take into consideration of the whole set to generate natural language descriptions. While almost all previous studies have dealt with the relation between a single image and a single natural sentence, our work extends both input and output dimension to a sequence of images and a sequence of sentences. To this end, we design a novel architecture called coherent recurrent convolutional network (CRCN), which consists of convolutional networks, bidirectional recurrent networks, and entity-based local coherence model. Our approach directly learns from vast user-generated resource of blog posts as text-image parallel training data. We demonstrate that our approach outperforms other state-of-the-art candidate methods, using both quantitative measures (e.g. BLEU and top-K recall) and user studies via Amazon Mechanical Turk.

Speakers

Gunhee Kim

Cesc Park

Tuesday December 8, 2015 19:00 - 23:59 EST
210 C #4

Posters

NIPS 2015

Gunhee Kim

Cesc Park

Attendees (0)

NIPS 2015

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Gunhee Kim

Cesc Park

Attendees (0)