Im2Text: Describing Images Using 1 Million Captioned Photographs

Im2Text: Describing Images Using
1 Million Captioned Photographs
Vicente Ordonez (presenter), Girish Kulkarni, Tamara L. Berg
Stony Brook University

sky
trees
water
Computer Vision building
bridge

Our Goal
An old bridge over dirty green water.

One of the many stone bridges in town
that carry the gravel carriage roads.

A stone bridge over a peaceful river.

Harness the Web!
Matching using Global
Image Features
SBU Captioned Photo Dataset (GIST + Color)
1 million captioned images!

Smallest house in paris Bridge to temple in A walk around the
between red (on right) Hoan Kiem lake. lake near our house
and beige (on left). with Abby.

Transfer Caption(s)
e.g. “The water is clear Hangzhou bridge in The daintree river by
enough to see fish The water is clear
West lake. boat.

swimming around in it.” enough to see
fish swimming
around in it. ...

Use High Level Content to Rerank
(Objects, Stuff, People, Scenes, Captions)

The bridge over the Iron bridge over the Duck
lake on Suzhou Street. river.

Transfer Caption(s)
e.g. “The bridge over the The Daintree river by boat. Bridge over Cacapon river.
lake on Suzhou Street.”
...

Results
Good Bad

A female Mallard duck in
the lake at Luukki Espoo.
Amazing colours in the sky at
sunset with the orange of The cat in the window.
the cloud and the blue of the
sky behind.

The boat ended up a kilometre
Fresh fruit and
from the water in the middle of
vegetables at the market
the airstrip.
Cat in sink. in Port Louis Mauritius.

Im2Text: Describing Images Using 1 Million Captioned Photographs

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Mehr von Vicente Ordonez

Mehr von Vicente Ordonez (14)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Im2Text: Describing Images Using 1 Million Captioned Photographs

Hinweis der Redaktion