Google Research Blog

The latest news from Research at Google

Updating Google Maps with Deep Learning and Street View

Wednesday, May 03, 2017

Posted by Julian Ibarz, Staff Software Engineer, Google Brain Team and Sujoy Banerjee, Product Manager, Ground Truth TeamAttention-based Extraction of Structured Information from Street View ImageryFrench Street Name Signspublicly available

Example of street name from the FSNS dataset correctly transcribed by our system. Up to four views of the same sign are provided.

Optical Character Recognition neural networks to blur faces and license platesreading street numbersStreet View House NumbersIan Goodfellowreleased French Street Name Signs

These are examples of challenging signs that are properly transcribed by our system by selecting or combining understanding across images. The second example is extremely challenging by itself, but the model learned a language model prior that enables it to remove ambiguity and correctly read the street name. Note that in the FSNS dataset, random noise is used in the case where less than four independent views are available of the same physical sign.

Example of text normalization learned from data in Brazil. Here it changes “AV.” into “Avenida” and “Pres.” into “Presidente” which is what we desire.

In this example, the model is not confused from the fact that there is two street names, properly normalizes “Av” into “Avenue” as well as correctly ignores the number “1600”.

our paperLarge Scale Business Discovery from Street View Imagery

The system is correctly able to predict the business name ‘Zelina Pneus’, despite not receiving any data about the true location of the name in the image. Model is not confused by the tire brands that the sign indicates are available at the store.