Secrets from Google's labs

The company uses multiple approaches to detection: comparison to the reference, analysis of the image itself and comparison against other search results. A 'classifier' gathers all statistics describing an image, such as skin tones, shadows, facial hair and other attributes. These classifiers are fed into what Baluja calls 'visual rank', which determines the accuracy of the search. They decide which images form the basis for what the person normally looks like. That's why, in the Lindsay Lohan search, a cartoon of her might appear much lower in the search results.

Eventually, Google will apply its computer vision research to more than just faces. For example, Baluja explained that they plan to use a visual search system for products. When you type the search term 'Apple iPhone', you might want to see a computer vision search that shows the device in use in the field, the iconic image from Apple or the cartoon images where people make fun of the 'Jesus phone'. Baluja says they are agnostic about the kind of images they are searching, but the main goal is to provide results that they think its customers want, which could in turn raise advertising revenues.

TOPICS