Searching with Images instead of Words 207
johnsee writes "A computer vision researcher by the name of Hartmut Neven is developing ingenious new technology that allows the searching of a database by submitting an image, for example, off a mobile phone camera. Imagine taking a photo of a street corner to find out where you are, or the photo of a city building to see its history"
Is A Picture Worth A Thousand Words? (Score:4, Insightful)
The article also mentioned this thing should start small, like a movie guide, so is it easier to upload a 2K "I,Robot" billboard photo, or just enter "I,Robot" in Google on your cell phone?
As long as human input is still required (i.e. you need to submit something), I don't think this is going to be popular. However, if you have a Oakley that automatically takes photos of what you see and feeds you the location details, that'll be something.
Re:Is A Picture Worth A Thousand Words? (Score:2, Funny)
Re:Is A Picture Worth A Thousand Words? (Score:3, Informative)
Of course, for some reason I think it would be difficult to make Visual Google function that well... the only way I would get results for my Thing would be if someone already knew what it was, and defined it for the search engine.
Re:Is A Picture Worth A Thousand Words? (Score:2)
Of course, it could just return the picture with a big red arrow saying "you are here" (or should it be "U R HERE"?)
Seems to me that just offering a mapping function via cell locators would be more popular.
Re:Is A Picture Worth A Thousand Words? (Score:3, Funny)
Imgseek. (Score:5, Informative)
Imagine you're a photographer. Professional or hobbyist, I don't care. You have made thousands of pictures; they all are on your hard drive.
Imagine you're lazy. (Maybe you don't have to imagine that.) You don't want to describe your photos, you don't want to label them. The only metadata associated to your photos is date and time.
Imagine you're looking for a particular photo. You know where you'va taken it, you know what is on it, you can remember the subject, the color shades, etc. You just can't remember exactly when you took that picture. How do you search for it?
Well, you quickly make a drawing in which you try to (sort of) replicate colors and shapes. And you let your computer search for "similar" graphics.
Such software exists already (for quite some time). There's a beta Free software project (GNU licenced) called imgseek [sourceforge.net]. Current version: 0.8.4. I haven't tried it, I don't know how good it is. But this screenshot [sourceforge.net] looks impressive.
Re:Imgseek. (Score:2)
All imgseek does is compare the wavelets - not even remotely applicable to identifying a 3d scene with a non-predetermined 2d image.
Still, a neat program.
Re:Imgseek. (Score:2)
Re:Is A Picture Worth A Thousand Words? (Score:5, Interesting)
He can claim he's developing whatever he wants, but I'll believe it when I see it. It reminds me too much of how many AI researchers in the 60s were convinced that by the 90s computers would regularly converse with humans and be able to reason like them.
Re:Is A Picture Worth A Thousand Words? (Score:2)
Re:Is A Picture Worth A Thousand Words? (Score:2, Insightful)
Re:Is A Picture Worth A Thousand Words? (Score:2)
Read this post [slashdot.org].
Re:Is A Picture Worth A Thousand Words? (Score:2)
Re:Is A Picture Worth A Thousand Words? (Score:2)
Yeah, but this doesn't have to be any more accurate that say.....Google. And the
Re:Is A Picture Worth A Thousand Words? (Score:2)
My method really only encompasses searching for images with images. You could add text by searching through ALT tags, and processing the text on a page which contains a given image.
A lot of pages with a picture of the eiffel tower would have the word "Paris" on them, for example.
Re:Is A Picture Worth A Thousand Words? (Score:2)
Distinguishing one street corner from the next, regardless of angle, requires full understanding of the 3d geometry underlying t
Re:Is A Picture Worth A Thousand Words? (Score:2)
J.
Re:Is A Picture Worth A Thousand Words? (Score:2)
Re:Is A Picture Worth A Thousand Words? (Score:2)
I meant in comparison to things like faces, which the parent poster was commenting (and I agree) are very hard.
Here's how you do it: Look for slightly curved lines. Examine curvature (due to lens) to infer distance. Convert to 3d model of lines. Look for lines which are parallel. Infer planes.
[I'm a physicist and mathematician by the way. Your terminology may vary.]
Justin.
Re:Is A Picture Worth A Thousand Words? (Score:3, Interesting)
Re:Is A Picture Worth A Thousand Words? (Score:2)
Alternator arm from a Ford or a Chevy? 1/4" difference in length would negate its use on one or the other.
If I know the context (VCR model 110FC, or 2001 Ford Focus), then it would be much easier to just go directly to an illustrated parts breakdown or CAD repository for that model. If I don't know the context,
snap pix (Score:2)
Re:Is A Picture Worth A Thousand Words? (Score:2)
The human mind is already exactly this - a visual google. We have a DB of contextual knowledge that is accessed and triggered through a few sensory inputs - smell, touch, sight etc...
here we could have a robot that could 'see' be able to get contextual information based on what it is seeing.
Say you have it walk into a room - and the room is filled with a bunch of objects, the AI could scan the room, then de
Re:Is A Picture Worth A Thousand Words? (Score:2)
my office is on queen street. what google query will tell you what city i am in?
Better yet... (Score:5, Funny)
Re:Better yet... (Score:2)
Fleck's image recognition (Score:5, Funny)
The possibilities are endless!
Re:Fleck's image recognition (Score:2)
"No technology is sufficiently advanced until it is used for pr0n."
Re:Fleck's image recognition (Score:2, Insightful)
So it's not "identifying a person by a nude picture", it's identifying pictures which contain nude people...
Re:Fleck's image recognition (Score:2)
Why not use GPS Technology? (Score:5, Interesting)
Re:Why not use GPS Technology? (Score:3, Insightful)
Re:Why not use GPS Technology? (Score:4, Funny)
Re:Why not use GPS Technology? (Score:2, Funny)
Re:Why not use GPS Technology? (Score:2)
or the photo of a city building to see its history"
Substitute "person" for "building". (Is that a cameraphone, officer?)
Re:Why not use GPS Technology? (Score:2)
alek
Re:Why not use GPS Technology? (Score:3, Insightful)
The database to work will have to understand what 3D objects are (at least in the specific domain) and have an idea of what features of the object are important (like signs for exam
Re:Why not use GPS Technology? (Score:3, Insightful)
GPS would be useful in some situations (if you want to know about a general area), but for the example of taking a "photo of a city building to see its history", GPS itself would not be sufficient.
GPS can provide a location, but it can't pinpoint what you are looking at. This is the case even with compass data indicating which direction you are pointing your device--what if there are two things in your line of site from that perspective? (Do you want information about the building, or do you want inform
Of course... (Score:2)
Re:Of course... (Score:2)
Man on man (Score:3, Insightful)
Re:Man on man (Score:5, Funny)
Heh. (Score:2)
Stalkers... (Score:3, Insightful)
Re:Stalkers... (Score:3, Insightful)
Straw-man.
Stalkers already use Google. It's a lot easier to stalk someone with text than with pictures. What are the chances your image search would actually turn up anything for your average Jane Q Public?
Re:Stalkers... (Score:3, Insightful)
Re:Stalkers... (Score:2)
Give me your face! (Score:2)
Remember: At some point when exabyte storage is commonplace, this is be all too easy. And I don't think it will be the gov't who will do it. It will be a result of the same kind of marketing genius that makes you use discount cards at the supermarket to get a deal. "Just give us a picture of your face - you'll get a deal..." You watch.
One thing builds upon another - don't believe this sort of thing doesn't have org
Re:Man OH man (Score:2)
Being a geek, locked away at your desk staring at a computer, what exactly are you going to be taking pictures of to search for?!
Re:Man on man (Score:2)
Already been done for porn (Score:2)
Or a photo of... (Score:2, Funny)
or a photo of your wife to see...
oh never mind.
New Query Language (Score:5, Funny)
Scary Results (Score:3, Funny)
difficult for slashdotters to find p0rn (Score:2)
Good idea but.... (Score:2, Insightful)
If I have a mobile phone.. (Score:2)
Seriously it's a good thing but the uses are somewhat limited. What if you don't have a digital image or a photograph to be scanned? How would you translate the image in your minds-eye into something searchable by the PC? (yah, mindlink... I know...)
Re:If I have a mobile phone.. (Score:2)
It's going to be boobs (Score:4, Funny)
1: Take picture of current date's frontside archtecture.
2: Submit to search.
3: Reply: You can do better than that. Try her older sister.
i can't wait for this! (Score:2, Flamebait)
i mean, really. isn't this one of the main roadblocks to having a robot that can operate independently -- object recognition? i mean, if spam bots (the world's most advanced robots) get tripped up trying to read the word 'cat' behind some wavy lines while they're signing up for a hotmail account, you really think that i'll be able to photograph a car in 5 years and get info about it?
Re:i can't wait for this! (Score:3, Insightful)
When your brain 'recognises' what it is looking at, it is doing a lot more than just comparing two images (as in the street-corner example from the article). Your brain simply doesnt operate in terms of bitmaps.
The fact that he is basing his hyper-vaporous product on facial-recognition software should
Um... (Score:2)
Take picture of street corner with camera phone. Connect cameraphone to Internet connection. Upload to wherami.cooldatabases.com. Wait 30 seconds for processing. Get location.
OPTION 2:
Pull out $400 GPS with map software.
OPTION 3:
Read street signs. Check index of road atlas.
Yeah. Option 1 sounds awesome...
And we have a winner... (Score:3, Funny)
Visual Google? (Score:2)
Also, I think the building and street corner thing would work a lot better with GPS than a camera.
The most interesting thing I saw in this article is that he plans to roll out a first version in about a year. Besides that, it's interesting research, but stuff we've already heard about.
I would definitely like to experiment with this sort of system.
It's also possible to search with music. (Score:2)
Even though satellite/digital radio will reduce the market for this kind of thing, because each displays the artist name and track title, there are still plenty of opportunities to get a song stuck in your head that you don't recognize. A surprising number of people find out about music when it's used as the background tune in TV commercials, for example.
umm (Score:2)
iDating (Score:5, Insightful)
click
"Whoa Dude!, she's been on 4 amature Pr0n sites!"
Perhaps a better use... (Score:2)
I can't imagine how well this would work, since orientation fools things pretty easily. But I imagine that if it were available, I'd find a lot of unidentified objects to look up. (A cooking magazine I read has a
Yes, Imagine taking a picture (Score:5, Insightful)
Yes, imagine that.
1: Take picture with ultra-modern all-features camera phone of building while lost in city.
2: Submit to search system.
3: Search system queries phone's built-in GPS for position information.
4: Search system sends back retrieved GPS location.
5: Customer is absolutely blown away and immediately sends back picture of self signing virtual 10-year contract at Early Adopter prices.
6: Profit!
On today's search engines... (Score:2)
Robot potential? (Score:2, Interesting)
Finding places (Score:2)
If I'm completely lost, the only way object recognition would work is if I'm in an area with a lot of recognizable features...like a city...in which case I'd just ask somebody. I doubt taking a picture of a bush when I'm lost in the middle of nowhere will be helpful (see: car navigation system).
Hartmut Neven and hundres of other researchers... (Score:3, Insightful)
Re:Hartmut Neven and hundres of other researchers. (Score:2)
What I want to know? (Score:2)
Totaly Handy (Score:2)
IBM has a related technology (Score:2)
Duh (Score:3, Funny)
Imagine reading that street sign you just took a photo of to find out where you are.
I'm just waiting for object recognition (Score:2)
I show an image of a car, and the computer knows, the make, model.
I show an screenshot of a TV show where they remove the product name/brand from the product... it can ID the product.
Facial recognition is not to bad at this point (though it seems lots of the pioneers are going under). Nobody seems to have successfully applied it to objects.
I think that has much more use... think about it:
1. Indexing and searching images/video
2. Explaining TV to the bli
Re:I'm just waiting for object recognition (Score:2)
This image searching would make computing far more intuitive. In the far future, it may also lead to computers able to learn/recall in the same way as humans, (via visual memory). I believe that when th
Um (Score:2)
oh, something new (!?) (Score:2)
Sorry, please somebody explain me what is so ingeniously novel in this technology. In the digital image compression, pattern recognition and indexing research area answering example-based queries on an image or video database is anything but novel. And that includes color-, histogram-, texture-, object-, depth-, even motion-based indexing and query/retrieval (no secrets here, many conferences every year).
Don't get me wrong, I have nothing again
of course, the first use of this would be... (Score:2)
What about the context? (Score:2)
I take a photograph of a building that is under construction here and submit it to the "search engine". So what might I be wondering?
- what street conrner am I at?
- who owns/is building/will occupy?
- what materials is it made of?
- how is it being constructed?
Same goes for anything. A flower, a person, an object. Without context, search results will be across the board.
Huh, now that you mention it, same problem with Google today. Just do a seach for "building
Oh yeah.... sure.... (Score:2)
All right, poo-pooers... (Score:2)
Maybe it could never work, but in principle, this isn't a bad idea.
News for Nerds, indeed (Score:2)
I was hoping to find out how, exactly, images could be normalized, and the normalize images indexed in a way to avoid O(N) searches.
Differences in perspective seem to me to be an enormous hurdle, not to mention shadows that change from hour to hour. Nothing about these challenges was even mentioned in the article
Looks to me like this guy is trolling patent sharks.
Not really new (Score:2)
could be dangerous (Score:2)
Better suggestions for examples (Score:2)
Lame. Better yet:
1. Identify edible as opposed to poisonous plants when hiking.
2. Identify a part you took off your car during a doomed DIY tune-up session.
3. Identify a part that was lying on the ground after you closed up the computer case that you just *swear* wasn't there when you started.
4. Identify the odd substance in the bowl from the back of the fridge...
5. ID your blind d
And also (Score:2)
Its been done before - Informix had this (Score:2)
One of the big goals of this kind of technology w
product manuals (Score:2)
No more hunting for model numbers (which I've found are not often included somewhere on the actual product).
take THAT intellectual-property! (Score:2)
muwahahahaaa
writing is speech
pictures are speech (this)
code is tshirt is music is speech (decss)
insert numerous other examples (my laziness)
good luck continuing to legislate speech as property based upon the medium, the inevitability is that they're all communication and increasingly interchangable. i'd start looking for another way to designate what can be proprietary and not NOW.
Online demo (Score:2)
It's called Visual Google - You give it a picture, and it returns all the frames in a movie that contain the picture. The clever bit is that it uses a kind of text-retrieval inspired method to do it, so the processing time is essentially zero (just an inverted index lookup, with "visual words")
Work for Google (Score:2)
Princeton has something similar (sorta) (Score:2)
IconSurf (Score:2)
Wavelet transforms (Score:2)
Re:The possibilities! (Score:2)