Teaching Computers to See with Games 57
An anonymous reader writes "The Pittsburgh Post-Gazette has a story on Peekaboom, a two-player on-line game in which one player tries to get the other player to guess a word associated with an image, by revealing parts of the image one click at a time. From the article, "The process of revealing objects, or highlighting images within the larger context of the photo, is the sort of thing that researchers in computer vision must do to teach computers to see.""
Frist? (Score:2)
Nothing for you to see here. Please move along.
Re:Frist? (Score:2)
Well, I got registered but there is something in some sites that crashes my Mozilla and they have it.
It seems like worthwhile research. We might learn to recognize things by a process as mundane as actually seeing them from different angles but I suspect there are processes of object abstraction at work as well.
Re:Frist? (Score:2, Informative)
Aka? (Score:1)
Pictionary: Boring game!! (Score:2)
Oh please don't... (Score:1)
Think of the possibilities! (Score:3, Funny)
Re:Think of the possibilities! (Score:1, Offtopic)
Re:Think of the possibilities! (Score:3, Funny)
Re:Think of the possibilities! (Score:2)
Great name. (Score:1)
Re:Great name. (Score:1)
And to answer your question, I would have called it "Game." I'm bad at naming things. But if I was forced to name it, even just Peekaboo is better than Peekaboom.
What about Bayesian analysis? (Score:1)
Why not try that with vision? Ditch the spam filter and use high-end bayesian analysis, feed the bayesian learning-program all of the data about different objects in a video game who edges are already defined, usually by colors and texture borders, see what you get.
I'm no expert on this-- can anyone offer ways it could o
Re:What about Bayesian analysis? (Score:3, Informative)
I'm no expert on this-- can anyone offer ways it could or couldn't work?
The human eye works in a similar way. The first layer of optic nerve after the retina recognise dots. The next layers recognise contrast and patterns in the previous layers, i.e. lines, edge recognition, etc. By the time it gets to the brain it's already broken down into basic shapes, at which point there are nerves that have been taught to look for certain combinations of shape and colour are triggered, causing the sensation of re
Re:What about Bayesian analysis? (Score:2)
Re:What about Bayesian analysis? (Score:1, Interesting)
Tips (Score:2, Interesting)
That adds 25 points to your score.
During the bonus round you get points for clicking the same spot as your teammate. Once numbers start appearing, keep clicking right there for maximum points.
Pass if the word looks difficult. Don't hesitate.
Pass if your partner passes, too. He probably has a good reason to.
More Tips (Score:1)
Seconding the use of labels. They're worth 300 points over the course of a full game. If none of them apply, because, for example, the word is an adjective, select "text". It's the least likely to be misleading to your partner.
Common
The real story today... (Score:3, Interesting)
Re:Can we program some sense into the robot? (Score:2)
Things aren't that simple ofcourse. Most likely we'll have to train robots to handle the world in the same way we educate children. But once we have a working brain, we can copy it. But that'll need s
end result... (Score:2, Interesting)
Re:end result... (Score:2)
Re:end result... (Score:2, Interesting)
With Peekaboom, you give them the job of guessing "dog", and the parts of the image that are revealed are likely the parts of the image that contain the dog.
That said, the relationship to CAPTCHAs is still there. Simple image distortion CAPTCHAs don't really hold up, and the
Isn't actually being used (Score:3, Insightful)
Quothal:
The process of revealing objects, or highlighting images within the larger context of the photo, is the sort of thing that researchers in computer vision must do to teach computers to see.
While the ESP Game was designed to generate descriptive labels for photographs and other images, Peekaboom is intended to help teach computers to see.
Re:Isn't actually being used (Score:2)
But do they learn what we think they learn? (Score:3, Interesting)
But when they tried the system on more images, it failed miserably. Further investigation revealed that, by accident, all of the "tank" pictures had been taken on cloudy days and all of the non-tank pictures had been taken on sunny days. The system had learned, and learned beautifully, how to recognize cloudy vs. sunny days.
The point is that the software was good enough to learn to recognize the difference between the two populations of images but that that difference wasn't the one intended by the people working on the system. In the same vein, I'm sure that Peekaboom will learn to distinguish between objects in images but whether it learns the actual object or just some incidental characteristic of that pocture of the object will require a very very good diversity of training pictures to avoid accidental, non-meaningful patterns in the image data.
I do wish them luck. Perhaps Peekaboom could create a distributed version of the training process in which others can both submit and help train on new objects/images. Letting others submit images and train the system would help diversify the training & testing data sets. Because some people will, no doubt, submit porn, I'm sure the system might become quite adept at recognizing the nether regions of the human body.
Re:But do they learn what we think they learn? (Score:1)
[OT] I say it without fearing to be modded down. (Score:1, Offtopic)
No posts at 4+ for a long long time.
I am not gonna read
If you want to vote for it - reply below.
No moderation leads to lack of comments (Score:1, Interesting)
Some posters like giving lots of information and opinion and getting lots of replies in return. They typically have a little background in what they are discussing, or have a very strong opinion on the subject. When they post and get modded up and have lots of replies, they have achieved a personal victo
link to the powerpoint presentation (Score:1, Informative)
the one in google's index now seems to be broken
Somewhat relevant... (Score:1)
A little insight (Score:1)
Whenever one wants to train an algorithm to detect or recognize an object in a data set, one needs both the data set and the ground truth. The data set is usually a large set of images and the ground truth is some semantic information associated with each image, such as the locations of people and cars, or perhaps a representative word or category. The data set is usually ea
I was there. (Score:1)
The premise was that no algorithm existed for computers to be able to find where within a picture a certain object existed. But who is good at doing these things? People. Normally, a group of people would be paid to sort through hundreds of thousands of different images and find where a certain object was. But this was slow, and consumed unnecessary resources (like money). However, the in
Guess the nude celebrity's identity? (Score:1)