The Science of Word Recognition 430
neile writes "I stumbled across a fascinating paper over at the Microsoft Typography site today that provides a really nice overview of the different theories on how humans read. If you thought we read by recognizing word shapes, think again! With the assistance of fancy eye-tracking cameras researchers have been able to devise several clever experiments to give us new insight into how reading works." We've linked to some of Larson's work previously.
AAAAAARRGGHHH, I'm going blind! (Score:5, Funny)
Re:AAAAAARRGGHHH, I'm going blind! (Score:2, Funny)
For all those wanting this post in an eye-shattering colour, Click here [slashdot.org]
Honest!!! (Score:5, Funny)
Re:Honest!!! (Score:5, Funny)
(in big letters) If you can read this,
(in slightly smaller letters)you obviously must have
(in still smaller letters)very good eyesight.
(in smaller letters)While you're down here, why don't you give me a blow job?
Oh no! (Score:3, Funny)
"If you are reading this then you owe Microsoft royalies"
I'm not sure I buy it. (Score:2, Interesting)
To make it more obvious, stick a tall letter in a word that only has short letters and you'll come away thinking word shape does matter.
(or did he explain it... there were way to many words and way too few glossy pictures in that article for me to co
Re:I'm not sure I buy it. (Score:2, Interesting)
Cheers & God bless
Sam "SammyTheSnake" Penny
Re:I'm not sure I buy it. (Score:5, Interesting)
Re:I'm not sure I buy it. (Score:5, Insightful)
Re:I'm not sure I buy it. (Score:5, Insightful)
I imagine the brain's focus as little perception boxes, scanning up and down the concatenated sentence until enough symbols are aligned to fire a recognition signal... As I read your post above, I find my eyes darting about a little more, actually darting to the center of the "word" once recognition is made.
runonsentencewithlowercase -- here's your letter by letter scan "mode"
runonsentencewithcoloring -- slightly easier to define word boundaries by color
runonSENTENCEwithuppercase -- it's easier to locate the word SENTENCE because we perceive a boundary beween small letters and upper letters.
runo nsente ncewit hbads pacing -- pain in the ass, but we still comprehend
run on sentence with lowercase -- whitespace speeds compehension.
Interesting observation (Score:5, Interesting)
Example:
'uesdnatnrd' wasn't to hard to recognize beacuase 'uesd' and 'tnrd' aren't letter patterns that exist in real words. So the mind works quicker to rearrange the letters to find a real word.
'aulaclty' was much harder because it's almost pronouncable. 'lac' and 'lty' are common patterns from real words, and 'aul' might not be common but it's pronouncable.
Just an observation.
In related news... (Score:3, Funny)
The USSGN (Union of Slashdot Spelling and Grammar Nazis) is expected to stage protests against the new product in the interest of keeping their jobs.
You can take a horse to water... (Score:2)
No biggy (Score:2)
(Actually, I don't know HOW anyone can be content with the misspellings. But then, I don't see how Americanese holds water either.)
Eye movements? (Score:5, Interesting)
Oh they must have been using EyeQ [infmind.com]....
I can read at 44692 words per minute! Thanks for posting that long article for me to read, I needed the exercise.
And thank you EyeQ! Your the greatest!
Really though, they say that the more letters/words mean faster reading times [microsoft.com]. It's true. Think about a book or article you've read. When the words are together on the page it's easier to read because your eyes can jump around letting your brain fill in the blanks.
Ever read something that made sense but you couldn't quote it word for word? It's likely because you read in this same way.
Quotation (Score:5, Funny)
Man, I'm so glad they finally figured this out...
Re:Quotation (Score:2, Insightful)
The question pondered is whether _experienced_ reader reads by, in the first place, recognising the word shape, or by recognising the letters.
P.S. yes I know that psychologists are great for stating the obvious, but not here...
P.P.S. to parent: read the article properly, I'm sure you'll find a nice funny case of stating the obvious.
Re:Quotation (Score:2)
Re:Quotation (Score:3, Interesting)
For example: "sadhoslt nwes for nrdes. Sfutf taht mrttaes". (I think ive got that correct, someone will obviously correct me if not
Your brain didnt need the middle of the word to understand the word
Re:Quotation (Score:3, Interesting)
-Lars
Re:Quotation (Score:3, Informative)
Re:Quotation (Score:3, Insightful)
Very strange because if y_u r____r we d__'t n__d a_l those l_____s.
I love how (Score:5, Insightful)
I would love to see a study comparing how english is read to how chinese is read by native speakers. Very interesting i would gather.
Re:I love how (Score:3, Interesting)
I just searched around on google and these documents come up
Word Order in German [about.com]
Kathol's analysis of German Word Order [let.rug.nl]
Re:I love how (Score:3, Interesting)
This actually happens in English, too, but we've been trained not to think of it that way.
* I'd like to hang up that picture.
* I don't know where to hang that picture up.
* His friends are going to move out at the end of the month.
* Is he going to help move his friends out?
Most of the time when you are "ending a sentence with a preposition," you are actually doing no such thing--yo
Though comes before language (Score:5, Informative)
Research shows that
Re:Thought comes before language (Score:4, Interesting)
Infants of English-speaking parents easily grasp the Korean distinction between a cylinder fitting loosely or tightly into a container. In other words, children come into the world with the ability to describe what's on their young minds in English, Korean, or any other language. But differences in niceties of thought not reflected in a language go unspoken when they get older.
Absolutely. And adults can "relearn" those distinctions, too; I found that as my Japanese studies progressed (started at 19, pretty close to native now) the range of things I was able to think about expanded considerably--so much so that now I sometimes have trouble speaking to people in English because English doesn't have a word for the concept I'm thinking about.
Re:Thought comes before language (Score:3, Interesting)
The grandparent also didn't say "couldn't be expressed", but "has no word". Given enough ve
Re:Thought comes before language (Score:5, Interesting)
Re:Though comes before language (Score:3, Insightful)
For instance, it is clear that many non-verbal animals are able to think, in at least some limited fashion. Larger rodents, for instance, are able to build models of their world and solve simple problems (not limited to learning by trial and error). It is exactly this kind of modelling that concepts like one object being inside another stem from -- spatial reas
Re:I love how (Score:5, Interesting)
Maybe we can learn even more about our way of reading, like: Is it the most efficient?
Is right to left, or left to right the best way to go.
Interesting side note (don't know why I'm bringing this up...) President #20, James A. Garfield could write in both Latin and Greek at the same time?
Re:I love how (Score:2)
Isn't that more a consequence of the fact that most people write with their right hand?
Re:I love how (Score:2)
Re:I love how (Score:2)
Puzzling.... anyway, it's good to know that at least some presidents have some skills
Re:I love how (Score:2)
Read up on this man... very cool.
I learned the first fact, about his writing, from "Incredible But True" a great old book.
Re:I love how (Score:4, Interesting)
I remember reading about an interesting study into this. Apparently, there are a small number of people who have a particular form of brain damage which effectively reverses their perception. These people, if they were originally educated to read/write left to right, would afterwards naturally read/write right to left, or vice versa.
Apparently, once they get used to using their right hand with a style similar to that a left-hander would use (or vice-versa) they can read & write in the opposite direction at roughly the same rate a normal person can in the usual direction. The conclusion: the difference is not noticeable; neither left to right nor right to left is substantially more efficient (or any difference is also negated by the brain damage these people have suffered).
No, I can't cite references. I just came across it about 10 years ago, I don't even remember what I was studying at the time.
Re:I love how (Score:3, Informative)
If you're right-handed, you'll smudge the text with your hand if you write right-to-left.
Re:I love how (Score:5, Insightful)
Also, Chinese is character-based, not letter-based, so the research would be completely different. Kind of like asking someone who's studying jet aircraft to study cars as more people have them.
Re:I love how (Score:2)
Also, Chinese is character-based, not letter-based, so the research would be completely different.
Yes, but could there be a similarity in that reading Chinese involves recognition of strokes the same way that reading in English involves letters? FTA...
Perhaps reading Chinese involves focusing on the strokes
Re:I love how (Score:5, Interesting)
Re:I love how (Score:2)
"The researchers, led by Dr Li-Hai Tan believe that this region is implicated because reading Chinese is a different mental task compared with reading an alphabetic language.
With an alphabetic language, reading is done sequentially - the letters are recognised and broken up into blocks of sound which are then matched to a known meaning.
But with Chinese, the reading is more like parallel processi
Re:I love how (Score:3, Interesting)
no. (Score:2)
no, not really. It seems very reasonable considering that english is most likely the native language of the researchers. Research is hard enough without introducing extra complexity through using a foreign language and then having to find subjects that are fluent in that language.
You can't study everything at the same time. Quit complaining for the sake of complaining... sheeesh.
--
Simon
Reading about how we read (Score:5, Interesting)
Don't let the Microsoft name scare you off - the article makes for a fascinating look (pun intended) into how we read. I wonder, though, if these findings are duplicated with written Oriental languages.
What about other writing systems? (Score:5, Interesting)
Re:What about other writing systems? (Score:2)
Re:What about other writing systems? (Score:2)
all the others then they will certainly outnumber those who use the latin alphabet. Of course, the latin alphabet is probably the widest in use but not necessarily used by the majority.
Re:What about other writing systems? (Score:5, Interesting)
Want my theory? I think the brain uses multiple techniques in parallel, then releases resources from the ones found to be going nowhere. So at any one time you may be trying to read a word letter-by-letter, recognising the word from the Bouma shape, and picking likely words from context. The different techniques will have different successes depending on various factors (clean type vs. messy handwriting, familiar vs unfamiliar words, &c). So my theory is that the brain is trying various methods at the same time, each narrowing down the possibilities, and just goes with whatever produces a result first. As soon as that happens, any half-finished tests in progress are scrapped and their resources deallocated. The eye movements may well have something to do with this
If all this sounds inefficient, you have to remember that human beings are optimised for non-optimum conditions
Re:What about other writing systems? (Score:3, Informative)
Reduced Redudancy (Score:4, Informative)
The final conclusions are similar to what I learned in my college linguistics classes 15 years ago. Language contains a lot of redundancy. The reason is that we often encounter situations of so-called "reduced redundancy". For example, someone might have sloppy handwriting so you can't make out all of the letters. Or you might be talking to someone while they brush their teeth. If language were highly optimized, we wouldn't understand a thing in these situations, but because of redundancy we can usually communicate very effectively.
The same applies to reading. The conclusions of the paper seem trivial to me. Of course, reading exploits "visual" and "contextual" information. How else would be understand a sentence like "The boy ate a ham___er" (with a few letters obscured)?
The fact that the brain's neural net adds up the weighted lexicographic, syntactic, semantic (and even pragmatic) information available to it in order to interpret language should be familiar to anyone who's read Goedel, Escher, Bach. And that was published in 1979...
Re:Reduced Redudancy (Score:2)
No automatic recognition here.
Hamster?
Hammer?
Re:Reduced Redudancy (Score:2)
How else would be understand?
Case in point.
Re:Reduced Redudancy (Score:2)
My point was probably clear but perhaps a better example would have been "hamb___er". Didn't know there were so many hammer/hamster eaters out there...
Re:Reduced Redudancy (Score:5, Interesting)
What a way to prove your point. I kept thinking "hamster", "hammer" and then eventually realised that I didn't spot your miss-spelling of 'we' and that I read right over it and filled in the blank.
Re:Reduced Redudancy (Score:3, Interesting)
What a way to prove your point. I kept thinking "hamster", "hammer" and then eventually realised that I didn't spot your miss-spelling of 'we' and that I read right over it and filled in the blank.
Wow. Not only did I do what you did, but not-even-reading your post, I picked out "ham___er", "hamster", "hammer", and "we", and tried to figure out if you were suggesting that "we" fit in the missing sp
Re:Reduced Redudancy (Score:2, Interesting)
Re:Reduced Redudancy (Score:2)
Re:Reduced Redudancy (Score:3, Informative)
This got slashdotted!? The idea of recognizing words by "word shape" seems so silly to me that I almost feel as if the author is attacking a straw man rather than a widely accepted linguistic theory.
The author is aiming the article at typographers, not linguists and psychologists. It seems that while everyone who does scientific research into the way that we read has known for a long time that the word shape theory is full of crap, the theory persists as a kind of urban myth among typographers. So the
Re:Reduced Redudancy (Score:2)
That would be something the computer tracking would not be able to figure out and was somewhat hinted at in the article.
Extremely inter
Re:Reduced Redudancy (Score:2)
No way, man! Those things are, like, full of cholesterol!
How we read... (Score:2, Interesting)
Hree is an epamxle of jsut taht, it's qitue esay to raed, ins't it? Agulohth it can get plluartraicy hrad wtih the lgnoer wdros.
Re:How we read... (Score:5, Informative)
Aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it deosn't mttaer in waht oredr the ltteers in a wrod are, the olny iprmoetnt tihng is taht the frist and lsat ltteer be at the rghit pclae. The rset can be a toatl mses and you can sitll raed it wouthit porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe.
But soon enough there was a counter example:
Anidroccg to crad cniyrrag lcitsiugnis planoissefors at an uemannd, utisreviny in Bsitirh Cibmuloa, and crartnoy to the duoibus cmials of the ueticnd rcraeseh, a slpmie, macinahcel ioisrevnn of ianretnl cretcarahs araepps sneiciffut to csufnoe the eadyrevy oekoolnr.
In the counter example, the letters are not randomly scrabled, the letters are in reverse order, except the first and last letters.
Re:How we read... (Score:2, Funny)
This would be a lot easier to read without that misplaced comma.
Re:How we read... (Score:3, Interesting)
So ... (Score:5, Insightful)
For what i know abaout japanese, they don't use spaces between 'words'. A single kanji represents the whole word and their outline is always more or less square. So the whole bouma theory fails here, as he finds out.
I'm sure they could leard more interesting things in other writing sysmtems
Re:So ... (Score:2)
That would probably be Chinese. Written Japanese seems to be a mix-and-match job involving two native phonetic alpabets (one all spiky and angular, and one with a lot of letters that look like pretzels), one imported phonetic alphabet, and lots of Chinese pictograms for good measure...
Re:So ... (Score:2)
English = character-based
It's like comparing apples and oranges - two completely different ways a written language is interpreted.
Re:So ... (Score:4, Informative)
English = character-based
It's like comparing apples and oranges - two completely different ways a written language is interpreted.
I think they're not quite as different as many people seem to think though.
Most kanji are composed of more primitive components. From observing myself reading Japanese, I've noticed that I make many of the same mistakes in recognition, and use similar tricks in recognizing unknown kanji, as I do when reading english. For instance, I frequently confuse two kanji because they have mostly the same primitive components, but differ in one (often the radical -- even though it's arguably the most important part of a kanji, I find I tend to ignore it when reading!).
In my opinion it's not unreasonable to think of the parts of a kanji as being like letters and the whole thing as being like a word.
I think Chinese is similar (Score:3, Interesting)
First, Chinese characters are often composed of several smaller characters, 500 or so, instead of the 70'ish letters and numerals (including capitals) in English. We say such a character may have a "moon" sub-character on the left, a "white" on the right and so on. The sub-characters can be partial clues to meaning and pronunciation (e.g. a
This was a very interesting paper. (Score:4, Interesting)
Also, more work needs to be done to consider the visual cues outside the focus of attention. It is here that, I believe, shape and form cue the reader, more than letter shapes do, as to the potential content of the text to come. (Exactly how is for the geniuses.)
Focuses on 1 script, 1 language (Score:5, Insightful)
While some of the results here are interesting (but old), the fact that the entire study focuses on exactly 1 script and 1 language basically renders the conclusions worthless (as conclusions about cognition in general... I suppose they still have value as conclusions about English and the Latin script).
What has happened here is:
1 -- Observe people reading a given language/script
2 -- See how they make use of features of that particular language/script, such as tall letters, case, and the occurrence of 'skippable' words such as articles
3 -- Describe the way they use these local features, and call that a theory of reading in general.
I don't really understand how to apply a theory of reading based on word and letter shapes when there are so many people reading text in which:
--There are no letter boundaries, and/or
--There are no word boundaries, and/or
--Letters all have the same form factor
The experiments described would probably generalize very well to arabic and greek scripts, pretty well to cyrillic (no tall/short letters to speak of), badly to devanagari-type scripts, very badly to Chinese and Japanese, and not at all to hieroglyphics (though I agree that there may never have been a reader of hieroglyphics who was fluent by modern standards).
To pretend that these experiments apply to humanity in general rather than the author's own language/script choice is silly. It's an interesting article and I'm glad the research was done but unfortunately a certain failure to 'get' the multilingual nature of humanity, which I don't really expect to find in MS work, is in evidence here.
Re:Focuses on 1 script, 1 language (Score:5, Insightful)
Now someone else can work on a PhD Thesis by taking his work and seeing if it applies in other languages.
Isn't this how science works? You do research, try to make some conclusions, and publish the results. If you wait to publish until you've found the Grand Unified Theory of Everything, then nobody publishes anything and science doesn't advance at all.
I'm not sure that he missed anything. He has started with what he knows and has resources to study.
Re:Focuses on 1 script, 1 language (Score:5, Insightful)
You know what is also silly? To pretend that this was the conclusion, although clearly the paper nowhere stated that it had found the grand unified theory of how people read. Here's a hint: when the paper talks about reading, it is obviously talking about reading English.
Yes, the paper would be even more interesting if it included studies of other scripts, and the failure to acknowledge the existence of other scripts should be criticised. But the rest of your criticism is unfounded.
Please (Score:2, Informative)
Re:Please (Score:2)
or maybe it's both? (Score:5, Interesting)
Obviously it can't all be shape, there are plenty of words with identical shapes and yet these are distinguishable.
But it could certainly be true that we use shape and parallel letter recognition at the same time. Shape narrows the field of possibilities from millions to a small handful, and then parallel recognition chooses one of the options.
Whatever happens, you can be sure it's terribly complicated, extremely robust and very efficient.
Don't shout! (Score:5, Interesting)
Re:Don't shout! (Score:5, Informative)
People can easily be trained to read text in caps as fast as lowercase text - or mirrored text.
What I fail to understand is how randomizing the middle letters of a word doesnt affect reading much. I had hoped he would use that as an example.
Tihs is a emxpale of the efecft.
FTA... (Score:3, Interesting)
I'm surprised this guy is actually working with ClearType. That is just a simple way of making characters appear better by using sub-pixels to increase character resolution. I would think this type of work would be better applied in optical character recognition, maybe even with cursive handwriting.
Microsoft Research Web Site (Score:5, Informative)
Aplogise for the tangent, on the back of this article seemed an apt place to point to the MS research site for those that might not of been aware of it.
Phonemic information mandatory? (Score:3, Insightful)
I can not believe this is in a serious paper. Mandatory? Please. What about people born deaf? Are they all unable to read?
read cache as a prior art (Score:2)
Eye movement studies that I will discuss shortly indicate that there are three zones of visual identification. Readers collect information from all three zones during the span of a fixation. Closest to the fixation point is where word recognition takes place. This zone is usually large enough to capture the word being fixated, and often includes smaller function words directly to the right of the fixated word. The next zone extends a few letters past the word recognition zone, and readers gather
amusing test... (Score:5, Interesting)
Source code? (Score:3, Interesting)
why is MicroSoft research so disconnected? (Score:3, Insightful)
This suggests an interesting contradiction in MS product strategy. MS has a long history of "clone and conquer", e.g. Excel copies off VisiCalc and Lotus 123. Just this week MS cloned Apple iTunes. Yet MS Research is conducting some very interesting basic research. Go figure!
we disagree (Score:3, Insightful)
Reading words silently, I sometimes notice an inner chorus pronouncing the words, with one or two discordant notes, even from poorly organized structure or unparseable punctuation. Deciding how people recognize words must also account for how people's minds are organized. The myth of the "undivided self" gets in the way of understanding not only how complex we are "under the hood", where media is digested, but denies credit to our grand integrator, which juggles these partial selves into one face with which to confront the world. As machine intelligence benefits from multiple simultaneous processing, why should they have all the fun? As we mimic our own minds in computer simulations, why should we have all the fun?
Fantasy and Science Fiction writers take notice! (Score:3, Interesting)
"N'kalogh leapt onto his mighty huyloch and rode across the plains of V'looth'u". Next please.
This paper gives a convincing pyschological model about why this occurs and it is pretty much what I had surmised on my own.
So, from now on, please name all of your aliens Bob, Larry, Bubba, or Charles.
Comment removed (Score:3, Funny)
Whine:aaah!! eyes hurt! (Score:2, Flamebait)
Re:aaah!! eyes hurt! (Score:5, Funny)
re-nerding! ha ha. Best... typo... ever...
Re:aaah!! eyes hurt! (Score:5, Informative)
dunno, firefox / moz has one of my favourite features
tools
great for annoying "web site designers" who can't design for shit
Read it... (Score:3, Insightful)
Article in short... (Score:5, Informative)
Re:Comments (Score:2, Informative)
Well, it does in a way. (Score:4, Interesting)
The FArticle does, in fact, address this, though not directly - it puts forth a theory that all letters in a word are absorbed simultaneously, and the brain re-orders them. This is given as theory #3, admittedly a ways down.
This gets me thinking, though, about the importance of context. If you drew the letters PLEORBM in a Scrabble game, it might take a while to see the word staring at you. But in the context of a (mangled) sentence: "you can sitll raed tish wouthit a pleorbm," it much more easily jumps out. Interesting.
Re:Interesting (Score:2)
Re:REKANYZE! (Score:5, Insightful)
I cdnuolt blveiee taht I cluod aulaclty uesdnatnrd waht I was rdgnieg The phaonmneal pweor of the hmuan mnid Aoccdrnig to a rscheearch at Cmabrigde Uinervtisy, it deosn't mttaer inwaht oredr the ltteers in a wrod are, the olny iprmoatnt tihng is taht the frist and lsat ltteer be in the rghit pclae. The rset can be a taotl mses and you can sitll raed it wouthit a porbelm. Tihs is bcuseae the huamn mnid deos not raed ervey lteter by istlef, but the wrod as a wlohe. Amzanig huh? yaeh and I awlyas thought slpeling was ipmorantt!
Re:REKANYZE! (Score:5, Interesting)
Re:Eye tracker experiments (Score:3, Interesting)
It seems if you don't blink whilst reading it's like trying to eat food in big chunks... At least for some people. Then again it may be the stress of keeping your eyes open distracting you from reading?
My postulate is that the brain takes the blink time to dedicate more resources to processin
Re:Eye tracker experiments (Score:3, Interesting)
As for why people blink at a specific rate, and whether that changes based on level of concentration, that's been studied.
"Studies have measured the blink rate and tearing on computer workers and noted that the blink rate dropped very significantly during work at a computer compared to befo