The Evolution of Language 528
TaeKwonDood writes "We all know language has evolved but mathematicians are trying to take how it has changed in the past to predict what it will be like in the future." From the article: "Mathematical analysis of this linguistic evolution reveals that irregular verb conjugations behave in an extremely regular way -- one that can yield predictions and insights into the future stages of a verb's evolutionary trajectory," says Lieberman, a graduate student in applied mathematics in Harvard's School of Engineering and Applied Sciences and in the Harvard-MIT Division of Health Sciences and Technology, and an affiliate of Harvard's Program for Evolutionary Dynamics. "We measured something no one really thought could be measured, and got a striking and beautiful result.""
Of course it's all about the verbs (Score:4, Funny)
Verbs, verbs, verbs, that's all anyone thinks about.
Re:Of course it's all about the verbs (Score:5, Funny)
Re: (Score:2)
Re:Of course it's all about the verbs (Score:4, Funny)
Re:Of course it's all about the verbs (Score:4, Funny)
Re: (Score:3, Funny)
Queue Ocatavio Paz... (Score:2)
Re:Of course it's all about the verbs (Score:5, Interesting)
In fact, Fuck, as in "Fuck you," isn't even properly a verb: English sentences without overt grammatical subjects [rr.com]. To summarize: "Fuck you or I'll take away your teddy bear" is not grammatically correct; neither is "Describe and fuck communism."
And, of course, XKCD has something to say about computational linguists [xkcd.com].
Re: (Score:3, Insightful)
Hmm... But take the phrase "to fuck up". It doesn't have anything to do with being displeased, it just means that someone has just failed, and in a particularly spectacular way at that. So, "fuck communism" could also be interpreted "fuck up communism", or "make communism fail in a spectacular manner". So, the original sentence of "describe an
Some love for prepositions (Score:2)
Quite a few are probably thinking about German group sex according to the words 'an', 'auf', 'hinter', 'in', 'neben', 'über', 'unter', 'vor' and 'zwichen'.
Sorry, o(oooo)ld joke, but it just seemed to fit in.
Re: Of course it's all about the verbs (Score:2)
The Best One Recently (Score:4, Funny)
We get cussed out on a regular basis.
Sometimes the kids get restrained by trained staff and they will say something like, "I can't fucking breathe." This they know is a magic phrase. We had a teacher recently go in and tell a student:
You cannot use a gerund with an intransitive verb. You should say I can't fucking. Or I can't breathe. You cannot use I can't fucking breathe. Make up your mind you are either not fucking or not breathing!
Well this is what happens when english teachers have way to much caffeine.
Re:Of course it's all about the verbs (Score:5, Funny)
Re: (Score:3, Funny)
Bawstan Habah? (Score:2, Interesting)
Re: (Score:2)
OT, this article is pretty cool, but doesn't take into account the evolution of symbolic representation in language, and the r00tshell.com effect. STFU n00b.
HAND =)
Re:Bawstan Habah? (Score:5, Informative)
As a Massachusetts resident, I have no idear what happens to the ahs.
What really cracked me up is the day they decided to rename, "Great Woods Performing Arts Center", to the, "Tweeter Center for the Performing Arts". It's like they tried to purposely add more ahs!
"Hey Boston Guy, where's the concert?"
"It's at the Tweetah Centah for the Performin' Ahts!"
It depends on the speaker. Sometimes its more like Wista. Either way it's usually followed most times by, "Spag's", as in, "If we're going to bother to go to that wretched hive of scum and villainy, Wista, we might as well stop at Spag's".
To the point where sometimes people don't understand the normal pronunciation!
True story:
One day I went to a, "Boston Market", with my coworker for lunch. On this particular day, we were unfortunate enough to be waited upon by a guy with a Southy accent so thick you'd swear he was an extra from, "Good Will Hunting".
In case you're lucky enough to be from another country and have never encountered one of these abominations of cuisine, some explanation is in order. Boston Market is a fast food restaurant that sells mainly rotisserie-cooked poultry dishes with your choice of side. At Boston Market you can get a chicken dish that consists of a leg and thigh, which is called a, "Quarter Dark". This is the item that I was prepared to order.
I am not originally from Massashusetts, and so my pronunciation of these two words are almost identical to anyone in the civilized world (not entirely, or that would be, "civilised world"). I approached the register and ordered:
Me: "I would like a quarter dark, please."
Him: "Excuse me?"
Me (loudly): "A quarter dark, please."
Him: "What?"
Me: "QWAHTAH DAHK!!!"
Him: "Oh, a qwahtah dahk..."
At least, "job", isn't pronounced like, "jaerb".
Yet.
Re: (Score:3, Funny)
Re: (Score:3, Informative)
Hari Seldon... (Score:3, Interesting)
I, for one... (Score:5, Funny)
Easy- a lot of it will go (Score:5, Funny)
Re: (Score:2)
Why would the words fall apart? I guess you already lost "lose"...
Re: (Score:3, Funny)
Johnny said, "Bill went to the store".
whereas the American style is:
Johnny said, "Bill went to the store."
Obviously the former makes more sense because it nests properly: (sentence begins) (quote begins) (quote ends) (sentence ends).
That said
Re: (Score:2)
Re: (Score:3, Insightful)
System.out.println("Hello, world.");
Because in that case it makes perfect sense.
(code begins) (open paren) (String begins) (sentence begins) (sentence ends) (String ends) (close paren) (code ends)
I have no problem with a sentence like:
Bill said, "Go to the store."
Because in that case, it's logical. Well, almost. You could argue that it should read:
Bill said, "Go to the store.".
Because there's really two sentences there (the narrator's sentence as well as Bill's) but actually putting two peri
Re: (Score:3, Interesting)
It may "make sense" but as is common in programming it does not fit the original simple requirement, in other words: where has the quote gone?
Re:Programming does that to you (Score:4, Funny)
Because there's really two sentences there (the narrator's sentence as well as Bill's) but actually putting two periods is redundant and I have no problem with the internal period in that case.
I wouldn't say it's redundant, since as you said, there are two sentences. However, language often sacrifices logical consistency for fluency and clarity. Having lots of punctuation marks is typographically ugly, and distracts from fluent reading. Frankly, .". looks like an anime character.
Re: (Score:3, Insightful)
Oh wait, no I don't, it's a useless extra comma that isn't necessary.
Re:Easy- a lot of it will go (Score:5, Funny)
"Why?" asks the confused, surviving waiter amidst the carnage, as the panda makes towards the exit. The panda produces a badly punctuated wildlife manual and tosses it over his shoulder.
"Well, I'm a panda," he says at the door. "Look it up."
The waiter turns to the relevant entry in the manual and, sure enough, finds an explanation. "Panda. Large black-and-white bear-like mammal, native to China. Eats, shoots and leaves."
Re:Easy- a lot of it will go (Score:4, Informative)
It's taken from a book written by Lynne Truss published in the UK roughly 3 years ago.
Amazon Link [amazon.com]
Re: (Score:3)
As suggested by Mark Twain (Score:5, Funny)
Generally, then, the improvement would kontinue iear bai iear with iear 5 doing awai with useless double konsonants, and iears 6-12 or so modifaiing vowlz and the rimeiniing voist and unvoist konsonants. Bai iear 15 or sou, it wud fainali bi posibl tu meik ius ov thi ridandant letez "c", "y" and "x" bai now jast a memori in the maindz ov ould doderez tu riplais "ch", "sh", and "th" rispektivili.
Fainali, xen, aafte sam 20 iers ov orxogrefkl riform, wi wud hev alojikl, kohirnt speling in ius xrewawt xe Ingliy-spiking werld.
Re: (Score:3, Insightful)
Re:As suggested by Mark Twain (Score:5, Informative)
8th century - Beowulf, which is unreadable for modern English speakers.
1066 - Norman conquest - Old French would have a massive influence on English. Introduction of lots of Latin roots into English.
14th century - Chaucer, somewhat readable for modern English speakers with modernized spellings.
16th century - Shakespeare, more or less readable for modern English speakers without much editing.
Pronunciation of course also changed drastically, and this was reflected in orthography as well.
Re: (Score:2)
Re: (Score:3, Informative)
No it wasn't [wikipedia.org].
Again, this is a bit nonsensical. Do you really think the complexity of letterforms caused printers to modify their shapes? If so, how do you account for "a" or "g" or--even worse--the ampersand?
Re:As suggested by Mark Twain (Score:5, Interesting)
The printing press was a major incentive to standardise spelling, but also let to one of the few problems translating/transcribing Shakespeare.
Early fonts put a curl to the left on the bottom of the lower case "f" making it look a bit like a letter "s". Because s is much more common than "f", early printers would run out of esses before effs and would substitute an eff for an ess when neceffary.
My dad has a reproduction of early prints of Shakspeare's plays and the Midsummer Nights Dream song "Where the bees suck, there suck I" is on one such page. This caused a bit of a stir backstage and had to be explained, apparently.
Re: (Score:3, Interesting)
Re: (Score:3, Interesting)
I specifically referred to a page on a reproduction of a genuine contempory (to ol' Bill) print of a Shakespearian play. The word "suck" doesn't and didn't have a long s. On the page it is definitely spealt with "f" and there is a foreword which explains the frequent use of "f" instead of "s".
Re: (Score:3, Interesting)
English is actually a good example why the mathematical approach is inappropriate. Your step between Beowulf and Chaucer is the crucial. In this period the linguistic situation in Britain became rather complex, while the vast majority of people continued to speak Anglo-Saxon (a West Germanic language of the Anglo-Frisian branch), the Norman nobility spoke Anglo-Norman, while the clergy used Latin. (Not to forget the different celtic tongues used by the people in Cornwall, Wales and Scotland.) All this produ
Re: (Score:3, Insightful)
Re: (Score:3, Informative)
I'm going to have to disagree with you here. While later French did have an impact on English as a result of fashion, Anglo-Normand massively influenced Old English. From Histoire de la Langue: du Latin à l'Ancien Français, Peter
Re: (Score:2)
Re: (Score:2, Interesting)
Just look at them damned Chinese characters and the reform they underwent last century -- compare the characters used in Taiwan or Hong Kong, those in Japan (that were adopted after the Chinese simplified them once) and those that are used in China now (which were simplified gradually even more). The more them characters evolve, the more they look the same.
Pro
Re: (Score:2)
http://en.wikipedia.org/wiki/Katakana [wikipedia.org]
Re: (Score:2)
The whole point is that the less commonly used irregular verbs are more likely to become regularized. It might be different with spelling, but I doubt we'll see much of it in print given the way we have standardized spelling over the past hundred or two hundred years.
Re:As suggested by Mark Twain (Score:5, Informative)
Re:As suggested by Mark Twain (Score:5, Informative)
The verb is indeed irregular in many languages, but nonetheless completely regular in others. One of the problems people have in deciding whether a feature of language is universal is the very small subset of languages they've been exposed to.
Most of the languages you can name off hand are all part of the Indoeuropean family of languages, which has a very large number of speakers, but does not constitute a large number of languages. Thus a lot of features common to Indo-european languages are taken to be linguistic universals when really they are common only to a very small subset of human languages.
Re:As suggested by Mark Twain (Score:5, Informative)
Compare I bike, you bike, we all bike.... the distinction by person is useless the way it is in english, I wonder if it'll disappear completely outside of "to be". (for other words you have the "he bikes" thing)
Thing is, this actually -did- make sence at some point (or atleast it served a purpose) in many languages that universally have different forms for different persons, you can remove the personal pronoun, since it's clear from the verb alone which person is meant.
"I am a boy" is superfluous; "am boy" conveys the same meaning since "am" can only be used for "I". Works that way in finnish, for example:
"puhun Suomi" (I speak finnish) "puhut Soumi" (you speak finnish), with enough grammar you can do away with many small words, and you can make the sequence of words more freely choosable. In english you make questions by reordering words. "you can have a cookie." "can I have a cookie?", with grammar that can also be done away with; In finnish you use -ko to symbolise question, so no need to reorder words (or add "do you" or similar antics)
"puhutko Saksa?" ("Do you speak German?")
In general though, it seems that the trend is that -less- grammar and -more small-word and word-sequence is used. English sure is losing grammar at a noticeable rate, same for Norwegian and German.
Re: (Score:2)
OH NOES NOT GERMAN...
Re: (Score:2)
Re:As suggested by Mark Twain (Score:5, Informative)
Actually, apparently this is widely misattrbuted [alt-usage-english.org] to Mark Twain; it's actually from a letter by a guy named M. J. Shields.
Re: (Score:2)
Re: (Score:3, Interesting)
Re: (Score:2)
Can you give an example where globally replacing "y" with "i" would be ambiguous?
Werd Up (Score:5, Funny)
Re: (Score:2)
Efforts to stamp out irregularity (Score:2, Interesting)
(Zonk has, of course, given up hope on regularizing "to be".)
this isn't really news (Score:5, Informative)
This isn't really news. We linguists have known this for a long time, as the article mentions, and we've known why: a child learning a language tends to regularize irregular forms. If he or she then hears the irregular form enough, the child reverts to the irregular form. This is why you'll hear children learning English go through a stage in which their knowledge of verb forms is skimpy but they have irregular forms like "brought", because they are memorizing individual forms, then through a stage in which they produce incorrect but regular forms, which they could not have learned from adults, like "bringed", because they have learned the rule, and through a third stage in which they learn the exceptions to the rule and the irregular forms like "brought" return. Irregular forms will only be learnable if they are sufficiently frequent. The only novelty of this research is the computational ability to carry out an accurate simulation.
As for predicting the future of the language, that's silly. There is a lot more to language change than what happens to irregular verbs.
Re:this isn't really news (Score:4, Interesting)
So are these usages converging the same way as verb irregularity?
Re:this isn't really news (Score:4, Informative)
One can also expand their English vocabulary by working with Indians. Took me a while to figure out WTF 'prepone' meant. As in (say with your best Apu imitation), "We need to prepone the meeting an hour or so." Prepone being the opposite of postpone.
Re: (Score:2)
We've been preponing meetings this way since long back.
Re: (Score:3, Interesting)
Re: this isn't really news (Score:5, Informative)
Another example is that Modern English has a "weird" class of adjectives beginning in 'a' that don't be have like other adjectives: asleep awry alive, etc. -- there's a pile of them. I talked to a professor of linguistics, who had published a fairly well known textbook on syntax, and he seemed genuinely puzzled by them. But a basic familiarity with language change reveals that they are actually fossilized prepositional phrases. Cf. the line in A Clockwork Orange, "While you are on life" = "While you are alive". So what looks like an unmotivated class of irregular adjectives is actually just the evolutionary reflex of a very normal, regular syntactic structure.
To add to the confusion, we're now getting a similar class of irregular adverbs with the derivation from the article 'a' rather than an old preposition, "alot", "awhile", etc., which while denegrated as ignorant spelling are actually a clue to the writer's understanding of the language. In a hundred years (or is that "ahundred"?), people without knowledge of English's history will think we have a class of irregular adjectives *and* adverbs, blissfully unaware that they are just evolved forms of very regular structures.
Oh, and the properties of Chinese have nothing to do with writing or a lack thereof.
Re: (Score:2)
I think you need to reread my comment, which you haven't understood. Yes, it has long been known that only common verbs remain irregular. The description I gave of the stages in child language acquisition was part of the explanation of why this is the case. So, as I said, the basic observation is not news. The virtue of the study is in providing a more precise quantitative model.
Psychohistory? (Score:4, Interesting)
Brief history of psychohistory for those who haven't read The Foundation Trilogy by Asimov:
Psychohistory is the name of a fictional science, which combined history, sociology, and mathematical statistics, in Isaac Asimov's Foundation universe, to create a (nearly) exact science of the actions of very large groups of people, such as the Galactic Empire.
From Wikipedia, obviously:
http://en.wikipedia.org/wiki/Psychohistory/ [wikipedia.org]
Re: (Score:2)
And as predicted 31 years ago... (Score:2)
http://snltranscripts.jt.org/75/75rdecabet.phtml [jt.org]
Predicting the future using language (Score:5, Interesting)
Stanislaw Lem wrote a book -- I think it was _The Futurological Congress_ -- which included people who predicted future inventions by predicting possible words. The theory being: things won't be popular unless they have a good name, so by thinking of good names, and then considering what might have those names, you can predict future developments.
Re:Predicting the future using language (Score:4, Funny)
Squatting (Score:2)
Great (Score:2)
Re: (Score:2)
Bellyfeel! (Score:2)
Keep the 'mitten' in 'smitten' (Score:3, Interesting)
I'm not sure what fancy-pants sources these guys are using, but 'shirve' and 'smite' are definitely not low frequency verbs in my crowd. I say keep the 'mote' in smote. They will rue the day when 'smitted' crosses my lips!
Re:Keep the 'mittens' in The Kittens (Score:2)
Grocken Zie Greek?
Seriously folks, now all they need is a study to predict which comes first - the "regularization" of irregular verbs (you'd think they'd just eat-all bran) versus their seriously overdue death.
Re: (Score:3, Informative)
That is a grievous insult to the English language - shrive yourself or I will smite your ass!
(ok, so I don't have occasion to use "shrive" too often, but "smite" is a very useful word)
Lolcats (Score:2, Interesting)
im in ur internetz, evolving ur languages [icanhascheezburger.com]
Question (Score:2)
I occasionally see algorithms used to predict future outcomes of a system where the algorithm appears to have been manipulated to fit the data rather than actually attempt to model the system in question. A prime example is one where the "novelty" of the universe is plotted over time and spikes appear in correlation with historic events. My question: Is there a specific term to describe this type
Re: Question (Score:2)
I occasionally see algorithms used to predict future outcomes of a system where the algorithm appears to have been manipulated to fit the data rather than actually attempt to model the system in question. A prime example is one where the "novelty" of the universe is plotted over time and spikes appear in correlation with historic events. My question: Is there a specific term to describe this type of shenanigans?
In the general case it should probably be considered a form of "overfitting", in the sense of what happens when you use a high-order polynomial to pass your plot through all your data points, rather than using a straight line or simple curve and allowing some of the data to scatter around it.
Of course, if you deliberately do it to misrepresent something, it can be called "lying" rather than "overfitting".
Too late for "wed" (Score:2)
Death of COBOL (Score:2, Funny)
The Future? (Score:2)
Internet might change these results (Score:3, Insightful)
To google (Score:5, Funny)
I google
I gaigle
I have googlen
The Future of Linguistics... (Score:4, Funny)
Stewardess: "Oh, good. Please tell him that I'll be right back with some medicine."
Barbara: (to man) "Jus hang loose blood, she gonna catch you on the rebound with some medicide..."
Man: "Whatchu talkin' bout momma, my momma didn't raise no dummies, I dug her rap!"
Barbara: "Cut me some slack jack! (arguing in Jive) Jive-ass fool ain't got no brains... anyhow."
(Forgive me if I missed a part, trying to do it from memory here....)
The past as a guide (Score:3, Interesting)
Re: (Score:2)
Or the Slashdot Effect. Or Fark. Or lollercoaster/roflcopter/lmaonade/whatever the latest
Re: (Score:2)
Re: (Score:2)
With
Re: (Score:2)
Re: (Score:2)
Re: (Score:2)
Re: Love! (Score:3, Funny)
Re: (Score:2)
Re:Yes, well...however...there are other methods. (Score:5, Interesting)
Sorry, but this is absolutely false. Korean has dialects that differ significantly from each other - there is no single unified language. Nor did the king ever standardize the language. Korean is no more artificial than any other human language. This appears to be a garbled version of the development of the Hangul alphabet by king Sejong and his advisors. This was a great development, but it was just a writing system, not a standardization of the language itself.
Furthermore, it is not true that someone who speaks Chinese or Japanese can quickly pick up Korean. Chinese and Korean are not only unrelated but of radically different types. Chinese speakers find Korean quite difficult. Japanese speakers find Korean somewhat easier because the two languages are very similar in grammatical type, but even so most of the vocabulary is quite unfamiliar and the morphology, though similar in a general typological way, is quite different in detail.
MOD PARENT UP -- GP is talking out ass (Score:5, Informative)
As a fluent Japanese speaker and part-time studier of Korean, I can vouch for the grammatical similarities -- most intriguing. And also as a part-time studier of Chinese, I can vouch that Chinese and Korean are about as similar as English and Korean -- Korean has borrowed words from both languages, but structurally resembles neither. Okay, so Chinese influenced Korean (and Japanese too) in terms of how counters are used (words like "brace" in "a brace of ducks", or "murder" in "a murder of crows", or "loaf" in "three loaves of bread"), but otherwise Chinese and Korean have pitifully little to do with each other. For that matter, Chinese is closer to English structurally speaking than it is to Korean, so there. ;)
Cheers,
Re: (Score:3, Interesting)
Linguistics 101 lesson: a language is not a bag of words. Any generalization about language that treats it as if it is some bag of words (e.g., in this case, that language change consists of new words entering the bag, while other words fall out of it) shows a profound ignorance of the fundamental ideas of linguistics. A language is a gram
Re: (Score:3, Insightful)