Faculty of Language: Some reading for the curious

Thursday, February 19, 2015

Some reading for the curious

Here are some recent things that I found interesting that may interest you as well.

On MOOCish matters: http://www.voxeu.org/article/disruptive-potential-online-learning. The big finding is that employers don’t like MOOCs that much and treat them as inferior degrees. This would change if, for example, places like Harvard and MIT and Stanford substituted the 4 year college experience they currently offer to elites with a MOOCish experience. When the well-to-do vote with their kids’ feet and buy into MOOC based degrees, then everyone will. Till then, it will largely be a way of bending some cost curves (and you know whose) and not others.

Dan Everett (DE) still doesn’t understand what a universal is: http://fivebooks.com/interviews/dan-everett-on-language-and-thought.

This little interview is filled with exciting tidbits. Here are three:

(i) Sapir’s hypothesis concerning the interaction of language with thought is far more modest than many have assumed. On DE’s interpretation, the Sapir-Whorf hypothesis is not the rather exciting (but clearly wrong) view that, the language we speak determines the way we can think” but the rather modest claim that “the language we speak affects in some way some of the ways we think when we need to think quickly.” Note the Kahnemanian tinge here. IMO, this is hardly an exciting thesis, and it is little wonder that the strong version of the thesis is what aroused interest. The weak version seems to me close to a truism.

(ii) But a truism that Everett is impressed with. He claims that Sapir discovered that “culture can influence language” and that though language “clearly has some computational aspects that cannot be reduced to culture…there are a number of broad characteristics that reflect the culture they emerge from…”(3). I confess that this strikes me as obvious and is the first thing a neophyte learning a second language focuses on. So, though Sapir is deserving of honors, it is not because of this “insight.” Curiously, Everett seems not to have noted that Sapir’s first observation (i.e. that language is a kind of computational system) does not impress him. Maybe that’s why Everett has problems understanding claims that people imake about such systems. In particular,

(iii) DE still confuses Chomsky Universals with Greenberg Universals. It comew across in DE’s discussion of recursion where he once again asserts that the existence of a finite language would undermine the Chomsky claim that language is recursive (see answer to question 2). This is not the claim. The claim is that UG produces Gs that are recursive. So the fact that FL endows humans with the capacity to acquire Gs that are recursive does not imply either that every language has a recursive grammar or that every speaker uses this capacity to produce endlessly large sentences. So, evern were Piraha a “finite language” as DE claims (and which, truth be told, I still do not believe) it implies nothing whatsoever for Chomsky’s claim that it is a fact about FL/UG that language is recursive. This is simply a non-sequitur based on DE’s misunderstanding of what GGers take a universal to be (note his claim would be valid were he understanding ‘universal’ in Greenbergian terms). However, do note expect DE to ever loose this misunderstanding. As Upton Sinclair once noted: “It is difficult to get a man to understand something when his salary depends on his not understanding it.” What do you think the odds are that DE would be getting interviewed here or featured in the New Yorker or the Chronicle of HigherEeducation were he not peddling the claim that his work on Piraha showed that Chomsky work in linguistics was incorrect? Do I hear 0?

(iv) DE does not appear to understand that Gs can be recursive even if utterances have an upper bound. I am not saying that this is what is the case for Piraha. I am saying that recursion is a property of Gs not of utterances. A mark of recursion (i.e. evidence for recursive mechanisms) can be gleaned by looking to see if the products of this mechanism are unbounded in length and depth. But the converse does not obtain: Gs might be recursive even if utterances (their products) are bounded in size. DE seems to think that during language acquisition, kids scale the Chomsky hierarchy, first treating them as finite lists and then as generated by regular grammars and then by context free and then…all the way to mildly context sensitive. Where he got this conception I cannot fathom. But there is no reason to think that this is so. And if it is not, then given that Piraha speakers can learn what even DE considers recursive languages (a bad locution, by the way, given that ‘recursive’ is properly speaking a predicate of grammars, and only secondarily their products) like Portuguese it is clear that they have the same UGs we all do. And if this is right, then it is quite unlikely that they would not acquire a recursive G even for Piraha. But this is a discussion for another time. Right now it suffices for you to know that DE, it appears, cannot be taught and that there is still a large and lucrative market for “Chomsky is wrong” material. Big surprise.

Genes and languages: The Atlantic has a little piece showing that some “languages and genes do in fact share similar geographical fault lines.” Apparently, whether this was so was a question of interest to linguists. As the paper puts it: “Using new dataset and statistical techniques, the researchers were able to scratch an itch linguists and demographers have struggled to reach.” I confess to never having had this itch so I am not sure why this observation is of particular interest to linguists.

It is quite clear that whatever genetic change occurred did not affect the basic structure of FL. How do I know this? Because, so far as we can tell any kind can still learn any language in roughly the same way any other kid can. And, from what we can tell, all Gs obey effectively the same kinds of general structure dependent constraints. So, whatever the genetic changes, they did not affect those genes undergirding FL/UG. Nor, so far as I can tell, is there any reason to think that the phoneme properties and genetic features that are tracked are in a causal relation (i.e. neither is the root cause of the others change). It just seems that they swing together. But is this really surprising? Don’t people who have similar phonemes tend to live near each other? And as these kinds of genetic changes are subject to environmental influence is this really a surprise?

Maybe this is interesting for some other reason. If so, please post a comment and let me know what that interest is. I would love to know. Really. Here’s the link:

http://www.theatlantic.com/technology/archive/2015/02/how-languages-and-genes-evolve-together/385116/

Some philo/history of science: I enjoyed this little piece mainly for the discussion of the relationship between realism and mathematics in the physical sciences historically. It suggests one way of understanding Newton’s famous line about not feigning hypotheses. His theory gave a precise mathematical understanding of gravity. He thought that this was enough and that metaphysical speculations concerning its “reality” were not required from a scientific theory. This was enough. At any rate, there has been lots of intellectual pulling and pushing about how to understand one’s theoretical claims (e.g. realistically, instrumentally) and it is interesting to see a little history.

http://www.bostonreview.net/steven-shapin-scientism-virtue?utm_source=Newsletter%3A+January+27%2C+2015&utm_campaign=Jan+27%2C+2015+Newsletter&utm_medium=email

Replication/Reproducibility and stats in science: Here’s yet another paper on replicability in the sciences (https://www.sciencenews.org/article/redoing-scientific-research-best-way-find-truth). Many factors are cited as creating problems, but the one that I thought most provocative is at the end:

Much of the controversy has centered on the types of statistical analyses used in most scientific studies, and hardly anyone disputes that the math is a major tripping point…

There is a case to be made that though statistics is in principle useful, applying it correctly is very very hard. It’s one of these things that are better in theory than they are in practice. And maybe any paper dressed up in statistical garb should ipso facto be treated cautiously. Right now we do the opposite: stats lend credence to results. Might it be that they should be treated with suspicion until proven innocent? (For some useful discussion how even the best intended can go statistically astray see this recent piece by Gelman and Locken.)

One great scientist who was very suspicious of statistical results, it seems, was Ernest Rutherford. He was working at a time when physical theory was far more advanced than anything we see in our part of the sciences. Here’s what he said: “If your experiment needs statistics, you ought to have done a better experiment.” The problems with replication seem to lend his one liner some weight, as does the apparent difficulty inherent in doing one’s stats correctly.

48 comments:

Marc van OostendorpFebruary 19, 2015 at 8:17 AM
Let me use this space to advertise an Introduction to Linguistics MOOC we are making for the University of Leiden, and Coursera. I don't think anybody involved inj creaating this particulkar MOOC thinks that courses such as this are going to replace real courses, although they may replace books and other study *material*.
ReplyDelete
Replies
Callum HackettFebruary 19, 2015 at 12:15 PM
I think it's a mis-step for MOOCs to sell themselves as alternatives to college education (and they DO sell themselves now; paid versions of courses are granting fancy 'certificates'), and also for people to expect that employers should ever look favourably on them as such an alternative.

It seems to me that MOOCs can have one of two benefits. The first is to give people practical, employable skills like programming. Want to learn? Take a MOOC. But then when you go to an employer, you hardly need to tell them you've taken a MOOC in it, you just need to show you can do it.

The second benefit is to free up time in the college environment in order to reshape education there. Need to learn some fundamentals like calculus, probability, syntax or the like? Take a MOOC in that and then degrees will become about the more interesting applications and explorations of these that can't be replicated with distance learning.

That said, I'm rather partial to William Deresiewicz's NR piece which was widely criticised http://www.newrepublic.com/article/118747/ivy-league-schools-are-overrated-send-your-kids-elsewhere Much of top-end higher education functions like elitist branding to allure financial consultancy scum.

My only concern with using MOOCs like this is that people underestimate how much it's the case that *time*, as well as money, is largely a preserve of the rich, and if this means there will be greater competition amongst applicants to have much more preliminary experience through taking MOOCs alongside high school, then the fact that MOOCs are free isn't going to mean the benefits are shared equally.
ReplyDelete
Replies
karthik durvasulaFebruary 19, 2015 at 1:14 PM
Norbert, my reading of the Gelman and Loken article is that post-hoc justification (statistical or otherwise) for the data is close to useless, since there are simply too many analytical/theoretical possibilities. In some sense, the criticism applies to standard linguistic methodology too. Where, to my eye, a lot of post-hoc analyses are presented as predicting the data that has been analysed. The only reason that linguistic theories (primarily, syntactic/semantic analyses) might be less susceptible to the problems assessed by G&L is that the data that has been dealt with is relatively clean - not much variance (“low-hanging fruit”). The moment, the variance becomes a problem (with subtle judgments), post-hoc justification becomes a bigger problem. Phonological discussions have already started suffering from this problem because they deal with a lot of gradient phonotactic/fuzzy data these days.

So, I see Gelman & Loken as warning researchers to submit to separating true predictions from fishing expeditions, and for separating true predictions from “post-dictions” (the latter of which is extremely common in standard linguistic arguments too). Their commentary, to me, is not about statistics per se, it is about methodology that might seem convincing, but really isn’t. All this is not to say there is no place for post-hoc justification (surely, science needs it for new ideas). But, that is not to be confused with predictions and true testing.

On a related note: This might be interesting reading, to say the least. It’s a short editorial banning the use of NHST and most inferential statistics from their articles. Not an opinion I agree with - the call in my opinion should be for better and more careful stats,. But, to each their own. Maybe, this is the only thing that can be done to stop the obsession with p<0.05 rampant now, even in linguistic papers.

http://www.tandfonline.com/doi/full/10.1080/01973533.2015.1012991#abstract
ReplyDelete
Replies
NorbertFebruary 19, 2015 at 7:30 PM
This is posted for Mark Johnson who, no doubt for all the right reasons (here that NSA!) is having endless trouble posting on this blog. At the rise of being an accomplice to I know not what, I post here for him. Here's Mark:

I also second the thanks for maintaining this blog!

I realise what I'm about to say is likely to annoy everyone in the Piraha debate, but here goes anyway.

I suspect that the only reason why we see recursion in syntax is because our Language of Thought (or whatever you want to call it) provides us with recursive thoughts. But there are ways of expressing recursive thoughts that don't require recursive syntax, and maybe that's what's going on in Piraha.

For example, sentential anaphora permits us to express a single complex thought using several simple sentences. "Sam suspects Sasha thinks Sandy hates Alex" can also be expressed as "Sam suspects something. It is that Sasha thinks something else. It is that Sandy hates Alex". So we can express an arbitrarily deeply embedded thought via a sequence of sentences with only depth 2 clausal embedding by using sentential anaphora.

It could even be a cultural issue as to whether you prefer to express your recursive thoughts using syntactic recursion or other devices such as anaphora. (It's not a property of a language -- English lets you use both syntactic embedding and sentential anaphora).

I'd expect all languages to be able to express recursive thoughts somehow, e.g., using sentential anaphora. But of course there are perfectly reasonable thoughts that seem to be ineffable in English, and it's possible that the set of ineffable thoughts varies from language to language. I'd be surprised if the Piraha couldn't conceive of thoughts with arbitrary depth of recursion, though, as recursive thoughts seem central to e.g., the theory of mind.
ReplyDelete
Replies
UnknownFebruary 19, 2015 at 10:16 PM
A fun XKCD comic about p-values.
ReplyDelete
Replies
UnknownFebruary 20, 2015 at 3:40 AM
Long term reorders of Norbert's blog no doubt notice an interesting trend: a couple of years ago main target of criticism were articles published in top-journals. By now much focus is on popular books [Evans' The Language Myth], on-line comments or informal interviews [Everett above]. Presumably this indicates that critics of Chomsky are now also justified to conclude what he does or does not understand based on interview-volumes - what's right for the goose...

Another question arises. Norbert writes:

"However, do note expect DE to ever loose this misunderstanding. As Upton Sinclair once noted: “It is difficult to get a man to understand something when his salary depends on his not understanding it.” What do you think the odds are that DE would be getting interviewed here or featured in the New Yorker or the Chronicle of HigherEeducation were he not peddling the claim that his work on Piraha showed that Chomsky work in linguistics was incorrect? Do I hear 0?"

If criticizing Chomsky is so profitable one imagines defending him generates even greater rewards. If Norbert does not wish his readers start thinking along those lines he may want to remove the insulting remark on Everett's motive...
ReplyDelete
Replies
ewanFebruary 22, 2015 at 3:49 PM
This comment has been removed by the author.
ReplyDelete
Replies
ewanFebruary 22, 2015 at 3:50 PM
Re the genes and language piece, I don't think you were the target audience. This kind of research has been around since the 80s if I recall, showing that basically languages hew pretty closely to one genetic line and human breeding hews pretty closely to people who speak the same language. It always stuck with me as pretty surprising and one of those incredibly useful things to know when you're studying the history of human migration. It's of no immediate use unless you're asking questions about that as far as I can see.

Less immediately, though, historical relationships between languages _are_ relevant when you're studying Greenberg universals, as a nuisance factor that you need to get rid of (and you therefore should have a good model of). This kind of result means in principle you should be able to (carefully) plug in genetic data to improve your model when you're on the search for surfacey universals.

And the presupposition that something like this was true was what lent credence to this week's _other_ historical linguisitcs story, about Indo-European origins (http://news.sciencemag.org/archaeology/2015/02/indo-european-languages-tied-herders).
ReplyDelete
Replies
Callum HackettFebruary 25, 2015 at 2:09 PM
As a less sincere point of rhetoric, I find it amusing that DE's blog post cited by CB in the comments was published the day after this post and in it he throws back the exact same Upton Sinclair quotation that you used above. Perhaps he is a regular reader and is at least influenced to take some of your ideas. I would still take you both to task on it, however! Jibes at other people's intelligence can be fun, but when does a discussion ever profit from imputing nefarious motives? And if such accusations about money are true, or even just sincerely believed, then you're surely both making fools of yourselves by spending time constructing arguments where only bribes have power. In any case, as far as speculative psychoanalysis goes, it looks to me that linguistics is fought on the battle-ground of ego and self-pity much more than financial gain...
ReplyDelete
Replies

Add comment

Faculty of Language

Comments

Thursday, February 19, 2015

Some reading for the curious

48 comments:

Contributors