Faculty of Language: The Generative Death March, part 2.

Monday, September 12, 2016

The Generative Death March, part 2.

I’m sitting here in my rocking chair, half dozing (it’s hard for me to stay awake these days) and I come across this passage from the Scientific American piece by Ibbotson and Tomasello (henceforth IT):

“And so the linking problem—which should be the central problem in applying universal grammar to language learning—has never been solved or even seriously confronted.”

Now I’m awake. To their credit, IT correctly identifies the central problem for generative approaches to language acquisition. The problem is this: if the innate structures that shape the ways languages can and cannot vary are highly abstract, then it stands to reason that it is hard to identify them in the sentences that serve as the input to language learners. Sentences are merely the products of the abstract recursive function that defines them, so how can one use the products to identify the function? As Steve Pinker noted in 1989 “syntactic representations are odorless, colorless and tasteless.” Abstractness comes with a cost and so we are obliged to say how the concrete relates to the abstract in a way that is transparent to learners.

And IT correctly notes that Pinker, in his beautifully argued 1984 book Language Learnability and Language Development, proposed one kind of solution to this problem. Pinker’s idea was based on the idea that there are systematic correspondences between syntactic representations and semantic representations. So, if learners could identify the meaning of an expression from the context of its use, then they could use these correspondences to infer the syntactic reprentations. But, of course, such inferences would only be possible if the syntax-semantics correspondences were antecedently known. So, for example, if a learner knew innately that objects were labeled by Noun Phrases, then hearing an expression (e.g., “the cat”) used to label an object (CAT) would license the inference that that expression was a Noun Phrase. The learner could then try to determine which part of that expression was the determiner and which part the noun. Moreover, having identified the formal properties of NPs, certain other inferences would be licensed for free. For example, it is possible to extract a wh-phrase out of the sentential complement of a verb, but not out of the sentential complement of a noun:

(1) a. Who did you [VP claim [S that Bill saw __]]?

b. * Who did you make [NP the claim [S that Bill saw __]]?

Again, if human children knew this property of extraction rules innately, then there would be no need to “figure out” (i.e., by general rules of categorization, analogy, etc) that such extractions were impossible. Instead, it would follow simply from identifying the formal properties that identified an expression as an NP, which would be possible given the innate correspondences between semantics and syntax. This is what I would call a very good idea.

Now, IT seems to think that Pinker’s project is widely considered to have failed [1]. I’m not sure that is the case. It certainly took some bruises when Lila Gleitman and colleagues showed that in many cases, even adults can’t tell from a context what other people are likely to be talking about. And without that semantic seed, even a learner armed with Pinker’s innate correspondence rules wouldn’t be able to grow a grammar. But then again, maybe there are a few “epiphany contexts” where learners do know what the sentence is about and they use these to break into the grammar, as Lila Gleitman and John Trueswell have suggested in more recent work. But the correctness of Pinker’s proposals is not my main concern here. Rather, what concerns me is the 2nd part of the quotation above, the part that says the linking problem has not been seriously confronted since Pinker’s alleged failure [2]. That’s just plain false.

Indeed, the problem has been addressed quite widely and with a variety of experimental and computational tools and across diverse languages. For example, Anne Christophe and her colleagues have demonstrated that infants are sensitive to the regular correlations between prosodic structure and syntactic structure and can use those correlations to build an initial parse that supports word recognition and syntactic categorization. Jean-Remy Hochmann, Ansgar Endress and Jacques Mehler demonstrated that infants use relative frequency as a cue to whether a novel word is likely to be a function word or a content word. William Snyder has demonstrated that children can use frequent constructions like verb-particle constructions as a cue to setting an abstract parameter that controls the syntax of a wide range of complex predicate constructions that may be harder to detect in the environment. Charles Yang has demonstrated that the frequency of unambiguous evidence in favor of a particular grammatical analysis predicts the age of acquisition of constructions exhibiting that analysis; and he built a computational model that predicts that effect. Elisa Sneed showed that children can use information structural cues to identify a novel determiner as definite or indefinite and in turn use that information to unlock the grammar of genericity. Misha Becker has argued that the relative frequency of animate and inanimate subjects provides a cue to whether a novel verb taking an infinitival complement is treated as a raising or control predicate, despite their identical surface word orders. In my work with Josh Viau, I showed that the relative frequency of animate and inanimate indirect objects provides a cue to whether a given ditransitive construction treats the goal as asymmetrically c-commanding the theme or vice versa, overcoming highly variable surface cues both within and across languages. Janet Fodor and William Sakas have built a large scale computational simulation of the parameter setting problem in order to illustrate how parameters could be set, making important predictions for how they are set. I could go on [3].

None of this work establishes the innateness of any piece of the correspondences. Rather it shows that it is possible to use the correlations across domains of grammar in order to make inferences on the basis of observable phenomena in one domain to the abstract representations of another. The Linking Problem is not solved, but there is a large number of very smart people working hard to chip away at it.

The work I am referring to is all easily accessible to all members of the field, having been published in the major journals of linguistics and cognitive science. I have sometimes been told, by exponents of the Usage Based approach and their empiricist cousins, that this literature is too technical, that, “you have to know so much to understand it.” But abbreviation and argot are inevitable in any science, and a responsible critic will simply have to tackle it. What we have in IT is an irresponsible cop out from those too lazy to get out of their armchairs.

I think it’s time for my nap. Wake me up when something interesting happens.

____________________________________

[1] IT also thinks that something about the phenomenon of ergativity sank Pinker’s ship, but since Pinker spent considerable time in both his 1984 and 1989 books discussing that phenomenon, I think these concerns may be overstated.

[2] You can sign me up to fail like Pinker in a heartbeat.

[3] A reasonable review of some of this literature, if I do say so myself, can be found in Lidz and Gagliardi (2015) How Nature Meets Nurture: Statistical Learning and Universal Grammar. Annual Reviews of Linguistics 1. Also, the new Oxford Handbook of Developmental Linguistics (edited by Lidz, Snyder and Pater) is also full of interesting probes into the linking problem and other important concerns.

78 comments:

Nina KazaninaSeptember 12, 2016 at 1:33 PM
Thanks, Jeff, especially for the concrete examples of existing linking attempts. To me the process of employing probabilistic information on the animacy of the subject/object in order to infer the underlying syntactic structure seems unclear, even in broad strokes. A clarification would be most welcome.
ReplyDelete
Replies
AveryAndrewsSeptember 12, 2016 at 5:56 PM
There are some interesting typological universals that really are observational connected to some of these things ... for example there are afaik zero languages where words referring to kinds of things and living things (or 'Spelke objects', if you prefer that kind of terminology; there are many different possibilities that work pretty well for the purpose here) are split into multiple 'part of speech categories' (that is, ones that have an effect on word order principles). This is in striking contrast to the behavior of grammatical features, such as grammatical gender, which often do split up this conceptual category crazy ways. Since there always is a single part of speech into which words for kinds of things and living things will fall, we can label this category 'noun'; then a bit of X-bar gives you NPs, and Pinker's program is basically back in the air (the basic idea is due to John Lyons, in his 1968 book _Theoretical Linguistics_, although I've modified the formulation)

Note carefully that this says nothing about what else may or may not be dumped into the 'noun' category, for example languages often include words indicating kinds of actions ('give it a kick'). A bit oddly, I find it a lot harder to do the same kind of a job for verbs.
ReplyDelete
Replies
Richard MooreSeptember 14, 2016 at 2:44 AM
Of all of the criticisms that one could aim at Mike Tomasello, surely failure to get out of the armchair is not one of them.
ReplyDelete
Replies
Jeff LidzSeptember 14, 2016 at 7:00 AM
When it comes to doing linguistics, this is evidently the correct assessment. Of course, Tomasello has done a mountain of really important work outside of language and he has been involved in many papers on language acquisition. So obviously he is working hard. But when it comes to generative linguistics, he has for the 20 years I've been engaging with his work, I have seen no demonstration that he has even tried to understand the perspective against which he has positioned himself. So, that's the armchair I am referring to.
ReplyDelete
Replies
William MatchinSeptember 14, 2016 at 8:41 AM
I entirely agree that this sort of scorched-earth criticism of Generative Grammar from this quarter is completely unfounded and lazy and deserves the response Jeff offers. However, and I say this as one of the most steadfast Chomskians you will ever find, there is something substantive in what Tomasello and others have to say. So, in the spirit of charity, I’d like to zero in on the fundamentals of what they have (had) to say that we should take into account. From Diogo’s comment above, I looked up Tomasello’s 2000 paper (I found one in Cognitive Linguistics), and I found this very interesting passage from the Conclusion:

“The general picture that emerges from my application of the usage-based view to problems of child language acquisition is this: When young children have something they want to say, they sometimes have a set expression readily available and so they simply retrieve that expression from their stored linguistic experience. When they have no set expression readily available, they retrieve linguistic schemas and items that they have previously mastered (either in their own production or in their comprehension of other speakers) and then ``cut and paste'' them together as necessary for the communicative situation at hand-what I have called ``usage-based syntactic operations'' … It is also important that the linguistic structures being cut and pasted in these acts of linguistic communication are a variegated lot, including everything from single words to abstract categories to partially abstract utterance or phrasal schemas.”

I’d like to point out that this is remarkably similar to the picture I painted in the recent post. If you replace “schemas” with “treelets”, it is pretty much the same thing that I outlined (note that Tomasello later uses the word “linguistic structures” instead of schemas). I find it quite interesting that Tomasello arrives at this picture based on evidence from language acquisition, and I arrived at this picture from psycholinguistics, neuroscience, and even data from the traditional domain of linguistics (e.g., idioms). I think this is a very important generalization for people working in acquisition, sentence processing, and neuroscience of sentences. What Tomasello is missing is what Jeff and the Generative community have been banging on about for a long time – there is a gap between the data and the knowledge. I would like to fill that gap with a Minimalist UG that generates these stored structures, and I think this provides an excellent bridge that incorporates the insights of both camps.

The Generative community has not dealt substantively with Tomasello’s generalization: the heavy use of stored linguistic structures. I find it interesting that I haven’t yet met a syntactician (with the possible exception of Alec Marantz) that denied the existence/use of stored structures. So why don’t we respond to Tomasello by accepting this empirical generalization and developing a model of language acquisition and online processing that makes use of stored structures /constructions / treelets, and arguing that a Minimalist grammar gives us a theory of how children acquire these stored structures?
ReplyDelete
Replies
UnknownSeptember 14, 2016 at 2:21 PM
@William

Well, one could do worse than update Lewis & Vasishth's proposal with a more realistic MG instead of the CFG they use.

Elsewhere, though, John Hale and others (including me) have advanced precise proposals about how MGs can be incorporated into a model of incremental comprehension. These models can furnish quantitative predictions for behavioral and neurophysiological data. I'd be interested in known what's missing from this line of work that you see as necessary for your own interests.

Some recent examples:

[hale 2016](http://dx.doi.org/10.1111/lnc3.12196)
[brennan 2016](http://dx.doi.org/10.1111/lnc3.12198)
ReplyDelete
Replies

Add comment

Faculty of Language

Comments

Monday, September 12, 2016

The Generative Death March, part 2.

78 comments:

Contributors