Faculty of Language: Some recent Chomsky lectures on Minimalism

Thursday, June 5, 2014

Some recent Chomsky lectures on Minimalism

Bill Idsardi sent me this link to 4 recent lectures by Chomsky. I have not looked at them myself, but I suspect that some of you may find them interesting.

53 comments:

Marc van OostendorpJune 5, 2014 at 12:05 PM
They are great. I hope there will be a discussion about them; there are several technical aspects of them which I do not understand completely.

To give an example, Chomsky exists that it is misleading to draw trees, and he sticks to bracketed structures instead. The reason seems to be that some nodes in the trees can remain unlabeled, but it is completely unclear to me why that cannot be represented in a tree structure. So I think I might be missing something. As a matter of fact, there are various other things which I do not get about the labeling function (such as: why is it there at all?) The unfortunate thing about these videos is that there are very few questions asked by the students, and there is very little discussion. But the main message is brilliant, in my view.
ReplyDelete
Replies
William MatchinJune 6, 2014 at 9:57 PM
So, I am 32 minutes into lecture 2, and there is discussion of the SMT, so here come my comments on this again. Chomsky discussed the SMT as guiding the choice of the simplest combinatorial operation, i.e., Merge, over some alternative (say, a set of phrase-structure rules). The set is more complicated because there are more rules - more rules being more complex than fewer rules. So this is how I interpret the SMT - more rules is less optimal, longer distance is less optimal than shorter distance, positive number of uninterpretable features is less optimal than zero uninterpretable features, etc.

However, a grammar with phrase structure rules could very well be more efficient for parsing and production than merge, right? I suppose I don't know off the top of my head what would be better for communication, but it's far from clear. But the SMT provides a clear guide in the sense of the some other notion of "optimal" or "minimal" - fewer rules are better than more rules.

So this is just my discomfort in couching the SMT in externalization terms - it would belie the goals of the Minimalist Program, IMO.
ReplyDelete
Replies
OmerJune 7, 2014 at 7:14 AM
I hesitate to make my first chime-in on the blog be tinged with self-promotion, but: I have argued (here, here) that narrow syntax may indeed *not* have anything resembling uninterpretable features (at least, not in the sense of "features that will lead to a crash if not checked"). Certainly, such features don't seem to be involved in what we pre-theoretically refer to as agreement.

What is the relevance of this? Well, Norbert says above that a view of syntax as an "optimal realization of interface conditions" would entail a syntax without uninterpretable features. I think there is good reason to think that this is exactly the syntax we observe. Now: both agreement and movement (or, if Norbert is right, the single operation that underlies them both) are certainly feature *driven* – i.e., they occur only when certain features are present in the derivation – and what I have said here sheds no particular light on why those features would exist in narrow syntax. But I do think it is interesting to note that, if I am right, there are no features in narrow syntax that the interfaces cannot deal with (i.e., the kind that would cause a *crash* were they to arrive at the interfaces untouched). For those who wonder how syntax could be "optimal" while also containing crash-inducing features, this would perhaps be a modest step forward.

ReplyDelete
Replies
Dennis O.June 8, 2014 at 8:16 AM
Norbert wrote: However, as you note all current stories rely on features to drive operations. Is this conceptually better than operations free applying and being filtered out by Bare Output Conditions alone? Not obviously. Is this better than theories where there are no such features at all and so no movement to speak of? Not clear, at least to me.

It's not true that "all current stories" rely on formal features as triggers of syntactic computation; see e.g. work by Reinhart, Fox, Moro and others. It's quite amazing, in my view, that the featural-triggers hypothesis is still so popular, given that (again, in my view) it has had virtually no explanatory success in any domain of syntax. As Chomsky, Fanselow and others have pointed out repeatedly, in almost all cases featural triggers are totally ad hoc, without independent motivation: saying that XP undergoes some operation O because XP has an O-feature is nothing but a restatement of the fact. And even in the one case where there is some initial plausibility for a featural trigger for displacement (wh-movement), there is interesting work suggesting that features aren't needed.

It's particularly surprising to me that so many "Chomskyans," like Norbert, are still adhering to the feature-driven world view, given that Chomsky himself has clearly been moving away from this at least since the 2004 paper, advocating a free-Merge model instead (although I admit he's been a bit ambivalent about this). The motivation is clear: what's done at the interfaces doesn't need to be redundantly replicated in syntax, and so interface-based explanations, at least in principle, shift the explanatory burden away from UG. And while most people have been happy to continue working with featural descriptions rather than more principled notions, there is some very intersting work, as I indicated above (and also Omer's work), that shows that this alternative can be fruitfully pursued.

Note also that the featural-trigger hypothesis has serious conceptual problems: the principle of Full Interpretation has, to my knowledge, never really been justified conceptually; to mark valued uFs as "deleted" you need to introduce a diacritic (masked by its "strike-through" notation, but still a diacritic), hence violate Inclusiveness; you need to somehow assign non-intrinsic features to XPs in the course of the derivation (or in the numeration, but that too is without independent motivation); etc.
ReplyDelete
Replies
Dennis O.June 8, 2014 at 8:19 AM
Something that also strikes me as relevant to this whole issue was mentioned by Omer: The Bare Output Conditions model requires (potentially massive) overgeneration of possible derivations, followed by filtration of those derivations whose outcomes do not meet the relevant conditions.

If you follow Chomsky and drop the idea that there is a significant notion of "well-formed formula" for natural language, then the term "overgeneration" has no real meaning. There's nothing incoherent about the idea that the grammar itself licenses all kinds of expressions along all dimensions of "acceptability," deviance, usefulness, etc. In fact, we know that acceptability and grammaticality cannot (and should not) be directly correlated, although many people seem to be assuming essentially this -- a misunderstanding, I think, rooted in analogies to formal-language theory in early GG (note that in FLT, there are no interface systems/conditions).

So once you drop that, "overgeneration" and conversely "crash-proof (grammar)" all become pretty much meaningless notions, or at least I don't see what they would mean, unless you either mistakenly equate acceptability (or something like that) with grammaticality or else give up on GG's fundamental tenet that competence can/must be meaningfully studied in total abstraction from real-time processes. Otherwise there's no problem whatsoever with having a grammar that yields all kinds of expressons, only a subset of which is usable by interfacing systems -- in fact, I think, it's the most desirable outcome, since it would allow you to get away with a maximally simple core-computation system, in the extreme.

If "sticking close to the SMT" means anything for actual linguistic research, this seems to me to be the guideline: try to show that Merge applies freely, and ascribe as much as the complexity we find beyond this (i.e., basically all the complexity) to the independently given interface systems. And again, I think the only reason this perspective is often frowned upon is the mistaken idea that acceptability = grammaticality, and therefore an efficient grammar should be "crash-proof." As I said before, Chomsky has made this point repeatedly, but it seems that it hasn't really had an impact.
ReplyDelete
Replies
Dennis O.June 8, 2014 at 9:37 AM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownJune 8, 2014 at 1:25 PM
Dennis said that in [Formal Language Theory], there are no interface systems/conditions. Well, there are.

There's an entire subfield of formal language theory that's concerned with the generative capacity of logically defined constraints. For instance, a string language is definable via constraints stated in monadic second-order logic iff it is recognized by a finite-state string automaton iff it is generated by a regular string grammar iff its Myhill-Nerode partition has finite index.

What FLT shows is that we can switch between all these perspectives as we see fit: features or constraints, grammars or recognizers, unbounded dependencies or local subcategorization. Some are more succinct, some are easier to implement, some highlight connections that are easy to miss otherwise (the logic perspective is very useful for comparing phonology and syntax). But they all have their own unique advantages.

That's why I'm having a hard-time making sense of discussions like this (this = features VS constraints, not grammaticality VS acceptability). It doesn't seem to be a discussion of how one's research interests should inform what kind of technical devices one uses. Instead the sentiment is apparently that feature-based perspectives are fundamentally broken and a switch to constraints would easily solve that.
ReplyDelete
Replies

Add comment

Faculty of Language

Comments

Thursday, June 5, 2014

Some recent Chomsky lectures on Minimalism

53 comments:

Contributors