Faculty of Language: The GG game: Plato, Darwin and the POS

Sunday, May 25, 2014

The GG game: Plato, Darwin and the POS

Alex Clark has made the following two comments (abstracted) in his comments to this post.

I find it quite frustrating that you challenge me to "pony up a story" but when pressed, you start saying the MP is just a conjecture and a program and not a theory.

So I read the Hauser et al paper where the only language specific bits are recursion and maps to the interfaces -- so where's the learning story that goes with that version of UG/FLN? Nobody gives me a straight answer. They change the subject or start waffling about 3rd factor principles.

I believe that these two questions betray a misunderstanding, one that Alex shares with many others concerning the objectives of the Minimalist Program (MP) and how they relate to those of earlier theory. We can address the issue by asking: how does going beyond explanatory adequacy relate to explanatory adequacy? Talk on the Rialto is that the former cancels the latter. Nothing could be further from the truth. MP does not cancel the problems that pre-MP theory aimed to address. Aspiring to go beyond explanatory adequacy does not amnesty a theory from explanatory adequacy. Let me explain.

Before continuing, however, let me state that what follows is not Chomsky exegesis. I am a partisan of Chomsky haruspication (well not him, but his writings), but right now my concern is not to scavenge around his literary entrails trying to find some obscure passage that might, when read standing on one’s head, confuse. I am presenting an understanding of MP that addresses the indicated question above. The two quoted paragraphs were addressed to (at?) me. So here is my answer. And yes, I have said this countless times before.

There are two puzzles, Plato’s Problem (PP) and Darwin’s Problem (DP). They are interesting because of the light they potentially shed on the structure of FL, FL being whatever it is that allows humans to be as linguistically facile as we are. The work in the last 60 years of generative grammar (GG) has revealed a lot about the structure of FL in that it has discovered a series of “effects” that characterize the properties of human Gs (I like to pretentiously refer to these as “laws of grammar” and will do so henceforth to irritate the congenitally irritated). Examples of the kinds of properties these Gs display/have include the following: Island effects, binding effects, ECP effects, obviation of Island effects under ellipsis, parasitic gap effects, Weak and Strong Crossover effects etc. (I provided about 30 of these effects/laws in the comments to the above mentioned post, Greg K, Avery and others added a few more). To repeat again and loudly: THESE EFFECTS ARE EMPIRICALLY VERY WELL GROUNDED AND I TAKE THEM TO BE ROUGHLY ACCURATE DESCRIPTIONS OF THE KIND OF REGULARITIES THAT Gs DISPLAY AND I ASSUME THAT THEY ARE MORE OR LESS EMPIRICALLY CORRECT. They define an empirical domain of inquiry. Those who don’t agree I consign to the first circle of scientific hell, the domicile of global warming skeptics, flat earthers and evo deniers. They are entitled to their views, but we are not required (in fact, it is a waste of time) to take their views seriously. So I won’t.

Ok, let’s assume that these facts have been established. What then? Well, we can ask what they can tell us about FL. IMO, they potentially tell us a lot. How so? Via the POS argument. You all know the drill: propose a theory that derives the laws, take a look at the details of the theory, see what it would take to acquire knowledge of this theory which explains the laws, see if the PLD provides sufficient relevant information to acquire this theory. If so, assume that the available data is causally responsible.[1] If not assume that the structure of FL is causally responsible. Thus, knowledge of the effects is explained by either pointing to the available data that it is assumed the LAD tracks or by adverting to the structure of LAD’s FL. Note, it is critical to this argument to distinguish between PLD and LD as the LAD has potential use of the former while only the linguist has access to the latter. The child is definitely not a little linguist.[2]

All of this is old hat, a hat that I’ve worn in public on this blog countless times before and so I will not preen before you so hatted again. What I will bother saying again is that this can tell us something about FL. The laws themselves can strongly suggest whether FL is causally responsible for this or that effect we find in Gs. They alone do not tell us what exactly about FL is responsible for this or that effect. In other words, they can tell us where to look, but they don’t tell us what lives there.

So, how does one go from the laws+POS to a conjecture/claim about the structure of FL? Well, one makes a particular proposal that were it correct would derive the effects. In other words, one proposes a hypothesis, just as one does in any other area of the sciences. P,V,T relate to one another via the gas laws. Why? Well maybe it’s because gases are made up of small atoms banging against the walls of the container etc. etc. etc. Swap gas laws for laws of grammar and atomic theory for innately structured FL and off we go.

So, what kinds of conjectures have people made? Well, here’s one: the principles of GB specify the innate structure of FL.[3] Here’s why this is a hypothesis worth entertaining: Were this true then it would explain why it is that native speakers judge movement out of islands to be lousy and why they like reflexivization where they dislike pronominalization and vice versa. How does it explain these laws? As follows: if the principles of GB correctly characterize FL, then in virtue of this FL will yield Gs that obey the laws of grammar. So, again, were the hypothesis correct, it would explain why natural languages adhere to the generalizations GG has discovered over the last 60 years.[4]

Now, you may not like this answer. That’s your prerogative. The right response is to then provide another answer that derives the attested effects. If you do, we can consider this answer and see how it compares with the one provided. Also, you might like the one provided and want to test it further. People (e.g. Crain, Lidz, Wexler, a.o.) have done just that by looking at real time acquisition in actual kids. At any rate, all of this seems perfectly coherent to me, and pretty much standard scientific practice. Look for laws, try to explain them.

Ok, as you’ve no doubt noticed, the story told assumes that what’s in FL are principles of GB.[5] Doesn’t MP deny this? Yes and No. Yes, it denies that FL codes for exactly these principles as stated in GB. No, it assumes that some feature of FL exists from which the effects of these principles follow. In other words, MP assumes that PP is correct and that it sheds light on the structure of FL. It assumes that a successful POS argument implies that there is something about the structure of the LAD that explains the relevant effect. It even takes the GB description of the effects to be extensionally accurate. So how does it go beyond PP?

Well, MP assumes that what’s in FL does not have the linguistic specificity that GB answers to PP have. Why?

Well, MP argues that the more linguistically specific the contents of FL, the more difficult it will be to address DP. So, MP accepts that GB accurately derive the laws of grammar but assumes that the principles of GB themselves follow from yet more general principles many of which are domain general so as to be able to accommodate DP in addition to PP.[6] That, at least, is the conjecture. The program is to make good on this hunch. So, MP assumes that the PP problem has been largely correctly described (viz. that the goal is to deduce the laws of grammar from the structure of FL) but that the fine structure of FL is not as linguistically specific as GB has assumed. In other words, that FL shares many of its operations and computational principles with those in other cognitive domains. Of course, it need not share all of them. There may be some linguistically specific features of FL, but not many. In fact, very very few. In fact, we hope, maybe (just maybe, cross my fingers) just ONE.

We all know the current favorite candidate: Merge. That’s Chomsky’s derby entry. And even this, Chomsky suggests may not be entirely proprietary to FL. I have another, Label. But really, for the purposes of this discussion, it doesn’t really matter what the right answer is (though, of course I am right and Chomsky is wrong!!).

So, how does MP go beyond explanatory adequacy? Well, it assumes the need to answer both PP and DP. In other words, it wants the properties of FL that answer PP to also be properties that can answer DP. This doesn’t reject PP. It doesn’t assume that the need to show how the facts/laws we have discovered over 60 years follow from FL has all of a sudden gone away. No. It accepts PP as real and as described and aims to find principles that do the job of explaining the laws that PP aims to explain but hopes to find principles/operations that are not so linguistic specific as to trouble DP.

Ok, how might we go about trying to realize this MP ambition (i.e. a theory that answers both PP and DP)? Here’s a thought: let’s see if we can derive the principles of GB from more domain general operations/principles. Why would this be a very good strategy? Well because, to repeat, we know that were the principles of GB innate features of FL then they would explain why the Gs we find obey the laws of grammar we have discovered (see note 6 for philo of science nostrums). So were we able to derive GB from more general principles then these more general principles would also generate Gs that obeyed the laws of grammar. Here I am assuming the following extravagant rule of inference: if AàB and BàC then AàC. Tricky, eh? So that’s the strategy. Derive GB principles from more domain general assumptions.

How well has MP done in realizing this strategy. Here we need to look not at the aims of the program, but at actual minimalist theories (MT). So how good are our current MT accounts in realizing MP objectives? The answer is necessarily complicated. Why? Because many minimalist theories are compatible with MP (and this relation between theory and program holds everywhere, not just in linguistics). So MP spawns many reasonable MTs. The name of the game if you like MP is to construct MTs that realize the goals of MP and see whether you can get them to derive the principles of GB (or the laws of grammar that GB describes). So, to repeat, how well have we done?

Different people will give different answers. Sadly, evaluations like these require judgment and reasonable people will differ here. I believe that given how hard the problems are, we have done not bad/pretty well for 20 years of work. I think that we have pretty good unifications of many parts of GB in terms of simpler operations and plausibly domain general computational principles. I have tried my own hand at this game (see here). Others have pursued this differently (e.g. Chomsky). But, and listen closely here, MP will have succeeded only if whatever MT it settles on addresses PP in the traditional way. As far as MP is concerned, all the stuff we thought was innate before is still innate, just not quite in the particular form envisaged. What is unchanged is the requirement to derive the laws of grammar (as roughly described by GB). The only open question for DP is whether this can be done using domain general operations/principles with (at most) a very small sprinkling of domain specific linguistic properties. In other words, the open question is whether these laws are derived directly from principles of GB or indirectly from them (think GB as axioms vs GB as theorems of FL).

I should add that no MT that I know of is just millimeters away from realizing this MP vision. This is not a big surprise, IMO. What is a surprise, at least to me, is that we’ve made serious progress towards a good MPish account. Still, there are lots of domain specific things we have not been able to banish from FL (ECP effects, all those pesky linguistic features (e.g. case), the universal base (and if Cinque is right, it’s a hell of a monster) and more). If we cannot get rid of them, then MP will only be partly realized. That’s ok, programs are, to repeat, not true or false, but fecund or not. MP has been very fertile and we (I?) have reason to be happy with the results so far, and hopeful that progress will continue (yes, I have a relentlessly sunny and optimistic disposition).

With this as prologue, let’s get back to Alex C. On this view, the learning story is more or less the one we had before. MP has changed little.[7] The claim that the principles of GB are innate is one that MP can endorse (and does, given the POS arguments). The question is not whether this is so, but whether the principles themselves are innate or do they derive from other more general innate principles. MP bets on the second. However, MP does not eschew the conclusion that GB (or some equivalent formulation) correctly characterizes the innate structure of FL. The only question is how direct these principles are instantiated, as axioms or as theorems. Regardless of the answer, the PP project as envisioned since the mid 60s is unchanged and the earlier answers provided still quite viable (but see caveat in note 7).

In sum, we have laws of grammar and GB explanations of them that, via the POS, argue that FL has GBish structure. MP, by adding DP to the mix, suggests that the principles of GB are derived features of FL, not primitive. This, however, barely changes the earlier conclusions based on POS regarding PP. It certainly does not absolve anyone of having to explain the laws of grammar. It moreover implies that any theory that abstracts away from explaining these laws is a non-starter so-far as GG is concerned (Alex C provides a link to one such theory here).[8]

Let me end: here’s the entrance fee for playing the GG game:

1. Acceptance that GG work over the last 60 years has identified significant laws of grammar.

2. Acceptance that a reasonable aim of research is to explain these laws of grammar. This entails developing theories (like GB) which would derive these laws were these theories true (PP).

3. More ambitiously, you can add DP to the mix by looking for theories using more domain general principles/operations from which the principles of GB (or something like them) follow as “theorems,” (adopting DP as another boundary condition on successful theory).

That’s the game. You can play or not. Note that they all start with (1) above. Denial that the laws of grammar exist puts you outside the domain of the serious. In other words, deny this and don’t expect to be taken seriously. Second, GG takes it to be a reasonable project to explain the laws of grammar and their relation to FL by developing theories like GB. Third, DP makes step 2 harder, but it does not change the requirement that any theory must address PP. Too many people, IMO, just can’t wrap their heads around this simple trio of goals. Of course, nobody has to play this game. But don’t be fooled by the skeptics into thinking that it is too ill defined to play. It’s not. People are successfully playing it. It’s just when these goals and ambitions are made clear many find that they have nothing to add and so want to convince you to stop playing. Don’t. It’s really fun. Ignore their nahnahbooboos.

[1] Note that this does not follow. There can be relevant data in the input and it may still be true that the etiology of the relevant knowledge traces to FL. However, as there is so much that fits POS reasoning, we can put these effects to the side for now

[2] One simple theory is that the laws themselves are innate. So, for example, one might think that the CNPC is innate. This is one way of reading Ross’s thesis. I personally doubt that this is right as the islands seem to more or less swing together, though there is some variation. So, I suspect that island effects themselves are not innate though their properties derive from structural properties of FL that are, something like what Subjacency theory provides.

[3] As many will no doubt jump our of their skins when they encounter this, let me be a tad careful. Saying that GB is innate does not specify how it is thus. Aspects noted two ways that that this could be true: GB restricts the set of admissible hypotheses or it weights the possible alternative grammars/rules by some evaluation measure (markedness). For current purposes, either or both are adequate. GB tended to emphasize the restrictive hypothesis space, Ross, for example, was closer to a theory of markedness.

[4] Observe: FL is not itself a theory of how the LAD acquires a G in real time. Rather it specifies, if descriptively adequate, which Gs are acquirable (relative to some PLD) and what properties these Gs will have. It is reasonable to suppose that what can be acquired will be part of any algorithm specifying how Gs get acquired, but they are not the same thing. Nonetheless, the sentence that this note is appended to is correct even in the absence of a detailed “learning theory.”

[5] None of the above or the following relies on it being GB that we use to explain the laws. I happen to find GB a pretty good theory. But if you want something else, fine. Just plug your favorite theory in everywhere I put in ‘GB’ and keep reading.

[6] Again this is standard scientific practice: Einstein’s laws derive Newton’s. Does this mean that Newton’s laws are not real? Yes and No. They are not fundamental, but they are accurate descriptions. Indeed, one indication that Einstein’s laws are correct is that they derive Newton’s as limit cases. So too with statistical mechanics and thermodynamics or quantum mechanics and classical mechanics. That’s the way it works. Earlier results (theory/laws) being the target of explanation/derivation of later more fundamental theory.

[7] The one thing it has changed is resurrect the idea that learning might not be parameter setting. As noted in various posts, FL internal parameters are a bit of a bother given MP aims. So, it is worth considering earlier approaches that were not cast in these terms, e.g. the approach in Berwick’s thesis.

[8] It’s oracular understanding of the acquisition problem simply abstracts away from PP, as Alex D noted. Thus, it is without interest for the problems discussed above.

60 comments:

benjamin.boerschingerMay 26, 2014 at 2:07 AM
I think part of the issue is the tension between your footnote 7

"The one thing it has changed is resurrect the idea that learning might not be parameter setting."

and the fact that the only work that is really "visible" at the moment is Yang's variational parameter setting and Fodor/Sakas-triggering. So it at least looks (at least to the "outsiders") as if the only work that considers the acquisition problem at the moment is somewhat at odds with the current state-of-the-art, and I think it's fair game pointing this out.

Your footnote mentions Berwick's thesis, so perhaps you (or Bob?) could elaborate just a bit how this fits into the bigger picture?
ReplyDelete
Replies
NorbertMay 28, 2014 at 2:17 PM
@Thomas
I guess I fail to understand. You are interested in a different set of questions for which the laws as I understand them do not matter. Far be it from me to stop you from looking at whatever you want. However, I do not see that the problems really overlap or that because you are interested in what you are it renders illegitimate the questions I am asking. This is not the sense I get from Alex. He thinks that my questions are ill formed or silly. He thinks that what he is doing renders my questions moot. I frankly don't see this, nor would I understand your results as bearing on my questions, to the degree I understand them. I am interested in why there are hierarchical and locality conditions on binding. You clearly are not as you are happy to hand code these in. That's fine with me, we are doing very different things. That's all that I want conceded. Maybe they will touch one another in the future, but right now, they are, or appear to me to be, miles apart.
ReplyDelete
Replies
Alex ClarkMay 29, 2014 at 12:39 AM
I really don't think your goal is silly; it differs only very slightly from mine.
There are goals, the methodology we use to approach that goal, and the theories we produce using that methodology. My goal is, IMO, ucontroversial, I want to figure out the structure of the LAD. Your goal is slightly different, I take it. " I am interested in why there are hierarchical and locality conditions on binding.".
You want to figure out why the attested languages have the properties they have. Or is this only as a step towards figuring out UG?

Our disagreement is about methodology, and as a result about the reliability of the results produced. But you have backed off from any *theoretical* claims. And you have put no theories of the LAD on the table, so there is nothing to disagree with there.

I still don't understand why you object so strongly to the methodology I use, other that perhaps the fact that I have not yet arrived at a theory that explains the complex syntactic phenomena you are interested in. Or rather I understand the sociological reasons, but not the scientific ones.
ReplyDelete
Replies
NorbertMay 29, 2014 at 7:50 AM
@Alex:
I think that we only appear to have the same goals. My goal is to figure out the structure of the LAD given that we know a lot about it already. What do we know? Well the 30 or so effects I mentioned characterize some laws of grammar. Moreover, I think that the theories developed in the mid 80s (GB being the one I practice, but I find GPSG, LFG, RG, etc to be notational variants over a large part of the domain of interest).

Given this view, I am open to any account of why these laws of grammar exist and why the LAD’s Gs adheres to them. I am very open minded here. I have no problem with theories that are domain specific (ascribing linguistically proprietary properties to FL), nor do I eschew domain general explanations which exploit general cognitive mechanisms. In fact, given my MPish inclinations I prefer the latter. However, what I want out of them is that they address the facts as we know them to be. I have little interest in approaches that fail to address these issues.

As you know, because I’ve said it repeatedly, I think that one of the strengths of GBish like accounts is that they try to explain these laws of grammar. Binding theory aims to explain binding effects, subjacency tries to explain island effects, case theory tries to explain A-movement effects etc. This is a plus for these theories and though I think that they are not quite right, I do think that they are on the right track precisely because they do a fairly decent job of explaining the laws. This belief leads to a proposed approach if one has MPish inclinations: try to derive the principles of GB from more domain general assumptions. Were you to do this, it would derive the laws of grammar (more or less) as a by-product.

But that’s all detail. What is our methodological difference? It does not love in a rarified atmosphere. It’s quite basic: we disagree about what the basic things we should explain are. We disagree at very basic levels: e.g. you seem enamored of string properties, I insist the basic data is judgments concerning constrained homophony (sound-meaning pairings). Your work ignores the generalizations I mentioned above, mine directly addresses them. You find only domain general theories admissible, I find this to be an open empirical question and tolerate domain specificity if that is what explains the established data. My wishes are to reduce these to a minimum. My methodology disallows me from ignoring them when I cannot. You cannot explain everything at once. Granted. But when you cannot explain something, you say so and remain appropriately sheepish until you can. You do not disparage conclusions that get the basic facts in service of some hope that one day you might. That’s where our methodologies differ.

Alex, I have nothing against the following statement: “theory T were it correct would explain facts F. But theory T has features I personally don’t like and would like to explain in more general terms. Right now I cannot do that nor do I know how one could do that. But that is my hope.” It’s always nice to know what one’s hopes are and how far one has gone in realizing them. But it’s a pretty weak argument and until realized does not serve as an actual contender. Frome where I sit, you take your hopes to constitute a methodological reason for disparaging stories that work pretty well. That is, IMO, a very bad way to proceed. That’s my main objection to your methodology.
ReplyDelete
Replies
NorbertJune 1, 2014 at 7:45 PM
@Alex D:

Amen.
ReplyDelete
Replies
Alex ClarkJune 2, 2014 at 5:16 AM
@AlexD ; let me answer here so as not to get mixed up with CB's biolinguistics.

"@Alex C. You seem to be saying that although the ECP is probably innate, we shouldn't say that it is because that would stop us looking for a deeper explanation. However, almost all of the people who actually are looking for deeper explanations of ECP effects think that they are innately determined! "

I don't think ECP is innate. I don't think Norbert does either -- at least he has backed off from that, I think, to the claim that there are ECP effects and the ECP effects are caused by some innate bias.
This is not just an annoying bit of hair-splitting. So let us distinguish two claims.

1) The ECP is a true intensional statement about the psychologically real grammars.

2) The ECP is a good descriptive generalisation about the sorts of phenomena we see in the sound/meaning pairings in natural language.

So the consensus here (I think ??) is that 1) is false and 2) is true. I am sure someone will correct me if this is wrong.

Let's assume that for the moment.
Now my argument is this.

Any theory of linguistics has to have a number of components that need to fit together.

Two of those components are
(UG) a specification of the class of possible grammars
and
(LAD) a specification of a learning algorithm that selects one of these on the basis of information available to the child.

UG and LAD need to fit together quite closely. N argues that we can specify UG without thinking about learning algorithms and someone else will come along later and sort out the uninteresting technical details of the learning algorithm. I think this is a mistake for various technical reasons.
I think that positing innate stuff that may be false as stated, will make the problem of finding a learning algorithm much harder, perhaps even impossible.
Whereas perhaps some GGers think that this makes the problem easier, perhaps even trivial (if e.g. the class of grammars becomes finite as a result).

As a side issue, as you know, I don't think that just saying "P is innate" constitutes an explanation of P.
But if you want to say that it is an explanation, but a shallow one, then I am not going to argue the point,
and in any event I am not interested in explaining things like the ECP, but in solving Plato's problem.
ReplyDelete
Replies
Alex DrummondJune 2, 2014 at 10:25 AM
@Alex C. I don't want to repeat what N/CA are saying, so let me just comment on your (1) and (2). When I say that the ECP is innate (or that ECP effects are innately determined, or something like that), I don't mean either of (1) or (2). I mean that ECP effects derive primarily from innately-determined properties of the grammar (or LAD, processing systems, and whatever else is involved[*]). This does not entail (1); it entails (2) but is a stronger claim.

On this understanding it is just a fact that the ECP is innate, just as it's a fact that barking is an innately-determined dog behavior. Whether or not that fact explains anything is largely irrelevant. The point is just that things could have been otherwise but aren't. So, there's a parallel universe where dogs learn to bark by observing other dogs, but we're not in that universe. Knowing this places useful constraints on further investigations of barking.

It bears repeating that everyone who's ever had anything interesting to say about ECP effects (and I wish I belonged to this small but illustrious group!) has been a generative syntactician of some stripe who starts from the assumption that they're innately determined.
ReplyDelete
Replies
gopipatelMay 27, 2020 at 5:20 AM
Awesome article, Thanks for sharing!
Help With Choosing the POS System for Your Business
ReplyDelete
Replies
AnonymousJune 12, 2022 at 11:52 PM
Great Helpful piece of content. Precise and accurate information. Really Appreciate it. You might be looking for hotel management software New York
ReplyDelete
Replies
algosoftMay 19, 2024 at 9:41 PM
Thanks for sharing an article like this. The information which you have provided is better than another blog. Great work.

Android apps development company in Noida
ReplyDelete
Replies
VivekvermaMay 19, 2024 at 10:21 PM
Thanks for sharing an article like this. The information which you have provided is better than another blog.

Freelance mobile app developer
ReplyDelete
Replies
TheCodeWorkMay 20, 2024 at 1:28 AM
Thanks for sharing The Awesome content. I will also share with my friends. Great Content thanks a lot.

Web and app development services
ReplyDelete
Replies
Print Online AEMay 20, 2024 at 3:42 AM
Thank you for sharing such a nice article with us I had visited your website first time and got myself fall in love with your content, i have bookmarked your website to visit again ahead

Personalized gifts dubai
ReplyDelete
Replies
AI Text GeneratorsMay 20, 2024 at 5:13 AM
You guys are writing some Amazing tips. This article really helped me a lot. Thanks for sharing this

AI Voice Generator
ReplyDelete
Replies

Add comment