Comments on Faculty of Language: Simplicity and Ockham

Thanks all for this discussion here. Lots of issue...

2016-05-12T18:08:31.433-07:00

Thanks all for this discussion here. Lots of issues in urgent need of clarification. I'm planning to work on some of them, so this has been very helpful!

I agree with David's main points, particularly...

2016-05-12T06:43:54.589-07:00

I agree with David's main points, particularly that there is no current way to squeeze a prohibition against SWM simply from the properties of Merge. I also agree that one way to do this is to go for some property of the memory system and argue that a simple memory structure gets the prohibition. This is what David did in his Baggett lecture 3. Where David and I probably part ways is on how much independent evidence (conceptual or empirical) we have for the particular memory structures needed to get the prohibition. IMO, the conceptual argument is weak at best and the main motivation is to eliminate SWM. If this is correct (and I am pretty sure that David would disagree) then the question resolves down to the empirical arguments for or against SWM. Not surprisingly, my main argument in its favor resolves around the issues of adjunct control and parasitic gaps. I very much like Nunes PG account (IMO, it is the only extant theory that derives most of the PG effects without too much stipulation) and I also believe that the parallels between adjunct control and complement control requires that they be treated in a parallel manner. So, if you like the MTC for complements you will like SWM for adjunct control. Of course, someone's modus pinene can be someone else's modus tollens so even if you accept the conditional there are (to my mind, very unsavory but logically coherent) ways out.

Last brief for SWM: it really does seem to be a novelty made available by BPS and merge style thinking. One of the things we should be open to is considering such theoretical novelties. This is what happens in real sciences when one gets a change in theoretical perspective. IMO, we have been much too quick to discard the possibility of SWM. Even it is turns out to be something that w want to dump, we should not do too before we examine its virtues and vices more carefully. Right now, IMO, most of the rejection has been knee-jerk, more a reflection of GBish prejudices than careful consideration. There have been some good points made (e.g. David's reconstruction arguments) but it would be good to take it seriously enough to compile a convincing list of problems so that we fully understand its downside.

Interesting discussion. Like Dennis, I'd alway...

2016-05-12T05:01:48.263-07:00

Interesting discussion. Like Dennis, I'd always taken Chomsky to mean, by ternarity in this case, that Merge would have to have access to 3 objects. But I think if you work this out in any formal way, you end up being unable to make the distinction without, ultimately, reintroducing a disjunction that gives the Merge/Move distinction. You can sort of see it in his latest specifications of what Merge is: (i) Select A from the workspace; (ii) select B (from either the workspace or from A); Merge(A, B). True, here the disjunction essentially comes for free, if there is only the workspace and objects constructed, but, if constructed objects are available for selection, as they have to be to allow internal Merge, then why, in clause (i) here can't you select C in A, already constructed. To rule that out, you need to say that only top-level elements in the workspace are accessible. In terms of simplicity, we'd want to make clauses (i) and (ii) the same: (i) select A; (ii) select B; (iii) Merge(A,B), but then we allow sideways move. This is basically what I said in that 3rd Baggett lecture, and is, I think, of a one with Norbert's view that there doesn't seem to be a simplicity argument here. I don't think the copies/repetition story that Dennis told really helps in the absence of a good theory of that, and I agree with everyone that we don't have such a theory. I tried, in that third Baggett lecture, to say that Merge was maximally simply (as in the version just above) but the data structure it operated over had a cache-register structure, so that Merge can only see inside the register, and that register isn't big enough to allow sideways movement. So that's taking some of the complexity out of Chomsky's definition of move, but placing that complexity in the memory architecture. I guess the conceptual argument for that is not simplicity of structure, but rather efficiency of operations (operations are more optimal if they work over smaller structures). So that's a second kind of economy: not just simplicity of definition, but reduction in required resource.

Thx for this. It was very helpful. Note, that indi...

2016-05-11T12:13:45.294-07:00

Thx for this. It was very helpful. Note, that indices are just the device to distinguish two different tokens of the same type. that's what they are designed to do. Chomsky has claimed that indices violate inclusiveness. Maybe. But we actually have no current account of what it is to "take" a lexical atom and use it in a syntactic derivation. Presumably, when we access an atom, the lexicon does not shrink by one. Selecting an atom for the syntax does not reduce the size of the lexicon. So, accessing is more akin to tokening. However, whatever the story, I agree that it is important to get it straight.

I am on board as well with he observation that numerations are an unneeded encumbrance. I thought that phases were defined at some point by the subnumerations associated with them. I see that this is may not be required anymore. I am not as moved as others are by the idea that phase heads drive all syntax and don't even find the valued/unvalued distinction that useful. But that is a discussion for another time over beer (I'll buy). So thx again.

Norbert: I don't think anybody knows how copie...

2016-05-11T11:36:48.713-07:00

Norbert: I don't think anybody knows how copies are distinguished from repetitions in a Merge-based system (although there are proposals). I think it's a huge gap in this (otherwise very impressive and revolutionary) idea that PSG and TG can be collapsed into Merge. It needs to be resolved before this can be considered a coherent system.

Yes, Chomsky at some point had the idea that you can do it by tracking EM vs. IM within the phase, but what does that really mean? EM vs. IM is a descriptive, not an ontological distinction; it's the same operation. At some point he argued that you could do it by having EM apply before and IM at the phase level, but that just re-introduces a Merge vs. Move distinction, and it's also not clear to me how this could be implemented (if Transfer 'sees' IM applying at the phase level, that doesn't in and of itself change anything about the representation shipped off to the interfaces). You can make either Merge or Transfer more complicated by making them add indices (as Collins & Stabler do, if I remember correctly), but it will always remain a complication.

In short, I don't think there's any standard wisdom, or if there is, I'm not aware of it. I think it's a huge and very important open issue (but note that outside of venues like this one, hardly anyone in the field seems to care).

I don't know what the phase-related reasons might be to assume numerations/arrays. I don't see why they would be necessary. If phases (assuming we need them at all) can be delimited in any other way (uFs, interface legibility, whatever) they're just redundant if not vacuous.

Thanks for the pointer to Marcus' work in this context, I should take another look at it. But I think in this case the problem is really introduced by set theory, where there is (as far as I know) no meaning to the statement that two "x"s in a given representation are different objects, whereas two "John"s in NL can be.

I agree that we don't strongly disagree. I was...

2016-05-11T10:47:03.692-07:00

I agree that we don't strongly disagree. I was trying to sharpen matters to see what the issues are. Thx for helping do this.

I agree that numerations are likely not useful anymore given that merge over move economy accounts are out. I though, however, that we still had phase reasons to consider numerations, but I might be out of date here. That said, how exactly does the current grammar track the distinction between copies and originals? I thought that Chomsky allowed this to be tracked within a given phase. In other words, it is a fact about phasal memory that it can distinguish copies from originals (oviducts of E vs I merge) within a phase. If this is not how things are done, how is it done?

I should add, that IMO, contrary to Chomsky here, indexing selections from the lexicon is not a very computationally onerous task. It is a technical way of distinguishing types from tokens (objects from occurrences) which is something that almost any computational system needs to be able to do (See Marcus discussion in The Algebraic Mind). thus, it is likely something NOT proprietary to language. At any rate, we all agree that this needs doing and so the question becomes how does the G do it. I am asking here because I really am not sure what the standard wisdom is. Help is appreciated.

@Norbert: I don't think we have any strong dis...

2016-05-11T09:59:32.167-07:00

@Norbert: I don't think we have any strong disagreement. I just think it's an interesting (open) question whether or not SWM is fully formally compatible with binary Merge (because I do think there is a good conceptual argument for keeping Merge strictly binary). The copies/repetitions problem is related but somewhat orthogonal, and I certainly didn't mean to say that this problem is specific to SWM. I was merely trying to reconstruct Chomsky's claim that SWM and such cases are non-binary.

As for your second paragraph: I don't think tracking selection from the lexicon is a viable option. First of all, this is backtracking, which we want to avoid, whether you do it via numerations or not. Numerations shouldn't be a part of the theory unless we need transderivational comparisons; at least I know of no other principled motivation for having them. Secondly, this becomes somewhat more complex withn it comes to deciding whether some complex object (say, "the tall man" rather than just "John") has been merged externally or internally. In this case we need to know if DP = {the,{tall,man}} is a copy or a repetition, but it's not something that's been drawn from the lexicon (although its terms are, of course).

2016-05-11T09:59:12.706-07:00

This comment has been removed by the author.

On Sober Not picking up any pretty much any of th...

2016-05-11T08:18:34.570-07:00

On Sober

Not picking up any pretty much any of the comments, but...

Sober has been on this topic for MANY years, going back at least to his 1975 book "Simplicity". That one is of some particular interest to linguists because it devotes one chapter (#3) to . . . "The sound pattern of English"! He does so because, as he notes, it was (and probably still is) just about the only instance ever of actual working scientists trying to make explicit (and formal) just what they meant by 'simplicity', not merely invoking it.

Subsequent works, especially "Reconstructing the past" (1988, a chapter on "The philosophical problem of simplicity"), offer further reflections and revisions on the topic. No doubt the current work references the earlier stuff, but the SPE focus of "Simplicity" is not something that comes up all that often in my experience.

--RC

@Dennis I would go Alex one step further: to the d...

2016-05-10T07:32:13.341-07:00

@Dennis
I would go Alex one step further: to the degree that Minimality plus LBE plus phases might work to distinguish copies from repetitions then to that degree SWM can. The same restrictions hold in the SWM case as in the simple I-merge case (or so I argued in the 2009 book).

I should add two more points. All of this is moot if we allow the G to track selections from the lexicon. This is already done in any theory that has numerations. A numeration with 2 selections of an LI is different from a numeration with one. The question then becomes whether we should allow this tracking in the syntax. So two selections of 'John' are indicially distinguished. The claim is that this is a gross violation of inclusiveness. So, that violation is ok in the numeration but not in the G. I confess that this is too subtle for me. Ok, say that there is no numeration. One still needs to "remember" which are copies and which not. It is claimed that this is all done within the phase. But then why assume that SWM is less capable than I merge of retaining this info? This only follows if phases can only be defined for single rooted structures. But why assume this? So far as I can tell, this is not so. Note, btw, that long distance binding will need something like indices anyhow. So they must exist in either the G or the interfaces. If they exist in the latter then indices are generally cognitively available so why shouldn't the G use them? But this is an issue for another time.

Very last point. I think that it is critical to understand that whatever our conclusions here, Chomsky's recursive definition of Merge allows SWM. What I mean is that nothing in the definition prohibits it. What MIGHT prohibit it is the algorithm that APPLIES the definition/procedure. It seems to be a matter of how we define SELECT, the operation that chooses the objects to be merged. If one can only select SOs within SOs by first selecting the container SO then for SWM select must be 3-place. If we can select an SO just in case it is an SO (i.e. SOs are transparent regardless of where they sit) then Select for SWM can be binary. The conceptual question concerns the transparency assumption. In particular: why assume that SOs contained in other SOs can only be selected if the SO they are contained in is also selected? Can there really be a conceptual argument for this assumption?In fact, doesn't the other assumption, viz. that all SOs are qua SOs potential arguments of Merge seem "simpler" (i.e. less encumbered)? Isn't adding the restriction on select complicating the merge operation? And if so, doesn't one need empirical arguments FOR doing this? And if so don't the objections to SWM just reduce to the standard empirical ones? That's what I think. I also happen to think that the empirical arguments FOR SWM are pretty good and that the putative problems are not that severe, but that's my view. What I don't see is that there is ANY good conceptual argument against it, if one does not encumber Merge with further do-dads, a move that Chomsky finds unfortunate in other venues.

@Alex: I think this is at least in part an orthogo...

2016-05-10T07:13:58.371-07:00

@Alex: I think this is at least in part an orthogonal issue (although an important one). My assumption above was that you *minimally* need to be able to say whether or not you're relating A to something that's contained in it or not. But then it leads to the question like the one you raise (which, if I understand correctly, is identical to the one I mentioned above).

Incidentally, I don't think that English topicalization is a good example here, since it's most likely not simple movement but a kind of dislocation.

I don't see that the binary version of Merge w...

2016-05-10T02:02:35.548-07:00

I don't see that the binary version of Merge would be able to distinguish copies from repetitions in the general case. If the root contains two independent instances of 'John', then setting A=John and B=[...John...John...] doesn't tell you which 'John' has moved. I guess the idea is that Minimality plus some kind of ban on left branch extraction would always disambiguate? But we have e.g.:

John, I gave a picture of t to John.
John, I gave a picture of John to t.

So I'm not sure if that strategy would work.

Not sure that's at issue here, since my C = B ...

2016-05-09T13:50:38.868-07:00

Not sure that's at issue here, since my C = B was meant to mean "intensionally equivalent" throughout. We need someone from the math ling camp to enlighten us.

"If Merge relates A, B, and C, and C = B, the...

2016-05-09T13:15:24.924-07:00

"If Merge relates A, B, and C, and C = B, then it's simply not ternary in the first place; it's just binary."

It seems like there's an extensional/intensional difference at play here and that only on the extensional view is it binary. I can't see immediately whether this is important or not, but that seems to be the issue

Yeah, I think that's what Chomsky has in mind,...

2016-05-09T12:42:09.159-07:00

Yeah, I think that's what Chomsky has in mind, but I should stress again that this is my interpretation (as far as I know, he's never spelled it out).

Not sure I understand what the identity question is supposed to be. If Merge relates A, B, and C, and C = B, then it's simply not ternary in the first place; it's just binary.

A different question might be what happens in case we merge A and B, and B = the root contains repetitions (rather than copies) of A. Perhaps this would derive/predict some sort of superiority.

Ah! now I see it! I had apparently been misinterpr...

2016-05-09T12:36:53.400-07:00

Ah! now I see it! I had apparently been misinterpreting the 'if C contains B' bit to be about what dominates B irrespective of whether C was a root or not (which I think is a fair reading of this). But if it's job is to tell us specifically which root B is under, I can totally see the distinction now.

I guess the question now is how identity can render ternary operations binary, but that's obviously a different question

2016-05-09T12:35:32.188-07:00

This comment has been removed by the author.

The root need not know. Perhaps we can think of it...

2016-05-09T12:22:54.215-07:00

The root need not know. Perhaps we can think of it in this way: the Merge operation always relates A, B, and C. If B = C, that's equivalent to it relating A and B, i.e. it's binary -- this allows Internal Merge. If B ≠ C, it's ternary.

See this is where I feel like I'm being blind ...

2016-05-09T11:59:02.480-07:00

See this is where I feel like I'm being blind to something. How does the root 'know' where the mover is coming from in this case. Sure the root dominates the mover, but it's not like the root wears on its sleeve all the things that it dominates. It seems like you would need to specify the location of the mover as root internal somehow.

No, because in regular upward movement you only ne...

2016-05-09T11:52:53.953-07:00

No, because in regular upward movement you only need two terms: A and B, the root, which contains A.

2016-05-09T11:52:37.952-07:00

This comment has been removed by the author.

@Dennis I get that intuition about why we might ne...

2016-05-09T11:50:48.337-07:00

@Dennis I get that intuition about why we might need to specify where the B is being merged from so we can distinguish copies from repetitions. Assume that works, why doesn't that also hold of upward movement? I imagine we wanna distinguish copies from repetitions for upward movement too, wouldn't that make both upward and sideward movement (weakly) ternary?

@Brooke, Norbert: I wasn't trying to argue for...

2016-05-09T11:37:18.276-07:00

@Brooke, Norbert: I wasn't trying to argue for Chomsky's view here; I'm not sure it's formally compelling. But I do think there is a reason why we have to identify B via C when the latter contains the former. Think of this as the copy/repetitions problem. If you merge A and B, how do you know that B is the B that's in C, rather than some independent B? If you want it to be Internal Merge of (a copy of) B rather than External Merge of (a repetition of) B, then you need to identify B as *the B contained in C*. Whether or not this really and necessarily means that the operation ternary (perhaps it's weakly ternary ;-)), I don't know; but I can see where the intuition comes from.

@ David: Matthew is right. Yes, sans Copy you need...

2016-05-09T10:42:40.855-07:00

@ David: Matthew is right. Yes, sans Copy you need Chomsky's conception of merge. If you allow Copy then you can do without looking into build structure FOR merge. My only question was whether there was a simplicity argument capable of motivating Chomsky's version of merge. I believe he thinks that there is.

@Dennis: I am with Brooke here, and David Adger too in his (non-posted) third Baggett lecture. SWM does not require a 3-place merge operation. It may require 3-place operation if one assumes that one cannot see SOs embedded within other SOs unless one selects the container in the select operation. But, why assume that one needs to? Anger gives some mechanisms for accomplishing this, but, IMO, that does not address the central issue: do we WANT to rule out SWM. I have no problem thinking that there are various ways of doing so should it be undesirable. Note too, and this is a jab at Chomsky, if one needs to ADD something to rule out SWM, then given his views about simplicity, the one that allows for SWM is simpler and so is the default preferred version. Of course, maybe it should be disallowed, but the recursive definition of SO Chomsky assumes allows for SWM if not modified. nd as you know, modification implies complication and hence less simple operation. So, do we want SWM or not? Conceptually, yes. The big issue is the empirical one. I am ready to defend the empirical virtues of SWM, but on the conceptual issue, I think that Chomsky is just wrong to think that the standard definition rules out SWM.

@Dennis. Maybe I'm being dumb, but I never rea...

2016-05-09T08:21:02.225-07:00

@Dennis. Maybe I'm being dumb, but I never really understood that way to rule out sideward movement. So, Merging A with B when B is contained in C, this involves three entities and that's a ternary operation. But I don't see why we are forced to specify the location of B for sideward movement but not upward movement. That is, why do we need to say what B is contained in for some operations and not others?