Faculty of Language: Optimal Design

Thursday, February 23, 2017

Optimal Design

In a recent book (here), Chomsky wants to run an argument to explain why the Merge, the Basic Operation, is so simple. Note the ‘explain’ here. And note how ambitious the aim. It goes beyond explaining the “Basic Property” of language (i.e. that natural language Gs (NLG) generate an unbounded number of hierarchically structured objects that are both articulable and meaningful) by postulating the existence of an operation like Merge. It goes beyond explaining why NLGs contain both structure building and displacement operations and why displacement is necessarily to c-commanding positions and why reconstruction is an option and why rules are structure dependent. These latter properties are explained by postulating that NLGs must contain a Merge operation and arguing that the simplest possible Merge operation will necessarily have these properties. Thus, the best Merge operation will have a bunch of very nice properties.

This latter argument is interesting enough. But in the book Chomsky goes further and aims to explain “[w]hy language should be optimally designed…” (25). Or to put this in Merge terms, why should the simplest possible Merge operation be the one that we find in NLGs? And the answer Chomsky is looking for is metaphysical, not epistemological.

What’s the difference? It’s roughly this: even granted that Chomsky’s version of Merge is the simplest and granted that on methodological grounds simple explanations trump more complex ones, the question remains, given all of this why should the conceptually simplest operation be the one that we in fact have. Why should methodological superiority imply truth in this case? That’s the question Chomsky is asking and, IMO, it is a real doozy and so worth considering in some detail.

Before starting, a word about the epistemological argument. We all agree that simpler accounts trump more complex ones. Thus if some account A is involves fewer assumptions than some alternative account A’ then if both are equal in their empirical coverage (btw, none of these ‘if’s ever hold in practice, but were they to hold then…) then we all agree that A is to be preferred to A’. Why? Well because in an obvious sense there is more independent evidence in favor of A then there is for A’ and we all prefer theories whose premises have the best empirical support. To get a feel for why this is so let’s analogize hypotheses to stools. Say A is a three legged and A’ a four legged stool. Say that evidence is weight that these stools support. Given a constant weight each leg on the A stool supports more weight than each of the A’ stool, about 8% more. So each of A’s assumption are better empirically supported than each of those made by A’. Given that we prefer theories whose assumptions are better supported to those that are less well supported A wins out.[1]

None of this is suspect. However, none of this implies that the simpler theory is the true one. The epistemological privilege carries metaphysical consequences only if buttressed by the assumption that empirically better supported accounts are more likely to be true and, so far as I know, there is actually no obvious story as to why this should be the case short of asking Descarte’s God to guarantee that our clear and distinct ideas carry ontological and metaphysical weight. A good and just God would not deceive us, would she?

Chomsky knows all of this and indeed often argues in the conventional scientific way from epistemological superiority to truth. So, he often argues that Merge is the simplest operation that yields unbounded hierarchy with many other nice properties and so Merge is the true Basic Operation. But this is not what Chomsky is attempting here. He wants more! Hence the argument is interesting.[2]

Ok, Chomsky’s argument. It is brief and not well fleshed out, but again it is interesting. Here it is, my emphasis throughout (25).

Why should language be optimally designed, insofar as the SMT [Strong Minimalist Thesis, NH] holds? This question leads us to consider the origins of language. The SMT hypothesis fits well with the very limited evidence we have about the emergence of language, apparently quite recently and suddenly in the evolutionary time scale…A fair guess today…is that some slight rewiring of the brain yielded Merge, naturally in its simplest form, providing the basis for unbounded and creative thought, the “great leap forward” revealed in the archeological record, and the remarkable difference separating modern humans from their predecessors and the rest of the animal kingdom. Insofar as the surmise is sustainable, we would have an answer to questions about apparent optimal design of language: that is what would be expected under the postulated circumstances, with no selectional or other pressures operating, so the emerging system should just follow laws of nature, in this case the principles of Minimal Computation – rather the way a snowflake forms.

So, the argument is that the evolutionary scenario for the emergence of FL (in particular its recent vintage and sudden emergence) implies that whatever emerged had to be “simple” and to the degree we have the evo scenario right then we have an account for why Merge has the properties it has (i.e. recency and suddenness implicate a simple change).[3] Note again, that this goes beyond any methodological arguments for Merge. It aims to derive Merge’s simple features from the nature of selection and the particulars of the evolution of language. Here Darwin’s Problem plays a very big role.

So how good is the argument? Let me unpack it a bit more (and here I will be putting words into Chomsky’s mouth, always a fraught endeavor (think lions and tamers)). The argument appears to make a four way identification: conceptual simplicity = computational simplicity = physical simplicity = biological simplicity. Let me elaborate.

The argument is that Merge in its “simplest form” is an operation that combines expressions into sets of those expressions. Thus, for any A, B: Merge (A, B) yields {A, B}. Why sets? Well the argument is that sets are the simplest kinds of complex objects there are. They are simpler than ordered pairs in that the things combined are not ordered, just combined. Also, the operation of combining things into sets does not change the expressions so combined (no tampering). So the operation is arguably as simple a combination operation that one can imagine. The assumption is that the rewiring that occurred triggered the emergence of the conceptually simplest operation. Why?

Step two: say that conceptually simple operations are also computationally simple. In particular assume that it is computationally less costly to combine expressions into simple sets than to combine them as ordered elements (e.g. ordered pairs). If so, the conceptually simpler an operation then the less computational effort required to execute it. So, simple concepts imply minimal computations and physics favors the computationally minimal. Why?

Step three: identify computational with physical simplicity. This puts some physical oomph into “least effort,” it’s what makes minimal computation minimal. Now, as it happens, there are physical theories that tie issues in information theory with physical operations (e.g. erasure of information plays a central role in explaining why Maxwell’s demon cannot compute its way to entropy reversal (see here on the Landauer Limit)).[4] The argument above seems to be assuming something similar here, something tying computational simplicity with minimizing some physical magnitude. In other words, say computationally efficient systems are also physically efficient so that minimizing computation affords physical advantage (minimizes some physical variable). The snowflake analogy plays a role here, I suspect, the idea being that just as snowflakes arrange themselves in a physically “efficient” manner, simple computations are also more physically efficient in some sense to be determined.[5] And physical simplicity has biological implications. Why?

The last step: biological complexity is a function of natural selection, thus if no selection, no complexity. So, one expects biological simplicity in the absence of selection, the simplicity being the direct reflection of simply “follow[ing] the laws of nature,” which just are the laws of minimal computation, which just reflect conceptual simplicity.

So, why is Merge simple? Because it had to be! It’s what physics delivers in biological systems in the absence of selection, informational simplicity tied to conceptual simplicity and physical efficiency. And there could be no significant selection pressure because the whole damn thing happened so recently and suddenly.

How good is this argument? Well, let’s just say that it is somewhat incomplete, even given the motivating starting points (i.e. the great leap forward).

Before some caveats, let me make a point about something I liked. The argument relies on a widely held assumption, namely that complexity is a product of selection and that this requires long stretches of time. This suggests that if a given property is relatively simple then it was not selected for but reflects some evolutionary forces other than selection. One aim of the Minimalist Program (MP), one that I think has been reasonably well established, is that many of the fundamental features of FL and the Gs it generates are in fact products of rather simple operations and principles. If this impression is correct (and given the slippery nature of the notion “simple” it is hard to make this impression precise) then we should not be looking to selection as the evolutionary source for these operations and principles.

Furthermore, this conclusion makes independent sense. Recursion is not a multi-step process, as Dawkins among others has rightly insisted (see here for discussion) and so it is the kind of thing that plausibly arose (or could have arisen) from a single mutation. This means that properties of FL that follow from the Basic Operation will not themselves be explained as products of selection. This is an important point for, if correct, it argues that much of what passes for contemporary work on the evolution of language is misdirected. To the degree that the property is “simple” Darwinian selection mechanisms are beside the point. Of course, what features are simple is an empirical issue, one that lots of ink has been dedicated to addressing. But the more mid-level features of FL a “simple” FL explains the less reason there is for thinking that the fine structure of FL evolved via natural selection. And this goes completely against current research in the evo of language. So hooray.

Now for some caveats: First, it is not clear to me what links conceptual simplicity with computational simplicity. A question: versions of the propositional calculus based on negation and disjunction or negation and disjunction are expressively equivalent. Indeed, one can get away with just one primitive Boolean operation, the Sheffer Stroke (see here). Is this last system more computationally efficient than one with two primitive operations, negation and/or conjunction/disjunction? Is one with three (negation, disjunction and conjunction) worse? I have no idea. The more primitives we have the shorter proofs can be. Does this save computational power? How about sets versus ordered pairs? Is having both computationally profligate? Is there reason to think that a “small rewiring” can bring forth a nand gate but not a neg gate and a conjunction gate? Is there reason to think that a small rewiring naturally begets a merge operation that forms sets but not one that would form, say, ordered pairs? I have no idea, but the step from conceptually simple to computationally more efficient does not seem to me to be straightforward.

Second, why think that the simplest biological change did not build on pre-existing wiring? So, it is not hard to imagine that non-linguistic animals have something akin to a concatenation operation. Say they do. Then one might imagine that it is just as “simple” to modify this operation to deliver unbounded hierarchy as it is to add an entirely different operation which does so. So even if a set forming operation were simpler than concatenation tout court (which I am not sure is so), it is not clear that it is biologically simpler to derive hierarchical recursion from a modified conception of concatenation given that it already obtains in the organism then it is to ignore this available operation and introduce an entirely new one (Merge). If it isn’t (and how to tell really?) then the emergence of Merge is surprising given that there might be a simpler evolutionary route to the same functional end (unbounded hierarchical objects via descent with modification (in this case modification of concatenation)).[6]

Third, the relation between complexity of computation and physical simplicity is not crystal clear for the case at hand. What physical magnitude is being minimized when computations are more efficient? There is a branch of complexity theory where real physical magnitudes (time, space) are considered, but this is not the kind of consideration that Chomsky has generally thought relevant. Thus, there is a gap that needs more than rhetorical filling: what links the computational intuitions with physical magnitudes?

Fourth, how good are the motivating assumptions provided by the great leap forward? The argument is built by assuming that Merge is what gets the great leap forward leaping. In other words, the cultural artifacts that are proxy for the time when the “slight rewiring” that afforded Merge that allowed for FL and NLGs. Thus the recent sudden dating of the great leap forward are the main evidence for dating the slight change. But why assume that the proximate cause of the leap is a rewiring relevant to Merge, rather than say, the rewiring that licenses externalization of the Mergish thoughts so that they can be communicated.

Let me put this another way. I have no problem believing that the small rewiring can stand independent of externalization and be of biological benefit. But even if one believes this, it may be that large scale cultural artifacts are the product of not just the rewiring but the capacity to culturally “evolve” and models of cultural evolution generally have communicative language as the necessary medium for cultural evolution. So, the great leap forward might be less a proxy for Merge than it is of whatever allowed for the externalization of FL formed thoughts. If this is so, then it is not clear that the sudden emergence of cultural artifacts shows that Merge is relatively recent. It shows, rather, that whatever drove rapid cultural change is relatively recent, and this might not be Merge per se but the processes that allowed for the externalization of merge generated structures.

So how good is the whole argument? Well let’s say that I am not that convinced. However, I admire it for it tries to do something really interesting. It tries to explain why Merge is simple in a perfectly natural sense of the word. So let me end with this.

Chomsky has made a decent case that Merge is simple in that it involves no-tampering, a very simple “conjoining” operation resulting in hierarchical sets of unbounded size and that has other nice properties (e.g. displacement, structure dependence). I think that Chomsky’s case for such a Merge operation is pretty nice (not perfect, but not at all bad). What I am far less sure of is that it is possible to take the next step fruitfully: explain why Merge has these properties and not others. This is the aim of Chomsky’s very ambitious argument here. Does it work? I don’t see it (yet). Is it interesting? Yup! Vintage Chomsky.

[1] All of this can be given a Bayesian justification as well (which is what lies behind derivations of the subset principle in Bayes accounts) but I like my little analogy so I leave it to the sophisticates to court the stately Reverend.

[2] Before proceeding it is worth noting that Chomsky’s argument is not just a matter of axiom counting as in the simple analogy above. It involves more recondite conceptions of the “simplicity” of one’s assumptions. Thus even if the number of assumptions is the same it can still be that some assumptions are simpler than others (e.g. the assumption that a relation is linear is “simpler” than that a relation is quadratic). Making these arguments precise is not trivial. I will return to them below.

[3] So does the fact that FL has been basically stable in the species ever since it emerged (or at least since humans separated). Note, the fact that FL did not continue to evolve after the trek out of Africa also suggests that the “simple” change delivered more or less all of what we think of as FL today. So, it’s not like FLs differ wrt Binding Principles or Control theory but are similar as regards displacement and movement locality. FL comes as a bundle and this bundle is available to any kid learning any language.

[4] Let me fess up: this is WAY beyond my understanding.

[5] What do snowflakes optimize? The following see here, my emphasis [NH]):

The growth of snowflakes (or of any substance changing from a liquid to a solid state) is known as crystallization. During this process, the molecules (in this case, water molecules) align themselves to maximize attractive forces and minimize repulsive ones. As a result, the water molecules arrange themselves in predetermined spaces and in a specific arrangement. This process is much like tiling a floor in accordance with a specific pattern: once the pattern is chosen and the first tiles are placed, then all the other tiles must go in predetermined spaces in order to maintain the pattern of symmetry. Water molecules simply arrange themselves to fit the spaces and maintain symmetry; in this way, the different arms of the snowflake are formed.

[6] Shameless plug: this is what I try to do here, though strictly speaking concatenation here is not among objects in a 2-space but a 3-space (hence results in “concatenated” objects with no linear implications.

94 comments:

Alex ClarkFebruary 24, 2017 at 4:11 AM
Are there any models of computation where sets are simpler than string? Where say taking the union of two sets (of natural numbers say) is simpler than concatenating two strings?

And what are these "principles of Minimal Computation"?
ReplyDelete
Replies
Patrick T.February 24, 2017 at 2:35 PM
Two quick thoughts on the Merge-culture relation:

(1) There's no "general solution" to the externalisation problem, so that we may take Chomsky's account to imply that the problem of externalisation is something that is solved anew every time in every individual, which is (one of the reasons) why we have variation and different modalities. Variation arises from the properties of the S-M system already in place when Merge emerged and the fact that there's a multitude of solutions to this problem.

(2) It's reasonable to assume that some kind of communication system was in place before Merge emerged, so another response would be to say that the externalisation problem had already been solved when Merge appeared. By saying that Merge provided a novel means for structuring thought this seems to me to be implied. After all, a lot had to "be there already". If not, we'd have to assume that prior to Merge there was no (social) interaction of any kind, etc. pp. and that's certainly not an assumption that we want to make. I've always taken this to be the reason why Chomsky argues for a qualitative difference and for the importance of the emergence of Merge for providing a novel means of thinking.

Now, I have no idea whether this really is what Chomsky thinks, but that's how I've interpreted his argument. I agree with you that it's not a perfect argument, but it nevertheless kind of makes sense (or at least I fail to see the error).
ReplyDelete
Replies
UnknownFebruary 26, 2017 at 6:06 AM
Some brief thoughts on the first two caveats:

(1) I agree that we hardly have a clear general conception of simplicity; perforce, we cannot adjudicate issues of computational simplicity on the basis of general conceptual reflection. Still, I’m not sure about your examples. Reducing PC's constants to stroke or dagger, say, makes one’s formulae very long (e.g., ‘A’ becomes ‘A|A’, etc.), and in a system of natural deduction, one pretty much gets to define what rules of inference one wants, and so proofs can be shortened accordingly. It might well be, then, that there just isn’t an answer to ‘Is this system simpler than that?’ for many cases without some stipulations. As for sets and pairs, I take it that sets are basic in the sense that we can define n-tuples set-theoretically (classically, equals {{a}, {a, b}}, and so on for triples, etc.), but we can’t go the other way. If you have sets, ordered sets are just a trick, i.e., no additional primitives are required.

(2) If we have concatenation primitively, I don’t quite see how to get Merge by building upon it. One would need to make concatenation non-associative, remove order, and add some recursive principle, if concatenation is finite, in order to get Merge. I don’t see how this would be building on concatenation – it looks more as if one is stripping properties from concatenation and then potentially introducing a new a recursive principle. Basically, if concatenation stands to Merge in the ways indicated, then the former shouldn’t be the evolutionary foundation for the latter. So, I don't think the issue is so much whether one can define a magma or not in terms of strings or sets, but that one needs a non-associative operation, which concatenation is not.

I've got the kids today, so perhaps I missed something:)
ReplyDelete
Replies
AnonymousFebruary 26, 2017 at 7:55 AM
A few random, but perhaps connected, comments about Merge.

a. Merge is the computational operation that interfaces with the Lexicon to produce "linguistic expressions". The objects it produces are lexical items in a hierarchical structure, not strings. Strings are derived from a linearization operation that applies elsewhere in a derivation.

b. Merge is definitely not set union because the set union operation does not create hierarchical structure, which is crucial for an account of structural ambiguities (e.g. exceptional students and teachers, every politician who cheats repeatedly lies).

c. So the optimality of Merge depends on whether it is the simplest computational operation that creates hierarchical structure. Are there other candidates?

d. Binary Merge constrained by No Tampering yields both structure-preservation (cf. Emonds MIT dissertation 1970 and elsewhere) and strict cyclicity. See Freidin (2016) "Chomsky's Linguistics: the goals of the generative enterprise" in Language (September) and also a forthcoming article on cyclicity in syntax in the Oxford Encyclopedia of Linguistic Research. Further evidence for the optimality of Merge?

d. Consider Norbert's question: why does Merge have these properties (unbounded hierarchical structure, displacement, structure dependence) and not others? Adapting the late Irwin Corey's answer to the question "why do you wear tennis shoes?" (obituary New York Times, February 7, 2017) , we can answer:

“Actually, that is two questions. The first is ‘Why?’ This is a question that philosophers have been pondering for centuries. As for the second question, [‘Does Merge have these properties?,’] the answer is yes.”

One straightforward answer to the why-question is that given our current understanding of the properties of human language, these are the properties that a computational operation that interfaces with the lexicon requires, and lacking empirical evidence for other properties, there is no reason to postulate a computational operation that has them.

Maybe ultimately this is a question that neuroscience can answer when the unification with linguistics happens, if ever. But even if there is a unification in some probably distant future, the question that will have to be addressed is the connection between knowledge and behavior. Merge is an element in a theory about knowledge of language, not linguistic behavior. As far as I know, there are no theories about how knowledge is converted into behavior in any domain.
ReplyDelete
Replies
OmerMarch 2, 2017 at 6:16 AM
People, can we inject some much needed facts into this discussion?

No Tampering is empirically incorrect (Richards 1997, 2001).

cf. Bulgarian: [which journalist][i] [which book][k] t[i] spread the rumor that the senator wanted to ban t[k]?

(There certainly is something like cyclicity, but obviously No Tampering is too strong a condition. As Richards and others have suggested, something like tend only to the needs of the head that is currently projecting seems to work much better.)

The Strong Minimalist Thesis (understood as the idea that there are interface conditions, and general principles of efficient computation, but nothing language-specific beyond Merge and maybe Agree) is demonstrably false (see, e.g., my 2014 monograph).

From this perspective, you are all having a very involved discussion about "why x holds," or "how x is to be derived," regarding a collection of x's many of which are just not true of natural language. This might still be an interesting philosophical exercise, but from where I'm sitting, the connection to the human language faculty seems to have been lost in the shuffle.

ReplyDelete
Replies
Tim HunterMarch 10, 2017 at 3:39 PM
I very much agree with Greg's point above that sets are simply ill-suited to encoding the structures we typically use to describe "movement". They just don't seem to fit the bill empirically, whatever other attractive "minimal" properties they might have.

In addition to that point, I think they also fail to fit the bill empirically in an entirely independent respect, namely they don't give us a natural way to encode headedness. This point seems to get lost, I think, because there's a tendency to reason from (a) the generalization that syntactic operations don't refer to order of pronunciation (only hierarchy), to (b) the conclusion that if merge applies to X and Y the result should be {X,Y} and not <X,Y> or <Y,X>. Of course, if syntactic operations did refer to order of pronunciation, then it would follow that it makes sense to distinguish between <X,Y> and <Y,X> -- but nothing in particular follows from the fact that syntactic operations do not refer to order of pronunciation, because there might be other things we want the distinction between <X,Y> and <Y,X> to encode.

And indeed, one of the "big facts" about language seems to be that when two things combine to form a larger constituent, one of them provides the head of the newly-formed constituent and the other does not. We don't have anywhere to encode this distinction if the result of merge applying to X and Y is simply {X,Y}, and this leads to all sorts of complicated questions about labeling and so on. Another option is just to suppose that merge creates ordered pairs, and that <X,Y> and <Y,X> are two distinct syntactic objects, both of which have X and Y as their (only) immediate subconstituents, one of which has (the head of) X as its head, and the other of which has (the head of) Y as its head. (Note that it doesn't even matter which one of these is which.)

The aversion to using ordered pairs as syntactic objects seems to stem from conflating the "order" in "ordered pair" with linear/pronunciation order. But the "order" in "ordered pair" just refers to the fact that <X,Y> is distinct from <Y,X>.

For example, think of ordered pairs in high school coordinate geometry. Why do we use ordered pairs like <3,4> to describe points in the plane rather than sets like {3,4}? Because as well as the point <3,4>, there's a different point that we would like to represent by <4,3>. It's not because the point we represent by <3,4> "has 3 to the left of 4" whereas the point we represent by <4,3> "has 3 to the right of 4" -- it's because they're different in some other way that needs to be tracked somehow. Similarly, I think it makes sense to distinguish between two different imaginable syntactic objects formed out of the words 'eat' and 'cake', not because one has 'eat' to the left of 'cake' and the other has 'cake' to the left of 'eat', but rather because one has 'eat' as its head (projecting over 'cake') and one has 'cake' as its head (projecting over 'eat'). If your syntactic objects are things like {eat,cake}, then I don't know which of those you mean, and whichever one it is I don't know how to refer to the other one. A better solution seems to be to use <eat,cake> and <cake,eat> to represent these two syntactic objects (again, which one is which doesn't matter).
ReplyDelete
Replies
UnknownMarch 12, 2017 at 6:14 AM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownMarch 12, 2017 at 6:17 AM
Tim: Jut a few things:
(i) The pair doesn't by itself take you any distance towards describing headedness (nor does the set def., for that matter), for the pair is not intrinsically asymmetrical. Symmetry holds where x = y. Moreover, why should one element be the head rather than another? You only get headedness via the information that the elements are distinct and some other information about the elements (as supposedly in the case of adjunction). So, I really don't see what the pair gives you without stipulations.

(ii) My thought about primitiveness and order was merely that if you take as primitive than you still need some information (an axiom or whatever) that it encodes the characteritic order property. The set df. allows you to derive the property plus other nice stuff. I meant, therefore, that one could take '<...>' to mean, say '>' , but that takes you back to the intuitive idea, which is not what one wants (for example, to keep to your Cartesian example, it misses anything on the diagonal). I wasn't suggesting that the set df. has anything to do with order. Order in set theory is a kinda trick, and was designed to cater for transfinite issues, where intution goes outof the window.
ReplyDelete
Replies
Tim HunterMarch 12, 2017 at 9:50 AM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownMarch 12, 2017 at 1:40 PM
This comment has been removed by the author.
ReplyDelete
Replies
UnknownMarch 12, 2017 at 1:42 PM
Tim: No problem. Let me say the following by way of clarification.

(i) Putting two distinct elements together via our heavily restricted set-theoretic gizmo (aka, Merge) gives us {x, y}, which is the same as {y, x}. This gives us no insight into headedness, because neither element is marked out as special or diffrent in any way. Now let's introduce something else that, by stipulation/definition, also puts two distinct elements together, but which is non-commutative (to go with your analogy): (x, y). Does this new kind of entity describe or otherwise capture headedness? The thought is 'no, it doesn't', because while the elements are marked out as different insofar as they are non substitutable, that doesn't follow from the operation itself, but merely from the elements being different independently; besides, why should the head be one element as opposed to the other, even when the elements are different? So, here I agree with the Adger/Chomsky line of shipping headedness out of the syntax proper, at least if syntax is Merge-y.

(ii) My point about {{x}, {x, y}} is that you can't escape the lack of required information in sets by moving to (x, y), because that just is {{x}, {x, y}}, which no more tells you what the head is than {x, y} does. The set df. of (x, y) tells you that (x, y) is not (y, x), only if x is not y, and that (x, y) is (w, z), iff x = w and y = z. So expressed, it is clear that does not distinguish the elements as such, but only provides a structure in which different pairs can be defined as the same or different. For example, 'being first' (x) on this view simply means 'being a member of each member'.
ReplyDelete
Replies
UnknownMarch 14, 2017 at 5:29 AM
Norbert: Just a quick point about headedness... Right, the SEM interface appears not to care about syntactic labels, but only some categorisation that will feed into composition. Still, it must care about headedness, and so all one really wants from the labelling algorithm is headedness to be decidable for each merge. Whether you call it VP, AP, DP, etc. really doesn't matter so long as V, A, and D turn out to be different in their semantic effects.
ReplyDelete
Replies

Add comment

Faculty of Language

Comments

Thursday, February 23, 2017

Optimal Design

94 comments:

Contributors