Faculty of Language: (Just One) More on Interpreting the SMT

Monday, April 15, 2013

(Just One) More on Interpreting the SMT

One issue I left dangling (relegated to a footnote here actually) is whether to understand the SMT as a methodological or a metaphysical thesis.[1] The difference lies in evaluating the SMT wrt its fecundity or wrt its truth. I am partial to the first reading. In fact, I find it hard to see how the metaphysical version of the SMT could be true. Let me elaborate.

Minimalists like to say that that Minimalism is a program, not a theory. I actually have some reservations about this claim for programs are vindicated to the degree that they generate interesting and true theories, if not right away, then over a reasonable time span. The Minimalist Program (MP) is now approximately 20 years old, so we should by now be evaluating it in terms of its theories.[2] As I believe that there are a lot of pretty good and empirically interesting (even true) proposals that the program has generated, I think that the “program” line can sometimes be a dodge. However, whether this is true or not, there is something important about the ‘program’ vs ‘theory’ distinction that is relevant to what I want to say here. The main distinction between programs and theories is that theories are things that can be true or false whereas programs are things that are fertile or sterile. As such, programs generate research methodologies, ways of approaching questions that can lead to theories that are truth evaluable. And one important feature of a good program is that the methodologies do actually function as productive hypothesis generators. One of the features of minimalism from the get-go has been its ability to suggest interesting avenues of linguistic investigation. It has done so in several ways.[3]

First, as Chomsky has liked to stress, it keeps us honest. Often our explanations are the same order of complexity as the phenomena they aim to explain. This may or may not be useful (re-describing something can be a crucial step to explaining it), however, it is unlikely to be explanatory. It’s never a good idea to explain N data points with a theory that has N degrees of freedom. At any rate, early minimalism stressed the methodological virtues of simplicity and elegance and tried to motivate how they might be operationalized in the context of late GB syntax. Chomsky’s 1993 paper[4], the one that launched the enterprise, was a very good guide of how to do this, deploying Ockham’s razor to great effect in cutting away some GB underbrush. Chomsky’s basic point was methodological viz. if we want our explanations to explain then they cannot be as convoluted as the data they care about. And he observed that firmly keeping this in focus can lead to a significant explanatory boost, i.e. less can be more, a whole lot more.

Second, MP started people (like me) thinking about the virtues of reduction/unification. Though never absent even in earlier work, this kind of project makes deep sense in the minimalist context. Why? Precisely because it urges that we go “beyond explanatory adequacy,” to use Chomsky’s terms. GB was mainly concerned with Plato’s problem. This problem is “solved” once something is put into UG for by assumption things in UG need not be learned. This often has the effect (I am confessing here) of removing the incentive for developing swelt, streamlined, elegant accounts. So long as the principles can be shoved into UG their ungainliness fails to generate much empirical friction. To put it crudely (something many of you might think I do all too well) GB theories were to elegance what the lunar module was to aerodynamics (Did you ever see the damn thing? It looks like a pile of mechanical garbage wrapped in tin foil, see here) and roughly for the same reason. The module didn’t need to be nicely shaped to move efficiently through space as way out there space is a frictionless medium. In a sense, that was also true of GB; once inside UG, the shape of the principles didn’t much matter for Plato’s Problem.

Now I don’t want to overstate this. Linguists have always cared about elegance, simplicity, redundancy, etc. However, MP greatly raised the status of these virtues. These virtues fueled the impulse to unify the principles of UG and unification became empirically important when one worried not only about learnability issues but also about how FL/UG itself might have arisen. I’ve talked about this (no doubt too much, but as you can see I am obsessed by this) elsewhere (here) so I will drop the issue now. But I bring it up because it bears on the correct interpretation of the SMT.

One way of thinking about the SMT is along the lines of these more general desiderata. In other words, the SMT is an injunction to look for examples where interface properties reveal representational structure. The PLHH work shows that the ANS+visual system can tell us quite a bit about the nature of semantic representations (aka linguistic meaning) and work on parsing and acquisition can do so as well wrt syntactic representations. When such things are found, they can be revealing and the SMT, viewed as a methodological precept to look for such, can be, and has been, quite fecund, especially in forcing different kinds of linguists (syntacticians, phonologists, psycho types, and even neuro types) to ask how their projects and assumptions fit together. In short, as a guiding methodological principle, the SMT is a winner: fecund? Check ✔.

What about a metaphysical thesis? Here, things get a whole lot murkier. Recall that the SMT is supposed to be the thesis that the grammar is the optimal solution to interface conditions. One way of reading this is that the interfaces cause linguistic representations to have the properties they do. But what would it mean for this to be true? I really don’t know.

There is one possibility, the standard Darwinian one in which over long periods of time the interfaces chisel away at the rough edges of FL/UG (and vice versa) till they fit snugly together (interface requirements accommodating themselves to features of FL/UG and properties of FL/UG accommodating themselves to features of the interfaces). Maybe, but recall a good deal of Darwin’s Problem in MP rests on the premise that FL/UG popped up pretty quickly and so there was no time for the Darwin’s selectionist mutual accommodation to effectively operate. Without this Darwininan solvent, any fit that exists between interfaces and FL/UG will be quite adventitious. In fact, I would expect such perfect fit to be the exception rather than the rule and I expect that there will be/are many many interfaces with which the resources of FL don’t integrate at all well with systems that (try to) use them. I can personally attest to the fact that my “dance module” is almost completely inured to verbal instruction. So, as a metaphysical thesis, I see no reason to believe that the SMT is even roughly correct. Or if it is correct it is total mystery why it is or even could be. It would be too damn amazing were FL/UG to be just what every interface ordered. This would be super-intelligent design! This is why I find the SMT to be a pretty poor metaphysical thesis: from where I sit, it has all the hallmarks of being obviously false (indeed, incredible).

Is there anything paradoxical about a principle being methodologically fecund though metaphysically false? Nope. Fecundity and truth are related but distinct evaluative dimensions. To repeat: programs/methodologies fecund, theories/proposals true/false. So qua methodological precept (viz. look for this!) the SMT is a powerful injunction, but qua metaphysical thesis, not so much.

Let me put this another way by considering an analogy between the SMT and the Anthropic Principle (AP) (here). The AP can be used to deduce the values of attested physical constants. How? Well, the values must lie within a certain range in order for (conscious) life to be possible. As the universe clearly contains (conscious) life (i.e. us, well on some days at least) this fact can be used to specify a narrow range of values for the attested physical constants (e.g. the fine structure constant). As a methodological principle, AP seems unexceptionable. Given that we are here, of course the universe must be hospitable to us and this means that the physical constants must have hospitable-for-us values. However, as a metaphysical principle AP has a decidedly mystical air (e.g. the universe is “compelled, in some sense, for conscious life to emerge” (Wikipedia). Note the “in some sense,” always a sign that things are getting weird) that has a distinct theistic odor suggesting intelligent design. The SMT is similar. If FL’s products fit an interface transparently there is a lot to learn about the fine structure of the representation. However, this is not because the interface causes linguistic representations to have the features they do but because in the domains where the SMT holds features of the interface and features of the representations are very closely correlated. Thus, knowing the properties of one can tell you a lot about the properties of the other. In other words, where the SMT holds features of the interface can be used to probe features of the linguistic representation. And just as our existence has implications for the values of the physical constants (at least in our universe) per AP, so too do properties of SMT compliant interfaces have implications for the properties of linguistic representations, even if metaphysically speaking both the AP and the SMT are false. [5]

In sum, even if the general metaphysical version of the SMT is false, there is reason to hope that some interfaces will fit with FL/UG tightly. The properties of these can then be used to plumb the internal details of FL/UG (and, of course, vice versa). These domains of investigation will then be closely integrated, allowing for the development of richer theories of both FL/UG and the relevant interface.

Methodologically, one can go a little further and elevate the SMT to a methodological ideal. In particular, we can take as a default assumption that, for any given interface, the SMT (viz. the Transparency Thesis) holds. It should be easyish to disconfirm this if false (and I suspect that it will be often false), so it is a good 0-th level assumption to make. In the meantime, whether the SMT holds or not for a particular interface, we will find something interesting, and that’s what makes it an ideal methodological principle.

No doubt, there are other interpretations of the SMT that are more metaphysically charged (see Introduction of this for example). There are times when Chomsky’s allusions to third factors and snowflakes can carry this kind of tinge (there are also times when he resiles from this interpretation and explicitly adopts a methodological stance wrt MP and its precepts). For me, it is comforting to be able to interpret the various programmatic precepts in methodological terms. Why? I understand these and can see how to use them to generate research hypotheses. Seen from this perspective, the SMT is a very good way of framing linguistic questions, even if it is metaphysically very far fetched.[6]

[1] This post developed from conversations that I had with Paul (the ‘P’ in PLHH) about the Interface Transparency Thesis and the SMT. It goes without saying that he is completely responsible for any dumb ass thing that I say here. Don’t like it, complain to him.

[2] A possible counter is that it’s too early to engage minimalist themes. Perhaps. But if so, then it’s not really a program either, more like a vision or dream.

[3] I’ve discussed this here for those interested.

[4] Epstein’s paper on c-command (here) was also very good at making these points.

[5] Now for a mea culpa (footnotes are good for this): (here) I said that the features of the ANS+visual system explain the features of L. This strongly suggests that they are the cause of those features in L. If the above is right, this is very misleading and I accept full responsibility for misleading you. I am so contrite that I am sure you will all forgive me. Thx for your indulgence. What we can say is that given the ITT we can deduce some features of L by noting features of ANS+visual, but in this case deducing X does not amount to explaining L (think heights of flagpoles and the shadows they cast).

[6] Curiously, this is the converse with the most vociferous versions of Linguistics Platonism: whatever its metaphysical virtues (none in my view) the methodological consequences of adopting it are confusing at best and baneful at worst (see here).

29 comments:

UnknownApril 15, 2013 at 4:31 PM
I also believe that the best way to understand the SMT (and several aspects of minimalism) is as a methodological assumption. If our job as grammarians consists on comparing theories (i.e. analyses in competence, typically), then we should have a systematic way to choose between two possible analyses A and B that make exactly the same predictions. Usually, we prefer the simplest alternative (Occam’s razor). But if we don’t have any stronger and deeper ontological assumption about the simplicity of language, then the systematical preference for the simplest theory doesn’t follow from anything. Thus, the application of Occam’s razor requires an ontological assumption as the SMT (e.g. “language is an optimal device in order to connect sounds and meanings”, or the definition you prefer). Therefore, it “doesn’t matter” if the SMT is false, since its postulation is a means and not an end.

Very interesting blog, by the way.
ReplyDelete
Replies
UnknownApril 16, 2013 at 3:20 AM
This is a very interesting blog indeed. I want to focus on two important points:

1. In conceding that "the general metaphysical version of the SMT is [likely] false" Norbert admits that Minimalists have no rebuttal to Postal's ontological challenge that Minimalism is internally incoherent [NOT that Platonism is true which is a different issue]. Considering that Chomsky was unable to either understand or admit this for decades I think this candid statement demonstrates substantial progress for Minimalism and want to congratulate Norbert on it.

2. It is probably unproblematic to use SMT 'as a means' and argue: "If FL’s products fit an interface transparently there is a lot to learn about the fine structure of the representation. However, this is not because the interface causes linguistic representations to have the features they do but because in the domains where the SMT holds features of the interface and features of the representations are very closely correlated."

However, in admitting that there is good reason to question the existence of the innate structure postulated by SMT [SMT is false as ontological thesis, we have no causation just correlation] Minimalism looses its edge over empiricist views [e.g., Tomasello, MacWhinney, Christiansen & Chater, to name a few]. If FL is merely a convenient theoretical construct that does not actually exist [and the inability of giving any biological evidence for its existence gestures in this direction], then the only reason to prefer Minimalism over empiricism would be if the former can better account for facts of acquisition or performance [no typo, I mean performance] than the latter.

In case it is not obvious: Norbert was very clever when using the analogy between SMT and AP:

"The AP can be used to deduce the values of attested physical constants. How? Well, the values must lie within a certain range in order for (conscious) life to be possible. As the universe clearly contains (conscious) life".

Indeed, the values must lie in a certain range because conscious life exists. But whether FL exists or the fact that only Chomsky's granddaughter but not her kitten can acquire language can be explained by mechanisms suggested by competing views is still an open question. Admitting that SMT is likely false as ontological thesis amounts to admitting that this question remains open - which again is considerable progress and I tip my hat to Norbert once more.
ReplyDelete
Replies
davidadgerApril 16, 2013 at 10:00 AM
I'm a bit perplexed. Isn't the SMT meant to be a biological or physical thesis, not a metaphysical one. If we take it to be true, that would mean that the structure of the faculty of language is an optimal structure as far as the various systems that need to use it are concerned as a biological fact. If it were a metaphysical thesis, say about ontic commitments, I'm not sure what import it would have, but given that Chomsky takes metaphysical questions to be beside the point (perhaps that's why he declines to discuss Platonism, Christina?), and I must admit I'm in agreement with him, I doubt that when he proposed the SMT he intended it metaphysically.
ReplyDelete
Replies
NorbertApril 16, 2013 at 10:41 AM
One question: what do you mean by "need" in "need to use it"? I assume you intend that some systems use it and those that use it do so optimally. This does not imply that every interface system does use it, only that if they use it they do so optimally. Is this right?
ReplyDelete
Replies
Alex DrummondApril 16, 2013 at 12:21 PM
I think I share some of David's confusion, so let me see if I can figure out what Norbert's getting at when he's talking about SMT as a metaphysical thesis.

The thesis that the computational system is optimal in some specified sense is a robustly empirical thesis. However, the claim that the truth of this thesis would be sufficient to explain why language has the properties it does is metaphysically doubtful. By analogy, if it could be shown that the optimal number of eyes is three, that would not explain why I have three eyes (I don't!) Or to take a linguistic example, if it can be shown that phases permit optimally efficient computations in some sense, that does not explain why phases exist, since (1) is — to put it mildly — not a plausible metaphysical principle:

(1) Everything works in the most optimal way possible.

Without something like (1) (or some weaker principle which can do the job in the case at had), we have no explanatory bridge from “X would be the optimal way for the computational system to work” to “X is how the computational system works.” The only obvious candidate for something like (1) in the case of the language faculty is evolution by natural selection. But Chomsky has always taken a dim view of the hypothesis that the language faculty is optimal in some adaptionist sense.
ReplyDelete
Replies
davidadgerApril 16, 2013 at 1:04 PM
I think I see, but I think its a mistake take SMT metaphysically in this way in the first place (contra Martin and Uriagereka, which I was never convinced by). Investigating SMT, if SMT is anywhere near correct, would itself provide a particular take on what optimal would mean in this domain (since the way it's stated is as 'an' optimal solution, presumably there are other possibilities), and that result may be further understandable from the perspective of theories of physical or computational optimality in general, outside of the linguistic or biological subdomains of science. But all of this is just normal science, attempting to understand particular findings in certain empirical domains in terms of more general theories. It's a bit like saying that evaluation measures ( our old kind of simplicity!) need to be determined empirically, which was always the only way to do it since we don't know in advance what counts as optimal (e.g number of features, determinism of mapping, underspecification of order, ...). Same for optimality in SMT. Still no metaphysics.
ReplyDelete
Replies
NorbertApril 16, 2013 at 1:12 PM
I like Alex's gloss and I agree, I hope that was clear, that I am no fan of the metaphysical reading of the SMT. I also think that David and I are on the same page here: assume that any system that uses FL uses it optimally. I take PLHH to provide a gloss on what this could mean: the representations are used transparently, i.e. there is no need for a translation into a covering grammar, the actual grammar suffices for interface use. This is a nice methodological assumption to make for it allows for ready falsification. Moreover, where accurate, it gives one another empirical window on the structure of the representations, something we all desire. An interesting consequence of this, however, is that to so function (as an empirical window on the structure of FL) it need not be that all interfaces actually/in fact optimally engage with FL. It suffices for this purpose that some do. PLHH provide evidence that this possibility is actual. Good. So, SMT as regulative ideal and with no required metaphysical consequences. Great.
ReplyDelete
Replies
VilemKodytekApril 16, 2013 at 2:52 PM
I always have to translate Norbert's (or other blogger's) idea into something simple to get it. In this case Norbert did it for me (perhaps in an earlier blog): It's an excellent idea to model (not too dense) gases as composed of non-interacting molecules, but no philosopher should deduce from it that molecules do no interact.
ReplyDelete
Replies
UnknownApril 17, 2013 at 3:01 AM
I find it noteworthy that minimalists constantly need to translate ideas into something that has absolutely nothing to do with human language: 'gases composed of non-interacting molecules' here; neuron wiring in nematodes or comet trajectories in Chomsky's writings etc. etc. This raises the [admittedly naive] question: why do you use so rarely analogies from human brains to explain your ideas to each other and to the world? And why do even the analogies from human brains concern virtually always other systems like the visual system. No one denies there are some similarities but what is INTERESTING about language is DIFFERENT from the visual system. The great Noam Chomsky said [rightly IMHO] about connectionists: "They’re abstracting radically from the physical reality, and who knows if the abstractions are going in the right direction?" [Chomsky, 2012, p. 67] So let me ask you: how do you know your abstractions ARE going in the right direction?

I also find it interesting that in Norbert's post we comment on right now not a single linguistic example supports the bold assumption "that any system that uses FL uses it optimally". David says quite rightly: "since the way [SMT is] stated is as 'an' optimal solution, presumably there are other possibilities" - so where can I find a comparison between SMT based analysis of specific linguistic examples [say 100 sentences that normal speakers of English would use] and a Platonist analysis? Since the analysis of the latter is not informed by the kind of optimality considerations discussed here there ought to be differences. And then we can judge whether the SMT based analysis is superior.

Finally, I am fairly sure that the people who have e-mailed me with comments on and questions about my postings here will be taken aback reading that a philosopher is being ridiculed by minimalists for insisting that the entities posed by the [program/thesis/theory...] need to be at least in principle capable of existing in human brains. That apparently Norbert finds it hysterically comical that someone would expect such is an important insight that will be news for many who consider minimalism as part of the natural sciences
ReplyDelete
Replies

Add comment