Faculty of Language: Three kinds of syntactic research

Sunday, January 26, 2014

Three kinds of syntactic research

Robert Chametzky has some useful things to say about the absence of theoretical work within syntax that I labored to make clear in an earlier post (here). He distinguishes metatheoretical, theoretical and analytic work, the last being by far the predominant type of research. All three are valuable, but, as a matter of fact, the third is what predominates in syntax and is what is generally, inaptly, called “theoretical.” Here is Rob’s tripartite typology, pp. xvii-xix from his sadly under read book A Theory of Phrase Markers and the Extended Base available here. I transcribe, with some indicated omissions, the relevant two pages immediately below.

There are three sorts of work that can generally be distinguished in empirical inquiry. One is metatheoretical, a second is theoretical and the third is analytic. As is often the case, the boundaries are not sharp and so the types shade off into the other, but the distinctions are real enough for the core cases. I take up each in turn.

Metatheoretical work is theory of theory, and divides into two sorts: general and (domain) specific. General metatheoretical work is concerned with developing and investigating adequacy conditions for any theory in any domain. So, for example, it is generally agreed that theories should be (1) consistent and coherent, both internally and with other well-established theories; (2) explicit; and (3) simple…Specific metatheoretical work is concerned with adequacy conditions for theory in a particular domain. So, for example, in linguistics we have Chomsky’s (1964, 1965) familian distinctions between observational, descriptive and explanatory adquequacy. Whether such work is “philosophy” or, in this case “linguistics” seems to me a pointless question.

Theoretical work is concerned with developing and investigating primitives, derived concepts and architecture within a particular domain of inquiry. This work will also deploy and test concepts developed in metatheoretical work against the results of actual theory construction in a domain, allowing for both evaluating of the domain theory and sharpening of the metatheoretical concepts. Note this well: deployment of metatheoretical concepts is not metatheoretical work; it is theoretical work.

Analytic work is concerned with investigating the (phenomena of the) domain in question. It deploys and tests concepts and architecture developed in theoretical work, allowing for both understanding of the domain and sharpening of the theoretical concepts. Note this well: deployment of theoretical concepts is not theoretical work, it is analytic work. Analytic work is what overwhelmingly most linguists do overwhelmingly most of the time…

Linguists tend to confuse analytic work with theoretical work and theoretical work with metatheoretical work…

Linguists typically distinguish not among the three types of work described above, but rather between “theoretical” and “descriptive” work, where both of these are better understood as analytic work, with, respectively, more or less reliance on or reference to a specific framework and its concepts and architecture…This distinction between “theoretical” and “descriptive” is not only ill-conceived, but also, for reasons I do not fully understand, invidious. The tripartite distinction discussed above involves no evaluative component for or ranking of the three sorts of work other…

28 comments:

UnknownJanuary 26, 2014 at 4:11 PM
Do you think that the balance between types of research is somehow off in linguistics? As a person with self-ascribed physics envy, do you think that the balance of research in physics is radically different?
ReplyDelete
Replies
NorbertJanuary 26, 2014 at 4:39 PM
Interesting question. Personally, I think that there is so little real theoretical work in Chametzky's sense that I would like to see more. There is lots of excellent analytic work. But as for theory, I find that as a field we have largely off loaded the responsibility for this to Chomsky. So, yes, there is not enough in my view. Moreover, except for Chomsky's musings, most other theory is barely tolerated. We don't have ways of evaluating proposals except analytically.

How's this compare with other fields? Well in the ones I envy, I think that theory is accorded a place. It's not the ONLY game but it's a legit one. In linguistics, at least in syntax, not so much. At least that's my opinion. You?
ReplyDelete
Replies
Alex ClarkJanuary 27, 2014 at 12:27 AM
I like this division: I think it draws the lines in the right place.

Isn't the whole discussion of MCS languages, and the MCFG hierarchy etc a very solid bit of theoretical linguistics in this sense?
I agree that this work is maybe not that familiar to most linguists of the non-computational persuasion.
ReplyDelete
Replies
Jeff HeinzJanuary 28, 2014 at 10:35 AM
The discussion of what these computational properties have to say about FL, and whether it's theory in RC's sense or not, Norbert referring to research I conduct with Bill Idsardi and Jim Rogers, plus the latest post on "being Edgy" are compelling me to chime in here. While I don't mind the fact that reading this blog has become a habit, I am concerned that writing posts to it could take more time away that I don't already have... but here goes.

I would answer Alex's question above more generally: investigating the computational properties of the weak and strong generative capacities of natural language are absolutely solid theory in RC's sense. Norbert describes "the central theoretical question" as "What's the structure of FL and why?" In the "being Edgy" post, Norbert also makes the point that the properties of UG are going to be abstract. I completely agree with Norbert on these two points (w.r.t. the last Keenan and Stabler response to Levinson et al 2009 is a good read; dunno how to do the linking yet). And I would come out swinging that computational properties of natural language patterns can in fact provide strong, abstract, restrictive hypotheses about the nature of FL *and* they can help explain why.

One example I can give for this comes from my own work in phonology. It appears that the attested phonological patterns (both phonotactic patterns modeled as formal languages--i.e. sets of strings---and phonological processes modeled as mappings from strings to strings) belong to very specific subregular regions defined by particular computational properties. The linguistic hypothesis is that these properties reflect the character of the FL. They are strong properties because they eliminate most logically possible patterns. They make strong, falsifiable empirical predictions about the kinds of patterns we expect to find cross-linguistically and the kinds of patterns learnable/processable in psycholinguistic experimentation. They are abstract properties because they are not about surfacey things at all like "all languages have vowels". As for why natural language exhibits these properties, I think one answer comes from learning. If the FL/UG generalizes in certain ways from its linguistic experience, then we expect the empirical data to look the way it does.

I'm less familiar with all the syntactic generalizations, and it is true that the claim that all syntactic patterns are MCS is much weaker than the claims I am making about phonology. But noone is saying the story ends there. The observations that Thomas and Greg and others are making about the nature of derivation trees in MG could very well lead to similarly restrictive hypotheses. This line of research will inevitably lead IMO to interesting subclasses of the MCS languages or other restrictive classes that crosscut the Chomsky Hierarchy (and which may include copying). The work that Alex is doing on learning is a complementary approach focusing on computational properties of grammars and languages that are learnable in various senses and display natural-language like behavior.

I'm very excited and optimistic about the future for 2 reasons: I think in my lifetime the theoretical/analytical linguistic work, the mathematical linguistic work, and the work in grammatical inference, all of which is ongoing, is going to make some very exciting connections. The fact that people from all those important corners are talking on this blog is the 2nd reason.
ReplyDelete
Replies
Alex ClarkJanuary 29, 2014 at 3:57 AM
@Norbert: "Last point: In LSLT Chomsky observed that we idealize to unbounded/infinitely sized sentences because we are interested in generative capacity. But the real important distinction is not between infinite and finite but between small and large. Say the actual grammatical set of sentences were in the 20 billion range, but finite. Would that really change the problem for me? Nope. The projection problem of how to form a grammar from PLD would be as acute. Would it change the weak generative capacity problem? Well finite is finite, right?"

This is a really good point. I think the standard way of presenting this property misses this point -- the size of the representation is crucially important. This is also important when we say things like MGs are the same as MCFGs -- well they aren't because the MCFGs may be massively bigger than the corresponding MGs. So there are important intensional differences between MCFGs and MGs that the weak (and strong) generative capacity arguments miss. But that is just an argument saying that we need a slightly better theory -- not an argument against having mathematically precise theories.
ReplyDelete
Replies
Chris CollinsJanuary 29, 2014 at 2:34 PM
I agree with Rob's distinctions. It seems like an insightful way to divide things up. The way I understand what he is saying is that analytical work takes some system of concepts/mechanisms/principles and applies them to some language data (the "phenomena"), to try to make sense out of that data, and then, on the basis of that analysis to modify the concepts/mechanisms/principles. Almost all cartographic work falls into this category, I believe. Most of my books (with Paul Postal) "Imposters" and "Classical NEG-Raising", also fall into this category. "Analytical" work is highly addictive. I had originally (in 1993) thought that the Minimalist Program would stimulate much more theoretical work (in Rob's sense). An early clear example of this was Epstein's well-known paper on c-command. The whole point of the SMT (Strong Minimalist Thesis) is to try to look into the relations of various concepts, to see what their status is. And the SMT gives a concrete way of performing this task. One can ask, for each concept/mechanism/principle: "How does this fit into the SMT?" That is an exciting research program that really has not reached its potential. But this kind of work definitely takes a back seat to analytical work. The natural instinct of a syntactician is to approach such questions by immediately thinking of relevant data. We cannot even help ourselves. But a different path would be to approach them "theoretically", as Rob suggests. Discussing these issues goes right to the heart of our field, and even minor clarifications could lead to huge advances on the "analytical" level. In addition to c-command and labels, which have been discussed from the "theoretical" point of view on occasion, I would like to add that there is need of much more discussion of basic issues like occurrence, Merge, multi-dominance, chains, copies, NTC, inclusiveness, derivations, workspaces, the primacy of the C-I interface, interpretability, "late insertion", the format of lexical items, the nature of grammatical categories, semantic interpretation without indicies, what semantic interpretation actually is, etc.
ReplyDelete
Replies
PeggyJanuary 30, 2014 at 1:22 PM
I have always found Rob's distinctions useful, but I think there are some practical reasons why most current work in Linguistics is analytical. One is that it is easier to evaluate analytical work - It adopts some premises, applies them within a domain, and analyzes the outcome. Theoretical work involves examination of premises, and it's much more difficult to convince people that their premises are wrong than to convince them that such-and-such data can be analyzed within their (perhaps slightly amended) premises. The other reason is the one referred to by Norbert as "offloading" to Chomsky. In fact, if someone who isn't Chomsky does try to do theoretical work and comes up with a proposal that isn't Chomsky's, the proposal tends to get sucked into a black hole. If Chomsky comes up with something similar, the proposal becomes Chomsky's, and if he comes up with something else, the proposal becomes tragically misguided. I don't think this phenomenon is malevolent - In many ways it's helpful to a field to have forces that cause theoretical developments to be generally shared across the field. This discussion is very interesting. Presumably there are other ways to foster shared assumptions while encouraging theoretical work in Rob's sense.
ReplyDelete
Replies

Add comment

Faculty of Language

Comments

Sunday, January 26, 2014

Three kinds of syntactic research

28 comments:

Contributors