Faculty of Language: Universals; structural and substantive

Tuesday, February 27, 2018

Universals; structural and substantive

Linguistic theory has a curious asymmetry, at least in syntax. Let me explain.

Aspects distinguished two kinds of universals, structural vs substantive. Examples of the former are commonplace: the Subjacency Principle, Principles of Binding, Cross Over effects, X’ theory with its heads, complements and specifiers; these are all structural notions that describe (and delimit) how Gs function. We have discovered a whole bunch of structural universals (and their attendant “effects”) over the last 60 years, and they form part of the very rich legacy of the GG research program.

In contrast to all that we have learned about the structural requirements of G dependencies, we have, IMO, learned a lot less about the syntactic substances: What is a possible feature? What is a possible category? In the early days of GG it was taken for granted that syntax, like phonology, would choose its primitives (atomic elements) from a finite set of options. Binary feature theories based on the V/N distinction allowed for the familiar four basic substantive primitive categories A, N, V, and P. Functional categories were more recalcitrant to systematization, but if asked, I think it is fair to say that many a GGer could be found assuming that functional categories form a compact set from which different languages choose different options. Moreover, if one buys into the Borer-Chomsky thesis (viz. that variation lives in differences in the (functional) lexicon) and one adds a dash of GB thinking (where it is assumed that there is only a finite range of possible variation) one arrives at the conclusion that there are a finite number of functional categories that Gs choose from and that determine the (finite) range of possible variation witnessed across Gs. This, if I understand things (which I probably don’t (recall I got into syntax from philosophy not linguistics and so never took a phonology or morphology course)), is a pretty standard assumption within phonology tracing back (at least) to Sound Patterns. And it is also a pretty conventional assumption within syntax, though the number of substantive universals we find pale in comparison to the structural universals we have discovered. Indeed, were I incline to be provocative (not something I am inclined to be as you all know), I would say, that we have very few echt substantive universals (theories of possible/impossible categories/features) when compared to the many many plausible structural universals we have discovered.

Actually one could go further, so I will. One of the major ambitions (IMO, achievements) of theoretical syntax has been the elimination of constructions as fundamental primitives. This, not surprisingly, has devalued the UG relevance of particular features (e.g. A’ features like topic, WH, or focus), the idea being that dependencies have the properties they do not in virtue of the expressions that head the constructions but because of the dependencies that they instantiate. Criterial agreement is useful descriptively but pretty idle in explanatory terms. Structure rather than substance is grammatically key. In other words, the general picture that emerged from GB and more recent minimalist theory is that G dependencies have the properties they have because of the dependencies they realize rather than the elements that enter into these dependencies.[1]

Why do I mention this? Because of a recent blog post by Martin Haspelmath (here, henceforth MH) that Terje Lohndal sent me. The post argues that to date linguists have failed to provide a convincing set of atomic “building blocks” on the basis of which Gs work their magic. MH disputes the following claim: “categories and features are natural kinds, i.e. aspects of the innate language faculty” and they form “a “toolbox” of categories that languages may use” (2-3). MH claims that there are few substantive proposals in syntax (as opposed to phonology) for such a comprehensive inventory of primitives. Moreover, MH suggests that this is not the main problem with the idea. What is? Here is MP (3-4):

To my mind, a more serious problem than the lack of comprehensive proposals is that linguistics has no clear criteria for assessing whether a feature should be assumed to be a natural kind (=part of the innate language faculty).

The typical linguistics paper considers a narrow range of phenomena from a small number of languages (often just a single language) and provides an elegant account of the phenomena, making use of some previously proposed general architectures, mechanisms and categories. It could be hoped that this method will eventually lead to convergent results…but I do not see much evidence for this over the last 50 years.

And this failure is principled MH argues relying that it does on claims “that cannot be falsified.”

Despite the invocation of that bugbear “falsification,”[2] I found the whole discussion to be disconcertingly convincing and believe me when I tell you that I did not expect this. MH and I do not share a common vision of what linguistics is all about. I am a big fan of the idea that FL is richly structured and contains at least some linguistically proprietary information. MP leans towards the idea that there is no FL and that whatever generalizations there might be across Gs are of the Greenberg variety.

Need I also add that whereas I love and prize Chomsky Universals, MH has little time for them and considers the cataloguing and explanation of Greenberg Universals to be the major problem on the linguist’s research agenda, universals that are best seen as tendencies and contrasts explicable “though functional adaptation.” For MH these can be traced to cognitively general biases of the Greenberg/Zipf variety. In sum, MH denies that natural languages have joints that a theory is supposed to cut or that there are “innate “natural kinds”” that give us “language-particular categories” (8-9).

So you can see my dilemma. Or maybe you don’t so let me elaborate.

I think that MH is entirely incorrect in his view of universals, but the arguments that I would present would rely on examples that are best bundled under the heading “structural universals.” The arguments that I generally present for something like a domain specific UG involve structural conditions on well-formedness like those found in the theories of Subjacency, the ECP, Binding theory, etc. The arguments I favor (which I think are strongest) involve PoS reasoning and insist that the only way to bridge the gap between PLD and the competence attained by speakers of a given G that examples in these domains illustrate requires domain specific knowledge of a certain kind.[3]

And all of these forms of argument loose traction when the issue involves features, categories and their innate status. How so?

First, unlike with the standard structural universals, I find it hard to identify the gap between impoverished input and expansive competence that is characteristic of arguments illustrated by standard structural universals. PLD is not chock full of “corrected” subjacency violations (aka, island effects) to guide the LAD in distinguishing long kosher movements from trayf ones. Thus the fact that native speakers respect islands cannot be traced to the informative nature of the PLD but rather to the structure of FL. As noted in the previous post (here), this kind of gap is where PoS reasoning lives and it is what licenses (IMO, the strongest) claims to innate knowledge. However, so far as I can tell, this gap does not obviously exist (or is not as easy to demonstrate) when it comes to supposing that such and such a feature or category is part of the basic atomic inventory of a G. Features are (often) too specific and variable combining various properties under a common logo that seem to have little to do with one another. This is most obvious for phi-features like gender and number, but it even extends to categories like V and A and N where what belongs where is often both squishy within a G and especially so across them. This is not to suggest that within a given G the categories might not make useful distinctions. However, it is not clear how well these distinctions travel among Gs. What makes for a V or N in one G might not be very useful in identifying these categories in another. Like I said at the outset, I am not expert in these matters, but the impression I have come away with after hearing these matters discussed is that the criteria for identifying features within and across languages is not particularly sharp and there is quite a bit of cross G variation. If this is so, then the particular properties that coagulate around a given feature within a given G must be acquired via experience with that that particular feature in that particular G. And if this is so, then these features differ quite a bit in their epistemological status from the structural universals that PoS arguments most effectively deploy. Thus, not only does the learner have to learn which features his G exploits, but s/he even has to learn which particular properties these features make reference to, and this makes them poor fodder for the PoS mill.

Second, our theoretical understanding of features and categories is much poorer than our understanding of structural universals. So for example, islands are no longer basic “things” in modern theory. They are the visible byproducts of deeper principles (e.g. Subjacency). From the little I can tell, this is less so for features/categories. I mentioned the feature theory underlying the substantive N,V,A,P categories (though I believe that this theory is not that well regarded anymore). However, this theory, even if correct, is very marginal nowadays within syntax. The atoms that do the syntactic heavy lifting are the functional ones, and for this we have no good theoretical unification (at least so far as I am aware). Currently, we have the functional features we have, and there is no obvious theoretical restraint to postulating more whenever the urge arises. Indeed, so far as I can tell, there is no theoretical (and often, practical) upper bound on the number of possible primitive features and from where I sit many are postulated in an ad hoc fashion to grab a recalcitrant data point. In other words, unlike what we find with the standard bevy of structural universals, there is no obvious explanatory cost to expanding the descriptive range of the primitives, and this is too bad for it bleaches featural accounts of their potential explanatory oomph.

This, I take it, is largely what MH is criticizing, and if it is, I think I am in agreement (or more precisely, his survey of things matches my own). Where we part company is what this means. For me this means that these issues will tell us relatively little about FL and so fall outside the main object of linguistic study. For MH, this means that linguistics will shed little light on FL as there is nothing FLish about what linguistics studies. Given what I said above, we can, of course, both be right given that we are largely agreeing: if MH’s description of the study of substantive universals is correct, then the best we might be able to do is Greenberg, and Greenberg will tell us relatively little about the structure of FL. If that is the argument, I can tag along quite a long way towards MH’s conclusion. Of course, this leaves me secure in my conclusion that what we know about structural universals argues the opposite (viz. a need for linguistically specific innate structures able to bridge the easily detectable PoS gaps).

That said, let me add three caveats.

First, there is at least one apparent substantive universal that I think creates serious PoS problems; the Universal Base Hypothesis (UBH). Cinque’s work falls under this rubric as well, but the one I am thinking about is the following. All Gs are organized into three onion like layers, what Kleanthes Grohmann has elegantly dubbed “prolific domains” (see his thesis). Thus we find a thematic layer embedded into an agreement/case layer embedded into an A’/left periphery layer. I know of no decent argument arguing against this kind of G organization. And if this is true, it raises the question of why it is true. I do not see that the class of dependencies that we find would significantly change if the onion were inversely layered (see here for some discussion). So why is it layered as it is? Note that this is a more abstract than your typical Greenberg universal as it is not a fact about the surface form of the string but the underlying hierarchical structure of the “base” phrase marker. In modern parlance, it is a fact about the selection features of the relevant functional heads (i.e. about the features (aka substance) of the primitive atoms). It does not correspond to any fact about surface order, yet it seems to be true. If it is, and I have described it correctly, then we have an interesting PoS puzzle on our hands, one that deals with the organization of Gs which likely traces back to the structure of FL/UG. I mention this because unlike many of the Greenberg universals, there is no obvious way of establishing this fact about Gs from their surface properties and hence explaining why this onion like structure exists is likely to tell us a lot about FL.

Second, it is quite possible that many Greenberg universals rest on innate foundations. This is the message I take away from the work by Culbertson & Adger (see here for some discussion). They show how some order within nominals relating Demonstratvies, Adjectives, Numerals and head Nouns are very hard to acquire within an artificial G setting. They use this to argue that their absence as Greenberg options has a basis in how such structures are learned. It is not entirely clear that this learning bias is FL internal (it regards relating linear and hierarchical order) but it might be. At any rate, I don’t want anything I said above to preclude the possibility that some surface universals might reflect features of FL (i.e. be based on Chomsky Universals), and if they do it suggests that explaining (some) Greenberg universals might shed some light on the structure of FL.

Third, though we don’t have many good theories of features or functional heads, a lazy perusal of the facts suggest that not just anything can be a G feature or a G head. We find phi features all over the place. Among the phi features we find that person, number and gender are ubiquitous. But if anything goes why don’t we find more obviously communicatively and biologically useful features (e.g. the +/- edible feature, or the +/- predator feature, or the +/- ready for sex feature or…). We could imagine all sorts of biologically or communicatively useful features that it would be nice for language to express structurally that we just do not find. And the ones that we do find, seem from a communicative or biological point of view to often be idle (gender (and, IMO, case) being the poster child for this). This suggests that whatever underlies the selection of features we tend to see (again and again) and those that we never see is more principled than anything goes. And if that is correct, then what basis could there be for this other than some linguistically innate proclivity to press these features as opposed to those into linguistic service. Confession: I do not take this argument to be very strong, but it seems obvious that the range of features we find in Gs that do grammatical service is pretty small, and it is fair to ask why this is so and why many other conceivable features that we could imagine would be useful are nonetheless absent.

Let me reiterate a point about my shortcomings I made at the outset. I really don’t know much about features/categories and their uniform and variable properties. It is entirely possible that I have underestimated what GG currently knows about these matters. If so, I trust the comments section will set things straight. Until that happens, however, from where I sit I think that MH has a point concerning how features and categories operate theoretically and that this is worrisome. That we draw opposite conclusions from these observations is of less moment than that we evaluate the current state of play in roughly the same way.

[1] This is the main theme of On Wh Movement and I believe what drives the unification behind Merge based accounts of FL.

[2] Falsification is not a particularly good criterion of scientific adequacy, as I’ve argued many times before. It is usually used to cudgel positions one dislikes rather than push understanding forward. That said, in MH, invoking the F word does not really play much more than an ornamental role. There are serious criticisms that come into play.

[3] I abstract here from minimalist considerations which tries to delimit the domain specificity of the requisite assumptions. As you all know, I tend to think that we can reduce much of GB to minimalist principles. The degree to which this hope is not in vain, to that degree the domain specificity can be circumscribed to whatever it is that minimalism needs to unify the apparently very different principles of GB and the generalizations that follow from them.

50 comments:

UnknownFebruary 27, 2018 at 3:34 PM
This is a topic that's very close to my own research interests because

1) the choice of feature system can have a huge effect on computational complexity (which level of the subregular hierarchy does a given dependency belong to?)

2) I showed several years ago that every syntactic theory where subcategorization requires exact matches ("I select a DP, only a DP, and nothing but a DP!") overgenerates on a massive scale because we have no theory of categories, so all kinds of information can be indirectly encoded in category distinctions.

Imho the main problem is that research has focused on arguing for or against specific feature systems/category hierarchies instead of identifying basic properties such a system must have.

For example, is a category system a flat unordered set or are there necessarily implicational relations such that if X selects Z it can also select Y (e.g. everything that can select a numeral can select a noun). Or more radically, could a natural language have a system where every lexical item is its own category, but only this one category? If not, why not?

The same goes for feature systems. Rather than argue about the specifics of certain feature decompositions or feature geometries, we should focus on general properties a feature system must have. A recent example is the work by Bobaljik & Sauerland on deriving the *ABA generalization from content-agnostic feature combinatorics.

A more radical approach is to completely give up on describing anything in terms of features and categories. That's the route I've been taking in my approach to the *ABA generalization and morphosyntax in general. I've also looked a bit at what happens if you remove category features from syntax, and it has some nice effects:

For example, you predict that all categorial ambiguity is resolvable within a local context. Suppose that a head can no longer say "I want a noun" but instead has to say "I can take cat, or dog, or water, or ...". The problem here is that 'water' may also be a verb. But you can tell immediately that it isn't if it has already selected 'the'. In some cases you may have to look a little deeper, but you shouldn't have to look arbitrarily deep to find a disambiguating lexical item. As far as I can tell, this holds for pretty much all natural languages. If category features are real, it's much less clear why this property should hold since the grammar never faces any ambiguity to begin with.

The generalization that heads don't care about the arguments of their arguments can also be explained in this fashion, but is mysterious in a system with category features since we have no criterion to rule out "Possessive determiner selecting an animate DP and an inanimate mass noun" as a possible category.
ReplyDelete
Replies
Peter SvenoniusFebruary 27, 2018 at 3:56 PM
I agree with Norbert that there are good reasons to treat structure and feature content differently, and in fact I would argue further for separating feature content (like past, plural, distal) from the kinds of syntactic instructions features can carry (like move or agree, or whatever distinguishes those two).

But I think there are good candidates for PoS-type arguments even among the contentful features. Take phi features of person, number, and gender, which have some special relationship to DP modification and to argument agreement. One of the phi-features is number, where languages very commonly make sg-pl or sg-du-pl distinctions and occasionally a handful of other distinctions (trial, paucal, some few other, and according to Daniel Harbour, all analyzable in terms of primitives like atomic, minimal, and augmented).

As a phi feature, number marking commonly shows up in nominal morphology, including on determiners and adjectives, agreement morphology on the predicate, and the pronominal system.

A question is, what is it that makes the singular-plural distinction linguistically special in this particular way, as opposed to, say, a three-way distinction between (i) singulars, (ii) collections which are connected or otherwise move together (bunches of grapes, fingers on a hand, wolves in a pack, a swarm of bees, floats or weights on a net, fringe on a garment, ripples on a pond), and (iii) pluralities that don't move together (groups of stones, rabbits, sunflowers, arrows not quivered, etc.)? Or items and collections of items which are manageable enough to carry versus ones that aren’t? Or any number of other ways of categorizing ways of grouping things.

The distal-proximal distinction is so common that it is a good candidate for a universal feature of demonstratives, but it's not a phi-feature. In principle it could be a phi feature and get copied onto the noun and any attributive adjectives, and get coded in the argument agreement on the verb, the way plural commonly does -- but it doesn’t.

So you commonly find "these-du good-du hunters-du caught-du.subj-pl.obj some-pl brown-pl kangaroos-pl" but not "these-proximal good-proximal hunters-proximal caught-proximal.subj-distal-obj that-distal brown-distal kangaroo-distal." There are some cases that look a little like proximal/distal agreement, but they're rare and could possibly be analyzed another way, whereas plural agreement is common and often mechanical.
ReplyDelete
Replies
markFebruary 28, 2018 at 7:10 AM
Thus we find a thematic layer embedded into an agreement/case layer embedded into an A’/left periphery layer. I know of no decent argument arguing against this kind of G organization. And if this is true, it raises the question of why it is true. I do not see that the class of dependencies that we find would significantly change if the onion were inversely layered (see here for some discussion). So why is it layered as it is?

At least part of this seems amenable to an account in functional/processing terms. As a first shot, the periphery (left and right) is a key site for discourse-relevant stuff; left and right periphery positions have long been known to be key sites for things like topicalisation, question word fronting, and discourse markers & particles (cf. Lambrecht on information structure, basically all work in interactional linguistics, or Martina Wiltschko's work for a more formal take on same). Given this, it is actually highly unlikely that a reversely layered onion would be able to work in the same way: given how we use language in interaction, you want to have the interactionally relevant stuff in the periphery where it can do things like draw attention, manage discourse expectations, and so on.

I'm not sure what the account would be for the relative ordering of thematic vs. case/agreement features but it is probably a useful idea to look to functional/processing considerations first — things like salience, reference tracking, given/new etc. are likely to play an important role in explaining the layered structure of the clause (indeed, in using that phrase, I realise that this is one of the slogans of Role and Reference Grammar, which offers another functionally motivated take on this).
ReplyDelete
Replies
William MatchinFebruary 28, 2018 at 1:31 PM
With respect to the "anything does not go" point, I think that examining sign languages can be illuminating. I am not a formal expert on American Sign Language, but space definitely seems to be a phi feature, i.e. you get subject- or object-verb agreement depending on the assigned location of the referent in space. So it seems exactly like the availability of space affords sign languages the option of using space as a phi-feature while preventing its use in spoken languages.
ReplyDelete
Replies
OlivierMarch 4, 2018 at 6:53 AM
"The atoms that do the syntactic heavy lifting are the functional ones, and for this we have no good theoretical unification (at least so far as I am aware). Currently, we have the functional features we have, and there is no obvious theoretical restraint to postulating more whenever the urge arises. Indeed, so far as I can tell, there is no theoretical (and often, practical) upper bound on the number of possible primitive features"

Hm. I wonder if there are two intermingled issues here: how many features do we have (and could we have) and what are they?

The first questions seems relatively tractable to me. Suppose features are all abstract and ±. How about "we have as many as needed to make binary structures suitable (for instance for the LCA or for lexical insertion) but not more?" for an answer? Note that this would ultimately collapse generalizations about features and categories into structural ones (because features exist to deal with structures).

The second (that is the why person but not edible? or why number but not proximal?) is more mysterious to me. In fact, I find it especially mysterious for those supposedly linked to the left-periphery (why focus? hell, if FL evolved independently from communication, why ±wh?).

If I had to offer a speculation, I would say (gun to my head) that all commonly found features ultimately reflect one specific human cognitive ability which is clearly linked with language acquisition (yet is rarely discuss explicitly as such, at least in syntax), namely pointing. But the road is long between that and even the semblance of a theory.
ReplyDelete
Replies
Martin HaspelmathMarch 9, 2018 at 9:44 PM
Thanks for discussing my blog post! As I have said repeatedly elsewhere (e.g. here https://dlc.hypotheses.org/961), I don't deny the language instinct, and I find the PoS for it quite convincing. But being more of a linguist than a philospher, I'd like to see worked out proposals for structural universals that actually work or that can at least be tested. So I don't see that claimed structural universals (ECP, X' theory, Binding theory) fare much better than substantive universals. I know that many of my empirically minded colleagues are more optimistic for the future, but what seems to be clear is that even after five decades of generative grammar, we have many more new ideas than stable results.
ReplyDelete
Replies
NorbertMarch 10, 2018 at 2:39 AM
I am not surprised that you don’t buy the claims, though I am glad you buy the logic. I would not however, contrary to something your last sentence implied, there has been remarkable stability in the basic findings in GG since its inception. So, we have no good evidence that there exist lowering rules, or that “adjuncts”can extract out of islands, or that movement can target non CC positions, or that control into embedded object positions exists or that a head can select the complement of its complement or...I could go on. These principles are firmly established and I take your skepticism about them to be more a reflection of an (no doubt admirable) general skeptical attitude than a conviction based in an appreciation of the material. But that is just my view.
ReplyDelete
Replies
Martin HaspelmathMarch 10, 2018 at 5:06 AM
There may be no good enough evidence for lowering rules, but actually, there is no good enough evidence for most of the tree structures that we see routinely in syntax papers – so the claim that no lowering rules exist is almost impossible to evaluate. And in my perspective, the notions "head", "adjunct", and "embedded object" are substantive categories, just like "verb" and "noun". Note that I'm not generally skeptical – in fact, in my Stuttgart talk a few days ago, I made a large number of empirical claims about universals (of the Greenbergian type). I would say that I am merely asking for more rigour in stating the universal claims.
ReplyDelete
Replies
Marc van OostendorpMarch 11, 2018 at 9:23 AM
I don't think there is a large consensus anymore that it would be possible to find very deep or interesting substantive universals in phonology either. The main piece of evidence for features is that certain sounds behave as a natural class for phonological processes and there is quite a catalogue of sounds behaving in the same way for which there is no obvious 'natural' phonological feature. And even rather superficial looking universals such as 'every language has a [t] sound' require quite some level of abstraction if we want them to be true (for example: is it enough if there is some sound which has [t] as an allophone?) There is even no consensus on what the substance should be (whether it is articulatory or acoustic, for instance).
Although there is still a relatively large group of phonologists who work under the assumption that there is some universal list of phonological features, there is also a rather large group of scholars who believe that there may be features but these are purely abstract labels, which the learner can then link to some set of phonetic events. All 'substantive' universals in this respect are then supposed to follow from mechanics, acoustics, etc., alone; e.g. the fact that all or most languages have a [t] follows from the fact that this sound is easy to make and easy to distinguish. But what makes this into language is that we force it into the frame of features.
ReplyDelete
Replies
Martin HaspelmathMarch 12, 2018 at 4:06 AM
No, the non-convergence of proposals in GG is not primarily an empirical issue – it's not only that cross-linguistic evidence is not taken into account sufficiently when making general claims. The biggest problem is the point that Marc makes: "The main piece of evidence for features is that certain phenomena behave as a natural class" – but there are many different possible classifications, and most claims in GG papers rely on one of many possibilities that happens to be currently widely adopted (e.g. strict binary branching of trees). This means that the proposals are not really testable, even if one had a lot of cross-linguistic data.
ReplyDelete
Replies
Martin HaspelmathMarch 12, 2018 at 1:00 PM
Thanks for continuing the discussion, but the problem is not that any claims are wrong - the problem is in the presuppositions, e.g. that all of syntax works in terms of binary trees. Linguists who grew up in the GG paradigm may simply not think about this, which is why I keep emphasizing the point: binary branching trees are not a discovery, but a notational decision. And transformational rules themselves are notations for which there are alternatives. So I cannot evaluate the claim that no lowering rules exist - I see this as a claim about notational preferences, not one about languages. It is true, however, that wh-fronting is frequent and wh-rightward shifting is hardly attested, and this is indeed a discovery of linguists inspired by GG (though one that involves the substantive notion wh-).
ReplyDelete
Replies
AveryAndrewsMarch 12, 2018 at 10:01 PM
One point here is that there is no need to *assume* that *all* syntax is binary branching in order to collect and contemplate arguments that at least some of it is. So one point that I find interesting is that if we assume binary branching, the standard spec-before, complement after order, and that quirky case requires a kind of tree-adjency, then we get explanations for the fact that in Icelandic, you get to have at most two quirky case-marked NP (because a third at the top would have the one in spec in the way), and of various differences in properties between the first and second quirky NP.

If you also suppose that the positions of the two objects in double double object constructions are not strongly fixed, various other things also seem to get explained, as detailed in work by Anagnostopoulou, Georgala (Euythymia), and others.

Sometimes it relatively easy to restate the results nonbinary terms, and some times it's harder, but it does *not* have to be taken as an assumption.

I'll add that although binarism is sometimes seen as a wierd linguistic thing, perhaps instead it is a more general thing having to do with efficient memory organization. Consider that if the 'now or never' principle of the _Creating Language_ book is accepted, we need to bang events into some kind of structural format extremely rapidly in order to remember anything about them, and binary trees are optimal for storage and search under many circumstances
ReplyDelete
Replies
Martin HaspelmathMarch 13, 2018 at 6:50 AM
"Principle, A, B, C... These theories have empirical backing and theoretical motivation" – I've long wondered about the empirical backing of Chomsky's binding theory, which works fairly well as a description of English, but what does it say about human language? The notions "anaphor", "pronoun" and "r-expression" are categories, and if these cannot be identified across languages (as Norbet seems to tend to agree), then the binding theory is not a theory of human language. I have a 2008 paper (https://zenodo.org/record/1197122) where I discuss a range of universals involving "anaphors", based on a rigorous cross-linguistic definition of this notion. I make no use of the devices of GG.
ReplyDelete
Replies
Martin HaspelmathMarch 16, 2018 at 1:04 PM
I mean languages as conventional (or normative) systems, as the term is used in everday language. I don't think many people mean the output when they say language, and we don't know much about our internal systems for conforming to the conventions.
ReplyDelete
Replies

Add comment