Faculty of Language: A query for my computational colleagues

Friday, January 17, 2014

A query for my computational colleagues

There appears to be a consensus that natural languages are mildly context sensitive (MCS). Indeed, as I understand matters, this is taken to be an important fact about them and one that deserves some explanation. Here's my question: dos this mean that there is consensus that Kobele's thesis is incorrect? As I recall, it argued that NLs do not display constant growth given that the presence of Copy operations. Again, my recollection is that this is discussed extensively in Greg's last chapter of the thesis. I was also led to believe that MCS languages must display constant growth, something incompatible with the sorts of copy mechanisms Greg identified (I cannot remember the language and I don't have the thesis to hand, but I am sure that you all know it). So, was Greg wrong? If so, how? And please use little words for this bear of little brain would love to know the state of play. Thx.

15 comments:

UnknownJanuary 17, 2014 at 10:52 AM
Short answer: yes, people more or less agree that all natural languages are MCS, but that does not entail that Greg's claims about Yoruba are incorrect.

Long answer: The whole notion of MCS isn't exactly well-defined, but you're correct that the constant growth property is one of the original four desiderata. In general, people agree that the MCS family includes the Tree Adjoining languages (TALs) and the multiple context-free languages (MCFLs), and if you ignore constant growth --- which I usually do because there still is no formalization that captures the original intent --- parallel multiple context-free languages (PMCFLs) are included, too. These classes form a proper hierarchy, such that every TAL is an MCFL (but not the other way round) and every MCFL is a PMCFL (but not the other way round): TAL < MCFL < PMCFL. The split between MCFLs and PMCFLs corresponds to whether movement can leave pronounced copies behind.

There is unassailable evidence that some natural languages are at least TALs, and until the late 90s there was little evidence that any natural language is not a TAL. That's where the MCS claim comes from. Since then new evidence has been discovered that some natural languages are PMCFLs but not TALs, for instance Greg's observations on free relatives in Yoruba. But some people remain unconvinced for various reasons (e.g. claiming that the posited structural dependencies are not supported by the data, that non-syntactic factors are involved, etc.). So right now it seems that the claim "all natural languages are PMCFLs" is true, while the status of "all natural languages are TALs" is less certain.

Addendum: If I remember correctly, Greg does not claim that Yoruba as a string language absolutely requires copying, but that the pattern instantiated by a linguistically plausible analysis of Yoruba cannot be captured without copying. So his argument is about strong rather than weak generative capacity.
ReplyDelete
Replies
Tim HunterJanuary 18, 2014 at 9:02 AM
If we leave aside the copying question --- for concreteness, say we adopt the more permissive definition of MCS, and therefore equate it with PMCFGs, such that Greg's analysis of Yoruba still counts as MCS --- then supposing that all natural languages are MCS still imposes another interesting upper bound: in MG terms, it shows up as the bound on the number of unchecked movement-triggering features you can have at any point in a derivation. This is the Shortest Move Constraint issue that we've talked about a couple of times before.

Personally I find this more interesting than the copying question, because it seems to say something about the derivations (and "derived structures", if they matter at all), whereas the copying question seems to be "just" about how those derivations are mapped to strings. But I'm not sure if there's really any good reason to be more interested in one than the other. (Is there? Does anyone else share my gut-feeling?)
ReplyDelete
Replies

Add comment

Faculty of Language

Comments

Friday, January 17, 2014

A query for my computational colleagues

15 comments:

Contributors