Faculty of Language: Pullum on the competence-performance distinction

Friday, July 3, 2015

Pullum on the competence-performance distinction

In an earlier post I discussed a nice little piece by Geoff Pullum (GP) on Aspects after 50 years (here). GP has a second interesting post (here) that discusses a central distinction highlighted in Aspects; the competence-performance distinction (CPD). As he rightly notes, this distinction has caused endless cognitivist’s dissonance. Like GP, I don’t really understand why. It’s a distinction that is rampant, for example, in statistical conceptions of cognition. The trivial distinction between the structure of the hypothesis space and the actual distributions over that space is a version of the CPD. Nobody takes this to be a controversial distinction.[1] If so, we can take a theory of the hypothesis space (what’s its geography?) to be a theory of competence and a theory of how this space gets filled probabilistically to be a theory of performance. This common distinction then provides a first okish pass at the CPD. It is not perfect, but it gets you a lot of the way there.

Another useful explication is that a theory of competence aims to limn the limits of the possible. In syntax we aim to describe the possible sentences of a language in terms of the unbounded products of a given G. We aim to describe the possible Gs in terms of the products of FL/UG. We aim to describe the possible FLs… The sentences/ Gs/FLs we actually encounter are points in a space of sentences/Gs/FLs that could exist. Theories of what could be are theories of competence.

Ontologically, theories of competence ground theories of performance. This is why the CPD is rife with Empiricist/Rationalist (E/R) baggage, which chapter 1 of Aspects elaborates on. This is also why, IMO, it has proven to be so difficult a concept to get across. Empiricists treat competence as a conceptually derivative notion. First there is performance. Competence, on the E view, is smoothed performance, performance with the variance squeezed completely out, non-noisy performance. This, however, is not what the CPD demarcates. It is essentially a Rationalist notion, pointing to hidden structure, which actual linguistic items can be used to reveal.

I mention this because, this seems quite different from the way that GP frames matters in his post.[2] For GP, following Edward Lorenz, competence is “what you expect,” and performance is “what you get.” He then elaborates this in terms of what we will find if we squeeze out all the “sporadic and unintended mistakes” of performance. So competence is cleaned up performance. But it isn’t. For example, many many G products will never be performed, (im)perfectly or otherwise, yet they are explanada of a competence theory of G. Humans may fail to internalize many many Gs, yet these Gs may be possible and Gs that at theory of FL/UG should permit. So, what we see, even in statistically cleaned up form (i.e. with all the nosiy variance squeezed out) is not what competence is about.

Of course, we hope that we can get a window into the possible by investigating what is actually deployed. Thus, actual performances are the source of our linguistic data. What people say, how they judge things we present to them, etc. These data are used to plumb the limits of the possible. And some data is perhaps better than others for this purpose. So, if you are interested in what linguistic knowledge consists in (i.e. knowledge of a G) then some data might be better suited than others for probing this. What kind? Well, those that do not run afoul of factors such as limited memory, slips of the tongue, etc. that we think might confuse matters. So, we clean up the relevant data we use to plumb G competence. These gussied up linguistic objects are not themselves the targets of explanation. Rather they act as probes into the structure of a G or of FL/UG. In other words, the target of a theory of competence in linguistics (i.e. what we want a theory of competence to explain the properties of) is a G or FL/UG. The intuitions we harvest and the utterances we track are what we use to investigate these structures. The point that Chomsky makes in Aspects concerning the ideal speaker-hearers simply adverts to the fact that some data (e.g. those that abstract away from performance factors like memory and attention limitations) are nosier than others and so less useful as probes of the G and FL/UG systems. In other words, Aspects makes the undeniably correct point so some data are plausibly more useful than others if one’s aim is to discern the structure of linguistic knowledge.

GP interprets the discussion in Aspects quite differently. He understands Chomsky to have been proposing that the “subject matter” of linguistics was “speaker intuitions about sentence structure.” These intuitions are purer than linguistic performances (e.g. utterances) in that they abstract away from (and here GP quotes Chomsky) “grammatically irrelevant conditions as memory limitations, distractions, shifts of attention and interest, and errors (random or characteristic) in applying his knowledge of this language in actual performance.” However, even if we grant this (which we should), this does not make these intuitions the subject matter of linguistics. No! These intuitions are simply data, evidence that linguists can use effectively to understand the structure of Gs and FL/UG. Chomsky does claim that such intuitions are rich sources of information about the structure of G knowledge and might suggest that they are superior data to corpora data (i.e. recorded speech of actual utterances). Moreover, he clearly thinks that we should dump the prejudice against speakers’ judgments that behaviorism saddled us with. However, Chomsky does not argue in Aspects (so far as I can tell) that intuitions are epistemologically or ontologically privileged data, just that they don’t suffer from some of the problems we might think would mislead investigation.

This is almost certainly correct, and it is not the same as suggesting that these data are the subject matter of linguistics. The subject matter, (aka aim of inquiry) is, as Chomsky puts it, to describe “the mental reality underlying actual behavior.”

I should add that the idea that intuitions are privileged in some way leads to all sorts of misconceptions that psychologists then spend so much time lecturing linguists about. Such judgments are themselves complex performances, with all the problems that performances entail. The most that can be said for intuitions is that have proven to be excellent probes and that they are very stable, robust, and easy to gather. These are important virtues, but they don’t argue for intuitions as such being privileged, as would be the case were intuitions the subject matter of linguistics.

In sum, if I got GP right here (but see note 2), then I think he gets the CPD wrong. What he presents is the Eish version of the CPD. Aspects takes an irreducibly Rish conception of the aim of inquiry and the CPD is intended to highlight this approach. If this is right, then the main problem with getting others to understand the competence/performance distinction lies in them getting to see how closely it is related to an Rish conception of mind (and science actually (see here)). But Rationalism is largely anathema to many practicing neuro-cog types and so the distinction is hard for them to understand (and accept). This is to be expected. On an Eish conception, the most one can make of the notion lies in the quality of data. It marks a distinction between types of data: Performance data is “messy” “competence” data is not. The latter is privileged for that reason. However, this is not the point that the CPD is intended to highlight. It highlights the difference between the products of an underlying mechanism and the mechanism itself. In other words, it highlights the claim that the subject matter of linguistics is Gs and FL. In other words, the CPD carries within itself the project of modern Generative Grammar, and that’s why understanding it is so very important.

[1] See the Amy Perfors quote here.

[2] I say “might be” because it is possible to read GP as making the same point as I am making here. If so, great. However, there is another reading where he sees competence as non-noisy performance. But, I may be misreading him here and if so my sincere apologies to GP. That said, the two ways of interpreting the CPD are important so I will continue putting my own construal on GP’s elaboration.

27 comments:

Colin PhillipsJuly 4, 2015 at 10:00 AM
I disagree with your diagnosis of the widespread squeamishness about the CPD. The problem is not so much that folks misunderstand it as that they find its deployment frustrating. Chomsky's use of the terms has been consistently clear about what he takes Competence to be, and consistently ambiguous about what Performance refers to, and hence what the CPD is. That is visible already in Ch 1 of Aspects, and it is inherited by most users since that time. The usage encompasses both the notion that you have in mind, and the one that Geoff Pullum has in mind, and more.

You will find lots of people who are happy to say that they study linguistic competence. But you'll find almost nobody that says that they study performance. Instead, they would use a more specific description for what they study. So, 'competence' is mostly used by one group to assert what they are concerned with, 'performance' is almost exclusively used by that same group in the context of saying "don't bother me about X, Y, Z". And it also leads many in that group to assume that X, Y, and Z have something in common. (The term "gentile" has a similar flavor.)

My own main frustration with the CPD is that it invites confusion by conflating a variety of distinctions, some of which are necessary, and some of which are interesting empirical hypotheses. So, while I spend most of my life dealing with issues in the general ballpark of the CPD, I try to avoid the terminology, because it rarely clarifies anything. There is the distinction between a system and what it generates (necessary). There are distinctions between descriptions of a system at varying grains of analysis (indisputable; it's always a question of which grain size yields the most insight). There is the distinction between a task-neutral cognitive system and systems the are designed to deploy the task-neutral knowledge in specific tasks like speaking and understanding (interesting empirical hypothesis). And so on.

If all who are squeamish about the CPD are labeled as empiricists, that probably serves only to exacerbate their frustration. The Cognitive Revolution happened. I think that the dissatisfaction comes from the fact that the CPD is most commonly used to close down discussion.

Side note 1: but yes, I agree with you that intuitions are not the object of study.
Side note 2: and it's true that there are some who think that corpus data are privileged and intuitions are to be treated with skepticism. But they're a small subset of those who are nervous about CPD.
ReplyDelete
Replies
Alex DrummondJuly 5, 2015 at 1:05 PM
The thing I found most puzzling in Pullum's exposition of the CPD was the claim that Chomsky is interested in the intuitions of ideal speaker-hearers. On my understanding, the idealization to an ideal speaker-hearer comes into play when when we consider the problem of language acquisition. That is, we begin by trying to solve the “easy” version of the problem before we try to figure out how kids deal with non-homogenous speech communities, memory limitations, speech errors, etc. Chomsky is not telling us to interpret the intuitions of real speaker-hearers as approximations to the intuitions of imaginary ideal-speaker hearers.
ReplyDelete
Replies
Jeff LidzJuly 5, 2015 at 2:05 PM
I'm baffled by the exchange between Colin and Norbert. Nobody says they study "competence" any more than they say they study "performance" because on both ends, there are a gazillion topics. People who work on "competence" study stuff like binding, control, bounding, agreement, etc. People who work on "performance" study stuff like planning, prediction, attention, encoding for memory, maintenance in memory, retrieval from memory, etc. There is a sense in which those who live on the C-side of the CPD don't care about what happens on the P-side, and sometimes this is a perilous move (because maybe some fact that you thought was about the grammar was really about memory, or whatever), and sometimes it is completely innocuous because the intuitions about acceptability are so robust that they suffice for theory construction. If you work on the P-side of the CPD, then it is always perilous to make claims without paying attention to (at least some of) the contents of the competence theory because that theory at the very least specifies the data structures (or constraints on what the data structures have to encode) that the performance theory engages. This much, I am quite certain everyone in this discussion agrees with. So what's left? Explaining why some people are grumpy about the CPD. This, I think, has two parts. The first part is, I think, the sociological point that Colin is making, which is that people who are interested in psycholinguistics maybe think that the hard core grammatical theorists look down their noses at them because competence is important and performance is just noise. Certainly back when we were in graduate school and syntax was still king of the hill, there may have been some truth to that (and for all I know maybe there still is), but this is just noise that gets in the way of thinking. Those who feel slighted by the CPD should just get over themselves. The second part is what Norbert is talking about, and which is certainly right. Many people trained in psychology (and linguistics) think that psychology is the study of behavior (indeed, many psychology textbooks even begin with a sentence making that assertion), which is essentially an empiricist idea. Those people are confused about the CPD because they think the outputs of the system are the system itself, that there is no hammer without hammering. Those psychologists who think that psychology is the study of the mental structures that give rise to behavior have no problem with the CPD (at least in my experience). Those people learned the lessons of the cognitive revolution. I'm sure Colin agrees with the second part and that Norbert would assent to the sociological point. So, I don't see what you guys are grumbling about.
ReplyDelete
Replies
Colin PhillipsJuly 6, 2015 at 8:15 AM
Typical. Norbert and I try to stage an interesting fight, and then Lidz comes along and goes all conciliatory on us. Party pooper.

But seriously. I'm a bit confused too, as I wasn't looking to pick a fight (yawn) and I'm not feeling bent out of shape or trying to make a sociological point. I was simply making the suggestion that the CPD hasn't proven to be a great source of clarity. It's possible that in the context of the early 60s it did just what was needed, in helping to delimit a new research program. But I'm not sure that it has helped a lot since then. As this thread illustrates, it seems to be one of those things that people are confident that they understand, and are surprised to learn that others think of it differently.

Some differences that get swept together under the heading of the CPD.

1. Mechanism vs. its products. Easy.

2. Degrees of abstraction in characterizing a neurocognitive system (roughly equivalent to Marr levels, except that the notion that there are 3 distinct levels masks the many choices that you make when choosing where you'll find the most insight).

3. Differences in how the same neurocognitive system performs in different task settings, when information is available or withheld, e.g., in comprehension the meaning is withheld.

4. Differences between distinct neurocognitive systems with specific functions, e.g., "grammar" vs. "producer".
ReplyDelete
Replies

Add comment

Faculty of Language

Comments

Friday, July 3, 2015

Pullum on the competence-performance distinction

27 comments:

Contributors