tag:blogger.com,1999:blog-5275657281509261156.post6245495626541847270..comments2024-03-28T04:04:55.806-07:00Comments on Faculty of Language: Pullum on the competence-performance distinctionNorberthttp://www.blogger.com/profile/15701059232144474269noreply@blogger.comBlogger27125tag:blogger.com,1999:blog-5275657281509261156.post-22690069233854183452015-07-14T14:36:05.845-07:002015-07-14T14:36:05.845-07:00@Greg: It’s hard to take E&C’s closing call fo...@Greg: It’s hard to take E&C’s closing call for engagement very seriously, since in their view there is simply nothing for them to engage with. Generative syntax has yielded no “testable hypotheses” and no significant results. If that is really true, then we have nothing useful to offer and we should all just quit. I think that is what E&C would really like to say (which is perhaps why you described the review as diplomatic).Alex Drummondhttps://www.blogger.com/profile/04676457657606185543noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-2444132521615227532015-07-10T19:11:28.076-07:002015-07-10T19:11:28.076-07:00@Omer: Alright, you've got me there. That ope...@Omer: Alright, you've got me there. That opener is fairly colorful...Greg Kobelehttps://www.blogger.com/profile/08006251459440314496noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-55256369252383326132015-07-10T18:25:36.262-07:002015-07-10T18:25:36.262-07:00@Greg: I think if you've described the target ...@Greg: I think if you've described the target of your review (or the subject matter that the target discusses; I couldn't quite tell) as "more like a taxi ride gone bad than a free tour," you've pretty much taken 'diplomatic' off the table.Omerhttps://www.blogger.com/profile/06157677977442589563noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-75289969490832570772015-07-10T18:00:18.862-07:002015-07-10T18:00:18.862-07:00This comment has been removed by the author.Omerhttps://www.blogger.com/profile/06157677977442589563noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-13697499169302205662015-07-10T17:32:06.387-07:002015-07-10T17:32:06.387-07:00@Omer: Have you read it? My impression is that th...@Omer: Have you read it? My impression is that they don't think too highly of Lasnik's book, or of the transformational generative tradition in general. (They nowhere say this.) Still, their critiques are fair (i.e. about issues), and they close with a positive attempt at a rapprochement. How much more diplomatic could it get?Greg Kobelehttps://www.blogger.com/profile/08006251459440314496noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-59011806818366447822015-07-10T13:20:07.183-07:002015-07-10T13:20:07.183-07:00@Greg: "I find the E&C review interesting...@Greg: "I find the E&C review interesting for a number of reasons; <i>they seem to be trying to be very diplomatic</i>, [...]"<br /><br />You're being sarcastic, right?Omerhttps://www.blogger.com/profile/06157677977442589563noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-670647324441638192015-07-10T13:10:16.788-07:002015-07-10T13:10:16.788-07:00Thanks for the clarification, Norbert.
As for refe...Thanks for the clarification, Norbert.<br />As for references, I read many of Postal's complaints as involving excluding data points. Also, I read Edelman & Christiansen's review of Lasnik as complaining that syntactic theory is not re-engaging with (what I think they would call) less idealized data. I take these not to be isolated instances, but to reflect an underlying current of dissatisfaction with the relationship of minimalist syntax to data. I find the E&C review interesting for a number of reasons; they seem to be trying to be very diplomatic, and they suggest a way to engage with them, that is different (I think) from how we have been trying to engage with detractors.<br /><br />I am familiar with some of the literature on statistical inference. I'll remind you that part of pre-registration involves defining in advance your trimming criteria. There are schools of thought (for example, Minimum Description Length) according to which no trimming is allowed. Essentially, when you trim, you are hypothesizing that the data you got was the interaction of multiple factors (and you are guessing that the stuff you didn't trim is more reflective of the factor you care about). It is a hypothesis, and must ultimately be tested.Greg Kobelehttps://www.blogger.com/profile/08006251459440314496noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-58484783302629574852015-07-09T07:41:12.283-07:002015-07-09T07:41:12.283-07:00@ Greg:
"Suppose someone runs an experiment,...@ Greg:<br /><br />"Suppose someone runs an experiment, plots the data, and<br />notices that, if it weren't for these pesky points, it'd fit<br />reasonably well to this curve. We might regard it as somewhat<br />sketchy if this person built their theory ignoring those points."<br /><br />Really? Did you ever sit in on a psych lab meeting? Did you ever notice that there is ALWAYS trimming in the sense of certain data points being excluded (the baby fussed out, the subject wasn't paying attention as can be seen by missing the question at the end, the subject was a vast outlier, etc). That's what is commonly done. Sometimes this is legit, sometimes not. There is NO general rule to tell ahead of time which is which. One needs to argue on a case by case basis. This is how things work.<br /><br />Amazingly, this is a LESS common problem in our part of the world. Why? Well because the data is generally very robust and for many sorts of cases the judgments are pretty crisp. Island and ECP effects fall into this by and large. So do many binding condition effects. There is some language variation, but there is also a lot of consensus over many data points. <br /><br />Given this, I don't think that our detractors find themselves in the position you note. I at least don't know of any. Maybe a reference would be useful. But say they did make your point. I would respond that you argue the cases one by one as you do in any scientific domain. So, say I think center embedding is a GRAMMATICAL fact about legit representations rather than a fact about memory, and you think the opposite. What does one do? Well what one always does: argue the details. But from my experience the skeptics NEVER want to argue the details. In fact, they refuse to consider the facts that the two of us agree are pertinent. I have friends who would love it if the other side would seriously engage. They almost NEVER do. That's the problem, not the CPD. That's at lest how I see it. They may sometimes complain about the CPD (though again I know of no examples), but IMO this is cover. They just don't want to seriously engage.<br /><br />As for absolute measures: we have evidence that this is better than that (so a 6 rather than a 4 on a rating scale). But we have no quantity that we are measuring (the analogue of mass, or charge, or force). We have no analogue unit like the Newton or measurable constant like the gravitational constant. So there is no Chomsky which is the unit of grammaticality that we can measure. This is too bad. For if there were we would be able to make predictions about the exact Chomsky value of a certain unacceptable sentence, i.e. how much is due to ungrammaticality and how much to other factors. We cannot do this right now, so the discussion is necessarily more imprecise. That's what I meant.Norberthttps://www.blogger.com/profile/15701059232144474269noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-71277412076752134312015-07-08T14:31:03.660-07:002015-07-08T14:31:03.660-07:00@Norbert: Suppose someone runs an experiment, plot...@Norbert: Suppose someone runs an experiment, plots the data, and<br />notices that, <i>if it weren't for these pesky points, it'd fit<br />reasonably well to this curve</i>. We might regard it as somewhat<br />sketchy if this person built their theory ignoring those points. We<br />might want to know if there were some principled way to ignore points,<br />other than that they don't fit in with the theory. If that person<br />were to admonish us to not let those ignored points distract us from<br />how good their theory was, we might respond that, if they ignore<br />points that <i>don't</i> fit their theory, then by definition their<br />theory will fit the points that they don't ignore pretty well.<br /><br />I am suggesting that this is the position some of our detractors find<br />themselves in. I read your last comment as agreeing with me (?), but<br />then arguing that this is par for the scientific course. (?!)<br /><br />As for your remark about not having an absolute measure of any mental<br />quantity, sure; instead we have absolute measures of behavioural<br />quantities. (I think I must be misunderstanding you here.)Greg Kobelehttps://www.blogger.com/profile/08006251459440314496noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-67984774887263622692015-07-08T12:33:21.966-07:002015-07-08T12:33:21.966-07:00@Greg
Maybe I am missing things, but data does no...@Greg<br /><br />Maybe I am missing things, but data does not come marked as competence data or performance data or any other kind of data. It's just data. Ahead of explanation there is no partition of the data. All we have are unacceptability judgments. These are used to construct various kinds of theories. One such theory is a competence theory. We claim that some difference in acceptability can/should be traced to some difference in their G status. Some unacceptabilities are ungrammaticalities. Other acceptability data might be explained in terms of memory limits (center embedding). Again the data use is acceptability data (at least in part). We all agree that this is kosher and standard practice. But you seem to suggest that there is something untoward about this as we are not engaging real data. But we are. What we are not doing is pre-specifying which data is relevant to evaluate competence from anything else. That's right. We don't and cannot. But then again, this is true in every domain of inquiry. Ahead of theorizing, we don't know what data is relevant for what, ever. This is no different here in linguistics. So if one is suspicious about this, then the suspicions have nothing to do with linguistic practice. They are more global, and frankly, I don't care to argue these in this blog. Go talk to your philosophy friends.<br /><br />The second point you make is that data is cleaned up; with various irrelevancies pruned. Again, this is not that different from what happens in general everywhere. Did the baby really look left? Was the angle of incidence really 27 degrees? Maybe it was 26.879? Our data is idealized just like this data is, with the BIG exception that nobody in linguistics (or the mental sciences more generally) have absolute measure of any quantity. We have no grammatometer to measure grammaticality or even absolute acceptability (e.g this is 4.6 Chomskys more acceptable than that) . But, nor does anyone else (or most everyone else, there are some psychophysics cardinal measures of things). It would be great to find some real values we could measure, but sadly, we don't have these right now. That said, Colin is right to note that what we do have is pretty good by psych standards.<br /><br />So, I must be obtuse (ok, I know I am) but I just don't get what you are fussing about. By psych standards our data rocks. By psych standards our theories are great. By most standards our methodological and measuring problems are pretty anodyne. This does not mean that every claim regarding competence based on acceptability data is right (think agreement attraction errors). There is guarantee. But who would have thought otherwise?Norberthttps://www.blogger.com/profile/15701059232144474269noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-85049309106210187862015-07-08T08:20:21.727-07:002015-07-08T08:20:21.727-07:00I also don't want to have a scrap about judgme...I also don't want to have a scrap about judgment data; I think<br />judgments are perfectly fine data, and that informal elicited<br />judgments have been shown to be pretty robustly related to the results<br />of more controlled judgment tasks. (Although my recollection of the<br />Sprouse et al results is that they only tested pairwise judgments<br />(better-worse), and didn't test for consistency across the sample<br />(that stars are always worse than no stars).)<br /><br />I want to respond to Norbert's claim that there is no <i>idealized<br />data</i> (or at least that it is relevant to not what linguists do).<br />First off, it is absolutely right that when we elicit (however<br />informally) speaker judgments about anything that that is just as much<br />`real data' as anything else. So I agree with this 100%.<br /><br />Let me give a simple and concrete example of what I meant by idealized<br />data. Consider a center-embedded structure with 4 levels of embedding<br />(no pronouns, etc). This is judged to be very unacceptable, by<br />everybody. We all believe that the right account of this is to treat<br />it as grammatical, and to have the fact that it is unacceptable be the<br />consequence of memory limitations, or some such. I think this is a<br />perfectly legitimate move in general, and I think it is even the right<br />one in this case. However we are no longer accounting for the real<br />data with our theory of grammar. We could; we could penalize<br />ourselves for each divergence between acceptability and predicted<br />acceptability (i.e.~grammaticality), we could incorporate various<br />theories of memory limitations, and compare how much they reduce the<br />divergence against how complicated the resulting theory is. But we<br />don't. And this is ok, it is a perfectly reasonable way to do things,<br />but we can't evaluate whether it was `right' until we reengage with<br />the `real data' in the manner mentioned just above.<br /><br />This is a straight-up competence performance deal, but I introduced<br />the term `idealized data' because I wanted to include some other stuff<br />as well. When doing fieldwork, you don't report the actual responses<br />you got (usually), but rather some informal statistic thereof. You<br />never see `speaker 1 hemmed and hawwed before accepting it, speaker 2<br />said that maybe I could say that, but he wouldn't, speaker 3 offered a<br />reformulation instead of a judgment, speaker 4 said it was fine, but he says everything is fine', rather you see that the sentence<br />in question is unacceptable. This would be a straightforward and<br />unproblematic instance of dimensionality reduction, except that the<br />procedure is informal.<br /><br />These are two kinds of instances of doing things the relation of which<br />to the actual collected data is not clear. For that reason, I called<br />it `idealized data'. My intent was not to impugn or malign it, but<br />rather to point out that people acting rationally might have qualms<br />about the empirical relevance of our work at these points.Greg Kobelehttps://www.blogger.com/profile/08006251459440314496noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-87595318129898581792015-07-08T05:09:48.045-07:002015-07-08T05:09:48.045-07:00I'm loathe to contribute to the slide of this ...I'm loathe to contribute to the slide of this thread into yet another discussion of judgment data (it's like a cosmic force), but a couple of clarifications are needed.<br /><br />1. Yes, when people are given a gradient scale, they use it. The causes of the gradience are partly understood, partly not understood.<br /><br />2. Gradience is quite different than inconsistency, of course. For the Scandinavian phenomena that Alex mentions, there's certainly variability across items, contexts, etc., but I'm not sure that we have evidence that the reactions are uncommonly inconsistent. For the English judgments in Sprouse's LI corpus, he has shown that the gradient judgments are rather consistent.<br /><br />3. Although it's true that judgments typically involve acceptability-under-an-interpretation ratings, that's explicitly not what Sprouse et al. did in their large scale studies, for practical reasons. They restricted attention to the ~50% of example sentences where interpretation, prosody, context, etc. were not really at issue. E.g., no judgments involving anaphora.<br /><br />4. Though Sprouse has tested the consistency of the relatively 'interpretation free' judgments in his sample, we know less about the consistency of the phenomena that SA&S excluded. It's likely that there's more variability once interpretation, context, prosody become relevant. But that could be for relatively uninteresting reasons, e.g., linguists are trained at imagining scenarios to support the interpretation/prosody at issue; regular folks are not.<br /><br />5. The dependent measures that we get in acceptability rating tasks are way more consistent than what we get in studies with reading times, eye-movements, ERPs or what-have-you. <br /><br />... but please let's not have a CPD discussion devolve into another scrap about judgment data. Colin Phillipshttps://www.blogger.com/profile/09724709677503698323noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-21978212809084100472015-07-08T02:02:18.356-07:002015-07-08T02:02:18.356-07:00"There is nothing idealized about it. What do..."There is nothing idealized about it. What does it consist in? Well, the general data linguists use are acceptability-under-an-interpretation judgments.".<br /><br />Acceptability data is quite noisy and gradient. The discussion about the status of islands in Scandinavian here (the comments to http://facultyoflanguage.blogspot.co.uk/2015/06/islands-are-not-parametric.html) is basically a discussion about what is a reasonable level of idealization with regard to acceptability judgmentsAlex Clarkhttps://www.blogger.com/profile/04634767958690153584noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-46625041606058934242015-07-07T09:14:46.522-07:002015-07-07T09:14:46.522-07:00Part 2:
Other data Colin uses include eye trackin...Part 2:<br /><br />Other data Colin uses include eye tracking measures, TVJT, rep and meg data, self-paced reading measures and some others that I a cannot recall. Why does our work not engage these data as easily as the acceptability data? Because, they are aiming for online measures and so intentionally do not abstract away from the memory, attention, executive control concerns that the acceptability data tries to remove from the picture. Interestingly, some of this data confirms the acceptability data (e.g see his stuff on islands and filled gap effects using self paced reading measures). However, much of his work asks related but DIFFERENT questions than you or I tend to ask: e.g. how is the G used in real time? When is a principle of G invoked in processing? When do kids master principle A, B, C. Note that these all presuppose Gish knowledge and ask more about it. These, of course, require extra assumptions in addition to Gish ones. Our theories do not as they stand address these for they make no obvious commitments to what these extra assumptions are, being compatible with many variants. However, from what I can tell, these too are just data, neither more nor less idealized, just tapping into different cognitive resources and so being more or less direct windows onto G, UG and the performance systems that deploy G and UG.<br /><br />So what would a rational person do? Well, he might once have argued that the data linguists use is not reliable. But we know that this is false. Or she might think that the fit between Gs and UGs and performance systems might require us to change the the Gish and UGish representations we think are right just basing ourselves on acceptability data. This resolves to the question of how transparent G and UGs are to the operations and primitives of the performance systems. This question has been asked, and IMO, to date it looks like the fit between the two is pretty close, though this is clearly something that people will have to keep investigating (think covering grammars here). However, TO DATE, there is little reason from most of what I have seen to think that the conclusions we have drawn on acceptability judgment data are much challenged by data using these other methods.<br /><br />In fact, I would go further: standard practice is to start with the theories we have developed using our data and then combine THEM with assumptions about memory structure (content addressable? RAM? Stack?) and attention and see where this combined theory goes. There are a few cases where we are confident enough in our theories of memory, attention and excutive control to start holding these constant to probe G structure. I personally hope we will see more of this in the future, but IMO, this is still not that compelling. Note, however, this uses data that is neither better nor worse than ours, just different.<br /><br />That's how I see it. It's not a question of data quality or ontological differences in the data (idealized vs objective). I wish in some sense that such distinctions made sense for then we could all concentrate on the right kind. But it doesn't. It's just objective data all the way down! Most of the data is massaged using standard methods, but there is no other real difference except the number of assumptions you are or are not abstracting away from. So, that's how I see things. Norberthttps://www.blogger.com/profile/15701059232144474269noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-24329185192985011792015-07-07T09:12:32.696-07:002015-07-07T09:12:32.696-07:00Thx for the clarification. The answer is in two p...Thx for the clarification. The answer is in two parts due to space issues. <br /><br />I am having a problem with what you call "idealized data." This is just data from where I sit, no different from other data. There is nothing idealized about it. What does it consist in? Well, the general data linguists use are acceptability-under-an-interpretation judgments. In some cases, when there is no interpretation that could make these sentence acceptable, this defaults to simple acceptability. This is our main probe. <br /><br />Is this idealized? No, we ask real flesh and bone speakers. Is this good data? Yes very reliable as Sprouse, Almeida Schutze among others have shown us. And very easy to use. So, this is the main kind of data we use.<br /><br />Why this? Well, as noted it is easy and reliable and cheap. It is easier than using corpus data, for example, and can be aimed pretty accurately at what you want to look at. Second, it is not an online measure so we can rationally hope it abstracts away from things that we know can distort judgments; things like attention and memory issues. Does this mean that it is unimpeachable? No. Does it mean that it only targets G and UG properties? No. there are still lots of ways of being unacceptable. Sprouse, for example, showed that filled gap effects degrade acceptability.<br /><br />Colin, I know this being the syntax member of many a psycho committee, uses such data all the time, cleaned up with experimental syntax methods given his audience. So, this is some of Colin's data. <br /><br />Continued next entry.Norberthttps://www.blogger.com/profile/15701059232144474269noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-5806488169209640602015-07-06T09:23:13.352-07:002015-07-06T09:23:13.352-07:00@Norbert: I had intended my last paragraph to add...@Norbert: I had intended my last paragraph to address some of these points. I think that you are using the word data in a different way than was I. My parenthetical `(idealized) data' is I think what you are calling data in your first paragraph. I agree that our theories are all too easy to falsify using idealized data, but I hope you will agree that our theories say next to nothing about the data collected in (say) Colin's latest experiment. I think that idealized data is very important in constructing a simple theory that we can understand (which is a goal that I take seriously). But, from where I sit, the theories we construct are only as good as their utility as part of an account of real data. <br /><br />Don't get me wrong. I don't think that our theories must be vetted against the real data at every step. I'm happy to use idealized data over and over to try to get a handle on something. But when we describe idealized data, we're not describing the world (as determined objectively by experiments etc), but rather our conceptualization of it (ie. a pattern we think we see). In so far as our theories should be about the world, they must accord with it, rather than our projection of structure into it.<br /><br />I don't really expect that you would disagree with this (other than in how coarsely formulated it is). (Do you?) What I'm trying to do is to identify where rational people might look at linguistics and be unhappy. I think that a rational person could be unhappy with the fact that we are very heavily into idealized data, and that we are not yet seriously attempting as a field to make contact with objective data.Greg Kobelehttps://www.blogger.com/profile/08006251459440314496noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-33334182606678077912015-07-06T08:15:32.617-07:002015-07-06T08:15:32.617-07:00Typical. Norbert and I try to stage an interesting...Typical. Norbert and I try to stage an interesting fight, and then Lidz comes along and goes all conciliatory on us. Party pooper. <br /><br />But seriously. I'm a bit confused too, as I wasn't looking to pick a fight (yawn) and I'm not feeling bent out of shape or trying to make a sociological point. I was simply making the suggestion that the CPD hasn't proven to be a great source of clarity. It's possible that in the context of the early 60s it did just what was needed, in helping to delimit a new research program. But I'm not sure that it has helped a lot since then. As this thread illustrates, it seems to be one of those things that people are confident that they understand, and are surprised to learn that others think of it differently. <br /><br />Some differences that get swept together under the heading of the CPD.<br /><br />1. Mechanism vs. its products. Easy.<br /><br />2. Degrees of abstraction in characterizing a neurocognitive system (roughly equivalent to Marr levels, except that the notion that there are 3 distinct levels masks the many choices that you make when choosing where you'll find the most insight).<br /><br />3. Differences in how the same neurocognitive system performs in different task settings, when information is available or withheld, e.g., in comprehension the meaning is withheld.<br /><br />4. Differences between distinct neurocognitive systems with specific functions, e.g., "grammar" vs. "producer".Colin Phillipshttps://www.blogger.com/profile/09724709677503698323noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-43187245812713207722015-07-06T08:01:01.720-07:002015-07-06T08:01:01.720-07:00I was hoping you would chime in Greg. If I recall ...I was hoping you would chime in Greg. If I recall correctly you once described yourself as very much an instrumentalist scientifically. That said, I guess I don't see the problems linking theory and data that you like to harp on. I don't see that our data is particularly bad or that we have trouble linking theoretical predictions to data. In fact, we have an all to easy time showing that a particular hypothesis is false. Are there work arounds? Sure, but this is true in *ANY* field (e.g. You may have noticed how hard it has been to pin down the energy range of the Higgs.). It really is hard to eliminate all variants of a hypothesis. So, I really don't buy your skepticism regarding results in linguistics wrt how they fit the data we consider. It's not physics, but then what is? But it's not bad.<br /><br />So, from where I sit, your skepticism is not about the CPD at all, but about whether we have any data that fit our theories. You seem to think that the answer is no and that the right thing is then to model the data, rather than the mechanism. this will eliminate the gap between the two, but in a very uninteresting sort of way. So, here we disagree and I would ascribe your views as very very Eish. This should not bother an instrumentalist, so you might agree. Norberthttps://www.blogger.com/profile/15701059232144474269noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-90806231874426307862015-07-06T07:48:23.717-07:002015-07-06T07:48:23.717-07:00Yes, a (slightly) longer piece is in the works. Sh...Yes, a (slightly) longer piece is in the works. Shevaun Lewis and I have been working on this sporadically for a while. We'll let you know once it's done. As part of this exercise, we did some digging through the literature on what people had said about CPD over the years. Colin Phillipshttps://www.blogger.com/profile/09724709677503698323noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-20328492100845322752015-07-06T07:46:03.785-07:002015-07-06T07:46:03.785-07:00I don't think that this is about rationalism v...I don't think that this is about rationalism vs empiricism, at least<br />not as I understand those terms. Clearly, there is a sense in which<br />linguistics /is/ `about data'; if there weren't, then linguistics<br />couldn't possibly be an empirical science. One way to approach<br />complex phenomena (like the behaviour which provides the data of<br />linguistics) is to attempt to describe them as the interaction of<br />multiple causal powers. Generative linguistics is trying to describe<br />one of the powers which conspire to give rise to actual language<br />behaviour. One obvious way to do this would be to provisionally treat<br />the effects of the other factors as a normally distributed random<br />variable, and to see whether different linguistic hypotheses improve<br />the model. That would keep a straightforward connection between our<br />theories and data.<br /><br />That's not what we do.<br /><br />Instead, we have an extremely non-trivial relation between our<br />theories and data. Like Chomksy, and Marr, and Pylyshyn, and many<br />others have recommended, we are conducting our investigation into our<br />postulated causal power at multiple levels. Generative linguistics<br />has typically been working at higher levels, which do not connect<br />directly to data. Following Chomsky, who admonished us to use very<br />clear data (and to let the theory decide the unclear data), we might<br />identify a contrast, and build our model to provide a distinction<br />between the contrasting conditions. We appeal thereby to an often<br />implicit linking hypothesis. (Maybe, for each subjacency violation we<br />expect a linear (?) decrease in acceptability.) And we hardly ever<br />relate our high-level theory of our postulated causal power to the<br />actual language data. On the rare occasions when we do, there is this<br />monstrous edifice working behind the scenes to deliver results that<br />don't need it.<br /><br />I think that *this* is what bothers people, and I think that they are<br />right to be bothered. I don't think this has anything to do with the<br />cognitive revolution leaving some behind, or what not. From my<br />extremely post-cog-rev perspective, the cognitive revolution gave us<br />permission to postulate very abstract entities, not to completely<br />disengage from data.<br /><br />But of course, I think that this is an important way to investigate<br />cognitive phenomena. The theories which are complete overkill for any<br />particular experiment are intended to be significant components of<br />accounts of wildly different experiments on wildly different data.<br />And I think that it's completely reasonable to build models of simple<br />(idealized) data in order to get a handle on the real deal, which is,<br />afaict, what we are in the business of doing.<br />Greg Kobelehttps://www.blogger.com/profile/08006251459440314496noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-79431811353421167582015-07-06T07:45:43.993-07:002015-07-06T07:45:43.993-07:00@Jeff: Agreed. I don't get it either. However,...@Jeff: Agreed. I don't get it either. However, if what Colin is getting at is the sociology, then I do agree that it the attitude of syntacticians is not not only misplaced but counter-productive. I think I've even been known to say this publicly in syntax venues (E.g. Athens recently). That said, we need to get the logic right and for this the CPD is very useful for it accurately describes the lay of the logical land. There are always subtleties in applying the CPD to actual cases, but this is true of any general distinction/principle. I don't see the CPD more problematic than most. So, I agree with your puzzlement and I second both points you make. Norberthttps://www.blogger.com/profile/15701059232144474269noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-79512731634308851732015-07-05T14:05:05.390-07:002015-07-05T14:05:05.390-07:00I'm baffled by the exchange between Colin and ...I'm baffled by the exchange between Colin and Norbert. Nobody says they study "competence" any more than they say they study "performance" because on both ends, there are a gazillion topics. People who work on "competence" study stuff like binding, control, bounding, agreement, etc. People who work on "performance" study stuff like planning, prediction, attention, encoding for memory, maintenance in memory, retrieval from memory, etc. There is a sense in which those who live on the C-side of the CPD don't care about what happens on the P-side, and sometimes this is a perilous move (because maybe some fact that you thought was about the grammar was really about memory, or whatever), and sometimes it is completely innocuous because the intuitions about acceptability are so robust that they suffice for theory construction. If you work on the P-side of the CPD, then it is always perilous to make claims without paying attention to (at least some of) the contents of the competence theory because that theory at the very least specifies the data structures (or constraints on what the data structures have to encode) that the performance theory engages. This much, I am quite certain everyone in this discussion agrees with. So what's left? Explaining why some people are grumpy about the CPD. This, I think, has two parts. The first part is, I think, the sociological point that Colin is making, which is that people who are interested in psycholinguistics maybe think that the hard core grammatical theorists look down their noses at them because competence is important and performance is just noise. Certainly back when we were in graduate school and syntax was still king of the hill, there may have been some truth to that (and for all I know maybe there still is), but this is just noise that gets in the way of thinking. Those who feel slighted by the CPD should just get over themselves. The second part is what Norbert is talking about, and which is certainly right. Many people trained in psychology (and linguistics) think that psychology is the study of behavior (indeed, many psychology textbooks even begin with a sentence making that assertion), which is essentially an empiricist idea. Those people are confused about the CPD because they think the outputs of the system are the system itself, that there is no hammer without hammering. Those psychologists who think that psychology is the study of the mental structures that give rise to behavior have no problem with the CPD (at least in my experience). Those people learned the lessons of the cognitive revolution. I'm sure Colin agrees with the second part and that Norbert would assent to the sociological point. So, I don't see what you guys are grumbling about. Jeff Lidzhttps://www.blogger.com/profile/02662307721892528218noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-57366163488082302102015-07-05T13:05:26.970-07:002015-07-05T13:05:26.970-07:00The thing I found most puzzling in Pullum's ex...The thing I found most puzzling in Pullum's exposition of the CPD was the claim that Chomsky is interested in the intuitions of ideal speaker-hearers. On my understanding, the idealization to an ideal speaker-hearer comes into play when when we consider the problem of language acquisition. That is, we begin by trying to solve the “easy” version of the problem before we try to figure out how kids deal with non-homogenous speech communities, memory limitations, speech errors, etc. Chomsky is not telling us to interpret the intuitions of real speaker-hearers as approximations to the intuitions of imaginary ideal-speaker hearers. Alex Drummondhttps://www.blogger.com/profile/04676457657606185543noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-51913893709266811702015-07-05T10:44:30.405-07:002015-07-05T10:44:30.405-07:00I think that it is important that people appreciat...I think that it is important that people appreciated the distinction between Gs and their products and between FL/UG and its products. The object of inquiry are these Gs and FL/UG. I think it is important that people understand that intuitions are the data that linguists have found useful in exploring the structure of Gs and FL/UG. I think that it is important to distinguish Gs and UG from how Gs and UG is put to use in using and acquiring a G. I think that these distinctions are important for if they are run together, the chance for confusion rapidly increases. The CPD is useful in that it encapsulates these distinctions and it makes clear that Generative Capacity cannot be identified with our intuitions, our utterances, our parsings etc. So, to me the CPD is just the claim that Gs matter (as does UG) and that it cannot be identified with its uses. From what I can tell you agree. But I sense that you also think that the CPD in some sense denigrates/confuses/ casts aspersions on other kinds of research, research that wants to understand how Gs or UG is actually used. <br /><br />I have never seen this myself. Chomsky's point, and here I agree, is that studying these things will already presuppose some commitments to the structure of Gs and UG for they are PART of any theory of use. Put tersely: you can have G/UG without use of these but not vice versa. And this really is an important point, one that I thought you agreed with. <br /><br />There is a little more, but this is not part of the basic logic of CPD but often assumed; namely that what is specifically linguistic is G and UG, all other aspects of use being cognitively general. Again, I thought that you and yours buy this: there is no special linguistic memory, no special linguistic attention, no special linguistic executive control. Rather these general things PLUS G and UG are what performance amounts to. If this is so, then again, the CPD is useful for it often is used to tacitly make this cut.<br /><br />So why are "non-linguists…dissatisfied"? You say that it is not just Eish attitudes (conceding that it is at least Eish attitudes?). Maybe not. So what is it? In my experience, the non-linguists could care less about G or UG. They even deny that FL exists. So maybe you might like to explain what you see the problem with CPD to be. I would be delighted for you to post on the issue at length should you wish more than a couple of paragraphs. I personally would like to know what the problems are, as truth be told I have never found the distinction tight to grasp (though like all distinctions it can be hard to apply in practice). I would hope that if you decided to do this it would not be just Chomsky exegesis, fun as this is.Norberthttps://www.blogger.com/profile/15701059232144474269noreply@blogger.comtag:blogger.com,1999:blog-5275657281509261156.post-34545150780187522602015-07-04T21:31:15.036-07:002015-07-04T21:31:15.036-07:00You end your post by saying how important it is to...You end your post by saying how important it is to understand the CPD. And you seem to argue that a lot of non-linguists are unhappy with it (or misunderstand it) because they are empiricists. My disagreement is with both of these points. I think that there are many important issues in this general arena, but that the CPD has demonstrably done little to help to clarify them. I think that clarity is most likely to come from avoiding the CPD and talking instead about the important individual issues that it conflates. And I think that there are many reasons aside from empiricism that lead non-linguists to be dissatisfied with the CPD. <br /><br />The contrast between a mechanism and its products is indeed the distinction that is highlighted on p.4 of Aspects. But by p.9 the same terms are being used to refer to a different distinction, one involving either different mechanisms or different degrees of abstraction, or both.Colin Phillipshttps://www.blogger.com/profile/09724709677503698323noreply@blogger.com