Categorization, Communication and Consciousness 2020: 9b. Pullum, G.K. & Scholz BC (2002) Empirical assessment of stimulus poverty arguments

Monday, September 2, 2019

9b. Pullum, G.K. & Scholz BC (2002) Empirical assessment of stimulus poverty arguments

Pullum, G.K. & Scholz BC (2002) Empirical assessment of stimulus poverty arguments. Linguistic Review 19: 9-50

http://www.youtube.com/watch?v=b5u6nSwFXV0

This article examines a type of argument for linguistic nativism that takes the following form: (i) a fact about some natural language is exhibited that al- legedly could not be learned from experience without access to a certain kind of (positive) data; (ii) it is claimed that data of the type in question are not found in normal linguistic experience; hence (iii) it is concluded that people cannot be learning the language from mere exposure to language use. We ana- lyze the components of this sort of argument carefully, and examine four exem- plars, none of which hold up. We conclude that linguists have some additional work to do if they wish to sustain their claims about having provided support for linguistic nativism, and we offer some reasons for thinking that the relevant kind of future work on this issue is likely to further undermine the linguistic nativist position.

42 comments:

JG Fall 2020October 29, 2020 at 7:05 AM
This article sets out to show that proponents of the "Argument from Poverty of the Stimulus" argument (APS) have yet to produce evidence that backs up their theories. Of the evidence that they have produced, all can be refuted using corpus analysis techniques that were previously unavailable or ignored by researchers. Notably, Pullum does not aim to disprove this version of nativisim with empiricism: rather, he uses empirical evidence to demonstrate that the nativist position isn't as well founded as it purports to be.

This is the scientific method at its best: while evidence can be used to disprove or bolster theories, it can also point to a gap in our knowledge. Pullum leaves us unsatisfied with the APS, and in doing so, he pushes us and future researchers to investigate the theory further. Crucially, he encourages his peers in generative linguistics to adopt strategies to which they were traditionally antipathetic, like mathematical learning theory and corpus linguistics.

As Pullum demonstrates, research's value isn't just in proofs and theories; it's in identifying the gaps in our ideas so that we can bridge reasoning and reality.
ReplyDelete
Replies
Stevan HarnadNovember 3, 2020 at 10:44 AM
Don't get too enthusiastic! Pullum does the same thing Pinker did: he ignores the distinction between UG and OG, thereby begging the question.

All the violations Pullum finds in his big-data text corpus are just violations of OG, not UG. It remains true that no one -- child or adult -- hears or produces UG violations, let alone corrections... except adult Chomskian linguists. But they are doing it to try to reverse-engineer the rules of UG, by trying out potential rules to see if they work. And their corrective feedback for the UG-violating utterances comes from their own innate UGs!

It turns out Geoffrey Pullum (who is quite clever) was not quite right about the eskimo snow term hoax either.

Pullum, Geoffrey K. The great Eskimo vocabulary hoax and other irreverent essays on the study of language. University of Chicago Press, 1991.

Lupyan, G., & Zettersten, M. (2020). Does vocabulary help structure the mind?.
ReplyDelete
Replies
Aylish MNovember 6, 2020 at 5:03 PM
This paper seems to apply poverty of the stimulus only to positive feedback information gathered from a child’s environment. The authors supply strong support to suggest complex and specific grammar principles are relatively commonly heard by children during language acquisition. So, it is probably not correct that since children aren’t exposed to these examples, they must be innate. However, the authors’ claim that lack of positive evidence for acquired grammar is the strongest support for nativism seems to be a matter of opinion. The authors set aside the arguments of incompleteness and positivity by failing to examine the lack of UG-violating examples in speech. As was discussed in categorization, to learn rules and categories, it is crucial to not only know what is inside a category but also what is not. Pullum and Scholz show that what is inside a category (in this case correct grammar) is supplied to children, but don’t address how children know what is outside the category. This is not to say that children don’t make errors, but they don’t make ones that violate UG. How would they know to only make OG errors and not UG if not for some innate knowledge of UG grammar?
ReplyDelete
Replies
Stephanie PyeNovember 7, 2020 at 8:25 AM
In terms of useful takeaways from this paper I think the biggest one is that children do receive a richer and fuller set of inputs than I had previously thought. The authors are self-aware that they have collapsed the acquisition of language by children as presented on pages 12 to 13 into just focusing on all of the positive stimuli inputs children would be expected to hear. Fine – but after reading some of the other skywritings and replies for this week I really do not think that a lack of negative examples can be made up for by this abundance of positive examples. It is great to show that children are exposed to unique and what were presumed to be very rare formulations, but I am really not sure that this pokes as many holes in the nativism theory as they suggest.

In connection to the idea of a purely correlational learner that was brought up in the Pinker paper, I am not sure I think having this large data set as discussed in the Pullum and Scholz paper is a solution to learning language. With no innate filter to parse the input data into these different phrase structures and have a hint as to what features should be remembered because they are important, how would all of these utterances be stored in the child’s brain so that different correlations could be extracted? Our memory is good for learning thousands of vocabulary words, but this seems different to me than storing entire utterances in order to learn grammar. I hope that this is not the equivalent of a granny argument but it is something that I am curious about.
ReplyDelete
Replies
Lyla HNovember 9, 2020 at 8:05 AM
I’m going to use these quotes from 9a because I feel like they tie in pretty nicely

“The girl giggled and Don't giggle me… unless their parents slip them some signal in every case that lets them know they are not speaking properly, it is puzzling that they eventually stop”
“If the world isn't telling children to stop, something in their brains is, and we have to find out who or what is causing the change”

This sounds familiar. I’ve definitely heard people with English as their second language say “I’m going to learn you” instead of I’m going to teach you. The authors say that something seemingly innate eventually stops children from making these mistakes, so are they implying that these are UG mistakes? And if yes, why do my friends who (I think) learned English as a second language continue to make these mistakes? Are we more sensitive to UG during the critical period? My understanding was that nobody, children nor adults, would be making UG mistakes (except linguists that are purposefully looking to make those mistakes) so this sort of confused me.
But back to this reading, I think we all agree that learning OG requires some form of positive and negative feedback. After all, to know what something is, we must also know what it is not and kids do this by making mistakes and being corrected. However, with UG, kids end up learning the right thing although they have never been exposed to the wrong thing because UG mistakes aren’t made. To make progress towards the reverse engineering problem, this distinction definitely needs to be made because UG needs to somehow be hard coded into our T3 robot while OG would probably be learned (I’d imagine through supervised learning). Trying to teach a robot UG through supervised learning, if it works, might give us weak I/O equivalence because we pretty much know for a fact that we don’t learn UG. The only benefit I’d see in this is that if we can’t figure out how to hard code UG then we can teach them to the robot instead. We know that that robot won’t be learning language the way we learn language but if we manage to teach it language at all that’s a win. But if we are going to spend time coming up with enough wrong examples of UG to teach the robot, we could just hard code them in instead and save training time.
ReplyDelete
Replies
Solim LegrisNovember 10, 2020 at 8:36 AM
Pullum takes the following to be the correct version of the POS argument:

"There could be evidence for a certain grammatical rule in a language that could be found by an adult linguist through intensive search but would never appear frequently enough in conversational data for a child to find it."

After reading this article, I came to the conclusion that the four arguments in section 4 that supposedly undermine the APS pertain not to UG but to OG (which is confirmed in a comment above by Prof. Harnad). There being obscure grammatical rules in a language that children know or do not know because of lack of exposure is not what the POS argument is about. The POS argument is really about language rules that everyone knows, universals, whatever language they speak. It is that capacity, unique to humans, to learn any language as a child. The POS argument pertains to the fact that we learn and master any language as children much faster than would be expected if we did not have any prior knowledge given to us by innate language learning mechanisms.

On the other hand, I think that the empirical methodology outlined by Pullum might be useful to define UG more precisely. I don't know a lot about linguistics, but from what we have read so far, I am getting the impression that no one really has clearly defined the rules of UG and its structure. Most of the time, it is mentioned in quite abstract terms. Although defining it as "parameter setting" and potential "constraints on thought" is interesting, I still don't know what the parameters are (or if anyone knows them) or what those constraints might be. Potentially, those constraints prevent us from figuring that out too.
ReplyDelete
Replies
AlexNovember 10, 2020 at 10:05 AM
In "Empirical assessment of stimulus poverty arguments," Pullum understands the Poverty of Stimulus argument (POS) as referring to the fact that children are not exposed to data about the structure of their language which would be necessary to learn that structure; therefore, the structure must already be there -- that is, it must be innate. Pullum argues that empirical evidence sheds doubt on the POS. One example he gives concerns variations in plurals of compound words across linguistic cultures. He notes that in the USA, they would likely say "a drug problem," whereas in the UK, they would say "a drugs problem." Pullum argues that this variation would not be possible if there was an underlying UG which would put a constraint on which compound words could or could not be plurals.

As has been noted in this thread, the problem with this example -- and Pullum's argument more generally -- is that he addresses variations or incorrect formulations in ordinary grammar (OG) but not UG. UG concerns the internal structure of sentences. In the case of "a drug problem" vs "a drugs problem," the structure of the formulation is the same (it's just that in one case a plural is added to a word and in the other it is not), indicating that this variation is not in fact a variation in UG but in OG. For Pullum to make a better argument, he would have to show that variations exist in the internal structure of sentences which go against the principles of UG -- indeed, he seems to think that the POS concerns a supposed lack of data about the structure of languages -- so it is strange that he uses an example which does not in fact have to do with the structure of the formulation.
ReplyDelete
Replies
Matt MiltonNovember 12, 2020 at 10:39 AM
I agree with others when they say that the definition of UG has been fuzzy. I think it is interesting that Pullum’s empirical approach relies heavily on determining If examples can be explained by data-driven learning which I think is how he ends up discussing OG rather than UG. In fact, can someone be able to explain in kid-sib fashion why the authors believe, “until data-driven learning is investigated in more detail, linguists will remain ill-equipped to do more than fantasize and speculate on the matter” (p. 47). Is it because if you can prove something can be learned through data-driven learning then it cannot be a part of UG? How then can we determine what UG is if we do not learn it? Also, why can we not simple find common syntactical characteristics of all languages and call those part of UG? Surely linguists have a pretty good understanding of what is shared by all languages at this point.
ReplyDelete
Replies
Allie F (she/her)November 13, 2020 at 2:24 PM
In this article, Pullum questions whether nativists have really done their due diligence to empirically show cases of APS (or POS is you prefer). He proposes a structure to provide empirical support for APS:

-Acquirendum (essentially what is the rule or principle we are trying to say is innate)
-Lacuna (a set of sentences that can be thought of as the data one would need to acquire the acquirendum via learning)
-Indispensability (argument that one could not acquire the acquirendum without the lacuna sentences)
-Inaccessibility (show that the lacuna sentences are not adequately accessible)
-Acquisition (show that children do acquire the acquirendum during childhood)

The inaccessibility evidence proves to be exceedingly difficult to show since the thresholds for inaccessibility are not strictly defined. How often would a child need to encounter a lacuna sentence for them to learn the grammatical principle? How rare do the instances have to be in order to claim that they're so inaccessible that a child could not possibly have acquired acquirendum with that level of exposure? I suppose it ends up being a moot point, since as many have pointed out, Pullum does not make the important distinction between UG and OG. And for UG, there is not an acceptable level of negative evidence since the entire APS/POS argument is that we have UG in the complete absence of any negative evidence. We've actually seen quite a few papers in which the author does not make a distinction between OG and UG. Is this because we have only recently separated the two into distinct categories? I also wonder what approach we can take to reverse engineer UG and to define its principles. It clearly needs to be separate from OG but this seems to be a very difficult categorization problem to tackle.
ReplyDelete
Replies
Deirdre WhiteNovember 14, 2020 at 2:58 PM
Seeing the repeated critique of Pullum and Scholz disregarding the distinction between UG and OG, It does make me wonder if this paper would have even been possible in its analytical form had this distinction been acknowledged. With such a strong emphasis on empirical data, would the paper not have been done in(looking past the fact that Pullum's examples end up only including OG errors) by attempting to distinguish between two things that we cannot differentiate between in any empirically satisfying way.
ReplyDelete
Replies
TingNovember 15, 2020 at 10:41 AM
In Pullum and Scholz' article, the authors describe four different cases that they thought provided support for the APS:
1. Plurals in noun-noun compounding: Children will use an irregular plural as the first element of a noun-noun compound like teeth-eater. (page 24)
2. Auxiliary sequences: Children know the MHBV sequence. E.g., You should have been attending to the lesson. (page 28)
3. Anaphoric one: Children know the correct use of the word "one" when it refers to another word. E.g., This box is bigger than the other one. (page 32)
4. Auxiliary-initial clause: “Children know the auxiliary-initial positioning in polar interrogatives in languages like English and Spanish.”
E.g., The dog in the corner is hungry. → Is the dog in the corner hungry? (Children move the auxiliary "is" to the front.)
E.g., The dog that is in the corner is hungry. → Is the dog that is in the corner hungry? (Children move the "is" in the proposition to the front and drop the other "is".) (page 36-37)

Based on our current understanding, we know that these examples apply to OG, not to UG.

As well, we know with confidence that:
- Humans are genetically more motivated to learn language than other species.
- OG is a set of learned grammatical rules for a specific language, and UG is a set of innate grammatical rules for all languages.
- There is no negative evidence for UG. We are only exposed to UG-compliant examples.
- There is negative evidence for OG. Children and adults who make OG mistakes can learn to correct them through feedback from supervised learning.

To be honest, I have been hesitant to write my skywritings for this week because I still feel a bit lost. I know that word order and recursion are under UG. Are these the only examples of UG we have right now? Can anyone provide some other examples?
ReplyDelete
Replies
Ishika ONovember 17, 2020 at 2:17 AM
Commenting on the youtube video:

My understanding of UG was that there were some rules underlying all languages which we know innately. We do not need positive stimulus to learn it, we do so in the absence of stimulus. However, I am confused about the example you used of Gaddafi arbitrarily making a statement (which we know to be incorrect in English from UG) legal, following which it would become part of UG within a few generations. Does this mean that UG changes with time? If so, if a statement that used to be illegal in one language becomes legal, does that mean that this kind of statement becomes legal in all languages?
ReplyDelete
Replies
Alessia WoolfeDecember 13, 2020 at 10:15 AM
We know from our study of categorization that you cannot create categories without negative evidence. This is essential to the argument from poverty of the stimulus, or APS, which states that there exists in children some innate linguistic nativism because despite a lack of negative evidence, children obey Universal Grammar (UG). It is important to distinguish UG from Ordinary Grammar (OG) because children make OG violations all the time when they are learning a language. However, these errors are sometimes corrected, and if not, children can eventually pick up OG from unsupervised learning. In contrast, from a very early age, children obey UG without hearing any negative evidence. Where many of the arguments Pullum analyzes in this article fail, and where Pullum himself fails, is this important distinction. The fact that children make OG mistakes is not an argument against UG. Furthermore, Pullum appears to misunderstand the APS - “All we will say here is that for those readers who are disappointed that we will not treat the negative data issue further, and feel that the APS we do discuss is not the one that really deserves the name, we can only suggest that they reinterpret our abbreviation “the APS” to stand for “the Argument selected by Pullum and Scholz.””. The author eschews negative evidence and instead focuses on the amount of positive evidence, which is not what the cornerstone of the argument is. Universal grammar is a mystery because there is an absence of negative evidence, not an absence of positive. UG must be innate because without negative evidence, there would be no way to create the categories.
ReplyDelete
Replies
Claire Douglas-LeeDecember 15, 2020 at 4:07 AM
Hey Wendy — I liked your idea that this phenomenon (the observation that children don’t make errors that violate UG) might perhaps be facilitated by feeling. This topic of feeling has come up time and time again throughout this course. I wonder if the fact that children don’t make errors that violate UG somehow fits into the hard problem of how and why we feel a certain way, when we do things. Perhaps the “why” of the hard problem is that feeling makes survival and evolutionary processes simpler and more efficient. In terms of language, having this seemingly innate “feeling” that certain sentences are wrong, would streamline a child’s process of learning a language, which is a critical element of our species’ evolution and survival.
ReplyDelete
Replies
Katherine MillsDecember 15, 2020 at 9:41 AM
UG is unlearnable because we hear only UG positive examples, no negative examples. Let’s imagine we were able to find some negative examples. What would a negative example look like? Would it stand out as a negative example, or would we not be able to immediately tell it apart from a positive example? If we found proper negative examples, could we show them do a T2/T3 robot and would it be able to learn UG? I am not sure how we would ever program UG into a robot (and I therefore don’t have a clue how it is programmed into our own brains). But that is just the easy problem.
ReplyDelete
Replies

Add comment

Categorization, Communication and Consciousness 2020

Blog Archive

Monday, September 2, 2019

9b. Pullum, G.K. & Scholz BC (2002) Empirical assessment of stimulus poverty arguments

42 comments:

Opening Overview Video of Categorization, Communication and Consciousness