Abstract
This paper describes the theory and implementation ofBabel, a system which explores the hypothesis that much of the differences in the world's languages may be characterized by the inventory and properties of the lexical items and functional categories of those languages. The structure ofBabel assumes that functional categories are originally lacking in a child's syntax, and are acquired through a statistical induction process of lexical acquisition.Babel then uses information induced from the structure of the lexicon to create a model of syntax via a deductive, rule-based process. This model makes a number of predictions about the time course of language acquisition. These predictions are tested by runningBabel as a simulation of child language acquisition, using large samples of adult speech to children as input. The simulation results are shown to highly correlate to longitudinal studies of child language acquisition in English and Polish. Finally, the approach to handling noisy data withBabel is detailed.
Article PDF
Similar content being viewed by others
References
Abney, Steven P.The English Noun Phrase in its Sentential Aspect. PhD thesis, Massachusetts Institute of Technology, 1987.
Anderson, John. Language acquisition by computer and child. Technical Report Human Performance Center # 55, University of Michigan, 1974.
Anderson, John. Computer simulation of a language acquisition system: A first report. In R. Solso, editor,Information Processing and Cognition: The Loyola Symposium. Lawrence Erlbaum, Washington, 1975.
Baker, Mark and Hale, Kenneth. Relativized minimality and pronoun incorporation.Linguistic Inquiry, 21(2):289–297, 1990.
Berman, Ruth A. The acquisition of Hebrew. In Dan I. Slobin, editor,The Cross-linguistic Study of Language Acquisition, Volume 1: The Data, pages 255–371. Lawrence Erlbaum, Hillsdale, New Jersey, 1985.
Berwick, Robert C.The Acquisition of Syntactic Knowledge. MIT Press, Cambridge, Mass., 1985.
Brown, Roger and Hanlon, Camille. Derivational complexity and order of acquisition in child speech. In J. R. Hayes, editor,Cognition and the development of language. Wiley, New York, 1970.
Brown, Roger.A First Language: The Early Stages. Harvard University Press, Cambridge, Mass., 1973.
Caplan, David and Hildenbrandt, Nancy.Disorders of Syntactic Comprehension. MIT Press, Cambridge, Mass., 1988.
Chomsky, Noam.Lectures on Government and Binding. Foris, Dordrecht, Holland, 1981.
Chomsky, Noam. Some notes on economy of derivation and representation. In Itziar Laka and Anoop Mahajan, editors,MIT Working Papers in Linguistics 10: Functional Heads and Clause Structure. Department of Linguistics and Philosophy, Massachusetts Institute of Technology, 1989.
Chomsky, Noam and Halle, Morris.The Sound Pattern of English. Harper and Row, New York, 1968.
Clahsen, Harald. Critical phases of grammar development. A study of the acquisition of negation in children and adults. In Peter Jordens and Josien Lalleman, editors,Language Development, pages 123–148. Foris, Dordrecht, Holland, 1988.
Clark, Eve. The Acquisition of Romance, with Special Reference to French. In Dan I. Slobin, editor,The Cross-linguistic Study of Language Acquisition, Volume 1: The Data, pages 687–782. Lawrence Erlbaum, Hillsdale, New Jersey, 1985.
Davis, Henry and LeBlanc, David. A model of the development of phrase structure. Working Notes: AAAI Spring Symposium Series, Workshop on Machine Learning of Natural Language and Ontology, 1991.
de Villiers, Jill G. and de Villiers, Peter A. The Acquisition of English. In Dan I. Slobin, editor,The Cross-linguistic Study of Language Acquisition, Volume 1: The Data, pages 27–139. Lawrence Erlbaum, Hillsdale, New Jersey, 1985.
Emmorey, Karen D.Morphological Structure and Parsing in the Lexicon. PhD thesis, University of California, Los Angeles, 1987.
Emonds, J. E.A Unified Theory of Syntactic Categories. Foris, Dordrecht, Holland, 1985.
Ervin, Susan. Imitation in children's language. In E. H. Lenneberg, editor,New Directions in the Study of Language. MIT Press, Cambridge, Mass., 1964.
Fong, Sandiway. The computational implementation of principle-based parsers. In C. Tenny, editor,The MIT Parsing Volume, 1988–1989. MIT Center for Cognitive Science, Cambridge, Mass., 1989.
Fukui, Naoki and Speas, Margaret. Specifiers and projections, ms. MIT, 1987.
Gibson, Edward.A Computational Theory of Human Linguistic Processing: Memory Limitations and Processing Breakdown. PhD thesis, Carnegie Mellon University, 1991.
Grimshaw, Jane. Form, function, and the language acquisition device. In C. L. Baker and John J. McCarthy, editors,The Logical Problem of Language Acquisition, pages 165–182. MIT Press, Cambridge, Mass., 1981.
Horning, James. A study of grammatical inference. Stanford University Computer Science Department Technical Report No. CS139, 1969.
Kazman, Rick. The Genesis of Functional Categories. Presented to the 15th Boston University Conference on Language Development, 1990.
Kazman, Rick. On building a model of grammar from information in the lexicon. Working Notes: AAAI Spring Symposium Series, Workshop on Machine Learning of Natural Language and Ontology, 1991.
Kazman, Rick. Babel: A psychologically plausible cross-linguistic model of lexical and syntactic acquisition. In L. A. Birnbaum and G. C. Collins, editors,Proceedings of the Eighth International Workshop (ML91), pages 75–79, San Mateo, CA, 1991. Morgan Kaufmann.
Kazman, Rick. Why do children say “Me do it”? In K. J. Hammond and D. Gentner, editors,Proceedings of the 13th Annual Conference of the Cognitive Science Society, pages 455–460, Hillsdale, New Jersey, 1991. Lawrence Erlbaum.
Kazman, Rick.The Induction of the Lexicon and the Early Stages of Grammar. PhD thesis, Carnegie Mellon University, 1991.
Lachter, Joel and Bever, Thomas G. Language and connectionism. In Stephen Pinker and Jacques Mehler, editors,Connections and Symbols, pages 193–247. MIT Press, Cambridge, Mass., 1988.
Langley, Pat and Carbonell, Jaime G. Language acquisition and machine learning. In Brian MacWhinney, editor,Mechanisms of Language Acquisition, pages 115–155. Lawrence Erlbaum, Hillsdale, New Jersey, 1987.
MacWhinney, Brian and Snow, Catherine. The child language data exchange system.Journal of Computational Linguistics, 12:271–296, 1985.
Marcus, Gary F., Ullman, Michael, Pinker, Steven, Hollander, Michelle, Rosen, T. John, and Xu, Fei. Over-regularization. Technical Report Center for Cognitive Science Occasional Paper #41, Massachusetts Institute of Technology, 1990.
Newell, E., Gleitman, H., and Gleitman, L. Mother, please, I'd rather do it myself: some effects and non-effects of maternal speech style. In C. Snow and C. Ferguson, editors,Talking to Children: Language Input and Acquisition, pages 109–150. Cambridge University Press, New York, 1977.
Pinker, Stephen. Formal models of language learning.Cognition, (1):217–283, 1979.
Pinker, Stephen.Language Learnability and Language Development. Harvard University Press, Cambridge, Mass., 1984.
Pinker, Stephen and Prince, Alan. Language and connectionism. In Stephen Pinker and Jacques Mehler, editors,Connections and Symbols, pages 73–193. MIT Press, Cambridge, Mass., 1988.
Radford, Andrew.Transformational Grammar: A First Course. Cambridge University Press, Cambridge, UK, 1988.
Rumelhart, David and McClelland, James. On learning the past tenses of English verbs. In J. L. McClelland and D. E. Rumelhart, editors,Parallel distributed processing: Explorations in the microstructure of cognition. Volume 2: Psychological and biological models. Bradford Books/MIT Press, Cambridge, Mass., 1986.
Scarborough, D. L., Cortese, C., and Scarborough, H. S. Frequency and repetition effects in lexical memory.Journal of Experimental Psychology: Human Perception and Performance, 3:1–17, 1977.
Selfridge, Mallory. A computer model of child language learning. InProceedings of the 1st annual conference of the American Association for Artificial Intelligence, 1980.
Selfridge, Mallory. Why do children say “goed”? A computer model of child generation. InProceedings of the 3rd Annual Meeting of the Cognitive Science Society, 1981.
Slobin, Dan I. On the nature of talk to children. In E. H. Lenneberg and E. Lenneberg, editors,Foundations of Language Development. Academic Press, New York, 1975.
Smoczyńska, Magdalena. The Acquisition of Polish. In Dan I. Slobin, editor,The Cross-linguistic Study of Language Acquisition, Volume 1: The Data, pages 595–686. Lawrence Erlbaum, Hillsdale, New Jersey, 1985.
Suppes, Patrick. The semantics of children's language.American Psychologist, pages 103–114, February 1973.
Weist, Richard, Wysocka, Hanna, Witkowska-Stadnik, Katarzyna, Buczowska, Ewa, and Koneiczna, Emilia. The defective tense hypothesis: On the emergence of tense and aspect in child Polish.Journal of Child Language, 11:347–374, 1984.
Wexler, Kenneth and Culicover, Peter.Formal Principles of Language Acquisition. MIT Press, Cambridge, Mass., 1980.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Kazman, R. Simulating the child's acquisition of the lexicon and syntax—Experiences withBabel . Mach Learn 16, 87–120 (1994). https://doi.org/10.1007/BF00993175
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/BF00993175