Overregularization

Definition:

Overregularization (also spelled over-regularization) is the incorrect application of a productive grammatical rule to an irregular form that is an exception to that rule. A child learning English says “goed” instead of went, or “mouses” instead of mice, because they have learned the regular past tense and plural rules and are applying them where they don’t apply. Overregularization is one of the most documented phenomena in first language acquisition and also occurs in second language acquisition.

Also known as: over-generalization, overgeneralization error, regularization error


In-Depth Explanation

Overregularization is not a mistake in the ordinary sense — it is evidence that the learner has correctly internalized a productive grammatical rule. A child who says “goed” has already made a significant cognitive leap: they have extracted the rule for past tense formation (-ed) from thousands of regular verbs and are attempting to apply it systematically. The error is the predictable result of rule learning meeting irregular exceptions.

The phenomenon is famously U-shaped in L1 acquisition. Children initially say “went” correctly because they have memorized it as a lexical chunk. When they later extract the -ed rule and apply it productively, they start producing “goed” — apparently a regression. Finally, they learn that “go/went” is an exception to the rule and produce “went” correctly again, but now with genuine rule knowledge rather than rote memory. The U-shape (correct → incorrect → correct again) is a strong argument against purely behaviorist accounts of language learning, which cannot explain the regression phase.

In second language acquisition, overregularization happens whenever a learner has acquired a productive rule but has not yet learned its exceptions. Common overregularization zones across foreign languages:

  • English irregular past tense: goed, runned, buyed, speaked
  • English irregular plurals: mouses, childs, sheeps, foots
  • German verb conjugations: Third-person singular irregular verbs regularized (e.g., ich laufe → er lauft instead of läuft)
  • Japanese verb classes: Godan verbs regularized on the ichidan pattern (or vice versa), producing non-standard forms
  • Spanish stem-changing verbs: Applying the regular paradigm where a stem change is required
  • Spanish ser/estar: Applying “soy” across contexts where “estoy” is required — a different type of regularization

The frequency effect plays a key role: the most irregular forms in a language are typically also the highest-frequency forms (“went,” “was,” “had,” “saw”), and frequent exposure to these forms as lexical chunks generally suppresses overregularization. It is low-frequency irregular forms — ones the learner has encountered in rule form before encountering enough exemplars — that are most vulnerable.

Overregularization also reveals what learners are not doing: they are not merely imitating heard forms (which would prevent overregularization of “went,” never heard in native speech). They are extracting generalizations, building a rule system, and applying those rules — sometimes in places they don’t belong. This makes overregularization a diagnostic window into the learner’s implicit grammatical knowledge at a given stage of development.


History

The systematic study of overregularization began with child language researchers in the 1960s–70s. Roger Brown’s longitudinal studies of L1 acquisition (Hart, Adam, and Sarah — published in A First Language, 1973) documented the developmental sequence of morpheme acquisition and implicitly showed the role of regular rules. The U-shaped learning curve for irregular forms was explicitly described by Steven Pinker and colleagues in the 1980s–90s.

The key theoretical debate overregularization triggered is about the architecture of language: Are irregular forms stored in memory and regular forms computed by rule (dual-route model), or are all forms stored and retrieved with varying activation strengths (connectionist model)? Pinker and Ullman’s dual-route model (Words and Rules, 1999) uses overregularization as key evidence — the temporary suppression of stored irregulars by the rule mechanism explains the U-shape. Connectionist models (Rumelhart & McClelland, 1986) showed that neural networks produce overregularization naturally at intermediate stages of training without explicit rules, arguing the dual-route distinction is unnecessary.

In SLA, overregularization was recognized as a systematic type of interlanguage error — distinct from L1 transfer — in Selinker’s foundational work (1972) and detailed in Corder’s error analysis framework. It is now treated as a normal developmental feature, not a failure.


Common Misconceptions

“Overregularization is a mistake that should be corrected immediately.”

Most SLA researchers and L1 acquisition specialists recommend against correcting overregularization errors in natural speech, particularly in children. The error is evidence of productive rule learning, and correcting it does not help the learner bypass the U-shaped learning process. Over-correction can also increase language anxiety. Recasts that model the correct form without explicit correction are generally preferred.

“Overregularization means the learner got worse.”

The U-shaped regression is commonly misread as decline. It is a developmental stage, not a setback. A child who starts saying “goed” was previously producing “went” as a memorized form and is now producing it by (incorrect) rule application — which actually represents more generative knowledge.

“Only children overregularize.”

Adult L2 learners overregularize systematically, particularly in morphologically complex languages. Japanese language learners regularize verb classes; Spanish learners regularize stem changes; German learners regularize strong verbs. The phenomenon simply looks different from child overregularization because adults’ greater metalinguistic awareness allows them to monitor output in some contexts but not others.

“Overregularization only affects inflectional morphology.”

While past tense and plural are the canonical examples, overregularization also occurs in derivational morphology (using a productive suffix where an established form exists: beautiness for beauty), in syntax (regularizing irregular word order patterns), and in pragmatics (applying a general discourse pattern where a frozen exception applies).


Criticisms

The dual-route vs. connectionist debate around overregularization remains unresolved. Connectionist critics argue that Pinker and Ullman’s model overstates the rule-memory distinction and underspecifies how frequency effects modulate regularization rates. Bybee’s usage-based model reframes overregularization as competition between high-frequency memorized forms and a lower-frequency target irregular, mediated by type frequency of the regular class — no explicit rules required. The architecture debate has profound implications for SLA theory: does L2 learning involve the same implicit rule-extraction as L1, or does adult SLA rely more heavily on explicit rule learning that is then proceduralized?


Social Media Sentiment

Overregularization is a crowd-pleaser for language learning communities because the examples are funny and accessible — “goed,” “mouses,” “sheeps.” It regularly comes up in r/linguistics threads and parenting forums when people share amusing things their children said. In SLA communities, it appears in discussions about error correction philosophy: whether to immediately correct learner errors or allow interlanguage to develop naturally. The prescriptivist framing (“this is wrong”) vs. the developmental framing (“this is a sign of progress”) plays out repeatedly in these discussions.

Last updated: 2026-04


Practical Application

When you encounter your own overregularization errors in an L2, treat them as information: you have acquired the rule, but the irregular exception hasn’t formed a sufficiently strong chunk. The fix is targeted exposure to the irregular form in varied contexts — not more rule study. For Japanese learners, irregular verbs (する, くる, and the occasional verb class mismatch) are the most common overregularization zones. The prescription is high-frequency, varied exposure in comprehensible input, not rote drilling of paradigm tables.


Related Terms


See Also

  • Sakubo – Japanese SRS App — particularly useful for drilling irregular Japanese verb and adjective forms through spaced repetition to suppress overregularization

Research

  1. Pinker, S. (1999). Words and Rules: The Ingredients of Language. Basic Books. [Comprehensive argument for the dual-route model; uses overregularization as central evidence for rule-memory distinction]
  2. Brown, R. (1973). A First Language: The Early Stages. Harvard University Press. [Landmark longitudinal study of L1 acquisition; establishes morpheme acquisition sequence and documents U-shaped learning]
  3. Rumelhart, D. E., & McClelland, J. L. (1986). On learning the past tenses of English verbs. In McClelland & Rumelhart (eds.), Parallel Distributed Processing: Explorations in the Microstructure of Cognition (Vol. 2). MIT Press. [Classic connectionist simulation that produces overregularization without explicit rules; triggered the dual-route debate]
  4. Marcus, G. F., Pinker, S., Ullman, M., Hollander, M., Rosen, T. J., & Xu, F. (1992). Overregularization in language acquisition. Monographs of the Society for Research in Child Development, 57(4). [Systematic analysis of overregularization rates and patterns across child learners; provides frequency data on when and why overregularization occurs]
  5. Selinker, L. (1972). Interlanguage. International Review of Applied Linguistics, 10(3), 209–241. [Foundational SLA paper establishing interlanguage as a systematic variable of L2 learners; overregularization treated as a key interlanguage process]