Attention in SLA

Definition:

Attention in SLA refers to the allocation of limited cognitive processing resources to elements of the linguistic input and output—a selective, capacity-limited mechanism that determines what is available for further processing, storage, and ultimately acquisition. The role of attention has been centrally debated since Schmidt’s (1990) Noticing Hypothesis proposed that conscious attention at the level of noticing is necessary for acquisition (subliminal acquisition is not possible for adults in a meaningful sense). Tomlin and Villa (1994) proposed a finer-grained model distinguishing detection (unconscious) from awareness, and Robinson (1995, 2003) developed an influential taxonomy of attentional processes. Collectively, these frameworks address the question: what must a learner consciously attend to in order to acquire it, and how can instruction direct limited attentional resources toward target forms?


In-Depth Explanation

Why attention matters in SLA:

Adult L2 learners are exposed to vast quantities of input. However, not everything in the input is processed with equal depth. The input contains more potential information than can be fully processed simultaneously. Attentional selectivity determines what gets encoded into working memory and deeper processing — and what the learner can ultimately learn from a given input encounter.

The SLA attention debate centers specifically on:

  1. Is awareness required for acquisition? (Schmidt: yes — noticing required; Reber: no — adults can learn implicitly)
  2. Are form and meaning processed in competition? (VanPatten: yes — attention trade-off; limited resources require form-meaning prioritization)
  3. Can instruction direct attention to forms learners would otherwise miss? (Schmidt, Robinson, Ellis: yes — this is the rationale for focus on form and input enhancement)

Tomlin and Villa’s (1994) framework:

Tomlin and Villa proposed three levels of attentional function:

  • Alertness: General readiness to receive input; a tonic state of vigilance.
  • Orientation: Directing attention toward a particular class of stimuli (e.g., orienting toward a new grammatical form after it is introduced in instruction).
  • Detection: The cognitive registration of a particular stimulus; detection may or may not be conscious. Tomlin and Villa argued detection (without awareness) may be sufficient for acquisition — challenging Schmidt’s noticing requirement.

Schmidt (2001) responded that Tomlin and Villa’s “detection” is either the same as noticing or is empirically impossible to demonstrate — leading to a still-unresolved debate about the minimal conscious awareness required.

Schmidt’s Noticing Hypothesis:

Schmidt (1990, 1993, 2001) proposed:

  • Subliminal learning is either impossible or trivially insignificant for adult L2 acquisition.
  • Noticing — attending to form at the level of registering its existence as a phonological event — is a necessary condition for acquisition.
  • Understanding (metalinguistic awareness, rule induction) is neither necessary nor sufficient — noticing at the surface level is the minimal requirement.

Evidence: Schmidt’s diary study of his own Portuguese acquisition showed a correlation between noticing a form in input and subsequent acquisition; forms he repeatedly failed to notice despite exposure were not acquired.

Robinson’s (1995, 2003) attentional taxonomy:

Robinson extended Schmidt by distinguishing:

  • Noticing: Attention to surface form without rule understanding.
  • Noticing the gap: Attending to a mismatch between learner output and target-like input.
  • Understanding: Metalinguistic awareness; rule induction from noticed exemplars.

Robinson also proposed the Cognition Hypothesis (for Task-Based Language Teaching): cognitively more complex tasks require and appropriately foster more focused attention to input, producing richer acquisition outcomes.

Limited capacity and attention trade-offs:

VanPatten’s (1990) Input Processing framework demonstrated that learners processing input for meaning may not simultaneously attend to grammatical form — particularly when form is redundant or low in communicative value. The attention trade-off has implications for instruction: simply exposing learners to input does not guarantee form attention; instruction must actively direct limited attentional resources.

Attention in Japanese:

Japanese offers particular examples of how attentional resources are deployed:

  • Morphological processing complexity: Japanese verb+morphology sequences (食べさせられたくない [don’t want to be made to eat]) are highly morphologically complex. Attention to individual morphemes in such sequences requires processing resources that may conflict with message-level comprehension.
  • Sentence-final verb processing: Japanese SOV structure means the main verb (and much grammatical information) comes at the sentence end. L2 Japanese learners must maintain attention across long sentences before accessing core propositional content — a working memory attentional load challenge.
  • Particle detection: Japanese particles (は, が, を, に, で, と) carry essential semantic and pragmatic information. Research suggests that early learners’ attention is preferentially allocated to content words (nouns, verbs), and particles receive less attentional processing — resulting in systematic particle errors despite dense particle frequency in input.
  • Prosodic focus and attention: Japanese pitch accent is a phonological cue that native speakers use to direct attention to focused referents and new information. L2 learners of Japanese may not align their attentional allocation with focus-marking, missing prosodic signals about information structure.

History

  • 1967: Reber—artificial grammar learning; implicit (unattended) learning in adults.
  • 1990: Schmidt—Noticing Hypothesis; attention as minimum necessary condition.
  • 1993: Schmidt—elaboration of awareness levels; noticing, understanding, metalinguistic awareness.
  • 1994: Tomlin & Villa—alertness, orientation, detection framework; challenges Schmidt.
  • 1995: Robinson—noticing, noticing the gap, understanding taxonomy.
  • 2001: Schmidt—Attention volume; most comprehensive statement of the Noticing Hypothesis.
  • 2001: N. C. Ellis—statistical learning and attention; attention as trigger for statistical pattern extraction.
  • 2010s: Working memory and attention research connects SLA to cognitive psychology broadly.

Common Misconceptions

“Any exposure to input produces acquisition.” Input must be attended to for acquisition to occur. Unattended background input (TV on in the background in a language you’re studying) produces minimal acquisition because attentional resources are not allocated.

“Paying attention to form means ignoring meaning.” Schmidt’s noticing requires attention to form as form during comprehension — a brief registration of the surface event. This does not necessarily preclude simultaneous meaning processing, though capacity limits may require serialization or priority-setting.

“Metalinguistic understanding is required for acquisition.” Schmidt’s framework explicitly distinguishes noticing (necessary) from understanding (not necessary). A learner can acquire a pattern by noticing it repeatedly without being able to state a rule governing it.


Criticisms

  • The Noticing Hypothesis has been challenged as not falsifiable in strict empirical terms — “noticing” is defined in ways that make it difficult to test whether acquisition occurred without noticing.
  • The detection vs. noticing debate has not been resolved experimentally.
  • Attention is a theoretical construct borrowed from cognitive psychology; its exact operationalization in naturalistic L2 acquisition contexts remains contested.

Social Media Sentiment

Language learners intuitively recognize attentional principles without using the terminology: “When I’m tired, I can watch Japanese TV for hours and feel like I learned nothing.” This is attentional allocation failure — input without attention produces minimal acquisition. High-intensity immersion learners (AJATT community) recommend strategies like active looking-up, sentence mining, and immediate review — all of which direct attention to newly encountered forms and prevent surface-level exposure without attended processing.

Last updated: 2026-04


Practical Application

  • Active, not passive input: When studying Japanese, actively engage with material rather than playing it as background. Attention is the prerequisite for acquisition — passive exposure at low attention produces minimal gains.
  • Focus attention on target structures: When studying a specific Japanese grammar point, choose listening and reading materials that contain that structure and actively watch for it — you are operationalizing Schmidt’s noticing requirement.
  • Manage cognitive load: Sentence-final verb structures in Japanese require sustained attention across long propositions. Practice with short sentences before attending to long complex clauses — build attentional capacity for Japanese graduallythrough massed practice at appropriate complexity levels.
  • Post-input review: Immediately after an input session, review the forms you noticed and look up those you didn’t understand. This post-input metalinguistic processing deepens the noticing that began during input.

Related Terms


See Also


Research

Schmidt, R. W. (1990). The role of consciousness in second language learning. Applied Linguistics, 11(2), 129–158. [Summary: Noticing Hypothesis; awareness at level of noticing is necessary condition for intake; subliminal acquisition minimal for adults; diary methodology + psycholinguistic review; most foundational paper for attention in SLA.]

Schmidt, R. W. (2001). Attention. In P. Robinson (Ed.), Cognition and Second Language Instruction (pp. 3–32). Cambridge University Press. [Summary: Most comprehensive statement of Noticing Hypothesis; distinguishes noticing, understanding, and metalinguistic awareness; reviews Tomlin & Villa challenge; responds to criticisms; essential reference.]

Tomlin, R. S., & Villa, V. (1994). Attention in cognitive science and second language acquisition. Studies in Second Language Acquisition, 16(2), 183–203. [Summary: Proposes alertness/orientation/detection framework; argues detection without awareness may enable acquisition; challenges Schmidt’s noticing requirement; key theoretical challenge to noticing as necessary condition.]

Robinson, P. (1995). Attention, memory, and the noticing hypothesis. Language Learning, 45(2), 283–331. [Summary: Extends Schmidt’s framework; distinguishes noticing, noticing the gap, understanding; reviews experimental evidence; influential taxonomic paper connecting attention to memory and acquisition in SLA.]

VanPatten, B. (1990). Attending to form and content in the input: An experiment in consciousness. Studies in Second Language Acquisition, 12(3), 287–301. [Summary: Demonstrates attention trade-off between form and meaning in L2 input processing; learners who attend to content miss form; motivates form-focused instruction as attentional redirection; foundational attention trade-off paper.]