Lexicon

Lexicon — the total inventory of words and word-like units stored in a speaker’s mind (mental lexicon) or documented in a dictionary — including their forms, meanings, and relationships.

Definition

The total inventory of words and word-like units stored in a speaker’s mind (mental lexicon) or documented in a dictionary — including their forms, meanings, and relationships.

In Depth

The total inventory of words and word-like units stored in a speaker’s mind (mental lexicon) or documented in a dictionary — including their forms, meanings, and relationships.

In-Depth Explanation

Lexicon refers, in linguistics, to the mental repository of a speaker’s word knowledge — the internal vocabulary system that stores phonological, morphological, syntactic, and semantic information about every word a speaker knows. It is distinct from the physical dictionary (lexicography): the mental lexicon is an active, dynamic cognitive system involved in language production and comprehension, not a static alphabetical list.

What the lexicon stores:

Each lexical entry in the mental lexicon is a complex package of multiple types of information:

  1. Phonological form: How the word sounds; syllable structure, stress pattern, segmental content
  2. Orthographic form: How the word looks in writing (partially separate from phonological form)
  3. Morphological structure: Whether the word is simple or complex; what affixes it takes
  4. Syntactic properties: Part of speech; subcategorization frame (transitivity, argument structure)
  5. Semantic content: Meaning; conceptual representation; semantic features
  6. Pragmatic information: Register, connotation, frequency, typical contexts
  7. Collocational information: Which words this word typically occurs with

The L2 lexicon:

In second language acquisition, the lexicon is a central research domain because:

  • Vocabulary knowledge is the single strongest predictor of reading comprehension across L1 and L2 research (Nation 2001)
  • The L2 learner builds a second lexicon in a fundamentally different environment than L1 acquisition — encountering words less frequently, with less embedment in context, often through explicit instruction rather than incidental acquisition
  • The L1 lexicon influences L2 lexical acquisition through cross-linguistic influences: conceptual transfer, form similarity (cognates), and false friend interference

Lexical representation in the L2:

Research on L2 lexical representation addresses questions like:

  • Are L1 and L2 lexical networks stored separately or integrated?
  • Do L2 words link directly to concepts, or via L1 word equivalents? (the “conceptual mediation” debate)
  • Revised Hierarchical Model (Kroll & Stewart 1994): proposes that beginner L2 learners process L2 through L1 lexical links; advanced learners develop direct L2-to-concept links

Lexical access and production:

When producing language, lexical access proceeds approximately as:

  1. Conceptual preparation: The concept to be expressed is activated
  2. Lemma selection: The syntactically appropriate word is selected from lexical candidates
  3. Phonological encoding: The sound form of the word is retrieved and assembled
  4. Articulation: The motor program executes the sound sequence

In L2, each step shows additional processing time compared to L1, explaining slower L2 speech rates and more frequent tip-of-the-tongue states, particularly at lower proficiency levels.

The Japanese lexicon for L2 learners:

Japanese presents specific lexical challenges:

  • Three writing system layers — each word may have kanji, hiragana, and potentially katakana forms
  • Large layers of loanword vocabulary (gairaigo, 外来語) from English and other languages written in katakana
  • Extensive honorific registers (keigo, 敬語) requiring separate lexical representations for polite vs. plain forms
  • Vocabulary size requirements dramatically larger than many L2s: functional literacy requires 2,000+ kanji with associated compound vocabulary

History

The concept of the mental lexicon was developed in psycholinguistics and cognitive linguistics beginning in the 1970s. Morton’s (1969) logogen model and Levelt’s (1989) speaking model established foundational computational architectures for lexical access. Meara (1996) and Nation (2001) drove L2 lexical research specifically; Kroll & Stewart’s (1994) Revised Hierarchical Model became the dominant framework for L2 lexical representation. More recent work (2000s–2020s) has used neuroimaging, eye-tracking, and computational modelling to probe L2 lexical processes.

Common Misconceptions

  • “The lexicon is just a list of words.” The mental lexicon is a complex interconnected network — words are linked to each other by sound similarity, meaning similarity, collocation, syntactic compatibility, and conceptual association. These connections are active during language processing.
  • “Knowing a word means knowing its translation equivalent.” Full lexical knowledge (Nation 2001) is multi-dimensional: size (knowing a word exists), word form, word parts, meaning (conceptual), associations, constraints on use, text-level distribution, and collocational information. Translation knowledge is only one dimension.

Social Media Sentiment

“Lexicon” as a technical term appears rarely in mainstream language learning content — learners say “vocabulary” rather than lexicon. However, the underlying concept drives enormous discussion: vocabulary building strategies (SRS flashcards, sentence mining, extensive reading) are all approaches to L2 lexicon development. The question of how many words you need to know for various Japanese tasks is effectively a lexicon size question; community-produced frequency lists and kanji lists (JLPT tiers, core 2k/6k/10k decks) are practical lexicon-development tools.

Last updated: 2026-04

Practical Application

  • Depth not just breadth: Build lexical depth — collocational knowledge, pragmatic range, morphological family — not just breadth (number of headwords). A word isn’t truly acquired until it’s usable across different contexts.
  • Multiple encounters in context: Research consistently supports multiple contextually varied encounters per word for deep lexical representation. Seeing 楽しむ in a novel, a manga panel, a headline, and an advertisement builds a richer lexical entry than seeing it 20 times on the same SRS flashcard.
  • Productive vs. receptive lexicon: Both receptive (understanding) and productive (using) lexical knowledge matter and develop differently. Productive vocabulary requires additional activation frequency. Writing and speaking in Japanese builds productive lexical access that reading alone does not.

Related Terms

See Also

Sakubo – Japanese SRS App

Sources

  • Nation, I. S. P. (2001). Learning Vocabulary in Another Language. Cambridge University Press. Comprehensive framework for L2 vocabulary knowledge dimensions, assessment, and pedagogy; foundational for understanding the lexicon in L2 acquisition research.
  • Levelt, W. J. M. (1989). Speaking: From Intention to Articulation. MIT Press. Influential cognitive model of lexical access in speech production, establishing the computational architecture for understanding how words are retrieved for production.
  • Kroll, J. F., & Stewart, E. (1994). Category interference in translation and picture naming. Journal of Memory and Language, 33(2), 149–174. Foundational paper introducing the Revised Hierarchical Model of L2 lexical representation, addressing the conceptual mediation debate in bilingual lexical access.