Lexis

Lexis — a collective term for the vocabulary of a language or a person — often used interchangeably with ‘vocabulary’ but sometimes preferred in technical linguistics to include multi-word items.

Definition

A collective term for the vocabulary of a language or a person — often used interchangeably with ‘vocabulary’ but sometimes preferred in technical linguistics to include multi-word items.

In Depth

A collective term for the vocabulary of a language or a person — often used interchangeably with ‘vocabulary’ but sometimes preferred in technical linguistics to include multi-word items.

In-Depth Explanation

Lexis (from Greek lexikon, relating to words) refers to the total vocabulary of a language or of an individual speaker. In technical linguistics, the term is often preferred over vocabulary because it includes not just single words but also multi-word items, collocations, idioms, and formulaic sequences as units of lexical knowledge.

Lexis vs. related terms:

TermScopeUsage context
LexisFull vocabulary system including multi-word unitsTechnical linguistics; lexicology
VocabularySet of words in a language or learner’s knowledgeGeneral; applied linguistics; everyday use
LexiconMental lexicon (internal mental word store) or dictionaryPsycholinguistics; mental grammar models
Lexical itemA single unit of lexical meaning (word or multi-word)Lexicology; vocabulary instruction

Lewis (1993) Lexical Approach: Michael Lewis argued that language is fundamentally grammaticalised lexis, not lexicised grammar — that lexis is primary and grammar secondary, and that the core of language acquisition is building a rich store of multi-word lexical units (chunks, collocations, idioms, sentence frames) rather than first acquiring abstract grammatical rules and then filling in vocabulary. This challenged the structural/grammar-first tradition.

Lexical density: A measure of text complexity — the ratio of content words (lexical items: nouns, main verbs, adjectives, adverbs) to total words. High lexical density characterises academic/written English; lower density characterises spoken/informal language.

Nation’s frequency framework: Nation (2001) proposed that English vocabulary is organised by frequency bands:

  • First 1,000 word families: Cover ~85% of general text
  • First 2,000: Cover ~90%+
  • Academic Word List (Coxhead 2000): Next most important layer for academic reading
  • First 5,000–10,000: Needed for comfortable native-level reading

Japanese lexis structure: Japanese lexis is divided into three traditional strata:

  • Wago (和語): Native Japanese vocabulary (e.g., 山 yama, 海 umi)
  • Kango (漢語): Sino-Japanese vocabulary from Chinese (e.g., 山脈 sanmyaku, 海岸 kaigan)
  • Gairaigo (外来語): Loanwords from other languages (primarily English modern loans; written in katakana)

History

Lexicology as a formal discipline developed through the 19th century’s comparative philology work. Nation’s landmark Learning Vocabulary in Another Language (2001) established frequency-based vocabulary research as the dominant paradigm in applied lexicology. Michael Lewis’s The Lexical Approach (1993) and Implementing the Lexical Approach (1997) proposed a methodological alternative centred on lexis over grammar. Sinclair’s corpus linguistics work (COBUILD project, 1980s–90s) transformed our understanding of how words actually appear in real lexis — revealing collocation patterns invisible to introspection.

Common Misconceptions

  • “Vocabulary = single words with one meaning each.” Lexical knowledge includes multi-word units, collocational behaviour, register appropriateness, grammatical links, connotation, and frequency — not just a word-definition pairing.
  • “You can learn lexis by memorising dictionary definitions.” Rich lexical knowledge (knowing how to use a word, not just its meaning) requires multiple exposures in varied contexts, production practice, and attention to collocational patterns.
  • “Lexis and grammar are separate.” Nation (2009) argues vocabulary and grammar interconnect through the grammar of the lexicon — many lexical items require specific grammatical environments (“it depends on.” not “it depends in.”).
  • “Japanese vocabulary is uniquely difficult.” Japanese lexis has a large-scale Sino-Japanese stratum (kango) cognate with Chinese and partly with Korean. For Japanese learners who know some Chinese, or who study kanji productively, the kango vocabulary layer provides significant positive transfer.

Social Media Sentiment

Lexis as a technical term appears primarily in linguistics academic discourse. In language learning communities, the equivalent discussions concern vocabulary size, frequency-based learning, and the debate between sentence mining (whole-sentence, contextual vocabulary acquisition) and isolated word-list study. The question of “how many words do you need to read/speak Japanese naturally” — i.e., a vocabulary/lexis size question — is a perennial Japanese learning community topic.

Last updated: 2026-04

Practical Application

  • Frequency-based study: Prioritising high-frequency vocabulary first is the single most efficient vocabulary study decision. For Japanese, the JLPT frequency lists, JMDICT core frequency data, and the Core 2,000/Core 6,000 Anki decks implement frequency-based lexis building.
  • Collocational knowledge: Beyond knowing a word’s meaning, learners should study how it collocates. For Japanese, this means learning verbs with their particle partners (品山に 登る vs. 連絡に 取る) as lexical chunks.
  • Wago/Kango balance: Japanese vocabulary study should include both wago and kango vocabulary. Kango kanji-based vocabulary is proportionally higher in formal/academic Japanese; wago is more common in casual speech.
  • Sakubo SRS and lexical depth: Using Sakubo for spaced repetition of Japanese vocabulary builds frequency-based lexical knowledge and, through sentence cards, collocational and contextual lexical knowledge simultaneously.

Related Terms

See Also

Sakubo – Japanese App

Sources

  • Nation, I. S. P. (2001). Learning Vocabulary in Another Language. Cambridge University Press. The standard reference for frequency-based vocabulary learning, word families, and lexical knowledge dimensions.
  • Lewis, M. (1993). The Lexical Approach. Language Teaching Publications. The foundational text for the lexical approach and its argument that grammar emerges from lexis.
  • Sinclair, J. (1991). Corpus, Concordance, Collocation. Oxford University Press. Corpus linguistics foundation demonstrating how real-language lexis behaves — revealing collocation patterns and the idiom principle.