Collocations

Definition:

Collocations are word combinations that native speakers naturally produce together with much greater frequency than chance — pairs and groups of words that belong together not because grammar requires them, but because usage patterns have established them as conventional groupings. “Strong coffee” is a collocation; grammatically, “powerful coffee” is equally correct, but native speakers say “strong coffee” — the adjective “strong” has been established through usage convention as the natural collocate for “coffee” in this context. Non-native speakers who know both “strong” and “coffee” individually will often produce “powerful coffee” or other grammatically acceptable but non-native combinations. Collocational knowledge is what allows you not just to communicate but to sound natural — it is the layer of language knowledge beneath grammar and vocabulary that native-like production requires and that advanced learners notoriously struggle with. It is one of the primary targets of advanced vocabulary learning and one of the clearest indicators of the distinction between high-proficiency L2 speakers and native speakers.


Types of Collocations

Collocations exist at multiple levels of grammatical and semantic strength:

Verb + noun collocations (the most studied):

  • make a decision (not take a decision in English, though “take” is used in other languages/translations)
  • commit a crime (not do a crime)
  • pose a challenge / present a challenge (not give a challenge)

Adjective + noun collocations:

  • heavy rain (not strong rain)
  • strong evidence (not heavy evidence)
  • deep regret (not big regret)

Adverb + adjective collocations:

  • deeply committed (not extremely committed, though both work — but “deeply committed” is far more natural in formal registers)
  • highly unlikely (not very unlikely — both acceptable but “highly” is the strong collocate)

Verb + adverb collocations:

  • strongly recommend (not heavily recommend)
  • deeply affected (not strongly affected — though both may occur, frequency profiles differ)

Noun + noun (compound tendency):

  • traffic jam (not car jam)
  • bank account (not money account)

Why Collocations Matter

Collocations are important for three reasons:

1. Naturalness. Correct grammar and sufficient vocabulary are prerequisites for comprehension; collocational accuracy is what makes production sound native. Two utterances can be grammatically correct and have the same meaning, but “I have a strong doubt about that” is more natural than “I have a powerful doubt about that.” This distinction is collocational.

2. Comprehension speed. Collocations function as processing chunks — native speakers process frequent collocations as single units rather than word-by-word. “Make a decision” is processed as a chunk. Learners who have not acquired collocational knowledge process each word individually, resulting in slower comprehension and less fluent production.

3. Dictionaries don’t capture them. Looking up individual words doesn’t tell you their collocations — a word’s dictionary entry lists meanings and sometimes collocational examples, but the full collocational profile of a word (all its natural partner words, frequency-weighted) is not represented in standard dictionaries.

Collocations and the Lexical Approach

Michael Lewis‘s Lexical Approach (1993) placed collocations at the center of language teaching:

  • Language is not “grammar and vocabulary” but “grammar and lexis” — where “lexis” includes not just words but patterned word combinations (collocations, idioms, formulaic sequences)
  • Learning language means learning lexical chunks, not just words
  • Instruction should focus on noticing and learning collocations from corpus-informed texts

The Lexical Approach remains influential in ELT though debate persists about how explicitly to teach collocations vs. relying on incidental discovery.

Acquiring Collocations

Collocational knowledge is primarily acquired through large volumes of natural input and does not respond well to decontextualized rule study:

  • Exposure frequency: collocates become associated through repetition of the combination in reading and listening
  • Attention-to-form: noticing that a combination recurs helps consolidate it
  • Production feedback: when non-native collocations are corrected, the feedback drives toward native patterns

For deliberate study: collocation dictionaries (Oxford Collocations Dictionary, BBI Dictionary) list frequent collocates by word; SRS sentences that contain natural collocations — the format used in Sakubo — provide contextual exposure that eventually consolidates correct combinations.


History

1957 — Firth and collocation. J. R. Firth coined the term “collocation” in linguistics — “You shall know a word by the company it keeps.” His theoretical work on word meaning as contextually and co-textually defined established the theoretical framework.

1980s–90s — Corpus linguistics. The emergence of large text corpora (the Brown Corpus, BNC) with computational analysis tools enabled empirical measurement of word co-occurrence patterns. Researchers could now identify which words genuinely collocate and at what frequency — moving from intuition to quantified evidence.

1993 — Michael Lewis, The Lexical Approach. Lewis’s book made collocations central to language pedagogy, arguing that lexical chunks (including collocations) are the natural units of language acquisition and should organize instruction. Highly influential in ELT teacher training.

1995 — Oxford Collocations Dictionary. The publication of corpora-informed collocation dictionaries made collocation reference practical for learners and teachers.

2000s–present. Increasing interest in corpus-informed teaching — textbooks informed by collocation frequency data from large corpora. Research on collocational learning processes continues.


Common Misconceptions

“If the grammar is right and the meaning is clear, the collocation doesn’t matter.”

Communication matters most; collocations matter for naturalness. At lower levels, communication priority justifies tolerating non-native collocations. At higher levels — when the goal is native-like use — non-native collocations are the remaining gap between “communicating well” and “sounding natural.”

“You can learn collocations by memorizing combination lists.”

Collocation lists exist but direct memorization has limited effectiveness without contextualized exposure. Collocations are best acquired through high-volume reading and listening that repeatedly exposes natural combinations, with attention-to-form when the same combination recurs.


Criticisms

  1. What counts as a collocation? The boundary between free combinations, collocations, and fixed expressions is fuzzy and variable across languages. Different researchers and dictionaries set different frequency thresholds for “collocation” status.
  1. L1-specific patterns. Collocations are language-specific. What is a natural collocation in English is not necessarily a natural translation equivalent in Japanese or Spanish. This causes cross-linguistic interference — learners transfer L1 collocational patterns that produce non-native combinations in L2.

Social Media Sentiment

Collocations are frequently discussed in advanced learner communities as the “next layer” after vocabulary and grammar. Common experience: “I know all the words but sometimes something sounds wrong and I can’t figure out why.” The answer is usually collocational.

Collocation dictionaries are recommended in TEFL/TESOL communities and among advanced learners of English. For Japanese learners, collocation databases (for common verb-noun pairs, adjective-noun pairs) are discussed in advanced r/LearnJapanese and Refold Discord.

Last updated: 2026-04


Practical Application

  1. Mine collocations during active immersion. When you encounter a word you already know used in a new partner combination, mine the sentence for your SRS. “I know 食べる but I’ve never seen it with ~合う before” — that sentence is a valuable collocational card.
  1. Use a collocation dictionary. When studying a new word in your target language, look it up in a collocation dictionary or corpus tool in addition to a regular dictionary. This expands your knowledge from “what it means” to “who it travels with.”
  1. Review sentences with natural collocations in Sakubo. Sentence cards from authentic native speech naturally contain native-speaker collocations. Reviewing a sentence with 固い決意 (“strong/firm determination”) teaches the adjective-noun collocation implicitly while targeting the vocabulary item.
  1. Output and check. Write or speak in the target language and submit your output for review by a native speaker or teacher. Collocational errors are the most common error type for advanced learners and are specifically what native-speaker feedback catches most reliably.

Related Terms


See Also

  • Vocabulary Learning — The broader domain of which collocational knowledge is the “depth” component; full word knowledge includes collocational mastery
  • Advanced Plateau — The high-proficiency stage where collocational gaps are often the primary remaining acquisition challenge
  • Active Immersion — The primary route to collocational acquisition through naturalistic high-volume exposure to native language use
  • Sentence Cards — The SRS format that naturally captures collocations in context; reviewing authentic sentences builds collocational knowledge alongside vocabulary
  • Sakubo

Research

  • Firth, J. R. (1957). A synopsis of linguistic theory, 1930–55. In J. R. Firth (Ed.), Studies in Linguistic Analysis (pp. 1–32). Blackwell. [Summary: Firth’s foundational contribution — coinage of the term collocation and the theoretical claim that meaning is contextually and co-textually defined; set up the entire collocation research program.]
  • Lewis, M. (1993). The Lexical Approach. Language Teaching Publications. [Summary: The pivotal pedagogical text placing collocations at the center of language teaching — argues that language is lexis + grammar where lexis means chunks and collocations, not isolated words; the main applied-linguistics text on collocation in language teaching.]
  • Sinclair, J. (1991). Corpus, Concordance, Collocation. Oxford University Press. [Summary: Key corpus linguistics text — documents how collocations are identified and measured from large corpora; provides the empirical methodology for studying collocational frequency and distribution.]
  • Nation, I. S. P. (2001). Learning Vocabulary in Another Language. Cambridge University Press. [Summary: Includes treatment of collocational knowledge as a dimension of vocabulary depth; provides the research context for why collocations are difficult to teach explicitly and require naturally contextual acquisition.]
  • Nesselhauf, N. (2005). Collocations in a Learner Corpus. John Benjamins. [Summary: Corpus study of L2 collocational errors — documents the types and frequency of collocational non-nativeness in advanced learner texts; provides empirical evidence for the gap between learner and native-speaker collocational patterns.]
  • Boers, F., & Lindstromberg, S. (2009). Optimizing a Lexical Approach to Instructed Language Acquisition. Palgrave Macmillan. [Summary: Research on ways to make collocation instruction effective — examines what kinds of collocational instruction produce measurable learning gains; relevant for designing targeted collocation study.]