Definition:
Collocations are word combinations that native speakers naturally produce together with much greater frequency than chance — pairs and groups of words that belong together not because grammar requires them, but because usage patterns have established them as conventional groupings. “Strong coffee” is a collocation; grammatically, “powerful coffee” is equally correct, but native speakers say “strong coffee” — the adjective “strong” has been established through usage convention as the natural collocate for “coffee” in this context. Non-native speakers who know both “strong” and “coffee” individually will often produce “powerful coffee” or other grammatically acceptable but non-native combinations. Collocational knowledge is what allows you not just to communicate but to sound natural — it is the layer of language knowledge beneath grammar and vocabulary that native-like production requires and that advanced learners notoriously struggle with. It is one of the primary targets of advanced vocabulary learning and one of the clearest indicators of the distinction between high-proficiency L2 speakers and native speakers.
In-Depth Explanation
Collocational knowledge is the layer of vocabulary depth beneath grammar and word meaning: knowing not just what a word means but which words it travels with. Non-native speakers can be grammatically accurate and lexically broad while still producing non-native word combinations — “strong rain,” “do a mistake,” “powerful coffee” — that mark them as non-native to any fluent listener. The primary acquisition route is extensive authentic input: collocational patterns are abstracted from high-frequency exposure to the same combinations in context, not from memorizing word lists. Corpus tools (COCA, Sketch Engine) and collocation dictionaries make explicit study possible; SRS with authentic sentence examples provides contextualized collocational exposure alongside vocabulary learning.
Types of Collocations
Collocations exist at multiple levels of grammatical and semantic strength:
Verb + noun collocations (the most studied):
- make a decision (not take a decision in English, though “take” is used in other languages/translations)
- commit a crime (not do a crime)
- pose a challenge / present a challenge (not give a challenge)
Adjective + noun collocations:
- heavy rain (not strong rain)
- strong evidence (not heavy evidence)
- deep regret (not big regret)
Adverb + adjective collocations:
- deeply committed (not extremely committed, though both work — but “deeply committed” is far more natural in formal registers)
- highly unlikely (not very unlikely — both acceptable but “highly” is the strong collocate)
Verb + adverb collocations:
- strongly recommend (not heavily recommend)
- deeply affected (not strongly affected — though both may occur, frequency profiles differ)
Noun + noun (compound tendency):
- traffic jam (not car jam)
- bank account (not money account)
Why Collocations Matter
Collocations are important for three reasons:
1. Naturalness. Correct grammar and sufficient vocabulary are prerequisites for comprehension; collocational accuracy is what makes production sound native. Two utterances can be grammatically correct and have the same meaning, but “I have a strong doubt about that” is more natural than “I have a powerful doubt about that.” This distinction is collocational.
2. Comprehension speed. Collocations function as processing chunks — native speakers process frequent collocations as single units rather than word-by-word. “Make a decision” is processed as a chunk. Learners who have not acquired collocational knowledge process each word individually, resulting in slower comprehension and less fluent production.
3. Dictionaries don’t capture them. Looking up individual words doesn’t tell you their collocations — a word’s dictionary entry lists meanings and sometimes collocational examples, but the full collocational profile of a word (all its natural partner words, frequency-weighted) is not represented in standard dictionaries.
Collocations and the Lexical Approach
Michael Lewis‘s Lexical Approach (1993) placed collocations at the center of language teaching:
- Language is not “grammar and vocabulary” but “grammar and lexis” — where “lexis” includes not just words but patterned word combinations (collocations, idioms, formulaic sequences)
- Learning language means learning lexical chunks, not just words
- Instruction should focus on noticing and learning collocations from corpus-informed texts
The Lexical Approach remains influential in ELT though debate persists about how explicitly to teach collocations vs. relying on incidental discovery.
Acquiring Collocations
Collocational knowledge is primarily acquired through large volumes of natural input and does not respond well to decontextualized rule study:
- Exposure frequency: collocates become associated through repetition of the combination in reading and listening
- Attention-to-form: noticing that a combination recurs helps consolidate it
- Production feedback: when non-native collocations are corrected, the feedback drives toward native patterns
For deliberate study: collocation dictionaries (Oxford Collocations Dictionary, BBI Dictionary) list frequent collocates by word; SRS sentences that contain natural collocations provide contextual exposure that eventually consolidates correct combinations.
History
- 1957 — Firth. Coins the term “collocation” in linguistics — “You shall know a word by the company it keeps”; his theoretical work establishes word meaning as contextually and co-textually defined.
- 1980s–90s — Corpus linguistics. Large text corpora (Brown Corpus, BNC) with computational tools enable empirical measurement of word co-occurrence patterns; researchers can now identify collocates and measure their frequency.
- 1993 — Michael Lewis, The Lexical Approach. Places collocations at the center of language pedagogy; argues that lexical chunks (including collocations) are the natural units of language acquisition.
- 1995 — Oxford Collocations Dictionary. Corpus-informed learner reference makes collocation lookup practical for learners and teachers.
- 2000s–present. Corpus-informed textbook design and growing research on collocational learning processes continue.
Common Misconceptions
“If the grammar is right and the meaning is clear, the collocation doesn’t matter.”
Communication matters most; collocations matter for naturalness. At lower levels, communication priority justifies tolerating non-native collocations. At higher levels — when the goal is native-like use — non-native collocations are the remaining gap between “communicating well” and “sounding natural.”
“You can learn collocations by memorizing combination lists.”
Collocation lists exist but direct memorization has limited effectiveness without contextualized exposure. Collocations are best acquired through high-volume reading and listening that repeatedly exposes natural combinations, with attention-to-form when the same combination recurs.
Criticisms
- Boundary fuzziness: The distinction between free combinations, collocations, and fixed expressions is fuzzy; different researchers and dictionaries apply different frequency thresholds for “collocation” status.
- L1-specific patterns: Collocations are language-specific, not universal; L1 collocational patterns cause cross-linguistic interference, producing non-native combinations in L2.
Social Media Sentiment
Collocations are frequently discussed in advanced learner communities as the “next layer” after vocabulary and grammar. Common experience: “I know all the words but sometimes something sounds wrong and I can’t figure out why.” The answer is usually collocational.
Collocation dictionaries are recommended in TEFL/TESOL communities and among advanced learners of English. For Japanese learners, collocation databases (for common verb-noun pairs, adjective-noun pairs) are discussed in advanced r/LearnJapanese and Refold Discord.
Last updated: 2026-04
Practical Application
- Mine collocations during active immersion. When you encounter a word you already know used in a new partner combination, mine the sentence for your SRS. “I know 食べる but I’ve never seen it with ~合う before” — that sentence is a valuable collocational card.
- Use a collocation dictionary. When studying a new word in your target language, look it up in a collocation dictionary or corpus tool in addition to a regular dictionary. This expands your knowledge from “what it means” to “who it travels with.”
- Review sentences with natural collocations. Sentence cards from authentic native speech naturally contain native-speaker collocations. Reviewing a sentence with 固い決意 (“strong/firm determination”) teaches the adjective-noun collocation implicitly while targeting the vocabulary item.
- Output and check. Write or speak in the target language and submit your output for review by a native speaker or teacher. Collocational errors are the most common error type for advanced learners and are specifically what native-speaker feedback catches most reliably.
Related Terms
See Also
Research
- Firth, J. R. (1957). A synopsis of linguistic theory, 1930–55. In J. R. Firth (Ed.), Studies in Linguistic Analysis (pp. 1–32). Blackwell.
Summary: Foundational contribution coining the term collocation and establishing the theoretical claim that word meaning is contextually and co-textually defined; set up the entire collocation research program. - Lewis, M. (1993). The Lexical Approach. Language Teaching Publications.
Summary: Pivotal pedagogical text placing collocations at the center of language teaching; argues language is lexis + grammar where lexis means chunks and collocations, not isolated words. - Nesselhauf, N. (2005). Collocations in a Learner Corpus. John Benjamins.
Summary: Corpus study of L2 collocational errors; documents the types and frequency of collocational non-nativeness in advanced learner texts and identifies L1 transfer as the primary source of errors.