Definition:
A cloze test is a language assessment technique in which words are deleted from a written passage at regular intervals (typically every 5th, 6th, or 7th word — a method called fixed-ratio deletion) and test-takers are required to supply the missing words. The name comes from closure (Gestalt psychology) — the human tendency to complete incomplete patterns. Cloze tests are used to measure overall reading comprehension, contextual language knowledge, and broad language proficiency. They require test-takers to draw on grammatical, semantic, and discourse-level knowledge simultaneously to restore the missing items.
How Cloze Tests Work
Standard (fixed-ratio) cloze:
Every nth word is deleted from a continuous passage:
> “The man ______ (1) to the market ______ (2) buy some groceries. He ______ (3) looking for fresh vegetables ______ (4) good prices.”
Test-takers write (or choose) the missing word. Scoring may be:
- Exact word scoring: only the original word counts as correct
- Acceptable word scoring: any contextually appropriate word counts (more valid but harder to score reliably)
Rational deletion (modified cloze):
Rather than deleting every nth word, the test designer deletes specific target items — often function words, prepositions, or verb forms. This creates more controlled assessment of specific grammatical features.
C-test (a variant):
The second half of every other word is deleted (rather than whole words). Thought to test psycholinguistic processing automaticity more precisely. Widely used in European language testing.
What Cloze Tests Measure
Cloze tests correlate strongly with:
- Overall reading proficiency
- Grammatical knowledge
- Vocabulary breadth
- Discourse coherence awareness (using text-level context to predict missing words)
They have been validated as reasonable proxies for general language proficiency — their simplicity of construction (take a passage, delete every nth word, score) makes them popular for quick assessment.
Advantages and Disadvantages
Advantages:
- Easy to construct
- Low cost
- Measures multiple language dimensions simultaneously
- Strong empirical correlation with broader proficiency tests
Disadvantages:
- Fixed-ratio deletion may delete grammatically trivial words (articles, prepositions) or critically important content words unpredictably
- Scoring controversy: exact vs. acceptable word scoring dramatically affects difficulty and reliability
- Discourse context dependency: performance depends heavily on the topic and register of the passage — less transferable across domains
Cloze Tests in SLA Research
Beyond assessment, cloze-type tasks are used as research instruments:
- Measuring development of implicit (automatic) grammatical knowledge
- Testing sensitivity to specific structures
- C-tests in particular have been used to measure processing speed and automaticity
Cloze vs. Gap Fill
Cloze test = systematic deletion of any word at fixed intervals
Gap fill / sentence completion = targeted deletion of specific items to test specific forms (past tense, prepositions, collocations)
The latter is more commonly seen in language learning textbooks and practice tests; the former is more theoretically motivated as a proficiency measure.
History
The cloze test was developed by Wilson Taylor (1953) at the University of Illinois, building on the Gestalt psychological concept of “closure” (the tendency to complete incomplete patterns). Taylor proposed that if a native speaker can reliably fill in deleted words from a text, their fill-in rate reflects language proficiency — because their internalized knowledge of the language’s lexical and grammatical patterns allows accurate prediction. The test gained rapid adoption in L2 testing research through the 1960s and 1970s, becoming one of the most studied psychometric instruments in applied linguistics. Oller’s (1979) “unitary competence hypothesis” proposed cloze tests as a global measure of language proficiency, though this strong claim was later challenged. The development of rational deletion and fixed-ratio variants led to ongoing debate about what the test actually measures and which deletion procedure is optimal.
Common Misconceptions
“Higher cloze scores always mean higher proficiency.” Cloze scores are influenced by the specific text used, the deletion interval, and the scoring method (exact word vs. acceptable word). A learner may score differently on two cloze tests at the same “difficulty level.” Cloze performance is best interpreted in comparison to population norms on a standardized instrument rather than as an absolute proficiency measure.
“Rational deletion cloze tests are just vocabulary tests.” Although rational deletion specifically targets linguistic items, rational cloze activates reading comprehension, grammatical knowledge, and contextual reasoning — not just vocabulary knowledge. Completing rational deletions correctly requires integrating multiple linguistic knowledge components simultaneously.
Criticisms
The validity of cloze tests as measures of unitary language proficiency has been extensively debated. Research shows that cloze scores partly reflect text-specific knowledge, background knowledge, and reading strategy, not only language proficiency. The “unitary competence” interpretation (Oller, 1979) has been strongly challenged. Performance on fixed-ratio (every nth word) cloze depends substantially on which word happens to fall at the deletion interval, introducing significant measurement error. Modern language testing has largely moved toward performance-based and conversation-analytic assessment approaches, with cloze tests occupying a more limited role in contemporary L2 assessment batteries.
Social Media Sentiment
Cloze tests appear primarily in language exam preparation communities — official standardized tests that include cloze items (Cambridge examinations, IELTS Reading, JLPT) generate discussion and practice sharing. EFL teachers share cloze exercise creation tips on Pinterest, TpT (Teachers Pay Teachers), and Twitter/X. The cloze procedure is also discussed in the Anki/sentence-mining community as a study format (though distinct from the assessment use). The theoretical history of cloze testing is rarely discussed outside academic applied linguistics.
Last updated: 2026-04
Practical Application
Cloze tests are most useful for L2 learners as diagnostic proficiency snapshots and as reading practice materials. For test preparation (Cambridge FCE, B2 First; IELTS Academic Reading), familiarity with rational deletion cloze formats builds the speed and contextual reasoning strategies needed for these exam sections. For self-study, constructing or using cloze exercises from authentic texts at the learner’s target level builds integrative reading-grammar competence. Sakubo offers a cloze-like review format through contextual sentence frames that test vocabulary recall in authentic linguistic contexts.
Related Terms
See Also
Research
Taylor, W. L. (1953). “Cloze Procedure”: A new tool for measuring readability. Journalism Quarterly, 30(4), 415-433.
The original paper introducing the cloze procedure, explaining the Gestalt rationale for using word restoration to measure text readability and native-speaker text difficulty — the founding methodological paper for all subsequent cloze test research.
Oller, J. W. (1979). Language Tests at School: A Pragmatic Approach. Longman.
The comprehensive treatment of cloze testing as a pragmatic proficiency measure proposing the “unitary competence hypothesis” — the controversial claim that language proficiency is best measured by integrative tests like cloze; the most influential but also most challenged framework in cloze testing research.
Alderson, J. C. (1979). The cloze procedure and proficiency in English as a foreign language. TESOL Quarterly, 13(2), 219-226.
An empirical study examining what cloze tests actually measure, investigating the relationship between cloze performance and other measures of L2 proficiency and contributing to the debate about whether cloze scores reflect a unitary factor or multiple components.