Definition:
Construct validity is the degree to which a test or measurement instrument accurately measures the theoretical construct it is designed to assess. A “construct” is an abstract, unobservable characteristic — such as “reading comprehension,” “communicative competence,” or “grammatical knowledge” — that cannot be directly observed but is inferred from test performance. A test has construct validity when evidence supports the interpretation that performance on the test reflects the construct defined in the measurement framework. In contemporary language testing theory, following Messick (1989), construct validity is considered the superordinate validity concept comprising all aspects of test validation — including evidence from test content, relationships with other measures, and consequences of test use. Validity and reliability are the two central psychometric properties of any language assessment.
The Unified View of Validity
Messick (1989)’s influential unitary conception holds that all validity is construct validity — evidence for content validity (does test content represent the domain?) and criterion validity (does the test predict real-world outcomes?) are all forms of construct validity evidence.
Types of Validity Evidence
| Type of evidence | Question answered |
|---|---|
| Content evidence | Does test content represent the construct domain? |
| Criterion evidence | Does test correlate with other measures of the same construct? |
| Convergent evidence | Do scores correlate with theoretically related constructs? |
| Discriminant evidence | Do scores NOT correlate with theoretically unrelated constructs? |
| Consequential evidence | Are the uses and consequences of test scores appropriate? |
Construct Underrepresentation and Construct-Irrelevant Variance
Two major validity threats:
- Construct underrepresentation: the test fails to sample important aspects of the construct (e.g., a reading test that only uses multiple-choice items may fail to measure inferential comprehension)
- Construct-irrelevant variance: performance on the test is affected by factors outside the construct (e.g., a speaking test score reflecting test anxiety rather than speaking ability)
Language Testing Applications
In evaluating a communicative language test, construct validity evidence would include:
- Does the test framework match a theoretically grounded model of communicative competence?
- Do scores predict performance in real-world language use tasks?
- Do higher-proficiency learners score higher? (known-groups analysis)
Bachman & Palmer Framework
Bachman & Palmer (1996) proposed that language test validity requires alignment between test task characteristics and real-world target language use (TLU) situations — the degree of correspondence between what the test requires and authentic language performance.
History
The concept of construct validity was introduced by Cronbach & Meehl (1955) as a third type of validity alongside content and criterion validity. Messick’s (1989) reconceptualization of validity as a unified, construct-centered concept transformed the field and is now the dominant framework in language testing.
Common Misconceptions
- “A test is valid if it looks like it tests the right thing” — what a test “looks like” is face validity (see face validity), not construct validity; construct validity requires empirical evidence
- “Validity is a property of the test” — validity is a property of the interpretations made from test scores; a test may support valid interpretation in one context and invalid interpretation in another
Criticisms
- Messick’s unified validity framework is theoretically comprehensive but difficult to operationalize practically; test developers may struggle to collect the full range of evidence required
Social Media Sentiment
Construct validity is a staple discussion in language testing and applied linguistics graduate coursework; practitioners often discuss the gap between the ideal of full construct validity investigations and real-world testing resources. Last updated: 2026-04
Practical Application
- When designing a language test, define the construct explicitly before writing items
- Collect multiple types of validity evidence rather than relying only on content coverage
- Sakubo — Sakubo‘s vocabulary and reading assessments are grounded in theoretically motivated construct definitions (lexical threshold, comprehension depth), aligning measurement with acquisition constructs
Related Terms
See Also
Research
- Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational Measurement (3rd ed., pp. 13–103). American Council on Education. — Landmark reconceptualization of validity as a unified, construct-centered concept.
- Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52(4), 281–302. — Introduced the concept of construct validity.
- Bachman, L. F., & Palmer, A. S. (1996). Language Testing in Practice. Oxford University Press. — Applied construct validity framework for language test design.