Construct Validity

Definition:

Construct validity is the degree to which a test or measurement instrument accurately measures the theoretical construct it is designed to assess. A “construct” is an abstract, unobservable characteristic — such as “reading comprehension,” “communicative competence,” or “grammatical knowledge” — that cannot be directly observed but is inferred from test performance. A test has construct validity when evidence supports the interpretation that performance on the test reflects the construct defined in the measurement framework. In contemporary language testing theory, following Messick (1989), construct validity is considered the superordinate validity concept comprising all aspects of test validation — including evidence from test content, relationships with other measures, and consequences of test use. Validity and reliability are the two central psychometric properties of any language assessment.

The Unified View of Validity

Messick (1989)’s influential unitary conception holds that all validity is construct validity — evidence for content validity (does test content represent the domain?) and criterion validity (does the test predict real-world outcomes?) are all forms of construct validity evidence.

Types of Validity Evidence

Type of evidence	Question answered
Content evidence	Does test content represent the construct domain?
Criterion evidence	Does test correlate with other measures of the same construct?
Convergent evidence	Do scores correlate with theoretically related constructs?
Discriminant evidence	Do scores NOT correlate with theoretically unrelated constructs?
Consequential evidence	Are the uses and consequences of test scores appropriate?

Construct Underrepresentation and Construct-Irrelevant Variance

Two major validity threats:

Construct underrepresentation: the test fails to sample important aspects of the construct (e.g., a reading test that only uses multiple-choice items may fail to measure inferential comprehension)
Construct-irrelevant variance: performance on the test is affected by factors outside the construct (e.g., a speaking test score reflecting test anxiety rather than speaking ability)

Language Testing Applications

In evaluating a communicative language test, construct validity evidence would include:

Does the test framework match a theoretically grounded model of communicative competence?
Do scores predict performance in real-world language use tasks?
Do higher-proficiency learners score higher? (known-groups analysis)

Bachman & Palmer Framework

Bachman & Palmer (1996) proposed that language test validity requires alignment between test task characteristics and real-world target language use (TLU) situations — the degree of correspondence between what the test requires and authentic language performance.

History

The concept of construct validity was introduced by Cronbach & Meehl (1955) as a third type of validity alongside content and criterion validity. Messick’s (1989) reconceptualization of validity as a unified, construct-centered concept transformed the field and is now the dominant framework in language testing.

Common Misconceptions

“A test is valid if it looks like it tests the right thing” — what a test “looks like” is face validity (see face validity), not construct validity; construct validity requires empirical evidence
“Validity is a property of the test” — validity is a property of the interpretations made from test scores; a test may support valid interpretation in one context and invalid interpretation in another

Criticisms

Messick’s unified validity framework is theoretically comprehensive but difficult to operationalize practically; test developers may struggle to collect the full range of evidence required

Social Media Sentiment

Construct validity is a staple discussion in language testing and applied linguistics graduate coursework; practitioners often discuss the gap between the ideal of full construct validity investigations and real-world testing resources. Last updated: 2026-04

Practical Application

When designing a language test, define the construct explicitly before writing items
Collect multiple types of validity evidence rather than relying only on content coverage
Sakubo — Sakubo‘s vocabulary and reading assessments are grounded in theoretically motivated construct definitions (lexical threshold, comprehension depth), aligning measurement with acquisition constructs

Related Terms

Research

Messick, S. (1989). Validity. In R. L. Linn (Ed.), Educational Measurement (3rd ed., pp. 13–103). American Council on Education. — Landmark reconceptualization of validity as a unified, construct-centered concept.
Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52(4), 281–302. — Introduced the concept of construct validity.
Bachman, L. F., & Palmer, A. S. (1996). Language Testing in Practice. Oxford University Press. — Applied construct validity framework for language test design.