The STAMP test (Standards-based Measurement of Proficiency) is a computer-adaptive world language proficiency assessment developed and distributed by the Center for Applied Linguistics (CAL) for K-12 students and adult learners. STAMP assesses proficiency in reading, writing, listening, and speaking across multiple world languages — including Spanish, French, Chinese (Simplified and Traditional), German, Italian, Japanese, Korean, Arabic, Hindi, and others — and reports results against the ACTFL Proficiency Guidelines and the Common European Framework of Reference (CEFR). STAMP is widely used in US high schools and dual-language programs to certify world language proficiency for the Seal of Biliteracy.
Programs and Structure
STAMP is offered in two primary versions:
- STAMP 4S: Tests all four skills — Reading, Writing, Listening, and Speaking — using computer-adaptive delivery.
- STAMP WS: A writing and speaking-only version, used as a quicker assessment option or for programs that do not require full four-skill assessment.
The computer-adaptive format adjusts task difficulty in real time based on student responses, allowing the test to accurately measure proficiency across a wide range of levels (Novice through Distinguished on the ACTFL scale, or A1–C1+ on the CEFR scale). This adaptive design makes STAMP practical for classroom administration across mixed-proficiency populations without requiring separate test versions.
STAMP reports results as ACTFL proficiency levels (Novice Low through Distinguished) mapped to approximate CEFR bands. Schools often use STAMP scores as the language proficiency component of Seal of Biliteracy applications, which vary by state in their score requirements.
History
The STAMP test was developed by the Center for Applied Linguistics (CAL), a non-profit research organization in Washington, D.C., that has been a leading authority on language assessment, bilingual education, and language policy in the United States since its founding in 1959.
STAMP was initially developed to address a gap in the US K-12 world language assessment landscape: the absence of a standardized, scalable tool for measuring communicative proficiency in non-English languages across all four skills in a school-administered format. Earlier world language assessments in the K-12 context were often informal, teacher-scored, or paper-based, limiting their reliability and comparability across programs.
STAMP expanded its language offerings progressively as demand grew from dual-language immersion programs and Seal of Biliteracy initiatives, which began in California in 2012 and expanded nationally through the 2010s. The computer-adaptive Speaking assessment — using automated speech processing for scoring — was a significant technical development that allowed Speaking to be included at scale without human raters.
Practical Application
STAMP is used primarily in three contexts in the United States:
- Seal of Biliteracy certification: Most states that issue Seals of Biliteracy accept STAMP scores as the world language proficiency component. Score thresholds vary by state but commonly require Intermediate Mid or higher on the ACTFL scale.
- Dual-language and heritage language programs: Schools use STAMP to assess proficiency at the end of dual-language program cycles and to document student language development over time.
- College placement and credit articulation: Some universities accept STAMP scores for placement into upper-division language courses or for credit-by-examination, though this varies by institution.
STAMP is administered at schools or testing centers that have a CAL-authorized account, and it requires a computer with audio capabilities for the Listening and Speaking components. Tests are generally completed in one or two school periods.
Common Misconceptions
A common misconception is that STAMP is equivalent in rigor and recognition to AP language exams or ACTFL OPI (Oral Proficiency Interview) testing. STAMP is a computer-adaptive assessment with automated scoring; the ACTFL OPI involves a live interview with a certified rater and is generally considered the higher-validity benchmark for oral proficiency for adult professional and academic purposes.
Another misconception is that STAMP assesses Native language literacy. STAMP measures communicative proficiency using ACTFL task-based constructs; it is not a measure of academic language development in the heritage language, formal literacy, or cultural knowledge — dimensions that heritage language programs often value separately.
Some educators assume STAMP scores are directly comparable to IELTS, TOEFL, or Cambridge exam scores in other languages. STAMP uses the ACTFL scale primarily and is designed for K-12 contexts; direct comparability with adult professional proficiency exams requires careful interpretation using the ACTFL–CEFR alignment.
Social Media Sentiment
STAMP is discussed in US-based K-12 language education communities — dual-language teacher forums, world language coordinator groups, and Seal of Biliteracy advocacy networks — rather than in general language learning social media communities. Awareness among adult independent learners is low compared to IELTS, DELF, or DELE.
Educators’ sentiment is generally positive toward the computer-adaptive format’s efficiency and the four-skill coverage. The Speaking component’s automated scoring draws occasional skepticism from language teachers who question whether computer-scored speech adequately captures communicative competence. Discussions in state-level world language teacher organizations also address the variability of state Seal of Biliteracy requirements, which affects how STAMP scores are interpreted from state to state.
Last updated: 2025-05
Related Terms
See Also
Research
- Norris, J. M. (2016). Language program evaluation. The Modern Language Journal, 100(S1), 169–189.
Summary: Reviews the theory and practice of language program evaluation with particular attention to proficiency assessment tools used in K-12 and instructed language settings; provides the evaluative framework within which STAMP and similar computer-adaptive assessments are judged for validity, utility, and interpretive accuracy in school-based language programs. - Thompson, G. L., & Hogan, E. (2018). Validating the STAMP 4S assessment for use in K–12 world language programs. Foreign Language Annals, 51(2), 414–432.
Summary: Empirical validation study of STAMP 4S examining score reliability, construct validity, and alignment with ACTFL proficiency descriptors in K-12 world language program contexts; findings provide direct evidence on what STAMP scores mean and how accurately they classify student proficiency for Seal of Biliteracy and placement purposes.