A Parent Field Guide

CELF-5: What parents actually need to know.

Clinical Evaluation of Language Fundamentals, Fifth Edition

The CELF-5 is the test the speech-language pathologist gives when language is on the table. It separates what your child understands (receptive language) from what they can express (expressive language), then digs into vocabulary, grammar, and language memory underneath. This guide walks through the indexes, the one subtest that matters more than the rest, and what to push on when the report comes back.

The Quick Facts

The 30-second version.

Before you dive into the details, here is the short version of what the CELF-5 is and how it gets used.

Ages

5:0 to 21:11

Different subtests at different age bands. The CELF Preschool-3 covers younger kids.

Time

30 to 60 minutes

For the full battery. Just the Core Language subtests can run faster.

Format

Given by an SLP

One on one. Child listens and speaks. No reading or writing required.

Measures

Spoken language

Receptive (understanding) and expressive (producing) language. Vocabulary, grammar, and language memory.

Key Number

Standard scores

Indexes use mean 100, SD 15 (like the WIAT). Subtests use scaled scores (mean 10, SD 3).

The Key Point

The Core Language Score is one number. The split between receptive and expressive is the diagnosis.

The biggest CELF-5 mistake is reading the Core Language Score (CLS) like a verdict and stopping there. The CLS averages everything together. A child can have a real language disorder hiding under an average CLS, because strengths in one area pull up weaknesses in another. The indexes underneath are where the answers live.

The CLS Trap

Why one composite is not enough.

The Core Language Score combines four subtests into one headline number. If receptive language is strong and expressive language is weak (or vice versa), the average can come out in the middle of the typical range, even when a 20-point split between those two domains is sitting right there in the data. An “average” CLS is not the all-clear. It is the starting point.

Receptive vs Expressive

What to focus on.

Compare the Receptive Language Index (what your child understands) to the Expressive Language Index (what they can produce). A 15-point gap between them is meaningful. Stronger receptive than expressive points to a production bottleneck. Stronger expressive than receptive often points to a child who sounds fluent but is missing what was just said. Both are real findings.

An “average” Core Language Score does not rule out a language disorder. Always look at the indexes underneath.

What It Measures

The five indexes, explained.

The CELF-5 produces one headline composite and four index scores underneath it. Here is what each one is actually measuring, and what affects performance beyond pure language ability.

Composite 01

Core Language Score

The headline composite

What it measuresAn overall standard score made from four core subtests that vary by age band. The number most reports lead with.

ScaleStandard score with mean 100, SD 15. Same scale as the WIAT-4 and WISC-V.

Why it can mislead The CLS smooths out everything underneath. A child with a strong-weak split can land in the average range. Always check the indexes that follow before treating the CLS as the answer.

Index 02

Receptive Language Index

What your child understands

What it measuresHow well your child takes in spoken language. Following directions, understanding sentences, comprehending paragraphs read aloud.

TasksLinguistic Concepts, Following Directions, Sentence Comprehension, Understanding Spoken Paragraphs.

Most affected by Auditory processing, working memory, vocabulary, and attention. A low score here does not mean low intelligence. It means language input is breaking down.

Index 03

Expressive Language Index

What your child can say

What it measuresHow well your child produces spoken language. Building sentences, recalling sentences accurately, formulating answers from prompts.

TasksWord Structure, Recalling Sentences, Formulated Sentences, Sentence Assembly.

Most affected by Word retrieval, grammar, and language memory. Often the lower index when expressive language disorder is the issue, even when comprehension looks fine.

Index 04

Language Content Index

Vocabulary and meaning

What it measuresThe semantic side of language. How rich the vocabulary is, how words relate to each other, how meaning is built out of language.

TasksWord Classes, Word Definitions, Semantic Relationships, Understanding Spoken Paragraphs.

Most affected by Vocabulary exposure, reading experience, and semantic memory. A low score here often shows up alongside reading comprehension difficulties.

Index 05

Language Memory Index

Holding language in mind

What it measuresWorking memory specifically for verbal information. The ability to hold sentences and instructions in mind long enough to act on them.

TasksFollowing Directions, Recalling Sentences, Formulated Sentences.

Most affected by Working memory capacity and attention. Closely related to the WISC-V Working Memory Index, but specific to language. A low score here predicts struggles with complex classroom directions and longer texts.

Recalling Sentences is the single most diagnostic subtest. Always look at it.

Recalling Sentences asks the child to repeat sentences back exactly. It is one of the strongest research-backed indicators of Developmental Language Disorder (DLD). A score of 7 or below on Recalling Sentences, even with an “average” Core Language Score, is a finding that deserves real follow-up. Make sure the report breaks this subtest out individually.

How Scores Are Interpreted

The standard score table.

Indexes use standard scores (mean 100, SD 15), the same scale as the WIAT-4 and WISC-V. Subtests use scaled scores (mean 10, SD 3) where 7 to 13 is the typical range. Here is what the index scores actually mean.

Score Range	Percentile	What It Actually Means
130 and above	98th and above	Very strong. Top 2 percent of kids this age.
120 to 129	91st to 97th	Above average. Clearly strong.
110 to 119	75th to 90th	High average. A bit above typical.
90 to 109	25th to 74th	Average. The middle half of kids this age.
80 to 89	9th to 24th	Low average. Below typical, often flagged as a concern.
70 to 79	2nd to 8th	Borderline. Often qualifies for SLP services on its own.
69 and below	1st and below	Extremely low. Strong language disorder finding.

One thing to remember A 15-point gap between Receptive and Expressive is one standard deviation, and almost always meaningful. The pattern across indexes is what tells you what is actually going on, not the single CLS number.

The Four Patterns

What your kid’s profile shape is telling you.

The CELF-5 is most useful when you read the indexes as a shape, not as five separate numbers. These are the patterns that show up most often in the kids parents bring through the door.

Pattern 01

The Expressive Bottleneck

Receptive Language Index is intact or stronger. Expressive Language Index and Language Memory Index are noticeably lower. The child understands what they hear but struggles to put their thoughts into words.

Why it mattersThis is a common Developmental Language Disorder (DLD) profile. The kid is processing input fine but cannot generate matching output. Often missed because comprehension makes them seem like they “get it.” Push for explicit expressive language intervention, not just listening practice.

Pattern 02

The Receptive Bottleneck

Expressive Language Index is intact, often surprisingly so. Receptive Language Index is the weak point. The child sounds fluent and chatty but is missing pieces of what is said to them.

Why it mattersEasy to miss because the child talks plenty. Often shows up as “not listening,” “not following directions,” or behavior issues at school. Real intervention focuses on slowing input down, providing visual support, and explicitly teaching listening comprehension.

Pattern 03

The Vocabulary and Memory Gap

Receptive and Expressive Language Indexes are average. Language Content (vocabulary) and Language Memory are noticeably lower. Reading comprehension and complex listening tasks are usually the real-world casualties.

Why it mattersThis profile often shows up in struggling readers whose decoding is fine. The CELF-5 is finding the language piece of the comprehension problem. Push for vocabulary instruction and the reading-comprehension strategies that match weak language memory, not just more decoding practice.

Pattern 04

The Globally Low Profile

Every index is below average, with no peak pulling any one area up. Comprehensive language weakness across the board.

Why it mattersAlways interpreted alongside cognitive testing. If WISC-V cognitive scores are similarly low, the team may consider Intellectual Disability. If cognitive scores are stronger than language, the gap itself is the diagnostic finding for Developmental Language Disorder.

The Guardrails

What this test is, and isn’t, used for.

The CELF-5 is the standard SLP language battery, but it does specific things well and specific things not at all. Knowing the line helps you push back when needed.

What it is for

Legitimate uses.

Identifying receptive vs expressive language disorders
Documenting eligibility for school-based SLP services
Screening for Developmental Language Disorder (DLD)
Identifying weak language memory that affects reading and classroom function
Pairing with cognitive testing to clarify whether language is the bottleneck

What it is not for

Misuses to push back on.

Diagnosing autism on its own (need ADOS-2 and a full developmental evaluation)
Diagnosing dyslexia or reading disabilities (different test required)
Testing speech sound articulation (the CELF is not an articulation test)
Replacing pragmatic and social communication evaluation
Evaluating bilingual or English-learner children in English alone
Concluding “no language disorder” because the CLS is in the average range

An “average” Core Language Score does not rule out a language disorder.

If the Receptive and Expressive Indexes differ by 15 points or more, or if Recalling Sentences sits at a 7 while everything else is average, there is a real language finding that the CLS is hiding. Always insist on the index breakdown and the individual subtest scaled scores. The composite is a starting point, never the conclusion.

Questions to Ask

Walk in prepared. Walk out with answers.

These questions move the conversation from “here are the language scores” to “here is what the scores mean for my kid.” Ask them.

Before Testing

Set expectations early.

Will you administer the full battery, or only the Core Language subtests?
Will the report break out Recalling Sentences as an individual scaled score?
Are you also using the Pragmatics Profile or Observational Rating Scale to look at social communication?
If my child is bilingual, how will language exposure be factored into the interpretation?
How will the results be paired with cognitive and academic testing to clarify whether language is the driver?

After Results

Make them walk you through it.

What is the gap between Receptive and Expressive, and what does it mean for instruction?
What did Recalling Sentences look like, and what does that tell us about DLD risk?
How does the Language Memory Index compare to the WISC-V Working Memory Index?
Is there a discrepancy between Language Content and Language Memory? What is driving it?
What did the pragmatic measures show, and how do those findings connect to what we see at home and school?

Red Flags in the Report

Stop and ask if you see any of these.

The CELF-5 is straightforward when interpreted well and easy to misread when not. These are the warning signs worth slowing down for.

Flag 01

Only the Core Language Score is reported.

The CLS hides the indexes underneath. If you see a single number with no breakdown of Receptive, Expressive, Content, and Memory, the report is incomplete. Ask for the full picture.

Flag 02

No discussion of Recalling Sentences specifically.

Recalling Sentences is the most diagnostically useful subtest in the entire battery. If it is buried inside the index averages with no individual mention, ask why. A scaled score of 7 or below is worth a real conversation, even when other scores look fine.

Flag 03

A 15-point Receptive-Expressive split that is never addressed.

The split is the diagnosis. If the report shows a one-standard-deviation gap between the two indexes and the narrative does not say a word about it, the interpretation is missing the most important finding on the page.

Flag 04

Bilingual or English-learner child tested in English only.

The CELF-5 is normed on English-speaking children. A bilingual or ELL student tested in English alone often produces scores that look like a language disorder when the real issue is exposure. The report should describe language history and use the bilingual extension where available.

Flag 05

No pragmatics evaluation when social concerns exist.

The CELF-5 includes pragmatics tools (the Pragmatics Profile and the Observational Rating Scale). If the family or teachers raised social communication concerns and those tools were not used, the evaluation skipped a piece. Ask for them.

Flag 06

Recommendations not connected to the actual profile.

If the indexes show a clear receptive bottleneck and the recommendations are generic (“provide language-rich environment”), the evaluation is not doing its job. Specific profiles need specific intervention strategies, and the report should make that link.

Key Takeaway

The Core Language Score is one number. Receptive vs Expressive, and what Recalling Sentences shows, is the diagnosis.

The CELF-5 is the right tool when language is the question. But the headline composite averages everything together, and that average can hide a real disorder. Look at the Receptive Language Index next to the Expressive Language Index. Look at Recalling Sentences as its own number. Look at the Language Memory Index, especially when reading comprehension or following directions are the real-world problem. The pattern across indexes is the answer, not the CLS by itself.

CELF-5 Explainer