CELF-5: What parents actually need to know.
The CELF-5 is the test the speech-language pathologist gives when language is on the table. It separates what your child understands (receptive language) from what they can express (expressive language), then digs into vocabulary, grammar, and language memory underneath. This guide walks through the indexes, the one subtest that matters more than the rest, and what to push on when the report comes back.
The 30-second version.
Before you dive into the details, here is the short version of what the CELF-5 is and how it gets used.
The Core Language Score is one number. The split between receptive and expressive is the diagnosis.
The biggest CELF-5 mistake is reading the Core Language Score (CLS) like a verdict and stopping there. The CLS averages everything together. A child can have a real language disorder hiding under an average CLS, because strengths in one area pull up weaknesses in another. The indexes underneath are where the answers live.
Why one composite is not enough.
The Core Language Score combines four subtests into one headline number. If receptive language is strong and expressive language is weak (or vice versa), the average can come out in the middle of the typical range, even when a 20-point split between those two domains is sitting right there in the data. An “average” CLS is not the all-clear. It is the starting point.
What to focus on.
Compare the Receptive Language Index (what your child understands) to the Expressive Language Index (what they can produce). A 15-point gap between them is meaningful. Stronger receptive than expressive points to a production bottleneck. Stronger expressive than receptive often points to a child who sounds fluent but is missing what was just said. Both are real findings.
An “average” Core Language Score does not rule out a language disorder. Always look at the indexes underneath.
The five indexes, explained.
The CELF-5 produces one headline composite and four index scores underneath it. Here is what each one is actually measuring, and what affects performance beyond pure language ability.
Recalling Sentences asks the child to repeat sentences back exactly. It is one of the strongest research-backed indicators of Developmental Language Disorder (DLD). A score of 7 or below on Recalling Sentences, even with an “average” Core Language Score, is a finding that deserves real follow-up. Make sure the report breaks this subtest out individually.
The standard score table.
Indexes use standard scores (mean 100, SD 15), the same scale as the WIAT-4 and WISC-V. Subtests use scaled scores (mean 10, SD 3) where 7 to 13 is the typical range. Here is what the index scores actually mean.
| Score Range | Percentile | What It Actually Means |
|---|---|---|
| 130 and above | 98th and above | Very strong. Top 2 percent of kids this age. |
| 120 to 129 | 91st to 97th | Above average. Clearly strong. |
| 110 to 119 | 75th to 90th | High average. A bit above typical. |
| 90 to 109 | 25th to 74th | Average. The middle half of kids this age. |
| 80 to 89 | 9th to 24th | Low average. Below typical, often flagged as a concern. |
| 70 to 79 | 2nd to 8th | Borderline. Often qualifies for SLP services on its own. |
| 69 and below | 1st and below | Extremely low. Strong language disorder finding. |
What your kid’s profile shape is telling you.
The CELF-5 is most useful when you read the indexes as a shape, not as five separate numbers. These are the patterns that show up most often in the kids parents bring through the door.
The Expressive Bottleneck
Receptive Language Index is intact or stronger. Expressive Language Index and Language Memory Index are noticeably lower. The child understands what they hear but struggles to put their thoughts into words.
The Receptive Bottleneck
Expressive Language Index is intact, often surprisingly so. Receptive Language Index is the weak point. The child sounds fluent and chatty but is missing pieces of what is said to them.
The Vocabulary and Memory Gap
Receptive and Expressive Language Indexes are average. Language Content (vocabulary) and Language Memory are noticeably lower. Reading comprehension and complex listening tasks are usually the real-world casualties.
The Globally Low Profile
Every index is below average, with no peak pulling any one area up. Comprehensive language weakness across the board.
What this test is, and isn’t, used for.
The CELF-5 is the standard SLP language battery, but it does specific things well and specific things not at all. Knowing the line helps you push back when needed.
Legitimate uses.
- Identifying receptive vs expressive language disorders
- Documenting eligibility for school-based SLP services
- Screening for Developmental Language Disorder (DLD)
- Identifying weak language memory that affects reading and classroom function
- Pairing with cognitive testing to clarify whether language is the bottleneck
Misuses to push back on.
- Diagnosing autism on its own (need ADOS-2 and a full developmental evaluation)
- Diagnosing dyslexia or reading disabilities (different test required)
- Testing speech sound articulation (the CELF is not an articulation test)
- Replacing pragmatic and social communication evaluation
- Evaluating bilingual or English-learner children in English alone
- Concluding “no language disorder” because the CLS is in the average range
If the Receptive and Expressive Indexes differ by 15 points or more, or if Recalling Sentences sits at a 7 while everything else is average, there is a real language finding that the CLS is hiding. Always insist on the index breakdown and the individual subtest scaled scores. The composite is a starting point, never the conclusion.
Walk in prepared. Walk out with answers.
These questions move the conversation from “here are the language scores” to “here is what the scores mean for my kid.” Ask them.
Set expectations early.
- Will you administer the full battery, or only the Core Language subtests?
- Will the report break out Recalling Sentences as an individual scaled score?
- Are you also using the Pragmatics Profile or Observational Rating Scale to look at social communication?
- If my child is bilingual, how will language exposure be factored into the interpretation?
- How will the results be paired with cognitive and academic testing to clarify whether language is the driver?
Make them walk you through it.
- What is the gap between Receptive and Expressive, and what does it mean for instruction?
- What did Recalling Sentences look like, and what does that tell us about DLD risk?
- How does the Language Memory Index compare to the WISC-V Working Memory Index?
- Is there a discrepancy between Language Content and Language Memory? What is driving it?
- What did the pragmatic measures show, and how do those findings connect to what we see at home and school?
Stop and ask if you see any of these.
The CELF-5 is straightforward when interpreted well and easy to misread when not. These are the warning signs worth slowing down for.
Only the Core Language Score is reported.
The CLS hides the indexes underneath. If you see a single number with no breakdown of Receptive, Expressive, Content, and Memory, the report is incomplete. Ask for the full picture.
No discussion of Recalling Sentences specifically.
Recalling Sentences is the most diagnostically useful subtest in the entire battery. If it is buried inside the index averages with no individual mention, ask why. A scaled score of 7 or below is worth a real conversation, even when other scores look fine.
A 15-point Receptive-Expressive split that is never addressed.
The split is the diagnosis. If the report shows a one-standard-deviation gap between the two indexes and the narrative does not say a word about it, the interpretation is missing the most important finding on the page.
Bilingual or English-learner child tested in English only.
The CELF-5 is normed on English-speaking children. A bilingual or ELL student tested in English alone often produces scores that look like a language disorder when the real issue is exposure. The report should describe language history and use the bilingual extension where available.
No pragmatics evaluation when social concerns exist.
The CELF-5 includes pragmatics tools (the Pragmatics Profile and the Observational Rating Scale). If the family or teachers raised social communication concerns and those tools were not used, the evaluation skipped a piece. Ask for them.
Recommendations not connected to the actual profile.
If the indexes show a clear receptive bottleneck and the recommendations are generic (“provide language-rich environment”), the evaluation is not doing its job. Specific profiles need specific intervention strategies, and the report should make that link.
The Core Language Score is one number. Receptive vs Expressive, and what Recalling Sentences shows, is the diagnosis.
The CELF-5 is the right tool when language is the question. But the headline composite averages everything together, and that average can hide a real disorder. Look at the Receptive Language Index next to the Expressive Language Index. Look at Recalling Sentences as its own number. Look at the Language Memory Index, especially when reading comprehension or following directions are the real-world problem. The pattern across indexes is the answer, not the CLS by itself.