The Definitive Guide to Flesch Reading Ease: Formula, Benchmarks, and Authoritative Use

Rudolf Flesch spent years watching people struggle with government documents, insurance policies, and newspaper prose before he published The Art of Plain Talk in 1946. His frustration was technical as much as philosophical: he wanted a number. Not a vague editorial judgment, but a reproducible score that could tell a writer, with precision, whether their prose was accessible or impenetrable. The result was the Flesch Reading Ease formula — a metric that, nearly eighty years later, still sits inside Microsoft Word, Yoast SEO, and every serious content auditing platform on the market.

Understanding this formula is not optional if you write for the web professionally. It is table stakes.

The Formula Itself

The Flesch Reading Ease score is calculated as follows:

Score = 206.835 − (1.015 × ASL) − (84.6 × ASW)

Where ASL is Average Sentence Length (total words divided by total sentences) and ASW is Average number of Syllables per Word (total syllables divided by total words).

Two things deserve immediate attention here. First, the constants — 206.835, 1.015, and 84.6 — were not invented arbitrarily. Flesch derived them through regression analysis against a set of texts that human readers had already rated for difficulty. The formula is an empirical fit to observed reading behavior, not a theoretical construction. That empirical grounding is exactly why it has survived while dozens of competing readability formulas have been forgotten.

Second, notice the asymmetry in the coefficients. The syllable term (84.6 × ASW) is roughly 83 times heavier than the sentence-length term (1.015 × ASL). This is intentional and important: word-level complexity drives reading difficulty far more than sentence length does. A sentence with twenty short words is dramatically easier to process than a twelve-word sentence crammed with polysyllabic technical jargon. Writers who focus exclusively on breaking up long sentences while ignoring vocabulary complexity are optimizing the wrong variable.

Reading the Output: What the Score Actually Means

The scale runs from 0 to 100, with higher scores indicating easier reading. Flesch's original benchmarks, refined by subsequent researchers, break down like this:

90–100: Very Easy. Average sentence around 11 words, one-syllable words dominant. Think children's picture books and basic consumer product instructions.
80–90: Easy. Conversational English, tabloid journalism. Most web copy targeting broad consumer audiences should live here.
70–80: Fairly Easy. Standard newspaper reporting. Associated Press style is calibrated for this range.
60–70: Standard. Plain English, general-interest magazines, most competent web articles. This is the range Yoast SEO marks green for general content.
50–60: Fairly Difficult. Professional publications, white papers, long-form journalism. Harvard Business Review typically scores in this band.
30–50: Difficult. Academic journals, technical documentation, legal drafting. Readers need domain knowledge to follow along comfortably.
0–30: Very Difficult. Professional legal contracts, dense scientific literature. The U.S. Internal Revenue Code famously scores near the bottom of this range.

For comparison: Ernest Hemingway's prose from The Old Man and the Sea scores around 84. Academic philosophy papers often dip below 20. Most successful blog posts targeting non-specialist readers land between 60 and 75.

Where the Formula Carries Genuine Authority

Readability metrics have critics, and some of their criticisms are valid. But there are specific domains where the Flesch score is not just useful — it is formally required or professionally expected.

Legal and Insurance Documentation

More than forty U.S. states have enacted plain language statutes that either reference the Flesch Reading Ease score directly or cite readability standards traceable to Flesch's work. Florida's Insurance Code, for instance, has long required consumer policies to achieve a score of at least 45. Pennsylvania's plain language statute for consumer contracts is similarly calibrated. The Department of Defense has used Flesch-based standards for technical manuals since the 1970s. These are not soft editorial guidelines — they are compliance requirements with legal teeth.

Educational Publishing

Curriculum publishers use Flesch scores (alongside the related Flesch-Kincaid Grade Level formula) to ensure reading materials are age-appropriate. The relationship between Flesch Reading Ease and Flesch-Kincaid Grade Level is mathematically inverse and direct — a score of 90 corresponds roughly to 5th grade level, a score of 60 to about 9th grade. Educational content teams treat these numbers as hard constraints in their editorial workflows, not post-hoc checkboxes.

Healthcare and Patient Communication

The National Institutes of Health and the American Medical Association both recommend that patient-facing health communications target a 6th to 8th grade reading level — which maps to a Flesch Reading Ease score of approximately 60 to 80. Studies on medication adherence have consistently found that patients who receive instructions written in plainer language follow them more accurately. This is not a trivial finding: readability has measurable public health implications.

SEO and Content Marketing

In the SEO context, the Flesch score's authority is indirect but real. Google has never confirmed that readability scores are a direct ranking factor. What Google has confirmed, through its helpful content guidance and its E-E-A-T framework, is that content should be written for humans first. The correlation between high Flesch scores and positive UX signals — lower bounce rates, higher time-on-page, more social shares — is consistent across content audits. Yoast's integration of Flesch scoring into its plugin (which powers a significant fraction of all WordPress sites) has made the metric a practical checkpoint in millions of content workflows, regardless of its direct algorithmic weight.

What the Formula Does Not Measure

Intellectual honesty requires stating this plainly: the Flesch formula is syntactic, not semantic. It counts syllables and sentences. It does not know whether your argument is coherent, whether your examples are apt, or whether your logic is sound. A text composed entirely of short, simple sentences about a deeply counterintuitive concept can score 90 and still confuse a reader completely.

The formula also does not account for:

Reader prior knowledge. A cardiologist finds "ventricular fibrillation" transparent; a layperson finds it opaque. The syllable count is identical either way.
Sentence structure complexity beyond length. A 12-word sentence with three nested clauses is harder to parse than a 15-word sentence with a clean subject-verb-object structure. Flesch does not see this distinction.
Document-level coherence. Transitions, paragraph flow, and logical sequencing are invisible to the formula.
Typographic and layout factors. Dense blocks of text are harder to read than well-formatted prose with headers and lists, regardless of sentence length.

These limitations are not reasons to discard the metric. They are reasons to use it as one instrument in a broader diagnostic toolkit, not as a sole arbiter of quality.

Practical Application: How to Actually Improve Your Score

If your content is scoring below your target range, the highest-leverage interventions are:

Target polysyllabic words first. Before you touch sentence length, run a search for words with four or more syllables. "Utilization" becomes "use." "Demonstrate" becomes "show." "Approximately" becomes "about." Each substitution moves the needle more than cutting a sentence in half.

Break sentences at natural clause boundaries, not arbitrarily. "The report, which was completed after six months of fieldwork and incorporated feedback from forty-seven stakeholders, recommends restructuring the procurement process" should become two sentences — but cut it where meaning naturally divides, not just at a word count threshold.

Audit your passive constructions. Passive voice consistently adds syllables and complicates sentence structure. "The decision was made by the committee" scores worse than "the committee decided" on both the ASL and ASW dimensions simultaneously.

Use domain jargon deliberately, not habitually. Technical terms are sometimes necessary. When they are, define them once and use them freely. When they are not necessary, cut them. The writer who uses "utilize" when they mean "use" is not adding precision — they are subtracting clarity and lowering their Flesch score for no reason.

The Lasting Value of a 1948 Formula

In an era of AI-generated text, neural readability models, and large-scale NLP analysis, it might seem strange that a formula derived from regression analysis in the 1940s remains a professional standard. But its longevity is not accidental. The Flesch Reading Ease score is fast, deterministic, interpretable, and — critically — legally defensible. You can explain to a regulatory body exactly why a document scores 47 and exactly what changes would bring it to 60. You cannot do that with a black-box neural network.

Flesch built a tool that was precise enough to be useful and simple enough to survive. That combination is rarer than it looks.

Use the score honestly: as a diagnostic signal, a compliance checkpoint, and a discipline enforcer for writers who drift toward unnecessary complexity. Do not use it as a substitute for genuine clarity of thought. The formula can tell you that your sentences are long and your words are polysyllabic. Only you can tell the reader something worth knowing.