What Is Readability and How
to Measure It?
Zhai Yujing (翟玉净)
(The Institute of Online Education, Beijing Foreign
Studies University)
This article
outlines the subject of ‘readability’ and its relevance to reading
materials. The paper starts with some facts to show you the significance of
knowing the concept of ‘readability’. Then, details, including its definition
and historical overview are explicated to further your understanding of ‘readability’.
Next, the author moves on to the focus of this paper-readability formulas, which refer to the methods writers or educators can
use to assess and determine the reading levels of the materials. Finally, this
paper concludes with some suggestions on using readability formulae
scientifically that will help to increase the readability level.
A true story …
“The CEO of a firm that manufactures electronic controls asked me
(Leon Fletcher) to rewrite the instruction manual for his product. At a meeting
of his executives, I asked, “Are you sure your controls work — have you tested
them?”
They
bristled at that question, understandably. They roared, “Of course we test
every one of them!” My reply: “Good. And have you tested your instruction
manual?”
They
had not. The manual was written by a manager, reviewed by engineers, approved
by sales. It required the reading skills of a college senior, according to my
readability study. But the manual was intended for use by unskilled laborers,
most with only fifth grade educations.”
Some other facts …
l More than one out of four cancer patients were not able to understand any of 30 booklets produced by the American Cancer Society, according to a 1995 study by Villanova University.
l
The catalog
of one California community college stated its “General Education Requirements”
— essential information for students just out of high school — in text that
required reading skills beyond those of a Ph.D.
l
Three out of
four adults 60 years and older find it “hard to understand” most information in
newspapers, said a study by the Federal Drug Administration.
l
It was not long ago (i.e., in
1994) that the results from the "Adult Literacy in America" study
(accomplished by the Educational Testing Service for the U.S. Government; which
studied 26,000 adults) were announced a surprising number of college graduates
were found to only read at less than college levels.
The above facts prove that ‘readability’ is
severely neglected by some writers and educators. Some experts recommend that
any written composition/document that is utilized, via reading, to communicate
necessary details to individuals be very carefully examined with respect to its
readability level. To expect an individual to comprehend or otherwise
understand a written communication that requires reading skills beyond that
possessed by the involved individual can be a very major error. Therefore, this
paper focuses on ‘readability formula’ which is a very effective way to
help you to assess the readability level of the texts you write or select for
your readers.
In the broadest
sense, ‘readability’ is the sum total (including interactions) of all
those elements within a given piece of printed material that affect the success
which a group of readers have with it (Dale & Chall, 1948). The success is
the extent to which they understand it, read it at optimum speed, and find it
interesting.
A more reasonable
definition of ‘readability’ that is in keeping with more recent research
and theory is the level of ease or difficulty with which text can be understood
by a particular reader who is reading that text for a specific purpose.
The purpose of readability assessment is to effect a ‘best match’ between intended readers and texts. Readability tests, which are mathematical formulas, were designed to assess the suitability of reading materials for readers at particular grade levels or ages. These readability formulas are the most widely used methods for predicting text difficulty. A readability formula or index ‘is like a yardstick that help us measure certain qualities in the writing so we can make objective judgements about reading level’ (Laubach & Koschnick, 1977, p.12).
According to Timothy D. Giles, Talmudic priests in the 10th century C.E. employed a kind of readability formula, calculating ordinary and abstract meanings using word and idea tallies. These calculations were developed because the priests were concerned that their meanings would be clearly and precisely understood.
Modern research into readability formula began in 1921 when Thorndike published The Teacher’s Word Book in which he tabulated words according to the frequency of their use in general literature. The book is really worth mentioning because it provided a means for measuring the difficulty of words and permitted the development of mathematical formula. Since then, the research on readability formula started to develop rapidly. During 1920’s, research activities concentrated on looking for word factors that could be used to predict readability. Research broadened during the 1930’s and throughout the 1940’s, deriving formulas that could accurately predict readability using the least number of factors.
3.1 Review
By 1973 more than
200 language variables had been tried in almost 200 different readability
formulas, but the vast majority of these formulas are no longer in use. Today there
are by now dozens of readability formulas in existence, but for general use
there are only a handful to choose from. They fall into two groups-those based on sentence length and a word list and
those based on sentence length and word length. Put another way, readability
formulas are usually based on one semantic (the difficulty of words) and one
syntactic factor (the difficulty of sentences). Words are either measured
against a frequency list or are measured according to their length in
characters or syllables. Sentences are measured for the average length in
characters or words. Studies have confirmed that counting more things does not
make the formula any predictive of reading ease but takes a lot more effort.
Readability tests
can be performed manually by counting and doing a mathematical calculation, or
be referring to a chart or graph. They also can be performed by computer. Most
grammar or editing software today can perform several readability tests.
3.2 General principles
Before using a
readability formula, you’d better go through the following principles first:
1)
When counting
syllables for these tests, it helps to say the words aloud.
Some examples of syllables count are: another (3), area (3), passed (1), surface (2), surfaces (3), particle (3), enable (3).
2)
When counting
numbers, symbols, initials, etc, count one syllable for each number or letter.
For example: 1998 = 4 syllables (1-9-9-8), 4.2 = 3 syllables (4-point –2), H2O
= 3 syllables (H-2-O), USA = 3 syllables (U-S-A), Fig. 2 = 2 syllables
(Fig-two)
3)
For
abbreviations (cm, mm, km, kg. eg, ie), the usual rule is to count each as just
one syllable. Since headings and sub-headings are usually not sentences, they
are best ignored. It is not known how to deal with a formula or a numerical
calculation, so for the results below they were just ignored.
3.3 Some widely recognized readability formulas
3.3.1
Fry
Readability Graph
Fry developed a ‘Readability
Graph’ in 1965 for predicting readability. He used the common formula variables
of syllables per 100 words and words per sentence. The user marks
the counts of the variables on a graph and then reads the readability grade
score directly from it.
When you refer
to Fry Readability Graph, follow the steps below:
Step1. Select
samples of 100 words.
Step2. Find y, the average number of sentences per
100-word passage (calculating to the nearest tenth.
Step3. Find x, the average number of syllables per
100-word sample.
Step4. Use the Fry graph to determine the reading
age, in years or grade level.
The curve
represents normal texts. Points below the curve imply longer than average
sentence length. Points above the curve represent text with a more difficult vocabulary
(as in school science texts).
This text is
suitable for all ages, from infant to upper secondary.


3.3.2
The
Flesch Reading Ease Formula
Flesch published
his formula in 1943. In designing it for general adult reading matter, he felt
it gave proper attention to abstract words as well as sentence length. The
formula uses four factors: number of syllables per 100 word, average number of
words per sentence, number of personal words per 100 words, and number of
personal sentences per 100 sentences.
The
output of the Flesch Reading Ease formula is a number from 0 to 100, with a
higher score indicating easier reading. For example, zero means practically
unreadable and 100 means extremely easy. The minimum score for plain English is
60, or about 20 words per sentence and 11/2 syllables per word.
Step 1. Count
the words.
Count the words in your piece of writing. Count as single words contractions, hyphenated words, abbreviations, figures, symbols and their combinations, e.g., wouldn't, full-length, TV, 17, &, $15, 7%.
Step 2. Count
the syllables.
Count the syllables in your piece of writing. Count the
syllables in words as they are pronounced. Count abbreviations, figures,
symbols and their combinations as one-syllable words. If a word has two
accepted pronunciations, use the one with fewer syllables. If in doubt, check a
dictionary.
Step 3. Count
the sentences.
Count the sentences in your piece of writing. Count as a
sentence each full unit of speech marked off by a period, colon, semicolon,
dash, question mark or exclamation point. Disregard paragraph breaks, colons,
semicolons, dashes or initial capitals within a sentence. For instance, count
the following as a single sentence:
You qualify if —
(1) you are at least 58 years old; and
(2) your total household income is under $5,000.
Step 4. Figure
the average number of syllables per word.
Divide the number of syllables by the number of words.
Step 5. Figure
the average number of words per sentence.
Divide the number of words by the number of sentences.
Step 6. Find
your readability score.
Find the average sentence length and word length of your
piece of writing on the chart (below). Take a straightedge or ruler and connect
the two figures. The intersection of the straightedge or ruler with the center
column shows your readability score.

Here are the scores of some popular reading materials. These
are average scores of random samples.
|
Comics |
92 |
|
Consumer ads in magazines |
82 |
|
Movie Screen |
75 |
|
Seventeen |
67 |
|
Reader's Digest |
65 |
|
Sports Illustrated |
63 |
|
New York Daily News |
60 |
|
Atlantic Monthly |
57 |
|
Time |
52 |
|
Newsweek |
50 |
|
Wall Street Journal |
43 |
|
Harvard Business Review |
43 |
|
New York Times |
39 |
|
New York Review of Books |
35 |
|
Harvard Law Review |
32 |
|
Standard auto insurance policy |
10 |
|
Internal Revenue Code |
Minus 6 |
Here's how the scores translate into school grades. Reading
matter with the score shown on the left side will be easy for students on the
level shown on the right.
|
Score |
School Level |
|
90 to 100 |
5th grade |
|
80 to 90 |
6th grade |
|
70 to 80 |
7th grade |
|
60 to 70 |
8th and 9th grade |
|
50 to 60 |
10th to 12th grade (high school) |
|
30 to 50 |
College |
|
0 to 30 |
College graduate |
Besides the
chart, you can also use an algorithm to get Flesch Reading Ease score. It reads
as follows:
206.835 – (1.015 × ASL)
– (84.6 × ASW)
Where:
ASL = average
sentence length (the number of words divided by the number of sentences)
ASW = average number of syllables per word (the number of syllables
divided by the number of words)
3.3.3
Flesch-Kincaid
Readability Ease Formula
Flesch-Kincaid Grade Level, which is like
Flesch Reading Ease, is based on the average number of syllables per word and
words per sentence. The Flesch-Kincaid Grade Level score rates text based on
the U.S. high school grade level system (i.e. a score of 7.0 would mean a 7th
grader should be able to comprehend the text). The Flesch Reading Ease score is
based on a 100 point scale; the higher the score, the easier it is to
comprehend.
The
following is the algorithm to determine the Flesch-Kincaid grade level:
1.
Calculate the
average number of words you use per sentence, ASL.
2.
Calculate the
average number of syllables per word, ASW.
3.
Multiply the
average number of words by 0.39 and add it to the average number of syllables
per word multiplied by 11.8.
4.
Subtract
15.59 from the result.
Algorithm: (0.39 × ASL) + (11.8 ×ASW) - 15.59
A
score of 8.0 means that an eighth grader can understand the document. For most
standard documents should aim for a score of approximately 7.0 to 8.0.
3.3.4
McLaughlin
‘SMOG’ Formula
The SMOG
Readability Formula, adapted from McLaughlin, G. (1969) is a simple method you
can use to determine the reading level of your written materials. To calculate
the SMOG (Simplified Measure of Gobbledygoop) reading grade level, begin with
the entire written work that is being assessed, and follow these four steps:
1.
Count off 10
consecutive sentences near the beginning, in the middle, and near the end of
the text.
2.
From this
sample of 30 sentences, circle all of the words containing three or more
syllables (polysyllabic), including repetitions of the same word, and total
the number of words circled.
3.
Estimate the
square root of the total number of polysyllabic words counted. This is done by
finding the nearest perfect square, and taking its square root.
4.
Finally, add
a constant of three to the square root. This number gives the SMOG grade, or
the reading grade level that a person must have reached if he or she is to
fully understand the text being assessed.
A few
guidelines will help to clarify these directions:
Ø
A sentence is
defined as a string of words punctuated with a period (.), an exclamation point
(!) or a question mark (?).
Ø
Hyphenated
words are considered as one word.
Ø
Numbers which
are written out should also be considered, and if in numeric form in the text,
they should be pronounced to determine if they are polysyllabic.
Ø
Proper nouns,
if polysyllabic, should be counted, too.
Ø
Abbreviations
should be read as unabbreviated to determine if they are polysyllabic.
Not all pamphlets,
fact sheets, or other printed materials contain 30 sentences. To test a text
that has fewer than 30 sentences:
1.
Count all of
the polysyllabic words in the text.
2.
Count the
number of sentences.
3.
Find the
average number of polysyllabic words per sentence as follows:
Average= Total # of polysyllabic
words/Total # of sentences
4.
Multiply that
average by the number of sentences short of 30.
5.
Add that
figure on to the total number of polysyllabic words.
6.
Find the
square root and add the constant (常数, 恒量) of 3.
A quick version
of the SMOG test:
Count the number
of polysyllabic words in the chain of 30 sentences and look up the approximate
grade level on the SMOG conversion table.
|
Total
polysyllabic word counts |
Approximate grade
level (+/- 1.5 grades) |
|
0-2 |
4 |
|
3-6 |
5 |
|
7-12 |
6 |
|
13-20 |
7 |
|
21-30 |
8 |
|
31-42 |
9 |
|
43-56 |
10 |
|
57-72 |
11 |
|
73-90 |
12 |
|
91-110 |
13 |
|
111-132 |
14 |
|
133-156 |
15 |
|
157-182 |
16 |
|
183-210 |
17 |
|
211-240 |
18 |
Sample:
In Controlling Cancer—You Make a Difference
The key is action. You can help
protect yourself against cancer. Act promptly to:
Prevent some cancers through simple
changes in lifestyle.
Find out about early detection tests
in your home.
Gain peace of mind through regular
medical checkups.
Lung Cancer is the number one cancer
among men, both in the number of new cases each year (79,000) and deaths
(70,500). Rapidly increasing rates are due mainly to cigarette smoking. By not smoking,
you can largely prevent lung cancer. The risk is reduced by smoking less, and
by using lower tar and nicotine brands. But quitting altogether is by far the
most effective safeguard. The American Cancer society offers Quit Smoking
Clinics and selfhelp materials.
Colorectal Cancer is second in cancer
deaths (25,100) and third in new cases (49,000). When it is found early,
chances of cure are good. A regular general physical usually includes a digital
examination of the rectum and a guaiac slide test of a stool specimen to check
for invisible blood. Now there are also Do-It-Yourself Guaiac Slides for home use.
Ask your doctor about them. After you reach the age of 40, your regular
check-up may include a “Procto,” in which the rectum and part of the colon are
inspected through a hollow, lighted tube.
Prostate Cancer is second in the
number of new cases each year (57,000), and third in deaths (20,600). It occurs
mainly in men over 60. A regular rectal exam of the prostate by your doctor is the
best protection.
Be sure to have a regular, general physical
including an oral exam. It is your best guarantee of good health.
How Cancer Works
If we know something about how cancer
works, we can act more effectively to protect ourselves against the disease.
Here are the basics:
1.
Cancer
spreads; time counts—Cancer is uncontrolled growth of abnormal cells. It begins
small and if unchecked, spreads. If detected in an early, local stage, the
chances for cure are best.
2.
Risk
increases with age—This is not a reason to worry, but a signal to have more
regular, thorough physical check-ups. Your doctor or clinic can advise you on
what tests to get and how often they should be performed.
3.
What
you can do—don’t smoke and you will sharply reduce you chances of getting lung cancer.
Avoid too much sun, a major cause of skin cancer. Learn cancer’s Seven Warning Signals,
listed on the back of this leaflet, and see your doctor promptly if they
persist. Pain is usually a late symptom of cancer; don’t wait for it.
Know Cancer’s Seven Warning Signals
1.
Change
in bowel or bladder habits.
2.
A sore
that does not heal.
3.
Unusual
bleeding or discharge.
4.
Thickening
or lump in breast or elsewhere
5.
Indigestion
or difficulty in swallowing
6.
Obvious
change in wart or mole
7.
Nagging
cough or hoarseness
If you have a warning signal, see your
doctor.
Beware of unproven cancer remedies.
They may sound appealing, but they are usually worthless. Relying on them can
delay good treatment until it is too late. Check with your doctor or the
American Cancer Society.
More Information
For more information of any kind about
cancer--free of cost--contact your local unit of the American Cancer Society
We have calculated the reading grade level for this
example.
Compare your results to ours, and then
check both with the SMOG conversion table:
Readability Test Calculations