Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Readability
(section)
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
=== Lexico-semantic === The type-token ratio is one of the features that are often used to captures the lexical richness, which is a measure of vocabulary range and diversity. To measure the lexical difficulty of a word, the relative frequency of the word in a representative corpus like the [[Corpus of Contemporary American English]] (COCA) is often used. Below includes some examples for lexico-semantic features in readability assessment.<ref name="Computational assessment of text re" /> *Average number of syllables per word *Out-of-vocabulary rate, in comparison to the full corpus *Type-token ratio: the ratio of unique terms to total terms observed *Ratio of function words, in comparison to the full corpus *Ratio of pronouns, in comparison to the full corpus *Language model perplexity (comparing the text to generic or genre-specific models)
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)