Cliché Score
"Cliché" might be a bit strong — this measures the frequency of common phrases in a text.
Whether these are clichés or just frequent building blocks of American English is for the reader to decide.
Try, say,
The Great Gatsby,
The Four Quartets,
Hamlet, or
Alice in Wonderland.
[How it works]
Tokenize → form 4-grams → match to a list of common 4-grams → sum a log-scaled score.
Try Project Gutenberg for more books.
<< Back to other projects