translatedlabs.com

Readability analyzer

Information about the readability analysis

Introduction

Word length and phrase length influence the ease of reading and understanding of a given text. Short words are usually more common (Zipf's law). Short sentences require less abstraction ability to understand. The readability analysis could be useful to make a text better, augmenting its accessibility.

Why have we developed this?

The readability index tells us how easy a given text is to understand. A well-written text is effective, easy to understand and quick to read. This index helps us understand the text's complexity in order to better schedule the activities of translators and revisers. More than ever, written information, especially in the Internet, must be direct and well structured. This analysis can help achieve both goals.

Technology


Readability

Readability is calculated using the Gulpease Index. This index has been implemented for Italian, English and French. For German and Spanish, only the readability index works. If your language is not yet supported and you are interested in this technology, contact us

Terminology

It uses Poisson statistics, the Maximum Likelihood Estimation and Inverse Document Frequency between the frequency of words in a given document and a generic corpus of 100 million words per language. It uses a probabilistic part-of-speech tagger to take into account the probability that a particular sequence of words could be a term. It creates n-grams of words by minimizing the relative entropy. For more information see Terminology Extraction.

I can do better!

We are constantly looking to hire great engineers with a global mindset.
Get in touch if you think you can improve any of these these applications.

Get in touch

Explore our experiments

Spoken Language Identifier

The Spoken Language Identifier automatically detects the language of a spoken text. You can use it to classify recordings from 1 second to 1 minute. It currently supports 8 languages.

Learn more or Get API
Terminology Extractor

This tool automatically extracts the terminology of a technical topic from a written text. It can help translators identify the difficulties in a document, and simplify the process of creating glossaries.

Learn more or Get API
Readability analyzer

Written information, especially on the Internet, must be easy to read and well structured. This application helps you understand if a text is easily readable, or if it needs improvement.

Learn more or Get API
Language Identifier

The Language Identifier automatically detects the language of a written text. It can also be used to identify the topic of a written text in a language you do not understand.

Learn more
Semantic relationships

What do the words airplane, bird, and helicopter have in common? This application searches for semantic relationships in a text by analyzing the statistical properties of words.

Learn more
Translation Party

What happens when you translate an English sentence into Japanese, and then again into English, as if it was an infinite loop? Well, give it a try! And don't forget to share the funniest results with your friends.

Learn more