translatedlabs.com

Semantic relationships

Information on searching for semantic relationships

Introduction

This application searches for semantic relationships in a text by analysing the statistical properties of words.

It is not based on rules but on the probability that two words can appear in the same phrase without having a relationship.

Technology

It creates an n-dimensional representation of words ( PLSA) by using the statistical properties of the words which appear next to them as coordinates. This demo uses European Parliament debates as its corpus.

Why have we developed this?

This technology is an integral part of a more complicated project able to extract translated terminology from the web.

For example, if you want to find on the web the English translation of ‘Metallizzazione’, it will be difficult to find bilingual sites from where the information can be extracted. But you will find on google more than 50.000 Italian pages which talk about "Metallizzazione". From these pages, you will discover that ‘Metallazzione’ has semantic relationships with "vuoto", "impianto", "vernice", "finitura", "metallo" for which the English translations can be easily found. At this point, you can search for what the words "vacuum", "plant", "paint", "metal" have in common and the answer will be "Metallization", the translation you were looking for!.

I want it!

If you are interested in this technology, please read more on Translated Labs and our services for natural language processing.

I can do better!

We are constantly looking to hire great engineers with a global mindset.
Get in touch if you think you can improve any of these these applications.

Get in touch

Explore our experiments

Spoken Language Identifier

The Spoken Language Identifier automatically detects the language of a spoken text. You can use it to classify recordings from 1 second to 1 minute. It currently supports 8 languages.

Learn more or Get API
Terminology Extractor

This tool automatically extracts the terminology of a technical topic from a written text. It can help translators identify the difficulties in a document, and simplify the process of creating glossaries.

Learn more or Get API
Readability analyzer

Written information, especially on the Internet, must be easy to read and well structured. This application helps you understand if a text is easily readable, or if it needs improvement.

Learn more or Get API
Language Identifier

The Language Identifier automatically detects the language of a written text. It can also be used to identify the topic of a written text in a language you do not understand.

Learn more
Semantic relationships

What do the words airplane, bird, and helicopter have in common? This application searches for semantic relationships in a text by analyzing the statistical properties of words.

Learn more
Translation Party

What happens when you translate an English sentence into Japanese, and then again into English, as if it was an infinite loop? Well, give it a try! And don't forget to share the funniest results with your friends.

Learn more