Spoken Language Identifier

The spoken language identifier is a service that tries to determine the language spoken in an audio recording.
The model currently supports 8 languages: English, Spanish, Italian, French, German, Portuguese, Dutch, and Russian.

You can test the spoken language identifier in several ways: recording your audio, uploading an audio file or using one of our examples:

Information on on the spoken language identifier

Introduction

The spoken language identifier is a service that tries to determine the language spoken in an audio recording.

The model currently supports 8 languages: English, Spanish, Italian, French, German, Portuguese, Dutch, and Russian.

Supported audio formats: WAV, FLAC, OGG.

Technology

The model uses convolutional and recurrent neural networks trained on tens of hours of speech data. This is an end-to-end model that uses a raw waveform as input and makes no assumptions about the phonetics or the grammars of the languages considered. Rather, it tries to infer all the relevant features of the audio from the data. It produces the probability distribution over the languages recognized by the model as the output.

You can use it to classify recordings as short as 1 second and as long as a minute. Note that the longer the recording, the higher the accuracy of the prediction. For 20 second recordings the accuracy is about 95%, while for 5 second samples it is just over 80%

I want it

If this technology interests you, please have a look at our API available on Rapidapi

I can do better!

We are constantly looking to hire great engineers with a global mindset.
Get in touch if you think you can improve any of these these applications.

Get in touch

Explore our experiments

Spoken Language Identifier

The Spoken Language Identifier automatically detects the language of a spoken text. You can use it to classify recordings from 1 second to 1 minute. It currently supports 8 languages.

Learn more or Get API
Terminology Extractor

This tool automatically extracts the terminology of a technical topic from a written text. It can help translators identify the difficulties in a document, and simplify the process of creating glossaries.

Learn more or Get API
Readability analyzer

Written information, especially on the Internet, must be easy to read and well structured. This application helps you understand if a text is easily readable, or if it needs improvement.

Learn more or Get API
Language Identifier

The Language Identifier automatically detects the language of a written text. It can also be used to identify the topic of a written text in a language you do not understand.

Learn more
Semantic relationships

What do the words airplane, bird, and helicopter have in common? This application searches for semantic relationships in a text by analyzing the statistical properties of words.

Learn more
Translation Party

What happens when you translate an English sentence into Japanese, and then again into English, as if it was an infinite loop? Well, give it a try! And don't forget to share the funniest results with your friends.

Learn more