Why does AI need labeled data?

Why does AI need labeled data?

Artificial Intelligence (AI) is transforming industries and businesses worldwide, revolutionizing everything from healthcare facilities to newsrooms. One of the key components of AI that is often overlooked is labeled data. It is the foundation of AI and machine...
On the Path to the Responsible AI

On the Path to the Responsible AI

Based on an interview with Carlos Amaral, Co-Founder and CEO of Priberam Many industries, especially the media, are enthusiastic about using machine learning (ML) to enhance the analysis of large datasets. Annotated data is essential for machine learning and...
Who Spoke When?

Who Spoke When?

Use of voice fingerprints in multi-speaker recordings — Speaker Diarization A journalist’s life is made easier if questions like these can be resolved right away: What was said when? How often does the person speak, and where exactly is the audio or video? SELMA...
Need Computing Resources? Take a Queue Token!

Need Computing Resources? Take a Queue Token!

Training large neural models for speech and language processing (NLP), requires not only a lot of data input (read here on why and here on how SELMA is handling this ) but also a lot of computing resources. Nowadays, a fair share of computing resources are...
How to satisfy data-hungry machine learning

How to satisfy data-hungry machine learning

Machine learning requires large quantities of labeled training data (for more insights, read more in this post). That means, in order to reach acceptable performance, current speech recognition systems training demands thousands of hours of transcribed speech. For...