by kseniaskriptchenko | Mar 6, 2023 | AI
Artificial Intelligence (AI) is transforming industries and businesses worldwide, revolutionizing everything from healthcare facilities to newsrooms. One of the key components of AI that is often overlooked is labeled data. It is the foundation of AI and machine...
by kseniaskriptchenko | Dec 30, 2022 | AI, BIAS, Diversity
Based on an interview with Carlos Amaral, Co-Founder and CEO of Priberam Many industries, especially the media, are enthusiastic about using machine learning (ML) to enhance the analysis of large datasets. Annotated data is essential for machine learning and...
by kaymacquarrie | Dec 15, 2022 | Uncategorized
Where: Portugal (online) When: December, 15th, 2022 What is it? “The Locdoc – Media Localization Masterclass is an event promoted by localization experts and tech language consultants” Why are we there? Peggy van der Kreeft, Innovation Manager at the...
by Tugtekin Turan | Dec 13, 2022 | Speaker Diarization
Use of voice fingerprints in multi-speaker recordings — Speaker Diarization A journalist’s life is made easier if questions like these can be resolved right away: What was said when? How often does the person speak, and where exactly is the audio or video? SELMA...
by kaymacquarrie | Nov 3, 2022 | News, HLT
What’s new in the multilingual newsroom in terms of AI-supported tools? How to produce content which is available in many languages and formats and which is accessible to people with disabilities? How to monitor vast amounts of data and cluster information in...
by kaymacquarrie | Jul 7, 2022 | conferences, Meet us @
Where: Berlin, Germany When: July, 11-13th, 2022 What is it? “LocWorld is the leading conference for international business, translation, localization and global website management. Attendees are the people responsible for communicating across the boundaries of...
by guntisbarzdins | Jun 30, 2022 | NLP
Training large neural models for speech and language processing (NLP), requires not only a lot of data input (read here on why and here on how SELMA is handling this ) but also a lot of computing resources. Nowadays, a fair share of computing resources are...
by kseniaskriptchenko | May 26, 2022 | AI, News, HLT
Machine learning requires large quantities of labeled training data (for more insights, read more in this post). That means, in order to reach acceptable performance, current speech recognition systems training demands thousands of hours of transcribed speech. For...
by kaymacquarrie | Mar 22, 2022 | Meet us @, Uncategorized
When: March 21-23 Where: Brazil (online) What is it? “The 15th edition of the International Conference on the Computational Processing of Portuguese (PROPOR 2022) will be held at the University of Fortaleza, in Fortaleza, Ceará, in Brazil, from March 21st to...
by kseniaskriptchenko | Mar 1, 2022 | AI, Diversity, News, HLT
Diversity is becoming the new norm according to the German Zukunftsinstitut in its megatrend survey. Megatrends are said to last for at least a decade which means – the topic will stick with us into the 2030s. But let’s start with the basics: what is...