Machine Learning

My year with plain X

by Konstantin Klein | Jan 4, 2024 | AI, News, HLT

Artificial intelligence in journalism:  In newsrooms worldwide, there is a heated debate about what it means for the future of the media industry and its employees. First attempts to have texts written by AI-fueled systems are viewed with suspicion, and every mistake...

The SELMA Open-Source Platform

by normundsgruzitis, guntisbarzdins | Oct 18, 2023 | News, HLT, NLP, open-source

We are excited to introduce the SELMA Open-Source Software (OSS) Platform, developed by IMCS at the University of Latvia. SELMA OSS offers effective means to test and compare the performance of various language models used in multilingual media monitoring and content...

Introducing LeBenchmark: A Comprehensive Framework for Evaluating Self-Supervised Learning

by kseniaskriptchenko | Sep 19, 2023 | AI

Self-supervised learning (SSL) has ushered in a new era of advancements across diverse domains, including computer vision, natural language processing, and speech processing. By harnessing the power of vast amounts of unlabeled data, SSL techniques have unlocked...

Why does AI need labeled data?

by kseniaskriptchenko | Mar 6, 2023 | AI

Artificial Intelligence (AI) is transforming industries and businesses worldwide, revolutionizing everything from healthcare facilities to newsrooms. One of the key components of AI that is often overlooked is labeled data. It is the foundation of AI and machine...

Who Spoke When?

by Tugtekin Turan | Dec 13, 2022 | Speaker Diarization

Use of voice fingerprints in multi-speaker recordings — Speaker Diarization A journalist’s life is made easier if questions like these can be resolved right away: What was said when? How often does the person speak, and where exactly is the audio or video? SELMA...