SELMA was (is) on the road
SELMA was a project in the field of Human Language Technologies (HLT). It ran from January 2021 until March 2024. Find here an overview of the projects’ outcomes including data, software and components.
From the original text (now outdated):
“It will help media monitors and journalists make sense of huge content streams (big data analysis) – and also enable them to enrich audiovisual (AV) output through transcription, translation, voice-over and subtitling, thus making it more accessible.
Open-source Platform
The SELMA consortium aims to build a multilingual open-source platform that can process (very) large volumes of content and will feature a (self) learning AI system that is able to share information about data streams – and keep the added value of each language through a novel approach.
Shared Space
The idea is to create a crosslingual common space, which means: The system will always collect and analyze data in the original language and subsequently translate it into another language upon request.
To keep up with news, research results and prototypes, make sure to stay connected with SELMA on Twitter.”