The Limits of Speech Systems: Navigating Adversarial and Poisoning Threats with Robust Defenses

Seminare from Thomas Thebaud, researcher at CLSP, JHU   Date: 18/12/2024 Time : 13h30 Place: IC2, Boardroom Speaker: Thomas Thebaud,     The Limits of Speech Systems: Navigating Adversarial and Poisoning Threats with Robust Defense   Summary: The widespread adoption of voice-controlled devices and speech recognition systems underscores the critical need for robust security measures […]

TTS for low resource languages, dialects and accents

TTS for low resource languages, dialects and accents (13/12/2024) Currently, many different neural architectures are available to use a Text-to-Speech (TTS) system on the shelf. However it is not always easy to choose the best network for a given application. Especially the limits and drawbacks of pre-trained models are not well defined. This can be […]

Analyse et quantification des représentations genrées dans les médias audiovisuels : cinq années d’intéractions entre la recherche, les acteurs politiques et le grand public

Seminare from David Doukhan, researcher et Ina   Date: 12/12/2024 Time : 14h00 Place: IC2, Boardroom Speaker: David Doukhan     Analysing and quantifying gendered representations in the audiovisual media: five years of interaction between research, political players and the general public   Summary: The media have been described by the philosopher Michel Foucault as […]

From document to program embeddings: can distributional hypothesis really be used on programming languages?

Seminar from Thibaut Martinet, PhD student at LIFO, Orléans University   Date: 06/12/2024 Time: 10h15 Place: IC2, Boardroom Speaker: Thibaut Martinet     From document to program embeddings: can distributional hypothesis really be used on programming languages?   Programming language processing is a field of increasing interest, as more and more models become available, either […]

Information Extraction and Analysis from News videos

Seminar from Sadok Mansouri, ATER at LIUM   Date: 15/11/2024 Time: 11h00 Place: IC2, Boardroom Speaker: Sadok Mansouri     Information Extraction and Analysis from News videos   Information extraction from videos is an important research topic in content-based video indexing and retrieval. Indeed, the visual text present in news videos typically provides rich semantic […]

Offre stage M2 : Machine Learning for Acoustic-Based Keystroke Recognition: A Study on Security Vulnerabilities

Machine Learning for Acoustic-Based Keystroke Recognition: A Study on Security Vulnerabilities Supervsisors : Kais Hassan (LAUM), Meysam Shamsi (LIUM) Host Laboratory: Laboratoire d’Informatique de l’Université du Mans (LIUM) – Laboratoire d’Acoustique de l’Université du Mans (LAUM). Location : Le Mans Université Beginning of internship: February 2025 Contact : Kais Hassan, Meysam Shamsi (firstname.name@univ-lemans.fr)  

Offre stage M2 : Construction de Sound Zones par apprentissage automatique sur un large jeu de données

Constructing Sound Zones using machine learning on a large dataset Supervsisors : Théo Mariotte (LIUM), Manuel Melon (LAUM), Marie Tahon (LIUM) Host Laboratory: Laboratoire d’Informatique de l’Université du Mans (LIUM) – Laboratoire d’Acoustique de l’Université du Mans (LAUM). Location : Le Mans Université Beginning of internship: Between January and March 2025 Contact : Théo Mariotte, […]

LST-days

LST day   The LST team day is being held on 17 October. On this occasion, the young and more experienced researchers present their research themes. There will also be a presentation on European projects by Hélène Dereszowski from DRIS. This year, 3 workshops will focus on the following themes: 1. Spoof diarization audiovisuel 2. […]

KUTED

Corpus: Kurdish TED (KUTED)Licence: CreativeCommons Attribution NonCommercial-ShareAlike 4.0 International License.URL: https://huggingface.co/datasets/aranemini/kurdishtedAuthor(s): Mohammad MohammadaminiAntoine LaurentDescription Kurdish TED (KUTED) is the first Speech-to-Text-Translation (S2TT) dataset for the Central Kurdish language derived from TED Talks and TEDx. The corpus consists of 91,000 pairs, encompassing 170 hours of English audio, 1.65 million English tokens, and 1.40 million Central Kurdish […]