Corpus: ArSentimentAnalysis (ArSentimentAnalysis)GitHub: Author(s): Amira BarhoumiNathalie CamelinYannick EstèveLe package ArSentimentAnalysis comprend un ensemble de ressources permettant de concevoir et évaluer un système d’analyse d’opinions en arabe. Le package contient: Des ensembles d’embeddings spécifiques à l’arabe pré-entrainés Le lexique polarisé ArSentLex 1/ Ensembles d’embeddings spécifiques à l’arabe : Les embeddings pré-entrainés existants représentent un mot […]

Analyse et contre-mesures des fraudes à l’usurpation d’identité dans les systèmes de biométrie comportementale

Seminar from Thomas Thebaud, PhD student at Orange and LIUM   Date: 06/01/2020 Time: 11h00 Localisation: IC2, boardroom Speaker: Thomas Thebaud With the increasing use of biometric systems for authentication, the question of the security of these systems arises. The objective of this thesis is to study the possible frauds (spoofing) on several behavioral biometries […]

ALLIES Evaluation

ALLIES Evaluation for Autonomous Speaker Diarization Systems   The ALLIES project aims at catalysing the development of autonomous lifelong intelligent systems by providing the community with scenarios, evaluation plans and metrics to evaluate those systems. ALLIES focuses on two tasks: speaker segmentation, and machine translation. The speaker segmentation evaluation relies on a new corpus of […]


Corpus: AlloSat (AlloSat)Licence: creative CommonsAuthor(s): Manon MacaryMarie TahonAnthony RousseauYannick EstèveThe corpus, named AlloSat, is composed of real-life call center conversations in French and is continuously annotated in frustration and satisfaction. This corpus has been set up to develop new systems able to model the continuous aspect of semantic and paralinguistic information at the conversation level. […]

ExTENSoR Kick-off meeting

The kick-off meeting of the ANR project ExTENSoR will be held on February 12th and 13th at Le Mans University  


Corpus: Multi30k Dataset (Multi30k)Licence: Attribution-NonCommercial-ShareAlike 4.0 InternationalGitHub: Loïc BarraultOzan CaglayanFethi BougaresThe Flickr30K Dataset contains 31,014 images sourced from online photo-sharing websites (Young et al., 2014). Each image is paired with five English descriptions, which were collected from Amazon Mechanical Turk2. The dataset contains 145,000 training, 5,070 development, and 5,000 test descriptions. The Multi30K dataset […]

Privacy in speech processing

Seminar from Brij Srivastava, PhS student at l’Inria Lille Nord Europe / LIUM   Date: 24/01/2019 Time: 11h30 Location: IC2, room 210 Speaker: Brij Srivastava     Speech signals are a rich source of speaker-related information including sensitive attributes like gender, identity, accent, pathological conditions, etc. With a small amount of found speech data, such […]


Corpus: Tunisian Sentiment Analysis Corpus. (TSAC)Licence: GNU Lesser General Public License v3.0GitHub: Fethi BougaresSalima MdhaffarYannick EstèveAbout 17k user comments manually annotated to positive and negative polarities. This corpus is collected from Facebook users comments written on official pages of Tunisian radios and TV channels namely Mosaique FM, JawhraFM, Shemes FM, HiwarElttounsi TV and Nessma […]

Apprentissage d’espaces prétopologiques pour l’extraction de relations sémantiques

Seminar from Gaëtan Caillaut, junior lecturer at Université d’Orléans   Date: 17/01/2020 Time: 11h00 Location: IC2, Salle 210 Speaker: Gaëtan Caillaut     Apprentissage d’espaces prétopologiques pour l’extraction de relations sémantiques   La prétopologie est une théorie mathématique fondée dans les années 70s dans le but de relâcher certaines contraintes inhérentes à la théorie de […]

Pierre-Alexandre Broux

PhD defence, Pierre-Alexandre Broux Date : 10/01/2020 Time : 14h00 Location : Room 210, IC2 building, LIUM, Le Mans Université Title : Speaker diarization in audiovisual files in interaction with human annotators Jury members : Reviewers: – Jean-François BONASTRE (LIA, Université d’Avignon) – Nicholas EVANS (EURECOM) Examiners: – Régine ANDRE-OBRECHT (Université Toulouse 3) Supervisor: – […]