FrNewsLink

Corpus : Segmentation Thématique (FrNewsLink)URL : https://hal.archives-ouvertes.fr/hal-01741177Le package FrNewsLink comprend un ensemble de ressources permettant d’évaluer des tâches de segmentation thématique et titrage sur un corpus varié de journaux télévisés français. En raison de droits de diffusions, ce package ne contient ni les vidéos, ni les audios des journaux télévisés. Le corpus propose à la […]

RAPACE

Réseaux de neurones profonds pour le traitement de la langue orale et écrite (RAPACE)Date : 10/2017 – 09/2020Financement : RFI AltanStic 2020Appel : Défis scientifiques 2016Partenaires : LINA (France)URL : https://lium.univ-lemans.fr/rapace/LIUM Participant(s) : Antoine CaubrièreAntoine LaurentYannick EstèveLe projet RAPACE se focalise sur la mise en place d’une architecture neuronale profonde, associant des couches cachées spécialisées, […]

Antract

Transdisciplinary Analysis of French Newsreels (1945-1969) (Antract)Date : 10/2017 – 03/2022Financement : ANRAppel : GenericPartenaires : INA (France), EURECOM (France), Voxolab (France), CHS (France)URL : https://lium.univ-lemans.fr/antract/LIUM Participant(s) : Simon PetitrenaudAntoine LaurentSylvain MeignierPierre-Alexandre BrouxThe general objective of the ANTRACT project is the analysis of the images and sounds produced weekly in the framework of an independent […]

TED-LIUM Release 3

Corpus : TED-LIUM Release 3Licences : Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Auteur(s) : François FernandezVincent NguyenSahar GhannayNatalia TomashenkoYannick EstèveThis is the TED-LIUM corpus release 3, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   All talks and text are property of TED Conferences LLC.   This new TED-LIUM release was made through a collaboration between the Ubiqus […]

TED-LIUM Release 2

Corpus : TED-LIUM Release 2Licences : Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Auteur(s) : Anthony RousseauPaul DelégliseYannick EstèveThis is the TED-LIUM corpus release 2, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website. We have prepared and filtered these data in order […]

TED-LIUM Release 1

Corpus : TED-LIUM Release 1Licences : Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Auteur(s) : Anthony RousseauPaul DelégliseYannick EstèveThis is the TED-LIUM corpus release 1, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   The TED-LIUM corpus is English-language TED talks, with transcriptions, sampled at 16kHz. It contains about 118 hours of speech.   More details are given in […]

NMTPY

Logiciel : NMTPYLicences : MIT LicenseGitHub : https://github.com/lium-lst/nmtpyURL : https://arxiv.org/abs/1706.00457Auteur(s) : Ozan CaglayanMercedes García MartínezAdrien BardetWalid AransaLoïc BarraultFethi Bougaresnmtpy is a suite of Python tools, primarily based on the starter code provided in dl4mt-tutorial for training neural machine translation networks using Theano. The basic motivation behind forking dl4mt-tutorial was to create a framework where it […]

NMTPYTORCH

Logiciel : NMTPYTORCHLicences : MIT LicenseGitHub : https://github.com/lium-lst/nmtpytorch/URL : https://arxiv.org/abs/1706.00457Auteur(s) : Ozan CaglayanMercedes García MartínezAdrien BardetWalid AransaFethi BougaresLoïc BarraultThis is the PyTorch fork of nmtpy, a sequence-to-sequence framework which was originally a fork of dl4mt-tutorial.

LIUM Speaker Diarization

Logiciel : LIUM Speaker DiarizationLicences : GPLURL : https://projets-lium.univ-lemans.fr/spkdiarization/Outil de segmentation et regroupement locuteur (Speaker diarization) en java.

SIDEKIT

Logiciel : SIDEKITLicences : LGPLGitHub : https://git-lium.univ-lemans.fr/Larcher/sidekitURL : https://projets-lium.univ-lemans.fr/sidekit/Auteur(s) : Anthony LarcherKong Aik LeeSylvain Meignier Welcome to SIDEKIT documentation! SIDEKIT is an open source package for Speaker and Language recognition. The aim of SIDEKIT is to provide an educational and efficient toolkit for speaker/language recognition including the whole chain of treatment that goes from the […]