FrNewsLink

Corpus: Topic Segmentation (FrNewsLink)URL: https://hal.archives-ouvertes.fr/hal-01741177FrNewsLink package allows to adress several applicative tasks in the domain of topic and titling segmentation. It is compososed of a set of resources from a varied corpus of French Broadcast News (BN) and press articles. Due to broadcasting rights, this package does not contain videos or audios files. The corpus […]

RAPACE

Deep neural networks for oral and written language processing (RAPACE)Date: 10/2017 – 09/2020Funding: RFI AltanStic 2020Call: Défis scientifiques 2016Partners: LINA (France)URL: https://lium.univ-lemans.fr/en/rapace/LIUM Participant(s): Antoine CaubrièreAntoine LaurentYannick EstèveThe RAPACE project focuses on the implementation of a deep neural architecture, associating specialized hidden layers, to develop a fully neural system of speech processing. The aim is to […]

Antract

Transdisciplinary Analysis of French Newsreels (1945-1969) (Antract)Date: 10/2017 – 03/2022Funding: ANRCall: GenericPartners: INA (France), EURECOM (France), Voxolab (France), CHS (France)URL: https://lium.univ-lemans.fr/en/antract/LIUM Participant(s): Simon PetitrenaudAntoine LaurentSylvain MeignierPierre-Alexandre BrouxThe general objective of the ANTRACT project is the analysis of the images and sounds produced weekly in the framework of an independent company created in 1945, les Actualités […]

TED-LIUM Release 3

Corpus: TED-LIUM Release 3Licence: Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Author(s): François FernandezVincent NguyenSahar GhannayNatalia TomashenkoYannick EstèveThis is the TED-LIUM corpus release 3, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   All talks and text are property of TED Conferences LLC.   This new TED-LIUM release was made through a collaboration between the Ubiqus company and the […]

TED-LIUM Release 2

Corpus: TED-LIUM Release 2Licence: Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Author(s): Anthony RousseauPaul DelégliseYannick EstèveThis is the TED-LIUM corpus release 2, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website. We have prepared and filtered these data in order to train acoustic […]

TED-LIUM Release 1

Corpus: TED-LIUM Release 1Licence: Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Author(s): Anthony RousseauPaul DelégliseYannick EstèveThis is the TED-LIUM corpus release 1, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   The TED-LIUM corpus is English-language TED talks, with transcriptions, sampled at 16kHz. It contains about 118 hours of speech.   More details are given in this paper: A. […]

NMTPY

Software: NMTPYLicence: MIT LicenseGitHub: https://github.com/lium-lst/nmtpyURL: https://arxiv.org/abs/1706.00457Author(s): Ozan CaglayanMercedes García MartínezAdrien BardetWalid AransaLoïc BarraultFethi Bougaresnmtpy is a suite of Python tools, primarily based on the starter code provided in dl4mt-tutorial for training neural machine translation networks using Theano. The basic motivation behind forking dl4mt-tutorial was to create a framework where it would be easy to implement […]

NMTPYTORCH

Software: NMTPYTORCHLicence: MIT LicenseGitHub: https://github.com/lium-lst/nmtpytorch/URL: https://arxiv.org/abs/1706.00457Author(s): Ozan CaglayanMercedes García MartínezAdrien BardetWalid AransaFethi BougaresLoïc BarraultThis is the PyTorch fork of nmtpy, a sequence-to-sequence framework which was originally a fork of dl4mt-tutorial.

LIUM Speaker Diarization

Software: LIUM Speaker DiarizationLicence: GPLURL: https://projets-lium.univ-lemans.fr/spkdiarization/Outil de segmentation et regroupement locuteur (Speaker diarization) en java.

SIDEKIT

Software: SIDEKITLicence: LGPLGitHub: https://git-lium.univ-lemans.fr/Larcher/sidekitURL: https://projets-lium.univ-lemans.fr/sidekit/Author(s): Anthony LarcherKong Aik LeeSylvain Meignier Welcome to SIDEKIT documentation! SIDEKIT is an open source package for Speaker and Language recognition. The aim of SIDEKIT is to provide an educational and efficient toolkit for speaker/language recognition including the whole chain of treatment that goes from the audio data to the analysis […]