TED-LIUM Release 1

Corpus: TED-LIUM Release 1Licence: Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Author(s): Anthony RousseauPaul DelégliseYannick EstèveThis is the TED-LIUM corpus release 1, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   The TED-LIUM corpus is English-language TED talks, with transcriptions, sampled at 16kHz. It contains about 118 hours of speech.   More details are given in this paper: A. […]

NMTPY

Software: NMTPYLicence: MIT LicenseGitHub: https://github.com/lium-lst/nmtpyURL: https://arxiv.org/abs/1706.00457Author(s): Ozan CaglayanMercedes García MartínezAdrien BardetWalid AransaLoïc BarraultFethi Bougaresnmtpy is a suite of Python tools, primarily based on the starter code provided in dl4mt-tutorial for training neural machine translation networks using Theano. The basic motivation behind forking dl4mt-tutorial was to create a framework where it would be easy to implement […]

NMTPYTORCH

Software: NMTPYTORCHLicence: MIT LicenseGitHub: https://github.com/lium-lst/nmtpytorch/URL: https://arxiv.org/abs/1706.00457Author(s): Ozan CaglayanMercedes García MartínezAdrien BardetWalid AransaFethi BougaresLoïc BarraultThis is the PyTorch fork of nmtpy, a sequence-to-sequence framework which was originally a fork of dl4mt-tutorial.

LIUM Speaker Diarization

Software: LIUM Speaker DiarizationLicence: GPLURL: https://projets-lium.univ-lemans.fr/spkdiarization/Outil de segmentation et regroupement locuteur (Speaker diarization) en java.

SIDEKIT

Software: SIDEKITLicence: LGPLGitHub: https://git-lium.univ-lemans.fr/Larcher/sidekitURL: https://projets-lium.univ-lemans.fr/sidekit/Author(s): Anthony LarcherKong Aik LeeSylvain Meignier Welcome to SIDEKIT documentation! SIDEKIT is an open source package for Speaker and Language recognition. The aim of SIDEKIT is to provide an educational and efficient toolkit for speaker/language recognition including the whole chain of treatment that goes from the audio data to the analysis […]

s4d

Software: SIDEKIT for diarization (s4d)Licence: LGPLGitHub: https://git-lium.univ-lemans.fr/Meignier/s4dURL: https://projets-lium.univ-lemans.fr/s4d/Author(s): Pierre-Alexandre BrouxFlorent DesnousAnthony LarcherSylvain Meignier Welcome to SIDEKIT for diarization documentation! SIDEKIT for diarization (s4d as short name) is an open source package extension of SIDEKIT for Speaker diarization . The aim of S4D is to provide an educational and efficient toolkit for speaker diarization including the […]

Hop3x

Software: Hop3xURL: http://hop3x.univ-lemans.frDownload the eXist database and install it by following the instructions on http://exist.sourceforge.net/download.html. Download the Hop3x.zip file and unzip the Hop3x folder. The instructions for using Hop3x are available in french : Procedure_de_demarrage_d_Hop3x , Procedure_d_installation_d_Hop3x.

CSLM

Software: Continuous Space Language Model toolkit (CSLM)GitHub: https://git-lium.univ-lemans.fr/barrault/cslmURL: https://git-lium.univ-lemans.fr/barrault/cslm/-/archive/master/cslm-master.tar.gzAuthor(s): Holger SchwenkCSLM toolkit is open-source software which implements the so-called continuous space language model. The basic idea of this approach is to project the word indices onto a continuous space and to use a probability estimator operating on this space. Since the resulting probability functions are […]

MANY

Corpus: MANYLicence: GNU GPL v3URL: https://code.google.com/archive/p/many/Many is a MT System Combination software which architecture is described in the following picture :     The combination can be decomposed into three steps 1-Best hypotheses from all M systems are aligned in order to build M confusion networks (one for each system considered as backbone). All cn […]