News.bridge

News.bridgeDate: 01/2018 – 06/2019Funding: GoogleCall: Digital News Innovation FundingPartners: Deutsche Welle, Latvian News Agency, PriberamURL: https://lium.univ-lemans.fr/en/news-bridge/LIUM Participant(s): Sahar GhannayNatalia TomashenkoYannick EstèveThe NEWS-BRIDGE project will build a commercially exploitable integrated tool with a complete workflow using language technologies to facilitate and enhance multilingual news production. This tool provides for a full broadcast news translation system, rendering […]

ALLIES

Autonomous Lifelong Learning IntelligEnt Systems (ALLIES)Date: 12/2017 – 03/2022Funding: EU H2020Call: chist-eraPartners: IDIAP (Switzerland), UPC (Span), LNE (France)URL: https://projets-lium.univ-lemans.fr/alliesLIUM Participant(s): Anthony LarcherLoïc BarraultFethi BougaresSylvain MeignierYevhenii ProkopaloMeysam ShamsiThe goal of the ALLIES project is to encourage and demonstrate the development of autonomous systems, able to sustain performance across time according to a given learning scenario. A […]

FrNewsLink

Corpus: Topic Segmentation (FrNewsLink)URL: https://hal.archives-ouvertes.fr/hal-01741177FrNewsLink package allows to adress several applicative tasks in the domain of topic and titling segmentation. It is compososed of a set of resources from a varied corpus of French Broadcast News (BN) and press articles. Due to broadcasting rights, this package does not contain videos or audios files. The corpus […]

RAPACE

Deep neural networks for oral and written language processing (RAPACE)Date: 10/2017 – 09/2020Funding: RFI AltanStic 2020Call: Défis scientifiques 2016Partners: LINA (France)URL: https://lium.univ-lemans.fr/en/rapace/LIUM Participant(s): Antoine CaubrièreAntoine LaurentYannick EstèveThe RAPACE project focuses on the implementation of a deep neural architecture, associating specialized hidden layers, to develop a fully neural system of speech processing. The aim is to […]

Antract

Transdisciplinary Analysis of French Newsreels (1945-1969) (Antract)Date: 10/2017 – 03/2022Funding: ANRCall: GenericPartners: INA (France), EURECOM (France), Voxolab (France), CHS (France)URL: https://lium.univ-lemans.fr/en/antract/LIUM Participant(s): Simon PetitrenaudAntoine LaurentSylvain MeignierPierre-Alexandre BrouxThe general objective of the ANTRACT project is the analysis of the images and sounds produced weekly in the framework of an independent company created in 1945, les Actualités […]

TED-LIUM Release 3

Corpus: TED-LIUM Release 3Licence: Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Author(s): François FernandezVincent NguyenSahar GhannayNatalia TomashenkoYannick EstèveThis is the TED-LIUM corpus release 3, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   All talks and text are property of TED Conferences LLC.   This new TED-LIUM release was made through a collaboration between the Ubiqus company and the […]

TED-LIUM Release 2

Corpus: TED-LIUM Release 2Licence: Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Author(s): Anthony RousseauPaul DelégliseYannick EstèveThis is the TED-LIUM corpus release 2, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   The TED-LIUM corpus was made from audio talks and their transcriptions available on the TED website. We have prepared and filtered these data in order to train acoustic […]

TED-LIUM Release 1

Corpus: TED-LIUM Release 1Licence: Creative Commons BY-NC-ND 3.0 (attribution/non-commercial/no-derivatives)Author(s): Anthony RousseauPaul DelégliseYannick EstèveThis is the TED-LIUM corpus release 1, licensed under Creative Commons BY-NC-ND 3.0 (http://creativecommons.org/licenses/by-nc-nd/3.0/deed.en).   The TED-LIUM corpus is English-language TED talks, with transcriptions, sampled at 16kHz. It contains about 118 hours of speech.   More details are given in this paper: A. […]

NMTPY

Software: NMTPYLicence: MIT LicenseGitHub: https://github.com/lium-lst/nmtpyURL: https://arxiv.org/abs/1706.00457Author(s): Ozan CaglayanMercedes García MartínezAdrien BardetWalid AransaLoïc BarraultFethi Bougaresnmtpy is a suite of Python tools, primarily based on the starter code provided in dl4mt-tutorial for training neural machine translation networks using Theano. The basic motivation behind forking dl4mt-tutorial was to create a framework where it would be easy to implement […]

NMTPYTORCH

Software: NMTPYTORCHLicence: MIT LicenseGitHub: https://github.com/lium-lst/nmtpytorch/URL: https://arxiv.org/abs/1706.00457Author(s): Ozan CaglayanMercedes García MartínezAdrien BardetWalid AransaFethi BougaresLoïc BarraultThis is the PyTorch fork of nmtpy, a sequence-to-sequence framework which was originally a fork of dl4mt-tutorial.