{"id":25756,"date":"2022-10-12T14:00:42","date_gmt":"2022-10-12T12:00:42","guid":{"rendered":"https:\/\/lium.univ-lemans.fr\/?p=25756"},"modified":"2022-10-12T14:00:42","modified_gmt":"2022-10-12T12:00:42","slug":"seminaires-valentin-pelloin-et-martin-lebourdais","status":"publish","type":"post","link":"https:\/\/lium.univ-lemans.fr\/en\/seminaires-valentin-pelloin-et-martin-lebourdais\/","title":{"rendered":"S\u00e9minaires Valentin Pelloin et Martin Lebourdais"},"content":{"rendered":"<div class=\"panel-grid\" id=\"pg-25756-0\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-25756-0-0\" ><div class=\"panel-widget-style\" ><h2 style=\"color: #e5442d;\">Seminar from Valentin Pelloin and Martin Lebourdais, PhD students at LIUM <\/h2>\n<p>&nbsp;<\/p>\n<p><strong>Date:<\/strong> 14\/10\/2022<br \/>\n<strong>Time:<\/strong> 11h00<br \/>\n<strong>Localization:<\/strong> IC2 Boardroom,<br \/>\n<strong>Speakers:<\/strong> <a href=\"http:\/\/lium.univ-lemans.fr\/team\/valentin-pelloin\/\">Valentin Pelloin<\/a> et <a href=\"http:\/\/lium.univ-lemans.fr\/team\/martin-lebourdais\/\">Martin Lebourdais<\/a><br \/>\n&nbsp;<br \/>\n&nbsp;<br \/>\n&nbsp;<\/p>\n<p align=\"center\"><strong>ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks<\/strong><\/p>\n<p><strong>Valentin Pelloin<\/strong><\/p>\n<p align=\"justify\">We aim at improving spoken language modeling (LM) using very large amount of automatically transcribed speech. We leverage the INA (French National Audiovisual Institute) collection and obtain 19GB of text after applying ASR on 350,000 hours of diverse TV shows. From this, spoken language models are trained either by fine-tuning an existing LM (FlauBERT) or through training a LM from scratch. New models (FlauBERT-Oral) are shared with the community and evaluated for 3 downstream tasks: spoken language understanding, classification of TV shows and speech syntactic parsing. Results show that FlauBERT-Oral can be beneficial compared to its initial FlauBERT version demonstrating that, despite its inherent noisy nature, ASR-generated text can be used to build spoken language models.<\/p>\n<p>&nbsp;<br \/>\n&nbsp;<\/p>\n<p align=\"center\"><strong>Overlapped speech and gender detection with WavLM pre-trained features<\/strong><\/p>\n<p><strong>Martin Lebourdais<\/strong><\/p>\n<p align=\"justify\">This presentation focuses on overlapped speech and gender detection in order to study interactions between women and men in French audiovisual media (<a href=\"http:\/\/lium.univ-lemans.fr\/gem\/\">Gender Equality Monitoring project<\/a>).In this application context, we need to automatically segment the speech signal according to speakers gender, and to identify when at least two speakers speak at the same time. We propose to use WavLM model which has the advantage of being pre-trained on a huge amount of speech data, to build an overlapped speech detection (OSD) and a gender detection (GD) systems.<\/p>\n<p align=\"justify\">In this study, we use two different corpora. The DIHARD III corpus which is well adapted for the OSD task but lack gender information. The ALLIES corpus fits with the project application context. Our best OSD system is a Temporal Convolutional Network (TCN) with WavLM pre-trained features as input, which reaches a new state-of-the-art F1-score performance on DIHARD. A neural GD is trained with WavLM inputs on a gender balanced subset of the French broadcast news ALLIES data, and obtains an accuracy of 94.9%. This work opens new perspectives for human science researchers regarding the differences of representation between women and men in French media.<\/p><\/div><\/div><\/div><\/div><div class=\"panel-grid\" id=\"pg-25756-1\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-25756-1-0\" ><div class=\"panel-widget-style\" ><p><img src=\"https:\/\/lium.univ-lemans.fr\/wp-content\/uploads\/2022\/10\/IMG_20221006_155924-scaled.jpg\" alt=\"\" \/ ><\/p><\/div><\/div><div class=\"panel-grid-cell\" id=\"pgc-25756-1-1\" ><div class=\"panel-widget-style\" ><p><img src=\"https:\/\/lium.univ-lemans.fr\/wp-content\/uploads\/2022\/10\/IMG_20221006_140342-scaled.jpg\" alt=\"\" \/ ><\/p><\/div><\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>Seminar from Valentin Pelloin and Martin Lebourdais, PhD students at LIUM &nbsp; Date: 14\/10\/2022 Time: 11h00 Localization: IC2 Boardroom, Speakers: Valentin Pelloin et Martin Lebourdais &nbsp; &nbsp; &nbsp; ASR-Generated Text for Language Model Pre-training Applied to Speech Tasks Valentin Pelloin We aim at improving spoken language modeling (LM) using very large amount of automatically transcribed [&hellip;]<\/p>\n<p class=\"more-link style2\"><a href=\"https:\/\/lium.univ-lemans.fr\/en\/seminaires-valentin-pelloin-et-martin-lebourdais\/\"  class=\"themebutton\"><span class=\"more-text\">READ MORE<\/span><span class=\"more-icon\"><i class=\"fa fa-angle-right fa-lg\"><\/i><\/span><\/a><\/p>\n","protected":false},"author":14,"featured_media":13238,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[43],"tags":[49],"acf":[],"_links":{"self":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts\/25756"}],"collection":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/users\/14"}],"replies":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/comments?post=25756"}],"version-history":[{"count":0,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts\/25756\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/media\/13238"}],"wp:attachment":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/media?parent=25756"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/categories?post=25756"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/tags?post=25756"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}