{"id":26274,"date":"2024-06-24T14:06:44","date_gmt":"2024-06-24T12:06:44","guid":{"rendered":"https:\/\/lium.univ-lemans.fr\/?p=26274"},"modified":"2024-10-24T14:18:42","modified_gmt":"2024-10-24T12:18:42","slug":"soutenance-de-these-thibault-prouteau","status":"publish","type":"post","link":"https:\/\/lium.univ-lemans.fr\/en\/soutenance-de-these-thibault-prouteau\/","title":{"rendered":"Soutenance de th\u00e8se : Thibault Prouteau"},"content":{"rendered":"<div class=\"panel-grid\" id=\"pg-26274-0\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-26274-0-0\" ><div class=\"panel-widget-style\" ><h2 style=\"text-align: center;\"><span style=\"color: #e5442d;\">PhD defence,  Thibault Prouteau <\/span><\/h2>\n<p><b>Date : <\/b>03\/07\/2024<br \/>\n<b>Time : <\/b> 14h00<br \/>\n<b>Location : <\/b> Le Mans Universit\u00e9; IC2 buiding Auditorium<br \/>\n&nbsp;<\/p>\n<p><span style=\"font-size: 18pt;\"><strong style=\"color: #e5442d;\">Title: Graphs, Words, and Communities: Converging Paths to Interpretability with a Frugal Embedding Framework<\/strong><\/span><br \/>\n&nbsp;<\/p>\n<p><span style=\"color: #e5442d;\"><strong><span style=\"font-size: 14pt;\">Jury members :<\/span><br \/>\n<\/strong><\/span><span style=\"font-size: 12pt;\"><\/p>\n<ul>\n<li><strong>Vincent LABATUT<\/strong>, Assistant Professor, Universit\u00e9 d\u2019Avignon, <strong>Reviewer <\/strong><\/li>\n<li><strong>Christine LARGERON<\/strong>, Professor, Universit\u00e9 Jean Monnet, Saint-\u00c9tienne, <strong>Reviewer <\/strong><\/li>\n<li><strong>C\u00e9cile BOTHOREL<\/strong>, Assistant Professor, IMT Atlantique, Brest, <strong>Examiner <\/strong><\/li>\n<li><strong>Jean-Loup GUILLAUME<\/strong>, Professor, Universit\u00e9 de la Rochelle,  <strong>Examiner <\/strong><\/li>\n<li><strong>Ana\u00efs LEFEUVRE-HALFTERMEYER<\/strong>, Assistant Professor, Universit\u00e9 d\u2019Orl\u00e9ans, <strong>Examiner <\/strong><\/li>\n<li><strong>Marie TAHON<\/strong>, Professor, Le Mans Universit\u00e9 LIUM, <strong>Examiner <\/strong><\/li>\n<li><strong>Sylvain MEIGNIER<\/strong>, Professor, Le Mans Universit\u00e9 LIUM, <strong>Director of thesis <\/strong><\/li>\n<li><strong>Nicolas DUGU\u00c9<\/strong>, Assistant Professor, Le Mans Universit\u00e9 LIUM, <strong>Supervisor <\/strong><\/li>\n<li><strong>Nathalie CAMELIN<\/strong>, Assistant Professor, Le Mans Universit\u00e9 LIUM, <strong>Invited jury member <\/strong><\/li>\n<\/ul>\n<p><\/span><\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"color: #e5442d;\"><strong><span style=\"font-size: 14pt;\">Abstract:<br \/>\n<\/span> <\/strong><\/span><\/p>\n<p style=\"text-align: justify;\" align=\"justify\"> Representation learning with word and graph embedding models allows distributed representations of information that can in turn be used in input of machine learning algorithms.<\/p>\n<p style=\"text-align: justify;\" align=\"justify\">Through the last two decades, the tasks of embedding graphs nodes and words have shifted from matrix factorization approaches that could be trained in a matter of minutes to large models requiring ever larger quantities of training data and sometimes weeks on large hardware architectures. However, in a context of global warming where sustainability is a critical concern, we ought to look back to previous approaches and consider their performances with regard to resources consumption. Furthermore, with the growing involvement of embeddings in sensitive machine learning applications (judiciary system, health), the need for more interpretable and explainable representations has manifested. To foster efficient representation learning and interpretability, this thesis introduces Lower Dimension Bipartite Graph Framework (LDBGF), a node embedding framework able to embed with the same pipeline graph data and text from large corpora represented as co-occurrence networks. <\/p>\n<p style=\"text-align: justify;\" align=\"justify\">Within this framework, we introduce two implementations (SINr-NR, SINr-MF) that leverage com- munity detection in networks to uncover a latent embedding space where items (nodes\/- words) are represented according to their links to communities.<\/p>\n<p style=\"text-align: justify;\" align=\"justify\">We show that SINr-NR and SINr-MF can compete with similar embedding approaches on tasks such as predicting missing links in networks (link prediction) or node features (degree centrality, PageRank score). Regarding word embeddings, we show that SINr-NR is a good contender to represent words via word co-occurrence networks. Finally, we demonstrate the interpretability of SINr-NR on multiple aspects. First with a human evaluation that shows that SINr-NR s dimensions are to some extent interpretable. Secondly, by investigating sparsity of vectors, and how having fewer dimensions may allow interpreting how the dimensions combine and allow sense to emerge.<\/p>\n<p>&nbsp;<\/p>\n<p><span style=\"color: #e5442d;\"><strong><span style=\"font-size: 14pt;\">Keywords:<br \/>\n<\/span> <\/strong><\/span><\/p>\n<p style=\"text-align: justify;\" align=\"justify\">  spoken language understanding, automatic speech recognition, neural networks, pre-trained models, self-supervised models, semantic concepts extraction<\/p><\/div><\/div><\/div><\/div><div class=\"panel-grid\" id=\"pg-26274-1\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-26274-1-0\" ><div class=\"panel-widget-style\" ><div class=\"margin10\"><\/div><\/div><\/div><\/div><\/div><div class=\"panel-grid\" id=\"pg-26274-2\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-26274-2-0\" >&nbsp;<\/div><div class=\"panel-grid-cell\" id=\"pgc-26274-2-1\" >&nbsp;<\/div><div class=\"panel-grid-cell\" id=\"pgc-26274-2-2\" >&nbsp;<\/div><\/div><\/div><div class=\"panel-grid\" id=\"pg-26274-3\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-26274-3-0\" ><div class=\"panel-widget-style\" ><div class=\"margin10\"><\/div><\/div><\/div><\/div><\/div><div class=\"panel-grid\" id=\"pg-26274-4\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-26274-4-0\" >&nbsp;<\/div><div class=\"panel-grid-cell\" id=\"pgc-26274-4-1\" ><div class=\"panel-widget-style\" ><p><img src=\"https:\/\/lium.univ-lemans.fr\/wp-content\/uploads\/2020\/12\/logo_LEMANS_UNIVERSITE-01ptFormat.jpg\" alt=\"\" \/ ><\/p><\/div><\/div><div class=\"panel-grid-cell\" id=\"pgc-26274-4-2\" >&nbsp;<\/div><div class=\"panel-grid-cell\" id=\"pgc-26274-4-3\" ><div class=\"panel-widget-style\" ><p><img src=\"https:\/\/lium.univ-lemans.fr\/wp-content\/uploads\/2023\/09\/Mathematiques-STIC-ED.png\" alt=\"\" \/ ><\/p><\/div><\/div><div class=\"panel-grid-cell\" id=\"pgc-26274-4-4\" >&nbsp;<\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>PhD defence, Thibault Prouteau Date : 03\/07\/2024 Time : 14h00 Location : Le Mans Universit\u00e9; IC2 buiding Auditorium &nbsp; Title: Graphs, Words, and Communities: Converging Paths to Interpretability with a Frugal Embedding Framework &nbsp; Jury members : Vincent LABATUT, Assistant Professor, Universit\u00e9 d\u2019Avignon, Reviewer Christine LARGERON, Professor, Universit\u00e9 Jean Monnet, Saint-\u00c9tienne, Reviewer C\u00e9cile BOTHOREL, Assistant [&hellip;]<\/p>\n<p class=\"more-link style2\"><a href=\"https:\/\/lium.univ-lemans.fr\/en\/soutenance-de-these-thibault-prouteau\/\"  class=\"themebutton\"><span class=\"more-text\">READ MORE<\/span><span class=\"more-icon\"><i class=\"fa fa-angle-right fa-lg\"><\/i><\/span><\/a><\/p>\n","protected":false},"author":14,"featured_media":24068,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[46,6],"tags":[49],"acf":[],"_links":{"self":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts\/26274"}],"collection":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/users\/14"}],"replies":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/comments?post=26274"}],"version-history":[{"count":2,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts\/26274\/revisions"}],"predecessor-version":[{"id":26519,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts\/26274\/revisions\/26519"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/media\/24068"}],"wp:attachment":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/media?parent=26274"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/categories?post=26274"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/tags?post=26274"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}