{"id":25592,"date":"2022-04-26T13:55:36","date_gmt":"2022-04-26T11:55:36","guid":{"rendered":"https:\/\/lium.univ-lemans.fr\/?p=25592"},"modified":"2022-04-26T13:54:54","modified_gmt":"2022-04-26T11:54:54","slug":"des-arbres-des-chevaliers-et-des-marionnettes%e2%80%af-apprentissages-par-transferts-pour-le-traitement-des-langues-historiques","status":"publish","type":"post","link":"https:\/\/lium.univ-lemans.fr\/en\/des-arbres-des-chevaliers-et-des-marionnettes%e2%80%af-apprentissages-par-transferts-pour-le-traitement-des-langues-historiques\/","title":{"rendered":"Des arbres, des chevaliers et des marionnettes\u202f: apprentissages par transferts pour le traitement des langues historiques"},"content":{"rendered":"<div class=\"panel-grid\" id=\"pg-25592-0\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-25592-0-0\" ><div class=\"panel-widget-style\" ><h2 style=\"color: #e5442d;\">Seminar from Lo\u00efc Grobol, Assistant Professor at Universit\u00e9 Paris Nanterre<\/h2>\n<p>&nbsp;<\/p>\n<p><strong>Date:<\/strong> 29\/04\/2022<br \/>\n<strong>Time:<\/strong> 11h00<br \/>\n<strong>Localization:<\/strong> IC2 Boardroom, <a href=\"https:\/\/univ-lemans-fr.zoom.us\/j\/92143709904?pwd=Qy9QTXUydExNSlFnS0pWY0ZaNXpzUT09\">online<\/a><br \/>\n<strong>Speaker:<\/strong> Lo\u00efc Grobol<br \/>\n&nbsp;<\/p>\n<p align=\"center\"><strong>Trees, knights and puppets: transfer learning for processing of historical languages <\/strong><\/p>\n<p>&nbsp;<\/p>\n<p align=\"justify\">In recent years, automatic natural language processing (NLP) has evolved extremely rapidly, with NLP systems achieving record performance for many tasks and domains. These developments are largely due to the contributions of deep learning techniques, of which the most recent and impactful are based on the use of semi-supervised pre-training on large amounts of unannotated data complemented by targeted learning (<strong>fine-tuning<\/strong>) for target tasks (Peters et al., 2018 ; Howard et Ruder, 2018 ; Devlin et al., 2019). The main advantage of these techniques is that they allow the exploitation of massive data resulting from the omnipresent digitalization of language in all its forms. However, for many applications, the existence of such data is far from self-evident &#8211; whether in poorly endowed languages (Hedderic et al., 2021) or in poorly documented domains (Ramponi et Plank, 2021).<\/p>\n<p align=\"justify\">Historical languages, and in particular those representing ancient states of still existing and well-documented languages, are a particularly interesting case of this problem. Indeed, while the available data are often scarce, highly heterogeneous, and necessarily finite, their proximity to much better endowed languages makes it tempting to apply so-called <strong>transfer learning<\/strong> techniques to them: use resources (data and systems) developed for their well-endowed descendants, and use the data available for the former state to <strong>adapt<\/strong> those resources to it.<\/p>\n<p align=\"justify\">In this talk, I will present work done and still in progress in the framework of the PROFITEROLE project (PRocessing Old French Instrumented TExts for the Representation Of Language Evolution), which focuses on the use of heterogeneous resources for the syntactic analysis of medieval French. Our experiments show that it is possible to exploit resources for contemporary French (and in particular contextual representations of words) to significantly improve the processing of old French states using transfer learning techniques.<\/p>\n<p>&nbsp;<br \/>\n<strong>References<\/strong><\/p>\n<ul>\n<li>Devlin, Jacob, Ming-Wei Chang, Kenton Lee, et Kristina Toutanova. \u00ab BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding \u00bb. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 4171\u201186. Association for Computational Linguistics, 2019. https:\/\/doi.org\/10.18653\/v1\/N19-1423.<\/li>\n<li>Hedderich, Michael A., Lukas Lange, Heike Adel, Jannik Str\u00f6tgen, et Dietrich Klakow. \u00ab A Survey on Recent Approaches for Natural Language Processing in Low-Resource Scenarios \u00bb. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2545\u201168. Association for Computational Linguistics, 2021. https:\/\/doi.org\/10.18653\/v1\/2021.naacl-main.201.<\/li>\n<li>Howard, Jeremy, et Sebastian Ruder. \u00ab Universal Language Model Fine-tuning for Text Classification \u00bb. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 328\u201139. Association for Computational Linguistics, 2018. https:\/\/doi.org\/10.18653\/v1\/P18-1031.<\/li>\n<li>Peters, Matthew, Mark Neumann, Mohit Iyyer, Matt Gardner, Christopher Clark, Kenton Lee, et Luke Zettlemoyer. \u00ab Deep Contextualized Word Representations \u00bb. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 1:2227\u201137. Association for Computational Linguistics, 2018. https:\/\/doi.org\/10.18653\/v1\/N18-1202.<\/li>\n<li>Ramponi, Alan, et Barbara Plank. \u00ab Neural Unsupervised Domain Adaptation in NLP\u2014A Survey \u00bb. In Proceedings of the 28th International Conference on Computational Linguistics, 6838\u201155. International Committee on Computational Linguistics, 2020. https:\/\/doi.org\/10.18653\/v1\/2020.coling-main.603.<\/li>\n<\/ul><\/div><\/div><\/div><\/div><div class=\"panel-grid\" id=\"pg-25592-1\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-25592-1-0\" >&nbsp;<\/div><div class=\"panel-grid-cell\" id=\"pgc-25592-1-1\" >&nbsp;<\/div><div class=\"panel-grid-cell\" id=\"pgc-25592-1-2\" >&nbsp;<\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>Seminar from Lo\u00efc Grobol, Assistant Professor at Universit\u00e9 Paris Nanterre &nbsp; Date: 29\/04\/2022 Time: 11h00 Localization: IC2 Boardroom, online Speaker: Lo\u00efc Grobol &nbsp; Trees, knights and puppets: transfer learning for processing of historical languages &nbsp; In recent years, automatic natural language processing (NLP) has evolved extremely rapidly, with NLP systems achieving record performance for many [&hellip;]<\/p>\n<p class=\"more-link style2\"><a href=\"https:\/\/lium.univ-lemans.fr\/en\/des-arbres-des-chevaliers-et-des-marionnettes%e2%80%af-apprentissages-par-transferts-pour-le-traitement-des-langues-historiques\/\"  class=\"themebutton\"><span class=\"more-text\">READ MORE<\/span><span class=\"more-icon\"><i class=\"fa fa-angle-right fa-lg\"><\/i><\/span><\/a><\/p>\n","protected":false},"author":14,"featured_media":13238,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[43],"tags":[49],"acf":[],"_links":{"self":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts\/25592"}],"collection":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/users\/14"}],"replies":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/comments?post=25592"}],"version-history":[{"count":0,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts\/25592\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/media\/13238"}],"wp:attachment":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/media?parent=25592"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/categories?post=25592"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/tags?post=25592"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}