{"id":25327,"date":"2021-04-07T17:31:58","date_gmt":"2021-04-07T15:31:58","guid":{"rendered":"https:\/\/lium.univ-lemans.fr\/?p=25327"},"modified":"2021-04-08T15:31:21","modified_gmt":"2021-04-08T13:31:21","slug":"sinr-fast-computing-of-sparse-interpretable-node-representations-is-not-a-sin","status":"publish","type":"post","link":"https:\/\/lium.univ-lemans.fr\/en\/sinr-fast-computing-of-sparse-interpretable-node-representations-is-not-a-sin\/","title":{"rendered":"SINr: fast computing of Sparse Interpretable Node Representations is not a sin!"},"content":{"rendered":"<div class=\"panel-grid\" id=\"pg-25327-0\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-25327-0-0\" ><div class=\"panel-widget-style\" ><h2 style=\"color: #e5442d;\">Seminar from Thibault Prouteau, PhD student at LIUM <\/h2>\n<p>&nbsp;<\/p>\n<p><strong>Date:<\/strong> 19\/04\/2021<br \/>\n<strong>Time:<\/strong> 11h00<br \/>\n<strong>Localization:<\/strong> <a href=\"https:\/\/univ-lemans-fr.zoom.us\/j\/92384843807?pwd=Rko2SVdoMzVqcTlzRG85Qk1GZ01PUT09\">online<\/a><\/a><br \/>\n<strong>Speaker: <\/strong><a href=\"http:\/\/lium.univ-lemans.fr\/en\/team\/thibault-prouteau-2\/\">Thibault Prouteau<\/a><\/p>\n<p>&nbsp;<\/p>\n<p align=\"center\"><strong>SINr: fast computing of Sparse Interpretable Node Representations is not a sin!<\/strong><\/p>\n<p>&nbsp;<\/p>\n<p align=\"justify\">While  graph  embedding  aims  at  learning  low-dimensional representations of nodes encompassing the graph topology, word embedding focus on learning word vectors that encode semantic properties of the vocabulary. The first finds applications on tasks such as link prediction and node classification while the latter is systematically considered in natural language processing. Most of the time, graph and word embeddings are considered on their own as distinct tasks. However, word co-occurrence matrices, widely used to extract word embeddings, can be seen as graphs. Furthermore, most network embedding techniques rely either on a word embedding methodology (Word2vec) or on matrix factorization, also widely used for word embedding. These methods are usually computationally expensive, parameter dependent and the dimensions of the embedding space are not interpretable.<\/p>\n<p align=\"justify\">To circumvent these issues, we introduce the Lower Dimension Bipartite Graphs Framework (LDBGF) which  takes  advantage  of  the  fact  that  all  graphs  can  be  described  as bipartite  graphs,  even  in  the  case  of  textual  data.  This  underlying  bipartite  structure  may  be  explicit,  like  in  coauthor  networks.  However, with LDBGF, we focus on uncovering latent bipartite structures, lying for instance in social or  word co-occurrence networks, and especially such structures  providing  conciser  and  interpretable  representations  of  the graph at hand. We further propose SINr, an efficient implementation of the LDBGFapproach that extracts Sparse Interpretable Node Representations using community structure to approximate the underlying bipartite structure. In the case of graph embedding, our near-linear time method is  the  fastest  of  our  benchmark,  parameter-free  and  provides  state-of-the-art results on the classical link prediction task. We also show that low-dimensional vectors can be derived from SINr using singular value decomposition. In the case of word embedding, our approach proves to be very efficient considering the classical similarity evaluation.<\/p><\/div><\/div><\/div><\/div><div class=\"panel-grid\" id=\"pg-25327-1\" ><div class=\"panel-grid-core\"><div class=\"panel-grid-cell\" id=\"pgc-25327-1-0\" >&nbsp;<\/div><div class=\"panel-grid-cell\" id=\"pgc-25327-1-1\" >&nbsp;<\/div><div class=\"panel-grid-cell\" id=\"pgc-25327-1-2\" >&nbsp;<\/div><\/div><\/div>","protected":false},"excerpt":{"rendered":"<p>Seminar from Thibault Prouteau, PhD student at LIUM &nbsp; Date: 19\/04\/2021 Time: 11h00 Localization: online Speaker: Thibault Prouteau &nbsp; SINr: fast computing of Sparse Interpretable Node Representations is not a sin! &nbsp; While graph embedding aims at learning low-dimensional representations of nodes encompassing the graph topology, word embedding focus on learning word vectors that encode [&hellip;]<\/p>\n<p class=\"more-link style2\"><a href=\"https:\/\/lium.univ-lemans.fr\/en\/sinr-fast-computing-of-sparse-interpretable-node-representations-is-not-a-sin\/\"  class=\"themebutton\"><span class=\"more-text\">READ MORE<\/span><span class=\"more-icon\"><i class=\"fa fa-angle-right fa-lg\"><\/i><\/span><\/a><\/p>\n","protected":false},"author":14,"featured_media":13238,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":[],"categories":[43],"tags":[49],"acf":[],"_links":{"self":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts\/25327"}],"collection":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/users\/14"}],"replies":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/comments?post=25327"}],"version-history":[{"count":0,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/posts\/25327\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/media\/13238"}],"wp:attachment":[{"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/media?parent=25327"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/categories?post=25327"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/lium.univ-lemans.fr\/en\/wp-json\/wp\/v2\/tags?post=25327"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}