{"id":14460,"date":"2023-10-10T20:02:59","date_gmt":"2023-10-10T18:02:59","guid":{"rendered":"https:\/\/sites.uclouvain.be\/orm\/?p=14460"},"modified":"2023-10-10T20:30:55","modified_gmt":"2023-10-10T18:30:55","slug":"des-milliers-darticles-rtbf","status":"publish","type":"post","link":"https:\/\/sites.uclouvain.be\/orm\/2023\/10\/10\/des-milliers-darticles-rtbf\/","title":{"rendered":"\ud835\uddd7\ud835\uddf2\ud835\ude00 \ud835\uddfa\ud835\uddf6\ud835\uddf9\ud835\uddf9\ud835\uddf6\ud835\uddf2\ud835\uddff\ud835\ude00 \ud835\uddf1\u2019\ud835\uddee\ud835\uddff\ud835\ude01\ud835\uddf6\ud835\uddf0\ud835\uddf9\ud835\uddf2\ud835\ude00 \ud835\udde5\ud835\udde7\ud835\uddd5\ud835\uddd9 \ud835\uddee\u0300 \ud835\uddf1\ud835\uddf6\ud835\ude00\ud835\uddfd\ud835\uddfc\ud835\ude00\ud835\uddf6\ud835\ude01\ud835\uddf6\ud835\uddfc\ud835\uddfb \ud835\uddf1\ud835\uddf2\ud835\ude00 \ud835\ude00\ud835\uddf0\ud835\uddf6\ud835\uddf2\ud835\uddfb\ud835\ude01\ud835\uddf6\ud835\uddf3\ud835\uddf6\ud835\uddfe\ud835\ude02\ud835\uddf2\ud835\ude00"},"content":{"rendered":"\n<p>Le CENTAL (Centre de Traitement Automatique du Langage) et l&#8217;ORM (Observatoire de Recherche sur les M\u00e9dias et le Journalisme) de l&#8217;UCLouvain sont heureux de vous annoncer la mise \u00e0 disposition du Corpus RTBF, un corpus de plus de 750 000 articles de presse publi\u00e9s par le m\u00e9dia de service public belge francophone de 2008 \u00e0 2021.<\/p>\n\n\n\n<p>Gr\u00e2ce \u00e0 une collaboration scientifique avec la&nbsp;RTBF, l&#8217;UCLouvain a obtenu de pouvoir mettre \u00e0 disposition de la communaut\u00e9 acad\u00e9mique l&#8217;ensemble des articles publi\u00e9s sur leur site web jusqu&#8217;\u00e0 fin 2021.<\/p>\n\n\n\n<p>Le corpus contient un total de 214 millions de mots. Diff\u00e9rentes m\u00e9tadonn\u00e9es li\u00e9es \u00e0 chaque article sont disponibles : ID, titre, date de publication, signature, source, cat\u00e9gorie et mot-cl\u00e9. Plus de d\u00e9tails sur le corpus sont disponibles dans <a href=\"https:\/\/dial.uclouvain.be\/pr\/boreal\/object\/boreal:276580\">l&#8217;article<\/a> qui accompagne les donn\u00e9es.<\/p>\n\n\n\n<p>La RTBF (Radio-t\u00e9l\u00e9vision belge de la Communaut\u00e9 fran\u00e7aise) est l&#8217;organisme de radiodiffusion de service public de la communaut\u00e9 francophone de Belgique. En tant que m\u00e9dia de service public, elle est financ\u00e9e directement par le gouvernement belge et poss\u00e8de trois missions principales : informer, \u00e9duquer et divertir le public le plus large possible au sein de la communaut\u00e9 francophone belge. En plus de g\u00e9rer des cha\u00eenes de t\u00e9l\u00e9vision et des stations de radio, la RTBF exploite \u00e9galement un site web d&#8217;actualit\u00e9s depuis 2008, sur lequel des articles de presse exclusivement en ligne sont publi\u00e9s quotidiennement.<\/p>\n\n\n\n<p>\u27a1\ufe0f Veuillez consulter et accepter les termes d&#8217;utilisation avant de t\u00e9l\u00e9charger le corpus. Le Corpus RTBF est accessible librement aux formats JSON, CSV et EXCEL via ce <a href=\"https:\/\/dataverse.uclouvain.be\/dataset.xhtml?persistentId=doi:10.14428\/DVN\/PEVSSI\">lien<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Le CENTAL (Centre de Traitement Automatique du Langage) et l&#8217;ORM (Observatoire de Recherche sur les M\u00e9dias et le Journalisme) de l&#8217;UCLouvain sont heureux de vous annoncer la mise \u00e0 disposition du Corpus RTBF, un corpus de plus de 750 000 articles de presse publi\u00e9s par le m\u00e9dia de service public belge francophone de 2008 \u00e0 [&hellip;]<\/p>\n","protected":false},"author":13,"featured_media":14461,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[203],"tags":[],"class_list":["post-14460","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-actualites"],"featured_image_src":"https:\/\/sites.uclouvain.be\/orm\/wp-content\/uploads\/2023\/10\/Capture-decran-2023-10-10-a-19.53.49.png","author_info":{"display_name":"Antonin Descampe","author_link":"https:\/\/sites.uclouvain.be\/orm\/author\/adescampe\/"},"_links":{"self":[{"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/posts\/14460","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/users\/13"}],"replies":[{"embeddable":true,"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/comments?post=14460"}],"version-history":[{"count":2,"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/posts\/14460\/revisions"}],"predecessor-version":[{"id":14466,"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/posts\/14460\/revisions\/14466"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/media\/14461"}],"wp:attachment":[{"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/media?parent=14460"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/categories?post=14460"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sites.uclouvain.be\/orm\/wp-json\/wp\/v2\/tags?post=14460"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}