{"id":92256,"date":"2018-03-11T10:11:11","date_gmt":"2018-03-11T10:11:11","guid":{"rendered":"https:\/\/www.deberes.net\/tesis\/sin-categoria\/a-communication-perspective-on-automatic-text-categorization\/"},"modified":"2018-03-11T10:11:11","modified_gmt":"2018-03-11T10:11:11","slug":"a-communication-perspective-on-automatic-text-categorization","status":"publish","type":"post","link":"https:\/\/www.deberes.net\/tesis\/tecnologia-de-las-telecomunicaciones\/a-communication-perspective-on-automatic-text-categorization\/","title":{"rendered":"A communication perspective on automatic text categorization"},"content":{"rendered":"<h2>Tesis doctoral de <strong> Marta Capdevila Dalmau <\/strong><\/h2>\n<p>El inter\u00e9s principal de un sistema de comunicaci\u00f3n es el de transferir informaci\u00f3n desde su fuente hasta su destino. Los documentos de texto tambi\u00e9n tratan con la transmisi\u00f3n de informaci\u00f3n. Particularmente, desde el punto de vista de un sistema de categorizaci\u00f3n de texto, la informaci\u00f3n codificada por un documento es el tema o categor\u00eda a la cual pertenece. Siguiendo esta intuici\u00f3n inicial, que a nuestro saber no ha sido explorada anteriormente, esta tesis desarrolla un nuevo marco te\u00f3rico donde se estudia la categorizaci\u00f3n autom\u00e1tica de textos (atc) desde una perspectiva de sistemas de comunicaci\u00f3n.  bajo este enfoque, en lo concerniente a la representaci\u00f3n interna del documento, se ha abordado la problem\u00e1tica reducci\u00f3n del espacio de indexaci\u00f3n con un esquema supervisado de dos niveles, implementado por un filtrado de t\u00e9rminos ruidosos y una posterio compresi\u00f3n de t\u00e9rminos redundantes. Con este objetivo, los t\u00e9rminos han sido caracterizados por una funci\u00f3n de distribuci\u00f3n por categor\u00edas sobre la cual se han podido establecer medidas de dispersi\u00f3n, que eval\u00faan el grado de informaci\u00f3n que conlleva el t\u00e9rmino, y medidas de similitud, que determinan la cantidad de redundancia que hay entre ellos. El tema de la compresi\u00f3n de t\u00e9rminos redundantes se ha tratado bajo un enfoque de agrupaci\u00f3n (clustering) aglomerativa que reagrupa t\u00e9rminos similares que pueden ser tratados como una \u00fanica entidad de indexaci\u00f3n.  en lo que respecta al clasificador, los categorizadores probabil\u00edsticos gausianos, hasta ahora b\u00e1sicamente ignorados, han sido revisados y adaptados a la concomitante dispersi\u00f3n en atc. Al supuesto gausiano se ha a\u00f1adido la hip\u00f3tesis de independencia adoptada por el enfoque naive bayes, lo que ha generado la familia de clasificadores naive bayes gausianos (gnb). Adem\u00e1s, la idea perseguida por nuestra familia de clasificadores adaptados gnb es la de establecer una cota inferior para la varianza gausiana de manera a mitigar los efectos de la dispersi\u00f3n t\u00edpica en la representaci\u00f3n de las colecciones de textos.<\/p>\n<p>&nbsp;<\/p>\n<h3>Datos acad\u00e9micos de la tesis doctoral \u00ab<strong>A communication perspective on automatic text categorization<\/strong>\u00ab<\/h3>\n<ul>\n<li><strong>T\u00edtulo de la tesis:<\/strong>\u00a0 A communication perspective on automatic text categorization <\/li>\n<li><strong>Autor:<\/strong>\u00a0 Marta Capdevila Dalmau <\/li>\n<li><strong>Universidad:<\/strong>\u00a0 Vigo<\/li>\n<li><strong>Fecha de lectura de la tesis:<\/strong>\u00a0 13\/03\/2009<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h3>Direcci\u00f3n y tribunal<\/h3>\n<ul>\n<li><strong>Director de la tesis<\/strong>\n<ul>\n<li>Oscar Willian Marquez Florez<\/li>\n<\/ul>\n<\/li>\n<li><strong>Tribunal<\/strong>\n<ul>\n<li>Presidente del tribunal: fernando P\u00e9rez gonz\u00e1lez <\/li>\n<li>david enrique Losada carril (vocal)<\/li>\n<li>lorenza Carrasco martorell (vocal)<\/li>\n<li>Jes\u00fas Cid sueiro (vocal)<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Tesis doctoral de Marta Capdevila Dalmau El inter\u00e9s principal de un sistema de comunicaci\u00f3n es el de transferir informaci\u00f3n desde [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"footnotes":""},"categories":[12500,4149,2528,2489,18657],"tags":[42335,12650,16176,166534,190898,190899],"class_list":["post-92256","post","type-post","status-publish","format-standard","hentry","category-codigo-y-sistemas-de-codificacion","category-diseno-y-componentes-de-sistemas-de-informacion","category-inteligencia-artificial","category-tecnologia-de-las-telecomunicaciones","category-vigo","tag-david-enrique-losada-carril","tag-fernando-perez-gonzalez","tag-jesus-cid-sueiro","tag-lorenza-carrasco-martorell","tag-marta-capdevila-dalmau","tag-oscar-willian-marquez-florez"],"_links":{"self":[{"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/posts\/92256","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/comments?post=92256"}],"version-history":[{"count":0,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/posts\/92256\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/media?parent=92256"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/categories?post=92256"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/tags?post=92256"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}