{"id":71525,"date":"2018-03-09T23:16:12","date_gmt":"2018-03-09T23:16:12","guid":{"rendered":"https:\/\/www.deberes.net\/tesis\/sin-categoria\/anotacion-semantica-no-supervisada\/"},"modified":"2018-03-09T23:16:12","modified_gmt":"2018-03-09T23:16:12","slug":"anotacion-semantica-no-supervisada","status":"publish","type":"post","link":"https:\/\/www.deberes.net\/tesis\/matematicas\/anotacion-semantica-no-supervisada\/","title":{"rendered":"Anotaci\u00f3n sem\u00e1ntica no supervisada"},"content":{"rendered":"<h2>Tesis doctoral de <strong> David Jose Fernandez Amoros <\/strong><\/h2>\n<p>En esta tesis se trata el problema de la desambiguaci\u00f3n del sentido de las palabras (i.E.  Dados un  diccionario, una palabra y un  contexto, decidir en qu\u00e9 sentido del diccionario se est\u00e1 usando la palabra en el  contexto).  las diferentes fuentes de informaci\u00f3n utilizadas son :  1. La informaci\u00f3n de   origen taxon\u00f3mico  basada  en   la relaci\u00f3n   es-un, por ejemplo, un \u00e1guila  es-un p\u00e1jaro.  2. La informaci\u00f3n de  coocurrencias.  Tomando como punto de partida  un corpus de casi 300 millones de palabras provinientes de libros en  formato    electr\u00f3nico (proyecto gutenberg)    estudiaremos pares de  palabras cuyas apariciones en  contextos cortos son estad\u00edsticamente  dependientes.  Utilizaremos varias medidas  para calibrar  ese grado de dependencia y emplearemos dicha informaci\u00f3n para desambiguar.  3. Informaci\u00f3n extra\u00edda de la www.  La informaci\u00f3n de la glosas del   inventario de sentidos ser\u00e1n complementadas con informaci\u00f3n extra\u00edda  de la web.   Esta  informaci\u00f3n  ha sido extra\u00edda  de  un  sistema de clasificaci\u00f3n  de    documentos  realizado   por   voluntarios (open directory project)  por celina santamar\u00eda.  4. Informaci\u00f3n proviniente de corpora biling\u00ed\u00bce comparable. Partiendo  de un corpus en  ingl\u00e9s y otro  en  espa\u00f1ol se han buscado patrones   sint\u00e1cticos superficiales  correspondientes a sintagmas nominales en   ambos idiomas. a partir de este trabajo realizado por anselmo pe\u00f1as  y fernando l\u00f3pez ostenero estudiaremos si  es posible aprovechar las diferencias  entre ambos  idiomas  para detectar  estos sintagmas  y   desambiguar mediante las  capacidades transling\u00ed\u00bces de   una base de   conocimiento l\u00e9xica (eurowordnet).  se demostrar\u00e1  que la anotaci\u00f3n sem\u00e1ntica  no supervisada puede lograr buenos resultados, y  que    hay  lineas  de investigaci\u00f3n,    con  un importante potencial de mejora, que merecen exploradas.<\/p>\n<p>&nbsp;<\/p>\n<h3>Datos acad\u00e9micos de la tesis doctoral \u00ab<strong>Anotaci\u00f3n sem\u00e1ntica no supervisada<\/strong>\u00ab<\/h3>\n<ul>\n<li><strong>T\u00edtulo de la tesis:<\/strong>\u00a0 Anotaci\u00f3n sem\u00e1ntica no supervisada <\/li>\n<li><strong>Autor:<\/strong>\u00a0 David Jose Fernandez Amoros <\/li>\n<li><strong>Universidad:<\/strong>\u00a0 Nacional de educaci\u00f3n a distancia<\/li>\n<li><strong>Fecha de lectura de la tesis:<\/strong>\u00a0 29\/11\/2004<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n<h3>Direcci\u00f3n y tribunal<\/h3>\n<ul>\n<li><strong>Director de la tesis<\/strong>\n<ul>\n<li>Julio Gonzalo Arroyo<\/li>\n<\/ul>\n<\/li>\n<li><strong>Tribunal<\/strong>\n<ul>\n<li>Presidente del tribunal: llu\u00eds Padr\u00f3 cirera <\/li>\n<li> De buenaga rodriguez Manuel (vocal)<\/li>\n<li>raquel Martinez unanue (vocal)<\/li>\n<li>eneko Agirre bengoa (vocal)<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Tesis doctoral de David Jose Fernandez Amoros En esta tesis se trata el problema de la desambiguaci\u00f3n del sentido de [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"site-sidebar-layout":"default","site-content-layout":"","ast-site-content-layout":"","site-content-style":"default","site-sidebar-style":"default","ast-global-header-display":"","ast-banner-title-visibility":"","ast-main-header-display":"","ast-hfb-above-header-display":"","ast-hfb-below-header-display":"","ast-hfb-mobile-header-display":"","site-post-title":"","ast-breadcrumbs-content":"","ast-featured-img":"","footer-sml-layout":"","theme-transparent-header-meta":"","adv-header-id-meta":"","stick-header-meta":"","header-above-stick-meta":"","header-main-stick-meta":"","header-below-stick-meta":"","astra-migrate-meta-layouts":"default","ast-page-background-enabled":"default","ast-page-background-meta":{"desktop":{"background-color":"var(--ast-global-color-4)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"ast-content-background-meta":{"desktop":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"tablet":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""},"mobile":{"background-color":"var(--ast-global-color-5)","background-image":"","background-repeat":"repeat","background-position":"center center","background-size":"auto","background-attachment":"scroll","background-type":"","background-media":"","overlay-type":"","overlay-color":"","overlay-gradient":""}},"footnotes":""},"categories":[1890,13880,2528,126,17070],"tags":[155822,84560,55068,52765,55067,54810],"class_list":["post-71525","post","type-post","status-publish","format-standard","hentry","category-ciencia-de-los-ordenadores","category-informatica","category-inteligencia-artificial","category-matematicas","category-nacional-de-educacion-a-distancia","tag-david-jose-fernandez-amoros","tag-de-buenaga-rodriguez-manuel","tag-eneko-agirre-bengoa","tag-julio-gonzalo-arroyo","tag-lluis-padro-cirera","tag-raquel-Martinez-unanue"],"_links":{"self":[{"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/posts\/71525","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/comments?post=71525"}],"version-history":[{"count":0,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/posts\/71525\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/media?parent=71525"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/categories?post=71525"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.deberes.net\/tesis\/wp-json\/wp\/v2\/tags?post=71525"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}