Bibliša: Aligned Collection Search Tool

[ Log In ] [ Register ]
Extraction and annotation of ‘location names’Ekstrakcija i anotacija imena lokacija
INFOtheca, Scientific paper [pdf]INFOteka, Naučni rad [pdf] [WikiData]
ID: 1.2019.2.1 Number: 2 Volume: 19 Year: 2019 UDC: 81’322.2 [tmx] [bow]
Tita Kyriacopoulou
Institution: University of Paris-Est Laboratoire d’Informatique Gaspard-Monge, France
Mail: tita@u-pem.fr
Tita Kyriacopoulou
Institucija: Univeryitet u Parizu-Est, Računarska laboratorija Gaspar Monž, Francuska
E-pošta: tita@u-pem.fr
Claude Martineau
Institution: University of Paris-Est Laboratoire d’Informatique Gaspard-Monge, France
Mail: claude.martineau@u-pem.fr
Claude Martineau
Institucija: Univeryitet u Parizu-Est, Računarska laboratorija Gaspar Monž, Francuska
E-pošta: claude.martineau@u-pem.fr
Markarit Vartampetian
Institution: Paris Nanterre University Paris, France
Mail: markaritvar@gmail.com
Markarit Vartampetian
Institucija: Univerzitet u Parizu Nantere, Pariz, Francuska
E-pošta: markaritvar@gmail.com
Abstract
Introduced as part of the Message Understanding Conferences dedicated to information extraction, Named Entity extraction is a well-studied task in Natural Language Processing. The recognition and the categorisation of person names, location names, organisation names, etc., is regarded as a fundamental process for a wide variety of natural language processing applications dealing with content analysis and many research works are devoted to it, achieving very good results. One of our objectives is the identification and automatic (or semi-automatic) annotation of location names in order to apply the most appropriate information extraction methods. Then the main objective concerns the combination and interoperability between symbolic and statistical NLP (Natural Language Processing) methods (symbolic rules, machine learning, and data mining). Our work consisted of recognising named entities and in particular locations with Unitex, annotating them with Brat, and correcting them manually. The recall and accuracy rates are very encouraging but the question remains: What is a location name ?
Apstrakt
U je prikazano prepoznavanje imenovanih entiteta tj. lokacija korišćenjem sistema Unitex, zatom njihova anotacija primenom sistema Brat i na kraju manuelna korekcija. Odziv i preciznost su ohrabrujući ali i dalje ostaje pitanje: Šta su to imena lokacija?
Keywords: location names, locative complement, annotation, information extraction, Unitex.Ključne reči: imena lokacija, dopuna lokativa, anotacija, ekstrakcija infromacija, Unitex
Pages: 7-25Strane:
Publishing place:
Publisher:
Publishing year:
Mesto izdanja:
Izdavač:
Godina izdanja:
Translator: Prevodilac:
C:\inetpub\BiblishaMongo\export\11\svg\1_2019_2_1_tmx_0.svg