Extraction and annotation of ‘location names’ | Ekstrakcija i anotacija imena lokacija |
INFOtheca, Scientific paper [pdf] | INFOteka, Naučni rad [pdf] [WikiData] |
ID: 1.2019.2.1 Number: 2 Volume: 19 Year: 2019 UDC: 81’322.2 [tmx] [bow] |
Tita Kyriacopoulou Institution: University of Paris-Est Laboratoire d’Informatique Gaspard-Monge, France Mail: tita@u-pem.fr | Tita Kyriacopoulou Institucija: Univeryitet u Parizu-Est, Računarska laboratorija Gaspar Monž, Francuska E-pošta: tita@u-pem.fr |
Claude Martineau Institution: University of Paris-Est Laboratoire d’Informatique Gaspard-Monge, France Mail: claude.martineau@u-pem.fr | Claude Martineau Institucija: Univeryitet u Parizu-Est, Računarska laboratorija Gaspar Monž, Francuska E-pošta: claude.martineau@u-pem.fr |
Markarit Vartampetian Institution: Paris Nanterre University Paris, France Mail: markaritvar@gmail.com | Markarit Vartampetian Institucija: Univerzitet u Parizu Nantere, Pariz, Francuska E-pošta: markaritvar@gmail.com |
Abstract Introduced as part of the Message Understanding Conferences dedicated to information extraction, Named Entity extraction is a well-studied task in Natural Language Processing. The recognition and the categorisation of person names, location names, organisation names, etc., is regarded as a fundamental process for a wide variety of natural language processing applications dealing with content analysis and many research works are devoted to it, achieving very good results.
One of our objectives is the identification and automatic (or semi-automatic) annotation of location names in order to apply the most appropriate information extraction methods. Then the main objective concerns the combination and interoperability between symbolic and statistical NLP (Natural Language Processing) methods (symbolic rules, machine learning, and data mining).
Our work consisted of recognising named entities and in particular locations with Unitex, annotating them with Brat, and correcting them manually. The recall and accuracy rates are very encouraging but the question remains: What is a location name ? | Apstrakt U je prikazano prepoznavanje imenovanih entiteta tj. lokacija korišćenjem sistema Unitex, zatom njihova anotacija primenom sistema Brat i na kraju manuelna korekcija. Odziv i preciznost su ohrabrujući ali i dalje ostaje pitanje: Šta su to imena lokacija? |
Keywords: location names, locative
complement, annotation, information
extraction, Unitex. | Ključne reči: imena lokacija, dopuna lokativa, anotacija, ekstrakcija infromacija, Unitex |
Pages: 7-25 | Strane: |
Publishing place: Publisher: Publishing year: | Mesto izdanja: Izdavač: Godina izdanja: |
Translator: | Prevodilac: |