ABSTRACT
This paper presents a multilingual system designed to recognize named entities in a wide variety of languages (currently more than 12 languages are concerned). The system includes original strategies to deal with a wide variety of encoding character sets, analysis strategies and algorithms to process these languages.
- Appelt D. and Israel D. (1999). Introduction to Information Extraction Technology. (IJCAI-99) Tutorial, Stockholm, Sweden (available at: http://www.ai.sri. com/~appelt/ie-tutorial/)Google Scholar
- Asahara M., Matsumoto M. (2000) Extended Models and Tools for High-performance Part-of-Speech Tagger". In Proceedings of Coling'2000, Saarbrücken, Germany, pp. 21--27. Google ScholarDigital Library
- Bechet F., Nasr A., Genet F. (2000) Tagging Unknown Proper Names Using Decision Trees. In Proceedings of the 38th ACL Conference, Hong-Kong, pp. 77--84 Google ScholarDigital Library
- Bikel D., Miller S., Schwartz R. and Weischedel R. (1997) Nymble: a high performance learning name-finder. In Proceeding of the 5th ANLP Conference, Washington, USA. Google ScholarDigital Library
- Borthwick A. (1999) A maximum entropy approach for named entity recognition. PhD Thesis, New York University. Google ScholarDigital Library
- Collins M. and Singer Y. (1999) Unsupervised models for named entity classification. In Proceedings of EMNLP/WVLC, 1999, MA, pp. 189--196.Google Scholar
- Cucchiarelli A. and Velardi P. (1999) Adaptability of linguistic resources to new domains: an experiment with proper noun dictionaries. In Proceedings of the Vextal Conference, Venice, Italy, pp. 25--30.Google Scholar
- Mikheev A., Moens M. and Grover C. (1999) Named Entity recognition without gazetteers. In Proceedings of the Annual Meeting of the European Association for Computational Linguistics EACL '99, Bergen, Norway, pp. 1--8. Google ScholarDigital Library
- Mooney R. (1993) Induction over the unexplained: using overly general domain theories to aid concept learning, Machine Learning, 10:79. Google ScholarDigital Library
- MUC-6 (1995) Proceedings of the Sixth Message Understanding Conference (DARPA), Morgan Kaufmann Publishers, San Francisco.Google Scholar
- Poibeau T and Kosseim L. (2001) Proper-name Extraction from Non-Journalistic Texts. Proceeding of the 11th Conference Computational Linguistics in the Netherlands, Tilburg. Netherlands, Rodopi.Google Scholar
- Sekine S., Eriguchi Y. (2000) Japanese Named Entity Extraction Evaluation - Analysis of Results. In Proceedings of Coling'2000, Saarbrücken, Germany, pp. 1106--1110. Google ScholarDigital Library
- Silberztein M. (1993) Dictionnaires électroniques. Masson, Paris.Google Scholar
- Yarowsky D. (1995) Unsupervised Word Sense Disambiguation rivaling Supervised Methods. In Proceedings of the 33rd ACL Conference, Cambridge, USA. Google ScholarDigital Library
- The multilingual named entity recognition framework
Recommendations
Learning multilingual named entity recognition from Wikipedia
We automatically create enormous, free and multilingual silver-standard training annotations for named entity recognition (ner) by exploiting the text and structure of Wikipedia. Most ner systems rely on statistical models of annotated data to identify ...
NERA: Named Entity Recognition for Arabic
Name identification has been worked on quite intensively for the past few years, and has been incorporated into several products revolving around natural language processing tasks. Many researchers have attacked the name identification problem in a ...
Named entity recognition an aid to improve multilingual entity filling in language-independent approach
IKM4DR '12: Proceedings of the first workshop on Information and knowledge management for developing regionThis paper details the approach to identify Named Entities (NEs) from a large non-English corpus and associate them with appropriate tags, requiring minimal human intervention and no linguistic expertise. The main objective in this paper is to focus on ...
Comments