Skip to main content
Log in

Multilingual resources for NLP in the lexical markup framework (LMF)

  • Published:
Language Resources and Evaluation Aims and scope Submit manuscript

Abstract

Optimizing the production, maintenance and extension of lexical resources is one the crucial aspects impacting natural language processing (NLP). A second aspect involves optimizing the process leading to their integration in applications. With this respect, we believe that a consensual specification on monolingual, bilingual and multilingual lexicons can be a useful aid for the various NLP actors. Within ISO, one purpose of Lexical Markup Framework (LMF, ISO-24613) is to define a standard for lexicons that covers multilingual lexical data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4

Similar content being viewed by others

Notes

  1. Two types of diagrams must be distinguished: class diagrams and instance diagrams. A model is specified by a UML class diagram within a UML package: in this case, the class name is not underlined in the diagram. The various examples of word description are represented by UML instance diagrams: in this case, the class name is underlined.

  2. Packages for Morphology, Syntax and Semantics are described in (Francopoulo 2006).

  3. www.papillon-dictionary.org.

  4. To be more precise, Papillon uses the term “axie”. We decided to use the term “axis” in order to respect English orthography.

  5. It should be noted that a W3C schema is not specified as an ISO standard but is specified as a W3C recommendation. For criticisms about W3C schemas and comparison with Relax NG, see (Van der Vlist 2004).

  6. Please consult www.lexicalmarkupframework.org for updated information.

References

  • Antoni-Lay, M.-H., Francopoulo, G., & Zaysser, L. (1994). A generic model for reusable lexicons: The GENELEX project. Literary and linguistic computing, 9(1), 47–54.

    Article  Google Scholar 

  • Atkins, S., Bel, N., Bertagna, F., Bouillon, P., Calzolari, N., Fellbaum, C., et al. (2002). From resources to applications. Designing the multilingual ISLE lexical entry. Proceedings of LREC, Las Palmas, Canary Island, Spain (pp. 687–693).

  • Bertagna, F., Lenci, A., Monachini, M., & Calzolari, N. (2004). Content interoperability of lexical resources, open issues and MILE perspectives. Proceedings of LREC, Lisbon (pp. 131–134).

  • Calzolari, N., Mc Naught, J., & Zampolli, A. (1996) Eagles, editors introduction, from www.ilc.cnr.it/EAGLES96/edintro.html.

  • Francopoulo, G., George, M., Calzolari, N., Monachini, M., Bel, N., et al. (2006). Lexical markup framework (LMF). Proceedings of LREC, Genoa (pp. 223–236).

  • Ide, N., & Véronis, J. (1994). MULTEXT: Multilingual text tools and corpora. In Proceedings of the 15th International Conference on Computational Linguistics, COLING, Kyoto.

  • ISO-12620. (1999). Computer application in terminology—Data categories. Geneva: ISO.

  • Khemakhem, A., Gargouri, B., Abdelwahed, A., & Francopoulo, G. (2007). Modélisation des paradigmes de flexion des verbes arabes selon la norme LMF-ISO 24613. Toulouse: TALN.

  • Lenci, A., Bel, N., Busa, F., Calzolari, N., Gola, E., Monachini, M., et al. (2000). SIMPLE: A general framework for the development of multilingual lexicons. International Journal of Lexicography, 13(4), 249–263.

    Article  Google Scholar 

  • LMF. (2008). Lexical markup framework ISO-FDIS24613. Geneva: ISO.

  • Monachini, M., Quochi, V., Del Gratta, R., & Calzolari, N. (2007). Using LMF to shape a lexicon for the biomedical domain. LangTech Proceeding, Rome.

  • Rumbaugh, J., Jacobson, I., & Booch, G. (2004). The unified modeling language reference manual (2nd ed.). Boston, MA: Addison Wesley.

    Google Scholar 

  • Sérasset, G., & Mangeot-Lerebours, M. (2001). Papillon Lexical Database project: Monolingual dictionaries & interlingual links. Tokyo: NLPRS.

    Google Scholar 

  • Van der Vlist E. (2004). Relax NG, O’Reilly. Sebastopol, California: CRCS.

  • Zampolli, A. (1997). The PAROLE project in the general context of the European actions for language resources. In R. Marcinkeviciene & N. Volz (Eds.), TELRI Proceedings of the Second European Seminar: Language Applications for a Multilingual Europe. Manheim/Kaunas: IDS/VDU.

Download references

Acknowledgements

The work presented here was partially funded by the EU eContent-22236 LIRICS project (http://lirics.loria.fr ).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gil Francopoulo.

Additional information

This paper is the description of the ongoing work within ISO committees and is not a position paper.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Francopoulo, G., Bel, N., George, M. et al. Multilingual resources for NLP in the lexical markup framework (LMF). Lang Resources & Evaluation 43, 57–70 (2009). https://doi.org/10.1007/s10579-008-9077-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10579-008-9077-5

Keywords

Navigation