Capítulo 8. Diccionarios para máquinas

Tabla de contenidos

Estándares relativos a la terminología
Estandarizando lexicones computacionales: OLIF2

Estándares relativos a la terminología

Interesante lista de estándares relativos a la terminología, en un mensaje de Kara Warburton al foro de discusión sobre «localización» y terminología de LISA, Como no tiene desperdicio lo parafraseo:

Here is a list of Terminology Standards that I am familiar with and which I find useful. This is a starting point for a SIG list. Please post as a reply any additional ones which you find useful.

  1. TBX - TermBase eXchange format. This is the XML terminology markup format proposed by the LISA/OSCAR standards group as an standard for the localization industry. More info here:

  2. OLIF2 - Open Lexicon Interchange Format. This is an interchange format specifically for machine-readable lexicographical data, such as for machine translation systems. OLIF data will be recordable in TBX.

ISO TC 37 Standards

  1. ISO DIS 16642:2002 - TMF - Terminology Markup Framework. This is a high-level standard framework for defining individual TMLs (terminology markup languages). It covers basic structure and architecture of TMLs and terminology databases.

  2. ISO 12620:1999 (under revision) - Terminology Data Categories. This standard is being revised into 2 parts. The first part describes a standard generic method for defining data categories for terminology collections (what standard properties they should have, etc.). The second part is an inventory of all the possible types of data categories in lexicology and terminology (term, part of speech, definition, context, variants, etc., etc.). This part is very useful as a catalog for picking data categories for your own terminology database.

  3. ISO 12200:1999 - MARTIF - If you're still into SGML, this is a good established standard markup format. MARTIF is the basis for TBX and is also supported by a number of terminology tools. MARTIF will be integrated as an appendix in the final version of ISO 16642.

  4. ISO 17241 - GENETER. Another standard SGML format for terminology, used by some databases in Europe. GENETER will be integrated as an appendix in the final version of ISO 16642.

  5. ISO 704:2000 - Terminology Work - Principles and methods. Provides all kinds of useful information about terminology research methods and writing definitions, and other process-oriented tasks.

  6. ISO 12616 - Translation-oriented terminography. Provides useful guidelines for terminology work specifically to support the translation process.

  7. ISO 1087-1 and 1087-2 - Terminology Work - Vocabulary. These basically contain definitions of standard terms in terminology. A useful reference point for the SIG.


¿Cómo conseguir los caros estándares del Comité Técnico 37 de ISO, esenciales en este campo?