Hostname: page-component-77f85d65b8-8v9h9 Total loading time: 0 Render date: 2026-04-17T16:36:18.184Z Has data issue: false hasContentIssue false

ISO standard modeling of a large Arabic dictionary

Published online by Cambridge University Press:  07 September 2015

AIDA KHEMAKHEM
Affiliation:
MIRACL Laboratory, FSEGS, University of Sfax, B.P. 1088, 3018 Sfax, Tunisia e-mail: khemakhem.aida@gmail.com, bilel.gargouri@fsegs.rnu.tn
BILEL GARGOURI
Affiliation:
MIRACL Laboratory, FSEGS, University of Sfax, B.P. 1088, 3018 Sfax, Tunisia e-mail: khemakhem.aida@gmail.com, bilel.gargouri@fsegs.rnu.tn
ABDELMAJID BEN HAMADOU
Affiliation:
MIRACL Laboratory, ISIMS, University of Sfax, B.P. 242, 3021 Sakiet-Ezzit, Sfax, Tunisia e-mail: abdelmajid.benhamadou@isimsf.rnu.tn
GIL FRANCOPOULO
Affiliation:
IMMI-CNRS and Tagmatica, Rue John von Neumann, 91405 Orsay, France e-mail: gil.francopoulo@wanadoo.fr

Abstract

In this paper, we address the problem of the large coverage dictionaries of Arabic language usable both for direct human reading and automatic Natural Language Processing. For these purposes, we propose a normalized and implemented modeling, based on Lexical Markup Framework (LMF-ISO 24613) and Data Registry Category (DCR-ISO 12620), which allows a stable and well-defined interoperability of lexical resources through a unification of the linguistic concepts. Starting from the features of the Arabic language, and due to the fact that a large range of details and refinements need to be described specifically for Arabic, we follow a finely structuring strategy. Besides its richness in morphology, syntax and semantics knowledge, our model includes all the Arabic morphological patterns to generate the inflected forms from a given lemma and highlights the syntactic–semantic relations. In addition, an appropriate codification has been designed for the management of all types of relationships among lexical entries and their related knowledge. According to this model, a dictionary named El Madar1 has been built and is now publicly available on line. The data are managed by a user-friendly Web-based lexicographical workstation. This work has not been done in isolation, but is the result of a collaborative effort by an international team mainly within the ISO network during a period of eight years.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2015 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable