Hostname: page-component-89b8bd64d-sd5qd Total loading time: 0 Render date: 2026-05-13T15:16:33.676Z Has data issue: false hasContentIssue false

Morphologically rich Urdu grammar parsing using Earley algorithm

Published online by Cambridge University Press:  16 April 2015

QAISER ABBAS*
Affiliation:
Fachbereich Sprachwissenschaft, Universität Konstanz, 78457 Konstanz, Germany e-mail: qaiser.abbas@uni-konstanz.de

Abstract

This work presents the development and evaluation of an extended Urdu parser. It further focuses on issues related to this parser and describes the changes made in the Earley algorithm to get accurate and relevant results from the Urdu parser. The parser makes use of a morphologically rich context free grammar extracted from a linguistically-rich Urdu treebank. This grammar with sufficient encoded information is comparable with the state-of-the-art parsing requirements for the morphologically rich Urdu language. The extended parsing model and the linguistically rich extracted-grammar both provide us better evaluation results in Urdu/Hindi parsing domain. The parser gives 87% of f-score, which outperforms the existing parsing work of Urdu/Hindi based on the tree-banking approach.

Information

Type
Articles
Copyright
Copyright © Cambridge University Press 2015 

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

Article purchase

Temporarily unavailable