Skip to main content
    • Aa
    • Aa

Lemaza : An Arabic why-question answering system*

  • AQIL M. AZMI (a1) and NOUF A. ALSHENAIFI (a1)

Question answering systems retrieve information from documents in response to queries. Most of the questions are who- and what-type questions that deal with named entities. A less common and more challenging question to deal with is the why -question. In this paper, we introduce Lemaza (Arabic for why), a system for automatically answering why -questions for Arabic texts. The system is composed of four main components that make use of the Rhetorical Structure Theory. To evaluate Lemaza, we prepared a set of why -question–answer pairs whose answer can be found in a corpus that we compiled out of Open Source Arabic Corpora. Lemaza performed best when the stop-words were not removed. The performance measure was 72.7%, 79.2% and 78.7% for recall, precision and c@1, respectively.

Hide All

We would like to thank W. Al-Sanie for sharing his RST implementation; and the language specialist for helping us with why-question–answer pairs. The first author would like to thank Miss Maryam for her assistance in proof-reading the manuscript. Special thanks to all three anonymous reviewers for their constructive comments, which helped in further improvement of the manuscript. This work was supported by a special fund in the Research Center of College of Computer & Information Sciences (CCIS) at King Saud University for which the authors are thankful.

Recommend this journal

Email your librarian or administrator to recommend adding this journal to your organisation's collection.

Natural Language Engineering
  • ISSN: 1351-3249
  • EISSN: 1469-8110
  • URL: /core/journals/natural-language-engineering
Please enter your name
Please enter a valid email address
Who would you like to send this to? *


Full text views

Total number of HTML views: 1
Total number of PDF views: 25 *
Loading metrics...

Abstract views

Total abstract views: 149 *
Loading metrics...

* Views captured on Cambridge Core between 24th August 2017 - 18th October 2017. This data will be updated every 24 hours.