TY  - JOUR
T1  - A Domain-Based Approach to Extract Arabic Person Names Using N-Grams and Simple Rules
AU - Alhawarat, Mohammad 
JO  - Asian Journal of Information Technology
VL  - 14
IS  - 8
SP  - 287
EP  - 293
PY  - 2015
DA  - 2001/08/19
SN  - 1682-3915
DO  - ajit.2015.287.293
UR  - https://makhillpublications.co/view-article.php?doi=ajit.2015.287.293
KW  - Entity named recognition
KW  -Arabic person names
KW  -Natural language processing (NLP)
KW  -hadith
KW  -text mining
AB  - Named Entity Recognition (NER) is considered an important task in many human language technologies including information extraction, Natural Language Processing (NLP) and Machine Translation. This is believed to be a challenging task for Arabic language. Most of the existing research studies deal only with names that are found in Modern Standard Arabic (MSA) sources such as news. In this study, we aim at building Classical Arabic name list or Gazetteer which represents an important part of a lively Arabic literature and culture. To achieve this goal, we propose a new approach for extracting Arabic Person Names (APNs). This approach constitutes a new model for extracting named entities from unstructured Arabic text without the need for Part of Speech (POS) tagging and/or morphological analysis. The proposed approach is based on formulating a model that is established on a specific domain. For this study, we use an authentic text in the literature of Islamic-Arabic studies viz, the &#147;Hadith&#148;. This domain is related to the Prophet Mohammad&#146;s Peace Be Upon Him (PBUH) sayings. To achieve aims of this study, we use NLP and text mining techniques to extract and build an accurate standard list of classical APNs. Also, We built a standard evaluation classical names list in order to evaluate our approach. Results show very good precision of around 84%.
ER  -