TY  - JOUR
T1  - Two Layer Machine Learning Approach for Mining Referential Entities for a Morphologically Rich Language
AU - Ram, Vijay Sundar AU - Devi, Sobha Lalitha 
JO  - Asian Journal of Information Technology
VL  - 15
IS  - 16
SP  - 2831
EP  - 2838
PY  - 2016
DA  - 2001/08/19
SN  - 1682-3915
DO  - ajit.2016.2831.2838
UR  - https://makhillpublications.co/view-article.php?doi=ajit.2016.2831.2838
KW  - coreference resolution
KW  -Chain of referential entities
KW  -pronominal resolution
KW  -noun phrase coreferencing
KW  -Tree CRFs
KW  -support vector machines
KW  -Tamil
KW  -morphologically rich language
AB  - Business Intelligence (BI), a technology-driven process and presenting actionable information has become important for improvement of organisation, business units etc. BI requires mining information from a huge volume of unstructured text data. This mining task requires sophisticated natural language processing tasks. One of the crucial tasks is identifying the chain of referential entities in the given text which is described as coreference resolution. Coreference is the referent in one expression of the same referent in another expression and the referents must exist in the real world. Coreference chain is formed by connecting entities referring to same entity. We approach this resolution task for a morphologically rich language, Tamil as two subtasks and use two machine learning approaches. The two subtasks, the pronominal resolution and noun phrase coreferencing is done using Tree Conditional Random Fields (Tree CRFs) and Support Vector Machines (SVM), respectively. Coreference chains are evaluated with standard metrics and the results are encouraging.
ER  -