International Journal of Soft Computing

111
Views

0
Downloads

Extraction of Root Words Using Morphological Analyzer for Hindi Text

Anjusha Pimpalshende and A.R. Mahajan
Page: 134-138 | Received 21 Sep 2022, Published online: 21 Sep 2022

By Research Area

Medicine & Public Health Life Sciences Engineering Mathematics Biomedicine Physics Chemistry Computer Science Earth Sciences Social Sciences Business and Management Psychology Materials Science Economics Education Environment Philosophy Statistics Law Political Science and International Relations Pharmacy Dentistry Energy Linguistics Geography Finance Criminology and Criminal Justice Medicine Cultural and Media Studies History Architecture / Design Literature Biomedical Sciences Religious Studies Education & Language Food Science & Nutrition Public Health

By Volume and Issue

Abstract

Stemming is a process of extracting words from text and turning them into index terms in an IR system. Stemmers are based upon the written and not the spoken form of the language. Word stemming is one of the most significant factors that affect the performance of a Natural Language Processing (NLP) application such as Information Retrieval (IR) system, part of speech tagging, machine translation system and syntactic parsing, text summarization. A stemmer converts morphologically identical words to root word without performing analysis of that term. Sometimes, if we remove suffix from the word then the word may not be a proper Hindi word. So, to overcome this problem, a stemming algorithm is proposed that uses hybrid approach (combination of Brute force approach, suffix stripping approach and suffix substitution).

How to cite this article:

Anjusha Pimpalshende and A.R. Mahajan. Extraction of Root Words Using Morphological Analyzer for Hindi Text.
DOI: https://doi.org/10.36478/ijscomp.2018.134.138
URL: https://www.makhillpublications.co/view-article/1816-9503/ijscomp.2018.134.138