files/journal/2022-09-02_12-20-40-000000_622.png

International Journal of Soft Computing

ISSN: Online
ISSN: Print 1816-9503
111
Views
0
Downloads

Extraction of Root Words Using Morphological Analyzer for Hindi Text

Anjusha Pimpalshende and A.R. Mahajan
Page: 134-138 | Received 21 Sep 2022, Published online: 21 Sep 2022

Full Text Reference XML File PDF File

Abstract

Stemming is a process of extracting words from text and turning them into index terms in an IR system. Stemmers are based upon the written and not the spoken form of the language. Word stemming is one of the most significant factors that affect the performance of a Natural Language Processing (NLP) application such as Information Retrieval (IR) system, part of speech tagging, machine translation system and syntactic parsing, text summarization. A stemmer converts morphologically identical words to root word without performing analysis of that term. Sometimes, if we remove suffix from the word then the word may not be a proper Hindi word. So, to overcome this problem, a stemming algorithm is proposed that uses hybrid approach (combination of Brute force approach, suffix stripping approach and suffix substitution).


How to cite this article:

Anjusha Pimpalshende and A.R. Mahajan. Extraction of Root Words Using Morphological Analyzer for Hindi Text.
DOI: https://doi.org/10.36478/ijscomp.2018.134.138
URL: https://www.makhillpublications.co/view-article/1816-9503/ijscomp.2018.134.138