TY - JOUR T1 - A Novel Approach to Concept Extraction Using Naive Bayesian Classification Technique AU - , M. Sathya AU - , P.Venil AU - , M.S. Saleem Basha JO - International Journal of Soft Computing VL - 2 IS - 4 SP - 488 EP - 493 PY - 2007 DA - 2001/08/19 SN - 1816-9503 DO - ijscomp.2007.488.493 UR - https://makhillpublications.co/view-article.php?doi=ijscomp.2007.488.493 KW - Information extraction KW -naive bayesian KW -classification KW -novel approach KW -technique AB - Most of the information resources are capable to provide concept dependent results for the biased query. But the retrieval mechanism could not provide the relevant documents for the query text due to the size of the information resources is dynamically growing as the new topics being added. This problem can be overcome by automatically generating wrappers for these hidden documents. We are proposing a novel approach for automatically generating wrappers for describing the content of the hidden documents using a co-occurrence based clustering algorithm and Naive Bayesian classification model. The initial stage is the learning stage, which clusters the document based on the distinct concepts present in that. The learning technique makes use of a thesaurus and builds a co-occurrence correlation model. Then the clustered document features are used to generate the concept description using Naive Bayesian classifier. The join and posterior probabilities are calculated using the greedy selection and joining algorithm to represent cluster. Our implementation was tested on the standard data set and shows a better performance. ER -