Concept Based Similarity Measure

2992 Words12 Pages

ENHANCING QUALITY OF SEARCH RESULTS BY CONCEPT BASED MODEL P.Sharmila1 , W.Rubavathy2 1Information Technology, Jeppiaar Engineering College, India Email: sharmila2771994@gmail.com 2Information Technology, Jeppiaar Engineering College, India Email: rupawilliams03@gmail.com Abstract: Text Mining techniques are mostly based on Vector space model(term frequencies). The statistical analysis of a term frequency captures the importance of the term without a document only. But two terms can have the same frequency in …show more content…

The fourth component is the concept-based similarity measure which allows measuring the significance of each concept with respect to the semantics of the sentence, the topic of the document, and the discrimination among documents in a corpus.By combining the factors affecting the weights of concepts on the sentence, document, and corpus levels, a concept-based similarity measure is capable of the accurate calculation of pairwise documents is devised. This allows performing concept matching and concept-based similarity calculations among documents in a very strong and accurate way. The excellence of text clustering achieved by this model considerably surpasses the traditional single term- based approaches.There are a number of possibilities for extending this paper. One chance is to apply this concept to web document clustering. Another chance is to apply the same model to text classification. The intention is to investigate the usage of such model on other corpora and its effect on classification compared to that of traditional

More about Concept Based Similarity Measure

Open Document