Description
• Text mining is the discovery of interesting knowledge in text documents. • Many data mining techniques have been proposed for mining useful patterns in text documents. • It is a challenging issue to find accurate knowledge (or features) in text documents to help users to find what they want. • In existing, Information Retrieval (IR) provided many term-based methods to solve this challenge. • The term-based methods suffer from the problems of polysemy and synonymy. • The polysemy means a word has multiple meanings, and synonymy is multiple words having the same meaning. • In proposed to use pattern (or phrase)-based approaches should perform better than the term-based ones. • The proposed approach can improve the accuracy of evaluating term weights because discovered patterns are more specific than whole documents.