This book addresses all the major and latest techniques of data mining. It deals in detail with the algorithms for discovering association rules for clustering and building decision trees, and techniques such as neural networks, genetic algorithms, rough set theory and support vector machine used in data mining. The algorithmic details of different techniques such as Apriori, Pincer-search, Dynamic Itemset Counting, FP-Tree growth, SLIQ, SPRINT, BOAT, CART, RainForest, BIRCH, CURE, BUBBLE, ROCK, STIRR, PAM, CLARANS, DBSCAN, GSP, SPADE and SPIRIT are covered. The book also discusses the mining of web, spatial, temporal and text data. In the third edition, the chapter on data warehousing concepts was thoroughly revised to include multidimensional data modelling and cube computation. The discussion on genetic algorithms was also expanded as a separate chapter. In the fourth edition, a chapter on ROC curve for visualizing the performance of a binary classifier and the method for computing AUC and its uses has been included.
Students of computer science, mathematical science and management will find this introductory textbook beneficial for a first course on the subject; the exposition of concepts with supporting illustrative examples and exercises makes it suitable for self-study as well.