Read more
Data Mining: Concepts and Techniques, Fourth Edition introduces concepts, principles, and methods for mining patterns, knowledge, and models from various kinds of data for diverse applications. Specifically, it delves into the processes for uncovering patterns and knowledge from massive collections of data, known as
knowledge discovery from data, or
KDD. It focuses on the feasibility, usefulness, effectiveness, and scalability of data mining techniques for large data sets.
After an introduction to the concept of data mining, the authors explain the methods for preprocessing, characterizing, and warehousing data. They then partition the data mining methods into several major tasks, introducing concepts and methods for mining frequent patterns, associations, and correlations for large data sets; data classificcation and model construction; cluster analysis; and outlier detection. Concepts and methods for deep learning are systematically introduced as one chapter. Finally, the book covers the trends, applications, and research frontiers in data mining.
List of contents
1. Introduction
2. Data, measurements, and data processing
3. Data warehousing and online analytical processing
4. Pattern mining: basic concepts and methods
5. Pattern mining: advanced methods
6. Classification: basic concepts and methods
7. Classification: advanced methods
8. Cluster analysis: basic concepts and methods
9. Cluster analysis: advanced methods
10. Deep learning
11. Outlier Detection
12. Data mining trends and research frontiers
Appendix: Mathematical background
About the author
Jiawei Han is Professor in the Department of Computer Science at the University of Illinois at Urbana-Champaign. Well known for his research in the areas of data mining and database systems, he has received many awards for his contributions in the field, including the 2004 ACM SIGKDD Innovations Award. He has served as Editor-in-Chief of ACM Transactions on Knowledge Discovery from Data, and on editorial boards of several journals, including IEEE Transactions on Knowledge and Data Engineering and Data Mining and Knowledge Discovery.