Data Mining and Analysis: Fundamental Concepts and Algorithms Front Cover

Data Mining and Analysis: Fundamental Concepts and Algorithms

Description

The fundamental algorithms in data mining and analysis form the basis for the emerging field of data science, which includes automated methods to analyze patterns and models for all kinds of data, with applications ranging from scientific discovery to business intelligence and analytics. This textbook for senior undergraduate and graduate data mining courses provides a broad yet in-depth overview of data mining, integrating related concepts from machine learning and statistics. The main parts of the book include exploratory data analysis, pattern mining, clustering, and classification. The book lays the basic foundations of these tasks, and also covers cutting-edge topics such as kernel methods, high-dimensional data analysis, and complex graphs and networks. With its comprehensive coverage, algorithmic perspective, and wealth of examples, this book offers solid guidance in data mining for students, researchers, and practitioners alike.

Key features

  • Covers both core methods and cutting-edge research
  • Algorithmic approach with open-source implementations
  • Minimal prerequisites: all key mathematical concepts are presented, as is the intuition behind the formulas
  • Short, self-contained chapters with class-tested examples and exercises allow for flexibility in designing a course and for easy reference
  • Supplementary website with lecture slides, videos, project ideas, and more

Table of Contents

Chapter 1 Data Mining and Analysis

Part I Data Analysis Foundations
Chapter 2 Numeric Attributes
Chapter 3 Categorical Attributes
Chapter 4 Graph Data
Chapter 5 Kernel Methods
Chapter 6 High-Dimensional Data
Chapter 7 Dimensionality Reduction

Part II Frequent Pattern Mining
Chapter 8 Itemset Mining
Chapter 9 Summarizing Itemsets
Chapter 10 Sequence Mining
Chapter 11 Graph Pattern Mining
Chapter 12 Pattern and Rule Assessment

Part III Clustering
Chapter 13 Representative-based Clustering
Chapter 14 Hierarchical Clustering
Chapter 15 Density-based Clustering
Chapter 16 Spectral and Graph Clustering
Chapter 17 Clustering Validation

Part IV Classification
Chapter 18 Probabilistic Classification
Chapter 19 Decision Tree Classifier
Chapter 20 Linear Discriminant Analysis
Chapter 21 Support Vector Machines
Chapter 22 Classification Assessment

To access the link, solve the captcha.