- A practical and concise guide to using Haskell when getting to grips with data analysis
- Recipes for every stage of data analysis, from collection to visualization
- In-depth examples demonstrating various tools, solutions and techniques
This book will take you on a voyage through all the steps involved in data analysis. It provides synergy between Haskell and data modeling, consisting of carefully chosen examples featuring some of the most popular machine learning techniques.
You will begin with how to obtain and clean data from various sources. You will then learn how to use various data structures such as trees and graphs. The meat of data analysis occurs in the topics involving statistical techniques, parallelism, concurrency, and machine learning algorithms, along with various examples of visualizing and exporting results. By the end of the book, you will be empowered with techniques to maximize your potential when using Haskell for data analysis.
What you will learn from this book
- Obtain and analyze raw data from various sources including text files, CSV files, databases, and websites
- Implement practical tree and graph algorithms on various datasets
- Apply statistical methods such as moving average and linear regression to understand patterns
- Fiddle with parallel and concurrent code to speed up and simplify time-consuming algorithms
- Find clusters in data using some of the most popular machine learning algorithms
- Manage results by visualizing or exporting data
Step-by-step recipes filled with practical code samples and engaging examples demonstrate Haskell in practice, and then the concepts behind the code.
Who this book is written for
This book shows functional developers and analysts how to leverage their existing knowledge of Haskell specifically for high-quality data analysis. A good understanding of data sets and functional programming is assumed.
Table of Contents
Chapter 1: The Hunt for Data
Chapter 2: Integrity and Inspection
Chapter 3: The Science of Words
Chapter 4: Data Hashing
Chapter 5: The Dance with Trees
Chapter 6: Graph Fundamentals
Chapter 7: Statistics and Analysis
Chapter 8: Clustering and Classification
Chapter 9: Parallel and Concurrent Design
Chapter 10: Real-time Data
Chapter 11: Visualizing Data
Chapter 12: Exporting and Presenting