Cody’s Data Cleaning Techniques Using SAS, 3rd Edition Front Cover

Cody’s Data Cleaning Techniques Using SAS, 3rd Edition

  • Length: 234 pages
  • Edition: 3
  • Publisher:
  • Publication Date: 2017-03-15
  • ISBN-10: 1629607967
  • ISBN-13: 9781629607962
  • Sales Rank: #405592 (See Top 100 Books)
Description

Find errors and clean up data easily using SAS!

Thoroughly updated, Cody’s Data Cleaning Techniques Using SAS, Third Edition, addresses tasks that nearly every data analyst needs to do – that is, make sure that data errors are located and corrected. Written in Ron Cody’s signature informal, tutorial style, this book develops and demonstrates data cleaning programs and macros that you can use as written or modify which will make your job of data cleaning easier, faster, and more efficient.

Building on both the author’s experience gained from teaching a data cleaning course for over 10 years, and advances in SAS, this third edition includes four new chapters, covering topics such as the use of Perl regular expressions for checking the format of character values (such as zip codes or email addresses) and how to standardize company names and addresses.

With this book, you will learn how to:

  • find and correct errors in character and numeric values
  • develop programming techniques related to dates and missing values
  • deal with highly skewed data
  • develop techniques for correcting your data errors
  • use integrity constraints and audit trails to prevent errors from being added to a clean data set

Table of Contents

Chapter 1: Working with Character Data
Chapter 2: Using Perl Regular Expressions to Detect Data Errors
Chapter 3: Standardizing Data
Chapter 4: Data Cleaning Techniques for Numeric Data
Chapter 5: Automatic Outlier Detection for Numeric Data
Chapter 6: More Advanced Techniques for Finding Errors in Numeric Data
Chapter 7: Describing Issues Related to Missing and Special Values (Such as 999)
Chapter 8: Working with SAS Dates
Chapter 9: Looking for Duplicates and Checking Data with Multiple Observations per Subject
Chapter 10: Working with Multiple Files
Chapter 11: Using PROC COMPARE to Perform Data Verification
Chapter 12: Correcting Errors
Chapter 13: Creating Integrity Constraints and Audit Trails
Chapter 14: A Summary of Useful Data Cleaning Macros

To access the link, solve the captcha.