Book

Python Data Cleaning Cookbook

The book shows you how to view data from multiple perspectives, including data frame and column attributes. You will cover common and not-so-common challenges that are faced while cleaning messy data for complex situations. You will learn to manipulate data and get them down to a form that can be useful for making the right decisions.

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

14h32m

Language

English

About Book

Who Is This Book For?

This book is for anyone looking for ways to handle messy, duplicate, and poor data using different Python tools and techniques. The book takes a recipe-based approach to help you to learn how to clean and manage data. Working knowledge of Python programming is all you need to get the most out of the book.

Book content

chapters 14h32m total length

Anticipating Data Cleaning Issues when Importing Tabular Data into pandas

Anticipating Data Cleaning Issues when Importing HTML and JSON into Pandas

Taking the Measure of Your Data

Identifying Issues in Subsets of Data

Using Visualizations for Exploratory Data Analysis

Cleaning and Wrangling Data with Pandas Data Series Operations

Fixing Messy Data When Aggregating

Addressing Data Issues When Combining Data Frames

Tidying and Reshaping Data

User Defined Functions and Classes to Automate Data Cleaning

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required