Book

Data Cleaning and Exploration with Machine Learning

Data scientists spend 80% of their time cleaning and manipulating data and only 20% of their time analyzing it. Efforts put into cleaning data are crucial, since analyzing dirty data can lead to inaccurate decisions. This is a critically timed book that will help you identify, diagnose, and treat data cleaning problems in Python, with advanced ML techniques.

Offered byPackt Logo

Difficulty Level
Intermediate
Completion Time
18h4m approx.
Language
English
Certification
Not available

About Course

Book Content

chapters 18h4m total length

1. Examining the Distribution of Features and Targets
2. Examining Bivariate and Multivariate Relationships between Features and Targets
3. Identifying and Fixing Missing Values
4. Encoding, Transforming, and Scaling Features
5. Feature Selection
6. Preparing for Model Evaluation
7. Linear Regression Models
8. Support Vector Regression
9. K-Nearest Neighbor, Decision Tree, Random Forest and Gradient Boosted Regression
10. Logistic Regression
11. Decision Trees and Random Forest Classification
12. K-Nearest Neighbors for Classification
13. Support Vector Machine Classification
14. Naive Bayes Classification
15. Principal Component Analysis
16. K-Means and DBSCAN Clustering

On this page

Ready to Train Your Team?

Need training for your whole team? Get bulk pricing, LMS integration, and dedicated support.

Trusted by Leading Organizations Worldwide

Join thousands of companies that trust Calibr to power their learning and development initiatives.

Chalet Hotels logo
Pernod Ricard logo
ProMobi logo
Metrique logo
K Raheja Corp logo
Spyne.AI logo
VuNet Systems logo
Procurement Partners logo
vEngage.AI logo
1218 Global logo
TRADEJINI logo
Oben Electric logo
IIT STartups logo
EdTech Digit logo
MindSkillz logo
NewportMed logo

Request Access For Your Organization

Start training your team in minutes!

No credit card required

Related Resources