Book

Data Engineering with Python

This book is a comprehensive introduction to building data pipelines, that will have you moving and transforming data in no time. You'll learn how to build data pipelines, transform and clean data, and deliver it to provide value to users. You will learn to deploy production data pipelines that include logging, monitoring, and version control.

Offered byPackt Logo

Difficulty Level
Intermediate
Completion Time
11h52m approx.
Language
English
Certification
Not available

About Course

Book Content

chapters 11h52m total length

1. What is Data Engineering?
2. Building Our Data Engineering Infrastructure
3. Reading and Writing Files
4. Working with Databases
5. Cleaning, Transforming, and Enriching Data
6. Building a 311 Data Pipeline
7. Features of a Production Pipeline
8. Version Control Using the NiFi Registry
9. Monitoring and Logging Pipelines
10. Deploying your Pipelines
11. Building a Production Data Pipeline
12. Building a Kafka Cluster
13. Streaming Data with Apache Kafka
14. Data Processing with Apache Spark
15. Real-Time Edge Data with MiNiFi, Kafka, and Spark
16. Appendix

On this page

Ready to Train Your Team?

Need training for your whole team? Get bulk pricing, LMS integration, and dedicated support.

Trusted by Leading Organizations Worldwide

Join thousands of companies that trust Calibr to power their learning and development initiatives.

Chalet Hotels logo
Pernod Ricard logo
ProMobi logo
Metrique logo
K Raheja Corp logo
Spyne.AI logo
VuNet Systems logo
Procurement Partners logo
vEngage.AI logo
1218 Global logo
TRADEJINI logo
Oben Electric logo
IIT STartups logo
EdTech Digit logo
MindSkillz logo
NewportMed logo

Request Access For Your Organization

Start training your team in minutes!

No credit card required

Related Resources