Book

Optimizing Databricks Workloads

The book takes a hands-on approach to speeding up your Spark jobs and data processing by covering the implementation and associated methodologies that will have you up and running in no time. Developers working with Databricks and Spark will be able to put their knowledge to work with this practical guide to optimizing workloads.

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

7h40m

Language

English

About Book

Who Is This Book For?

This book is for data engineers, data scientists, and cloud architects who have working knowledge of Spark/Databricks and some basic understanding of data engineering principles. Readers will need to have a working knowledge of Python, and some experience of SQL in PySpark and Spark SQL is beneficial.

Book content

chapters 7h40m total length

Discovering Databricks

Batch and Real-Time Processing in Databricks

Learning about Machine Learning and Graph Processing in Databricks

Managing Spark Clusters

Big Data Analytics

Databricks Delta Lake

Spark Core

Case Studies

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required