Book

Distributed Data Systems with Azure Databricks

This book helps you to learn how to extract, transform, and orchestrate massive amounts of data to develop robust data pipelines. You'll perform complex machine learning tasks using advanced Azure Databricks features, and also explore model tuning, deployment, and control using Databricks functionalities such as AutoML and Delta Lake with TensorFlow.

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

13h48m

Language

English

About Book

Who Is This Book For?

This book is for software engineers, machine learning engineers, data scientists, and data engineers who are new to Azure Databricks and want to build high-quality data pipelines without worrying about infrastructure. Knowledge of Azure Databricks basics is required to learn the concepts covered in this book more effectively. A basic understanding of machine learning concepts and beginner-level Python programming knowledge is also recommended.

Book content

chapters 13h48m total length

Introduction to Azure Databricks core concepts

Creating an Azure Databricks workspace

Creating an ETL with Databricks

Delta Lake with Databricks

Introducing Delta Engine

Structured Streaming

Azure Databricks integration with Popular Python Libraries

Databricks Runtime for Machine Learning

Databricks Runtime for Deep Learning

Model tuning, deployment and control Using DataBricks AutoML

MLFlow on Azure Databricks

Distributed Deep Learning with Horovod

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required