Distributed Data Systems with Azure Databricks
This book helps you to learn how to extract, transform, and orchestrate massive amounts of data to develop robust data pipelines. You'll perform complex machine learning tasks using advanced Azure Databricks features, and also explore model tuning, deployment, and control using Databricks functionalities such as AutoML and Delta Lake with TensorFlow.
Offered by
Difficulty Level
Intermediate
Completion Time
13h48m
Language
English
About Book
Who Is This Book For?
This book is for software engineers, machine learning engineers, data scientists, and data engineers who are new to Azure Databricks and want to build high-quality data pipelines without worrying about infrastructure. Knowledge of Azure Databricks basics is required to learn the concepts covered in this book more effectively. A basic understanding of machine learning concepts and beginner-level Python programming knowledge is also recommended.
Distributed Data Systems with Azure Databricks
- About Book
- Who Is This Book For?
- Book Content
Book content
chapters • 13h48m total length
Introduction to Azure Databricks core concepts
Creating an Azure Databricks workspace
Creating an ETL with Databricks
Delta Lake with Databricks
Introducing Delta Engine
Structured Streaming
Azure Databricks integration with Popular Python Libraries
Databricks Runtime for Machine Learning
Databricks Runtime for Deep Learning
Model tuning, deployment and control Using DataBricks AutoML
MLFlow on Azure Databricks
Distributed Deep Learning with Horovod
Related Resources
Access Ready-to-Use Books for Free!
Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!