Book

Building Big Data Pipelines with Apache Beam

This book describes both batch processing and real-time processing pipelines. You’ll learn how to implement basic and advanced big data use cases with ease and develop a deep understanding of the Apache Beam model. In addition to this, you’ll discover how the portability layer works and the building blocks of an Apache Beam runner.

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

11h24m

Language

English

About Book

Who Is This Book For?

This book is for data engineers, data scientists, and data analysts who want to learn how Apache Beam works. Intermediate-level knowledge of the Java programming language is assumed.

Book content

chapters 11h24m total length

Introduction to Data Processing with Apache Beam

Implementing, Testing, and Deploying Basic Pipelines

Implementing Pipelines Using Stateful Processing

Structuring Code for Reusability

Using SQL for Pipeline Implementation

Using Your Preferred Language with Portability

Extending Apache Beam's I/O Connectors

Understanding How Runners Execute Pipelines

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required