Book

Learning Apache Spark 2

Apache Spark is one of the most popular Big Data processing frameworks today, delivering speed, accuracy and real-time results – all in one solution. With this book, you will delve into the world of Apache Spark and learn about the new features introduced in Spark 2, along with the architecture and the associated concepts. A comprehensive guide to Apache Spark 2 for beginners, this book covers everything you need to know to get up and running with Big Data processing, machine learning and stream processing with Apache Spark, and allows you to easily understand each of these concepts through real-world examples.

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

11h52m

Language

English

About Book

Who Is This Book For?

This book is intended for aspiring Big Data professionals and anyone who wants to get started with Apache Spark for Big Data processing and analytics. If you’ve worked with Apache Spark before and want to get familiarized with the new features introduced in Spark 2, this book will also help you. Some fundamental understanding of Big Data concepts and knowledge of Scala programming is required to get the best out of this book.

Book content

chapters 11h52m total length

Getting Started – Architecture & Installation

Transformation and Actions with Spark RDDs

ELT with Spark

Spark SQL

Spark Streaming

Machine Learning with Spark

GraphX

Operating in Clustered Mode

Building a recommendation System

Predicting customer churn

There's more with Spark

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required