Book

Apache Spark Quick Start Guide

Apache Spark is a ?exible in-memory framework that allows processing of both batch and real-time data. Its unified engine has made it quite popular for big data use cases. This book will help you to quickly get started with Apache Spark 2.0 and write efficient big data applications for a variety of use cases.

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

5h8m

Language

English

About Book

Who Is This Book For?

If you are a big data enthusiast and love processing huge amount of data, this book is for you. If you are data engineer and looking for the best optimization techniques for your Spark applications, then you will find this book helpful. This book also helps data scientists who want to implement their machine learning algorithms in Spark. You need to have a basic understanding of any one of the programming languages such as Scala, Python or Java.

Book content

chapters 5h8m total length

Introduction to Apache Spark

Apache Spark Installation

Spark RDD

Spark DataFrame and Dataset

Spark Architecture and Application Execution Flow

Spark SQL

Spark Streaming, Machine Learning, and Graph Analysis

Spark Optimizations

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required