Book

Frank Kane's Taming Big Data with Apache Spark and Python

Frank Kane’s Taming Big Data with Apache Spark and Python is your companion to learning Apache Spark in a hands-on manner. Frank will start you off by teaching you how to set up Spark on a single system or on a cluster, and you’ll soon move on to analyzing large data sets using Spark RDD, and developing and running effective Spark jobs quickly using Python.

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

9h52m

Language

English

About Book

Who Is This Book For?

If you are a data scientist or data analyst who wants to learn Big Data processing using Apache Spark and Python, this book is for you. If you have some programming experience in Python, and want to learn how to process large amounts of data using Apache Spark, Frank Kane’s Taming Big Data with Apache Spark and Python will also help you.

Book content

chapters 9h52m total length

Getting Started with Spark

Spark Basics and Simple Examples

Advanced Examples of Spark Programs

Running Spark on a Cluster

SparkSQL, Dataframes and Datasets

Other Spark Technologies and Libraries

Where to Go From Here? - Learning More About Spark and Data Science

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required