Frank Kane's Taming Big Data with Apache Spark and Python
Frank Kane’s Taming Big Data with Apache Spark and Python is your companion to learning Apache Spark in a hands-on manner. Frank will start you off by teaching you how to set up Spark on a single system or on a cluster, and you’ll soon move on to analyzing large data sets using Spark RDD, and developing and running effective Spark jobs quickly using Python.
Offered by
Difficulty Level
Intermediate
Completion Time
9h52m
Language
English
About Book
Who Is This Book For?
If you are a data scientist or data analyst who wants to learn Big Data processing using Apache Spark and Python, this book is for you. If you have some programming experience in Python, and want to learn how to process large amounts of data using Apache Spark, Frank Kane’s Taming Big Data with Apache Spark and Python will also help you.
Frank Kane's Taming Big Data with Apache Spark and Python
- About Book
- Who Is This Book For?
- Book Content
Book content
chapters • 9h52m total length
Getting Started with Spark
Spark Basics and Simple Examples
Advanced Examples of Spark Programs
Running Spark on a Cluster
SparkSQL, Dataframes and Datasets
Other Spark Technologies and Libraries
Where to Go From Here? - Learning More About Spark and Data Science
Related Resources
Access Ready-to-Use Books for Free!
Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!