Book

Spark for Data Science

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

11h28m

Language

English

About Book

Who Is This Book For?

This book is for anyone who wants to leverage Apache Spark for data science and machine learning. If you are a technologist who wants to expand your knowledge to perform data science operations in Spark, or a data scientist who wants to understand how algorithms are implemented in Spark, or a newbie with minimal development experience who wants to learn about Big Data Analytics, this book is for you!

Book content

chapters 11h28m total length

Big Data and Data Science - An introduction

Spark Programming Model

Introduction to DataFrames

Unified Data Access

Data Analysis on Spark

Machine Learning

Extending Spark with SparkR

Analyzing Unstructured Data

Visualizing Big Data

Putting it all together

Building Data Science applications

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required