Book Content
chapters • 11h total length
1. Spark installation and configuration
2. Abstracting data with RDDs
3. Abstracting data with DataFrames
4. Preparing data for modeling
5. Machine Learning with MLLib
6. Machine Learning with ML module
7. Structured streaming with PySpark
8. GraphFrames - Graph Theory with PySpark














