Book Content
chapters • 12h44m total length
1. Introduction
2. Storage
3. Processing - MapReduce and Beyond
4. Real Time Computation with Samza
5. Iterative Computation with Spark
6. Data Analysis with Apache Pig
7. Hadoop and SQL
8. Data Lifecycle Management
9. Making Development Easier
10. Running a Hadoop Cluster
11. Where to go Next














