Book

Data Lake for Enterprises

The term 'Data Lake' has recently emerged as a prominent term in the big data industry. Data scientists can make use of it in deriving meaningful insights which can be used by businesses to redefine or transform the way they operate. Lambda architecture is also emerging as one of the very eminent patterns in the big data landscape, as it helps to derive useful information from not only the historical data but also correlates real-time data to enable business for taking critical decisions. This book tries to bring these two important aspects into one, namely data lake and lambda architecture.

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

19h52m

Language

English

About Book

Who Is This Book For?

Java developers and architects who would like to implement a data lake for their enterprise will find this book useful. If you want to get hands-on experience with the Lambda Architecture and big data technologies by implementing a practical solution using these technologies, this book will also help you.

Book content

chapters 19h52m total length

Introduction to Data

Comprehensive Data Lake concepts

Lambda Architecture as a Pattern for Data Lake

Applied Lambda for Data Lake

Data Acquisition of Batch Date with Apache Sqoop

Data Acquisition of Stream Data with Apache Flume

Messaging Layer with Apache Kafka

Data Processing with Apache Flink

Data Storage using Apache Hadoop

Indexed Data Store

Data Lake components working together

Use case suggestions

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required