Book

Reproducible Data Science with Pachyderm

Pachyderm enables you to create collaborative data science workflows and reproduce your experiments at scale. This book will help you leverage Pachyderm's data versioning and lineage features to build scalable end-to-end AI/ML pipelines and show you how to deploy Pachyderm in leading cloud platforms, use its SaaS offering PachHub, and much more.

Offered by

Difficulty Level

Intermediate

Completion Time

12h8m approx.

Language

English

Certification

Not available

About Course

Book Content

chapters • 12h8m total length

1. The Problem of Data Reproducibility

2. Pachyderm Basics

3. Pachyderm Pipeline Specification

4. Installing Pachyderm Locally

5. Installing Pachyderm on a Cloud Platform

6. Creating Your First Pipeline

7. Pachyderm Operations

8. Creating an End-to-End Machine Learning Workflow

9. Distributed Hyperparameter Tuning with Pachyderm

10. Pachyderm Language Clients

11. Using Pachyderm Notebooks

On this page

Ready to Train Your Team?

Need training for your whole team? Get bulk pricing, LMS integration, and dedicated support.

Trusted by Leading Organizations Worldwide

Join thousands of companies that trust Calibr to power their learning and development initiatives.

Request Access For Your Organization

Start training your team in minutes!

Related Resources