Book

Data Processing with Optimus

Data Processing with Optimus helps you learn how to load, clean, and transform data easily with Optimus. This book is a step-by-step guide for preparing data to perform key data science tasks such as machine learning, analytics, feature engineering, and reporting to help you to build end-to-end real-world applications with ease.

Offered byPackt Logo

Difficulty Level

Intermediate

Completion Time

10h

Language

English

About Book

Who Is This Book For?

This book is for Python developers who want to explore, transform, and prepare big data for machine learning, analytics, and reporting using Optimus, a unified API to work with Pandas, Dask, cuDF, Dask-cuDF, Vaex, and Spark. Although not necessary, beginner-level knowledge of Python will be helpful. Basic knowledge of the CLI is required to install Optimus and its requirements. For using GPU technologies, you'll need an NVIDIA graphics card compatible with NVIDIA's RAPIDS library, which is compatible with Windows 10 and Linux.

Book content

chapters 10h total length

Hi Optimus!

Data Loading, Saving, and File Formats

Data Wrangling

Combining, Reshaping, and Aggregating Data

Data Visualization and Profiling

String Clustering

Feature Engineering

Machine Learning

Natural Language Processing

Hacking Optimus

Optimus as a Web Service

Related Resources

Access Ready-to-Use Books for Free!

Get instant access to a library of pre-built books—free trial, no credit card required. Start training your team in minutes!

No credit card required