Event Details

Please note: All times listed below are in Central Time Zone

<< Go back

The AI Thunderdome: using OpenStack to accelerate AI training with Sahara, Spark, and Swift

HPC / GPU / AI

OpenStack lends itself well to big data problems. With Swift and Ceph, data storage is easier than ever. One of the most consequential problems in the big data space is using AI to make sense of ever-increasing data volumes. OpenStack makes this a solvable problem: Data stored in Swift can be accessed by a Sahara cluster, which can use GPU instances to accelerate parallel AI hyperparameter tuning. This ability allows users to spin up and down huge AI training farms at a fraction of the manual effort, and in the end, isn't that what the cloud is all about?

What can I expect to learn?

Attendees will get an overview of:

The architecture pattern that has emerged of using Spark to accelerate AI training in parallel
Using OpenStack to build a Spark cluster to perform parallel AI training
Using Sahara to access data from Swift to perform the training

Tuesday, November 13, 11:00am-11:40am (10:00am - 10:40am UTC)

CityCube Berlin - Level 1 - Hall A3

Slides: The AI Thunderdome: using OpenStack to accelerate AI training with Sahara, Spark, and Swift

View video

Difficulty Level: Intermediate

Tags: Demo OpenStack TensorFlow Spark Technical Sahara Swift Ceph

Sean Pryor

Cloud Consultant

Sean is a cloud consultant and senior technologist at Red Hat. He has been working for multiple years on one of the largest ongoing telco deploys of openstack and has a passion for sane technology solutions FULL PROFILE