Barcelona, Spain
October 25-28, 2016

Event Details

Please note: All times listed below are in Central Time Zone


Big Data and Machine Learning on OpenStack Backed by Nova-LXD

Hadoop was built with bare metal in mind - get your commodity hardware, stick hadoop on it and let YARN do all the hard work managing resources. However, Big Data software deployments on other substrates such as AWS (ec2 and EMR), AZURE, GCE are gaining popularity. We look at the challenges and a relevant solution related to deploying big data software in an OpenStack cloud.  Perhaps most interestingly, we discuss and demonstrate what it looks like to run a machine learning job with Nova-LXD in that cloud to address data locality issues in virtualized environments, and to demonstrate that hypervisor overhead does not necessarily hinder Big/Fast Data processing.


What can I expect to learn?

How to quickly and easily deploy a big data stack in an openstack cloud.

How we can use Big Data tools to run a machine learning job on OpenStack logs and detect anomalies such as unusual user login location - and scaling to handle increased traffic.

How Nova-LXD mitigates technical concerns about data-locality and hypervisor overhead in a virtualized environment.

Spark anomoly detection.

Tuesday, October 25, 5:55pm-6:35pm (3:55pm - 4:35pm UTC)
Difficulty Level: Intermediate
Big Data Software Engineer
Openstack Dev and Test with Devops and Networking Background FULL PROFILE
OpenStack Engineering Mgr
As the Engineering Manager at Canonical, Ryan leads a global team of open-source software developers who focus on delivering upgradable, supportable distribution methodologies and lifecycle automation tooling surrounding OpenStack, Ceph, and other open-source software infrastructure projects.  He and his team produce and maintain the Ubuntu Cloud Archive, one of the long-standing primary... FULL PROFILE
Comments
2 Reviews
0
Posted: 3087 days ago
Because :)
Posted: 3088 days ago
"Well done presentation. "