Event Details

Please note: All times listed below are in Central Time Zone

<< Go back

Efficient Ways to Run Big Data in OpenStack

Big Data

Customers are eager to put big data services into Cloud. There are many projects like Sahara, Cloudera Director, or Cloudbreak in the market to help the customers to implement Hadoop in the cloud. But performance is always the most critical issue when considering big data in OpenStack. In this presentation, we would like to teach you how to configure Hadoop/Spark in OpenStack. We will use some real customer cases to point out most of the issues that you may concern when running a real big data workload in OpenStack. We did lots of performance testing and would like to show you the results and the gaps between bare metal and virtualization. We also proposed several efficient ways including both OpenStack and Hadoop/Spark configuration to enhance the performance and reduce the gap to integrate big data services into OpenStack easier.

What can I expect to learn?

How to optimize big data workloads in Cloud environment?

Thursday, October 27, 9:50am-10:30am (7:50am - 8:30am UTC)

CCIB - Centre de Convencions Internacional de Barcelona - P1 - Room 120/121

View video

Difficulty Level: Intermediate

Tags: Architect Containers Enterprise User Cinder Manila Sahara UX

Jian Zhang

Software Engineer Manager

Jian Zhang is a senior software engineer manager at Intel, he and his team primarily focused on Open Source Storage development and optimizations on Intel platforms, and build reference solutions for customers. He has 10 years of experiences on performance analysis and optimization for many open source projects like Xen, KVM, Swift and Ceph, HDFS and benchmarking workloads like SPEC-*,... FULL PROFILE

Event Details

Registration Opening Soon