Barcelona, Spain
October 25-28, 2016

Event Details

Please note: All times listed below are in Central Time Zone


Openstack and Ceph Used in Large Scale Cancer Research Projects

Collaboratory is a highly-available OpenStack environment scalable up to 3000 cores and 15 PB object storage, and the talk will discuss the various design goals and trade-offs considered. The Collaboratory currently stores 500 TB of genomic data from the International Cancer Genome Consortium, and the dataset is expected to grow to 5 PB by 2018. Software optimized for Ceph storage was designed to authenticate and provide data access to only authorized users, and one project as an early user of the Collaboratory is the PanCancer Analysis of Whole Genomes, one of the world's largest cancer data analysis initiatives exploring the whole genomes from over 2800 patients across 20 tumor types. The use case further drove the development of Dockstore for sharing workflows as docker containers.  The presentation will discuss PCAWG and other use cases on the Collaboratory, performance results and optimization, and lessons learnt from enabling large-scale cancer genomic research on OpenStack.

Wednesday, October 26, 4:30pm-4:44pm (2:30pm - 2:44pm UTC)
Difficulty Level: Intermediate
OICR
George is a Senior Cloud Architect in the Informatics and Bio-computing Program at the bio-informatics department of Ontario Institute for Cancer Research (OICR) where he designs, builds and supports a large Openstack/Ceph environment to enable cancer research used by cancer researchers. Having started with Openstack during the Cactus release, he  brings his expertise around cloud design,... FULL PROFILE
Comments
0 Reviews
0