Skip to main content

Dr. Xiaobo Zhou

University of Colorado, Colorado Springs

Monday, November 21, 2016
12:30PM – HEC 450

Abstract

MapReduce, a parallel distributed programming model on clusters of commodity hardware, has emerged as the de facto standard for big data analytics. Big data analytics usually require distributed computing at scale, which is often hard to afford for small business and institutes. Large MapReduce clusters also suffer from issues like low cluster efficiency. Cloud computing, unlocked by virtualization technology, allows the creation of dynamic virtual clusters with elastic resource allocation. Moving MapReduce into the cloud seems to be the promising future of efficient and affordable big data analytics. However, the heterogeneity, high dynamics and multi-tenancy of the Cloud, and the semantic gap between the MapReduce runtime and the resource manager of the cloud platform make building elastic and efficient virtual MapReduce clusters very challenging.

This talk focuses on performance improvement, elasticity, and sustainability of BigData parallel and distributed processing in the cloud. In specific, it introduces FlexSlot, a flexible slot management for moving Hadoop into the Cloud. FlexSlot provides a user-transparent task slot management scheme that coordinates the slot-based resource management of Hadoop with underlying cloud infrastructure. Then, it presents Ant, an adaptive task tuning approach that aims to automatically find the optimal MapReduce parameter settings for individual tasks running on heterogeneous clusters. Furthermore, this talk discusses techniques for improving energy efficiency and sustainability of datacenter cloud computing. The talk will conclude with discussions for potential research collaborations in the related fields.

Biography

Dr. Xiaobo Zhou is a Professor and the Chair of the Department of Computer Science, University of Colorado, Colorado Springs. He leads the Distributed, Sustainable, and Cloud Computing Lab. His research lies broadly in computer network systems, more specifically, Cloud computing and Datacenters, BigData parallel and distributed processing, autonomic and sustainable computing, distributed systems and middleware, and scalable Internet services. His research has been continuously supported by the US National Science Foundation. He was a recipient of NSF CAREER Award 2009, and the University Faculty Award for Excellence in Research 2011. His work won the Best Paper Award of USENIX ICAC 2013 and a Best Paper Nominee of IEEE HPCA 2013. Dr. Zhou served as a Program Co-Chair of the IEEE/ACM CCGrid 2015 and IEEE ICCCN 2011, a General Co-Chair of the IEEE ICCCN 2014 and IEEE ICCCN 2012, a Program Vice Chair of IEEE/ACM CCGrid 2014, IEEE GLOBECOM 2010, ICCCN 2009, HPCC 2008, and IEEE/IFIP EUC 2008, and the Workshop General Chair of the IEEE ICCCN 2007 and IFIP EUC 2006. He served numerous international journals including ACM TAAS, JPDC, ComCom as a guest editor or associate editor. He obtained his BS, MS and PhD degrees in Computer Science from Nanjing University, China.