COLA (A Cloud-Based System for Online Aggregation)
Cloud Group, WAMDM, Renmin University of China
[Home] [Seminars] [Academic Activities] [System] [Publication] [Download] [People]
 
News
 

 
Background
 
    Online aggregation is a promising solution to achieving fast early responses for interactive ad-hoc queries that compute aggregates on massive data. To process large datasets on large-scale computing clusters, MapReduce has been introduced as a popular paradigm into many data analysis applications. However, typical MapReduce implementations are not well-suited to analytic tasks, since they are geared towards batch processing. With the increasing popularity of ad-hoc analytic query processing over enormous datasets, processing aggregate queries using MapReduce in an online fashion is therefore an emerging important application need. We present a MapReduce-based online aggregation system called COLA, which provides progressive approximate aggregate answers for both single table and multiple joined tables. COLA provides an online aggregation execution engine with novel sampling techniques to support incremental and continuous computing of aggregation, and minimize the waiting time before an acceptably precise estimate is available. In addition, user-friendly SQL queries are supported in COLA. Furthermore, COLA can implicitly convert non-OLA jobs into online version so that users don't have to write any special-purpose code to make estimates.

 
System
 
    COLA is Cloud-Based System for Online Aggregation , developed under Renmin University of China. COLA can provide progressive approximate answers for both single tables and joined multiple tables.Here is an overview on the architecture of COLA. [More...]

 
Features
 

    As a native cloud-based system for online aggregation,COLA has the following features :

  1. Support progressive approximate aggregate answers for both single table and multiple joined tables
  2. COLA can implicitly convert non-OLA jobs into online version

The CloudDB Team
 

Faculty
Xiaofeng Meng (xfmeng@ruc.edu.cn)
Yunpeng Chai (ypchai@ruc.edu.cn)

Ph.D.
Youzhong Ma (ma_youzhong@163.com)
Xiang Ci (cixiang31415926@126.com)
Chunkai Wang (chunkai_wang@163.com)

Master
Xu Han (hanxumelody@sina.com)
Chunqiu Liu (ae.liushuai@gmail.com)
Yu Zhang (zhang_yu_90@126.com)
Yantao Gan  (ganyantao19901018@163.com)
Fengming Wang (wangfengmingqq@163.com)
Hehan Li (li_he_han@126.com)

Undergraduate Student
Wei Zhang (Senior)
Yulei Niu   (Junior)

Graduated Student
Yingjie Shi    (shiyingjie1983@yahoo.com.cn)
Zhongyuan Wang (zhywangchina@163.com)     work:Microsoft Research Asia ( MSRA )
Jing Zhao   (zmfeiyinggtxy@163.com)
Xiangmei Hu (huxiangmei2004@126.com)
Bingbing Liu (liubingbing@ruc.edu.cn)
Haiping Wang (lulang1022@yahoo.com.cn)
Long Liu    (lulong3999@gmail.com)

 
WAMDM, Renmin University of China, All Rights Reserved CloudDB Last Updated : 2013/11/05