Sponsors

Preliminary Program

Day 1: Friday, June 22, 2012
08:00 AM Registration
8:30-8:40 AM Welcome by the Chair Xiaofeng Meng
(Renmin University, XLDB Asia 2012 chair)
8:40-8:45 AM History of XLDB Kian-Tat Lim
(SLAC National Laboratory, Stanford University)
REFERENCE CASES FROM SCIENTIFIC COMMUNITIES
SESSION CHAIR: KIAN-TAT LIM
8:45-9:10 AM Extreme Data-Intensive Scientific Computing pdf Alexander Szalay
(John Hopkins University)
9:10-9:35 AM Toward Derivation, Management, and Analysis of Exascale Feature Sets pdf Joel Saltz
(Emory University)
9:35-10:00 AM Coffee Break
10:00-10:25 AM Extremely Large Databases and the Large Synoptic Survey Telescope pdf Kian-Tat Lim
(SLAC National Laboratory, Stanford University)
10:25-10:50 AM Data Intensive Astronomy and Astroinformatics pdf Chenzhou Cui
(National Astronomical Observatories, Chinese Academy of Sciences)
10:50-11:15 AM Data Intensive Study in Geoinformatics pdf Lizhe Wang
(The Center of Earth Observation and Digital Earth, Chinese Academy of Sciences)
11:15-12:00 AM Panel Discussion: The Challenge and Requirements for Handling Extremely Large Scientific Data Moderator: Alexander Szalay
12:10-1:30 PM Lunch
REFERENCE CASES FROM INDUSTRY
SESSION CHAIR: TOMASZ NYKIEL
1:30-1:55 PM The Case of Hadoop and BigData in Rakuten, the Largest e-Commerce Company in Japan Masaya Mori
(Rakuten Institute of Technology, Japan)
1:55-2:20 PM Distributed Online Machine Learning Framework for Big Data Shohei Hido
(Preferred Infrastructure, Inc., Japan)
2:20-2:45 PM Towards Industry Standard Benchmarks for Big Data Milind Bhandarkar
(EMC Greenplum Labs, USA)
2:45-3:10 PM Google Storage Architecture and Challenges Xuemei Gu
(Google Site Beijing)
3:10-3:40 PM Coffee Break
3:40-4:05 PM Scalability Challenges: Hadoop Distributed File System at Facebook Tomasz Nykiel
(Facebook, USA)
4:05-4:30 PM Oceanbase: a Scalable Relational Database Zhenkun Yang
(Taobao, China)
4:30-4:55 PM Extreme Analytics at Ebay Eddy Cai
(eBay, China)
4:55-5:45 PM Panel Discussion: NoSQL: the Cure for Big Data? Moderator: Milind Bhandarkar

Day 2: Saturday, June 23, 2012
08:00 AM Registration
8:30 AM Announcements
Research on Big Data Management
Session Chair: Martin Kersten
8:30-8:55 AM Integrating Extremely Large Data is Extremely Challenging Laura Haas
(IBM Almaden Research Center)
8:55-9:20 AM A scale-out Model for Big Data Software Development in Distributed Systems Xiaodong Zhang
(Ohio State University)
9:20-9:45 AM Managing and Mining Billion-Node Graphs Haixun Wang
(Microsoft Research Asia)
9:45-10:15 AM Coffee Break
10:15-10:40 AM Arrays in Database Systems, the Next Frontier? Martin Kersten
(CWI, Netherlands)
10:40-11:40 AM Panel Discussion: Evolution or Revolution: Database Research for Big Data Moderator: Laura Haas
12:00-1:30 PM Lunch
LIGHTNING TALKS
1:30-2:30 PM Lightning Talks I (Session Chair: Weisong Shi)
Streamlining Processing for Big Data (SPBD): a Case Study on Metagenomics Software
Weisong Shi (Wayne State University, USA)
Discovering Events from Satellite Data
Hideyuki Kawashima (University of Tsukuba, Japan)
SciQL, Bridging the Gap Between Science and Relational DBMS
Jennie Zhang (CWI, Netherlands)
Jacqueline: JSON/JAQL for MonetDB
Fabian Groffen (CWI, Netherlands)
Massive Data Collection for Your BigData with Fluentd (A Ruby Tool)
Abhishek Parolkar (viki.com, Singapore)
2:30-3:30PM Lightning Talks (II) (Session Chair: Jan-Jan Wu)
A Highly Scalable Cloud Database for Massive Multi-User Query Processing
Jan-Jan Wu (Academia Sinica, Taiwan)
TaijiDB: A Titanic and Just-in-Time DB
Long Liu (Remin University, China)
COLA: A Cloud-based On-Line Aggregation System
Yingjie Shi (Remin University, China)
LUMOS: An Extensive Data Cloud Platform for Big Data Analytics
Jidong Chen (EMC, China)
Lustre, Scalable Storage for Exascale
Liang Zhen(Whamcloud, China)
3:30-4:10 PM Coffee Break and Poster Discussion
4:10-4:20 PM Closeout and Future
4:20-5:00 PM Program Committee Meeting