A Configuration Management Study to Fast Massive Writing for Distributed NoSQL System

Xianqiang BAO  Nong XIAO  Yutong LU  Zhiguang CHEN  

IEICE TRANSACTIONS on Information and Systems   Vol.E99-D   No.9   pp.2269-2282
Publication Date: 2016/09/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2016EDP7104
Type of Manuscript: PAPER
Category: Data Engineering, Web Information Systems
configuration management,  optimization,  massive writing,  HBase,  NoSQL,  

Full Text: PDF>>
Buy this Article

NoSQL systems have become vital components to deliver big data services due to their high horizontal scalability. However, existing NoSQL systems rely on experienced administrators to configure and tune the wide range of configurable parameters for optimized performance. In this work, we present a configuration management framework for NoSQL systems, called xConfig. With xConfig, its users can first identify performance sensitive parameters and capture the tuned parameters for different workloads as configuration policies. Next, based on tuned policies, xConfig can be implemented as the corresponding configuration optimiaztion system for the specific NoSQL system. Also it can be used to analyze the range of configurable parameters that may impact the runtime performance of NoSQL systems. We implement a prototype called HConfig based on HBase, and the parameter tuning strategies for HConfig can generate tuned policies and enable HBase to run much more efficiently on both individual worker node and entire cluster. The massive writing oriented evaluation results show that HBase under write-intensive policies outperforms both the default configuration and some existing configurations while offering significantly higher throughput.