NoSQL systems have grown in popularity for storing
big data because these systems offer high availability, i.e.,
operations with high throughput and low latency. However,
metadata in these systems are handled today in ad-hoc ways.
We present Wasef, a system that treats metadata in a NoSQL
database system, as first-class citizens. Metadata may include
information such as: operational history for portions
of a database table (e.g., columns), placement information
for ranges of keys, and operational logs for data items (keyvalue
pairs). Wasef allows the NoSQL system to store and
query this metadata efficiently.We integrateWasef into Apache
Cassandra, one of the most popular key-value stores. We then
implement three important uses cases in Cassandra: dropping
columns in a flexible manner, verifying data durability during
migrational operations such as node decommissioning, and
maintaining data provenance. Our experimental evaluation
uses AWS EC2 instances and YCSB workloads. Our results
show that Wasef: i) scales well with the size of the data and
the metadata; ii) affects throughput minimally by only 9%,
and iii) affects operational latencies by only 3%.
This is the default collection for all research and scholarship developed by faculty, staff, or students at the University of Illinois at Urbana-Champaign
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.