Increasingly, databases are storing more and more data, making it costly to go through all the data one may have in a database. However, users are still interested in being able to query a database holding their data to get some understanding of the data that they have. In this paper we propose three different sampling-based methods to estimate the total mean value of one particular attribute in a particular group of records in a data set. First we approximate the number of elements pertaining to one group and then, estimate their mean value. With these two approximated quantities, we can easily estimate the total amount one group contributes by multiplying both averages. We will also argue the correctness of the algorithms that we propose. We evaluate each algorithm in practice by comparing them on real data.
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.