TY - GEN
T1 - Uncertainty annotated databases - A lightweight approach for approximating certain answers
AU - Feng, Su
AU - Huber, Aaron
AU - Glavic, Boris
AU - Kennedy, Oliver
N1 - Publisher Copyright: © 2019 Copyright held by the owner/author(s). Publication rights licensed to ACM.
PY - 2019/6/25
Y1 - 2019/6/25
N2 - Certain answers are a principled method for coping with uncertainty that arises in many practical data management tasks. Unfortunately, this method is expensive and may exclude useful (if uncertain) answers. Thus, users frequently resort to less principled approaches to resolve uncertainty. In this paper, we propose Uncertainty Annotated Databases (UA-DBs), which combine an under- and over-approximation of certain answers to achieve the reliability of certain answers, with the performance of a classical database system. Furthermore, in contrast to prior work on certain answers, UA-DBs achieve a higher utility by including some (explicitly marked) answers that are not certain. UA-DBs are based on incomplete K-relations, which we introduce to generalize the classical set-based notion of incomplete databases and certain answers to a much larger class of data models. Using an implementation of our approach, we demonstrate experimentally that it efficiently produces tight approximations of certain answers that are of high utility.
AB - Certain answers are a principled method for coping with uncertainty that arises in many practical data management tasks. Unfortunately, this method is expensive and may exclude useful (if uncertain) answers. Thus, users frequently resort to less principled approaches to resolve uncertainty. In this paper, we propose Uncertainty Annotated Databases (UA-DBs), which combine an under- and over-approximation of certain answers to achieve the reliability of certain answers, with the performance of a classical database system. Furthermore, in contrast to prior work on certain answers, UA-DBs achieve a higher utility by including some (explicitly marked) answers that are not certain. UA-DBs are based on incomplete K-relations, which we introduce to generalize the classical set-based notion of incomplete databases and certain answers to a much larger class of data models. Using an implementation of our approach, we demonstrate experimentally that it efficiently produces tight approximations of certain answers that are of high utility.
KW - Annotations
KW - Incomplete data
KW - Uncertain data
UR - https://www.scopus.com/pages/publications/85069442280
U2 - 10.1145/3299869.3319887
DO - 10.1145/3299869.3319887
M3 - Conference contribution
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 1313
EP - 1330
BT - SIGMOD 2019 - Proceedings of the 2019 International Conference on Management of Data
PB - Association for Computing Machinery
T2 - 2019 International Conference on Management of Data, SIGMOD 2019
Y2 - 30 June 2019 through 5 July 2019
ER -