ID: cs/0312043

On A Theory of Probabilistic Deductive Databases

December 18, 2003

View on ArXiv
Laks V. S. Lakshmanan, Fereidoon Sadri
Computer Science
Databases

We propose a framework for modeling uncertainty where both belief and doubt can be given independent, first-class status. We adopt probability theory as the mathematical formalism for manipulating uncertainty. An agent can express the uncertainty in her knowledge about a piece of information in the form of a confidence level, consisting of a pair of intervals of probability, one for each of her belief and doubt. The space of confidence levels naturally leads to the notion of a trilattice, similar in spirit to Fitting's bilattices. Intuitively, thep oints in such a trilattice can be ordered according to truth, information, or precision. We develop a framework for probabilistic deductive databases by associating confidence levels with the facts and rules of a classical deductive database. While the trilattice structure offers a variety of choices for defining the semantics of probabilistic deductive databases, our choice of semantics is based on the truth-ordering, which we find to be closest to the classical framework for deductive databases. In addition to proposing a declarative semantics based on valuations and an equivalent semantics based on fixpoint theory, we also propose a proof procedure and prove it sound and complete. We show that while classical Datalog query programs have a polynomial time data complexity, certain query programs in the probabilistic deductive database framework do not even terminate on some input databases. We identify a large natural class of query programs of practical interest in our framework, and show that programs in this class possess polynomial time data complexity, i.e., not only do they terminate on every input database, they are guaranteed to do so in a number of steps polynomial in the input database size.

Similar papers 1

Empirical Probabilities in Monadic Deductive Databases

March 13, 2013

89% Match
Raymond T. Ng, V. S. Subrahmanian
Artificial Intelligence
Databases

We address the problem of supporting empirical probabilities in monadic logic databases. Though the semantics of multivalued logic programs has been studied extensively, the treatment of probabilities as results of statistical findings has not been studied in logic programming/deductive databases. We develop a model-theoretic characterization of logic databases that facilitates such a treatment. We present an algorithm for checking consistency of such databases and prove its ...

Find SimilarView on arXiv

Probabilistic Reasoning at Scale: Trigger Graphs to the Rescue

April 11, 2023

88% Match
Efthymia Tsamoura, Jaehun Lee, Jacopo Urbani
Databases

The role of uncertainty in data management has become more prominent than ever before, especially because of the growing importance of machine learning-driven applications that produce large uncertain databases. A well-known approach to querying such databases is to blend rule-based reasoning with uncertainty. However, techniques proposed so far struggle with large databases. In this paper, we address this problem by presenting a new technique for probabilistic reasoning that...

Find SimilarView on arXiv

Securing Databases from Probabilistic Inference

June 8, 2017

88% Match
Marco Guarnieri, Srdjan Marinovic, David Basin
Cryptography and Security
Databases

Databases can leak confidential information when users combine query results with probabilistic data dependencies and prior knowledge. Current research offers mechanisms that either handle a limited class of dependencies or lack tractable enforcement algorithms. We propose a foundation for Database Inference Control based on ProbLog, a probabilistic logic programming language. We leverage this foundation to develop Angerona, a provably secure enforcement mechanism that preven...

Find SimilarView on arXiv

A multivalued knowledge-base model

March 8, 2010

88% Match
Agnes Achs
Artificial Intelligence

The basic aim of our study is to give a possible model for handling uncertain information. This model is worked out in the framework of DATALOG. At first the concept of fuzzy Datalog will be summarized, then its extensions for intuitionistic- and interval-valued fuzzy logic is given and the concept of bipolar fuzzy Datalog is introduced. Based on these ideas the concept of multivalued knowledge-base will be defined as a quadruple of any background knowledge; a deduction mecha...

Find SimilarView on arXiv

Challenges for Efficient Query Evaluation on Structured Probabilistic Data

July 19, 2016

87% Match
Antoine Amarilli, Silviu Maniu, Mikaël Monet
Databases

Query answering over probabilistic data is an important task but is generally intractable. However, a new approach for this problem has recently been proposed, based on structural decompositions of input databases, following, e.g., tree decompositions. This paper presents a vision for a database management system for probabilistic data built following this structural approach. We review our existing and ongoing work on this topic and highlight many theoretical and practical c...

Find SimilarView on arXiv

Probabilities on Sentences in an Expressive Logic

September 12, 2012

87% Match
Marcus Hutter, John W. Lloyd, ... , Uther William T. B.
Logic in Computer Science
Artificial Intelligence
Machine Learning
Logic
Probability

Automated reasoning about uncertain knowledge has many applications. One difficulty when developing such systems is the lack of a completely satisfactory integration of logic and probability. We address this problem directly. Expressive languages like higher-order logic are ideally suited for representing and reasoning about structured knowledge. Uncertain knowledge can be modeled by using graded probabilities rather than binary truth-values. The main technical problem studie...

Find SimilarView on arXiv

Reasoning with Probabilistic Logics

May 5, 2014

87% Match
Riccardo Zese
Artificial Intelligence

The interest in the combination of probability with logics for modeling the world has rapidly increased in the last few years. One of the most effective approaches is the Distribution Semantics which was adopted by many logic programming languages and in Descripion Logics. In this paper, we illustrate the work we have done in this research field by presenting a probabilistic semantics for description logics and reasoning and learning algorithms. In particular, we present in d...

Find SimilarView on arXiv

Structurally Tractable Uncertain Data

July 17, 2015

87% Match
Antoine Amarilli
Databases

Many data management applications must deal with data which is uncertain, incomplete, or noisy. However, on existing uncertain data representations, we cannot tractably perform the important query evaluation tasks of determining query possibility, certainty, or probability: these problems are hard on arbitrary uncertain input instances. We thus ask whether we could restrict the structure of uncertain data so as to guarantee the tractability of exact query evaluation. We prese...

Find SimilarView on arXiv

Towards Precision of Probabilistic Bounds Propagation

March 13, 2013

87% Match
Helmut Thone, Ulrich Guntzer, Werner Kiessling
Artificial Intelligence

The DUCK-calculus presented here is a recent approach to cope with probabilistic uncertainty in a sound and efficient way. Uncertain rules with bounds for probabilities and explicit conditional independences can be maintained incrementally. The basic inference mechanism relies on local bounds propagation, implementable by deductive databases with a bottom-up fixpoint evaluation. In situations, where no precise bounds are deducible, it can be combined with simple operations re...

Find SimilarView on arXiv

Anytime Decision Making with Imprecise Probabilities

February 27, 2013

87% Match
Michael Pittarelli
Artificial Intelligence

This paper examines methods of decision making that are able to accommodate limitations on both the form in which uncertainty pertaining to a decision problem can be realistically represented and the amount of computing time available before a decision must be made. The methods are anytime algorithms in the sense of Boddy and Dean 1991. Techniques are presented for use with Frisch and Haddawy's [1992] anytime deduction system, with an anytime adaptation of Nilsson's [1986] pr...

Find SimilarView on arXiv