TY - JOUR
AU - Choi, Jaesik
AU - de Salvo Braz, Rodrigo
AU - Bui, Hung
PY - 2011/08/04
Y2 - 2024/04/16
TI - Efficient Methods for Lifted Inference with Aggregate Factors
JF - Proceedings of the AAAI Conference on Artificial Intelligence
JA - AAAI
VL - 25
IS - 1
SE - Reasoning Under Uncertainty
DO - 10.1609/aaai.v25i1.8025
UR - https://ojs.aaai.org/index.php/AAAI/article/view/8025
SP - 1030-1036
AB - <p> Aggregate factors (that is, those based on aggregate functions such as SUM, AVERAGE, AND etc.) in probabilistic relational models can compactly represent dependencies among a large number of relational random variables. However, propositional inference on a factor aggregating <em>n k</em>-valued random variables into an <em>r</em>-valued result random variable is <em>O</em>(<em>r k</em> 2<em><sup>n</sup></em>). Lifted methods can ameliorate this to <em>O</em>(<em>r n<sup>k</sup></em>) in general and <em>O</em>(<em>r k</em> log <em>n</em>) for commutative associative aggregators. In this paper, we propose (a) an exact solution constant in <em>n</em> when <em>k </em>= 2 for certain aggregate operations such as AND, OR and SUM, and (b) a close approximation for inference with aggregate factors with time complexity constant in <em>n</em>. This approximate inference involves an analytical solution for some operations when <em>k</em> > 2. The approximation is based on the fact that the typically used aggregate functions can be represented by linear constraints in the standard (<em>k </em>–1)-simplex in R<em><sup>k</sup></em> where <em>k</em> is the number of possible values for random variables. This includes even aggregate functions that are commutative but not associative (e.g., the MODE operator that chooses the most frequent value). Our algorithm takes polynomial time in k (which is only 2 for binary variables) regardless of <em>r</em> and<em> n,</em> and the error decreases as n increases. Therefore, for most applications (in which a close approximation suffices) our algorithm is a much more efficient solution than existing algorithms. We present experimental results supporting these claims. We also present a (c) third contribution which further optimizes aggregations over multiple groups of random variables with distinct distributions. </p>
ER -