Probabilistic Circuits (PCs) offer a unified framework for tractable probability distributions, enabling efficient probabilistic inference through structured computation graphs. Researchers are advancing their speed and scalability via GPU parallelization, tensorized designs, and even custom hardware like DAG Processing Units. With applications ranging from explainability and data compression to neuro-symbolic AI and large language model detoxification, PCs are emerging as a powerful foundation for the next wave of efficient, interpretable AI.Probabilistic Circuits (PCs) offer a unified framework for tractable probability distributions, enabling efficient probabilistic inference through structured computation graphs. Researchers are advancing their speed and scalability via GPU parallelization, tensorized designs, and even custom hardware like DAG Processing Units. With applications ranging from explainability and data compression to neuro-symbolic AI and large language model detoxification, PCs are emerging as a powerful foundation for the next wave of efficient, interpretable AI.

Why Researchers Are Betting on PCs to Power the Next Wave of AI

Abstract and 1. Introduction

  1. Preliminaries and Related Work

  2. Key Bottlenecks in PC Parallelization

  3. Harnessing Block-Based PC Parallelization

    4.1. Fully Connected Sum Layers

    4.2. Generalizing To Practical Sum Layers

    4.3. Efficient Implementations by Compiling PC Layers

    4.4. Analysis: IO and Computation Overhead

  4. Optimizing Backpropagation with PC Flows

  5. Experiments

    6.1. Faster Models with PyJuice

    6.2. Better PCs At Scale

    6.3. Benchmarking Existing PCs

  6. Conclusion, Acknowledgements, Impact Statement, and References

A. Algorithm Details

B. Additional Technical Details

C. Experimental Details

D. Additional Experiments

\

2. Preliminaries and Related Work

Many probabilistic inference tasks can be cast into computing sums of products. By viewing them from a computation graph standpoint, PCs provide a unified perspective on many bespoke representations of tractable probability distributions, including Arithmetic Circuits (Darwiche, 2002; 2003), Sum-Product Networks (Poon & Domingos, 2011), Cutset Networks (Rahman et al., 2014), and Hidden Markov Models (Rabiner & Juang, 1986). Specifically, PCs define distributions with computation graphs consisting of sum and product operations, as elaborated below.

\

\ The key to guaranteeing exact and efficient computation of various probabilistic queries is to impose proper structural constraints on the DAG of the PC. As an example, with smoothness and decomposability (Poon & Domingos, 2011), computing any marginal probability amounts to a forward pass (children before parents) following Equation (1), with the only exception that we set the value of input nodes defined on marginalized variables to be 1. Please refer to Choi et al. (2020) for a comprehensive overview of different structural constraints and what queries they enable.

\

\ For example, Peharz et al. (2020a) demonstrate how the above parameter gradients can be used to apply ExpectationMaximization (EM) updates, and Vergari et al. (2021) elaborates how the forward pass can be used to compute various probabilistic and information-theoretic queries when coupled with PC structure transformation algorithms. Therefore, the speed and memory efficiency of these two procedures largely determine the overall efficiency of PCs.

\ Figure 1. Layering a PC by grouping nodes with the same topological depth (as indicated by the colors) into disjoint subsets. Both the forward and the backward computation can be carried out independently on nodes within the same layer.

\ Related work on accelerating PCs. There has been a great amount of effort put into speeding up training and inference for PCs. One of the initial attempts performs nodebased computations on both CPUs (Lowd & Rooshenas, 2015) and GPUs (Pronobis et al., 2017; Molina et al., 2019), i.e., by computing the outputs for a mini-batch of inputs (data) recursively for every node. Despite its simplicity, it fails to fully exploit the parallel computation capability possessed by modern GPUs since it can only parallelize over a batch of samples. This problem is mitigated by also parallelizing topologically independent nodes (Peharz et al., 2020a; Dang et al., 2021). Specifically, a PC is chunked into topological layers, where nodes in the same layer can be computed in parallel. This leads to 1-2 orders of magnitude speedup compared to node-based computation.

\ The regularity of edge connection patterns is another key factor influencing the design choices. Specifically, EiNets (Peharz et al., 2020a) leverage off-the-shelf Einsum operations to parallelize dense PCs where every layer contains groups of densely connected sum and product/input nodes. Mari et al. (2023) generalize the notion of dense PCs to tensorized PCs, which greatly expands the scope of EiNets. Dang et al. (2021) instead focus on speeding up sparse PCs, where different nodes could have drastically different numbers of edges. They use custom CUDA kernels to balance the workload of different GPU threads and achieve decent speedup on both sparse and dense PCs.

\ Another thread of work focuses on designing computation hardware that is more suitable for PCs. Specifically, Shah et al. (2021) propose DAG Processing Units (DPUs) that can efficiently traverse sparse PCs, Dadu et al. (2019) introduce an indirect read reorder-buffer to improve the efficiency of data-dependent memory accesses in PCs, and Yao et al. (2023) use addition-as-int multiplications to significantly improve the energy efficiency of PC inference algorithms.

\ Figure 2. Runtime breakdown of the feedforward pass of a PC with ∼150M edges. Both the IO and the computation overhead of the sum layers are significantly larger than the total runtime of product layers. Detailed configurations of the PC are shown in the table.

\ Applications of PCs. PCs have been applied to many domains such as explainability and causality (Correia et al., 2020; Wang & Kwiatkowska, 2023), graph link prediction (Loconte et al., 2023), lossless data compression (Liu et al., 2022), neuro-symbolic AI (Xu et al., 2018; Manhaeve et al., 2018; Ahmed et al., 2022a;b), gradient estimation (Ahmed et al., 2023b), graph neural networks rewiring (Qian et al., 2023), and even large language model detoxification (Ahmed et al., 2023a).

\

:::info Authors:

(1) Anji Liu, Department of Computer Science, University of California, Los Angeles, USA (liuanji@cs.ucla.edu);

(2) Kareem Ahmed, Department of Computer Science, University of California, Los Angeles, USA;

(3) Guy Van den Broeck, Department of Computer Science, University of California, Los Angeles, USA;

:::


:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

\

Market Opportunity
NeuroWeb AI Logo
NeuroWeb AI Price(NEURO)
$0,0099
$0,0099$0,0099
%0,00
USD
NeuroWeb AI (NEURO) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Coinbase’s CEO Armstrong Highlights Support for Crypto Clarity Act

Coinbase’s CEO Armstrong Highlights Support for Crypto Clarity Act

TLDR Coinbase plans to offer Bitcoin-backed credit cards with up to 4% rewards. The Crypto Clarity Act aims to clarify U.S. regulations for stablecoins and crypto. Coinbase is exploring stablecoin yield programs with returns up to 10%. Armstrong highlights the need for clear crypto laws to drive Coinbase’s growth. Coinbase’s CEO, Brian Armstrong, is optimistic [...] The post Coinbase’s CEO Armstrong Highlights Support for Crypto Clarity Act appeared first on CoinCentral.
Share
Coincentral2025/09/20 19:50
Why losing THIS support could drag XRP toward $1

Why losing THIS support could drag XRP toward $1

The post Why losing THIS support could drag XRP toward $1 appeared on BitcoinEthereumNews.com. Rising activity clashes with weakening momentum as XRP price struggles
Share
BitcoinEthereumNews2025/12/31 03:24
How The Washington Nationals Can Pull Themselves Out Of The Basement

How The Washington Nationals Can Pull Themselves Out Of The Basement

The post How The Washington Nationals Can Pull Themselves Out Of The Basement appeared on BitcoinEthereumNews.com. Washington Nationals interim manager Miguel Cairo (22) in action during the first baseball game of a doubleheader against the Atlanta Braves, Tuesday, Sept. 16, 2025, in Washington. (AP Photo/Nick Wass) Copyright 2025 The Associated Press. All rights reserved. Problems on the field can be fixed in a variety of ways. Problems off the field are more complicated, especially at the ownership level. That makes today’s Washington Post report on the Washington Nationals’ messy leadership structure that much more disturbing. The report, published by Barry Svrluga, Andrew Golden, and Chelsea Janes, detailed multiple inside sources criticizing the team’s leadership as “directionless.” It alleges that there are 10 members of the Lerner family making ownership-level decisions, preventing the franchise from having a unified voice. They haven’t employed a team president since 2010. At 62-92, the Nationals have already guaranteed another last-place finish in the National League East, their fourth in five years. Since winning the 2019 World Series, they haven’t won more than 71 games in a season. The Nationals fired president of baseball operations Mike Rizzo and manager Dave Martinez in July. They’ve been run by interim general manager Mike DeBartolo and interim manager Miguel Cairo since then, but they will need to make permanent hires in those critical roles early in the offseason. Their next leadership structure may or may not have significant changes from the current one. Regardless of how that plays out, they need to rethink their rebuild to climb out of the basement. The Nationals have three building blocks who were all acquired from the San Diego Padres in the Juan Soto trade. Shortstop CJ Abrams leads the team with 3.9 WAR (Baseball-Reference version) thanks to his 18 home runs and 31 stolen bases. Left fielder James Wood is hitting .254/.349/.461 with 27 home runs and 3.5…
Share
BitcoinEthereumNews2025/09/21 04:33