Towards a universal unfolder: cDDPMs learn P(x|y) conditioned on detector stats, enabling multidimensional, non-iterative unfolding that reduces truth bias.Towards a universal unfolder: cDDPMs learn P(x|y) conditioned on detector stats, enabling multidimensional, non-iterative unfolding that reduces truth bias.

A Conditional Diffusion Approach to Multidimensional Unfolding Across Physics Processes

:::info Authors:

(1) Camila Pazos, Department of Physics and Astronomy, Tufts University, Medford, Massachusetts;

(2) Shuchin Aeron, Department of Electrical and Computer Engineering, Tufts University, Medford, Massachusetts and The NSF AI Institute for Artificial Intelligence and Fundamental Interactions;

(3) Pierre-Hugues Beauchemin, Department of Physics and Astronomy, Tufts University, Medford, Massachusetts and The NSF AI Institute for Artificial Intelligence and Fundamental Interactions;

(4) Vincent Croft, Leiden Institute for Advanced Computer Science LIACS, Leiden University, The Netherlands;

(5) Martin Klassen, Department of Physics and Astronomy, Tufts University, Medford, Massachusetts;

(6) Taritree Wongjirad, Department of Physics and Astronomy, Tufts University, Medford, Massachusetts and The NSF AI Institute for Artificial Intelligence and Fundamental Interactions.

:::

Abstract and 1. Introduction

  1. Unfolding

    2.1 Posing the Unfolding Problem

    2.2 Our Unfolding Approach

  2. Denoising Diffusion Probabilistic Models

    3.1 Conditional DDPM

  3. Unfolding with cDDPMs

  4. Results

    5.1 Toy models

    5.2 Physics Results

  5. Discussion, Acknowledgments, and References

\ Appendices

A. Conditional DDPM Loss Derivation

B. Physics Simulations

C. Detector Simulation and Jet Matching

D. Toy Model Results

E. Complete Physics Results

Abstract

The unfolding of detector effects in experimental data is critical for enabling precision measurements in high-energy physics. However, traditional unfolding methods face challenges in scalability, flexibility, and dependence on simulations. We introduce a novel unfolding approach using conditional Denoising Diffusion Probabilistic Models (cDDPM). Our method utilizes the cDDPM for a non-iterative, flexible posterior sampling approach, which exhibits a strong inductive bias that allows it to generalize to unseen physics processes without explicitly assuming the underlying distribution. We test our approach by training a single cDDPM to perform multidimensional particle-wise unfolding for a variety of physics processes, including those not seen during training. Our results highlight the potential of this method as a step towards a “universal” unfolding tool that reduces dependence on truth-level assumptions.

\

1 Introduction

Unfolding detector effects in high-energy physics (HEP) events is a critical challenge with significant implications for both theoretical and experimental physics. Experimental data in HEP presents a distorted picture of the true physics processes due to detector effects. Unfolding is an inverse-problem solved through statistical inference that aims to correct the detector distortions of the observed data to recover the true distribution of particle properties. This process is essential for the validation of theories, new discoveries, precision measurements, and comparison of experimental results between different experiments. Since there are flaws in any possible solution to such a problem, the quality of the statistical inference directly impacts the reliability of scientific conclusions, making unfolding a cornerstone in high-energy physics research.

\ Traditional unfolding methods [5] are based on the linearization of the problem, reducing it to the resolution of a set of linear equations. Such approaches often suffer from limitations such as the requirement for data to be binned into histograms, the inability to unfold multiple observables simultaneously, and the lack of utilization of all features that control the detector response. These limitations necessitate a more robust and comprehensive approach to unfolding that would increase the usefulness of a dataset, for example by providing more information about the underlying physics process that led to the observed data.

\ Machine learning methods for unfolding have recently emerged as a powerful tool for this purpose. The ability of machine learning algorithms to learn patterns and relationships from large datasets makes them well-suited to analyzing the vast amounts of data generated by modern particle experiments. The OmniFold method, for instance, mitigates many of the challenges faced by traditional approaches by allowing us to utilize a multidimensional representation of the particles, which can include both the full phase space information and high-dimensional features [1]. Along with OmniFold, a variety of machine learning approaches for unfolding have been presented in recent years, including generative adversarial networks [9], conditional invertible neural networks [2], latent variational diffusion models [16], Schrodinger bridges and diffusion models [11], and others, see [15] for a recent survey. Each new method has made further strides in unfolding and shown the advantages in machine learning based approaches compared to traditional techniques. However, these methods all rely on an explicit description of the expected underlying distribution resulting from the unfolding process.

\ In this work, we introduce a novel approach based on Denoising Diffusion Probabilistic Models (DDPM) to unfold detector effects in HEP data without requiring an explicit assumption about the underlying distribution. We demonstrate how a single conditional DDPM can be trained to perform multidimensional particle-wise unfolding for a variety of physics processes. This flexibility is a step towards a “universal” unfolding tool, providing unfolded estimates while reducing the dependence on truth-level assumptions that could bias the results. This study serves as a benchmark for improving unfolding methods for the LHC and future colliders.

\

2 Unfolding

2.1 Posing the Unfolding Problem

\

\ \ This reveals one of the main challenges in developing a universal unfolder, which can be applied to unfold detector data for any physics process. Instead of developing a method able to learn a posterior P(x|y) to unfold detector data pertaining to a specific true underlying distribution, a universal unfolder aims to remove detector effects from any set of measured data agnostic of the process of interest, ideally with no bias towards any prior distribution.

\

2.2 Our Unfolding Approach

Although we cannot achieve an ideal universal unfolder, we can seek an approach that will enhance the inductive bias of the unfolding method to improve generalization to cover various posteriors pertaining to different physics data distributions. From eq. (2) we can see that the posteriors for two different physics processes i and j are related by a ratio of the probability density functions of each process,

\ \

\ \ Assuming we can learn the posterior for a given physics process, we note that we could extrapolate to unseen posteriors if the priors ftrue(x) and detector distributions fdet(y) can be approximated or written in a closed form. Although these functions have no analytical form, we can approximate key features using the first moments of these distributions. By making use of these moments, we can have a more flexible unfolder that is not strictly tied to a selected prior distribution, and enables it to interpolate and extrapolate to unseen posteriors based on the provided moments. Consequently, this unfolding tool gains the ability to handle a wider range of physics processes and enhances the generalization capabilities, making it a more versatile tool for unfolding in various high energy physics applications.

\ In practice, one can use a training dataset of pairs {x, y} to train a machine learning model to learn a posterior P(x|y). To implement our approach and improve the inductive bias, we define a training dataset consisting of multiple prior distributions and incorporate the moments of these distributions to the data pairs. The moments are therefore included in the conditioning and generative aspects of the machine learning model such that it may be able to model multiple posteriors. As a result, we establish an unfolding tool as a posterior sampler that, when trained with sufficient priors within a family of distributions, is “universal” in the sense that it has a strong inductive bias to allow generalization towards estimating the prior distribution of unseen datasets. Further details and a technical description of this method are provided in section 4.

\ Our proposed approach calls for a flexible generative model, and denoising diffusion probabilistic models (DDPMs) [13] lend themselves naturally to this task. DDPMs learn via a reversible generative process that can be conditioned directly on the moments of the distribution fdet(y) and on the detector values themselves, providing a natural way to model P(x|y) for unfolding. In particular, the various conditioning methods available for DDPMs offer the flexibility to construct a model that can adapt to different detector data distributions and physics processes. Further details on DDPMs are provided in section 3.

\

:::info This paper is available on arxiv under CC BY 4.0 DEED license.

:::

\

Market Opportunity
Brainedge Logo
Brainedge Price(LEARN)
$0.00929
$0.00929$0.00929
+0.21%
USD
Brainedge (LEARN) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact service@support.mexc.com for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Tom Lee, 2026’yı “Ethereum Yılı” İlan Etti: Fiyat Tahminini Paylaştı!

Tom Lee, 2026’yı “Ethereum Yılı” İlan Etti: Fiyat Tahminini Paylaştı!

BitMine Yönetim Kurulu Başkanı ve Fundstrat kurucu ortağı Tom Lee, Ethereum’un 2026 yılında “öne çıkan anını” yaşayabileceğini ve ETH fiyatının 12.000 dolara kadar
Share
Coinstats2026/01/17 22:47
How to earn from cloud mining: IeByte’s upgraded auto-cloud mining platform unlocks genuine passive earnings

How to earn from cloud mining: IeByte’s upgraded auto-cloud mining platform unlocks genuine passive earnings

The post How to earn from cloud mining: IeByte’s upgraded auto-cloud mining platform unlocks genuine passive earnings appeared on BitcoinEthereumNews.com. contributor Posted: September 17, 2025 As digital assets continue to reshape global finance, cloud mining has become one of the most effective ways for investors to generate stable passive income. Addressing the growing demand for simplicity, security, and profitability, IeByte has officially upgraded its fully automated cloud mining platform, empowering both beginners and experienced investors to earn Bitcoin, Dogecoin, and other mainstream cryptocurrencies without the need for hardware or technical expertise. Why cloud mining in 2025? Traditional crypto mining requires expensive hardware, high electricity costs, and constant maintenance. In 2025, with blockchain networks becoming more competitive, these barriers have grown even higher. Cloud mining solves this by allowing users to lease professional mining power remotely, eliminating the upfront costs and complexity. IeByte stands at the forefront of this transformation, offering investors a transparent and seamless path to daily earnings. IeByte’s upgraded auto-cloud mining platform With its latest upgrade, IeByte introduces: Full Automation: Mining contracts can be activated in just one click, with all processes handled by IeByte’s servers. Enhanced Security: Bank-grade encryption, cold wallets, and real-time monitoring protect every transaction. Scalable Options: From starter packages to high-level investment contracts, investors can choose the plan that matches their goals. Global Reach: Already trusted by users in over 100 countries. Mining contracts for 2025 IeByte offers a wide range of contracts tailored for every investor level. From entry-level plans with daily returns to premium high-yield packages, the platform ensures maximum accessibility. Contract Type Duration Price Daily Reward Total Earnings (Principal + Profit) Starter Contract 1 Day $200 $6 $200 + $6 + $10 bonus Bronze Basic Contract 2 Days $500 $13.5 $500 + $27 Bronze Basic Contract 3 Days $1,200 $36 $1,200 + $108 Silver Advanced Contract 1 Day $5,000 $175 $5,000 + $175 Silver Advanced Contract 2 Days $8,000 $320 $8,000 + $640 Silver…
Share
BitcoinEthereumNews2025/09/17 23:48
BetFury is at SBC Summit Lisbon 2025: Affiliate Growth in Focus

BetFury is at SBC Summit Lisbon 2025: Affiliate Growth in Focus

The post BetFury is at SBC Summit Lisbon 2025: Affiliate Growth in Focus appeared on BitcoinEthereumNews.com. Press Releases are sponsored content and not a part of Finbold’s editorial content. For a full disclaimer, please . Crypto assets/products can be highly risky. Never invest unless you’re prepared to lose all the money you invest. Curacao, Curacao, September 17th, 2025, Chainwire BetFury steps onto the stage of SBC Summit Lisbon 2025 — one of the key gatherings in the iGaming calendar. From 16 to 18 September, the platform showcases its brand strength, deepens affiliate connections, and outlines its plans for global expansion. BetFury continues to play a role in the evolving crypto and iGaming partnership landscape. BetFury’s Participation at SBC Summit The SBC Summit gathers over 25,000 delegates, including 6,000+ affiliates — the largest concentration of affiliate professionals in iGaming. For BetFury, this isn’t just visibility, it’s a strategic chance to present its Affiliate Program to the right audience. Face-to-face meetings, dedicated networking zones, and affiliate-focused sessions make Lisbon the ideal ground to build new partnerships and strengthen existing ones. BetFury Meets Affiliate Leaders at its Massive Stand BetFury arrives at the summit with a massive stand placed right in the center of the Affiliate zone. Designed as a true meeting hub, the stand combines large LED screens, a sleek interior, and the best coffee at the event — but its core mission goes far beyond style. Here, BetFury’s team welcomes partners and affiliates to discuss tailored collaborations, explore growth opportunities across multiple GEOs, and expand its global Affiliate Program. To make the experience even more engaging, the stand also hosts: Affiliate Lottery — a branded drum filled with exclusive offers and personalized deals for affiliates. Merch Kits — premium giveaways to boost brand recognition and leave visitors with a lasting conference memory. Besides, at SBC Summit Lisbon, attendees have a chance to meet the BetFury team along…
Share
BitcoinEthereumNews2025/09/18 01:20