SciPost Submission Page
Bayesian Active Search on Parameter Space: a 95 GeV Spin-0 Resonance in the ($B-L$)SSM
by Mauricio A. Diaz, Giorgio Cerro, Srinandan Dasmahapatra, Stefano Moretti
Submission summary
| Authors (as registered SciPost users): | Mauricio A. Diaz |
| Submission information | |
|---|---|
| Preprint Link: | https://arxiv.org/abs/2404.18653v2 (pdf) |
| Code repository: | https://github.com/mjadiaz/blssm-bcastor |
| Date accepted: | Sept. 23, 2025 |
| Date submitted: | Sept. 17, 2025, 12:21 a.m. |
| Submitted by: | Mauricio A. Diaz |
| Submitted to: | SciPost Physics Core |
| Ontological classification | |
|---|---|
| Academic field: | Physics |
| Specialties: |
|
| Approaches: | Computational, Phenomenological |
Abstract
In the attempt to explain possible data anomalies from collider experiments in terms of New Physics (NP) models, computationally expensive scans over their parameter spaces are typically required in order to match theoretical predictions to experimental observations. Under the assumption that anomalies seen at a mass of about 95 GeV by the Large Electron-Positron (LEP) and Large Hadron Collider (LHC) experiments correspond to a NP signal, which we attempt to interpret as a spin-0 resonance in the $(B-L)$ Supersymmetric Standard Model ($(B-L)$SSM), chosen as an illustrative example, we introduce a novel Machine Learning (ML) approach based on a multi-objective active search method, called b-CASTOR, able to achieve high sample efficiency and diversity, due to the use of probabilistic surrogate models and a volume based search policy, outperforming competing algorithms, such as those based on Markov-Chain Monte Carlo (MCMC) methods.
Author comments upon resubmission
Response to Referees
We sincerely thank the Editor and both referees for their careful reading and constructive feedback. We have revised the manuscript accordingly and provide detailed responses below, quoting each comment as bullet points followed by our reply. We are grateful for the insightful suggestions, which have improved the clarity and scope of the paper.
Referee 1
- While the comparison to traditional MCMC methods is appreciated, the work would benefit significantly from benchmarking against more modern scanning techniques such as those implemented in the BSMArt package (refs. [22,23]), which employs active learning for efficient exploration of BSM parameter spaces. A head-to-head comparison in terms of performance and computational efficiency would be particularly valuable for potential users seeking practical guidance on method selection.
Response: We appreciate this suggestion to include a comparison with modern parameter-scanning techniques such as those in the BSMArt package. As noted in our conclusions, benchmarking b-CASTOR against emerging ML/AI-based approaches is indeed a valuable direction for future work. A direct, fair comparison is not entirely straightforward because these algorithms often pursue different objectives: for example, the active-learning strategies in BSMArt aim to identify the boundaries of satisfactory regions using deep learning, whereas our primary objective is to densely populate the satisfactory region in a sample-efficient manner using Gaussian Processes.
We face a similar challenge even in our current comparison with MCMC--MH, which is fundamentally a sampling algorithm rather than a search algorithm. Our comparison to MCMC-MH is therefore intended as a proof of concept, demonstrating how a search-based approach like b-CASTOR can more efficiently characterise the satisfactory region.
Nevertheless, we agree on the importance of broader benchmarking and now emphasise in the conclusions that exploring b-CASTOR's performance relative to ML/AI-driven methods will be an important focus of future work.
- In this context, including detailed and quantitative timing benchmarks with a breakdown of the computational cost per step, is essential. Timing results should also indicate the computing environment used (e.g. CPU model, GPU availability, memory), as such information is crucial for reproducibility and fair comparisons.
Response: We thank the referee for highlighting the importance of reproducibility. We have added a complete description of the hardware and computational setup used for our studies, together with timing information and a breakdown of costs per step.
- Turning to the physics application, the authors choose a supersymmetric model motivated by dark matter, neutrino masses and anomalies pointing to the presence of a new states with a mass of 95 GeV. Given these motivations, it would be appropriate to include constraints relevant to all these phenomena, such as those from the dark matter relic abundance and direct/indirect detection constraints, neutrino observables, electroweak precision tests and LHC searches. At present, the study appears superficial: the scan focuses primarily on a subset of scalar states and the broader phenomenological implications are left unexplored. If the authors opt to present only a limited physics application as currently done, it would arguably be more appropriate to adopt a simplified model framework rather than a complex supersymmetric scenario. Moreover, the treatment of experimental uncertainties in the objective function remains unclear.
Response: We thank the referee for this thorough and constructive analysis. As suggested by the Editor, we are resubmit the work to \emph{SciPost Core}, which is better aligned with the scope of our study. We emphasise that the primary aim of this paper is not to provide an exhaustive phenomenological analysis, but rather to demonstrate the potential of b-CASTOR as a sample-efficient search strategy for complex BSM models. To make this clearer, we have expanded the introduction to explain our choice of the $(B-L)$SSM as a computationally challenging test case and to note explicitly that the algorithm is flexible enough to incorporate additional phenomenological constraints, such as those from dark matter, neutrino observables, and precision tests, in future work. We have also clarified how experimental uncertainties are incorporated into the objective function by explicitly stating the likelihood form and the associated bounds used.
- Another important concern pertains to the method’s coverage of parameter space. In the shown examples, the authors note that viable regions may be missed (as seen in figure 3(c)). Although they mention that the algorithm can be guided to avoid such pitfalls, this feature appears not to be activated by default. It is thus not clear why a mechanism to mitigate this issue is not implemented systematically. This indeed raises concerns regarding the robustness of the method, particularly in the B$-$L supersymmetric scan where unexplored viable regions could remain. Somehow, without such proper safeguards, there is a risk that the algorithm may become trapped in local minima and fail to explore the full viable space.
Response: We thank the referee for raising this point. We have expanded the final paragraph of Section 4.1 to explain the theoretical guarantees of the ECI framework, which our batched algorithm preserves, ensuring asymptotic coverage of the satisfactory region as the number of samples increases. We use a decaying-radius strategy to balance early discovery with late-stage resolution, and we performed a grid search to select optimal $(r_{\mathrm{initial}}, r_{\mathrm{final}})$ values. We clarify when and how these safeguards are activated in practice.
- The choice of references in the introduction is unusual and should be revised. For instance, the discussion of anomalies (refs. [3,4]) cites theoretical papers instead of the original experimental studies. Standard supersymmetry references are omitted (in favour of self-citations). Furthermore, the list of references related to the 95 GeV excess (refs. [9-12]) is incomplete and should be expanded to reflect the broader literature.
Response: We have added standard supersymmetry references and expanded the literature list on the 95 GeV excess. References [3,4] are retained as review-style summaries of anomalies, while original experimental studies are cited for the corresponding measurements where appropriate.
- The manuscript introduces a large number of acronyms and abbreviations, including several that are uncommon. This hampers readability. The authors are encouraged to minimise the use of non-standard abbreviations.
Response: We have reduced the number of acronyms, removing, e.g., BMO (Bayesian Multi-Objective Optimisation) and AL (Active Learning), and we define remaining abbreviations at first use.
- The sentence "The computational cost associated with numerically evaluating a specific configuration of a BSM model using a typical High Energy Physics (HEP) software toolbox is high." would benefit from a more precise and quantitative statement.
Response: We added the following quantitative example: "As an example, the HEP toolbox used in this study requires approximately one minute to evaluate a single parameter-space configuration".
- In the introduction, it would be useful to cite representative works that rely on MCMC methods for BSM parameter space exploration, to better contextualise the need for alternative approaches.
Response: We have added representative examples, including HEPfit (which internally uses Metropolis-Hastings MCMC) and \texttt{emcee} (a parallel and more advanced MCMC sampler) with applications to BSM fits and dark-matter phenomenology.
- Section 2.2 aims to briefly describe the model under consideration, but the current summary is overly concise. Reliance on external references makes the section insufficiently self-contained. At a minimum, the authors should include a table listing the quantum numbers of all superfields, as well as the supersymmetry-breaking Lagrangian. This would greatly aid the reader in understanding the scalar mass matrices discussed later as many parameters appear without being defined beforehand. Finally, note that the $\chi_2$ superfield is never introduced explicitly.
Response: We have defined all terms and parameters in Section 2.2, introduced the $\chi_2$ superfield, and clarified the Higgs mass-matrix parameter definitions. We have not added a full table of quantum numbers, as we believe this is not essential for our algorithm-focused presentation; however, the section is now self-contained for our purposes.
- A reference to the HiggsTools framework should be included, alongside HiggsBounds and HiggsSignals.
Response: A reference to HiggsTools has been added alongside HiggsBounds and HiggsSignals.
- It would be helpful to include information on how to access the code developed for this study (with a link to the associated public repository).}\end{quote}
Response: We have added links to both the \texttt{mjadiaz/blssm-bcastor} repository and the \texttt{mjadiaz/hep-aid} implementation.
Referee 2
In Section 2.3, second paragraph, the authors state that they use HiggsBounds and HiggsSignals to test for experimental constraints. Additional constraints could be obtained from recasting tools like MadAnalysis5. The authors should explain why they expect its contributions to be subleading or---if MadAnalysis5 was excluded because of runtime---state this as a limitation. Response: We thank the referee for this helpful suggestion. We have clarified in Sec. 2.3 that, for the 95 GeV anomaly, the experimental inputs we use are limited to the resonance mass and inclusive rates, which are directly constrained by HiggsBounds and HiggsSignals. We do not employ further event-level information, as such an approach would be necessary only for more fine-grained kinematic analyses; we have stated this explicitly as a limitation.
- The title of Section 3.4
Performance study'' seems unfitting since no study is performed in this section. PerhapsBenchmark Algorithm'' or something similar might be clearer.
Response: The section title has been changed from "Performance study'' to "Benchmark algorithm".
- The public code to reproduce the paper’s results should be linked in Section 3.5. That is both the repo \texttt{mjadiaz/blssm-bcastor} and the implementation of bcastor in \texttt{mjadiaz/hep-aid}.
Response: We have added links to both the mjadiaz/blssm-bcastor repository and the mjadiaz/hep-aid implementation.
- It seems that the algorithm presented in the paper is applicable to a vast range of BSM models. This should be stated in the conclusions. It would be particularly useful if the authors could highlight BSM models that can most benefit from a search with b-CASTOR.
Response: We thank the referee for this comment and fully agree that the algorithm has broader applicability. We have extended the conclusions to state explicitly that b-CASTOR can be applied to a wide range of BSM scenarios. In addition to our detailed study of the $(B-L)$SSM, we now also mention an internal cross-check of its performance in the 2HDM, which showed a comparable level of efficiency.
Typos: We thank the referee for the careful list of corrections. All typos have been fixed.
Once again, we thank the Editor and both referees for their constructive feedback and guidance. We hope the revisions address all concerns satisfactorily and improve the clarity and usefulness of the manuscript.
We therefore ask for the paper to be reconsidered for publication.
Regards, The authors
List of changes
1) Page 2, paragraph 2:
- Expanded 95 GeV excess literature list and added standard SUSY references
2) Page 2, paragraph 4:
- Added representative works using MCMC (HEPfit, emcee, etc.)
3) Page 2, footnote 2:
- Added quantitative timing example
4) Page 3, paragraph 2 and footnote 4:
- Added links to mjadiaz/blssm-bcastor and mjadiaz/hep-aid
5) Page 4, paragraph 1:
- Expanded introduction to justify using the (B-L)SSM as a computationally challenging case
6) Section 2.2:
- Improved model description and defined all terms and parameters
7) Section 2.3, paragraph 2:
- Clarified why only HiggsBounds/HiggsSignals were used
- Stated exclusion of MadAnalysis5 as a limitation
8) Section 4.1, last paragraph on page 15:
- Emphasised that the batched extension preserves ECI’s convergence guarantees
9) Section 3.4 title:
- Changed title from "Performance study" to "Benchmark algorithm"
10) Section 4.3 (Computational Resources):
- Added computing environment details (hardware, CPU/GPU, memory)
- Added quantitative timing results and breakdown of computational costs per step
11) Conclusions:
- Extended to state b-CASTOR applies to a wide range of BSM models, mentioning internal cross-check with 2HDM
- Emphasised that comparisons with ML/AI-based methods are left for future work, noting that differing objectives make direct comparisons non-trivial
Published as SciPost Phys. Core 8, 068 (2025)
