SciPost logo

SciPost Submission Page

Many-gluon tree amplitudes on modern GPUs: A case study for novel event generators

by Enrico Bothmann, Walter Giele, Stefan Hoeche, Joshua Isaacson, Max Knobbe

Submission summary

Authors (as registered SciPost users): Enrico Bothmann · Joshua Isaacson · Max Knobbe
Submission information
Preprint Link: https://arxiv.org/abs/2106.06507v2  (pdf)
Code repository: https://www.gitlab.com/ebothmann/blockgen-archive
Date accepted: 2022-02-28
Date submitted: 2022-01-11 14:45
Submitted by: Bothmann, Enrico
Submitted to: SciPost Physics Codebases
Ontological classification
Academic field: Physics
Specialties:
  • High-Energy Physics - Experiment
  • High-Energy Physics - Phenomenology
Approach: Computational

Abstract

The compute efficiency of Monte-Carlo event generators for the Large Hadron Collider is expected to become a major bottleneck for simulations in the high-luminosity phase. Aiming at the development of a full-fledged generator for modern GPUs, we study the performance of various recursive strategies to compute multi-gluon tree-level amplitudes. We investigate the scaling of the algorithms on both CPU and GPU hardware. Finally, we provide practical recommendations as well as baseline implementations for the development of future simulation programs. The GPU implementations can be found at: https://www.gitlab.com/ebothmann/blockgen-archive.

Author comments upon resubmission

We resubmit a second version to address referee comments. See the list of changes.

List of changes

- Provide additional instructions for compiling and running the code in the linked code repository
- Add minor clarifications and improve some formulations
- Improve layout and figure placement, fix typographical/punctuation issues
- Add two missing references for multi-jet measurements at UA1 and UA2
- Clarify why it is advantageous to use a real-number only algorithm for gluon-only algorithms
- Add technical details on the GPU used for the study
- Improve results on the comparison between sampling and summing with respect to reaching a pre-defined precision goal; related to that, add a comment on the use of an improved helicity sampling algorithm
- Extend the appendix to make our point on a possible hybrid CPU/GPU ansatz clearer; in particular note that we can not compare to a GPU-only ansatz, as a full-fledged phase-space generator implementation is still missing

Published as SciPost Phys. Codebases 3 (2022) , SciPost Phys. Codebases 3-r1.0 (2022)


Reports on this Submission

Report 1 by Peter Skands on 2022-1-28 (Invited Report)

Report

I thank the authors for the elaborations and clarifications they made in response to my report on v1 of this paper. In my opinion all concerns have been addressed, and I am happy to recommend this paper to proceed to publication.

  • validity: top
  • significance: top
  • originality: high
  • clarity: high
  • formatting: excellent
  • grammar: perfect

Login to report or comment