SciPost logo

SciPost Submission Page

Advancing the CMS Level-1 Trigger: Jet Tagging with DeepSets at the HL-LHC

by Stella Schaefer, Christopher Brown, Duc Hoang, Sioni Summers, Sebastian Wuchterl

This is not the latest submitted version.

Submission summary

Authors (as registered SciPost users): Stella Felice Schaefer
Submission information
Preprint Link: https://arxiv.org/abs/2509.24371v1  (pdf)
Code repository: https://github.com/CMS-L1T-Jet-Tagging/TrainTagger
Date submitted: Sept. 30, 2025, 7:42 a.m.
Submitted by: Stella Felice Schaefer
Submitted to: SciPost Physics Proceedings
Proceedings issue: The 2nd European AI for Fundamental Physics Conference (EuCAIFCon2025)
Ontological classification
Academic field: Physics
Specialties:
  • High-Energy Physics - Experiment
Approach: Experimental

Abstract

At the High Luminosity LHC, selecting important physics processes such as (di-) Higgs production will be a high priority. The Phase-2 Upgrade of the CMS Level-1 Trigger will reconstruct particle candidates and use pileup mitigation for the 200 simultaneous proton-proton interactions. A fast cone algorithm will reconstruct jets from these particles, providing access to jet constituents for the first time. We introduce a new multi-class jet tagger with a small, quantized DeepSets neural network. The tagger, trained on a mix of simulated CMS events, predicts various hadronic and leptonic classes. We present the tagger, its performance, and its improvements for triggering on (di-) Higgs events.

Current status:
Has been resubmitted

Reports on this Submission

Report #1 by Anonymous (Referee 1) on 2025-11-10 (Invited Report)

Strengths

Further deployment of processing algorithms on FPGAs, turning these into online
processing, is always appreciated and valuable.

Weaknesses

  • Model design procedure is not detailed.
  • FPGA deployment/synthesis workflow is not provided.
  • The data itself is not described.
  • The above weaknesses are most likely resulting from the page constraints. However, there is enough room to improve the data description.

Report

The topic and content are very relevant. My remarks are given in the "Requested changes" segment.

Requested changes

1- Introduction: Capitalise "particle flow (PF)" -> "Particle Flow (PF)" 2- All sections and subsection: These are not titles, thus only the first word is to be capitalised. 3- Section 2: You have mentioned that 10% of the data is reserved for testing and validation. What are the portions, individually? As testing and validation sets cannot be mixed. If the considered test set is rather small, it will not mitigate the risk of overfitting. 4- Section 2: Regarding the padding, is it constant or variable? 5- Section 2.1: "hardware and latency constraints" What are these constraints? Provide numbers. 6- Section 2.1: "DeepSet" is written as two separate words in the rest of the paper. 7- Section 3: "Fig. 2a" You have abbreviated figure term for sub-figures. Needs to be adjusted in LaTeX packages. 8- Section 3, paragraph 2: A new paragraph has to start indented. 9- Section 3: The class balance statistics of the dataset is not provided. How many classes and what are the proportions? 10- Generally speaking, plots are too small to be legible. 11- Section 5: Provide a reference or a footnote to the FPGA's specification page/document from the manufacturer. 12- Section 5: "models" -> "model's" 13- A code repository is linked to the SciPost submission. This should be added to the paper either as a reference (preferred if the link is permanent), or a footnote.

Recommendation

Ask for minor revision

  • validity: high
  • significance: ok
  • originality: good
  • clarity: ok
  • formatting: good
  • grammar: excellent

Login to report or comment