Identifying D Mesons from Radiative W Decays at the Large Hadron Collider

Bakos, Evelin; de Groot, Nicolo; Vranjes, Nenad

doi:10.3390/sym15101948

Open AccessArticle

Identifying D Mesons from Radiative W Decays at the Large Hadron Collider

by

Evelin Bakos

^1,2,*

,

Nicolo de Groot

²

and

Nenad Vranjes

¹

Institute of Physics, University of Belgrade, Pregrevica 118, 11080 Belgrade, Serbia

²

Institute for Mathematics, Astrophysics and Particle Physics (IMAPP), Radboud University and Nikhef, Heyendaalseweg 135, 6525 AJ Nijmegen, The Netherlands

^*

Author to whom correspondence should be addressed.

Symmetry 2023, 15(10), 1948; https://doi.org/10.3390/sym15101948

Submission received: 15 September 2023 / Revised: 12 October 2023 / Accepted: 17 October 2023 / Published: 20 October 2023

(This article belongs to the Section Physics)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we present two machine learning algorithms to identify D mesons produced in a colour singlet state from radiative W boson decays at the LHC. The combined network algorithm is able to identify D mesons via its hadronic decays with an efficiency of 47% while suppressing a background of quark and gluon jets by a factor of 100. Using the developed algorithm, we perform a prospective study for the measurement of

B (W \to D_{s} γ)

.

Keywords:

standard model QCD processes; W-boson rare decays; jet tagging; machine learning

1. Introduction

The large amount of W bosons produced in

p p

collisions at the Large Hadron Collider (LHC) enables searches for exclusive hadronic decays. These decay modes can offer precision studies of QCD factorisation [1] and are sensitive to the coupling of the W boson with the photon. However, the searches for the hadronic decays are still challenging due to the large background dominated by various QCD processes. Of all these decay modes,

W \to D_{s} γ

has the largest branching fraction predicted by the Standard Model with the value of

B = (3.7 \pm 1.5) \times 10^{- 8}

. No such decay has been observed so far and the best upper limit is set by the LHCb collaboration with the value

B (W \to D_{s} γ) < 6.4 \times 10^{- 4}

at a 95% confidence level [2]. The limit is obtained by analysing

K^{+} K^{-} π^{+}

final states, which make up 5.4% of the

D_{s}

decays. This improves on an earlier limit of

B (W \to D_{s} γ) < 1.3 \times 10^{- 3}

set by the CDF collaboration [3], using only

ϕ (K^{+} K^{-}) π^{+}

and

{K *}^{0} K^{+}

final states, which comprise 3.9% of all

D_{s}

decays. The algorithm presented in this paper offers a new approach to identify D mesons specific to radiative W boson decay, using inclusive tagging, and is sensitive to all decays at the possible expense of higher backgrounds. As a proof of principle, we focus on the

D_{s}

meson because of its highest predicted branching ratio.

A recent study [4] demonstrated that jets originating from a radiative decay of a colour-singlet charmonium state can be distinguished from coloured jets. With machine learning algorithms, we can differentiate between jets originating from radiatively produced

D_{s}

mesons and background jets from gluons and quarks. The main characteristic is that they are produced without accompanying fragmentation tracks and produce isolated jets. With retraining, the algorithm offers an opportunity to identify other mesons originating from hadronic decays of colour-singlet states as well. This would improve future searches for these rare decays and could improve the measurement precision using data to be collected during the ongoing LHC Run 3.

In the following section, the simulation setup is described together with the algorithm where a deep neural network (DNN), a convolutional neural network (CNN), and a combined network are used to identify signal mesons. In Section 3, the results are presented and an overview is given of the network performance and stability. In Section 4, prospects for the search for

W \to D_{s} γ

are assessed.

2. Materials and Methods

2.1. Simulated Samples

Proton–proton collisions are simulated at 13.6 TeV to match the Run 3 data taking period of the LHC. The sample of

D_{s}

particles is obtained via the hadronic decay of the

W \to D_{s} γ

. The matrix element for the process

p p \to W

is generated at LO accuracy in QCD using MADGRAPHv5 [5,6]. The NN23LO1 PDF set [7] is used in the generation. The

W \to D_{s} γ

decay as well as parton showering and subsequent hadronisation are performed using Pythia8 [8] with the A14 ATLAS tune [9].

The main background processes (in terms of

D_{s}

identification) are

p p \to g g

and

p p \to q q

where g and q denotes a gluon and a quark, respectively. The background samples are modelled separately using the same setup as for the signal events.

The detector response is simulated via the Delphes [10] package using the ATLAS detector configuration files. Jets are reconstructed as pFlow jets with the anti-

k_{t}

[11] jet clustering algorithm with

Δ R =

0.4, and are required to satisfy

p_{T} >

25 GeV and

| η | <

2.1 selection criteria. Jets are considered as a

D_{s}

meson if the angular distance to the truth

D_{s}

particle is

Δ R < 0.2

. The entire configuration can be found in [12].

2.2. $D_{s}$ Identification Using Machine Learning Algorithm

The full set of

W \to D_{s} γ

signal sample consists of 180k events, the

q q

background sample contains 45k, and the

g g

background sample contains 30k events. In addition,

q q

and

g g

, 30k, 30k and 45k

Z \to Y / (J / ψ) / ϕ + γ

events are considered as background to ensure that the network is able to reject other colour-singlet states. This makes the full background sample with 160k events comparable to the signal. Before the training all the samples were divided into training and testing sets, consisting of 70% and 30% of the full dataset, respectively. To create the machine learning algorithm, TensorFlow [13] and Keras [14] libraries were used. To determine the model performance, we use the receiver operating characteristic (ROC) curve and in particular the area under the ROC curve (AuC). The network hyperparameters were optimised with grid search to make sure that the best performing models were used to obtain the results.

2.2.1. Deep Neural Network

Signal jets originate from the decay of an isolated

D_{s}

(not surrounded by fragmentation tracks) and will be more collimated than background jets. This is particularly true for gluon jets, since gluon-initiated jets have higher particle multiplicity and a softer fragmentation function, due to the large colour factor. Variables

Δ ϕ

and

Δ η

, which measure the width of the jet in the

ϕ

and

η

directions, as well as

R_{e m}

and

R_{t r a c k}

which measure the

Δ R

with respect to the jet axis in case of tracks and electromagnetic clusters can be used to distinguish jets originating from

D_{s}

from the background jets. The multiplicity of charged and neutral particles (

n_{c h}

and

n_{0}

) in jets originating from

D_{s}

is lower compared to jets from quarks and gluons. From the lower constituent multiplicity it can also be deducted that signal jets have lower invariant mass. The

m_{t r}

measures the invariant mass of all charged tracks while

m_{j}

defines the invariant mass of all constituents in the jet. Jets emerging from

D_{s}

mesons are also less surrounded with hadronic activity caused by the fragmentation. The

p_{c o r e}

and

f_{c o r e}

measure the ratio of sum

p_{T}

in a cone and the jet

p_{T}

, and the ratio of sum

E_{T}

in a cone and the jet total

E_{T}

, respectively. The

E_{h a d} / E_{e m}

defines the energy ratio in the hadronic and the electromagnetic calorimeter.

We start with the variables used in [4]. These variables are further extended with the absolute values of the total charge and the jet-charge (

p_{T}

weighted charge sum [15]). The charge is expected to peak at zero for gluon jets, at one for signal jets, and have a higher average value for quark jets. In addition, with the b-jet tagging we gain some discriminating power against b-jets. A particular class of generalised angularities (

λ_{β}^{k}

) [16] are also added to the algorithm, which are efficient in distinguishing quark jets from gluon jets.

Furthermore, the N-Subjettiness [17] is also used, which measures to what degree the jet is composed of N subjets. For our signal jets, the N-subjettiness was expected to be close to zero, since all the radiation is aligned with the direction of the jet, meaning N (or fewer) subjets.

g g

background jets have

τ_{N} > >

0, since a large fraction of their energy is distributed away from the jet direction. All the variables used for the ML algorithm are listed in Table 1 and also shown in Figure 1.

Based on the optimisation results, the final model consists of one input layer and two hidden layers with 35, 20 and 12 nodes, respectively. The activation function for the input layer and both hidden layers is tanh. As is common with classification problems, the output layer is activated with the sigmoid function. The full set of hyperparameters is summarised in Table 2. A feature importance plot for the DNN network is also presented in Figure 2. It can be seen that the most important features are the charge ratio of the hadronic and electromagnetic energy deposit, and the N-Subjettiness, while the

R_{e m}

variable has very little impact on the network performance. This indicates kinematics of the generated sample does not have a major impact on the obtained results.

2.2.2. Convolutional Neural Network

Another approach for developing a

D_{s}

identification algorithm is to use a CNN. In this case the input variables are low level variables: energy deposited in the electromagnetic and the hadronic calorimeter, and track transverse momentum, which are plotted as a 3D image. The advantage of this approach is that one can use relatively raw data instead of carefully constructed variables.

In the context of this analysis, these energy deposits and the track transverse momentum are converted into a 20 × 20 jet image. Since the jet reconstruction parameter is

Δ R = 0.4

, and the segmentation of the ECAL is 0.02 × 0.02, the grid size of the jet image is equal to the smallest possible tower size in the

η

-

ϕ

plane. The variables are introduced in three different channels as is the case of an RGB picture, where the hadronic deposit is noted with blue, the electromagnetic deposit with green and the track transverse momentum with red. The schematic illustration of the jet image is shown in Figure 3.

Our CNN model consists of 5 layers: 3 convolutional and 2 fully connected dense layers. The number of nodes in the convolutional layers are 30, 8 and 8, respectively. The window sizes are [3 × 3] and [5 × 5] in the last layer, while the activation function is tanh in all three cases. A maxpooling layer is added after the second convolutional layer. The number of nodes in the first dense layers is 10 with the ReLU activation function. The output layer is again a dense sigmoid. The parameters of the final CNN model are summarised in Table 2.

2.2.3. Combined Network

To further improve the efficiency of the network, the DNN and the CNN models are merged into a single network. In this case, the output of the DNN and the output of the CNN are the inputs of the next combined layer. The last layer of the model performs the classification and the results depend both on the output of the CNN and the DNN.

The best performing combined network has slightly different number of nodes within the DNN layers: 33, 20 and 14, respectively. Another significant change compared to the previously introduced models is the absence of the dense layers after the convolutional layers. Instead, a combined dense layer is introduced with 8 nodes and ReLu activation function. The classification happens in the last sigmoid layer. The parameters of the combined model are summarised in the last column of Table 2.

3. Results

The ROC curves of the different models are presented in Figure 4, while the output distributions of the models can be seen in Figure 5. Table 3 shows the AuC values of the different networks defined previously. As is expected, the combined model performs the best with 0.956, which corresponds to a signal efficiency of 47% at a background rejection factor of 100 or 15% at a background rejection factor of 1000. Using DNN only one can reach a signal efficiency of 38% for a background rejection factor of 100 or 15% for 1000, while using only CNN the efficiency is 35% at 100 or 9% at 1000 times background rejection. As it can be seen, the performance is significantly better against a single background of gluon jets than against quark jets. This can be further improved if one uses only a gluon sample for training to an AuC of 0.991.

The tagging rate of the network for various samples used and not used during the training is presented in Table 4. Here a cut-off value of 0.75 is used. We find that for charm jets the results are not materially different from the generic quark-jet sample and this indicates that the absence of fragmentation tracks around the jets and a narrow jet with low multiplicity are more important than the exact D-meson decay topology. For hadronic

τ

decays, we find a high tagging rate, which is not surprising, given that

τ

leptons are also produced in a colour-singlet state and more than 5% of the

D_{s}

mesons decay to

τ

s.

We investigate the stability of the network performance under variations of the simulation parameters. To study this, we apply the recommended variations of the Pythia8 framework. These variations cover a range of possible events that differ from the base simulation: variation 1 is related to the underlying event activity, variation 2 covers the jet shapes and substructure, and the three variations 3 cover the effects of initial and final state radiation. The results of the variance in the model performance is presented in Table 5.

The effect of pileup is also taken into account during the analysis. Within the Delphes framework the additional tracking and vertexing information is not available, meaning that our estimate is worse than the real life conditions on the LHC experiments. We simulated samples with a pileup of

〈 μ 〉 =

40 meaning on average 40 pileup interaction, which is the expected amount for LHC Run 3 conditions. The retrained network, without further optimisation, shows a drop of 0.076 in the AuC, meaning that, while pileup has a significant effect, the model is still able to identify

D_{s}

mesons. One can note, however, that pileup mitigation techniques implemented in Delphes are suboptimal; hence, the expected effect with real data is smaller.

4. Discussion

In this section prospects for the measurement of

B (W \to D_{s} γ)

using the method described previously are studied. For the purpose of this exercise it is assumed that low-pileup data corresponding to the integrated luminosity of 1 fb

^{- 1}

are collected during LHC Run 3. Events are required to have one jet tagged as

D_{s}

and an isolated photon with

p_{T} >

30 GeV. Events with invariant mass of jet-photon system ±10 GeV around W boson mass are selected. Triggering efficiency is assumed to be 100%. The optimised network cut-off of 0.75 provides the best sensitivity. Total signal efficiency for

W^{+} \to D_{s} γ (W^{-} \to D_{s} γ)

is estimated to be 15.5% (18.7%), respectively.

In order to estimate background level, large MC samples of

p p \to g g

and

p p \to q q

, as well as

p p \to q γ

,

Z \to e e

, and

Z \to τ τ

are generated with MADGRAPHv5 and Pythia8. The detector response is simulated via the Delphes package using the ATLAS detector configuration files. Backgrounds are normalised according to their generated cross-sections. The total level of background is estimated to be 930,000 events corresponding to the integrated luminosity of 1 fb

^{- 1}

. The background is dominated by the QCD process while less than 1% of the total background arises from Z boson events. Figure 6 shows the distribution of

D_{s}

tagged jet-plus-photon invariant mass for the backgrounds and

W \to D_{s} γ

signal normalised to the integrated luminosity of 1 fb

^{- 1}

. The signal histogram is overlaid and scaled by a factor of 10,000.

The

C L_{s}

method [18,19] is used to calculate the upper limit on the branching fraction of the

W \to D_{s} γ

decay. The expected number of signal plus background events is

N_{e x p} = ϵ σ_{p p \to W} B (W \to D_{s} γ) \int L d t + N_{b g},

(1)

where

ϵ

is event selection efficiency of the signal,

σ_{p p \to W}

is the inclusive production cross-section for the W boson evaluated at the NNLO in QCD,

\int L d t

is the integrated luminosity, and

N_{b g}

is the expected number of background events. Uncertainties on

ϵ

,

\int L d t

, and

N_{b g}

are assumed to be Gaussian, and correlations between these are neglected. In this study total signal uncertainty is assumed to be 10% and has only marginal impact on the calculated limit. The uncertainty on the background level is assumed to be 0.5% as obtained in the ATLAS search for radiative Higgs boson decay [20]. The upper 95% CL (confidence level) limit on the “signal strength”

σ_{p p \to W} B (W \to D_{s} γ)

, with production cross-section fixed, is set using Poisson statistics and the above equation. The limit is obtained with

C L_{s} = C L_{s + b} / C L_{b} \leq 0.05

, where

C L_{s + b}

is the confidence level for signal and background, and

C l_{b}

is the confidence level for the background alone.

The calculated

C L_{s}

exclusion as a function of branching fraction of

W \to D_{s} γ

is shown in Figure 7.

The expected upper limit at the 95% CL is determined to be:

B (W \to D_{s} γ) < (2.87 \pm 0.22) \times 10^{- 4},

(2)

which is by a factor of two compared to the observed upper limit from LHCb.

With the entire Run 3 dataset corresponding to about 300 fb

^{- 1}

, assuming trigger efficiency of 40% and taking into account deterioration of the

D_{s}

tagger due to high pileup, the expected upper limit improves to

B (W \to D_{s} γ) < 1.6 \times 10^{- 4}

. Development of a dedicated trigger is needed to achieve corresponding precision.

5. Conclusions

The algorithm to identify jets originating from

D_{s}

mesons in radiative W decays presented in this paper shows a good efficiency of 47% for signal with a 100 times rejection of jets from quarks and gluons. Against a single background of gluon jets, the algorithm works even better. The algorithm is stable under the variations of the simulation parameters and it also works in the presence of pileup but at a significant loss of performance. The algorithm opens up the possibility of further improving measurements and searches involving D mesons, especially in the case of the rare decays that suffer from low statistics. We find very similar performance for a deep neural network and a convolutional neural network, with a combined network of the two performing best. With a low pileup dataset corresponding to the integrated luminosity of a 1 fb

^{- 1}

upper limit on the branching fraction of

W \to D_{s} γ

decay can be determined at the level of

B (W \to D_{s} γ) < 2.9 \times 10^{- 4}

.

Author Contributions

Conceptualization, N.d.G.; methodology, E.B., N.d.G. and N.V.; software, E.B.; validation, E.B., N.d.G. and N.V.; formal analysis, E.B.; investigation, E.B.; resources, E.B.; data curation, E.B.; writing—original draft preparation, E.B., N.d.G. and N.V.; writing—review and editing, E.B., N.d.G. and N.V.; visualization, E.B.; supervision, N.d.G. and N.V.; project administration, E.B.; funding acquisition, N.d.G. and N.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Configuration files for data generation and analysis software can be found in github at https://github.com/ebakos/DsGammaAnalysis. Data files are available at request.

Acknowledgments

This work was partially supported by the Serbian Ministry of Education, Science and Technological Development and the EU Erasmus plus programme.

Conflicts of Interest

The authors declare no conflict of interest.

References

Grossman, Y.; König, M.; Neubert, M. Exclusive Radiative Decays of W and Z Bosons in QCD Factorization. J. High Energy Phys. 2015, 2015, 101. [Google Scholar] [CrossRef]
Aaij, R.; Abdelmotteleb, A.S.W.; Beteta, C.A.; Abudinén, F.; Ackernley, T.; Adeva, B.; Adinolfi, M.; Adlarson, P.; Afsharnia, H.; Agapopoulou, C.; et al. Search for the rare decays W⁺→ $D_{s}^{+}$ γ and Z→D⁰γ at LHCb. Chin. Phys. C 2023, 47, 093002. [Google Scholar] [CrossRef]
Abe, F.; Akimoto, H.; Akopian, A.; Albrow, M.G.; Amadon, A.; Amendolia, S.R.; Amidei, D.; Antos, J.; Aota, S.; Apollinari, G.; et al. Search for the rare decay W^±→ $D_{s}^{+ -}$ γ in $p \bar{p}$ collisions at $\sqrt{s}$ = 1.8 TeV. Phys. Rev. D 1998, 58, 091101. [Google Scholar] [CrossRef]
de Groot, N.; Castells, S. Identifying hadronic charmonium decays in hadron colliders. SciPost Phys. Core 2020, 2, 8. [Google Scholar] [CrossRef]
Alwall, J.; Demin, P.; de Visscher, S.; Frederix, R.; Herquet, M.; Maltoni, F.; Plehn, T.; Rainwater, D.L.; Stelzer, T. MadGraph/MadEvent v4: The New Web Generation. J. High Energy Phys. 2007, 2007, 28. [Google Scholar] [CrossRef]
Alwall, J.; Frederix, R.; Frixione, S.; Hirschi, V.; Maltoni, F.; Mattelaer, O.; Shao, H.S.; Stelzer, T.; Torrielli, P.; Zaro, M. The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations. J. High Energy Phys. 2014, 2014, 79. [Google Scholar] [CrossRef]
Ball, R.D.; Bertone, V.; Carrazza, S.; Debbio, L.D.; Forte, S.; Groth-Merrild, P.; Guffanti, A.; Hartl, N.P.; Kassabov, Z.; Latorre, J.I.; et al. Parton distributions from high-precision collider data. Eur. Phys. J. C 2017, 77, 663. [Google Scholar] [CrossRef]
Sjostrand, T.; Mrenna, S.; Skands, P.Z. A Brief Introduction to PYTHIA 8.1. Comput. Phys. Commun. 2008, 178, 852–867. [Google Scholar] [CrossRef]
Buckley, A. ATLAS Pythia 8 Tunes to 7 TeV Data; Technical Report ATL-PHYS-PUB-2014-021; CERN: Geneva, Switzerland, 2014. [Google Scholar]
de Favereau, J.; Delaere, C.; Demin, P.; Giammanco, A.; Lemaître, V.; Mertens, A.; Selvaggi, M. DELPHES 3, A modular framework for fast simulation of a generic collider experiment. J. High Energy Phys. 2014, 2014, 57. [Google Scholar] [CrossRef]
Cacciari, M.; Salam, G.P.; Soyez, G. The anti-k_t jet clustering algorithm. J. High Energy Phys. 2008, 2008, 63. [Google Scholar] [CrossRef]
Bakos, E. DsGammaAnalysis. 2022. Available online: https://github.com/ebakos/DsGammaAnalysis (accessed on 15 September 2023).
Abadi, M.; Barham, P.; Chen, J.; Chen, Z.; Davis, A.; Dean, J.; Devin, M.; Ghemawat, S.; Irving, G.; Isard, M.; et al. TensorFlow: A system for large-scale machine learning. arXiv 2016, arXiv:1605.08695. [Google Scholar]
Keras. 2015. Available online: https://github.com/fchollet/keras (accessed on 14 September 2023).
Field, R.D.; Feynman, R.P. A Parametrization of the Properties of Quark Jets. Nucl. Phys. B 1978, 136, 1. [Google Scholar] [CrossRef]
Larkoski, A.J.; Thaler, J.; Waalewijn, W.J. Gaining (Mutual) Information about Quark/Gluon Discrimination. J. High Energy Phys. 2014, 2014, 129. [Google Scholar] [CrossRef]
Thaler, J.; Van Tilburg, K. Identifying Boosted Objects with N-subjettiness. J. High Energy Phys. 2011, 2011, 15. [Google Scholar] [CrossRef]
Read, A.L. Presentation of search results: The CL_s technique. J. Phys. G 2002, 28, 2693–2704. [Google Scholar] [CrossRef]
Junk, T. Confidence level computation for combining searches with small statistics. Nucl. Instrum. Meth. A 1999, 434, 435–443. [Google Scholar] [CrossRef]
Aaboud, M.; Aaboud, A.; Aad, G.; Abbott, B.; Abdinov, O.; Abeloos, B.; Abidi, S.H.; AbouZeid, O.S.; Abraham, N.L.; Abramowicz, H.; et al. Search for exclusive Higgs and Z boson decays to ϕγ and ργ with the ATLAS detector. J. High Energy Phys. 2018, 2018, 127. [Google Scholar] [CrossRef]

Figure 1. Distributions of the variables used for

D_{s}

identification, using DNN. The signal is presented with a solid blue line, while the

g g

and

q q

backgrounds are drawn with dashed red and dotted green lines, respectively.

Figure 1. Distributions of the variables used for

D_{s}

identification, using DNN. The signal is presented with a solid blue line, while the

g g

and

q q

backgrounds are drawn with dashed red and dotted green lines, respectively.

Figure 2. Feature importance plot of DNN. The blue bars represent the weight of each feature (variable) within the network.

Figure 3. Jet image construction from low level variables, where (a) shows 2 different signal and (b) 2 different background events. The hadronic deposit is noted with blue circle pattern, the electromagnetic deposit with green square grid and the track transverse momentum with red star pattern composing an RGB input picture to the CNN algorithm.

Figure 4. ROC curves for the different network types.

Figure 5. Output of the different networks for background (blue dotted pattern) and signal (red square grid pattern).

Figure 6. Distribution of the invariant mass of

D_{s}

tagged jet-plus-photon system. The signal is scaled with a factor of 10

^{4}

.

Figure 6. Distribution of the invariant mass of

D_{s}

tagged jet-plus-photon system. The signal is scaled with a factor of 10

^{4}

.

Figure 7. Expected upper limit on branching fraction of the

W \to D_{s} γ

decay. The vertical line corresponds to

C L_{s} =

0.05. The branching fractions higher than

2.87 \times 10^{- 4}

are excluded at 95% CL.

Figure 7. Expected upper limit on branching fraction of the

W \to D_{s} γ

decay. The vertical line corresponds to

C L_{s} =

0.05. The branching fractions higher than

2.87 \times 10^{- 4}

are excluded at 95% CL.

Table 1. DNN input variables.

Name	Description
$Δ η$	width of the jet in $η$
$Δ ϕ$	width of the jet in $ϕ$
$m_{t r}$	invariant mass of all charged tracks in the jet
$m_{j}$	invariant mass of all constituents of the jet
$n_{c h}$	charged particle multiplicity
$n_{0}$	neutral particle multiplicity
$\| Q \|$	absolute value of the total charge
$\| q_{j} \|$	jet charge ( $p_{T}$ weighted charge sum, $Σ_{i} q_{i} \cdot p_{T i}^{1 / 2} / Σ_{i} p_{T i}^{1 / 2}$ )
b-tag	output of the b-tagging algorithm
$R_{e m}$	average $Δ R$ with respect to the jet axis weighted by electromagnetic energy
$R_{t r a c k}$	$p_{T}$ weighted average $Δ R$ for tracks
$f_{e m}$	fraction of EM energy over total neutral energy of the jet
$p_{c o r e 1}$	ratio of sum $p_{T}$ in a cone of $Δ R <$ 0.1 and the jet $p_{T}$
$p_{c o r e 2}$	ratio of sum $p_{T}$ in a cone of $Δ R <$ 0.2 and the jet $p_{T}$
$f_{c o r e 1}$	ratio of sum ET in a cone of $Δ R <$ 0.1 and the jet total ET
$f_{c o r e 2}$	ratio of sum ET in a cone of $Δ R <$ 0.2 and the jet total ET
$f_{c o r e 3}$	ratio of sum ET in a cone of $Δ R <$ 0.3 and the jet total ET
${(p_{T}^{D})}^{2}$	$λ_{0}^{2}$
LHA	Les Houches Angularity; $λ_{0.5}^{1}$
Width	$λ_{1}^{1}$
Mass	$λ_{2}^{1}$
$E_{h a d} / E_{e m}$	ratio of the hadronic versus electromagnetic energy deposited in the calorimeter
$τ_{0}$ , $τ_{1}$ , $τ_{2}$	N-Subjettiness

Table 2. Hyperparameters of the different network types.

Parameter	DNN	CNN	Combined
Dense layer nodes	35—20—12—1	–	33—20—14
Dense layer activation	tanh—tanh—tanh—sigmoid	–	tanh—tanh—tanh
Convolutional layer nodes	–	30—8—8	30—8—8
Window size	–	[3 × 3], [3 × 3], [5 × 5]	[3 × 3], [3 × 3], [5 × 5]
Convolutional layer activation	–	tanh—tanh—tanh	tanh—tanh—tanh
Max pooling	–	After the 1st convolutional layer
Dense layers after convolution	–	10(relu)—1(sigmoid)	–
Combined layer nodes	–	–	8—1
Combined layer activation	–	–	relu—sigmoid
Loss function	binary cross-entropy
Optimiser	Adam
Training epochs	40
Batch size	1024

Table 3. Overview of the training results using the combined network. Mixed background test samples contain 50% quark and 50% gluon jets.

Network Type	Test Sample	Training Sample	AuC
DNN	$D_{s}$ vs. mixed	$D_{s}$ vs. mixed	0.939
CNN	$D_{s}$ vs. mixed	$D_{s}$ vs. mixed	0.938
Combined	$D_{s}$ vs mixed	$D_{s}$ vs mixed	0.956
	$D_{s}$ vs. gluon	$D_{s}$ vs. mixed	0.987
	$D_{s}$ vs. quark	$D_{s}$ vs. mixed	0.935
	$D_{s}$ vs. gluon	$D_{s}$ vs. gluon	0.991
	$D_{s}$ vs. quark	$D_{s}$ vs. quark	0.946

Table 4. Jet tagging rate for different samples. For

c \bar{c}

and

b \bar{b}

samples, the tagging rate is separately evaluated for events, where the jet contains a truth

D_{s}

.

Table 4. Jet tagging rate for different samples. For

c \bar{c}

and

b \bar{b}

samples, the tagging rate is separately evaluated for events, where the jet contains a truth

D_{s}

.

Sample	Tagging Rate
$D_{s}$ $γ$	79%
$q q$	9%
$g g$	1%
$τ τ$	62%
$Y γ$	3%
( $J / ψ$ ) $γ$	16%
$ϕ γ$	12%
	Jet with a truth $D_{s}$	Jet without a truth $D_{s}$
$c \bar{c}$	9%	7%
$b \bar{b}$	1%	3%

Table 5. Variations in the AuC for different Pythia8 tunes.

Parameter	+Variation	−Variation
Var1: UE activity	−0.008	0.003
Var2: jet shapes and substructure	−0.001	0.010
Var3a: ISR/FSR $t \bar{t}$ gap	−0.002	0.007
Var3b: ISR/FSR 3/2 jet ratio	−0.011	0.002
Var3c: ISR	−0.007	0.006

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bakos, E.; de Groot, N.; Vranjes, N. Identifying D Mesons from Radiative W Decays at the Large Hadron Collider. Symmetry 2023, 15, 1948. https://doi.org/10.3390/sym15101948

AMA Style

Bakos E, de Groot N, Vranjes N. Identifying D Mesons from Radiative W Decays at the Large Hadron Collider. Symmetry. 2023; 15(10):1948. https://doi.org/10.3390/sym15101948

Chicago/Turabian Style

Bakos, Evelin, Nicolo de Groot, and Nenad Vranjes. 2023. "Identifying D Mesons from Radiative W Decays at the Large Hadron Collider" Symmetry 15, no. 10: 1948. https://doi.org/10.3390/sym15101948

APA Style

Bakos, E., de Groot, N., & Vranjes, N. (2023). Identifying D Mesons from Radiative W Decays at the Large Hadron Collider. Symmetry, 15(10), 1948. https://doi.org/10.3390/sym15101948

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Identifying D Mesons from Radiative W Decays at the Large Hadron Collider

Abstract

1. Introduction

2. Materials and Methods

2.1. Simulated Samples

2.2. $D_{s}$ Identification Using Machine Learning Algorithm

2.2.1. Deep Neural Network

2.2.2. Convolutional Neural Network

2.2.3. Combined Network

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Identifying D Mesons from Radiative W Decays at the Large Hadron Collider

Abstract

1. Introduction

2. Materials and Methods

2.1. Simulated Samples

2.2. D s Identification Using Machine Learning Algorithm

2.2.1. Deep Neural Network

2.2.2. Convolutional Neural Network

2.2.3. Combined Network

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.2. $D_{s}$ Identification Using Machine Learning Algorithm