1 Introduction

Since the discovery of the Higgs boson in 2012, by the ATLAS [1] and CMS [2] collaborations, measurements of Higgs boson properties have remained consistent with the expectations of the Standard Model (SM). The Higgs boson mass has been determined, measurements of the Higgs coupling to fermions and gauge bosons have been performed (see, e.g. Refs. [3, 4]), and differential cross sections have been measured by the ATLAS and CMS collaborations. Since the differential cross sections of the Higgs boson can be predicted with high precision, these measurements can be used to search for deviations from the Standard Model and probe the effect of higher-order corrections in perturbation theory.

This paper presents measurements of differential cross sections for Higgs boson production via gluon–gluon fusion (ggF) and decay into \(WW^{*} \rightarrow e\nu \mu \nu \). Measuring the decay of the Higgs boson in the WW channel is an important component of fully characterising Higgs measurements in all of its decay states. The measurements are reported in a fiducial phase space, which minimizes extrapolations and therefore the model dependence of the results. Measurements of Higgs boson decays into WW bosons are well motivated because the large branching fraction makes the measurement competitive with the cleaner \(\gamma \gamma \) and ZZ channels. Although the presence of neutrinos in the WW final states makes it impossible to fully reconstruct the final state kinematics, the high branching fraction makes the measurement competitive with fully leptonic final states. In addition, the Higgs boson’s \(WW^{*} \rightarrow e\nu \mu \nu \) decay mode has an advantage when compared with the decay into b-quark pairs, since the Higgs boson produced via ggF is not required to be highly boosted, allowing a detailed study of the full Higgs transverse momentum spectrum. Finally, the \(e\nu \mu \nu \) final state is considered in order to avoid large backgrounds from Drell–Yan production that appear in the same flavour channels.

The analysis is performed using the full Run 2 dataset, corresponding to an integrated luminosity of 139 \({\text {fb}}^{-1}\), collected with the ATLAS detector during 2015–2018 at a centre-of-mass energy \(\sqrt{s}= 13\,\text {TeV} \). Final states with at most one jet are considered. Final states with two jets are not considered in order to avoid large backgrounds from top production processes. The small expected contributions from Higgs production in the vector-boson fusion (VBF) and associated production (VH) processes are fixed to their SM expectations and considered as background. Previous ATLAS measurements of the differential cross section in this channel were performed on 20.3 \({\text {fb}}^{-1}\) of data collected at a centre-of-mass energy of \(\sqrt{s} = 8\,\text {TeV} \) [5], while the inclusive cross-section measurement in this channel was performed on 139 \({\text {fb}}^{-1}\) of \(\sqrt{s} = 13\,\text {TeV} \) data [6]. The CMS Collaboration published a measurement of the differential and inclusive cross sections in this channel using 137 \({\text {fb}}^{-1}\) of data collected at a centre-of-mass energy of \(\sqrt{s} = 13\,\text {TeV} \) [7]. Compared to the previous ATLAS result, the present measurement introduces a refinement in that the signal in each interval of the observable under consideration is extracted by means of a fit to a kinematic variable. Compared to the previous results obtained by subtracting expected background contributions from the data, the current method, which employs a fitting procedure gives an improved sensitivity and allows for a more comprehensive treatment of systematic uncertainties. The ATLAS and CMS collaborations have also performed differential cross-section measurements for the cases where the Higgs boson decays into \(ZZ \rightarrow 4\ell \) and \(\gamma \gamma \) [8, 9].

The measurements of the differential cross sections are corrected for detector effects to a fiducial phase space defined at the particle level, through a procedure referred to as unfolding. Defining the selection criteria at the particle level removes the dependence on the theoretical modelling and facilitates direct comparison with a wide variety of theoretical predictions. The measurements are almost insensitive to the uncertainties on the ggF SM predictions to which they are compared.

The theoretical prediction of the differential cross section as a function of the Higgs boson transverse momentum (\(p_{\text {T}} ^{H} \)) is calculated at next-to-next-to-leading order (NNLO) in quantum chromodynamics (QCD). In ggF production, the transverse momentum of the Higgs boson is balanced by soft quark and gluon emission, enabling the low-momentum region of this distribution to test the non-perturbative effects, the modelling of soft-gluon emissions calculated using resummation, and loop effects sensitive to the Higgs–Charm Yukawa coupling. The low \(p_{\text {T}} ^{H} \) region is also sensitive to the interactions of the charm and bottom quarks with the Higgs boson. The high \(p_{\text {T}} ^{H} \) region is sensitive to perturbative QCD effects, as well as possible contributions from ‘beyond the Standard Model’ (BSM) physics, and higher-dimensional effective operators in BSM Lagrangians [10, 11].

The production kinematics of the Higgs boson produced in gluon fusion in a pp collision can be described by jet kinematic variables that include the number of jets and the rapidity of the leading jet (\(|y_{j0}|\)).Footnote 1 The \(|y_{j0}|\) distribution probes the theoretical modelling of hard gluon and quark emission. Measuring the cross section as a function of the number of jets provides information about the relative contributions of different Higgs production mechanisms. Processes with at most one jet are dominated by ggF and probe the higher-order QCD contributions to this production mechanism. In addition to measuring quantities sensitive to Higgs boson production, observables sensitive to its \(WW^{*} \rightarrow e\nu \mu \nu \) decay are also explored. These include leptonic kinematic observables such as the leading lepton’s transverse momentum (\(p_{\text {T}} ^{\ell 0}\)) and the dilepton system’s transverse momentum (\(p_{\text {T}} ^{\ell \ell }\)), invariant mass (\(m_{\ell \ell }\)), rapidity (\(y_{\ell \ell }\)), azimuthal opening angle (\(\Delta \phi _{\ell \ell }\)), and also \(\cos \theta ^*\), defined as \(\cos \theta ^* = |\tanh (\tfrac{1}{2}\Delta \eta _{\ell \ell })|\) (where \(\Delta \eta _{\ell \ell }\) is the lepton pseudorapidity difference). The observable \(y_{\ell \ell }\) is highly correlated with the rapidity of the reconstructed Higgs boson (\(y_{H}\)), which is known to be sensitive to the parton distribution function (PDF) [12]. The observable \(\cos \theta ^*\) is longitudinally boost-invariant and sensitive to the spin structure of the produced diparticle pairs, as discussed in Ref. [13].

Finally, double-differential measurements are made, in which the Higgs production cross section is explored as functions of two variables simultaneously, with one sensitive to the production kinematics and the other to the decay kinematics. Specifically, the cross section is measured as a function of 0-jet and 1-jet bins versus \(p_{\text {T}} ^{\ell 0}\), \(p_{\text {T}} ^{\ell \ell }\), \(m_{\ell \ell }\), \(y_{\ell \ell }\), \(\Delta \phi _{\ell \ell }\), and \(\cos \theta ^*\).

2 ATLAS detector

The ATLAS detector [14] is a multipurpose detector at the Large Hadron Collider at CERN, and almost fully covers the solid angle around the collision point. The innermost part of the detector covers the pseudorapidity range \(|\eta | < 2.5\) and is used to gather information about the momentum and the sign of the charge of charged particles. This inner detector (ID) consists of a silicon Pixel detector, a silicon microstrip tracker, and a straw-tube transition-radiation tracker. The Pixel detector includes a new innermost layer, the insertable B-layer [15, 16], added before Run 2. A thin superconducting solenoid surrounds the ID, providing it with a 2 T axial magnetic field. The inner detector is surrounded by electromagnetic and hadronic calorimeters. The electromagnetic (EM) calorimeter, consisting of lead absorbers and liquid argon (LAr), provides electromagnetic and hadronic energy measurements. It consists of a barrel component in the centre of the detector, covering \( |\eta |< 1.475\), and two EM endcaps covering \(1.375< |\eta | < 3.2 \). In addition, copper–LAr hadronic endcaps are placed behind the EM endcaps to capture hadronic energy in the \(1.5< |\eta | < 3.2\) region.

The EM calorimeter is surrounded by the steel and scintillator hadronic Tile calorimeter (TileCal), which measures the energy of hadrons. The Tile calorimeter consists of a central barrel covering \(|\eta | < 1.0\), and two extended barrels in the forward regions, covering \( 0.8< |\eta | < 1.7\). The regions closest to the beam pipe are covered by forward calorimeters (FCal), which cover \( 3.1< |\eta | <4.9 \) and consist of three modules. The first module is made of copper–LAr and is used for electromagnetic measurements and the beginning of the hadronic shower. The other two modules consist of tungsten and LAr and complete the hadronic measurements.

The calorimeters are surrounded by a muon spectrometer (MS) with superconducting toroidal magnets made of eight air-core barrel loops and two endcaps. The toroid magnets produce a non-uniform magnetic field, resulting in a bending power ranging from 2.0 to 6.0 T m. The MS has a high-precision tracking chamber system covering \(|\eta | <2.7\) and consisting mainly of monitored drift tubes but complemented by cathode-strip chambers in the forward directions, where the background is highest. The MS uses resistive-plate chambers in the barrel and thin-gap chambers in the endcaps for triggering within \(|\eta | <2.4\) and to provide track coordinates in the non-bending plane.

A two-level trigger system is used to select events of interest. The first-level trigger is implemented in hardware and uses a subset of the detector information to accept events from the 40 MHz proton bunch crossings at a rate below 100 kHz. This is followed by a software-based trigger, which reduces the accepted event rate to 1 kHz on average depending on the data-taking conditions.

An exhaustive software suite [17] is used in data simulation, in the reconstruction and analysis of real and simulated data, in detector operations, and in the trigger and data acquisition systems of the experiment.

3 Signal and background modelling

The signal and background modelling is described using various generators to model the hard-scatter process, parton shower (PS), hadronization and underlying event (UE). The Higgs boson can be produced through ggF processes, vector-boson fusion (VBF), in association with a W or Z boson (VH) or with a top-quark pair (\(t\bar{t}H\)). There are other production modes as well, but those are not included in this analysis as their expected contributions are negligible. These include Higgs bosons produced with a bottom-quark pair (\(b\bar{b}H\)) or a single top quark (tH, with either an additional W boson, tWH, or an additional b-quark and light quark, tHqb). In the combined 0-jet and 1-jet signal regions (see Sect. 4), the non-ggF production modes make up 6.9% of the total expected Higgs boson event yield, with 60% of this contribution coming from VBF production.

The primary ggF signal process was modelled using  NNLOPS [18,19,20,21,22] interfaced with  [23]. The 0-jet, 1-jet and 2-jet cross sections are predicted at NNLO, NLO and LO, respectively. To obtain NNLO accuracy for observable distributions, the Higgs boson rapidity spectrum in Hj-MiNLO  [24,25,26] was reweighted to that of HNNLO [27]. The resulting Higgs boson transverse momentum spectrum is compatible with the fixed-order HNNLO calculation and the calculation [28, 29]. The AZNLO tune [30] of the parameter values was used for the PS, UE and hadronization. The ggF process is normalized to the N\(^{3}\)LO cross section in QCD with NLO electroweak corrections [31,32,33,34,35,36,37,38,39,40,41]. For the results, a comparison is made between the ggF process generated by and an alternative generation performed by  [42] with the  [43] PDF set.

The VBF events were generated with , interfaced with and normalized to cross sections from NLO QCD with EW corrections, with an approximate NNLO QCD correction applied [44,45,46]. The VH production process was generated with MiNLO, and interfaced with . The VH cross section is calculated at NNLO in QCD with NLO electroweak corrections for \(q\bar{q}/qg \rightarrow VH\) and at NLO and next-to-leading-logarithm accuracy in QCD for \(gg \rightarrow ZH\) [47,48,49,50,51,52,53].

For the ggF, VBF and VH predictions, the  [54] PDF set was used. All of the Higgs boson cross sections and branching fractions are normalized in accordance with Hdecay [55,56,57] and  [58,59,60] calculations, which provide NLO EW corrections and account for interference effects. The samples were generated assuming a Higgs boson mass of 125 \(\text {GeV}\). The signal predictions described are not an extensive list of all available Monte Carlo simulations, and were chosen because of their applicability to this phase space.

The main sources of SM background that could resemble the signal process are the production of top quarks, dibosons, Z+jets, W+jets and multijets. The top-quark pair production process was simulated using in the framework using the PDFs, and interfaced with using PDFs for parton showering. The associated production of top quarks with W bosons, Wt, was modelled using the generator at NLO in QCD, using the five-flavour scheme and the set of PDFs. The diagram removal scheme [61] was used to remove interference between Wt and \(t{\bar{t}}\)  production. The \(t\bar{t}\) process is normalized using cross sections calculated at NNLO+NNLL [62], while the Wt process is normalized to NNLO. The alternative top-quark samples used to estimate certain systematic uncertainties were produced using MG5_aMC@NLO [42].

The WW background sample was generated using interfaced with PDFs [63]. The sample was generated with up to one jet at NLO accuracy and with two or three jets at LO accuracy. The loop-induced gg-initiated diboson processes were simulated by with up to one additional jet, interfaced with for the UE and PS, and used the PDF set.

The WZ and ZZ samples were generated with , using the PDF set, at NLO accuracy for up to one jet and at LO accuracy for two or three jets. Events containing three vector bosons, VVV, were simulated with the generator, using the PDF set. The matrix elements are accurate to NLO for the inclusive process and to LO for up to two additional parton emissions.

Drell–Yan production was simulated with using the PDFs with dedicated parton shower tuning developed by the authors.

Production of \(Z\gamma \) and \(W\gamma \) events was modelled using at NLO accuracy for up to one jet, using the PDFs. They were interfaced with for the UE and PS.

The W+jets process modelling is based on a purely data-driven method. However, MC samples are used to validate the estimation as well as to estimate the sample composition uncertainties. The V+jets processes simulated in MiNLO interfaced with , are used for the validation of the W+jets modelling. The PDF set used in was whereas the PDF set used in the parton shower was . The alternative V+jets samples used to estimate certain systematic uncertainties were produced using .

For the background processes, the AZNLO tune [30] is used for the diboson processes, while the A14 tune [64] is used for other processes.

The full list of Monte Carlo generators involved in the estimates used for this analysis is shown in Table 1.

Table 1 Overview of simulation tools used to generate signal and background processes, as well as to model the UE and PS. The PDF sets are also summarized. Alternative event generators or quantities varied to estimate systematic uncertainties as well as alternative predictions used in the final-results comparison are shown in parentheses

All simulated samples include the effect of pile-up from multiple interactions in the same and neighbouring bunch crossings. This was achieved by overlaying each simulated hard-scatter event with minimum-bias events, simulated using with the A3 tune. The Monte Carlo (MC) events were weighted to reproduce the distribution of the average number of interactions per bunch crossing (\(\left<\mu \right>\)) observed in the data. The \(\left<\mu \right>\) value in data was rescaled by a factor of \(1.03\pm 0.04\) to improve agreement between data and simulation in the visible inelastic proton–proton (\(pp\)) cross-section [93].

All samples were processed through the ATLAS detector simulation [94] based on Geant4  [95], and reconstructed with the standard ATLAS reconstruction software. Data are removed if relevant detector components were not fully operational, or if other data integrity issues were identified, as described in Ref. [96].

4 Event selection

Events are selected using a combination of unprescaled single-lepton triggers and one e\(\mu \) dilepton trigger [97, 98]. During the first year of data taking the single-electron (single-muon) trigger threshold was 24 (20) \(\text {GeV}\), and was increased to 26 \(\text {GeV}\) for the remainder of Run 2. The dilepton trigger had \(p_{\text {T}} \) thresholds of 17 \(\text {GeV}\) for electrons and 14 \(\text {GeV}\) for muons. At least one of the leptons reconstructed offline is required to match the online object that was triggered on. If only the dilepton trigger was used, each reconstructed lepton is required to match the triggered object. The \(p_{\text {T}} \) of the reconstructed leptons that are matched to the trigger are required to be at least 1 \(\text {GeV}\) above the trigger-level threshold.

The analysis utilizes both the objects identified after the full detector reconstruction and objects identified at the particle level in simulation. The particle-level and reconstructed objects are selected in two stages. An overlap removal procedure is applied to the objects resulting from the first selection stage, to ensure that no object is counted twice. After the second selection stage, the objects are used to construct fiducial, signal and control regions.

4.1 Reconstructed objects

Reconstruction-level objects are built from the signals left in the detector by traversing particles. The overlap removal procedure performed on reconstructed objects after the first selection stage is complex, and is fully described in Ref. [6]. For reconstructed leptons, lepton identification is used in the signal selection and a separate lepton identification is used, with some selection criteria reversed, for the estimation of backgrounds with misidentified leptons.

Electrons are reconstructed as clusters of energy deposited in the EM calorimeter and associated tracks in the inner detector. At the first selection stage, electrons are required to have a transverse momentum of \(p_{\text {T}} > 10\) \(\text {GeV}\) and \(|\eta | < 2.47\). The longitudinal impact parameters must satisfy, \(|z_0\sin \theta | < 0.5\) mm, while the significance of the transverse impact parameters must satisfy \(|d_0|/\sigma _{d_0} < 5\) [99]. A likelihood-based electron selection at the ‘Very Loose’ operating point is used [99]. At the second selection stage, electrons are required to have \(p_{\text {T}} > 15\) \(\text {GeV}\), and to be outside the transition region between the barrel and endcaps of the EM calorimeter, \(1.37< |\eta | < 1.52\). Electrons with \(15\,< p_{\text {T}} < 25\,\text {GeV} \) are required to satisfy a ‘Tight’ operating point, while electrons with \(p_{\text {T}} > 25\) \(\text {GeV}\) must satisfy a ‘Medium’ operating point. Lower-energy electrons are required to satisfy a tighter operating point since they are more likely to originate from misconstructed objects. To reduce the number of misidentified electrons, these electrons must be well isolated from other objects, using the strictest isolation criteria as described in Ref. [100], which employ both calorimeter and tracking information. The approximate efficiency of identifying and isolating leading and subleading electrons in the phase space of this analysis is is 81% and 58%, respectively

Muons are reconstructed using tracking information from the inner detector and the muon spectrometer. The muon identification is defined by making two sets of requirements on the number of detector hits, the momentum resolution, the track goodness-of-fit, and other parameters [101]. At the first selection stage, muons with \(p_{\text {T}} > 10\) \(\text {GeV}\) and \(|\eta | <2.7\) are considered if they are identified with the ‘Loose’ quality working point [102]. These muons must satisfy \(|z_0\sin \theta | < 1.5\) mm and \(|d_0|/\sigma _{d_0} < 3\) requirements. At the second selection stage, muon track segments in the inner detector and muon spectrometer are required to match, as defined by the ‘Tight’ working point, which maximizes the muon purity. Additionally, muons selected at the second stage are required to have \(p_{\text {T}} > 15\) \(\text {GeV}\) and pseudorapidity \(|\eta | < 2.5\). These muons must also satisfy the strictest isolation requirement [101], which employ both calorimeter and tracking information and entail that the muons be well isolated from other objects in order to differentiate them from muons produced in background processes containing semi-leptonic decays of heavy-flavour hadrons. Finally, the muons selected at the second stage must satisfy \(|z_0\sin \theta | < 0.5\) mm and \(|d_0|/\sigma _{d_0} < 3\) requirements. The approximate efficiency of identifying and isolating leading and subleading muons in the phase space of this particular analysis is 85% and 72%, respectively.

Jets are reconstructed using the anti-\(k_{t}\) algorithm [103] with a radius parameter \(R= 0.4\), and corrections are applied to calibrate the jets to the particle-level energy scale, as described in Ref. [104]. Jets are then calibrated using the particle-flow algorithm as described in Ref. [105]. At the first selection stage, jets are required to have \(p_{\text {T}} > 20\) \(\text {GeV}\), be located within \(|\eta | < 4.5\), and satisfy the ‘jet vertex tagger’ (JVT) requirements described in Ref. [106]. The JVT requirements are applied to jets in the range \(20< p_{\text {T}} < 60\) \(\text {GeV}\) and \(|\eta | < 2.4\) in order to reduce the number of jets from pile-up. At the second selection stage, jets are required to have \(p_{\text {T}} >30\) \(\text {GeV}\). Jets containing b-hadrons are identified using the DL1r b-tagging algorithm and are required to satisfy the 85% efficiency working point; they must also satisfy \(p_{\text {T}} > 20\) \(\text {GeV}\) and be located within \(|\eta | < 2.5\) [107].

The missing transverse momentum, defined as the negative vector sum of the transverse momenta of objects in the detector, measures the sum of the momenta of the neutrinos in the \(WW^{*} \rightarrow e\nu \mu \nu \) decays. Two types of missing transverse momentum are used in the analysis. The first, denoted by \(E_{\text {T}}^{\text {miss}} \), is described in Ref. [108] and accounts for the inclusion of electrically neutral objects by using jets directly in the calculation of \(E_{\text {T}}^{\text {miss}} \). This \(E_{\text {T}}^{\text {miss}}\) definition is used in the calculation of all sensitive variables because it provides superior resolution for objects used to discriminate signal events from background events. The second is a track-based variable, denoted by \(E_{\text {T}}^{\text {miss, track}} \) [108], which uses tracks matched to jets instead of the calorimeter jet energy and is used to suppress backgrounds from misidentified objects because it has better pile-up robustness than \(E_{\text {T}}^{\text {miss}} \). Both types of missing transverse momentum use a ‘Tight’ working point [108], which is equivalent to the condition that forward jets must satisfy the \(p_{\text {T}} > 30\) \(\text {GeV}\) requirement. The estimation of the lower energy components is based on tracks for both \(E_{\text {T}}^{\text {miss}} \) definitions.

4.2 Signal region selection

The signal region (SR) is defined using reconstructed objects at the second selection stage, and subdivided into categories of events that contain either no jets or one jet. The two categories share a common pre-selection and signal selection, but have partially different background rejection selections applied to reflect the variation in background composition between the jet multiplicity bins.

Table 2 Event selection criteria used to define the signal and fiducial region in the analysis. The \(E_{\text {T}}^{\text {miss}} \) definition described in Sect. 4.3 is used for the selection at particle level. The reconstructed electrons are required to have a pseudorapidity \(|\eta | < 2.47\), excluding the transition region between the barrel and endcaps of the EM calorimeter, \(1.37< |\eta | < 1.52\)

The signal region contains events with exactly two opposite-sign different-flavour leptons, with the leading (subleading) lepton required to satisfy \(p_{\text {T}} > 22\) \(\text {GeV}\) (\(p_{\text {T}} > 15\) \(\text {GeV}\)). The invariant mass of the two leptons, \(m_{\ell \ell } \), is required to be greater than 10 \(\text {GeV}\) in order to remove low-mass meson resonances and Drell–Yan events. To ensure orthogonality with the \(VH\) (WH, \(ZH\)) phase space, events with additional reconstructed leptons with \(p_{\text {T}} \) greater than 15 \(\text {GeV}\) are removed. In order to suppress misreconstructed low-energy events, the \(E_{\text {T}}^{\text {miss, track}}\) is required to be greater than 20 \(\text {GeV}\). Events that contain a b-tagged jet with \(p_{\text {T}} > 20\) \(\text {GeV}\) are removed to suppress backgrounds from top-quark processes. Additionally, a selection requiring a minimum value of 80 \(\text {GeV}\) is applied to the transverse mass (\(m_{\text {T}}\)), defined as:

$$\begin{aligned} m_{\textrm{T}} = \sqrt{ {(E_\text {T}^{\ell \ell } + E_{\textrm{T}}^{\text {miss}})}^2 - {|\vec {p_\text {T}^{\ell \ell }} + \vec {E_{\textrm{T}}^{\text {miss}}}|}^2 } \end{aligned}$$

where \(E_\text {T}^{\ell \ell } = \sqrt{|\vec {p_\text {T}^{\ell \ell }}|^2 + m_{\ell \ell }^2 }\). This requirement suppresses non-WW diboson background, as well as background from misidentified objects.

For the signal region category containing zero jets, a \(p_{\text {T}} \) threshold of 30 \(\text {GeV}\) is used to define the jet-counting criterion, meaning that the region has zero jets with \(p_{\text {T}} >30\) \(\text {GeV}\) . The azimuthal angle between the dilepton system and the missing transverse momentum, \(\Delta \phi _{\ell \ell ,E_{\text {T}}^{\text {miss}}} \), is required to be larger than \(\pi /2\). Since the leptons and \(E_{\text {T}}^{\text {miss}} \) are typically back to back for signal events, this requirement removes potential pathological events in which the \(E_{\text {T}}^{\text {miss}} \) points in the direction of the lepton pair because of misidentified objects. The transverse momentum of the dilepton system, \(p_{\text {T}} ^{\ell \ell } \), is required to be at least 30 \(\text {GeV}\) in order to reject events in which a Z boson decays into leptonically decaying \(\tau \)-leptons.

For the signal region category containing exactly one jet that satisfies \(p_{\text {T}} > 30\) \(\text {GeV}\) , the larger of the two lepton transverse masses, each defined as \(m_{\textrm{T}}^{\ell } = \sqrt{ 2 p_{\text {T}} ^\ell E_{\text {T}}^{\text {miss}} (1 - \cos ({\phi }_\ell - {\phi }_{E_{\text {T}}^{\text {miss}}}))} \), is required to exceed 50 \(\text {GeV}\) . Processes with at least one real W boson typically have a large value of \(m_{\text {T}} ^{\ell }\) for at least one of the two leptons, so a lower bound on this variable suppresses \(Z/\gamma ^{*} \rightarrow \tau \tau \) and multi-jet events. The 1-jet region also requires the di-\(\tau \) candidate invariant mass, \(m_{\tau \tau } \), as computed using the collinear approximation [109], to be less than \(m_Z - 25\) \(\text {GeV}\) in order to remove \(Z/\gamma ^{*} \rightarrow \tau \tau \) background events.Footnote 2

The signal region, divided into 0-jet and 1-jet categories, exploits the lepton kinematics to enhance a signal similar to that predicted by the SM. The azimuthal opening angle between the leptons, \(\Delta \phi _{\ell \ell }\), is required to be less than 1.8 radians because the charged leptons in leptonic \(H{\rightarrow \,}WW^{*}{\rightarrow \,}\ell \nu \ell \nu \) tend to be more collimated than those from non-resonant WW background. This is due to the spin-zero initial state of the resonant process and the \(V-A\) structure of the weak interaction. Both the 0-jet and 1-jet categories also require that the dilepton invariant mass, \(m_{\ell \ell } \), be less than 55 \(\text {GeV}\) . Table 2 summarizes all the requirements imposed on the signal region. The overall acceptance of the signal region, as defined by the number of events in the signal region, divided by the number of events passing the pre-selection, is 28\(\%\).

4.3 ‘Truth’ objects

The particle-level objects are simulated before particles traverse the detector, and can be found in the signal samples after the parton shower generation step. Collinear photons, for which the angular separation between the photon and particle-level lepton is \(\Delta R < 0.1\), are added to the four-momentum of the lepton. The electrons and muons with \(|\eta |<2.7\) and transverse momentum higher than 10 \(\text {GeV}\) , and jets with \(|\eta | < 4.5\) and transverse momentum higher than 20 \(\text {GeV}\) , are selected at the first selection stage.

Table 3 Event selection criteria used to define the control regions in the analysis. The pre-selection requirements were applied in all six control regions. The definitions of \(m_{\tau \tau } \), \(m_{\text {T}} ^{\ell }\), and the jet-counting \(p_{\text {T}} \) thresholds are the same as for the signal regions. The selection criteria that are reversed or different in the CR in order to make it orthogonal to the SR are marked in bold

Simulated particle-level jets are built from all stable particles with \(c\tau > 10\) mm, including neutrinos, photons, and leptons from hadron decays or produced in the shower. However, all decay products from the Higgs boson decay and the leptonic decays of associated vector bosons are removed from the inputs to the jet algorithm. The jets are reconstructed using the anti-\(k_{t}\) algorithm [103] with a radius parameter \(R = 0.4\) implemented in the FastJet package [110]. After this first selection stage, an overlap removal procedure is applied to objects that are near each other and mimic the SR selection. A particle-level electron is rejected if it is in the vicinity of a muon with \(\Delta R(e,\mu ) < 0.1\), or a jet with \(0.2< \Delta R(e,j) < 0.4\). Conversely, a jet is rejected if there is an electron within \(\Delta R(e,j) < 0.2\). A muon is rejected if there is a jet within \(\Delta R(\mu ,j) < 0.4\). The MC ‘truth’-level \(E_{\text {T}}^{\text {miss}} \) arises from neutrinos, and is calculated from preselected objects, as well as jets that are not required to satisfy the pre-selection criteria. At the second selection stage, further selection criteria are applied to the particle-level objects. A higher jet-\(p_{\text {T}} \) requirement of 30 \(\text {GeV}\) is applied, while electrons and muons are required to satisfy \(p_{\text {T}} > 15\) \(\text {GeV}\) and be within \(|\eta | <2.5\). The particle-level objects selected at the second stage are used in the definition of the fiducial regions, constructed to mimic the signal region as closely as possible.

4.4 Fiducial region selection

The fiducial region (FR) selection is defined using particle-level objects, before the detector reconstruction is applied. This selection is designed to be as close as possible to the signal region in order to avoid large extrapolation uncertainties. To obtain the fiducial region, all the selections shown in Table 2 are applied at the particle level. Additionally, the absolute rapidity of the dilepton system, \(|y_{\ell \ell } |\), is required to be less than 2.5, in order to match the constraints due to the coverage of the ATLAS detector. As shown in Table 2, the fiducial phase space at particle level includes a veto on jets containing b-quarks. If a jet is matched with a weakly decaying b-hadron, it is labelled as a b-jet. The overall correction factor, defined as the number of reconstructed signal events in the SR divided by the number of signal events in the fiducial region, is 35%. The number of events that pass into both the reconstructed signal region and fiducial region is 2250, the number of events that pass the reconstructed signal region and do not pass into the fiducial region is 352, the number of events that do not pass into the reconstructed signal region and pass into the fiducial region is 5117, and the number of events that fail to reach either region is 144109.

Table 4 Normalization factors for all background contributions constrained by control regions, as determined in the fit (described in Sect. 7) to the \(p_{\text {T}} ^{H} \) observable

5 Background estimation

The contributions from SM processes that mimic the signal production are estimated using a variety of methods. One method involves defining a normalization factor by comparing data with MC events in regions rich in background, known as control regions (CRs). The normalization factor is a free parameter that is obtained when the fitting procedure (described in Sect. 7) is applied simultaneously to all signal and control regions. The background in the signal regions is then scaled by the normalization factor, ensuring the background modelling is normalized to the real data. The control regions are close to the signal regions, in order to minimize the extrapolation uncertainties, but have one or two selection criteria reversed in order to ensure a high purity of background events and orthogonality with respect to the signal regions. Backgrounds estimated from control regions include those containing two W bosons, which closely resemble the signal and include top-quark production (\(t{\bar{t}}\) and \(Wt\)) and WW production. A control region is also used to estimate the backgrounds from the \(Z/\gamma ^{*} \rightarrow \tau \tau \) process, which resembles the signal when the \(\tau \)-leptons decay leptonically. The selections used to define the control regions are shown in Table 3 for the 0-jet and 1-jet categories, and Table 4 shows the corresponding normalization factors.

The presence of jets originating from b-quarks is used to identify processes containing top quarks. The 0-jet top control region contains a requirement that \(\Delta \phi _{\ell \ell } \,{<}\,2.8\) in order to remain alligned with the signal region, while retaining high statistics. The 1-jet top control region removes events with low-\(p_{\text {T}}\) b-jets (\(N_{b\text {-jet}, (20\,\text {GeV}<p_{\text {T}} <30\,\text {GeV})} \)) in order to ensure that the ratio of \(t{\bar{t}}\) to \(Wt\) is similar to that in the signal region and thus minimize the influence of secondary shape differences. The purity of the 0-jet (1-jet) top control region, defined as the ratio of the top-quark component to the total expectation, is 89% (98%).

In the WW and \(Z/\gamma ^{*} \rightarrow \tau \tau \) control regions the requirement that no b-jets with \(p_{\text {T}} > 20\,\text {GeV} \) be present is applied in order to reduce the top-quark contamination in those regions. The defining feature of regions containing WW background is a selection on the dilepton invariant mass (\(m_{\ell \ell } \)), as shown in Table 3. The 0-jet WW control region has a requirement that \(\Delta \phi _{\ell \ell } \,{<}\,2.8\) in order to keep the region closely alligned to the signal region, while maximizing statistics. The WW control region containing one jet also reduces the \(Z/\gamma ^{*} \rightarrow \tau \tau \) contamination by using the same \(m_{\tau \tau } \) and \(\max (m_{\text {T}} ^{\ell })\) selections that are applied in the signal region. The \(Z/\gamma ^{*} \rightarrow \tau \tau \) control regions are defined by requiring leptons from the \(\tau \) decay to be well separated (\(\Delta \phi _{\ell \ell } \,{>}\,2.8\)), and the invariant mass of the leptons to be low (\(m_{\ell \ell } < 80\,\text {GeV} {}\)) to eliminate contamination from \(Z \rightarrow ee/\mu \mu \) events. In addition, the \(Z/\gamma ^{*} \rightarrow \tau \tau \) regions have no \(E_{\text {T}}^{\text {miss, track}} \) requirement. The purity of the 0-jet (1-jet) WW CR is 67% (34%). The lower purity in the 1-jet region is acceptable because the ratio of WW events to top-quark events in the control region is close to that in the SR. The purity of the \(Z/\gamma ^{*} \rightarrow \tau \tau \) region is 94\(\%\) and 76\(\%\) in the 0-jet and 1-jet regions respectively. Table 4 shows the normalization factors extracted from each control region after the fitting procedure is applied to the \(p_{\text {T}} ^{H} \) distribution.

Fig. 1
figure 1

Distributions of \(m_{\ell \ell } \) in the top control region of the 0-jet channel (left), \(\cos \theta ^* \) in the \(Z/\gamma ^{*} \rightarrow \tau \tau \) control regions of the 0-jet channel (centre), and \(m_{\ell \ell } \) in a loose region including both the WW control region and the signal region in the 1-jet channel (right). The MC backgrounds are normalized by normalization factors obtained from signal plus background fits to the observable that is being plotted. Model uncertainties shown are obtained from the fit result using a reweighting technique

Distributions of \(m_{\ell \ell } \) and \(\cos \theta ^* \) in the top and \(Z/\gamma ^{*} \rightarrow \tau \tau \) control regions of the 0-jet channel are shown in Fig. 1, alongside the distribution of \(m_{\ell \ell } \) in a loose region including both the WW control region and the signal region in the 1-jet channel. The normalization factors obtained from the fitting procedure are applied to distributions in the figure.

Backgrounds from processes such as \(W{+}\text {jets} \) and multi-jet production have large cross sections, and can be mistaken for signal events in the rare cases where a jet is misidentified as an isolated lepton. These backgrounds are estimated using a data-driven method known as the extrapolation-factor method, and are labelled as ‘Mis-Id’ in the figures. This method is described in detail in Ref. [6].

The remaining backgrounds include WZ, \(W\gamma \), ZZ and VVV, have small cross sections, and are estimated from simulation. Finally, Higgs boson production via processes such as vector-boson fusion, and in association with vector bosons or top-quark pairs, are fixed to their SM predictions and treated as background. This is because these additional production mechanisms have a negligible impact on the final results when treated as part of the signal model.

6 Measured yields, uncertainties and distributions

To obtain differential fiducial cross sections, the reconstruction-level signal and background expectations are fitted to the data, and the resulting information is corrected for detector effects to the particle level. A signal region is defined for each bin of each observable, from which the number of signal events \(N_\text {S}\) is extracted from a fit to the data. A combined fit is performed to the \(m_{\text {T}} \) distribution in the range 80 to 160 \(\text {GeV}\) in bins of 10 \(\text {GeV}\). This combined fit includes single-bin WW, top, and \(Z/\gamma ^{*} \rightarrow \tau \tau \) control regions. Normalization factors used to normalize the backgrounds in the signal regions are extracted from the combined fit. The fit is performed for each slice of the measured observables. Normalization factors are compatible across all observables.

The fit is performed using the profile likelihood method (see, e.g., Ref. [111]), where the likelihood is constructed as the product of likelihoods from individual signal and control regions, as well as the product of all bins in those regions. All background estimates, both the data-driven estimate and those obtained from MC simulation, are added to the Poisson expectations. The systematic uncertainties are included as nuisance parameters, constrained by a Gaussian term.

The analysis is affected by theoretical uncertainties in the MC modelling of the signal and background, as well as experimental uncertainties related to the detector measurements. The signal and background expectations depend on nuisance parameters, which correspond to systematic uncertainties. The ggF signal uncertainties are a small contribution since the measurement is made in a fiducial region, making it independent of the type of signal model assumed. Uncertainties in the shape of the ggF signal distribution are minimized since the measurement is made in individual bins of kinematic variables. The uncertainties on the ggF cross-section predictions do not impact the measurements. The only remaining signal uncertainties are the uncertainties that affect the detector level corrections. An example of such an uncertainty is the jet shower shape uncertainty, since changes in how jets are reconstructed can affect truth-level and reconstruction-level quantities differently. These residual uncertainties are taken into account. The uncertainties in non-ggF Higgs production processes are treated in the same way as the uncertainties in the other backgrounds, and cross-section uncertainties are accounted for.

Various uncertainties related to detector measurements are considered. Uncertainties associated with identifying, reconstructing and calibrating objects such as electrons, muons, jets and missing transverse momentum, as well as tagging jets as originating from b-quarks are evaluated. The systematic uncertainty associated with the energy/momentum scale and resolution of objects is estimated by performing a scaling or smearing of the energy/momentum, and observing its effect on the yields and shape of the distribution of selected events in the final state. The systematic uncertainty is evaluated by comparing the \(\pm 1\sigma \) variations with the nominal prediction. The missing transverse momentum is constructed from various objects, as described in Sect. 4.1, and has uncertainties associated with those objects. Uncertainties in the measured efficiencies for electron and muon triggers and pile-up are included as systematic uncertainties. The uncertainty in the combined 2015–2018 integrated luminosity is 1.7% [112], obtained using the LUCID-2 detector [113] for the primary luminosity measurements.

The theoretical uncertainties in the MC modelling take into account QCD scale uncertainties, parton shower (PS) and underlying event (UE) modelling, and parton distribution function (PDF) uncertainties. To calculate the QCD scale uncertainties, the renormalization (\(\mu _\textrm{r}\)) and factorization (\(\mu _\textrm{f}\)) scales are varied in order to estimate the impact of missing higher-order corrections in fixed-order perturbative predictions. To estimate the QCD scale uncertainty for the Higgs boson signal, the Stewart–Tackmann method [114] is followed. For the backgrounds originating from WW bosons, top production, and \(Z/\gamma ^{*} \rightarrow \tau \tau \), the scale uncertainty is evaluated using the envelope of seven-point \(\mu _\textrm{r}\), \(\mu _\textrm{f}\) variations, with \(\mu _\textrm{r}\) and \(\mu _\textrm{f}\) each varying by factors of 0.5 and 2, subject to the constraint that \(0.5 \le \mu _\textrm{r}/\mu _\textrm{f} \le 2.0\), and the maximum variation of the event yields is taken as the uncertainty.

The PDF uncertainties are calculated following the standard PDF4LHC prescriptions  [115].

Fig. 2
figure 2

Pre-fit (left) and post-fit (right) distribution of the transverse mass \(m_{\text {T}} \) of the Higgs boson in the first bin of the transverse momentum \(p_{\text {T}} ^{H} \): 0–30 \(\text {GeV}\) for \(N_{\text {jet}} =0,1\). The last bin in \(m_{\text {T}} \) contains the overflow of events with \(m_{\text {T}} >160\) \(\text {GeV}\)

Fig. 3
figure 3

Pre-fit (left) and post-fit (right) distribution of the transverse mass \(m_{\text {T}} \) of the Higgs boson in the second bin of the transverse momentum \(p_{\text {T}} ^{H} \): 30–60 \(\text {GeV}\) for \(N_{\text {jet}} =0,1\). The last bin in \(m_{\text {T}} \) contains the overflow of events with \(m_{\text {T}} >160\) \(\text {GeV}\)

Fig. 4
figure 4

Pre-fit (left) and post-fit (right) distribution of the transverse mass \(m_{\text {T}} \) of the Higgs boson in the third bin of the transverse momentum \(p_{\text {T}} ^{H} \): 60–120 \(\text {GeV}\) for \(N_{\text {jet}} =0,1\). The last bin in \(m_{\text {T}} \) contains the overflow of events with \(m_{\text {T}} >160\) \(\text {GeV}\)

The uncertainties related to the UE and PS, as well as generator uncertainties, are estimated using alternative generators, and taken into account for the WW, top and \(Z/\gamma ^{*} \rightarrow \tau \tau \) backgrounds. In addition to that, for WW and top backgrounds, the uncertainties associated with matching NLO matrix elements to parton showers are estimated. For the WW background, an uncertainty is estimated to account for the fact that 0–1 additional jets are generated at NLO, while 2–4 additional jets are generated at LO. An additional uncertainty applied to single-top processes is estimated by comparing samples using different diagram removal schemes to account for interference between Wt and top-pair production [61].

A pruning procedure is used in order to improve the stability of the likelihood maximization by neglecting sources of small uncertainties. For this, any systematic uncertainty that induces a yield change of less than 0.1% for a given process in an observable bin is neglected for that bin and that process. Furthermore, for any systematic uncertainty which is very likely to be compatible with a pure normalization effect across \(m_{\text {T}} \) bins (with a p-value \(p>95\%\)), the shape component is neglected.

7 Unfolding

The reconstruction-level measurements are extrapolated to their particle-level quantities using a procedure known as unfolding [116]. This procedure uses Monte Carlo signal samples to define the correlation between reconstruction-level and particle-level quantities. These correlations are described in terms of a response matrix C, such that

$$\begin{aligned} \vec {n}_{\text {reco}} = C \cdot \vec {n}_{\text {particle-level}} + \vec {n}_{\text {out-of-fiducial}}, \end{aligned}$$

where \(\vec {n}_{\text {reco}}\) is the vector of event counts in bins of the reconstruction-level quantity, \(\vec {n}_{\text {particle-level}}\) is the vector of event counts in bins of the particle-level quantity, and \(\vec {n}_{\text {out-of-fiducial}}\) is the vector of signal events that pass the selection at the reconstruction level but are not inside the fiducial region defined at the particle level. In practice, the response matrix is derived from the migration matrix, a two-dimensional distribution of simulated events encoding the correlation between the particle- and reconstruction-level quantities, by normalizing each element to the total number of events in the corresponding particle-level bin. In this way, acceptance, resolution and efficiency effects are encoded in the matrix. Assuming that the response derived from simulation is representative of the detector response in real data, the particle-level distributions in data can be obtained by applying the inverse of the response matrix to the measured data.

The practical application of this procedure faces a challenge related to the lack of regularization: the bin migrations between particle and reconstruction levels induce off-diagonal entries in the response matrix, leading to negative off-diagonal entries in its inverse and ultimately to the amplification of uncertainties.

Two different approaches to alleviate this are presented here. The first approach is an unfolding procedure that uses a Tikhonov-regularized [117] in-likelihood unfolding algorithm. The likelihood used extracts the reconstruction-level signal yields by means of signal strengths \(\mu _{j}\) multiplying the expected event yields \(s^{r,\text {exp}}_{j,m}\) in \(m_{\textrm{T}}\) bin m and observable bin j, and is of the form

Fig. 5
figure 5

Pre-fit (left) and post-fit (right) distribution of the transverse mass \(m_{\text {T}} \) of the Higgs boson in the last bin of the transverse momentum \(p_{\text {T}} ^{H} \): 120–1000 \(\text {GeV}\) for \(N_{\text {jet}} =0,1\). The last bin in \(m_{\text {T}} \) contains the overflow of events with \(m_{\text {T}} >160\) \(\text {GeV}\)

$$\begin{aligned} \begin{aligned} \mathcal {L}(\vec {\mu }, \vec {\theta })&= \displaystyle \prod _{j=1}^{N_{\text {bin}}}\prod _{m=1}^{N_{m_{\textrm{T}}}} P\left( N_{j}| \mu _{j}s_{j,m}^{r,\text {exp}}(\vec {\theta }) + \displaystyle \sum _{n=1}^{N_{\text {bg}}}\mu ^{b}_{n}b_{j,m,n}(\vec {\theta })\right) \\&\qquad \times \left\{ \displaystyle \prod _{c=1}^{N_{\text {CR}}} P(N_{c}|s_{c}^{r,\text {exp}}(\vec {\theta }) + \displaystyle \sum _{n=1}^{N_{\text {bg}}}\mu ^{b}_{n}b_{c,n}(\vec {\theta }))\right\} \times \displaystyle \prod _{k=1}^{N_{\theta }}N(\tilde{\theta }_{k}|\theta _{k}). \end{aligned} \end{aligned}$$

In the Poisson terms P, the quantities \(N_j\) and \(N_{c}\) are the numbers of observed events in bin j of the signal region, and in control region c, respectively; \(s_{c}^{r,\text {exp}}\) is the expected event count in CR c; and \(b_{j,m,n}\) and \(b_{c,n}\) are the expected event counts in observable bin j and \(m_{\textrm{T}}\) bin m of the SR and in CR c, respectively, of the n-th background process. Finally, \(\mu ^{b}_{n}\) is the normalization factor for the n-th background process (and is fixed to 1 for backgrounds not estimated using CRs). The \(N_{\theta }\) nuisance parameters in the Gaussian terms N are represented by \(\vec {\theta }\). The SR is further subdivided into \(e\mu \) and \(\mu e\), depending on which of the two leptons has the higher \(p_{\text {T}} \), but this is not shown explicitly in the above expression. Also, when not considering the \(N_{\text {jet}}\) observable, the \(N_\text {jet}\,{=}\,0\) and \(N_\text {jet}\,{=}\,1\) SRs are treated separately in the fit; this further subdivision is not displayed explicitly either.

The signal \(s^r_j = \mu _{j}s^{r,\text {exp}}_{j}\) of the reconstruction-level distribution in the j-th observable bin can be rewritten as

$$\begin{aligned} s^r_j&= \sum _{i} s^t_i \, C_{ij} + f_j \end{aligned}$$

where \(s_i^t\) is the signal yield in the particle-level bin i, \(f_j\) is the number of reconstructed events that are not in the fiducial region

$$\begin{aligned} f_j = s^{r,\text {exp}}_j - \sum _i M_{ij} \end{aligned}$$

and \(C_{ij}\) is the response matrix

$$\begin{aligned} C_{ij} = M_{ij} / s^{t,\text {exp}}_i \end{aligned}$$

given by the migration matrix \(M_{ij}\) and the predicted particle-level yield \(s^{t,\text {exp}}_i\). The quantities \(s^{r,\text {exp}}\), \(s^{t,\text {exp}}\), and M are obtained from Monte Carlo predictions, with full dependencies on nuisance parameters from theory and experimental sources, i.e. \(s^{r,\text {exp}}=s^{r,\text {exp}}(\vec \theta )\), \(s^{t,\text {exp}}=s^{t,\text {exp}}(\vec \theta )\), and \(M=M(\vec \theta )\), and these dependencies are fully maintained throughout the likelihood maximization.

A Tikhonov regularization term is included in the likelihood as a penalty term, taking the form

$$\begin{aligned} P(\vec {x}) = \exp \left( -\tau \cdot \left( \sum _{i=2}^{N_{\text {bin}}-1} \left( \left( x_{i-1} - x_i\right) - \left( x_{i} - x_{i+1}\right) \right) ^2 \right) \right) \end{aligned}$$

with x being the quantity for which the curvature should be regularized. For this analysis, the measured particle-level signal strength, \( x_i = s_i^{t}/s_i^{t,\text {exp}}\), is chosen as the regularized quantity (see also, e.g., Ref. [118]). This type of regularization is chosen for all single-differential distributions in this analysis. For the double-differential cross-section measurements, independent values of \(\tau \) are chosen for each of the two jet multiplicity bins. No regularization across jet bins is performed, as two bins are not enough to define a curvature.

The choice of a value for the regularization parameter \(\tau \) is a trade-off between minimizing statistical fluctuations on the one hand, and the potential bias induced by adding an artificial constraint to the measurement on the other. This compromise is reached by first constructing distributions with the number of signal plus background events in each particle-level bin, fluctuated randomly by its respective Poisson uncertainty. The ‘toy’ distributions are then unfolded using the original response matrix in order to obtain the unfolded particle-level distribution. A bias is then calculated by subtracting the original particle-level events from the unfolded particle-level events for 1000 toy simulations for each value of the regularization parameter \(\tau \). The average of the bias for all these toy simulations is compared with the statistical uncertainty for each value of \(\tau \) for each SR bin, and a value for \(\tau \) is chosen so as to attain the lowest bias relative to uncertainty for most of the bins. Values of \(\tau \) near 1 were investigated since previous studies using this technique showed this range to be optimal. The final \(\tau \) values range from 0.25 to 1.5. For the example of the \(p_{\text {T}} ^{H} \) variable, a \(\tau \) value of 0.25 was chosen, and corresponds to values of the maximum bias divided by the uncertainty of 0.06, 0.01, 0.01, and 0.02 for the lowest to highest bins, respectively.

Fig. 6
figure 6

Measured differential fiducial cross section for \(p_{\text {T}} ^{H}\) in the 0+1-jet fiducial region using the regularized in-likelihood unfolding method. Uncertainty bars on the data points include statistical and systematic uncertainties from experimental and theory sources as well as background normalization effects and shape effects from background and signal. Uncertainty bands on the predictions shown are dominated by normalization effects on the signal arising from showering, PDF models, \(\alpha _{\text {s}}\) and the QCD scale. The legend includes p-values quantifying the level of agreement between the data and the predictions, including all sources of uncertainty. The systematic uncertainties of the data are shown separately

Fig. 7
figure 7

Measured differential fiducial cross section for \(m_{\ell \ell }\) (left) and \(y_{\ell \ell }\) (right) in the 0+1-jet fiducial region using the regularized in-likelihood unfolding method. Uncertainty bars on the data points include statistical and systematic uncertainties from experimental and theory sources as well as background normalization effects and shape effects from background and signal. Uncertainty bars on the predictions shown are dominated by normalization effects on the signal arising from showering, PDF models, \(\alpha _{\text {s}}\) and the QCD scale. The legend includes p-values quantifying the level of agreement between the data and the predictions, including all sources of uncertainty. The systematic uncertainties of the data are shown separately

Fig. 8
figure 8

Measured differential fiducial cross section for \(p_{\text {T}} ^{\ell \ell }\) (left) and \(\Delta \phi _{\ell \ell }\) (right) in the 0+1-jet fiducial region using the regularized in-likelihood unfolding method. Uncertainty bars on the data points include statistical and systematic uncertainties from experimental and theory sources as well as background normalization effects and shape effects from background and signal. Uncertainty bands on the predictions shown are dominated by normalization effects on the signal arising from showering, PDF models, \(\alpha _{\text {s}}\) and the QCD scale. The legend includes p-values quantifying the level of agreement between the data and the predictions, including all sources of uncertainty. The systematic uncertainties of the data are shown separately

Fig. 9
figure 9

Measured differential fiducial cross section for \(p_{\text {T}} ^{\ell 0}\) (left) and \(\cos \theta ^*\) (right) in the 0+1-jet fiducial region using the regularized in-likelihood unfolding method. Uncertainty bars on the data points include statistical and systematic uncertainties from experimental and theory sources as well as background normalization effects and shape effects from background and signal. Uncertainty bands on the predictions shown are dominated by normalization effects on the signal arising from showering, PDF models, \(\alpha _{\text {s}}\) and the QCD scale. The legend includes p-values quantifying the level of agreement between the data and the predictions, including all sources of uncertainty. The systematic uncertainties of the data are shown separately

Fig. 10
figure 10

Measured differential fiducial cross section for \(|y_{j0}|\) in the 1-jet fiducial region using the regularized in-likelihood unfolding method. Uncertainty bars on the data points include statistical and systematic uncertainties from experimental and theory sources as well as background normalization effects and shape effects from background and signal. Uncertainty bands on the predictions shown are dominated by normalization effects on the signal arising from showering, PDF models, \(\alpha _{\text {s}}\) and the QCD scale. The legend includes p-values quantifying the level of agreement between the data and the predictions, including all sources of uncertainty. The systematic uncertainties of the data are shown separately

A second unfolding procedure was performed as a cross-check and yielded similar results. This procedure extracts the reconstructed differential cross sections from the same statistical model, but then performs an iterative Bayesian unfolding (IBU), which uses the number of iterations as the regularization parameter. Further details of this method and the results obtained using it are provided in the Appendix.

The statistical and systematic uncertainties are propagated to the final result consistently, including their effects on the detector response. The entries of the response matrix are subject to correlated uncertainties among the matrix elements, which are incorporated into the statistical model for the in-likelihood procedure. For the iterative Bayesian unfolding procedure, the post-fit values of all nuisance parameters are propagated to the detector response, and the uncertainties are included in the bin-by-bin covariances and checked for coverage using toy distributions. Cross-section normalization uncertainties from theory that affect the signal are excluded from the measurement and instead are included in the predictions for comparison, while effects on the shape are fully propagated alongside the experimental uncertainties.

Several tests were performed to assess the robustness of the unfolding procedure. The purpose of the tests is to determine whether the unfolding procedure is biased by variations in the input samples that account for different spectral shapes, different Monte Carlo generators, new-physics contributions and systematic correlations. For the first tests, fluctuated MC truth-level distributions were created by fluctuating individual observable bins within their respective Poisson uncertainties. The fluctuated distributions are folded, and then unfolded, and the resulting unfolded distributions are found to closely match the original fluctuated truth-level distribution, showing that no biases were introduced. The same migration matrix, derived from SM ggF simulation, is used to fold and unfold the distributions. Since the migration matrix is only composed of Monte Carlo signal events, biases could be introduced by correlated nuisance parameters that only affect the signal. Thus, for the second tests, toy datasets generated with random values of the nuisance parameters demonstrated that no such biases are introduced in the construction of the migration matrix.

8 Results

The results of the measurement are illustrated in the figures in this section. The \(m_{\text {T}}\) distributions are obtained for all bins of each observable: \(|y_{j0}|\), \(p_{\text {T}} ^{H} \), \(p_{\text {T}} ^{\ell 0}\), \(p_{\text {T}} ^{\ell \ell }\), \(m_{\ell \ell }\), \(y_{\ell \ell }\), \(\Delta \phi _{\ell \ell }\), and \(\cos \theta ^*\). Figures 2, 3, 4 and 5 show examples of the pre- and post-fit \(m_{\text {T}}\) distributions at the reconstruction level for bins of the \(p_{\text {T}} ^{H} \) observable, which are inputs to the unfolding procedure. The number of bins and bin edges were chosen in order to maximize the Signal/\(\sqrt{\text {Background}}\) significance measure, while minimizing the bin migrations and statistical errors. In the distributions obtained after the fitting procedure is performed, all systematic uncertainties are included, and the background is normalised using the normalisation factors extracted from the control regions. The Higgs boson transverse momentum, \(p_{\text {T}} ^{H} \), is defined as the \(p_{\text {T}} \) of the combined two-lepton system and missing transverse momentum at the reconstruction and particle levels, and is binned in \(p_{\text {T}} \) bins in the ranges 0–30 \(\text {GeV}\), 30–60 \(\text {GeV}\), 60–120 \(\text {GeV}\) and 120–1000 \(\text {GeV}\). The ggF Higgs production predictions are derived from Monte Carlo simulated samples normalized to the best SM predicted cross sections from Table 1. These figures show that the data agree with the Standard Model expectation for the Higgs boson signal and backgrounds within the uncertainties. After the fitting procedure is performed, there is a decrease in uncertainties due to the additional information that results from the comparison to data in both the signal and control regions. The fit induces correlations between the nuisance parameters, which reduce the total uncertainties in the post-fit distributions via the correlation matrix.

Fig. 11
figure 11

Measured differential fiducial cross section for \(m_{\ell \ell }\) (left) and \(\Delta \phi _{\ell \ell }\) (right) versus \(N_{\text {jet}}\) in the 0-jet and 1-jet fiducial regions using the regularized in-likelihood unfolding method. Uncertainty bars on the data points include statistical and systematic uncertainties from experimental and theory sources as well as background normalization effects and shape effects from background and signal. Uncertainty bands on the predictions shown are dominated by normalization effects on the signal arising from showering, PDF models, \(\alpha _{\text {s}}\) and the QCD scale. The legend includes p-values quantifying the level of agreement between the data and the predictions, including all sources of uncertainty. The systematic uncertainties of the data are shown separately

Fig. 12
figure 12

Measured differential fiducial cross section for \(y_{\ell \ell }\) (left) and \(p_{\text {T}} ^{\ell \ell }\) (right) versus \(N_{\text {jet}}\) in the 0-jet and 1-jet fiducial regions using the regularized in-likelihood unfolding method. Uncertainty bars on the data points include statistical and systematic uncertainties from experimental and theory sources as well as background normalization effects and shape effects from background and signal. Uncertainty bands on the predictions shown are dominated by normalization effects on the signal arising from showering, PDF models, \(\alpha _{\text {s}}\) and the QCD scale. The legend includes p-values quantifying the level of agreement between the data and the predictions, including all sources of uncertainty. The systematic uncertainties of the data are shown separately

Fig. 13
figure 13

Measured differential fiducial cross section for \(p_{\text {T}} ^{\ell 0}\) (left) and \(\cos \theta ^*\) (right) versus \(N_{\text {jet}}\) in the 0-jet and 1-jet fiducial regions using the regularized in-likelihood unfolding method. Uncertainty bars on the data points include statistical and systematic uncertainties from experimental and theory sources as well as background normalization effects and shape effects from background and signal. Uncertainty bands on the predictions shown are dominated by normalization effects on the signal arising from showering, PDF models, \(\alpha _{\text {s}}\) and the QCD scale. The legend includes p-values quantifying the level of agreement between the data and the predictions, including all sources of uncertainty. The systematic uncertainties of the data are shown separately

Fig. 14
figure 14

Post-fit correlations of the observed cross sections for the bins of transverse momentum \(p_{\text {T}} ^{H}\). The legend on the right shows numbers in percentages

The final differential fiducial cross sections for each observable, obtained using the regularized in-likelihood unfolding method are shown in Figs. 6, 7, 8, 9, 10 for the single-differential distributions, and Figs. 11, 12, 13 for the double-differential distributions. Figure 14 shows the correlations of the unfolded cross sections in the signal regions in the different \(p_{\text {T}} ^{H} \) intervals. The numbers in the legend on the right are shown in percentages.

The compatibility of the data and each set of predictions is characterized by the p-values, shown in the respective figure legends. These p-values are computed from the fit of the respective predictions, including uncertainties in normalization and shape, to the observed data. The total cross sections (sum over all bins) are compatible with each other for all observables. The data and predictions agree very well, as demonstrated by the high p-values for all of the plots. No significant differences are observed between the measured cross sections and their Standard Model Monte Carlo predictions.

The uncertainties with the largest impact on the results include uncertainties related to jet and muon reconstruction. Theory uncertainties associated with the top-quark and WW backgrounds, and with the difficulty of modelling \(V\gamma \) processes also play a leading role. For the prediction, the uncertainties were evaluated in several different regions of phase space and summed in quadrature as described in Ref. [6], whereas for the sample, they could only be evaluated inclusvely for each bin, resulting in a slightly less precise evaluation. Other leading uncertainties include ones affecting the data-driven background estimates for misidentified objects, as well as uncertainties related to normalizing the backgrounds from control regions. The uncertainties are listed in Table 5, which shows ranges of values that correspond the different bins of the measured observables. Table 6 shows a detailed breakdown of the uncertainties for the \(p_{\text {T}} ^{H} \) variable obtained after the final fitting procedure is performed.

The integrated fiducial cross section is obtained using the procedure described above but without binning in any observable (and consequently without any regularization). It is measured to be \(56.0^{+10.0}_{-9.5}\) fb.

Table 5 The statistical uncertainties of the data and Monte Carlo predictions, as well as experimental and theoretical uncertainties of the measured cross sections for each kinematic variable. The uncertainties vary depending on the \(m_{\text {T}}\) bin in which the cross sections are measured, as demonstrated by the range shown for each entry. The dominant sources of systematic uncertainties are discussed in the text
Table 6 Relative cross section uncertainties broken down for each bin in \(p_{\text {T}} ^{H}\)

9 Conclusion

Measuring the Higgs boson’s differential production cross section is an important aspect of measuring Higgs properties and further testing the Standard Model. This analysis has measured single- and double-differential cross sections for Higgs boson production via gluon–gluon fusion and decay into \(WW^{*} \rightarrow e\nu \mu \nu \) in bins of the final-state transverse mass, \(m_{\text {T}}\) . The measurement was performed using the full LHC Run 2 dataset of 13 \(\text {TeV}\) proton–proton collisions collected with the ATLAS detector during 2015–2018, corresponding to an integrated luminosity of 139 \({\text {fb}}^{-1}\). The resulting \(m_{\text {T}}\) distributions and measurements of the differential cross sections in fiducial regions are presented for the \(|y_{j0}|\), \(p_{\text {T}} ^{H} \), \(p_{\text {T}} ^{\ell 0}\), \(p_{\text {T}} ^{\ell \ell }\), \(m_{\ell \ell }\), \(y_{\ell \ell }\), \(\Delta \phi _{\ell \ell }\), and \(\cos \theta ^*\) observables. Likelihood unfolding with Tikhonov regularisation is used to transform the reconstruction-level quantities to their particle-level distributions in each of these eight observables, and in each of the last six of them versus jet multiplicity. Performing the measurement at the particle level facilitates direct comparison with theoretical predictions and minimizes the impact of the signal uncertainties on the final results. The results agree extremely well with Standard Model expectations, derived using the , and Monte Carlo generators. The leading uncertainties are related to jet and muon reconstruction, theoretical modelling of the WW and \(V\gamma \) backgrounds, and data-driven background estimates for misidentified objects. The results improve upon those previously obtained by ATLAS, mainly by using more data and analysing a larger suite of observables. In addition, these results were obtained by using a fitting procedure, unlike the previous version of the analysis, which relied on a simple subtraction of the expected background event yields from the observed event yields.