The gauge symmetry of the standard model (SM) fixes the triple gauge boson couplings that determine the self-interactions of W and Z bosons. The pair production of vector gauge bosons allows a direct test of the electroweak sector of the SM [1]. Observation of anomalous triple gauge boson couplings would be an indication of physics beyond the SM.

We report the first measurement of WW+WZ diboson production in pp collisions in the semileptonic final state at the Large Hadron Collider (LHC), where one W boson decays leptonically (ℓν, with =e,μ) while the other boson (W or Z) decays hadronically (jj), giving rise to two energetic jets in the final state. Previous measurements in this channel at the Tevatron \(\mathrm{p}\bar{\rm p}\) collider include the recent CDF [2] and D0 [3, 4] results. The advantage of reconstructing WW+WZ in the ℓνjj decay mode over the purely leptonic final states [58] is the larger branching fraction of W and Z bosons to quarks. This advantage is partially offset by the larger backgrounds in the ℓνjj channel, coming mainly from W+jets production. In contrast to the fully leptonic decay of WW pairs, the semileptonic process permits a direct measurement of the boson transverse momentum (p T). The sensitivity of WW production to the WWγ coupling and of WW and WZ production at high boson transverse momentum to the WWZ coupling makes these processes particularly useful as a probe of anomalous triple gauge boson couplings.

The data correspond to an integrated luminosity of 5.0±0.1 fb−1 collected in 2010 and 2011 with the Compact Muon Solenoid (CMS) detector in pp collisions at \(\sqrt{s} = 7~\mbox{TeV}\) at the CERN LHC. The CMS experiment [9] uses a right-handed coordinate system, with the origin at the nominal interaction point, the x axis pointing to the center of the LHC ring, the y axis pointing up, perpendicular to the plane of the LHC ring, and the z axis along the counterclockwise beam direction. The polar angle θ is measured from the positive z axis and the azimuthal angle ϕ is measured in the xy plane. The pseudorapidity is defined as η=−ln[tan(θ/2)]. The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the field volume are silicon pixel and strip trackers and several calorimeters. The tracking system covers the range |η|<2.5 and provides a track momentum resolution of 1 % at 100 GeV. The lead tungstate crystal electromagnetic calorimeter (ECAL) covers |η|<3 with an energy resolution of about \(3~\%/\sqrt{E}\), where E is in GeV [10]. The brass/scintillator hadron calorimeter (HCAL) covers |η|<3.0 with an energy resolution of \(100~\%/\sqrt{E}\). The muon system consists of gas-ionization detectors inside and around the steel return yoke, and is capable of reconstructing and identifying muons within |η|<2.4. Extensive forward calorimetry complements the coverage provided by the barrel and endcap detectors. The CMS detector is nearly hermetic, allowing for measurements of the missing transverse energy (\({E_{\mathrm{T}}^{\text{miss}}}\)) in the event. A two-tier trigger system selects the events of interest.

The data were collected with a suite of single-lepton triggers mostly using p T thresholds of 24 GeV for muons and 25–32 GeV for electrons. To preferentially select events with on-shell W bosons, the single-electron triggers also require minimum thresholds on \({E_{\mathrm{T}}^{\text{miss}}}\) in the range 0–25 GeV and on the transverse mass m T of the electron plus \({E_{\mathrm{T}}^{\text{miss}}}\) system in the range 0–50 GeV. The overall trigger efficiency is about 94 % (90 %) for muon (electron) data, with a small dependence (a few percent) on p T and η. Simulated events are corrected for the trigger efficiency as a function of lepton p T and η, and in the case of electrons also as a function of \({E_{\mathrm{T}}^{\text{miss}}}\). Simulated events are used to develop and validate the methods used in the analysis.

The MadGraph5 1.3.30 [11] event generator produces parton-level events with a W boson and up to four partons on the basis of matrix-element (ME) calculations. The ME–parton shower (ME–PS) matching scale μ is taken to be 20 GeV [12], and the factorization and renormalization scales are both set to \(q^{2} = M_{ \mathrm {W} }^{2} + p_{\mathrm{T}, \mathrm {W} }^{2}\). Samples of \(\mathrm {t}\overline {\mathrm {t}}\) and Drell–Yan events are also generated with MadGraph. Single-top production is modeled with powheg 1.0 [13]. Multijet and diboson samples (W W, W Z, Z Z) are generated with pythia 6.422 [14]. pythia provides the parton shower simulation in all cases, with parameters of the underlying event set to the Z2 tune [15]. The set of parton distribution functions used is cteq6ll [16]. A Geant4-based simulation [17] of the CMS detector is used in the production of all Monte Carlo (MC) samples. Multiple proton–proton interactions within a bunch crossing (pileup) are simulated, and the triggers are emulated. All simulated events are reconstructed and analyzed as measured collision events.

Events are selected with one well-identified and isolated lepton (muon or electron), large missing transverse energy, and exactly two high-p T jets. Muons are reconstructed within |η|<2.1 with the inner tracker and the muon system [18]. Electrons are reconstructed from tracks in the tracker pointing to energy clusters in the ECAL, within |η|<2.5, excluding the transition region between the barrel and endcap, 1.44<|η|<1.57 [19]. Muons (electrons) are required to have p T greater than 25 GeV (35 GeV). The lepton candidates are required to be consistent with having originated from the primary vertex of the event, which is chosen to be the vertex with the highest ∑p T 2 of its associated tracks. According to the simulation, this requirement provides the correct assignment for the primary vertex in more than 99 % of the cases in both signal and background events. Charged leptons from W boson decays are expected to be isolated from other activity in the event. The sum of transverse momentum or energy in the tracker, ECAL, and HCAL, within a surrounding cone of \(\varDelta R \equiv \sqrt{(\varDelta \eta)^{2}+(\varDelta \phi)^{2}} <0.3\), excluding the lepton itself, is required to be less than 10 % of the measured p T of the muon, or less than 5 % of the measured p T of the electron. Here Δη and Δϕ are the differences in pseudorapidity and in azimuthal angle, respectively. To reduce the backgrounds from fully leptonic decays, such as Drell–Yan and electroweak diboson processes, we exclude events in which there is any other loosely identified lepton (with p T>10 GeV for muons and p T>20 GeV for electrons) in the event.

Jets are reconstructed from calorimeter and tracker information using a particle-flow technique that combines information from several subdetectors [20]. The anti-k T clustering algorithm [21, 22] with a distance parameter of 0.5 is used. Jets that overlap with isolated leptons within ΔR=0.3 are not considered. Jet-energy corrections are applied to account for the nonlinear energy response of the calorimeters and for other instrumental effects [23]. These corrections are based on in situ measurements using dijet, γ+jet, and Z+jet data samples [24]. Pileup collisions and the underlying event add to the energy of the reconstructed jets. The median energy density from pileup is evaluated in each event and the corresponding energy is subtracted from each jet [25]. In addition, charged tracks that do not originate from the primary vertex are not considered for jet clustering [26]. We verified that these procedures successfully remove the dependence of jet response on the number of interactions in a single event. A jet-quality requirement, primarily based on the energy balance between charged and neutral hadrons in a jet, is applied to remove poorly reconstructed jets. Only events containing exactly two jets with p T>35 GeV and within |η|<2.4 are selected for the analysis. To reduce contamination from \(\mathrm {t}\overline {\mathrm {t}}\) background, events are discarded if one or more jets pass high-efficiency b-quark jet identification criteria based on the presence of a secondary vertex within the jet [27]. An accurate \({E_{\mathrm{T}}^{\text{miss}}}\) measurement is essential to distinguish the W signal from multijet backgrounds and to reconstruct the full event kinematics of the WW system. We use \({E_{\mathrm{T}}^{\text{miss}}}\) measured in the event using the full particle-flow reconstruction [28] and require \({E_{\mathrm{T}}^{\text{miss}}}>25\) (30) GeV for the muon (electron) channel. To reduce the background from processes that do not contain W→ℓν decays, we require that the transverse mass of the W candidate exceed 30 GeV (50 GeV) in muon (electron) data [29].

We measure the dijet mass (m jj ) distribution, as shown in Fig. 1(a). The relative contributions of the known SM processes are determined using an unbinned maximum-likelihood fit over the mass range 40–150 GeV. The fit is performed separately for the muon and electron channels since their background compositions differ. Table 1 lists the SM processes included in the fit. The normalization of the diboson WW+WZ contribution is a free parameter. The normalizations of the background components are allowed to vary within Gaussian constraints around their central values. For multijet events, this central value is obtained from an independent two-component fit to the \({E_{\mathrm{T}}^{\text{miss}}}\) distribution which determines the corresponding fraction in the data [29]. The fit uncertainty is used as a constraint on the multijet contribution. The central values for all other processes are obtained from next-to-leading-order (NLO) or higher-order calculations, and the constraints are taken from the theoretical uncertainties listed in Table 1. With the exception of multijet production, the shape of the m jj distribution for all processes is obtained from simulation. Multijet events contribute to the total background when jets are misidentified as isolated leptons. Their m jj shape can be derived from data events with lepton candidates that fail the isolation requirements. The fluctuations in the shapes and yields of subleading backgrounds have a minor impact on the overall fit.

Fig. 1
figure 1

(a) Distribution of the dijet invariant mass in data, with the binning chosen based on the resolution and fit projections of the relevant components overlaid. (b) The dijet invariant mass after subtraction of all components except the electroweak WW+WZ processes. The error bars represent the statistical uncertainties and the hatched bands represent the systematic uncertainties. (c) The normalized residual or pull: (data−fit)/(fit uncertainty) (Color figure online)

Table 1 Treatment of background m jj shapes and normalizations in a fit to the data. The cross section values are calculated with the programs cited on the corresponding rows. The background normalizations are constrained to Gaussian distributions with the listed central values and widths. The treatment of multijet events is described in the text

The m jj spectrum of the dominant W+jets component is described using the shape from MadGraph simulation after taking into account the uncertainties due to the factorization and renormalization scale (both equal to q) and ME–PS matching scale μ [36]:

(1)

where \(\mathcal{F}_{\text{W+jets}}\) denotes the m jj shape from simulation. The parameters μ 0 (μ′) and q 0 (q′) correspond to the default (alternative) values of μ and q, respectively. The parameters α and β are free to vary during the fit and remain within the physical ranges (0≤α,β≤1 and 1−αβ≥0). We take μ′=2μ 0 or 0.5μ 0 (q′=2q 0 or 0.5q 0), depending on which alternative sample provides a better fit to the data. Thus, the fit probes variations of a factor of two in both μ and q (with the corresponding shape fluctuations accounted for when setting exclusion limits).

Figure 1(a) shows the observed m jj distribution for both channels combined, together with the fitted projections of the contribution of various SM processes. Figure 1(b) shows the same distribution after subtracting all SM contributions from data except for WW+WZ events. Figure 1(c) shows the pull distribution, i.e. the normalized residual defined as (data−fit)/(fit uncertainty), where the fit uncertainty is computed at each data point by propagating the uncertainty in the normalization coefficients. The yields of various SM components, as determined by the fit, are reported in Table 2.

Table 2 Event yields determined from a maximum-likelihood fit to the data. The total uncertainty is computed using the full covariance matrix. Owing to a higher kinematic threshold the product of acceptance × efficiency is smaller for the electron channel. The term \(\mathcal{A}\varepsilon\) includes W and Z branching fractions [37]

In order to ensure robustness against fit parameters and to account for corresponding biases we validate the fit procedure by performing pseudo-experiments. In each experiment, we generate the m jj pseudo-data for the SM processes, taking into account the correlations between the yields, and then perform a fit to each pseudo-data m jj distribution. The results for both the muon and electron channels indicate that there is a small bias (−8.6 % and −6.6 %) in the WW+WZ yield, corresponding to less than 0.4 standard deviations, and that the fit slightly overestimates the uncertainty on the yield. These effects are corrected for in the final result. The validation procedure shows that biases in all background yields and errors are small. The fit results for the background components are statistically consistent with the expectations, with the exception of W+jets, where 11 % fewer events for muons and 15 % fewer events for electrons, compared to the expectation, are observed. Overall, the approach produces a high quality model of the data (Fig. 1(a)), where the pull distribution is consistent with 0 (Fig. 1(c)), and allows us to extract the diboson peak (Fig. 1(b)).

Systematic uncertainties arising from the jet energy are estimated from W bosons decaying hadronically in a sample of semileptonic \(\mathrm {t}\overline {\mathrm {t}}\) events. The mean and resolution of the reconstructed dijet mass distribution in data agree to within 0.6 % of the expectations from simulation (this discrepancy is accounted for as an explicit systematic uncertainty), with negligible effect on acceptance. A small difference in \({E_{\mathrm{T}}^{\text{miss}}}\) resolution [28] between data and simulation affects the signal acceptance at the 0.5 % level. Further systematic uncertainties on the signal yield are due to the uncertainty on the trigger efficiency in data (1 %), and on the lepton reconstruction and selection efficiencies (2 %) [29]. The uncertainty due to the b-jet veto is negligible. The uncertainty in the luminosity measurement is 2.2 % [38]. The uncertainty in acceptance arising from theoretical uncertainties (evaluated using MadGraph and mcfm samples), including parton distribution functions and additional jet rejection, is 4 %.

As listed in Table 2, we observe 2700±340 (stat.)±360 (syst.) WW+WZ events, in agreement with the SM expectation. This result corresponds to a significance of 8.8 standard deviations when computed using a simple likelihood ratio [39], where the background yield uncertainties (Table 2) and errors on α, β (Eq. (1)) are fixed to their fitted values. Using the profile likelihood ratio [39], where these parameters are allowed to vary, the significance becomes 4.3 standard deviations. We compute the WW+WZ total cross section as \(\sigma = N_{\text{Sig}}/ (\mathcal{A} \, \varepsilon \, {\mathcal{L}})\), where N Sig is the number of extracted signal events, \(\mathcal{A}\) is the signal acceptance corrected for the branching fractions, ε is the overall efficiency for event selection, and \({\mathcal{L}}\) is the integrated luminosity. In the acceptance calculation we assume the SM value for the WW to WZ production ratio. The values of N Sig and \(\mathcal{A}\varepsilon\) are given in Table 2 separately for the muon and electron channels. Combining the results from the muon and electron channels, we obtain σ(pp→WW+WZ)=68.9±8.7 (stat.)±9.7 (syst.)±1.5 (lum.) pb, which is in agreement with the NLO prediction [40], 65.6±2.2 pb, that includes the contribution from gg→WW. The total cross section computed in the muon channel, 73.8±15.1 pb, is consistent with that obtained in the electron channel, 60.8±21.5 pb, where the statistical and systematic uncertainties have been combined in quadrature.

Measurements of electroweak diboson production can be translated into measurements of gauge boson self-couplings, which are among the most fundamental aspects of the SM. At the leading order, only s-channel \(\mathrm{q}\overline{\mathrm{q}}\) annihilation diagrams have a three-boson vertex involving WWγ and WWZ couplings in WW production, and WWZ coupling in WZ production. Physics beyond the SM can modify these couplings, leading to observable differences in the cross section and the kinematic distributions [41]. We search for anomalous triple gauge couplings (aTGCs) using an effective Lagrangian described by the following HISZ (Hagiwara, Ishihara, Szalapski, Zeppenfeld) parametrization without form factors [1]: λ γ =λ Z =λ, \(\varDelta {\kappa_{Z}} = \varDelta {g_{1}^{Z}}-\varDelta {\kappa_{\gamma}} \cdot \tan^{2}\theta_{\mathrm{W}}\). We use the dijet p T distribution (with most of the discriminating power coming from the last bin), shown in Fig. 2, as the observable after requiring 75 GeV<m jj <95 GeV to enhance signal purity. The dependence of the distribution on specific aTGCs is modeled by reweighting the pythia simulation of WW+WZ to the mcfm [30] NLO predictions. We account for systematic uncertainties arising from luminosity, signal selection efficiency (via comparisons to mcfm samples employing alternate choices of PDFs as well as factorization and renormalization scales), signal shape, and from the normalization and shape of the SM processes. We find no evidence for aTGCs. Given the tight bound on the parameter \(\varDelta {g_{1}^{Z}}\) [37], we assume the SM value (\(\varDelta {g_{1}^{Z}}=0\)) and set limits on the two parameters λ and Δκ γ . Exclusion limits at 95 % confidence level (CL) in the two-dimensional λ-Δκ γ plane, computed using the modified frequentist CL S  [39, 42] technique with profile likelihood as the test statistic, are shown in Fig. 3. The limit setting procedure combines fit results from muon and electron channels We obtain the following one-dimensional observed 95 % CL limits assuming the SM value for the other parameter: −0.038<λ<0.030, −0.11<Δκ γ <0.14. These limits are competitive with, and in some cases improve upon, the sensitivity of the combined LEP experiments listed in Refs. [37, 4346]. The ATLAS Collaboration recently reported limits in the fully leptonic channel for WZ [7] and WW [8] production. Limits obtained from fully leptonic channels are weaker due to the smaller branching fractions.

Fig. 2
figure 2

Dijet p T distributions for (a) the muon and (b) the electron channels after full selection and with the requirement 75 GeV<m jj <95 GeV. The stacked histogram shapes are taken from simulation or, where applicable, from data-driven estimates. They are normalized according to the fit to the observed m jj spectrum in data. Below we show the Data/MC ratio with the (dashed) red lines corresponding to the shape uncertainty. The last bin includes the overflow (Color figure online)

Fig. 3
figure 3

Observed (solid) and expected (dashed) exclusion limits at 95 % CL for anomalous triple gauge couplings. The dark green and light yellow bands correspond to the one and two sigma intervals, respectively, in the expected limit distribution. The SM expectation is shown by the solid dot (Color figure online)

In summary, a measurement of the sum of the inclusive WW and WZ production cross sections has been performed using events containing a leptonically decaying W and two jets. The measured value for the cross section is σ(pp→WW+WZ)=68.9±8.7 (stat.)±9.7 (syst.)±1.5 (lum.) pb, which is consistent with the SM prediction. This is the first measurement of WW+WZ production in pp collisions using this signature. No evidence for anomalous triple gauge couplings is found, and limits are set on their magnitudes.