1 Introduction

We present measurements of the \(\mathrm {p}\mathrm {p}\rightarrow {\mathrm{W}}^{\pm } +X \rightarrow \mu ^{\pm }\nu +X\) differential cross section and the muon charge asymmetry that provide important constraints on the valence and sea quark distributions in the proton. Uncertainties in the parton distribution functions (PDF) have become a limiting factor for the precision of many inclusive and differential cross section calculations, given the development of precise theoretical tools describing hard scattering processes in \(\mathrm {p}\mathrm {p}\) collisions.

For each charge of the \(\mathrm {W}\) boson, the differential cross section,

$$\begin{aligned} \sigma ^{\pm }_{\eta }=\frac{{\mathrm{d}}\sigma }{{\mathrm{d}}\eta } (\mathrm {p}\mathrm {p}\rightarrow {\mathrm{W}}^{\pm } +X \rightarrow \mu ^{\pm }\nu +X), \end{aligned}$$
(1)

is measured in bins of muon pseudorapidity \(\eta = -\ln \tan (\theta /2)\) in the laboratory frame, where \(\theta \) is the polar angle of the muon direction with respect to the beam axis. Current theoretical calculations predict these cross sections with next-to-next-to-leading-order (NNLO) accuracy in perturbative quantum chromodynamics (QCD). The dominant \({\mathrm{W}}^{\pm } \) boson production occurs through the annihilation of a valence quark from one of the protons with a sea antiquark from the other: \({\mathrm{u}}\overline{{\mathrm{d}}}\rightarrow \mathrm {W^{+}}\) and \({\mathrm{d}}\overline{{\mathrm{u}}}\rightarrow \mathrm {W^{-}}\). Because of the presence of two valence \({\mathrm{u}}\) quarks in the proton, \(\mathrm {W^{+}}\) bosons are produced more often than \(\mathrm {W^{-}}\) bosons. Precise measurement of the charge asymmetry as a function of the muon \(\eta \),

$$\begin{aligned} \mathcal {A}(\eta ) = \frac{\sigma ^+_{\eta }-\sigma ^-_{\eta }}{\sigma ^+_{\eta }+\sigma ^-_{\eta }}, \end{aligned}$$
(2)

provides significant constraints on the ratio of \({\mathrm{u}}\) and \({\mathrm{d}}\) quark distributions in the proton for values of x, the Bjorken scaling variable [1], between \(10^{-3}\) and \(10^{-1}\).

The \({\mathrm{W}}^{\pm } \) boson production asymmetry was previously studied in \(\mathrm {p}{\overline{\mathrm{p}}}\) collisions by the CDF and D0 collaborations [26]. At the LHC, the first measurements of the lepton charge asymmetries were performed by the CMS, ATLAS, and LHCb experiments using data collected in 2010 [79]. The CMS experiment has further improved the measurement precision in both the electron and muon decay channels using data from \(\mathrm {p}\mathrm {p}\) collisions at \(\sqrt{s}=7\,\mathrm{TeV} \) corresponding to integrated luminosities of 0.84 and 4.7\(\,\text {fb}^{-1}\) in the electron [10] and muon [11] decay channels, respectively.

This measurement is based on a data sample of \(\mathrm {p}\mathrm {p}\) collisions at \(\sqrt{s}=8\,\mathrm{TeV} \) collected by CMS during 2012, corresponding to an integrated luminosity of 18.8\(\,\text {fb}^{-1}\). At \(\sqrt{s}=8\,\mathrm{TeV} \) the average value of the Bjorken scaling variable for the interacting partons in \({\mathrm{W}}^{\pm } \) boson production is lower than at \(\sqrt{s}=7\,\mathrm{TeV} \), which is expected to result in a lower \({\mathrm{W}}^{\pm } \) boson production charge asymmetry. This measurement provides important constraints on the proton PDFs, which is illustrated by the QCD analysis also presented in this paper.

2 CMS detector

The central feature of the CMS apparatus is a superconducting solenoid of 6\(\text {\,m}\) internal diameter, providing a magnetic field of 3.8\(\text {\,T}\). Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass and scintillator hadron calorimeter, each composed of a barrel and two endcap sections. Muons are measured in gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid. Extensive forward calorimetry complements the coverage provided by the barrel and endcap detectors. A more detailed description of the CMS detector can be found in Ref. [12].

3 Data selection and simulation

A \({\mathrm{W}}^{\pm } \rightarrow \mu ^{\pm }\nu \) event is characterized by an isolated muon with a high transverse momentum \(p_{\mathrm {T}} \) and a large missing transverse energy \(E_{\mathrm {T}}/\) associated with an undetected neutrino. Events in this sample are collected with an isolated single-muon trigger with a \(p_{\mathrm {T}} \) threshold of 24\(\,\mathrm{GeV}\). To reduce the background, identification and isolation criteria are applied to the reconstructed muons. These requirements are similar to those used in the previous measurement [11]. Muon tracks must be reconstructed in both the silicon tracker and the muon detectors. The global muon fit is required to have a \(\chi ^2\) per degree of freedom less than 10. The pseudorapidity coverage for reconstructed muons is restricted to \(|\eta |<2.4\). Cosmic ray contamination is largely reduced by rejecting the muon candidates with a large \(({>}0.2\,\text {cm})\) distance of closest approach to the primary vertex in the transverse plane. The isolation criterion is based on additional tracks reconstructed in a cone of \(\sqrt{{(\Delta \eta )^2+(\Delta \phi )^2}}<0.3\) around the muon, where \(\phi \) is the azimuthal angle (in radians) in the laboratory frame. The muon candidate is rejected if the scalar \(p_{\mathrm {T}} \) sum of these tracks is more than 10 % of the muon \(p_{\mathrm {T}} \). The selected muon candidate with the largest \(p_{\mathrm {T}} \), identified as a signal muon from the \(\mathrm {W}\) boson decay, is required to have a \(p_{\mathrm {T}} >25\,\mathrm{GeV} \) and also to be the particle that triggered the event. To reduce the background from Drell–Yan (DY) dimuon production, events containing a second identified muon with \(p_{\mathrm {T}} >15\,\mathrm{GeV} \) are rejected.

A total of about 61 million \(\mathrm {W^{+}}\rightarrow {{\mu }^{+}}{\nu }\) and 45 million \(\mathrm {W^{-}}\rightarrow {\mu ^{-}}\overline{\nu }\) candidate events are selected. The \({\mathrm{W}}^{\pm } \rightarrow \mu ^{\pm }\nu \) signal is contaminated with backgrounds that also produce a muon with high \(p_{\mathrm {T}} \). The major background sources are (i) multijet (QCD) events with high-\(p_{\mathrm {T}}\) muons produced in hadron decays (about 10 % of the selected sample), and (ii) \({\mathrm{Z}}/\gamma ^{*} \rightarrow {{\mu }^{+}}{\mu ^{-}} \) events (5 % of the sample). The contribution from other backgrounds, such as \({\mathrm{W}}^{\pm } \rightarrow \tau ^{\pm }\nu \) (2.6 %), \({\mathrm{Z}}/\gamma ^{*}\rightarrow \tau ^{+} \tau ^{-} \) (0.5 %), and \({\mathrm{t}}\overline{{\mathrm{t}}} \) (0.5 %) events, is relatively small. The contributions from single top quark (0.14 %) and diboson (0.07 %) events, as well as from cosmic muons (\(10^{-5}\)), are negligible.

Simulated samples are used to model the signal and background processes. The signal, as well as the electroweak and \({\mathrm{t}}\overline{{\mathrm{t}}}\) background samples, is based on the next-to-leading-order (NLO) matrix element calculations implemented in the powheg Monte Carlo (MC) event generator [1316], interfaced with pythia6 [17] for parton showering and hadronization, including electromagnetic final-state radiation (FSR). The CT10 NLO PDFs [18] are used. The \(\tau \) lepton decays in relevant processes are simulated with tauola  [19]. The QCD background is generated with pythia6 using CTEQ6L PDF [20].

The MC events are overlaid by simulated minimum-bias events to model additional \(\mathrm {p}\mathrm {p}\) interactions (pileup) present in data. The detector response to all generated particles is simulated with Geant4  [21]. Final-state particles are reconstructed with the same algorithms used for the data sample.

4 Corrections to the data and simulations

The fiducial cross sections are measured for muon \(p_{\mathrm {T}} >25\,\mathrm{GeV} \) in 11 bins of absolute pseudorapidity, covering the range \(|\eta |<2.4\). The \(| \eta | \) binning is such that the migration effects due to the finite \(\eta \) resolution are negligible. In each \(|\eta |\) bin, the number of \(\mathrm {W^{+}}\rightarrow \mu ^+\nu \) and \(\mathrm {W^{-}}\rightarrow \mu ^-\nu \) events is extracted by fitting the \(E_{\mathrm {T}}/\) distributions with signal and background distributions (templates). The template shapes and initial normalizations are derived from MC simulations. To improve the simulation, several corrections are applied to the MC samples. The corrections, which are similar to those used in the previous measurement [11], are briefly summarized below.

All simulated events are weighted to match the pileup distribution in data. The weight factors are based on the measured instantaneous luminosity and minimum-bias cross section leading to a good description of the average number of reconstructed vertices in the data.

Accurate calibration of the muon momentum is important for the proper modeling of the yields of \({\mathrm{W}}^{\pm } \) events and of the shapes of \(E_{\mathrm {T}}/\) templates. Dominant sources of the muon momentum mismeasurement are the mismodeling of the tracker alignment and the magnetic field. Muon momentum correction factors are derived using \({\mathrm{Z}}/\gamma ^{*} \rightarrow {{\mu }^{+}}{\mu ^{-}} \) events in several iterations [22]. First, “reference” distributions are defined based on the MC generated muons, with momenta smeared by the reconstruction resolution. Then, corrections to muon momentum in bins of \(\eta \) and \(\phi \) are extracted separately for positively and negatively charged muons. These corrections match the mean values of reconstructed \(1/p_{\mathrm {T}} \) spectra to the corresponding reference values. Finally, correction factors are tuned further by comparing the reconstructed dimuon invariant mass spectra in each \(\mu ^+\) and \(\mu ^-\) pseudorapidity bin with the reference. The correction factors are determined separately for data and simulated events following the same procedure.

The overall muon selection efficiency includes contributions from reconstruction, identification, isolation, and trigger efficiencies. Each component is measured from \({\mathrm{Z}}/\gamma ^*\rightarrow {{\mu }^{+}}{\mu ^{-}}\) events using the “tag-and-probe” method  [23, 24]. The efficiencies are measured in bins of \(\eta \) and \(p_{\mathrm {T}} \) for \(\mu ^+\) and \(\mu ^-\) separately. Each \(\eta \) bin of the efficiency measurement is fully contained in a single \(| \eta | \) bin used for the asymmetry measurement. The total average efficiency is about 85 % at central rapidities and drops to about 50 % in the last \(| \eta |\) bin. The ratio of the average \(\mu ^{+} \) and \(\mu ^{-} \) efficiencies varies within 0.6 % of unity in the first 10 \(| \eta | \) bins. In the last bin the ratio is 0.98. The same procedure is used in data and MC simulation, and scale factors are determined to match the MC simulation efficiencies to data.

Template shapes, used in the fits, are based on the missing transverse momentum (\({\vec {p}}_{\mathrm {T}}^{\text {miss}} \)) reconstructed with the particle-flow algorithm [25, 26]. The \({\vec {p}}_{\mathrm {T}}^{\text {miss}} \) is defined as the projection on the plane perpendicular to the beams of the negative vector sum of the momenta of all reconstructed particles in an event. A set of corrections is applied to \({\vec {p}}_{\mathrm {T}}^{\text {miss}} \) in order to improve the modeling of distributions of \(E_{\mathrm {T}}/=|{\vec {p}}_{\mathrm {T}}^{\text {miss}} |\) in data and MC templates. First, the average bias in the \({\vec {p}}_{\mathrm {T}}^{\text {miss}} \)-component along the direction of \({\vec {p}}_{\mathrm {T}} \)-sum of charged particles associated with the pileup vertices is removed [27]. Second, the muon momentum correction to \({\vec {p}}_{\mathrm {T}} \), described above, is added vectorially to \({\vec {p}}_{\mathrm {T}}^{\text {miss}} \). In addition, the “\(\phi \)-modulation” corrections, which increase linearly as a function of pileup, make the \(\phi ({\vec {p}}_{\mathrm {T}}^{\text {miss}})\) distributions uniform [27]. The above corrections are applied to both data and simulated events. The final set of corrections, derived from the “hadronic recoil” technique [28, 29], is applied to simulated \({\mathrm{W}}^{\pm } \rightarrow \mu ^{\pm }\nu \), \({\mathrm{Z}}/\gamma ^{*} \rightarrow {{\mu }^{+}}{\mu ^{-}} \), and QCD events to match the average \({\vec {p}}_{\mathrm {T}}^{\text {miss}} \) scale and resolution to data.

The modeling of the multijet events is further improved with a set of corrections derived from a QCD control sample selected by inverting the offline isolation requirement for events collected using a prescaled muon trigger with no isolation requirement. Muon \(p_{\mathrm {T}} \)-dependent weight factors are determined for the QCD simulation that match the muon \(p_{\mathrm {T}} \) distributions with data. The QCD control sample is also used to derive ratios between the yields with positive and negative muons in each muon \(| \eta | \) bin. These ratios are used to constrain the relative QCD contributions to \(\mathrm {W^{+}}\) and \(\mathrm {W^{-}}\) events, as described in Sect. 5.

5 Signal extraction

In each of the 11 muon \(|\eta |\) bins, yields of \(\mathrm {W^{+}}\) and \(\mathrm {W^{-}}\) events are obtained from the simultaneous \(\chi ^2\)-fit of the \(E_{\mathrm {T}}/\) distributions of \(\mu ^+\) and \(\mu ^-\) events. The definition of \(\chi ^2\) used in the fit takes into account the statistical uncertainties in the simulated templates. The shapes of the \(E_{\mathrm {T}}/\) distributions for the \({\mathrm{W}}^{\pm } \rightarrow \mu ^{\pm }\nu \) signal and the backgrounds are taken from the MC simulation after correcting for mismodeling of the detector response and for the \(p_{\mathrm {T}}\) distribution of \(\mathrm {W}\) bosons. All electroweak and \({\mathrm{t}}\overline{{\mathrm{t}}} \) background samples are normalized to the integrated luminosity using the theoretical cross sections calculated at NNLO. Each simulated event is also weighted with scale factors to match the average muon selection efficiencies in data. In addition, mass-dependent correction factors are applied to \({\mathrm{Z}}/\gamma ^{*} \rightarrow {{\mu }^{+}}{\mu ^{-}} \) simulated events to match the observed mass distribution of dimuon events in data.

The \(\mathrm {W^{+}}\) and \(\mathrm {W^{-}}\) signal yields and the total QCD background normalization are free parameters in each fit. The relative contributions of QCD background events in the \(\mathrm {W^{+}}\) and \(\mathrm {W^{-}}\) samples are constrained to values obtained from the QCD control sample. The \({\mathrm{W}}^{\pm } \rightarrow \tau ^{\pm }\nu \) background is normalized to the \({\mathrm{W}}^{\pm } \rightarrow \mu ^{\pm }\nu \) signal, for each charge, using the scale factors corresponding to the free parameters of the signal yield. The normalizations of the remaining electroweak and \({\mathrm{t}}\overline{{\mathrm{t}}} \) backgrounds are fixed in the fit.

Table 1 summarizes the fitted yields of \(\mathrm {W^{+}}\) (\(N^{+}\)) and \(\mathrm {W^{-}}\) (\(N^{-}\)) events, the correlation coefficient (\(\rho _{+,-}\)), and the \(\chi ^2\) value for each fit. Examples of fits for three \(|\eta |\) ranges are shown in Fig. 1. The ratio of the data to the final fit, shown below each distribution, demonstrates good agreement of the fits with data. It should be noted that the \(\chi ^2\) values reported in Table 1 are calculated using the statistical uncertainties of both data and simulated templates; systematic uncertainties are not taken into account.

Table 1 Summary of the fitted \(N^{+}\), \(N^{-}\), the correlation (\(\rho _{+,-}\)) between the uncertainties in \(N^{+}\) and \(N^{-}\), and the \(\chi ^{2}\) of the fit for each \(|\eta |\) bin. The number of degrees of freedom (\(n_{\text {dof}}\)) in each fit is 197. The quoted uncertainties are statistical and include statistical uncertainties in the templates. The correlation coefficients are expressed as percentages
Fig. 1
figure 1

Examples of fits in three \(| \eta | \) ranges: \(0.0<| \eta | <0.2\) (top), \(1.0<| \eta | <1.2\) (center), and \(2.1<| \eta | <2.4\) (bottom). For each \(\eta \) range, results for \(\mathrm {W^{+}}\) (left) and \(\mathrm {W^{-}}\) (right) are shown. The ratios between the data points and the final fits are shown at the bottom of each panel

For each muon charge and \(|\eta |\) bin, the fiducial cross section is calculated as

$$\begin{aligned} \sigma _\eta ^{\pm } = \frac{1}{2\Delta \eta }\frac{N^{\pm }}{\epsilon ^{\pm }\epsilon _{\mathrm {FSR}}^{\pm }\mathcal {L}_{\mathrm {int}}}, \end{aligned}$$
(3)

where \(\epsilon ^{\pm }\) is the average \(\mu ^{\pm }\) selection efficiency per \(|\eta |\) bin, \(\epsilon _{\mathrm {FSR}}\) takes into account the event loss within the muon \(p_{\mathrm {T}} \) acceptance due to the final-state photon emission, and \(\mathcal {L}_{\mathrm {int}}\) is the integrated luminosity of the data sample. Each \(\epsilon ^{\pm }_{\mathrm {FSR}}\) factor, defined as a ratio of the numbers of events within the \(p_{\mathrm {T}} \) acceptance after and before FSR, is evaluated using the signal MC samples.

6 Systematic uncertainties

To estimate the systematic uncertainties in the muon selection efficiencies, several variations are applied to the measured efficiency tables. First, the efficiency values in each \(\eta \)\(p_{\mathrm {T}} \) bin are varied within their statistical errors for data and simulation independently. In each pseudo-experiment the varied set of efficiencies is used to correct the MC simulation templates and extract the cross sections using Eq. (3). The standard deviation of the resulting distribution is taken as a systematic uncertainty for each charge and \(| \eta | \) bin. The statistical uncertainties between the two charges and all \(\eta \)\(p_{\mathrm {T}} \) bins are uncorrelated. Second, the offline muon selection efficiency scale factors are varied by \({\pm }0.5~\%\) coherently for both charges and all bins. The trigger selection efficiency scale factors are varied by \({\pm }0.2~\%\), assuming no correlations between the \(\eta \) bins, but \({+}100~\%\) correlations between the charges and \(p_{\mathrm {T}} \) bins. The above systematic variations take into account uncertainties associated with the tag-and-probe technique and are assessed by varying signal and background dimuon mass shapes, levels of background, and dimuon mass range and binning used in the fits. Finally, an additional \({\pm }100~\%\) correlated variation is applied based on the bin-by-bin difference between the true and measured efficiencies in the \({\mathrm{Z}}/\gamma ^{*} \rightarrow {{\mu }^{+}}{\mu ^{-}} \) MC sample. This difference changes gradually from about 0.5 % in the first bin to about \({-}2~\%\) in the last bin. This contribution is the main source of negative correlations in the systematic uncertainties between the central and high rapidity bins. The total systematic uncertainty in the efficiency is obtained by adding up the four covariance matrices corresponding to the above variations.

A possible mismeasurement of the charge of the muon could lead to a bias in the observed asymmetry between the \(\mathrm {W^{+}}\) and \(\mathrm {W^{-}}\) event rates. The muon charge misidentification rate has been studied in detail and was found to be negligible (\(10^{-5}\)) [7].

The muon momentum correction affects the yields and the shapes of the \(E_{\mathrm {T}}/\) distributions in both data and MC simulation. To estimate the systematic uncertainty, the muon correction parameters in each \(\eta \)\(\phi \) bin and overall scale are varied within their uncertainties. The standard deviation of the resulting cross section distribution for each charge and muon \(| \eta | \) bin is taken as the systematic uncertainty and the corresponding correlations are calculated. Finite detector resolution effects, which result in the migration of events around the \(p_{\mathrm {T}} \) threshold and between \(| \eta | \) bins, have been studied with the signal MC sample and found to have a negligible impact on the measured cross sections and asymmetries.

There are two sources of systematic uncertainties associated with the QCD background estimate. One is the uncertainty in the ratio of QCD background events in the \(\mathrm {W^{+}}\) and \(\mathrm {W^{-}}\) samples (\(R_{\pm }^{\mathrm {QCD}}\)). Whereas the total QCD normalization is one of the free parameters in the fit, \(R_{\pm }^{\mathrm {QCD}}\)is constrained to the value observed in the QCD control sample, which varies within 3 % of unity depending on the \(| \eta | \) bin. The corresponding systematic uncertainty is evaluated by changing it by \({\pm }5~\%\) in each \(| \eta | \) bin. This variation covers the maximum deviations indicated by the QCD MC simulation, as indicated in Fig. 2. The resulting systematic uncertainties are assumed to be uncorrelated between the \(| \eta | \) bins. Additionally, to take into account possible bias in this ratio due to different flavor composition in the signal and QCD control regions, the average difference of this ratio between the signal and QCD control regions is evaluated using the QCD MC simulation. This difference of about 3 % is taken as an additional 100 %-correlated systematic uncertainty in \(R_{\pm }^{\mathrm {QCD}}\). As a check, using the same shape for the QCD background in \({{\mu }^{+}}\) and \({\mu ^{-}}\) events, its normalization is allowed to float independently for the two charges. The resulting values of \(R_{\pm }^{\mathrm {QCD}}\)are covered by the above systematic uncertainties.

Fig. 2
figure 2

Distribution of \(R_{\pm }^{\mathrm {QCD}}\)in QCD control region for data (solid circles), QCD control region for simulation (solid squares), and signal region for simulation (open squares). Open circles show the \(R_{\pm }^{\mathrm {QCD}}\)distribution when QCD contributions in \(\mathrm {W^{+}}\) and \(\mathrm {W^{-}}\) events are not constrained. Shaded area indicates assigned systematic uncertainty

The other component of systematic uncertainty, associated with the QCD background, is the \(E_{\mathrm {T}}/\) shape. To estimate the systematic uncertainty in modeling the shape of the \(E_{\mathrm {T}}/\) distributions in QCD events, additional fits are performed using the \(E_{\mathrm {T}}/\) distributions without the hadronic recoil corrections and the \(p_{\mathrm {T}} \)-dependent scale factors; this results in a variation of about 2 % in the average \(E_{\mathrm {T}}/\) resolution. The resulting shifts in the extracted cross section values in each \(| \eta |\) bin are taken as systematic uncertainties. The correlations between the \(| \eta | \) bins and the two charges are assumed to be 100 %.

The normalization of the \({\mathrm{Z}}/\gamma ^{*} \rightarrow {{\mu }^{+}}{\mu ^{-}} \) background in the signal region is corrected with mass-dependent scale factors that match the dimuon mass distribution in MC simulation with data. The systematic uncertainty is the difference between the cross sections calculated with and without applying these corrections to the DY background normalizations. An uncertainty of 7 % is assigned to the \({\mathrm{t}}\overline{{\mathrm{t}}} \) theoretical cross section [30], which is used to normalize the \({\mathrm{t}}\overline{{\mathrm{t}}} \) background to the integrated luminosity of the data sample. In the fits, the \({\mathrm{W}}^{\pm } \rightarrow \tau ^{\pm }\nu \) background is normalized to the \({\mathrm{W}}^{\pm } \rightarrow \mu ^{\pm }\nu \) yields in data with a ratio obtained from the simulation. A \({\pm }2~\%\) uncertainty is assigned to the \({\mathrm{W}}^{\pm } \rightarrow \tau ^{\pm }\nu \) to \({\mathrm{W}}^{\pm } \rightarrow \mu ^{\pm }\nu \) ratio [31]. Each of the above variations is assumed to be fully correlated for the different bins and the two charges.

There are several sources of systematic uncertainty that affect the \(E_{\mathrm {T}}/\) shapes. The systematic uncertainty associated with the \(\phi \) modulation of \({\vec {p}}_{\mathrm {T}}^{\text {miss}} \) is small and is evaluated by removing the corresponding correction to \({\vec {p}}_{\mathrm {T}}^{\text {miss}} \). A 5 % uncertainty is assigned to the minimum bias cross section used to calculate the expected pileup distribution in data. To improve the agreement between data and simulation, the \(\mathrm {W}\) boson \(p_{\mathrm {T}} \) spectrum is weighted using factors determined by the ratios of the \(p_{\mathrm {T}}\) distributions in \({\mathrm{Z}}/\gamma ^{*} \rightarrow {{\mu }^{+}}{\mu ^{-}} \) events in data and MC simulation. The difference in measured cross sections with and without this correction is taken as a systematic uncertainty. Each of the sources above are assumed to be fully correlated between the two charges and different bins. Systematic uncertainties associated with the recoil corrections are evaluated by varying the average recoil and resolution parameters within their uncertainties. The standard deviation of the resulting cross section distribution is taken as the systematic uncertainty and correlations between the two charges and different bins are calculated.

The emission of FSR photons tends, on average, to reduce the muon \(p_{\mathrm {T}} \). The observed post-FSR cross sections within the \(p_{\mathrm {T}} \) acceptance are corrected using the \(\epsilon _{\mathrm {FSR}}^\pm \) factors derived from the signal MC sample. The difference between the \(p_{\mathrm {T}} \) spectra of positive and negative muons results in smaller charge asymmetries after FSR compared with those before FSR within the same \(p_{\mathrm {T}} >25\,\mathrm{GeV} \) acceptance. These differences, which vary between 0.07 and 0.11 % depending on the \(| \eta | \) bin, are corrected by the charge-dependent \(\epsilon _{\mathrm {FSR}}^\pm \) efficiency factors. The systematic uncertainty in the FSR modeling is estimated by reweighting events with radiated photons with correction factors that account for missing electroweak corrections in the parton shower [32, 33]. The \(\epsilon ^{\pm }_{\mathrm {FSR}}\) correction factors are reevaluated after such reweighting, and the difference between the cross sections, calculated with the new and default \(\epsilon _{\mathrm {FSR}}^\pm \) values, is taken as a systematic uncertainty. The correlations between the two charges and different \(| \eta | \) bins are assumed to be 100 %. The effects of migration between the \(| \eta | \) bins due to final-state photon emission have been evaluated with the signal MC sample and are found to be negligible.

The PDF uncertainties are evaluated by using the NLO MSTW2008 [34], CT10 [18], and NNPDF2.1 [35] PDF sets. All simulated events are weighted according to a given PDF set, varying both the template normalizations and shapes. For CT10 and MSTW2008 PDFs, asymmetric master equations are used [18, 34]. For the NNPDF2.1 PDF set, the standard deviation of the extracted cross section distributions is taken as a systematic uncertainty. For the CT10, the 90 % confidence level (CL) uncertainty is rescaled to 68 % CL using a factor of 1.645. The half-width of the total envelope of all three PDF uncertainty bands is taken as the PDF uncertainty. The CT10 error set is used to estimate the correlations between the two charges and different \(| \eta | \) bins.

Finally, a \({\pm }2.6~\%\) uncertainty [36] is assigned to the integrated luminosity of the data sample. The luminosity uncertainty is fully correlated between the \(| \eta | \) bins and two charges. Therefore, this uncertainty cancels in the measured charge asymmetries. The uncertainty in the normalization of the electroweak backgrounds due to the luminosity uncertainty has a negligible impact on the measurements.

Table 2 Systematic uncertainties in cross sections (\(\delta \sigma _\eta ^\pm \)) and charge asymmetry (\(\delta \mathcal {A}\)) for each \(|\eta |\) bin. The statistical and integrated luminosity uncertainties are also shown for comparison. A detailed description of each systematic uncertainty is given in the text
Table 3 Correlation matrices of systematic uncertainties for \(\sigma _\eta ^\pm \) and \(\mathcal {A}\). The statistical and integrated luminosity uncertainties are not included. The full \(22\times 22\) correlation matrix for \(\sigma _\eta ^\pm \) is presented as four blocks of \(11\times 11\) matrices, as shown in Eq. (4). The \(C_{++}\) and \(C_{--}\) blocks on the diagonal represent the bin-to-bin correlations of \(\delta \sigma _\eta ^+\) and \(\delta \sigma _\eta ^-\), respectively. The off-diagonal \(C_{+-}\) and \(C_{+-}^T\) blocks describe the correlations between the two charges. The values are expressed as percentages

Table 2 summarizes the systematic uncertainties in the measured cross sections and asymmetries. For comparison, the statistical and luminosity uncertainties are also shown. The uncertainty in the integrated luminosity dominates the total uncertainties in the measured cross sections, while the uncertainty in the QCD background estimation dominates the uncertainties in the charge asymmetries. The uncertainties for the muon charge asymmetries are calculated from those in the differential cross sections, taking into account the correlations between the two charges.

The correlations in the systematic uncertainty between the charges and different \(| \eta | \) bins are shown in Table 3. The full \(22{\times }22\) correlation matrix C is split into four \(11{\times }11\) blocks as

$$\begin{aligned} C= \begin{bmatrix} C_{++}&C_{+-} \\ C_{+-}^T&C_{--} \\ \end{bmatrix}, \end{aligned}$$
(4)

where the \(C_{++}\) and \(C_{--}\) matrices represent the bin-to-bin correlations of systematic uncertainties in \(\sigma _\eta ^+\) and \(\sigma _\eta ^-\), respectively, and \(C_{+-}\) describes the correlations between the two charges. To construct the total covariance matrix, the covariance matrix of the systematic uncertainties should be added to those of the statistical and integrated luminosity uncertainties. The latter are fully correlated between the two charges and \(| \eta | \) bins. For the statistical uncertainties bin-to-bin correlations are zero; the correlations between the two charges are shown in Table 1.

7 Results

The measured cross sections and charge asymmetries are summarized in Table 4 and displayed in Fig. 3. The error bars of the measurements represent both statistical and systematic uncertainties, including the uncertainty in the integrated luminosity. The measurements are compared with theoretical predictions based on several PDF sets. The predictions are obtained using the fewz 3.1 [37] NNLO MC calculation interfaced with CT10 [18], NNPDF3.0 [38], HERAPDF1.5 [39], MMHT2014 [40], and ABM12 [41] PDF sets. No electroweak corrections are included in these calculations. The error bars of the theoretical predictions represent the PDF uncertainty, which is the dominant source of uncertainty in these calculations. For the CT10, MMHT, HERA, and ABM PDFs, the uncertainties are calculated with their eigenvector sets using asymmetric master equations where applicable. For the NNPDF set the standard deviations over its 100 replicas are evaluated.

The numerical values of the predictions are also shown in Table 4. We note that the previous lepton charge asymmetries measured by CMS at \(\sqrt{s}=7\,\mathrm{TeV} \) have been included in the global PDF fits for the NNPDF3.0, MMHT2014, and ABM12 PDFs. The measured cross sections and charge asymmetries are well described by all considered PDF sets within their corresponding uncertainties.

Table 4 Summary of the measured differential cross sections \(\sigma _\eta ^\pm \) (pb) and charge asymmetry \(\mathcal {A}\). The first uncertainty is statistical, the second uncertainty is systematic, and the third is the integrated luminosity uncertainty. The theoretical predictions are obtained using the fewz 3.1 [37] NNLO MC tool interfaced with five different PDF sets
Fig. 3
figure 3

Comparison of the measured cross sections (upper plot for \(\sigma _\eta ^+\) and middle for \(\sigma _\eta ^-\)) and asymmetries (lower plot) to NNLO predictions calculated using the fewz 3.1 MC tool interfaced with different PDF sets. The right column shows the ratios (differences) between the theoretical predictions and the measured cross sections (asymmetries). The smaller vertical error bars on the data points represent the statistical and systematic uncertainties. The full error bars include the integrated luminosity uncertainty. The PDF uncertainty of each PDF set is shown by a shaded (or hatched) band and corresponds to 68 % CL

8 QCD analysis

The muon charge asymmetry measurements at 8\(\,\mathrm{TeV}\) presented here are used in a QCD analysis at NNLO together with the combined measurements of neutral- and charged-current cross sections of deep inelastic electron(positron)-proton scattering (DIS) at HERA [42]. The correlations of the experimental uncertainties for the muon charge asymmetry and for the inclusive DIS cross sections are taken into account. The theoretical predictions are calculated at NLO by using the mcfm 6.8 program [43, 44], which is interfaced to applgrid 1.4.56 [45]. The NNLO corrections are obtained by using k-factors, defined as ratios of the predictions at NNLO to the ones at NLO, both calculated with the fewz 3.1 [37] program, using NNLO CT10 [18] PDFs.

Version 1.1.1 of the open-source QCD fit framework for PDF determination herafitter [46, 47] is used with the partons evolved by using the Dokshitzer–Gribov–Lipatov–Altarelli–Parisi equations [4853] at NNLO, as implemented in the qcdnum 17-00/06 program [54].

The Thorne–Roberts [34, 55] general mass variable flavor number scheme at NNLO is used for the treatment of heavy-quark contributions with heavy-quark masses \(m_{{\mathrm{c}}} = 1.43\,\mathrm{GeV} \) and \(m_{{\mathrm{b}}} = 4.5\,\mathrm{GeV} \). The renormalization and factorization scales are set to Q, which denotes the four-momentum transfer in case of the DIS data and the mass of the \(\mathrm {W}\) boson in case of the muon charge asymmetry, respectively.

The strong coupling constant is set to \(\alpha _s (m_{{\mathrm{Z}}})\) = 0.118. The \(Q^2\) range of HERA data is restricted to \(Q^2 \ge Q^2_{\text {min}} = 3.5\,\mathrm{GeV} ^2\) to assure the applicability of perturbative QCD over the kinematic range of the fit.

The procedure for the determination of the PDFs follows the approach used in the analysis in Ref. [11]. The parameterised PDFs are the gluon distribution, \(x{\mathrm{g}} \), the valence quark distributions, \(x{\mathrm{u}}_v\), \(x{\mathrm{d}}_v\), and the \({\mathrm{u}}\)-type and \({\mathrm{d}}\)-type anti-quark distributions, \(x\overline{U}\), \(x\overline{D}\). At the initial scale of the QCD evolution \(Q_0^2 = 1.9\,\mathrm{GeV} ^2\), the PDFs are parametrized as:

$$\begin{aligned} x{\mathrm{g}} (x)&= A_{{\mathrm{g}}} x^{B_{{\mathrm{g}}}}\,(1-x)^{C_{{\mathrm{g}}}}\, (1+D_{{\mathrm{g}}} x) , \end{aligned}$$
(5)
$$\begin{aligned} x{\mathrm{u}}_v(x)&= A_{{\mathrm{u}}_v}x^{B_{{\mathrm{u}}_v}}\,(1-x)^{C_{{\mathrm{u}}_v}}\,(1+E_{{\mathrm{u}}_v}x^2) , \end{aligned}$$
(6)
$$\begin{aligned} x{\mathrm{d}}_v(x)&= A_{{\mathrm{d}}_v}x^{B_{{\mathrm{d}}_v}}\,(1-x)^{C_{{\mathrm{d}}_v}}, \end{aligned}$$
(7)
$$\begin{aligned} x\overline{U}(x)&= A_{\overline{U}}x^{B_{\overline{U}}}\, (1-x)^{C_{\overline{U}}}\, (1+E_{\overline{U}}x^2), \end{aligned}$$
(8)
$$\begin{aligned} x\overline{D}(x)&= A_{\overline{D}}x^{B_{\overline{D}}}\, (1-x)^{C_{\overline{D}}}, \end{aligned}$$
(9)

with the relations \(x\overline{U} = x\overline{{\mathrm{u}}}\) and \(x\overline{D} = x\overline{{\mathrm{d}}}+ x\overline{{\mathrm{s}}}\) assumed.

The normalization parameters \(A_{{\mathrm{u}}_{\mathrm {v}}}\), \(A_{{\mathrm{d}}_\mathrm {v}}\), and \(A_{{\mathrm{g}}}\) are determined by the QCD sum rules, the B parameter is responsible for small-x behavior of the PDFs, and the parameter C describes the shape of the distribution as \(x\,{\rightarrow }\,1\). Additional constraints \(B_{\overline{\mathrm {U}}} = B_{\overline{\mathrm {D}}}\) and \(A_{\overline{\mathrm {U}}} = A_{\overline{\mathrm {D}}}(1 - f_{{\mathrm{s}}})\) are imposed with \(f_{{\mathrm{s}}}\) being the strangeness fraction, \(f_{{\mathrm{s}}} = \overline{{\mathrm{s}}}/( \overline{{\mathrm{d}}}+ \overline{{\mathrm{s}}})\), which is fixed to \(f_{{\mathrm{s}}}=0.31\pm 0.08\) as in Ref. [34], consistent with the determination of the strangeness fraction by using the CMS measurements of \(\mathrm {W}\) + charm production [11]. The \(\chi ^2\) definition in the QCD analysis follows that of Eq. (32) of [42] without the logarithmic term. The parameters in Eqs. (5)–(9) were selected by first fitting with all D and E parameters set to zero. The other parameters were then included in the fit one at a time independently. The improvement of the \(\chi ^2\) of the fits was monitored and the procedure was stopped when no further improvement was observed. This led to a 13-parameter fit.

The PDF uncertainties are estimated according to the general approach of HERAPDF1.0 [56] in which the experimental, model, and parametrization uncertainties are taken into account. A tolerance criterion of \(\Delta \chi ^2 =1\) is adopted for defining the experimental uncertainties that originate from the measurements included in the analysis.

Model uncertainties arise from the variations in the values assumed for the heavy-quark masses \(m_{{\mathrm{b}}}\), \(m_{{\mathrm{c}}}\) with \(4.25\le m_{{\mathrm{b}}}\le 4.75\,\mathrm{GeV} \), \(1.37\le m_{{\mathrm{c}}}\le 1.49\,\mathrm{GeV} \), following Ref. [42], and the value of \(Q^2_{\text {min}}\) imposed on the HERA data, which is varied in the interval \(2.5 \le Q^2_{\text {min}}\le 5.0\,\mathrm{GeV} ^2\). The strangeness fraction \(f_{{\mathrm{s}}}\) is varied by its uncertainty.

The parametrization uncertainty is estimated by extending the functional form of all parton densities with additional parameters. The uncertainty is constructed as an envelope built from the maximal differences between the PDFs resulting from all the parametrization variations and the central fit at each x value. The total PDF uncertainty is obtained by adding experimental, model, and parametrization uncertainties in quadrature. In the following, the quoted uncertainties correspond to 68 % CL.

Table 5 Partial \(\chi ^2\) per number of data points, \(n_{\text {dp}}\), and the global \(\chi ^2\) per degrees of freedom, \(n_{\text {dof}}\), as obtained in the QCD analysis of HERA DIS and the CMS muon charge asymmetry data. For HERA measurements, the energy of the proton beam is listed for each data set, with electron energy being \(E_{{\mathrm{e}}}=27.5\,\mathrm{GeV} \)
Fig. 4
figure 4

Distributions of \({\mathrm{u}}\) valence (left) and \({\mathrm{d}}\) valence (right) quarks as functions of x at the scale \(Q^2=m^2_{\mathrm {W}}\). The results of the fit to the HERA data and muon asymmetry measurements (light shaded band), and to HERA data only (hatched band) are compared. The total PDF uncertainties are shown. In the bottom panels the distributions are normalized to 1 for a direct comparison of the uncertainties. The change of the PDFs with respect to the HERA-only fit is represented by a solid line

The global and partial \(\chi ^2\) values for the data sets used are listed in Table 5, illustrating the consistency among the data sets used. The somewhat high \(\chi ^2/n_{\mathrm {dof}}\) values for the combined DIS data are very similar to those observed in Ref. [42], where they are investigated in detail.

In the kinematic range probed, the final combined HERA DIS data currently provide the most significant constraints on the valence distributions. By adding these muon charge asymmetry measurements, the constraints can be significantly improved, as illustrated in Fig. 4 where the \(x {\mathrm{u}}\) and \(x {\mathrm{d}}\) valence distributions are shown at the scale of \(m^2_{\mathrm {W}}\), relevant for the \(\mathrm {W}\) boson production. The changes in shapes and the reduction of the uncertainties of the valence quark distributions with respect to those obtained with the HERA data are clear.

For direct comparison to the results of the earlier CMS QCD analysis [11] based on the \(\mathrm {W}\) asymmetry measured at \(\sqrt{s} = 7\,\mathrm{TeV} \) and the subset of HERA DIS data [56], an alternative PDF fit is performed at NLO, following exactly the data and model inputs of Ref. [11], but replacing the CMS measurements at \(\sqrt{s} = 7\,\mathrm{TeV} \) by those at \(\sqrt{s} = 8\,\mathrm{TeV} \). Also, a combined QCD analysis of both CMS data sets is performed. Very good agreement is observed between the CMS measurements of \(\mathrm {W}\) asymmetry at \(\sqrt{s}=7\,\mathrm{TeV} \) and \(\sqrt{s} = 8\,\mathrm{TeV} \) and a similar effect on the central values of the PDFs as reported in Ref. [11]. Compared to the PDFs obtained with HERA only data, the improvement of the precision in the valence quark distributions is more pronounced, when the measurements at \(\sqrt{s} = 8\,\mathrm{TeV} \) are used compared to the results of Ref. [11]. Due to somewhat lower Bjorken x probed by the measurements at \(8\,\mathrm{TeV} \), as compared to \(7\,\mathrm{TeV} \), the two data sets are complementary and should both be used in the future global QCD analyses.

9 Summary

In summary, we have measured the differential cross section and charge asymmetry of the \({\mathrm{W}}^{\pm } \rightarrow \mu ^{\pm }\nu \) production in \(\mathrm {p}\mathrm {p}\) collisions at \(\sqrt{s}=8\,\mathrm{TeV} \) using a data sample corresponding to an integrated luminosity of 18.8\(\,\text {fb}^{-1}\) collected with the CMS detector at the LHC. The measurements were performed in 11 bins of absolute muon pseudorapidity \(| \eta | \) for muons with \(p_{\mathrm {T}} >25\,\mathrm{GeV} \). The results have been incorporated into a QCD analysis at next-to-next-to-leading-order together with the inclusive deep inelastic scattering data from HERA. A significant improvement in the accuracy of the valence quark distributions is observed in the range \(10^{-3}< x <10^{-1}\), demonstrating the power of these muon charge asymmetry measurements to improve the main constraints on the valence distributions imposed by the HERA data, in the kinematics range probed. This strongly suggests the use of these measurements in future PDF determinations.