paper

Post-hoc regularisation of unfolded cross-section measurements

Published 11 October 2022 © 2022 IOP Publishing Ltd and Sissa Medialab
, , Citation Lukas Koch 2022 JINST 17 P10021 DOI 10.1088/1748-0221/17/10/P10021

1748-0221/17/10/P10021

Abstract

Neutrino cross-section measurements are often presented as unfolded binned distributions in "true" variables. The ill-posedness of the unfolding problem can lead to results with strong anti-correlations and fluctuations between bins, which make comparisons to theoretical models in plots difficult. To alleviate this problem, one can introduce regularisation terms in the unfolding procedure. These suppress the anti-correlations in the result, at the cost of introducing some bias towards the expected shape of the data.

This paper discusses a method using simple linear algebra, which makes it is possible to regularise any result that is presented as a central value and a covariance matrix. This "post-hoc" regularisation is generally much faster than repeating the unfolding method with different regularisation terms. The method also yields a regularisation matrix which connects the regularised to the unregularised result, and can be used to retain the full statistical power of the unregularised result when publishing a nicer looking regularised result.

In addition to the regularisation method, this paper also presents some thoughts on the presentation of correlated data in general. When using the proposed method, the bias of the regularisation can be understood as a data visualisation problem rather than a statistical one. The strength of the regularisation can be chosen by minimising the difference between the implicitly uncorrelated distribution shown in the plots and the actual distribution described by the unregularised central value and covariance.

Aside from minimising the difference between the shown and the actual result, additional information can be provided by showing the local log-likelihood gradient of the models shown in the plots. This adds more information about where the model is "pulled" by the data than just comparing the bin values to the data's central values.

Export citation and abstract BibTeX RIS

10.1088/1748-0221/17/10/P10021