In this thesis, we apply methods from replica theory to deal with two contemporary problems in the study of complex systems. In Part I, we discuss the behavior of the rare fluctuations of the observable of choice in most models of spin glasses: the free energy. Due to the quenched disorder, this and other thermodynamic quantities are self-averaging random variables, whose probability distribution can be evaluated within replica theory. In particular, the probability of the free energy fluctuations above its typical value shows an anomalous scaling with the number of degrees of freedom, at variance with the ordinary exponential suppression for fluctuations below. We explain how the introduction of a small magnetic field can remove this anomalous behavior. In Part II, we apply the replica formalism to the problem of linear classification of objects with a geometrical structure, in the context of machine learning. In particular, using combinatorial techniques we evaluate the number of dichotomies (binary classifications) of a set of structured inputs achievable by a linear classifier, as a function of the number of inputs to classify. We prove that this number shows an additional critical point beyond the usual storage capacity for isolated points, at which the number of admissible dichotomies becomes zero in the thermodynamic limit; the associated phase transition present a certain degree of replica symmetry breaking. This behavior is due to a trade-off between the increasing number of points to classify, and the increasing volume excluded by their geometrical structure. This approach goes in the direction of finding bounds on the generalization error of certain simple neural network architectures, more stringent for structured data than the ones known from Statistical Learning Theory.

REPLICAS IN COMPLEX SYSTEMS: APPLICATIONS TO LARGE DEVIATIONS AND NEURAL NETWORKS / M. Pastore ; tutor: S. Caracciolo ; coordinatore: M. Paris. Università degli Studi di Milano, 2020 Oct 27. 33. ciclo, Anno Accademico 2020. [10.13130/pastore-mauro_phd2020-10-27].

REPLICAS IN COMPLEX SYSTEMS: APPLICATIONS TO LARGE DEVIATIONS AND NEURAL NETWORKS

M. Pastore
2020

Abstract

In this thesis, we apply methods from replica theory to deal with two contemporary problems in the study of complex systems. In Part I, we discuss the behavior of the rare fluctuations of the observable of choice in most models of spin glasses: the free energy. Due to the quenched disorder, this and other thermodynamic quantities are self-averaging random variables, whose probability distribution can be evaluated within replica theory. In particular, the probability of the free energy fluctuations above its typical value shows an anomalous scaling with the number of degrees of freedom, at variance with the ordinary exponential suppression for fluctuations below. We explain how the introduction of a small magnetic field can remove this anomalous behavior. In Part II, we apply the replica formalism to the problem of linear classification of objects with a geometrical structure, in the context of machine learning. In particular, using combinatorial techniques we evaluate the number of dichotomies (binary classifications) of a set of structured inputs achievable by a linear classifier, as a function of the number of inputs to classify. We prove that this number shows an additional critical point beyond the usual storage capacity for isolated points, at which the number of admissible dichotomies becomes zero in the thermodynamic limit; the associated phase transition present a certain degree of replica symmetry breaking. This behavior is due to a trade-off between the increasing number of points to classify, and the increasing volume excluded by their geometrical structure. This approach goes in the direction of finding bounds on the generalization error of certain simple neural network architectures, more stringent for structured data than the ones known from Statistical Learning Theory.
27-ott-2020
Settore FIS/02 - Fisica Teorica, Modelli e Metodi Matematici
CARACCIOLO, SERGIO
PARIS, MATTEO
Doctoral Thesis
REPLICAS IN COMPLEX SYSTEMS: APPLICATIONS TO LARGE DEVIATIONS AND NEURAL NETWORKS / M. Pastore ; tutor: S. Caracciolo ; coordinatore: M. Paris. Università degli Studi di Milano, 2020 Oct 27. 33. ciclo, Anno Accademico 2020. [10.13130/pastore-mauro_phd2020-10-27].
File in questo prodotto:
File Dimensione Formato  
phd_unimi_R12067.pdf

accesso aperto

Tipologia: Tesi di dottorato completa
Dimensione 1.63 MB
Formato Adobe PDF
1.63 MB Adobe PDF Visualizza/Apri
Pubblicazioni consigliate

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2434/778413
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact