Unsupervised hierarchical clustering using the learning dynamics of restricted Boltzmann machines

Aurélien Decelle, Beatriz Seoane, and Lorenzo Rosset
Phys. Rev. E 108, 014110 – Published 7 July 2023

Abstract

Data sets in the real world are often complex and to some degree hierarchical, with groups and subgroups of data sharing common characteristics at different levels of abstraction. Understanding and uncovering the hidden structure of these data sets is an important task that has many practical applications. To address this challenge, we present a general method for building relational data trees by exploiting the learning dynamics of the restricted Boltzmann machine. Our method is based on the mean-field approach, derived from the Plefka expansion, and developed in the context of disordered systems. It is designed to be easily interpretable. We tested our method in an artificially created hierarchical data set and on three different real-world data sets (images of digits, mutations in the human genome, and a homologous family of proteins). The method is able to automatically identify the hierarchical structure of the data. This could be useful in the study of homologous protein sequences, where the relationships between proteins are critical for understanding their function and evolution.

  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
  • Figure
11 More
  • Received 16 March 2023
  • Revised 19 May 2023
  • Accepted 9 June 2023

DOI:https://doi.org/10.1103/PhysRevE.108.014110

©2023 American Physical Society

Physics Subject Headings (PhySH)

Statistical Physics & ThermodynamicsInterdisciplinary Physics

Authors & Affiliations

Aurélien Decelle and Beatriz Seoane2

  • Departamento de Física Teórica, Universidad Complutense de Madrid, 28040 Madrid, Spain and Université Paris-Saclay, CNRS, INRIA Tau team, LISN, 91190 Gif-sur-Yvette, France

Lorenzo Rosset*

  • Departamento de Física Teórica, Universidad Complutense de Madrid, 28040 Madrid, Spain

  • *lrosset@ucm.es

Article Text (Subscription Required)

Click to Expand

References (Subscription Required)

Click to Expand
Issue

Vol. 108, Iss. 1 — July 2023

Reuse & Permissions
Access Options
Author publication services for translation and copyediting assistance advertisement

Authorization Required


×
×

Images

×

Sign up to receive regular email alerts from Physical Review E

Log In

Cancel
×

Search


Article Lookup

Paste a citation or DOI

Enter a citation
×