August 1, 2014
We present a thermodynamic theory for a generic population of $M$ individuals distributed into $N$ groups (clusters). We construct the ensemble of all distributions with fixed $M$ and $N$, introduce a selection functional that embodies the physics that governs the population, and obtain the distribution that emerges in the scaling limit as the most probable among all distributions consistent with the given physics. We develop the thermodynamics of the ensemble and establish a rigorous mapping to thermodynamics. We treat the emergence of a so-called "giant component" as a formal phase transition and show that the criteria for its emergence are entirely analogous to the equilibrium conditions in molecular systems. We demonstrate the theory by an analytic model and confirm the predictions by Monte Carlo simulation.
Similar papers 1
July 2, 2020
The appeal of thermodynamics to problems outside physics is undeniable, as is the growing recognition of its apparent universality, yet in the absence of a rigorous formalism divorced from the peculiarities of molecular systems all attempts to generalize thermodynamics remain qualitative and heuristic at best. In this paper we formulate a probabilistic theory of thermodynamics and and set the basis for its application to generic stochastic populations.
September 6, 2017
We show that model-based Bayesian clustering, the probabilistically most systematic approach to the partitioning of data, can be mapped into a statistical physics problem for a gas of particles, and as a result becomes amenable to a detailed quantitative analysis. A central role in the resulting statistical physics framework is played by an entropy function. We demonstrate that there is a relevant parameter regime where mean-field analysis of this function is exact, and that,...
October 5, 2018
We use statistical mechanics to study model-based Bayesian data clustering. In this approach, each partition of the data into clusters is regarded as a microscopic system state, the negative data log-likelihood gives the energy of each state, and the data set realisation acts as disorder. Optimal clustering corresponds to the ground state of the system, and is hence obtained from the free energy via a low `temperature' limit. We assume that for large sample sizes the free ene...
March 4, 2003
Clustering provides a common means of identifying structure in complex data, and there is renewed interest in clustering as a tool for the analysis of large data sets in many fields. A natural question is how many clusters are appropriate for the description of a given system. Traditional approaches to this problem are based either on a framework in which clusters of a particular shape are assumed as a model of the system or on a two-step procedure in which a clustering crite...
September 18, 2018
Statistical thermodynamics has a universal appeal that extends beyond molecular systems, and yet, as its tools are being transplanted to fields outside physics, the fundamental question, \textit{what is thermodynamics?}, has remained unanswered. We answer this question here. Generalized statistical thermodynamics is a variational calculus of probability distributions. It is independent of physical hypotheses but provides the means to incorporate our knowledge, assumptions and...
April 25, 2023
There exists a wide variety of works on the dynamics of large populations ranging from simple heuristic modeling to those based on advanced computer supported methods. Their interconnections, however, remain mostly vague, which significantly limits the effectiveness of using computer methods in this domain. The aim of the present publication is to propose a concept based on the experience elaborated in the nonequilibrium statistical mechanics of interacting physical particles...
November 15, 2014
Binary aggregation is known to lead, under certain kinetic rules, to the coexistence of two populations, one consisting of finite-size clusters (sol), and one that contains a single cluster that carries a finite fraction of the total mass (giant component or gel). The sol-gel transition is commonly discussed as a phase transition by qualitative analogy to vapor condensation. Here we show that the connection to thermodynamic phase transition is rigorous. We develop the statist...
July 15, 2009
The aim of this work is to put forward a statistical mechanics theory of social interaction, generalizing econometric discrete choice models. After showing the formal equivalence linking econometric multinomial logit models to equilibrium statical mechanics, a multi-population generalization of the Curie-Weiss model for ferromagnets is considered as a starting point in developing a model capable of describing sudden shifts in aggregate human behaviour. Existence of the ther...
July 27, 2015
In the past years, a remarkable mapping has been found between the dynamics of a population of M individuals undergoing random mutations and selection, and that of a single system in contact with a thermal bath with temperature 1/M. This correspondence holds under the somewhat restrictive condition that the population is dominated by a single type at almost all times, punctuated by rare successive mutations. Here we argue that such thermal dynamics will hold more generally, s...
April 4, 2008
We propose a unifying, analytical theory accounting for the self-organization of colloidal systems in nano- or micro-cluster phases. We predict the distribution of cluter sizes with respect to interaction parameters and colloid concentration. In particular, we anticipate a proportionality regime where the mean cluster size grows proportionally to the concentration, as observed in several experiments. We emphasize the interest of a predictive theory in soft matter, nano-techno...