We propose a simple method to extract the community structure of large networks. Our method is a heuristic method that is based on modularity optimization. It is shown to outperform all other known community detection methods in terms of computation time. Moreover, the quality of the communities detected is very good, as measured by the so-called modularity. This is shown first by identifying language communities in a Belgian mobile phone network of 2 million customers and by analysing a web graph of 118 million nodes and more than one billion links. The accuracy of our algorithm is also verified on ad hoc modular networks.
The International School for Advanced Studies (SISSA) was founded in 1978 and was the first institution in Italy to promote post-graduate courses leading to a Doctor Philosophiae (or PhD) degree. A centre of excellence among Italian and international universities, the school has around 65 teachers, 100 post docs and 245 PhD students, and is located in Trieste, in a campus of more than 10 hectares with wonderful views over the Gulf of Trieste.
SISSA hosts a very high-ranking, large and multidisciplinary scientific research output. The scientific papers produced by its researchers are published in high impact factor, well-known international journals, and in many cases in the world's most prestigious scientific journals such as Nature and Science. Over 900 students have so far started their careers in the field of mathematics, physics and neuroscience research at SISSA.
ISSN: 1742-5468
Journal of Statistical Mechanics: Theory and Experiment (JSTAT) is a multi-disciplinary, peer-reviewed international journal created by the International School for Advanced Studies (SISSA) and IOP Publishing (IOP). JSTAT covers all aspects of statistical physics, including experimental work that impacts on the subject.
Open all abstracts, in this tab
Vincent D Blondel et al J. Stat. Mech. (2008) P10008
Preetum Nakkiran et al J. Stat. Mech. (2021) 124003
We show that a variety of modern deep learning tasks exhibit a 'double-descent' phenomenon where, as we increase model size, performance first gets worse and then gets better. Moreover, we show that double descent occurs not just as a function of model size, but also as a function of the number of training epochs. We unify the above phenomena by defining a new complexity measure we call the effective model complexity and conjecture a generalized double descent with respect to this measure. Furthermore, our notion of model complexity allows us to identify certain regimes where increasing (even quadrupling) the number of train samples actually hurts test performance.
Lucas Böttcher and Gregory Wheeler J. Stat. Mech. (2024) 023401
Analyzing the geometric properties of high-dimensional loss functions, such as local curvature and the existence of other optima around a certain point in loss space, can help provide a better understanding of the interplay between neural-network structure, implementation attributes, and learning performance. In this paper, we combine concepts from high-dimensional probability and differential geometry to study how curvature properties in lower-dimensional loss representations depend on those in the original loss space. We show that saddle points in the original space are rarely correctly identified as such in the expected lower-dimensional representations if random projections are used. The principal curvature in the expected lower-dimensional representation is proportional to the mean curvature in the original loss space. Hence, the mean curvature in the original loss space determines if saddle points appear, on average, as either minima, maxima, or almost flat regions. We use the connection between expected curvature in random projections and mean curvature in the original space (i.e. the normalized Hessian trace) to compute Hutchinson-type trace estimates without calculating Hessian-vector products as in the original Hutchinson method. Because random projections are not suitable for correctly identifying saddle information, we propose to study projections along the dominant Hessian directions that are associated with the largest and smallest principal curvatures. We connect our findings to the ongoing debate on loss landscape flatness and generalizability. Finally, for different common image classifiers and a function approximator, we show and compare random and Hessian projections of loss landscapes with up to approximately parameters.
Jozef Sznajd J. Stat. Mech. (2024) 053202
Three 2D spin models made of frustrated zig-zag chains with competing interactions which, by exact summation with respect to some degrees of freedom, can be replaced by an effective temperature-dependent interaction, were considered. The first model, exactly solvable Ising chains coupled by only four-spin interactions, does not exhibit any finite temperature phase transition; nevertheless, temperature can trigger a frustration–no frustration crossover accompanied by gigantic specific heat. A similar effect was observed in several two-leg ladder models (Weiguo 2020 arXiv:2006.08921v2; 2020 2006.15087v1). The anisotropic Ising chains coupled by a direct interchain interaction and, competing with it, indirect interaction via spins located between chains, are analyzed using the exact Onsager's equation and linear perturbation renormalization group (LPRG). Depending on the parameter set, such a model exhibits one antiferromagnetic (AF) or ferromagnetic (FM) phase transition or three phase transitions with a re-entrant disordered phase between AF and FM ones. The LPRG method was also used to study coupled uniaxial XXZ chains which, for example, can be a minimal model to describe the magnetic properties of compounds in which uranium and rare earth atoms form zig-zag chains. As with the Ising model, for a certain set of parameters, the model can undergo three phase transitions. However, both intrachain and interchain plain interactions can eliminate the re-entrant disordered phase, and then only one transition takes place. Additionally, the XXZ model can undergo temperature-induced metamagnetic transition.
Nicolò Ruggeri et al J. Stat. Mech. (2024) 043403
Hypergraphs are widely adopted tools to examine systems with higher-order interactions. Despite recent advancements in methods for community detection in these systems, we still lack a theoretical analysis of their detectability limits. Here, we derive closed-form bounds for community detection in hypergraphs. Using a message-passing formulation, we demonstrate that detectability depends on the hypergraphs' structural properties, such as the distribution of hyperedge sizes or their assortativity. Our formulation enables a characterization of the entropy of a hypergraph in relation to that of its clique expansion, showing that community detection is enhanced when hyperedges highly overlap on pairs of nodes. We develop an efficient message-passing algorithm to learn communities and model parameters on large systems. Additionally, we devise an exact sampling routine to generate synthetic data from our probabilistic model. Using these methods, we numerically investigate the boundaries of community detection in synthetic datasets, and extract communities from real systems. Our results extend our understanding of the limits of community detection in hypergraphs and introduce flexible mathematical tools to study systems with higher-order interactions.
Tomoyuki Obuchi and Yoshiyuki Kabashima J. Stat. Mech. (2016) 053304
We investigate leave-one-out cross validation (CV) as a determinator of the weight of the penalty term in the least absolute shrinkage and selection operator (LASSO). First, on the basis of the message passing algorithm and a perturbative discussion assuming that the number of observations is sufficiently large, we provide simple formulas for approximately assessing two types of CV errors, which enable us to significantly reduce the necessary cost of computation. These formulas also provide a simple connection of the CV errors to the residual sums of squares between the reconstructed and the given measurements. Second, on the basis of this finding, we analytically evaluate the CV errors when the design matrix is given as a simple random matrix in the large size limit by using the replica method. Finally, these results are compared with those of numerical simulations on finite-size systems and are confirmed to be correct. We also apply the simple formulas of the first type of CV error to an actual dataset of the supernovae.
Gernot Akemann et al J. Stat. Mech. (2024) 053501
The two-dimensional (2D) Coulomb gas is a one-parameter family of random point processes, depending on the inverse temperature β. Based on previous work, it is proposed as a simple statistical measure to quantify the intra- and interspecies repulsion among three different highly territorial birds of prey. Using data from the area of the Teutoburger Wald over 20 years, we fit the nearest-neighbour and next-to-nearest neighbour spacing distributions between the respective nests of the goshawk, eagle owl and the previously examined common buzzard to β of the Coulomb gas. Within each species, the repulsion measured in this way deviates significantly from the Poisson process of independent points in the plane. In contrast, the repulsion amongst each of two species is found to be considerably lower and closer to Poisson. Methodologically, we investigate the influence of the terrain, of a shorter interaction range given by the 2D Yukawa interaction, and the statistical independence of the time moving average we use for the yearly ensembles of occupied nests. We also check that an artificial random displacement of the original nest positions of the order of the mean level spacing quickly destroys the repulsion measured by β > 0. A simple, approximate analytical expression for the nearest-neighbour spacing distribution derived from non-Hermitian random matrix theory proves to be very useful.
Bart Wijns et al J. Stat. Mech. (2024) 043203
A microscopic model for a translational Brownian motor, dubbed a Brownian translator, is introduced. It is inspired by the Brownian gyrator described by Filliger and Reimann (2007 Phys. Rev. Lett.99 230602). The Brownian translator consists of a spatially asymmetric object moving freely along a line due to perpetual collisions with a surrounding ideal gas. When this gas has an anisotropic temperature, both spatial and temporal symmetries are broken and the object acquires a nonzero drift. Onsager reciprocity implies the opposite phenomenon, that is dragging a spatially asymmetric object into an (initially at) equilibrium gas induces an energy flow that results in anisotropic gas temperatures. Expressions for the dynamical and energetic properties are derived as a series expansion in the mass ratio (of gas particle vs. object). These results are in excellent agreement with molecular dynamics simulations.
Vir B Bulchandani J. Stat. Mech. (2024) 043205
We point out that Percus's collision integral for one-dimensional hard rods (Percus 1969 Phys. Fluids12 1560–3) does not preserve the thermal equilibrium state in an external trapping potential. We derive a revised Enskog equation for hard rods and show that it preserves this thermal state exactly. In contrast to recent proposed kinetic equations for dynamics in integrability-breaking traps, both our kinetic equation and its thermal states are explicitly nonlocal in space. Our equation differs from earlier proposals at third order in spatial derivatives and we attribute this discrepancy to the choice of collision integral underlying our approach.
Lorenzo Correale and Alessandro Silva J. Stat. Mech. (2024) 053101
We study the non-equilibrium phase diagram of a fully-connected Ising p-spin model, for generic p > 2, and investigate its robustness with respect to the inclusion of spin-wave fluctuations, resulting from a ferromagnetic, short-range spin interaction. In particular, we investigate the dynamics of the mean-field model after a quantum quench: we observe a new dynamical phase transition which is either first or second order depending on the even or odd parity of p, in stark contrast with its thermal counterpart which is first order for all p. The dynamical phase diagram is qualitatively modified by the fluctuations introduced by a short-range interaction which drive the system always towards various prethermal paramagnetic phases determined by the strength of time dependent fluctuations of the magnetization.
Open all abstracts, in this tab
Ashot Matevosyan J. Stat. Mech. (2024) 053206
Angular momentum conservation influences equilibrium statistical mechanics, leading to a generalized microcanonical density for an isolated system and a generalized Gibbs density for a weakly coupled system. We study the stochastic decay of angular momentum due to the weakly imperfect rotational symmetry of the external potential that confines the isolated many-particle system. We present a mesoscopic description of the system, deriving Langevin and Fokker–Planck equations, which are consistent with equilibrium statistical mechanics when rotational symmetry is maintained. When the symmetry is weakly violated, we formulate a coarse-grained stochastic differential equation governing the decay of total angular momentum over time. To validate our analytical predictions, we conduct numerical simulations of the microcanonical ensemble, an isolated system undergoing thermalization due to weak two-body interactions. Our coarse-grained Langevin equation accurately characterizes both the decay of the angular momentum and its fluctuations in a steady state. Furthermore, we estimate the parameters of our mesoscopic model directly from simulations, providing insights into the dissipative phenomenological coefficients, such as friction. More generally, this study contributes to a deeper understanding of the behavior of the integrals of motion when the corresponding symmetry is weakly violated.
Maurizio Serva J. Stat. Mech. (2024) 053401
The particles that we describe here can only move at the speed of light c in three-dimensional space. The velocity, which randomly but continuously changes direction, can be represented as a point on the surface of a sphere of constant radius c, and its trajectories may only connect points of this variety. The Wiener process that we use to describe the velocity dynamics on the surface of the sphere is anisotropic since the infinitesimal variation of the velocity is not only always orthogonal to the velocity itself (which guarantees a constant speed), but also to the position. This choice for the infinitesimal variation of the velocity is the one that best slows down the diffusion of particles in space by random motion at the speed of light. As a result of these dynamics, the position of the particles spontaneously remain confined on the surface of an expanding sphere whose radius increases, for large times, as the square root of time.
Jianjin Wang et al J. Stat. Mech. (2024) 053207
Amorphous solids can be theoretically modeled by anharmonic disordered lattices, but most of the theoretical studies on thermal conductivity in such lattices only consider hard-type (HT) anharmonicity. In this study, we investigate the thermal conductivity κ of one-dimensional disordered lattices with both HT and soft-type (ST) anharmonic on-site potentials. Our results from molecular dynamics simulations and the quasi-harmonic Green–Kubo (QHGK) method show that while the HT model exhibits non-monotonic dependence of κ on anharmonicity, the ST model shows a monotonically increasing trend. This trend provides a novel approach to enhancing thermal conductivity in disordered systems. Additionally, the QHGK predictions for κ in the HT model are consistent with simulation results over a wide range of anharmonicity values; however, for the ST model, deviations appear as the anharmonicity becomes softer. This peculiar feature may be attributed to delocalization effects being dominant in contrast to the competing roles played by both delocalization and localization effects observed in the HT model.
Cheng Tian and Shiqi Zhou J. Stat. Mech. (2024) 053205
We suggest a coarse-grained water model for use in classical density functional theory (cDFT) to describe aqueous inorganic salt solutions that act as working electrolytes in electrical double-layer capacitors (EDLCs) with electrodes comprising two face-to-face doped carbon monolayers. Focus of the cDFT calculations lies on the influence of solvent electrode wettability (SEW) on capacitance and energy storage behaviors, while also considering its interaction with factors like electrolyte bulk concentration, pore size, electrode voltage, and temperature. New phenomena are disclosed theoretically. Remarkably, this study challenges the traditional notion that energy storage is consistently boosted by enhancing the electrode's ionophobicity. Contrarily, the SEW effect reduces energy storage below the standard aqueous electrochemical window voltage (around 1.2 V) and only enhances the energy storage as the voltage surpasses a certain threshold up to the optimal window voltage (2 V–2.5 V). Furthermore, a non-monotonic SEW effect on energy storage is demonstrated under appropriate conditions, shedding new light on the complex relationship between ionophobicity and energy storage. Moreover, the present coarse-grained water model enables the prediction of the experimentally observed inverse relationship between temperature and capacitance. In contrast, the widely used electrolyte primitive model predicts the existence of a maximum value. The decisive factor for the impact of SEW on capacitance and energy storage is identified as congestion within the electrode pore, while other factors contribute by affecting this congestion. The present research offers valuable insights, highlighting the significance of SEW in the innovative and strategic design of aqueous inorganic EDLC devices.
Federico Corberi and Luca Smaldone J. Stat. Mech. (2024) 053204
We investigate the aging properties of the one-dimensional voter model with long-range interactions in its ordering kinetics. In this system, an agent, , positioned at a lattice vertex i, copies the state of another one located at a distance r, selected randomly with a probability . Employing both analytical and numerical methods, we compute the two-time correlation function () between the state of a variable Si at time s and that of another one, at distance r, at time t. At time t, the memory of an agent of its former state at time s, expressed by the autocorrelation function, decays algebraically for α > 1 as , where L is a time-increasing coherence length and λ is the Fisher–Huse exponent. We find λ = 1 for α > 2, and for . For , instead, there is an exponential decay, as in the mean field. Then, in contrast with what is known for the related Ising model, here we find that λ increases upon decreasing α. The space-dependent correlation obeys a scaling symmetry for α > 2. Similarly, for , one has , where the length regulating two-time correlations now differs from the coherence length as , with .
Open all abstracts, in this tab
Annabel L Davies and Tobias Galla J. Stat. Mech. (2022) 11R001
Network meta-analysis (NMA) is a technique used in medical statistics to combine evidence from multiple medical trials. NMA defines an inference and information processing problem on a network of treatment options and trials connecting the treatments. We believe that statistical physics can offer useful ideas and tools for this area, including from the theory of complex networks, stochastic modelling and simulation techniques. The lack of a unique source that would allow physicists to learn about NMA effectively is a barrier to this. In this article we aim to present the 'NMA problem' and existing approaches to it coherently and in a language accessible to statistical physicists. We also summarise existing points of contact between statistical physics and NMA, and describe our ideas of how physics might make a difference for NMA in the future. The overall goal of the article is to attract physicists to this interesting, timely and worthwhile field of research.
Shamik Gupta et al J. Stat. Mech. (2014) R08001
The phenomenon of spontaneous synchronization, particularly within the framework of the Kuramoto model, has been a subject of intense research over the years. The model comprises oscillators with distributed natural frequencies interacting through a mean-field coupling, and serves as a paradigm to study synchronization. In this review, we put forward a general framework in which we discuss in a unified way known results with more recent developments obtained for a generalized Kuramoto model that includes inertial effects and noise. We describe the model from a different perspective, highlighting the long-range nature of the interaction between the oscillators, and emphasizing the equilibrium and out-of-equilibrium aspects of its dynamics from a statistical physics point of view. In this review, we first introduce the model and discuss both for the noiseless and noisy dynamics and for unimodal frequency distributions the synchronization transition that occurs in the stationary state. We then introduce the generalized model, and analyze its dynamics using tools from statistical mechanics. In particular, we discuss its synchronization phase diagram for unimodal frequency distributions. Next, we describe deviations from the mean-field setting of the Kuramoto model. To this end, we consider the generalized Kuramoto dynamics on a one-dimensional periodic lattice on the sites of which the oscillators reside and interact with one another with a coupling that decays as an inverse power-law of their separation along the lattice. For two specific cases, namely, in the absence of noise and inertia, and in the case when the natural frequencies are the same for all the oscillators, we discuss how the long-time transition to synchrony is governed by the dynamics of the mean-field mode (zero Fourier mode) of the spatial distribution of the oscillator phases.
Open all abstracts, in this tab
Maurizio Serva J. Stat. Mech. (2024) 053401
The particles that we describe here can only move at the speed of light c in three-dimensional space. The velocity, which randomly but continuously changes direction, can be represented as a point on the surface of a sphere of constant radius c, and its trajectories may only connect points of this variety. The Wiener process that we use to describe the velocity dynamics on the surface of the sphere is anisotropic since the infinitesimal variation of the velocity is not only always orthogonal to the velocity itself (which guarantees a constant speed), but also to the position. This choice for the infinitesimal variation of the velocity is the one that best slows down the diffusion of particles in space by random motion at the speed of light. As a result of these dynamics, the position of the particles spontaneously remain confined on the surface of an expanding sphere whose radius increases, for large times, as the square root of time.
Federico Corberi and Luca Smaldone J. Stat. Mech. (2024) 053204
We investigate the aging properties of the one-dimensional voter model with long-range interactions in its ordering kinetics. In this system, an agent, , positioned at a lattice vertex i, copies the state of another one located at a distance r, selected randomly with a probability . Employing both analytical and numerical methods, we compute the two-time correlation function () between the state of a variable Si at time s and that of another one, at distance r, at time t. At time t, the memory of an agent of its former state at time s, expressed by the autocorrelation function, decays algebraically for α > 1 as , where L is a time-increasing coherence length and λ is the Fisher–Huse exponent. We find λ = 1 for α > 2, and for . For , instead, there is an exponential decay, as in the mean field. Then, in contrast with what is known for the related Ising model, here we find that λ increases upon decreasing α. The space-dependent correlation obeys a scaling symmetry for α > 2. Similarly, for , one has , where the length regulating two-time correlations now differs from the coherence length as , with .
Sourav Manna et al J. Stat. Mech. (2024) 053301
We study many-body localization in a hardcore boson model in the presence of random disorder on finite generation fractal lattices with different Hausdorff dimensions and different local lattice structures. In particular, we consider the Vicsek, T-shaped, Sierpinski gasket, and modified Koch-curve fractal lattices. In the single-particle case, these systems display Anderson localization for arbitrary disorder strength if they are large enough. In the many-body case, the systems available to exact diagonalization exhibit a transition between a delocalized and localized regime, visible in the spectral and entanglement properties of these systems. The position of this transition depends on the Hausdorff dimension of the given fractal, as well as on its local structure.
Lorenzo Correale and Alessandro Silva J. Stat. Mech. (2024) 053101
We study the non-equilibrium phase diagram of a fully-connected Ising p-spin model, for generic p > 2, and investigate its robustness with respect to the inclusion of spin-wave fluctuations, resulting from a ferromagnetic, short-range spin interaction. In particular, we investigate the dynamics of the mean-field model after a quantum quench: we observe a new dynamical phase transition which is either first or second order depending on the even or odd parity of p, in stark contrast with its thermal counterpart which is first order for all p. The dynamical phase diagram is qualitatively modified by the fluctuations introduced by a short-range interaction which drive the system always towards various prethermal paramagnetic phases determined by the strength of time dependent fluctuations of the magnetization.
Gernot Akemann et al J. Stat. Mech. (2024) 053501
The two-dimensional (2D) Coulomb gas is a one-parameter family of random point processes, depending on the inverse temperature β. Based on previous work, it is proposed as a simple statistical measure to quantify the intra- and interspecies repulsion among three different highly territorial birds of prey. Using data from the area of the Teutoburger Wald over 20 years, we fit the nearest-neighbour and next-to-nearest neighbour spacing distributions between the respective nests of the goshawk, eagle owl and the previously examined common buzzard to β of the Coulomb gas. Within each species, the repulsion measured in this way deviates significantly from the Poisson process of independent points in the plane. In contrast, the repulsion amongst each of two species is found to be considerably lower and closer to Poisson. Methodologically, we investigate the influence of the terrain, of a shorter interaction range given by the 2D Yukawa interaction, and the statistical independence of the time moving average we use for the yearly ensembles of occupied nests. We also check that an artificial random displacement of the original nest positions of the order of the mean level spacing quickly destroys the repulsion measured by β > 0. A simple, approximate analytical expression for the nearest-neighbour spacing distribution derived from non-Hermitian random matrix theory proves to be very useful.
Jozef Sznajd J. Stat. Mech. (2024) 053202
Three 2D spin models made of frustrated zig-zag chains with competing interactions which, by exact summation with respect to some degrees of freedom, can be replaced by an effective temperature-dependent interaction, were considered. The first model, exactly solvable Ising chains coupled by only four-spin interactions, does not exhibit any finite temperature phase transition; nevertheless, temperature can trigger a frustration–no frustration crossover accompanied by gigantic specific heat. A similar effect was observed in several two-leg ladder models (Weiguo 2020 arXiv:2006.08921v2; 2020 2006.15087v1). The anisotropic Ising chains coupled by a direct interchain interaction and, competing with it, indirect interaction via spins located between chains, are analyzed using the exact Onsager's equation and linear perturbation renormalization group (LPRG). Depending on the parameter set, such a model exhibits one antiferromagnetic (AF) or ferromagnetic (FM) phase transition or three phase transitions with a re-entrant disordered phase between AF and FM ones. The LPRG method was also used to study coupled uniaxial XXZ chains which, for example, can be a minimal model to describe the magnetic properties of compounds in which uranium and rare earth atoms form zig-zag chains. As with the Ising model, for a certain set of parameters, the model can undergo three phase transitions. However, both intrachain and interchain plain interactions can eliminate the re-entrant disordered phase, and then only one transition takes place. Additionally, the XXZ model can undergo temperature-induced metamagnetic transition.
Bart Wijns et al J. Stat. Mech. (2024) 043203
A microscopic model for a translational Brownian motor, dubbed a Brownian translator, is introduced. It is inspired by the Brownian gyrator described by Filliger and Reimann (2007 Phys. Rev. Lett.99 230602). The Brownian translator consists of a spatially asymmetric object moving freely along a line due to perpetual collisions with a surrounding ideal gas. When this gas has an anisotropic temperature, both spatial and temporal symmetries are broken and the object acquires a nonzero drift. Onsager reciprocity implies the opposite phenomenon, that is dragging a spatially asymmetric object into an (initially at) equilibrium gas induces an energy flow that results in anisotropic gas temperatures. Expressions for the dynamical and energetic properties are derived as a series expansion in the mass ratio (of gas particle vs. object). These results are in excellent agreement with molecular dynamics simulations.
Vir B Bulchandani J. Stat. Mech. (2024) 043205
We point out that Percus's collision integral for one-dimensional hard rods (Percus 1969 Phys. Fluids12 1560–3) does not preserve the thermal equilibrium state in an external trapping potential. We derive a revised Enskog equation for hard rods and show that it preserves this thermal state exactly. In contrast to recent proposed kinetic equations for dynamics in integrability-breaking traps, both our kinetic equation and its thermal states are explicitly nonlocal in space. Our equation differs from earlier proposals at third order in spatial derivatives and we attribute this discrepancy to the choice of collision integral underlying our approach.
Kevin Bauerbach and Florian Gebhard J. Stat. Mech. (2024) 043102
We determine the impurity-induced free energy and the impurity-induced zero-field susceptibility of the symmetric single-impurity Kondo model from weak-coupling perturbation theory up to third order in the Kondo coupling at low temperatures and small magnetic fields. We reproduce the analytical structure of the zero-field magnetic susceptibility as obtained from Wilson's renormalization group method. This permits us to obtain analytically the first two Wilson numbers.
Nicolò Ruggeri et al J. Stat. Mech. (2024) 043403
Hypergraphs are widely adopted tools to examine systems with higher-order interactions. Despite recent advancements in methods for community detection in these systems, we still lack a theoretical analysis of their detectability limits. Here, we derive closed-form bounds for community detection in hypergraphs. Using a message-passing formulation, we demonstrate that detectability depends on the hypergraphs' structural properties, such as the distribution of hyperedge sizes or their assortativity. Our formulation enables a characterization of the entropy of a hypergraph in relation to that of its clique expansion, showing that community detection is enhanced when hyperedges highly overlap on pairs of nodes. We develop an efficient message-passing algorithm to learn communities and model parameters on large systems. Additionally, we devise an exact sampling routine to generate synthetic data from our probabilistic model. Using these methods, we numerically investigate the boundaries of community detection in synthetic datasets, and extract communities from real systems. Our results extend our understanding of the limits of community detection in hypergraphs and introduce flexible mathematical tools to study systems with higher-order interactions.