Critical Percolation as a Synthetic Data Model for Interpretability
📄 arXiv:2606.20347 · 📥 PDF · 2026-06-18 · cond-mat.dis-nn
Authors: Aryeh Brill [arXiv · scholar] , Tom Ingebretsen Carlson [arXiv · scholar]
🕰 Orloj analysis
Paper představuje novou rodinu syntetických datových sad založených na kritické perkolaci, určených k hodnocení metod interpretovatelnosti neuronových sítí. Tyto sady simulují hierarchickou, víceskalární strukturu přírodních dat s fraktálními shluky a mocninným rozdělením velikostí, přičemž jsou analyticky uchopitelné a efektivně generovatelné.
💡 Tento model nabízí slibný, analyticky uchopitelný a škálovatelný testovací rámec pro interpretovatelnost AI, vyplňující mezeru v realistických syntetických datech s robustními vlastnostmi.
Categories:
INF-7
INF-4
EMG-5
INF-2
EMG-6
MET-5
MET-2
✓ falsifiable, limit_reductions, modest_claims, analytically_tractable
⚠ code_availability_not_mentioned
📄 Abstract
Neural networks learn features that reflect the hierarchical, multi-scale structure of natural data. Synthetic datasets used to evaluate interpretability methods typically lack this structure, limiting their value as realistic toy models. To close this gap, we introduce a family of synthetic datasets consisting of hierarchical functions defined on critical mean-field percolation clusters embedded in a high-dimensional data space. The percolation data consists of sparse, low-dimensional fractal clusters with a power-law size distribution. Latent variables modeling a taxonomic hierarchy generate each data point's target value. The data model is analytically tractable with known critical exponents that fix its properties without requiring hyperparameter tuning. We leverage a mapping between percolation clusters, random trees, and additive coalescence to propose an almost linear-time algorithm to jointly sample a random tree and its hierarchical latent decomposition, enabling data generation at arbitrary scale. Using probing experiments, we find that the model's ground-truth latent variables can be linearly decoded from neural network activations. Together, sparsity, self-similarity, power-law statistics, and analytical tractability make critical percolation a principled testbed for interpretability research.