← back

Critical Percolation as a Synthetic Data Model for Interpretability

📄 arXiv:2606.20347 · 📥 PDF · 2026-06-18 · cond-mat.dis-nn

Authors: Aryeh Brill [arXiv · scholar] , Tom Ingebretsen Carlson [arXiv · scholar]

🕰 Orloj analysis

7.9
Total score
8.5
Consistency
7.0
Quality
⭐⭐
AD relevance

Paper představuje novou rodinu syntetických datových sad založených na kritické perkolaci, určených k hodnocení metod interpretovatelnosti neuronových sítí. Tyto sady simulují hierarchickou, víceskalární strukturu přírodních dat s fraktálními shluky a mocninným rozdělením velikostí, přičemž jsou analyticky uchopitelné a efektivně generovatelné.

💡 Tento model nabízí slibný, analyticky uchopitelný a škálovatelný testovací rámec pro interpretovatelnost AI, vyplňující mezeru v realistických syntetických datech s robustními vlastnostmi.

Categories: INF-7 INF-4 EMG-5 INF-2 EMG-6 MET-5 MET-2

✓ falsifiable, limit_reductions, modest_claims, analytically_tractable

⚠ code_availability_not_mentioned

📄 Abstract

Neural networks learn features that reflect the hierarchical, multi-scale structure of natural data. Synthetic datasets used to evaluate interpretability methods typically lack this structure, limiting their value as realistic toy models. To close this gap, we introduce a family of synthetic datasets consisting of hierarchical functions defined on critical mean-field percolation clusters embedded in a high-dimensional data space. The percolation data consists of sparse, low-dimensional fractal clusters with a power-law size distribution. Latent variables modeling a taxonomic hierarchy generate each data point's target value. The data model is analytically tractable with known critical exponents that fix its properties without requiring hyperparameter tuning. We leverage a mapping between percolation clusters, random trees, and additive coalescence to propose an almost linear-time algorithm to jointly sample a random tree and its hierarchical latent decomposition, enabling data generation at arbitrary scale. Using probing experiments, we find that the model's ground-truth latent variables can be linearly decoded from neural network activations. Together, sparsity, self-similarity, power-law statistics, and analytical tractability make critical percolation a principled testbed for interpretability research.

📄 arXiv abstract page 📥 PDF