Experiments

V22: Intrinsic Predictive Gradient

Period: 2026-02-19. Substrate: V21 + within-lifetime gradient descent on energy prediction.

The key mechanism: Each environment step, the agent predicts its own energy delta, observes the truth, and updates its phenotype via SGD. The computational equivalent of the free energy principle: minimize surprise about your own persistence. No external reward, no human labels.

\text{loss} = (\hat{\Delta E} - \Delta E_{\text{actual}})^2 \quad \Rightarrow \quad \text{phenotype} \mathrel{-}= \text{lr} \cdot \nabla \text{loss}

Metric	Seed 42	Seed 123	Seed 7	Mean
Mean robustness	0.965	0.990	0.988	0.981
Mean $\intinfo$	0.106	0.100	0.085	0.097
Mean pred MSE	6.4e-4	1.1e-4	4.0e-4	3.8e-4
Final LR	0.00483	0.00529	0.00437	0.00483

Within-lifetime learning is unambiguously working (100-15000x MSE improvement per lifetime, 3/3 seeds). LR not suppressed — evolution maintains learning. But robustness not improved over V20.

V22 trajectories: robustness, integration, population, and prediction MSE — **V22 evolution trajectories.** Top-left: robustness stays near 1.0 between droughts, drops sharply during them. Top-right: mean Φ ranges 0.05–0.20 — moderate integration that doesn't trend upward. Bottom-left: population dynamics with regular drought dips. Bottom-right: prediction MSE stays low (10⁻⁴ scale) — the gradient works, but better prediction doesn't translate to higher integration.

V22 agent evolution filmstrip showing grid state across cycles — **V22 agent evolution (seed 42).** Grid snapshots across evolution cycles C0–C29. Agents (colored dots) on a resource landscape (green). Population oscillates with drought cycles. The visual shows the substrate is working — agents persist, reproduce, and die in response to resource dynamics — but the spatial patterns alone don't reveal the internal integration story.

Prediction ≠ integration. The gradient makes agents better individual forecasters without creating cross-component coordination. A single linear prediction head can be satisfied by a subset of hidden units — no cross-component coupling required. This is the decomposability problem: linear readouts are always factored.

Source code

v22_substrate.py — Within-lifetime SGD + genome/phenotype
v22_evolution.py — Evolution with gradient learning
v22_gpu_run.py — GPU runner (~10 min on A10)