Experiments / V2.663
V2.663
Dynamical Selection COMPLETE

V2.663 - Global Evidence Ratio — Bayesian Framework vs ΛCDM

V2.663: Global Evidence Ratio — Bayesian Framework vs ΛCDM

The Headline Number

Bayes factor = 50 (Planck alone) to 65,000 (4 independent datasets)

The framework (Ω_Λ = 0.6877, zero parameters) is statistically preferred over ΛCDM (Ω_Λ free, one parameter) by the Savage-Dickey density ratio. This is “Very strong” to “Decisive” evidence on the Jeffreys scale.

Method

The Savage-Dickey density ratio gives the Bayes factor between a nested model (framework, with Ω_Λ fixed) and the parent model (ΛCDM, with Ω_Λ free):

BF = P(Ω_Λ = 0.6877 | data, ΛCDM) / P(Ω_Λ = 0.6877 | prior, ΛCDM)

The numerator is the posterior density at the framework’s prediction (how well the prediction matches the data). The denominator is the prior density (how specific the prediction is relative to the prior range).

Individual Datasets

Datasetσ-tensionBFJeffreys
Planck 2018 CMB+0.4σ50.2Very strong
Planck + BAO+0.2σ69.6Very strong
ACT DR4 + WMAP+0.3σ32.0Very strong
SPT-3G 2018+0.1σ24.7Strong
DES Y3 (3×2pt)+0.9σ8.9Substantial
Pantheon+ SN+1.2σ10.7Strong
DESI BAO Y1+1.2σ13.7Strong

Every single dataset individually favors the framework over ΛCDM. The preference is strongest for CMB experiments (which constrain Ω_Λ most tightly) and weakest for low-z probes (which have larger errors and slightly higher tension).

Where the Preference Comes From

The Occam Factor

ΛCDM has Ω_Λ as a free parameter. The data constrains it to σ = 0.0073. With a flat prior on [0, 1], the Occam penalty is:

Occam factor = σ_posterior × √(2π) / Δ_prior = 0.0183

ΛCDM is penalized 55× for having a parameter that the data constrains to <1% of its prior range. The framework predicted this 1% range for free.

The Fit Quality

The framework’s prediction (0.6877) is 0.4σ above Planck’s best fit (0.6847). This costs a factor of exp(-0.17/2) = 0.92. A ~8% penalty for being slightly off, far outweighed by the 55× Occam reward.

Net: 55 × 0.92 ≈ 50× in favor of the framework (single dataset).

BIC and AIC Comparison

ModelkΔχ²ΔBICΔAIC
Framework5+0.2-7.7-1.8
ΛCDM60.00.00.0
w₀CDM7-1.2+6.6+0.8
w₀waCDM8-4.0+11.70.0
ΛCDM + N_eff7-0.5+7.3+1.5
ΛCDM + Σm_ν7-0.3+7.5+1.7

The framework is the only model that beats ΛCDM on BIC. All extensions (w₀, w₀wa, N_eff, Σm_ν) are penalized for extra parameters. ΔBIC = -7.7 qualifies as “strong evidence” on the Kass-Raftery scale (>6).

Future Forecast

Experimentσ(Ω_Λ)YearIf framework correctIf ΛCDM best-fit correct
Current (Planck)0.00732018BF = 55 (Very strong)BF = 50 (Very strong)
DESI DR3 + Planck0.0042027BF = 100 (Very strong)BF = 75 (Very strong)
Euclid + CMB-S40.0022030BF = 199 (Decisive)BF = 65 (Very strong)
Ultimate0.0012035BF = 399 (Decisive)BF = 4.4 (Substantial)

Key insight: If the framework is correct (Ω_Λ_true = 0.6877), the evidence grows monotonically — better data always helps. If ΛCDM’s best fit is correct (0.6847), the evidence initially stays high (Occam factor dominates) but eventually the 0.4σ offset grows to 3σ at ultimate precision, and the framework is disfavored. The two scenarios diverge decisively at σ ≈ 0.001 (circa 2035).

Honest Assessment

What IS established

  1. BF > 10 for every individual dataset — this is “Strong” evidence minimum
  2. ΔBIC = -7.7 — “Strong evidence” on Kass-Raftery (threshold = 6)
  3. The framework is the only model that beats ΛCDM on information criteria
  4. The preference comes from parsimony — the framework predicts, ΛCDM fits

What requires caution

  1. Prior dependence: The BF of 50 assumes a flat prior Ω_Λ ∈ [0, 1]. With a narrower prior (e.g., σ = 0.1 Gaussian), the advantage drops to ~5. The honest statement is BF ∈ [5, 55] depending on the prior.

  2. Combined BF is inflated: The combined BF of 65,000 from 4 “independent” datasets overstates the evidence. The datasets share calibration, physical models, and some systematics. The true independent BF is likely closer to 50-200, not 65,000. The single-dataset Planck BF of 50 is the most trustworthy number.

  3. Savage-Dickey assumes Gaussianity: The posterior on Ω_Λ is nearly Gaussian for Planck, but non-Gaussian tails from prior volume effects could modify the result at the 10-20% level.

  4. BF is relative, not absolute: A BF of 50 says the framework is 50× more probable than ΛCDM given the data. It does NOT say the framework is correct. Both models could be wrong; a third model could be better.

  5. The 0.4σ offset could be real: If the true Ω_Λ is 0.6847 (not 0.6877), the framework is slightly wrong. At current precision this is invisible. At Euclid precision (σ = 0.002), it becomes a 1.5σ tension. At ultimate precision (σ = 0.001), it becomes 3σ — potentially fatal.

  6. w₀waCDM: The DESI w≠-1 result is driven by SN systematics (V2.657), but if DESI DR3 confirms w≠-1 at >5σ with consistent SN samples, the framework is falsified regardless of the BF calculated here.

The bottom line

The framework’s statistical advantage over ΛCDM is real and quantifiable: BF ≈ 50 from Planck alone, ΔBIC = -7.7. This comes entirely from eliminating one free parameter while remaining within 0.4σ of the data. It is the simplest model that fits the data — Occam’s razor in action.

But the honest comparison is: framework = “we predicted the answer” vs ΛCDM = “we fit the answer.” A Bayes factor measures the value of that prediction. Whether the prediction is CORRECT (vs. coincidentally close) depends on future precision measurements, particularly Euclid and CMB-S4.

Files

  • src/evidence_ratio.py: Savage-Dickey computation, BIC/AIC, forecasts
  • tests/test_evidence_ratio.py: 27 tests, all passing
  • results.json: Full numerical output