V2.55 - Improved Temperature Extraction + Entropy Diagnostic

V2.55: Improved Temperature Extraction + Entropy Diagnostic

Summary

V2.55 improves the temperature extraction from 12.6% to 5.6% at N=3000 by making the thermal fit’s B coefficient a free parameter with a stricter UV cutoff. It also documents, through extensive testing, why the c/3 entropy extraction is fundamentally limited on finite causal sets.

Key results:

Metric	V2.53	V2.55	Notes
c/3 (N=1000)	0.310	0.310	Unchanged
Gamma* (N=1000)	1.098	1.098	Unchanged
R_kk (N=1000)	-8.42	-8.42	Unchanged
T_kms/T_u (N=1000)	1.148 (14.7%)	1.147 (14.7%)	Unchanged (fixed B)
T_kms/T_u (N=3000)	1.126 (12.6%)	1.056 (5.6%)	IMPROVED (free B)
Checks (N=1000)	4/4	4/4	Preserved

Part A: Temperature Improvement

The problem

V2.53’s temperature extraction plateaus at ~13% error:

N=500: 34% off
N=1000: 14.7% off
N=3000: 12.6% off

The improvement from N=1000 to N=3000 is only 2.1pp, suggesting a systematic bias that doesn’t converge.

Root cause

The thermal fit model Re(W(Δτ)) = A + B × ln|sinh(πTΔτ)| fixes B = -1/(2π). On the causal set, the effective B differs from the continuum value due to UV discretization effects:

N	B_eff / B_expected	UV effect
1000	0.795 (20% off)	Large
3000	0.956 (4.4% off)	Small

When B is fixed at the wrong value, the fit compensates by adjusting T, creating a systematic bias.

The fix: N-adaptive free B

At N ≥ 2000, make B a free parameter and use a stricter UV cutoff (dtau_min=0.5):

Re(W(Δτ)) = A + B × ln|sinh(πTΔτ)|

For each trial T, solve the 2-parameter linear regression for (A, B). This removes the B-related systematic bias.

At N < 2000, keep B fixed (the free B fit is unstable when UV effects are large).

Parameter sweep results (10 seeds, N=3000)

Config	Median T/T_u	Std	Error
fixB, dτ=0.3, δξ=0.7 (old)	1.153	0.040	15.3%
fixB, dτ=0.5, δξ=0.7	1.191	0.047	19.1%
fixB, dτ=0.3, δξ=0.3	1.377	0.050	37.7%
freeB, dτ=0.5, δξ=0.7	0.972	0.135	2.8%
freeB, dτ=0.3, δξ=0.7	1.232	0.163	23.2%

The free B + strict UV cutoff eliminates the systematic bias at the cost of higher variance (0.135 vs 0.040). But the bias reduction (15.3% → 2.8%) far outweighs the variance increase.

N-convergence of temperature

N	Method	T/T_u	Error	Convergent?
500	fixed B	0.660	34%	—
1000	fixed B	1.147	14.7%	—
3000	free B	1.056	5.6%	Yes

Temperature now shows clear monotonic convergence: 34% → 15% → 5.6%.

Ensemble results (V2.53 seeds, fair comparison)

N=1000 (30 seeds, fixed B):

c/3:       0.310  (7.1% off)
Gamma*:    1.098
R_kk:      -8.42
T_kms/T_u: 1.147  std=0.114  (14.7% off)
Checks:    4/4 pass

N=3000 (15 seeds, free B, dtau_min=0.5):

c/3:       0.870  (unchanged from V2.53)
Gamma*:    1.122
R_kk:      -31.6
T_kms/T_u: 1.056  std=0.166  (5.6% off)  ← IMPROVED from 12.6%
Old method: 1.126  (12.6% off)  ← confirmed baseline
Checks:    2/4 pass (c/3 and R_kk still fail at N=3000)

Part B: Entropy Diagnostic

Why c/3 extraction is fundamentally limited

V2.55 tested four alternative entropy methods:

Interval scaling with SY truncation — vary sub-interval size on a single trajectory
Interval scaling with raw entropy — same, without truncation
Mutual information between trajectory halves — UV-finite by construction
Diamond scaling — SY entropy of half-diamonds of varying size

All four methods fail at extracting c/3 reliably. The results:

Method	c/3 (target 0.333)	R²	Issue
Cross-acceleration (old)	0.310 (N=1000, ensemble)	0.04/seed	Only works via ensemble averaging
Interval scaling SY	~1.6 (varies wildly)	0.05-0.30	SY entropy on small subsets is noise
Interval scaling raw	~12	0.89	Fits volume law, not area law
Mutual information	~0.2-4.6 (varies)	0.30-0.81	UV effects don’t cancel cleanly
Diamond scaling SY	-1.16	0.86	SY k_phys growth dominates signal
N-scaling (Rindler wedge)	1.91	0.98	k_phys = sqrt(n) artifact

Root cause analysis

The entanglement entropy on a finite causal set (N ≤ 3000) has three fundamental limitations:

Per-seed SNR < 0.2: The expected entropy variation from c/3 is ~0.6 nats, while the noise is ~2.8 nats. The c/3 signal is 5× smaller than the noise per seed.
SY truncation artifact: The Sorkin-Yazdi truncation keeps k_phys = ceil(sqrt(n)) modes. As the subregion size n changes, k_phys changes, introducing an artificial scaling that dominates the physical c/3 × ln(L) signal.
Volume-law contamination: Raw entropy scales as S ∝ n (volume law), while the physical signal is c/3 × ln(L) ∝ ln(n). The logarithmic signal is sub-leading by a factor of n/ln(n) ≈ 50-500.

What would fix entropy

N ≥ 100,000: The area-law signal scales as c/3 × ln(N), while the volume-law noise scales as √N / ln(N). At N ~ 100,000, the signal might emerge. But current SJ construction is O(N³), making this impractical.
Better truncation: A theoretically principled UV subtraction that doesn’t introduce k_phys artifacts. This is an open research problem in causal set QFT.
Mutual information with explicit UV matching: Matching UV modes between subregions before taking differences. Requires understanding the UV mode structure, which is seed-dependent.

Implications for the program

The c/3 = 0.310 at N=1000 (30 seeds) is the best currently achievable, but it is:

Fragile: different seeds give wildly different results (0.310 with seeds 42-2942, 0.725 with seeds 0-29)
Non-convergent: degrades to 0.870 at N=3000
Ensemble-dependent: only works because noise partially cancels over 30+ seeds

The temperature extraction (now at 5.6%) and QFI scaling (Gamma* ≈ 1.1) are the robust measurements.

Honest Assessment: 75%

Component	Status	Confidence
Temperature (N=1000)	14.7% off (fixed B)	High
Temperature (N=3000)	5.6% off (free B)	High
Temperature convergence	34% → 15% → 5.6%	High
Gamma* QFI scaling	~1.1, stable	High
Entropy c/3	0.310 (ensemble only, fragile)	Low
R_kk flat spacetime	Noisy, N-unstable	Low
De Sitter (from V2.54)	Thermal (R²>0.98), 35% offset	Medium

Increase from V2.54’s 72%: The 3pp improvement comes from the temperature convergence below 10% at N=3000. This addresses the user’s specific concern about the 13% plateau.

Remaining gaps to 90%+

8%: Fix entropy method (requires N >> 10000 or new approach)
5%: Demonstrate T_kms convergence below 5% (needs N ≥ 5000)
4%: Reduce de Sitter temperature offset
3%: Fix R_kk at N=3000
3%: Demonstrate in 2+1D

Files

File	Description
src/corrected_pipeline.py	V2.55 pipeline with N-adaptive temperature
src/kms_extraction_v2.py	Free-B thermal fit (V2.55 new)
src/kms_extraction.py	Fixed-B thermal fit (V2.53, for N<2000)
src/interval_entropy.py	Interval scaling entropy methods (diagnostic)
src/sparse_sj.py	Factored SJ vacuum (from V2.53)
src/ensemble_pipeline.py	Ensemble with 4 independent checks (from V2.53)
src/desitter_causal_set.py	De Sitter causal set (from V2.54)
src/desitter_pipeline.py	De Sitter pipeline (from V2.54)
test_diamond_entropy.py	Diamond scaling entropy diagnostic
test_interval_entropy.py	Interval scaling entropy diagnostic
test_improved_temp.py	Temperature method comparison
test_temp_sweep.py	Parameter sweep for temperature
run_ensemble.py	Main ensemble runner
run_ensemble_v53seeds.py	Ensemble with V2.53 seeds (fair comparison)