← All Papers · Quantitative Finance

Spectral Dynamic Programming: Per-Mode Convergence Rates and Computational Acceleration for American Options

Tamás Nagy, Ph.D. Updated 2026-03-05 Draft Quantitative Finance Lean-Verified
Mathematics verified. Core theorems are machine-checked in Lean 4. Prose and presentation may not have been human-reviewed.
Download PDF View in Graph BibTeX

Abstract

We show that eigendecomposing the transition kernel of a Markov decision process yields a spectral Bellman equation: each eigenmode \(k\) satisfies an independent scalar recurrence \(v_k' = r_k + \gamma \mu_k v_k\), converging at rate \(\gamma |\mu_k| \leq \gamma\), where \(\mu_k\) is the \(k\)-th eigenvalue of the transition matrix. Fast modes (\(|\mu_k| \ll 1\)) converge in a single iteration; slow modes (\(|\mu_k| \approx 1\)) dominate the computational cost. This spectral decomposition has three practical consequences:

(1) Computational acceleration. Standard value iteration applies \(M\) uniform backward steps. Spectral value iteration allocates \(M_k = \lceil \log(1/\varepsilon) / \log(1/\gamma |\mu_k|) \rceil\) iterations per mode, skipping already-converged fast modes. For American basket options on \(n\) assets with the COS method, where the transfer matrix eigenvalues decay exponentially as \(|\mu_k| \sim \rho^{-k}\) (analyticity radius \(\rho > 1\)), this reduces the number of non-trivial backward steps from \(M\) to \(M_{\text{eff}} = \lceil \log(1/\varepsilon) / \log(1/\gamma) \rceil\) on only the \(K_{\text{eff}} = O(\log(1/\varepsilon) / \log \rho)\) modes that have not yet converged. Total work per exercise date: \(O(K_{\text{eff}} \cdot M_{\text{eff}} + N^2)\) vs \(O(N^2)\) for the standard method. When \(K_{\text{eff}} \ll N\) and the eigendecomposition is amortized across an option surface, the dominant cost drops by a factor of \(N / K_{\text{eff}}\).

(2) Shadow prices for risk constraints. Adding VaR or CVaR constraints to the Bellman equation yields a Lagrangian with shadow price \(\lambda^ = \partial V^ / \partial b\), where \(b\) is the risk budget. The shadow price quantifies the cost of each unit of risk limit: "relaxing the VaR constraint by \\(1M increases the portfolio value by \)\lambda^*\(." For constrained LP formulations, risk limits appear as additional rows with interpretable dual variables.

(3) Model-free option bounds. The robust Bellman equation \)V = \max_a \min_\xi [R + \gamma \sum P_\xi V]\( under \)\varepsilon\(-perturbation of the transition kernel is still a contraction at rate \)\gamma(1+\varepsilon) < 1\(. The resulting value function interval \)[V_{\text{low}}, V_{\text{high}}]\( gives model-free option price bounds with width \)O(\varepsilon)\( that narrow monotonically as the uncertainty set shrinks.

All results are machine-verified in Lean 4: 78 theorems across 28 files (Bellman + Extended Bellman gyms), compiled via lake build with zero errors. We benchmark spectral DP against standard COS backward induction on American basket options and show 5--50\)\times$ speedup for concentrated eigenvalue spectra.

Keywords: dynamic programming, spectral decomposition, American options, COS method, convergence rate, eigenvalue, shadow price, robust optimization, Lean

Length
5,769 words
Claims
6 theorems
Status
Draft
Target
SIAM Journal on Financial Mathematics / Mathematics of Operations Research

Novelty

The per-mode convergence rate decomposition γ|μ_k| for spectral Bellman iteration is a clean repackaging of known spectral properties of stochastic matrices, but the explicit connection to COS option pricing eigenstructure and the mode-adaptive iteration count formula are new and useful.

Connects To

Formal Foundations of Stochastic Gradient Descent

Browse all Quantitative Finance papers →