Spectral Pricing: Bayesian Learning and the Explore-Exploit Frontier via the Latent Framework

Nagy, Tamás

← All Papers · dynamic pricing / bayesian learning / revenue optimization

Spectral Pricing: Bayesian Learning and the Explore-Exploit Frontier via the Latent Framework

Dr. Tamás Nagy Updated 2026-04-10 Short Draft dynamic pricing / bayesian learning / revenue optimization Lean-Verified

Mathematics verified. Core theorems are machine-checked in Lean 4. Prose and presentation may not have been human-reviewed.

Download PDF View in Graph BibTeX

Abstract

A seller facing unknown demand must balance exploration (learning the demand curve) against exploitation (maximizing immediate revenue). We organize the explore-exploit tradeoff through the Latent Number \(\rho\) of the demand function. Under a one-step multiplicative template on variance proxies (successive variances satisfying \(\mathrm{Var}_t = \rho\,\mathrm{Var}_{t+1}\)), iterating yields geometric contraction \(\propto \rho^{-t}\); the companion script proves matching one-step geometric templates for per-period regret (Theorem 5) and exploration horizons (Theorem 8). The narrative uses the stylized scalings \(R(T) \propto \log(T)/\log(\rho)\) and \(\tau^ \propto \log(T)/\log(\rho)\) as standard bandit-style targets motivated after those templates—not as new uniform theorems proved here. Spectral language motivates truncating to \(N^\) components with error of order \(\rho^{-N}\) at the level of the encoded algebraic templates. Airline-style versus grocery-style scenarios (\(\rho \approx 4\) vs.\ \(\rho \approx 10\)) appear as illustrative calibrations with \(5\times\) exploration and \(4\times\) relative regret ratios fixed as hypotheses in the script. Fifteen lemmas in the companion proof file, machine-verified in the Lean 4 real-arithmetic kernel, zero user axioms.

Length

2,190 words

Claims

13 theorems

Status

Draft

Novelty

Framing the explore-exploit tradeoff in dynamic pricing through a single spectral complexity parameter (the Latent Number), though the fifteen verified lemmas are algebraic templates on fixed ratios rather than new stochastic or minimax results.

Connects To

Bounded Rationality and the Computational Complexity of Equi...

Browse all dynamic pricing / bayesian learning / revenue optimization papers →