License: arXiv.org perpetual non-exclusive license
arXiv:2603.17049v2 [physics.optics] 26 Jun 2026

Attractor-Keyed Memory

Natalia G. Berloff Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge CB3 0WA, United Kingdom N.G.Berloff@damtp.cam.ac.uk
Abstract

Physical selectors (lasers choosing a mode, Ising machines settling on a ground state, condensates occupying a spin state) produce high-dimensional signatures at the moment of decision: full field amplitudes, multimode interference patterns, or scattering responses. These signatures are richer than the winner’s index, yet they are routinely discarded. We show that when the signatures are repeatable across trials (stereotyped) and linearly independent across routes, a single linear decoder compiled from calibration data maps them to arbitrary payloads, merging selection and memory access into one event and eliminating the fetch that dominates latency and energy in sparse routing architectures. The construction requires one singular value decomposition of measured device responses, which certifies capability and bounds worst-case error for any downstream payload before the task is chosen. Runtime error separates into two independently diagnosable channels, decoding fidelity (controlled by dictionary conditioning) and routing reliability (controlled by the margin-to-noise ratio), each with a distinct physical origin and targeted remedy. We derive the full error decomposition, give Ising-machine selector constructions, and validate the predicted scalings on synthetic speckle-signature simulations across three measurement modalities. No hardware demonstration exists; we provide a falsifiable four-step experimental protocol specifying what a first experiment must measure. Whether real device signatures satisfy stereotypy is the central open question.

Architectures built around discrete selection, including mixture-of-experts models [7], neuromorphic processors [31, 14], and photonic classifiers [29, 32], perform two operations per input: a router selects which of KK experts to activate, then the associated DD-dimensional payload is fetched from digital memory [35, 12, 10, 34]. The fetch is the bottleneck: the payload read dominates both latency and energy [12, 31]. Yet the physics of selection has already produced a rich observable at the moment of decision.

When a laser selects a mode, a condensate occupies a state [1], or an Ising machine relaxes toward a ground-state configuration [18, 11, 30], the winning attractor carries a high-dimensional physical signature far richer than the winner’s index. This signature is discarded. We show that a fixed linear decoder can map it to arbitrary data, so that selection and memory access become a single event. We call this primitive attractor-keyed memory (AKM); retrieval without a separate memory fetch we term fetchless lookup.

The framework rests on one physical assumption and one algebraic condition. The assumption is route-conditioned repeatability (stereotypy): conditioned on route kk winning, the measured signature is approximately the same across inputs. Given stereotypy, the algebraic condition is that the KK calibrated mean signatures must be linearly independent. When they are, a minimum-norm pseudoinverse decoder [25] W=YΦ+W=Y\Phi^{+} recovers any desired payload table YY exactly, where ΦMsig×K\Phi\in\mathbb{R}^{M_{\mathrm{sig}}\times K} collects the KK mean signatures of dimension MsigM_{\mathrm{sig}}. Since Φ\Phi depends only on the hardware, changing the payload requires only recompiling WW.

For hardware used as a selector, AKM adds a design objective beyond producing a reliable winner: engineer the post-selection state so its signatures form a well-conditioned dictionary. This added objective yields three practical consequences. First, a task-independent certification protocol: a single singular value decomposition (SVD) of measured device responses determines rank(Φ)\operatorname{rank}(\Phi) and σmin(Φ)\sigma_{\min}(\Phi), certifying universal payload realizability and bounding worst-case error before deployment. Second, a two-channel error decomposition with distinct diagnostics and remedies: decoding fidelity controlled by σmin(Φ)\sigma_{\min}(\Phi), routing reliability controlled by the ratio Δ/Teff\Delta/T_{\mathrm{eff}} of the selection margin to effective noise temperature. Third, concrete hardware design targets: three predeployment failure modes (rank loss, conditioning collapse, margin collapse), each measurable and separately remediable.

The linear algebra is standard [25]; the rank criterion for Φ\Phi is the finite-dictionary analogue of the full-rank condition in reservoir readout training [15, 5]. In reservoir computing [3, 5], errors fold into a single empirical error budget; in Hopfield retrieval [27], the attractor is the stored object. The novelty is the object the algebra acts on: Φ\Phi is compiled once from measured device responses and predicts capability for any payload. Timing-based address selection in spiking networks [2] and driven-dissipative mode competition [1, 30] provide candidate physical selectors.

Refer to caption
Figure 1: Attractor-keyed memory versus reservoir computing. Both use a linear readout from a physical state, but differ structurally: reservoirs operate on a continuous driven trajectory, AKM on a discrete set of attractor signatures; reservoir states are input-sensitive (separability), AKM signatures are stereotyped (same pattern each time route kk wins); the reservoir decoder is trained on input–output pairs, the AKM decoder is compiled from calibration data via W=YΦ+W=Y\Phi^{+}; reservoir analyses fold routing-like failures into an overall readout error budget, AKM decomposes into two separately diagnosable channels (decoding fidelity σmin(Φ)\sigma_{\min}(\Phi), routing reliability Δ/Teff\Delta/T_{\mathrm{eff}}); reservoir analysis yields a capacity bound, AKM yields a predeployment certification protocol. In AKM the payload yky_{k} need not resemble the attractor state; the attractor is the key, not the stored value.

Evidence hierarchy. Four layers, in decreasing rigor: (i) exact theorem (full-rank dictionary gives universal payload realizability); (ii) approximate theory (conditioning, drift, and stereotypy bounds); (iii) phenomenology (Gibbs routing fit); (iv) synthetic validation (Monte Carlo on speckle-signature models, not physical experiment). Synthetic validation confirms internal consistency; it does not test physical assumptions. Figure 1 contrasts the architecture with reservoir computing; Fig. 2(a) illustrates a photonic realization. Whether real competitive selectors satisfy stereotypy is the central open experimental question; the formal statement and its quantitative relaxation appear below.

Route–decode abstraction.

Fetchless lookup factors into routing and readout:

xk^(x)ϕ~k^(x)y=Wϕ~k^(x).x\;\longmapsto\;\hat{k}(x)\;\longmapsto\;\tilde{\phi}_{\hat{k}(x)}\;\longmapsto\;y=W\tilde{\phi}_{\hat{k}(x)}. (1)

Here xNx\in\mathbb{R}^{N} is an NN-dimensional input; k^(x)\hat{k}(x) is the discrete route selected by the physical competition; ϕ~k^(x)Msig\tilde{\phi}_{\hat{k}(x)}\in\mathbb{R}^{M_{\mathrm{sig}}} is the single-shot measured signature of the winning state; and WD×MsigW\in\mathbb{R}^{D\times M_{\mathrm{sig}}} is a fixed linear decoder mapping signatures to DD-dimensional payloads. The theory requires only that routing returns a winner k^(x)\hat{k}(x) with a measurable margin; the internal structure of the selector is an implementation detail. In these terms, a route is the index k^(x)\hat{k}(x) returned by the selector; the payload yky_{k} is the data assigned to route kk; a fetch is the separate memory read that returns yky_{k} from a stored table once the route is known; and fetchless lookup eliminates that read by obtaining yy directly from the winner’s signature through WW, as in Eq. (1).

Score generation and routing.

One realization maps an input xx to a bank of MscoreKM_{\mathrm{score}}\geq K candidate scores:

h(x)=Bwidex,BwideMscore×N.h(x)=B_{\mathrm{wide}}\,x,\qquad B_{\mathrm{wide}}\in\mathbb{R}^{M_{\mathrm{score}}\times N}. (2)

In photonic platforms BwideB_{\mathrm{wide}} may be realized by a programmable linear optical transformation [19, 4, 22]. A fixed routing map RrouteK×MscoreR_{\mathrm{route}}\in\mathbb{R}^{K\times M_{\mathrm{score}}} compresses the wide scores into KK competing routes:

g(x)=Rrouteh(x)K,Ek(x)=gk(x),g(x)=R_{\mathrm{route}}\,h(x)\in\mathbb{R}^{K},\qquad E_{k}(x)=-g_{k}(x), (3)

where gk(x)g_{k}(x) is the score for route kk and Ek(x)E_{k}(x) the corresponding selector energy. The selector returns k^(x)argmaxkgk(x)\hat{k}(x)\in\operatorname*{arg\,max}_{k}g_{k}(x) with winner-to-runner-up margin

Δ(x)=g(1)(x)g(2)(x),\Delta(x)=g_{(1)}(x)-g_{(2)}(x), (4)

where g(1)g(2)g_{(1)}\geq g_{(2)}\geq\cdots are the ordered scores. Since Ek=gkE_{k}=-g_{k}, this score-space margin equals the energy-space gap E(2)E(1)E_{(2)}-E_{(1)}; we use Δ\Delta for both throughout. Equations (2)–(4) describe one realization; the theory requires only a winner k^(x)\hat{k}(x), a margin Δ(x)\Delta(x), and the testable assumption that misrouting decreases monotonically with Δ(x)\Delta(x).

Signature emission and decoding.

After routing converges to state u(k)u^{(k)}, a fixed measurement map \mathcal{M} produces the signature

ϕ~k=u(k)+δMsig,\tilde{\phi}_{k}=\mathcal{M}\,u^{(k)}+\delta\in\mathbb{R}^{M_{\mathrm{sig}}}, (5)

where δ\delta is zero-mean measurement noise (decomposed in Supp. Mat., Sec. S3D into an input-dependent residual εk(x)\varepsilon_{k}(x) and a stochastic component δk\delta_{k}). Conditioned on selecting route kk, the measured signature has mean ϕ¯k\bar{\phi}_{k} and covariance Σk\Sigma_{k}.

Assumption (route-conditioned repeatability / stereotypy). Conditioned on route kk winning, shot-to-shot variation in ϕ~k\tilde{\phi}_{k} is dominated by device noise, not by differences in the triggering input xx. Strong competition funnels all initial conditions toward a single final state; weak competition lets the winning state retain input memory, making Φ\Phi a poor summary. Mode hopping or near-degeneracy can break stereotypy and must be diagnosed (Supp. Mat., Sec. S3).

Calibration forces each route in turn and collects mean signatures and payloads:

Φ=[ϕ¯1ϕ¯K]Msig×K,Y=[y1yK]D×K.\Phi=[\bar{\phi}_{1}\cdots\bar{\phi}_{K}]\in\mathbb{R}^{M_{\mathrm{sig}}\times K},\qquad Y=[y_{1}\cdots y_{K}]\in\mathbb{R}^{D\times K}. (6)

The dictionary Φ\Phi is empirical, compiled from measured device responses. Runtime readout uses a fixed linear decoder y=Wϕ~k^(x),y=W\tilde{\phi}_{\hat{k}(x)}, with WD×Msig.W\in\mathbb{R}^{D\times M_{\mathrm{sig}}}.

Block-parallel architecture.

BB independent blocks operate in parallel, each running its own physical argmax\operatorname*{arg\,max} over KK routes; the rank condition applies per block. The layer output is the sum of decoded signatures across all BB blocks (Supp. Mat. [28]). Figure 2(a) illustrates the pipeline.

Proposition 1 (Universal payload realizability).

The system WΦ=YW\Phi=Y admits a solution for every YD×KY\in\mathbb{R}^{D\times K} if and only if rank(Φ)=K\operatorname{rank}(\Phi)=K. When this holds, MsigKM_{\mathrm{sig}}\geq K and the minimum-norm exact decoder is W=YΦ+W_{\star}=Y\Phi^{+} (proof and full decoder family in Supp. Mat. [28]). The condition is well known; what matters here is its physical interpretation: the KK routes must produce KK linearly independent measured patterns, and a single SVD of Φ\Phi checks this before any task is specified.

If rank(Φ)=r<K\operatorname{rank}(\Phi)=r<K, exact decoding may still hold for a particular YY whose rows lie in Row(Φ)\operatorname{Row}(\Phi). Rank deficiency rules out universal fetchless lookup, not every structured task.

Selector realizations.

The decoder theory is selector-agnostic, but two Ising constructions show the framework generates concrete hardware designs.

One-hot QUBO (quadratic unconstrained binary optimization) selector. Binary variables zi{0,1}z_{i}\in\{0,1\} define the energy

EWTA(z;x)=λ(i=1Kzi1)2i=1Kgi(x)zi,λ>0,E_{\mathrm{WTA}}(z;x)=\lambda\Big(\sum_{i=1}^{K}z_{i}-1\Big)^{\!2}-\sum_{i=1}^{K}g_{i}(x)\,z_{i},\qquad\lambda>0, (7)

where λ\lambda is a penalty weight enforcing the one-hot constraint. If λ\lambda exceeds the largest score magnitude, every global minimizer is one-hot: on the one-hot manifold, EWTA(ei;x)=gi(x)E_{\mathrm{WTA}}(e_{i};x)=-g_{i}(x), recovering the selector energy of Eq. (3). The standard spin transformation zi=(1+si)/2z_{i}=(1+s_{i})/2 yields an equivalent Ising Hamiltonian with dense antiferromagnetic couplings and local fields proportional to gi(x)g_{i}(x). Each block in the parallel architecture realizes its own such QUBO. The full proof is given in the Supp. Mat. [28].

Binary comparator (K=2K\!=\!2). An antiferromagnetically coupled spin pair with coupling J>0J>0 and local fields ha(x),hb(x)h_{a}(x),h_{b}(x) returns sign(hahb)\operatorname{sign}(h_{a}-h_{b}) [14]; when JJ exceeds the field magnitudes, the energy gap is Δcmp(x)=2|ha(x)hb(x)|\Delta_{\mathrm{cmp}}(x)=2|h_{a}(x)-h_{b}(x)|. Chaining ncn_{c} such comparators produces an ncn_{c}-bit address into a 2nc2^{n_{c}}-entry lookup table. Sweeping the field difference and comparing the empirical misrouting rate against Δcmp/Teff\Delta_{\mathrm{cmp}}/T_{\mathrm{eff}} is the simplest experimental test of fetchless lookup.

Driven-dissipative oscillators offer a third realization (Supp. Mat. [28]).

Robustness: two separable failure channels.

At run time two error channels remain: signature perturbation after the correct route wins, and selection of the wrong route. The decomposition is a diagnostic tool, not a claim of statistical independence (Supp. Mat. [28], Remark 3).

Conditional decoding error. If route kk wins and the single-shot signature is ϕ~k=ϕ¯k+δ\tilde{\phi}_{k}=\bar{\phi}_{k}+\delta (with perturbation δ\delta aggregating shot noise, detector noise, and drift), the minimum-norm decoder gives (using Wϕ¯k=ykW\bar{\phi}_{k}=y_{k})

Wϕ~kyk2=YΦ+δ2Y2σmin(Φ)δ2,\|W\tilde{\phi}_{k}-y_{k}\|_{2}=\|Y\Phi^{+}\delta\|_{2}\leq\frac{\|Y\|_{2}}{\sigma_{\min}(\Phi)}\,\|\delta\|_{2}, (8)

where Y2\|Y\|_{2} is the spectral norm of the payload table. For zero-mean fluctuations with covariance Σk\Sigma_{k},

𝔼Wϕ~kyk22=Tr[YΦ+Σk(Φ+)Y]Y22σmin(Φ)2TrΣk.\mathbb{E}\|W\tilde{\phi}_{k}-y_{k}\|_{2}^{2}=\operatorname{Tr}\!\big[Y\Phi^{+}\Sigma_{k}(\Phi^{+})^{\!\top}Y^{\!\top}\big]\leq\frac{\|Y\|_{2}^{2}}{\sigma_{\min}(\Phi)^{2}}\,\operatorname{Tr}\Sigma_{k}. (9)

When stereotypy is imperfect and the residual input-dependent shift has norm at most εstereo\varepsilon_{\mathrm{stereo}}, the additional decoding error is bounded by Y2εstereo/σmin(Φ)\|Y\|_{2}\,\varepsilon_{\mathrm{stereo}}/\sigma_{\min}(\Phi) (Supp. Mat. [28]), so stereotypy need only hold to within the device noise floor.

Dictionary drift. If the dictionary drifts from Φ0\Phi_{0} at calibration to Φ0+δΦ(t)\Phi_{0}+\delta\!\Phi(t) while the decoder remains W0=YΦ0+W_{0}=Y\Phi_{0}^{+},

W0[Φ0+δΦ(t)]Y2Y2σmin(Φ0)δΦ(t)2.\|W_{0}[\Phi_{0}+\delta\!\Phi(t)]-Y\|_{2}\leq\frac{\|Y\|_{2}}{\sigma_{\min}(\Phi_{0})}\,\|\delta\!\Phi(t)\|_{2}. (10)

Recalibration is needed once the drift exceeds tolerance εtolσmin(Φ0)\varepsilon_{\mathrm{tol}}\,\sigma_{\min}(\Phi_{0}). Each recalibration costs O(KR)O(KR) forced-route measurements (RR trials per route), so the duty-cycle overhead is this cost divided by the drift interval τdrift\tau_{\mathrm{drift}} over which δΦ(t)2\|\delta\!\Phi(t)\|_{2} stays within tolerance. Since τdrift\tau_{\mathrm{drift}} is device-set and presently uncharacterised, the overhead cannot be quantified without a drift measurement on a specific platform; the experimental protocol below measures it directly.

Routing error. If the intended route is kk^{\star} but the selector returns k\ell\neq k^{\star}, no decoder can repair the mistake. We use the Gibbs form as a phenomenological fit; the theory needs only that larger margin means lower misrouting:

P(k)=eEk/TeffjeEj/Teff,P(k)=\frac{e^{-E_{k}/T_{\mathrm{eff}}}}{\sum_{j}e^{-E_{j}/T_{\mathrm{eff}}}}, (11)

where Ek(x)=gk(x)E_{k}(x)=-g_{k}(x) is the selector energy defined in Eq. (3) and TeffT_{\mathrm{eff}} is an effective temperature fitted from repeated trials [18, 9, 30], the misrouting probability satisfies

Pmis(K1)eΔ/Teff1+(K1)eΔ/Teff,P_{\mathrm{mis}}\leq\frac{(K-1)\,e^{-\Delta/T_{\mathrm{eff}}}}{1+(K-1)\,e^{-\Delta/T_{\mathrm{eff}}}}, (12)

where Δ=minkk(EkEk)\Delta=\min_{k\neq k^{\star}}(E_{k}-E_{k^{\star}}) is the minimum energy gap to the next competing route (equal to the score-space margin of Eq. (4), since Ek=gkE_{k}=-g_{k}). The bound follows from bounding each competitor’s Boltzmann weight by eΔ/Teffe^{-\Delta/T_{\mathrm{eff}}}; for ΔTeff\Delta\gg T_{\mathrm{eff}}, log-odds of correct routing scale linearly in Δ/Teff\Delta/T_{\mathrm{eff}}. Near-degeneracy, where route statistics in Ising machines and SPIMs are known to depart from Boltzmann behaviour, is where the Gibbs form is least reliable; there the structural results (the two-channel decomposition and the certification) still hold, as they assume only monotonicity, while the specific bound (12) should be replaced by the empirically measured misrouting-versus-margin curve. With payload diameter dY=maxk,yky2d_{Y}=\max_{k,\ell}\|y_{k}-y_{\ell}\|_{2}, the triangle inequality gives a routing contribution at most Pmis(dY+Y2δ¯/σmin(Φ))P_{\mathrm{mis}}\,(d_{Y}+\|Y\|_{2}\,\bar{\delta}/\sigma_{\min}(\Phi)), where δ¯=maxkTrΣk\bar{\delta}=\max_{k}\sqrt{\operatorname{Tr}\Sigma_{k}} is the worst-case per-route noise level. A tighter second-moment decomposition is given in the Supp. Mat. (Remark 3); it requires the additional modeling assumption that route-conditioned emission noise has zero mean and covariance Σ\Sigma_{\ell} on each route \ell, independently of which route was intended (see Supp. Mat. for the precise statement). Figure 2(b–f) illustrates the decomposition on a controlled speckle-signature model (Monte Carlo; see Supp. Mat., Fig. S1). Because the simulation draws routes from the same Gibbs model used to derive Eq. (12), the agreement confirms the algebra, not the physical adequacy of the Gibbs assumption; that requires a goodness-of-fit test on real route frequencies, not yet available.

Calibration and training.

Fetchless lookup operates on two time scales. A slow calibration stage forces each route, measures signature statistics, forms Φ\Phi, and compiles W=YΦ+W=Y\Phi^{+}. Between recalibrations, Φ\Phi is fixed and online learning updates only YY (one column per sample, rank-1 update in WW); backpropagation through the argmax\operatorname*{arg\,max} uses a top-two surrogate gradient (Supp. Mat. [28]).

Refer to caption
Figure 2: Photonic implementation and numerical validation of attractor-keyed memory. (a) Schematic photonic pipeline. A spatial light modulator (SLM) encodes the input xx; a Fourier mask implements BwideB_{\mathrm{wide}}. A routing map RrouteR_{\mathrm{route}} compresses scores into KK competing routes. A spatial photonic Ising machine (SPIM) [26] selects the winner k^b\hat{k}_{b} with margin Δ\Delta. The selected mode’s field propagates through a scattering medium; a detector array records the speckle signature ϕ~k^Msig\tilde{\phi}_{\hat{k}}\in\mathbb{R}^{M_{\mathrm{sig}}}. A pre-compiled decoder W(b)W^{(b)} maps each signature to output; summing over BB blocks yields yy. (b)–(f) Error decomposition on a controlled speckle-signature model (Monte Carlo, not experiment; K=64K\!=\!64, Msig=128M_{\mathrm{sig}}\!=\!128, D=16D\!=\!16; RR: calibration trials per route; σread\sigma_{\mathrm{read}}: readout noise s.d.). (b) Relative reconstruction error WΦ^ΦYF/YF\|W_{\hat{\Phi}}\,\Phi-Y\|_{F}/\|Y\|_{F} versus calibration trials RR; four dictionary types (amplitude |z||z|, intensity |z|2|z|^{2}, heterodyne [z;z][\Re\,z;\Im\,z], orthogonal); σmin(Φ)\sigma_{\min}(\Phi) controls convergence rate. (c)–(f) All four panels use the amplitude-speckle dictionary. Solid lines: analytic RMS prediction 𝔼e22\sqrt{\mathbb{E}\|e\|_{2}^{2}} from the two-channel second-moment decomposition (Supp. Mat., Remark 3, Eq. (S17)); filled circles: Monte Carlo RMS (10,00010{,}000 trials); faint circles: Monte Carlo mean 𝔼[e2]\mathbb{E}[\|e\|_{2}]; dashed: per-trial worst-case bounds [Eq. (8) for decoding; dY+Y2δ¯/σmin(Φ)d_{Y}+\|Y\|_{2}\,\bar{\delta}/\sigma_{\min}(\Phi) for misrouting] weighted by PmisP_{\mathrm{mis}} from Eq. (12). (c) Routing-dominated regime (varying TeffT_{\mathrm{eff}} at fixed σread=0.03\sigma_{\mathrm{read}}\!=\!0.03). (d) Decoding-dominated regime (varying σread\sigma_{\mathrm{read}} at fixed Teff=0.05T_{\mathrm{eff}}\!=\!0.05). The two error channels are separable and each matches its predicted scaling. (e) Decoding error under stereotypy violation εstereo\varepsilon_{\mathrm{stereo}} (correct routing enforced). (f) Full pipeline at a realistic operating point; the crossover at Teff0.71T_{\mathrm{eff}}\!\approx\!0.71 separates the decoding-limited from the routing-limited regime. (g), (h) Dictionary conditioning σmin(Φ)\sigma_{\min}(\Phi) versus table size KK at fixed aspect ratio Mreal/KM_{\mathrm{real}}/K. Three measurement modalities, 30 random seeds per point; shaded bands: ±1\pm 1 s.d. (g) Mreal/K=2M_{\mathrm{real}}/K\!=\!2. (h) Mreal/K=4M_{\mathrm{real}}/K\!=\!4. From K=64K\!=\!64 onward, σmin\sigma_{\min} tracks the Bai–Yin prediction (dashed). At K=1024K\!=\!1024 the heterodyne value lies within 1%1\% of the asymptote at both aspect ratios. See Supp. Mat. [28] for full panel specifications.

Experimental protocol and falsifiability.

Four steps translate the theory into a falsifiable experiment: (1) Force each route in turn and record RR repeated signature measurements. This determines the sample-mean dictionary Φ^\hat{\Phi} (the finite-sample estimate of Φ\Phi), the covariances Σk\Sigma_{k}, the empirical rank, σmin(Φ^)\sigma_{\min}(\hat{\Phi}), and the stereotypy diagnostic εstereo/TrΣk\varepsilon_{\mathrm{stereo}}/\!\sqrt{\operatorname{Tr}\Sigma_{k}} (Supp. Mat., Sec. S3). Full rank is confirmed only if a bootstrap confidence interval for σmin(Φ^)\sigma_{\min}(\hat{\Phi}) excludes zero; the interval’s lower endpoint provides a conservative bound on dictionary conditioning. Under the hypothesis that centered signatures are sub-Gaussian with σsg,k2Σk2\sigma_{\mathrm{sg},k}^{2}\leq\|\Sigma_{k}\|_{2}, the number of trials needed to resolve σmin(Φ)\sigma_{\min}(\Phi) scales as R=O(KΣ2(Msig+logK)/σmin(Φ)2)R=O(K\,\|\Sigma\|_{2}\,(M_{\mathrm{sig}}+\log K)/\sigma_{\min}(\Phi)^{2}) (Supp. Mat., Proposition 2). If rank(Φ)<K\operatorname{rank}(\Phi)<K, no linear decoder can realize universal fetchless lookup. (2) Compile W=YΦ+W=Y\Phi^{+} for test payloads and verify WΦYW\Phi\approx Y on forced-route means; compare single-shot error scaling with σmin(Φ)\sigma_{\min}(\Phi) via Eqs. (8)–(9). (3) Release the selector, record winner frequencies versus the measured top-two margin, and fit TeffT_{\mathrm{eff}}. Assess the Gibbs fit by a goodness-of-fit test (e.g., χ2\chi^{2} on binned route frequencies); test Eq. (12). A necessary consistency check: if forced-route and free-running signatures differ significantly, the calibration model requires correction. (4) Monitor drift in mean signatures over time. Recalibrate when δΦ(t)2\|\delta\!\Phi(t)\|_{2} exceeds εtolσmin(Φ0)\varepsilon_{\mathrm{tol}}\,\sigma_{\min}(\Phi_{0}), the tolerance derived from Eq. (10).

No hardware demonstration exists to date; the protocol defines the criteria a first experiment must satisfy. The required ingredients exist separately in current platforms [3, 13, 19, 4, 22, 23, 18, 16, 30]; integrating them into a single device remains open. A forced-routing dictionary measurement is the natural first experiment, followed by a K=2K=2 routing test of the predicted Δcmp/Teff\Delta_{\mathrm{cmp}}/T_{\mathrm{eff}} dependence.

Scope and outlook.

How far the scheme scales depends on the oversampling ratio Msig/KM_{\mathrm{sig}}/K. The Bai–Yin law keeps σmin(Φ)\sigma_{\min}(\Phi) bounded away from zero when Msig/KM_{\mathrm{sig}}/K exceeds unity by a finite factor, yielding low error amplification at Msig/K=2M_{\mathrm{sig}}/K=2 [Eqs. (8)–(9)]. Real device signatures may exhibit spatial correlations that reduce the effective degrees of freedom below MsigM_{\mathrm{sig}}, worsening conditioning; the SVD diagnostic detects this. Among the three readout modalities tested [Fig. 2(b) and (g, h)], heterodyne achieves the best conditioning, improving σmin\sigma_{\min} by 3.4×3.4\times over amplitude-only measurement at matched complex-mode count, and remains within 1%1\% of the Bai–Yin asymptote up to K=1024K\!=\!1024 (Supp. Mat., Fig. S2).

AKM replaces an O(D)O(D) memory read with an MsigM_{\mathrm{sig}}-channel measurement and linear decode, favourable when the decode is absorbed into the measurement optics or when data-movement cost dominates compute [12, 31]; at Msig/K=2M_{\mathrm{sig}}/K=2 with digital decode, break-even requires native optical decode or DRAM-resident payloads (Supp. Mat., Sec. S13).

Any competitive physical selector that produces high-dimensional signatures and settles to stereotyped attractor states is a candidate platform: coherent Ising machines, polariton condensate networks, laser arrays, and spatial photonic Ising machines (SPIMs). Among these, SPIMs are closest to the requirements: focal-plane division now enables fully programmable Ising selection [33], and full-aperture wavefront correction removes the aberration bottleneck that previously limited effective MsigM_{\mathrm{sig}} [17]. What remains untested is the quantity AKM requires: within-attractor variance of the full speckle signature, conditioned on the same winning route.

Indirect evidence is encouraging: speckle physical unclonable functions (PUFs), photonic reservoirs, and polariton condensates achieve 94%\gtrsim\!94\% reproducibility of attractor-level observables [8, 24, 21, 3, 20, 6, 9], but the continuous high-dimensional state within a given attractor is almost never quantified [28]. A first experiment need only record continuous-valued output conditioned on the same attractor, yielding the four quantities Φ\Phi, σmin(Φ)\sigma_{\min}(\Phi), Σk\Sigma_{k}, and εstereo\varepsilon_{\mathrm{stereo}} that determine whether a platform supports fetchless lookup at a target scale.

Acknowledgements.
The author acknowledges support from HORIZON EIC-2022-PATHFINDERCHALLENGES-01 HEISINGBERG Project 101114978, from Weizmann–UK Make Connection Grant 142568, and from the EPSRC UK Multidisciplinary Centre for Neuromorphic Computing (grant UKRI982).

Patent and Implementation Notice. Certain systems, methods, hardware configurations, acceleration techniques, implementation architectures, and commercial applications related to the work described in this manuscript are the subject of pending or patent applications in progress. This manuscript is intended to describe the scientific concepts and experimental framework at a research level and does not disclose all proprietary engineering, hardware, system-integration, optimization, or commercial implementation details.

References

  • [1] N. G. Berloff et al. (2017) Realizing the classical XY hamiltonian in polariton condensates. Nature Materials 16, pp. 1120–1126. External Links: Document Cited by: Attractor-Keyed Memory, Attractor-Keyed Memory.
  • [2] N. G. Berloff (2026) Polychronous wave computing: timing-native address selection in spiking networks. External Links: 2601.13079, Document, Link Cited by: Attractor-Keyed Memory.
  • [3] D. Brunner, M. C. Soriano, C. R. Mirasso, and I. Fischer (2013) Parallel photonic information processing at gigabyte per second data rates using transient states. Nature Communications 4, pp. 1364. External Links: Document Cited by: Scope and outlook., Experimental protocol and falsifiability., Attractor-Keyed Memory.
  • [4] W. R. Clements, P. C. Humphreys, B. J. Metcalf, W. S. Kolthammer, and I. A. Walmsley (2016) Optimal design for universal multiport interferometers. Optica 3 (12), pp. 1460–1465. External Links: Document Cited by: Score generation and routing., Experimental protocol and falsifiability..
  • [5] J. Dambre, D. Verstraeten, B. Schrauwen, and S. Massar (2012) Information processing capacity of dynamical systems. Scientific Reports 2, pp. 514. External Links: Document Cited by: Attractor-Keyed Memory.
  • [6] Y. del Valle-Inclan Redondo, H. Ohadi, Y. G. Rubo, O. Beer, A. J. Ramsay, S. I. Tsintzos, Z. Hatzopoulos, P. G. Savvidis, and J. J. Baumberg (2018) Stochastic spin flips in polariton condensates: nonlinear tuning from GHz to sub-Hz. New Journal of Physics 20, pp. 075008. External Links: Document Cited by: Scope and outlook..
  • [7] W. Fedus, B. Zoph, and N. Shazeer (2021) Switch transformers: scaling to trillion parameter models with simple and efficient sparsity. Note: arXiv External Links: 2101.03961, Link Cited by: Attractor-Keyed Memory.
  • [8] Y. Gao, S. F. Al-Sarawi, and D. Abbott (2020) Physical unclonable functions. Nature Electronics 3 (2), pp. 81–91. External Links: Document Cited by: Scope and outlook..
  • [9] R. Hamerly, L. Bernstein, A. Sludds, M. Soljačić, and D. Englund (2019) Experimental investigation of performance differences between coherent ising machines and a quantum annealer. Physical Review X 9, pp. 021032. External Links: Document Cited by: Scope and outlook., Robustness: two separable failure channels..
  • [10] J. L. Hennessy and D. A. Patterson (2019) A new golden age for computer architecture. Communications of the ACM 62 (2), pp. 48–60. Cited by: Attractor-Keyed Memory.
  • [11] T. Honjo et al. (2021) 100,000-spin coherent ising machine. Science Advances 7 (40), pp. eabh0952. External Links: Document Cited by: Attractor-Keyed Memory.
  • [12] M. Horowitz (2014) 1.1 computing’s energy problem (and what we can do about it). In 2014 IEEE international solid-state circuits conference digest of technical papers (ISSCC), pp. 10–14. Cited by: Scope and outlook., Attractor-Keyed Memory.
  • [13] T. W. Hughes, I. A. D. Williamson, M. Minkov, and S. Fan (2019) Wave physics as an analog recurrent neural network. Science Advances 5 (12), pp. eaay6946. External Links: Document Cited by: Experimental protocol and falsifiability..
  • [14] E. Izhikevich (2025) Spiking manifesto. External Links: 2512.11843, Document, Link Cited by: Selector realizations., Attractor-Keyed Memory.
  • [15] H. Jaeger and H. Haas (2004) Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless communication. Science 304, pp. 78–80. Cited by: Attractor-Keyed Memory.
  • [16] K. P. Kalinin and N. G. Berloff (2018) Simulating Ising and nn-state planar Potts models and external fields with nonequilibrium condensates. Physical Review Letters 121, pp. 235302. External Links: Document Cited by: Experimental protocol and falsifiability..
  • [17] D. Karanikolopoulos, P. Karavelas, L. Mouchliadis, A. Spiliotis, N. Pitanios, S. Gentilini, D. Veraldi, P. Charlesworth, D. Pierangeli, J. Sakellariou, et al. (2026) High-fidelity spatial photonic ising machines via precise wavefront shaping. arXiv preprint arXiv:2602.13714. Cited by: Scope and outlook..
  • [18] P. L. McMahon, A. Marandi, Y. Haribara, R. Hamerly, C. Langrock, S. Tamate, T. Inagaki, H. Takesue, S. Utsunomiya, K. Aihara, R. L. Byer, M. M. Fejer, H. Mabuchi, and Y. Yamamoto (2016) A fully programmable 100-spin coherent ising machine with all-to-all connections. Science 354 (6312), pp. 614–617. External Links: Document Cited by: Robustness: two separable failure channels., Experimental protocol and falsifiability., Attractor-Keyed Memory.
  • [19] D. A. B. Miller (2013) Self-configuring universal linear optical component [invited]. Photonics Research 1 (1), pp. 1–15. External Links: Document Cited by: Score generation and routing., Experimental protocol and falsifiability..
  • [20] H. Ohadi, A. Dreismann, Y. G. Rubo, F. Pinsker, Y. del Valle-Inclan Redondo, S. I. Tsintzos, Z. Hatzopoulos, P. G. Savvidis, and J. J. Baumberg (2015) Spontaneous spin bifurcations and ferromagnetic phase transitions in a spinor exciton-polariton condensate. Physical Review X 5, pp. 031002. External Links: Document Cited by: Scope and outlook..
  • [21] N. Oliver, T. Jüngling, and I. Fischer (2015) Consistency properties of a chaotic semiconductor laser driven by optical feedback. Physical Review Letters 114, pp. 123902. External Links: Document Cited by: Scope and outlook..
  • [22] T. Onodera, M. M. Stein, B. A. Ash, M. M. Sohoni, M. Bosch, R. Yanagimoto, M. Jankowski, T. P. McKenna, T. Wang, G. Shvets, M. R. Shcherbakov, L. G. Wright, and P. L. McMahon (2025) Arbitrary control over multimode wave propagation for machine learning. Nature Physics. External Links: Document Cited by: Score generation and routing., Experimental protocol and falsifiability..
  • [23] S. Pai, Z. Sun, T. W. Hughes, T. Park, B. Bartlett, I. A. D. Williamson, M. Minkov, M. Milanizadeh, N. Abebe, F. Morichetti, A. Melloni, S. Fan, O. Solgaard, and D. A. B. Miller (2023) Experimentally realized in situ backpropagation for deep learning in photonic neural networks. Science 380 (6643), pp. 398–404. External Links: Document Cited by: Experimental protocol and falsifiability..
  • [24] R. Pappu, B. Recht, J. Taylor, and N. Gershenfeld (2002) Physical one-way functions. Science 297, pp. 2026–2030. External Links: Document Cited by: Scope and outlook..
  • [25] R. Penrose (1955) A generalized inverse for matrices. Mathematical Proceedings of the Cambridge Philosophical Society 51 (3), pp. 406–413. External Links: Document Cited by: Attractor-Keyed Memory, Attractor-Keyed Memory.
  • [26] D. Pierangeli, G. Marcucci, and C. Conti (2019) Large-scale photonic ising machine by spatial light modulation. Physical Review Letters 122 (21), pp. 213902. External Links: Document Cited by: Figure 2.
  • [27] H. Ramsauer, B. Schäfl, J. Lehner, P. Seidl, M. Widrich, T. Adler, L. Gruber, M. Holzleitner, M. Pavlović, G. K. Sandve, et al. (2020) Hopfield networks is all you need. arXiv preprint arXiv:2008.02217. Cited by: Attractor-Keyed Memory.
  • [28] See supplemental material. Note: at [https://www.damtp.cam.ac.uk/user/ngb23/publications/SI_AKM.pdf] for soft-spin dynamics, binary comparator details, ridge regularization, training derivations, experiment specification, σmin\sigma_{\min} scaling analysis, and reproducible code Cited by: Figure 2, Scope and outlook., Block-parallel architecture., Proposition 1 (Universal payload realizability)., Selector realizations., Selector realizations., Robustness: two separable failure channels., Robustness: two separable failure channels., Calibration and training..
  • [29] Y. Shen, N. C. Harris, S. Skirlo, M. M. Prabhu, T. Baehr-Jones, M. Hochberg, X. Sun, S. Zhao, H. Larochelle, D. Englund, and M. Soljačić (2017) Deep learning with coherent nanophotonic circuits. Nature Photonics 11, pp. 441–446. External Links: Document Cited by: Attractor-Keyed Memory.
  • [30] N. Stroev and N. G. Berloff (2023) Analog photonics computing for information processing, inference, and optimization. Advanced Quantum Technologies 6 (9), pp. 2300055. External Links: Document Cited by: Robustness: two separable failure channels., Experimental protocol and falsifiability., Attractor-Keyed Memory, Attractor-Keyed Memory.
  • [31] V. Sze, Y. Chen, T. Yang, and J. S. Emer (2017) Efficient processing of deep neural networks: a tutorial and survey. Proceedings of the IEEE 105 (12), pp. 2295–2329. External Links: Document Cited by: Scope and outlook., Attractor-Keyed Memory.
  • [32] A. N. Tait, T. F. De Lima, E. Zhou, A. X. Wu, M. A. Nahmias, B. J. Shastri, and P. R. Prucnal (2017) Neuromorphic photonic networks using silicon photonic weight banks. Scientific Reports 7, pp. 7430. External Links: Document Cited by: Attractor-Keyed Memory.
  • [33] D. Veraldi, D. Pierangeli, S. Gentilini, M. Calvanese Strinati, J. Sakellariou, J. S. Cummins, A. Kamaletdinov, M. Syed, R. Z. Wang, N. G. Berloff, D. Karanikolopoulos, P. G. Savvidis, and C. Conti (2025) Fully programmable spatial photonic Ising machine by focal plane division. Physical Review Letters 134 (6), pp. 063802. External Links: Document Cited by: Scope and outlook..
  • [34] S. Williams, A. Waterman, and D. A. Patterson (2009) Roofline: an insightful visual performance model for floating-point programs and multicore architectures. Communications of the ACM 52 (4), pp. 65–76. External Links: Document Cited by: Attractor-Keyed Memory.
  • [35] W. A. Wulf and S. A. McKee (1995) Hitting the memory wall: implications of the obvious. ACM SIGARCH computer architecture news 23 (1), pp. 20–24. Cited by: Attractor-Keyed Memory.