Ye \surLiang
Delay-Penalty Comparison for Sequential Testing and Quickest Detection in State-Dependent Diffusion Models
Abstract
We study sequential testing and Bayesian quickest detection for diffusion observations whose drift changes between two alternatives while the signal-to-noise ratio may depend on the current observation. In this setting the posterior probability is generally not a closed one-dimensional Markov statistic: the natural sufficient state is the augmented process consisting of the posterior (or likelihood ratio) and the observed diffusion. We formulate both testing and quickest detection within this common filtering framework and identify the corresponding degenerate free-boundary problems. The main contribution is a delay-penalty comparison principle. For a common terminal false-alarm or terminal decision cost, a pointwise larger running delay penalty increases the value of continuation, shrinks the continuation region, and yields earlier stopping. When the stopping set has a one-sided posterior representation, this gives an order relation for the optimal alarm boundaries. The result applies to linear delay costs and to nonlinear marginal delay penalties after the appropriate Markovian augmentation, and is illustrated by a constant signal-to-noise Shiryaev example in which the alarm threshold is computed numerically and shown to be monotone in the delay cost. The framework clarifies how state-dependent information and nonlinear delay costs jointly affect the geometry of sequential testing and quickest-detection rules.
keywords:
sequential testing; quickest detection; disorder problem; diffusion process; delay penalty; comparison principle; optimal stopping; free-boundary problem; variational inequality.pacs:
[MSC Classification]62L10; 62L15; 60G40; 60J60; 93E11
1 Introduction
Sequential methods determine endogenously when accumulated data justify a terminal action, balancing the cost of further observation against the quality of the eventual decision [wald2004sequential, wald1948optimum, rincon2025sequential, wang2025analysis, chow1971great, griffith2021statistics, silva2020optimal, silva2015continuous, wang2025analysis1, fischer2026improving]. Two problems organize much of the field. In sequential testing an observer watches a process whose law is governed by one of two simple hypotheses and must choose, as a function of the data, both a stopping time and a terminal decision so as to trade sampling cost against the probabilities of a wrong decision [shiryaev2025optimal, pabbaraju2026simple, liu2025bidirectional]. In quickest detection, or the disorder problem, the law of the observation changes at an unobservable time, and the observer raises an alarm to trade the frequency of false alarms against the delay incurred in detecting a true change [shiryaev1963optimum, naha2023quickest, snow2024quickest, sha2025quickest, liang2025global]. Minimax counterparts of the detection problem replace the prior on the change time by a worst-case criterion and lead to the CUSUM and Shiryaev–Roberts procedures [page1954continuous, banerjee2024minimax, xie2022minimax, wang2026algebraic, fromont2023minimax, yang2024sequential, huselitz2026online, lorden1971procedures, tosun2023robust, yu2024network, moustakides1986optimal, pollak1985optimal, wang2026damage, polunchenko2018comparative, pollak2009optimality]. The present paper is Bayesian and optimal-stopping based.
Both problems arise across applied domains. In quantitative finance a regime shift in a return or volatility process must be detected promptly while controlling false alarms; in statistical process control a manufacturing stream must be monitored for a shift in mean; in structural health monitoring, surveillance, and intrusion detection a sensor stream must be screened for the onset of an anomaly; and in epidemiological monitoring an incidence series must be watched for the start of an outbreak. In each case the natural observation model is a continuously sampled diffusion, the cost of a missed or delayed detection is problem specific and frequently nonlinear, and the practitioner needs to understand how the rule—and in particular the alarm threshold—moves when the penalty structure changes. That comparative question, rather than the explicit solution of any single model, is the focus of this paper.
By filtering, both problems reduce to fully observed optimal stopping for a Markovian sufficient statistic, after which the value function solves a variational inequality and the optimal rule is a first-exit time from a continuation region [liptser1977statistics, peskir2006optimal, yu2026pattern, peskir2000sequential, epstein2022optimal, ekstrom2022multi, ankirchner2020bayesian, wang2026breakdown, shiryaev2025optimal]. For Brownian observations with constant drift alternatives the sufficient statistic is a one-dimensional posterior diffusion and the rules are explicit thresholds. The situation changes when the diffusion coefficients are state dependent: the signal-to-noise ratio then depends on the current observation, the posterior equation no longer closes in the posterior coordinate alone, and the sufficient statistic becomes multidimensional. This is the regime we study, and it is the regime in which the comparative-statics question is least understood.
Contribution.
This paper makes two related contributions. First, we give a unified formulation of sequential testing and Bayesian quickest detection for state-dependent diffusion observations. The formulation makes explicit that, unless the signal-to-noise ratio is constant or otherwise reducible, the posterior probability is not a closed Markov state; the closed sufficient statistic is the augmented process , or equivalently , with an explicit degenerate generator. Second, and more importantly, we prove a delay-penalty comparison theorem for the resulting optimal stopping problems. For a fixed terminal cost, increasing the running delay penalty increases the value function, shrinks the continuation region, and induces earlier stopping. Whenever the stopping region is known to be one-sided in the posterior coordinate, this gives a monotone ordering of the alarm boundaries. This comparative-static result applies directly to linear delay costs and extends to nonlinear marginal delay penalties after adding the appropriate penalty-memory state. A worked Shiryaev example computes the alarm threshold numerically by a finite-difference solution of the variational inequality and exhibits the predicted monotonicity.
Organization.
Section 3 develops the filtering reductions for both problems and proves the closed Markov-state result. Section 4 states the generic optimal stopping problem and the variational inequality in complementarity form. Section 5 proves the delay-penalty comparison theorem and its linear and nonlinear specializations and gives the sampling-cost analogue for testing. Section 6 gives the free-boundary interpretation and the martingale verification. Section 7 is the worked Shiryaev example with the numerical method described in full. Section 8 relates the framework to CUSUM and Shiryaev–Roberts procedures, and Section 9 concludes.
2 Literature Review
Relation to existing literature.
The literature relevant to the present work lies at the intersection of sequential analysis, quickest detection, filtering, and optimal stopping.
On the sequential-analysis side, the foundations were laid by Wald and coauthors through the theory of sequential testing and optimal stopping of experiments [wald2004sequential, wald1948optimum, yu2026rigorous, chow1971great, griffith2021statistics]. In parallel, Shiryaev initiated the Bayesian disorder problem, in which an unobservable change point must be detected as rapidly as possible while controlling false alarms [shiryaev1963optimum]. These developments evolved into the modern theory of quickest detection, encompassing both Bayesian and minimax formulations, with comprehensive treatments given by Shiryaev’s monographs and retrospective survey and by the unified treatment of Poor and Hadjiliadis and others [shiryaev2025optimal, poor2009quickest, shiryaev2010quickest, yu2026beyond, shiryaev2009stochastic, pollak1985diffusion, pollak1985optimal, moustakides1986optimal, shiryaev2019stochastic, cai2026optimal]. The Bayesian formulation is naturally expressed as a Markov optimal stopping problem after filtering, while minimax formulations lead to procedures such as CUSUM and Shiryaev–Roberts [page1954continuous, wang2025multi, lorden1971procedures, moustakides1986optimal, pollak1985optimal, pollak2009optimality, polunchenko2018comparative, gao2022rolling]. The field now provides a unified framework for applications ranging from engineering and finance to surveillance and epidemiology.
A second strand of literature concerns diffusion observations and state-dependent signal structures. For Brownian models with constant signal-to-noise ratio, filtering reduces the problem to a one-dimensional posterior diffusion and optimal rules are characterized by scalar thresholds. The situation becomes substantially more difficult when the signal-to-noise ratio depends on the current state of the observed diffusion. In this setting, the posterior probability alone generally fails to form a closed Markov state. A line of research initiated by Gapeev and Shiryaev [gapeev2011sequential, gapeev2013bayesian] developed sequential testing and quickest detection formulations for diffusion processes with state-dependent coefficients. Subsequent analyses by Johnson and Peskir [johnson2017quickest, yu2026from, johnson2018sequential] revealed the rich boundary structure that may arise even in special cases such as Bessel-process observations. Most recently, Ernst and Peskir [ernst2024gapeev] resolved the Gapeev–Shiryaev conjecture by proving that monotonicity of the signal-to-noise ratio implies monotonicity of the associated optimal stopping boundaries.
A related body of work studies multidimensional and multi-source detection problems. When observations are collected from multiple sensors or coupled systems, the filtering state becomes vector valued and the stopping region is typically a hypersurface rather than a scalar threshold. Such models arise in distributed surveillance, sensor networks, and structural monitoring, and provide natural examples where finite-dimensional sufficient statistics remain available but the geometry of the stopping rule becomes substantially more complex [ludkovski2012bayesian, zhang2014quickest, kurt2018multisensor, konev2017quickest, didi2024active, dayanik2016sequential].
Methodologically, the present paper belongs to the optimal-stopping and free-boundary literature [peskir2006optimal]. After filtering, both sequential testing and Bayesian quickest detection reduce to variational inequalities for Markov sufficient statistics, with optimal rules represented as first-entry times into stopping regions [peskir2000sequential, ekstrom2022multi, epstein2022optimal, ankirchner2020bayesian]. Much of the existing literature focuses on deriving explicit solutions, characterizing free boundaries, or establishing structural properties of stopping regions for a fixed penalty specification. In diffusion-based optimal stopping problem, the principal questions have traditionally been existence, regularity, smooth fit, and stochastic control of optimal boundaries [peskir2019continuity, arkin2009variational, oshima2006optimal, oksendal2019applied].
The present paper addresses a different question. We do not seek a new explicit solution of a particular diffusion stopping problem. Instead, we study how the optimal rule changes when the delay-penalty structure changes. Our object is therefore the comparative-statics map
Once a filtering reduction and optimal-stopping formulation are available, we show that larger marginal delay penalties increase the value function, shrink continuation regions, induce earlier stopping, and, whenever a one-sided boundary representation is known, produce a monotone ordering of alarm boundaries. Thus the contribution of the paper is not a new boundary formula but a structural comparison principle that applies across a broad class of diffusion-based sequential testing and quickest-detection models.
3 Filtering Reductions for Diffusion Sequential Problems
Throughout, carries a standard Brownian motion , the observation is a one-dimensional diffusion on a state space , and is the augmented right-continuous observation filtration. We write for the conditional laws under the two hypotheses and for the corresponding expectations.
3.1 Sequential testing
Under hypothesis , , the observation solves
| (1) |
with and . The hidden hypothesis has prior . A testing rule is a pair consisting of an -stopping time and an -measurable terminal decision . With unit sampling cost per unit time and error costs for the two types of error, the Bayes risk is
| (2) |
Let be the posterior probability of . For a fixed stopping time, the terminal-error part of (2) is minimized by deciding when its posterior cost is smaller, that is, with , and the resulting conditional terminal cost is
| (3) |
where is concave and piecewise linear with apex at . Substituting collapses (2) to the optimal stopping problem
| (4) |
Under on , Girsanov’s theorem gives the likelihood ratio
| (5) |
Introducing the signal-to-noise ratio
| (6) |
one has under , where is the -driving Brownian motion, so is a -martingale. The posterior odds are . The innovation process
| (7) |
is a standard -Brownian motion [liptser1977statistics], and the Kushner–Stratonovich equation for the two-valued hidden variable gives the posterior diffusion together with the observation in innovation form,
| (8) |
The odds process is the smooth image of under the bijection of onto ; we therefore use and interchangeably as state descriptors and do not record a separate stochastic differential for , which carries an Itô correction relative to (8).
3.2 Bayesian quickest detection
Now the drift switches at an unobservable change time :
| (9) |
with the standard prior placing an atom at the origin and an exponential tail,
| (10) |
For an alarm time the linear-delay Bayes risk weighs the probability of a false alarm against the expected detection delay,
| (11) |
Let . The false-alarm probability is , and, by Fubini and the optional projection,
| (12) |
Hence (11) becomes the optimal stopping problem
| (13) |
with running cost and terminal cost . The analytically convenient Shiryaev (weighted likelihood-ratio) statistic is the posterior odds, which admits the explicit representation
| (14) |
where is the post-change-to-pre-change likelihood ratio over formed from (5). The filtering equation for the posterior, with the compensator of induced by (10), is
| (15) |
with as in (8). As above, is the equivalent state under the bijection .
3.3 Closed Markov state and generator
The reductions (4) and (13) are optimal stopping problems driven by the posterior. Whether the posterior is by itself a closed Markov state depends on the signal-to-noise ratio.
Assumption 1.
Theorem 2 (Closed Markov state for diffusion observations).
Under Assumption 1, the pair is a time-homogeneous -Markov sufficient statistic for the sequential decision problem, in both the testing and quickest-detection settings. Its generator on is
| (16) | ||||
in the testing case, and in the quickest-detection case. If is constant, the posterior coordinate has closed one-dimensional Markov dynamics and the problem projects onto alone, with generator
| (17) |
(omitting the -drift in the testing case). If is state dependent, the posterior SDE is not closed in alone. Thus provides the natural closed Markov realization of the filtering state. We do not attempt to characterize exceptional projection cases in which the posterior marginal may nevertheless be Markov.
Proof.
Assumption 1 is Novikov’s criterion, so (5) is a true martingale and Girsanov’s theorem yields the odds representations of Sections 3.1–3.2. The innovation theorem [liptser1977statistics] gives the -Brownian motion of (7), and the Kushner–Stratonovich equation for the two-valued hidden variable produces (8) and, with the compensator of under the prior (10), (15). Writing in the innovation gives the joint dynamics, with quadratic covariation
| (18) |
Applying Itô’s formula to and collecting drift terms yields (16) and, with the extra posterior drift, ; the cross term in (16) is exactly (18). All coefficients are time-independent functions of the current value , so is a time-homogeneous Markov process; sufficiency for the decision problem is inherited from the posterior being a sufficient statistic for . If is constant and depends on only, (16) collapses to (17) and the marginal law of is determined by alone. When is state dependent, the coefficient of the posterior martingale term in (8)–(15) depends on , so the posterior equation is not autonomous in . The augmented process supplies a closed Markov state; possible exceptional Markovian projections are outside the scope of the present comparison result. ∎
Remark 3 (On the cross term and projection).
The cross term in (16) reflects the common innovation noise driving both the posterior and the observation. When is constant the posterior coordinate has closed one-dimensional Markov dynamics and the stopping problem can be projected onto alone; the cross term persists only in the redundant two-dimensional representation . State dependence of removes the projection and makes the operative state.
Remark 4 (Degeneracy and regularity).
The diffusion matrix in (16),
has determinant zero, so diffuses along a single direction in -space: both coordinates are driven by the one innovation . The operator is therefore degenerate elliptic rather than uniformly elliptic. Hypoellipticity nonetheless holds under Hörmander-type conditions on , which underlies the regularity and continuity of the resulting two-dimensional stopping boundaries [peskir2019continuity, ernst2024gapeev].
Example 5 (State-dependent signal-to-noise: Bessel dimension).
With and on , the process is a Bessel process of dimension under , and is state dependent. The likelihood ratio (5) becomes
and does not yield a closed one-dimensional posterior equation; the augmented pair is the closed state. The resulting two-dimensional stopping problem admits an analytic characterization in terms of special functions and the associated free-boundary conditions [johnson2017quickest, johnson2018sequential].
4 Generic Optimal Stopping Formulation
The reductions above are instances of a single optimal stopping problem. Let be a time-homogeneous Markov process on a state space with generator (for example , , or ). Given a running cost and a terminal cost , set
| (19) |
with continuation and stopping regions
| (20) |
For sequential testing and of (4); for quickest detection and of (13).
Standing assumptions.
We assume throughout the comparison results that, for the running and terminal costs under consideration, the value function (19) is finite on and the first-entry time is optimal in (19). These are mild and standard under, for instance, lower semicontinuity of , continuity of , and a moment or transience condition ensuring finiteness; they hold in the diffusion models considered here [peskir2006optimal].
The dynamic programming principle then gives
| (21) |
or, equivalently, the variational inequality
| (22) |
which is in turn equivalent to the complementarity system
| (23) |
Both arguments of the maximum in (22) are nonpositive, which makes the sign convention transparent: on one has , and on one has , equivalently .111The compact form is equivalent to (22)–(23); we adopt the maximum/complementarity form because both of its arguments are manifestly nonpositive and the equality on is then immediate. Interpretations are in the classical, Sobolev, or viscosity sense according to the regularity of . The optimal rule is .
5 Delay-Penalty Comparison
The central result orders sequential rules by their running cost. It is purely comparative and does not require solving (22).
Theorem 6 (Delay-penalty ordering of sequential rules).
Let be a Markov process on with generator , fix a common terminal cost , and for let be measurable running costs with value functions , regions as in (19)–(20), and optimal first-entry times . Assume pointwise on . Then:
-
[label=()]
-
1.
(Value) on .
-
2.
(Regions and stopping times) , , and -almost surely for every . Apart from the standing assumptions that the value functions are finite and that the displayed first-entry times are optimal, no monotonicity, smoothness, or one-sidedness of the stopping set is needed.
-
3.
(Boundaries) If, in addition, each stopping region is one-sided in the posterior coordinate, for boundary functions —the structure established under a monotone signal-to-noise condition by gapeev2011sequential, gapeev2013bayesian, ernst2024gapeev—then for all .
Proof.
(i) For each admissible , pathwise gives , hence ; taking the infimum over yields .
(ii) Choosing shows . If , i.e. , then by (i) , so ; thus and, complementarily, . Since and are first-entry times of the same process into , any entry of into already lies in ; hence almost surely.
(iii) Under the one-sided representation, reads for each fixed , which holds if and only if . ∎
The monotonicity of the optimal stopping boundary with respect to the marginal delay penalty is intuitively depicted in Figure 1. When the system faces a more stringent delay penalty (i.e., ), the decision-maker becomes more conservative, which structurally shrinks the continuation region. Consequently, the optimal threshold shifts downward, yielding for all given states .
Remark 7 (Comparative-statics reading).
Parts (i)–(ii) hold under only the standing assumptions and deliver the operational message: a uniformly larger marginal delay penalty makes continuation less attractive everywhere and triggers (weakly) earlier alarms, hence shorter expected delay at the cost of more false alarms. Part (iii) translates this into boundary geometry, but only once the stopping region is known to be one-sided—a property that need not hold for arbitrary state-dependent diffusions and is exactly what the monotone signal-to-noise results secure. We do not establish the one-sided structure here; we invoke it from gapeev2011sequential, gapeev2013bayesian, ernst2024gapeev.
5.1 Linear delay cost
The classical disorder problem has and on the common state . Theorem 6 with specializes as follows.
Corollary 8 (Linear delay cost).
Let in the Bayesian disorder problem with common state and terminal cost . Then
If the stopping set has the form , then for all . In particular, in the constant-SNR case the single threshold satisfies : a larger delay cost rate implies an earlier alarm.
5.2 Sampling cost in sequential testing
The same principle applies on the testing side, where the running cost is the sampling cost. Scaling the sampling rate to replaces (4) by , i.e. with common terminal cost .
Corollary 9 (Sampling cost).
Let in the sequential testing problem with common terminal cost . Then , , and almost surely: a more expensive observation stream induces an earlier terminal decision and a narrower continuation band around the indifference point .
5.3 Nonlinear marginal delay penalties
For a general delay profile the risk is with nondecreasing and . The running cost is then no longer a function of alone, because the marginal delay cost at time depends on the unobserved elapsed post-change duration . The next proposition isolates the relevant statistic.
Proposition 10 (Marginal-cost augmentation).
Let be absolutely continuous, nondecreasing, with , and suppose
for the stopping times under consideration. Then
| (24) |
Consequently the nonlinear-delay disorder problem is the optimal stopping problem (19) with terminal cost and running cost
| (25) |
Proof.
Whether yields a finite-dimensional Markov stopping problem depends on and is not automatic. Three cases are representative.
(a) Linear delay, . Then and , recovering on the state of Theorem 2.
(b) Exponential delay, with . Then and the process can be represented through a weighted likelihood-ratio statistic, . In state-dependent diffusion models this statistic must still be combined with the observation state to obtain a closed Markov state; the representation is not a universal dimension reduction, and the resulting alarm boundary is generally observation-dependent [gapeev2013bayesian].
(c) General . The statistic need not admit a finite-dimensional filter, and the state must be augmented with accumulated-penalty information for the stopping problem to be Markovian.
In every case in which a common Markov state carries the statistics for two penalties , the pointwise ordering of marginal penalties transfers, via (25) and the monotonicity of conditional expectation, to . Theorem 6 then applies and yields the corresponding ordering of value functions, continuation regions, stopping times, and—where the one-sided structure holds—alarm boundaries. The convex (e.g. exponential) case has increasing, so a larger produces a pointwise-larger marginal penalty and an earlier alarm than the linear benchmark.
6 Free-Boundary Interpretation and Verification
On the Markov state (or its one-dimensional reduction), (22) is a free-boundary problem: on , on , with matching conditions across the free boundary . The continuous-fit condition always holds; the smooth-fit condition holds when the boundary point is probabilistically regular for the interior of and the diffusion is nondegenerate there. For regular one-dimensional diffusions smooth fit is standard [peskir2006optimal]. In the present degenerate two-dimensional setting it may fail where the diffusion coefficient vanishes (at or where ) or where the boundary is otherwise irregular; there only continuous fit is available, and the boundary’s continuity is itself a delicate question [peskir2019continuity].
When is constant the free-boundary problem reduces to the ordinary differential equation on the continuation interval, with of (17), and the boundary is a single threshold determined by smooth fit; this is the setting of Section 7. When is state dependent, the operator is degenerate elliptic in , is a curve, and—consistent with the two-boundary structure found by gapeev2011sequential—the alarm is the first exit of the posterior from a region bounded by observation-dependent (stochastic) boundaries; explicit solutions exist only in special cases, and the boundary is otherwise characterized by systems of nonlinear integral equations arising from the change-of-variable formula with local time on curves, or computed numerically.
A candidate solution of (22) is confirmed optimal by martingale verification.
Proposition 11 (Verification).
Let be continuous on , of polynomial growth, across , and on the interiors of and , and suppose
Suppose moreover that for every admissible the local martingale in (26) below, stopped along a localizing sequence , is uniformly integrable in the limit (e.g. a square-integrability or sublinear-growth condition on ). Then , and is optimal whenever .
Proof.
For an admissible and a localizing sequence , the Itô–Tanaka formula applied to gives
| (26) |
where is a local martingale. The (smooth-fit) hypothesis across ensures that the local-time term on the free boundary, which would otherwise appear because has a jump in its second derivatives there, vanishes; where only continuous fit holds, the local-time term is nonnegative and is retained in the inequality below without affecting its direction. Since , taking expectations and letting (using the growth and uniform-integrability hypotheses so that ) gives , hence . For the integrand vanishes on and , so the inequalities are equalities and . Thus and is optimal. ∎
7 Worked Example: Threshold Monotonicity in the Shiryaev Diffusion Model
We illustrate Corollary 8 in the constant-SNR quickest-detection model, where the posterior is a closed one-dimensional diffusion
| (27) |
With running cost and terminal cost , the value function solves the variational inequality (22) with the one-dimensional generator (17). On the continuation region the equation reads
| (28) |
and on the stopping region with required by (23), i.e. .
Numerical method.
We solve the obstacle problem (23) numerically rather than relying on a closed entrance-boundary shooting condition. The interval was discretized with a uniform grid of size . The degenerate diffusion coefficient was evaluated at grid points and discretized by central differences; the drift term was upwinded (forward difference), which renders the discrete generator a monotone -matrix. The degenerate left end , where the diffusion coefficient vanishes and the drift is , supplies its own discrete relation through the upwinded operator, so no entrance-boundary derivative condition is imposed by hand; at we set . The obstacle problem was then solved by projected (policy) iteration—each sweep performs a Gauss–Seidel/successive-overrelaxation update of followed by the projection onto the obstacle—iterated until the active set stabilized and the update fell below . The reported thresholds were stable under halving of (grids of points agree to the displayed digits) and were cross-checked against an entrance-boundary shooting solution of (28); the two methods agree.
Results.
Table 1 reports the optimal posterior threshold for and , together with the implied likelihood-ratio threshold . The threshold decreases monotonically as the delay cost rate increases. One checks directly that the computed thresholds satisfy , so the candidate solves the variational inequality and is optimal by Proposition 11. Figure 2 shows the value functions peeling away from the obstacle—continuation regions shrinking as grows—and the monotone curve . We emphasize that the monotonicity observed in the computed thresholds is not used as evidence for Corollary 8; it only illustrates the theorem, which is proved independently in Section 5.
| threshold () | |||
|---|---|---|---|
| 0.05 | 1.0 | 0.5 | |
| 0.05 | 1.0 | 1.0 | |
| 0.05 | 1.0 | 2.0 | |
| 0.05 | 1.0 | 5.0 |
Interpretation.
The comparative-statics content is direct. A higher per-unit delay cost makes the observer less willing to wait, so the alarm is raised at a lower posterior probability of disorder: the threshold , and with it the likelihood-ratio threshold , decreases in . By Theorem 6(ii) the associated alarm times are ordered pathwise, for , so a costlier delay yields uniformly earlier alarms; the price is a higher false-alarm probability, since stopping at a lower posterior is more often premature. The same qualitative picture persists for state-dependent , where is replaced by an observation-dependent boundary ordered as in Theorem 6(iii).
8 Relation to CUSUM and Shiryaev–Roberts Procedures
The present comparison result is Bayesian and optimal-stopping based. It is therefore closest to the Shiryaev procedure and its limiting Shiryaev–Roberts forms [shiryaev1963optimum, pollak1985optimal, pollak2009optimality], and it is not a minimax optimality statement for CUSUM [lorden1971procedures, moustakides1986optimal]. Nevertheless, the same state-dependence issue appears in the minimax formulations: when the likelihood increments depend on the current diffusion state, the CUSUM and Shiryaev–Roberts statistics are not closed one-dimensional Markov processes unless the observation state is included, and exactly characterized rules then involve the joint process . A comparison principle for the minimax thresholds analogous to Theorem 6 would require monotonicity of the worst-case detection delay in the penalty parameters and is left for future work.
9 Conclusion
We have organized sequential testing and Bayesian quickest detection for state-dependent diffusion observations around two facts. The first is a closed Markov-state reduction identifying as the sufficient statistic whenever the signal-to-noise ratio is state dependent, with an explicit degenerate generator. The second, and the methodological core of the paper, is a delay-penalty comparison theorem: uniformly larger marginal delay costs raise the value, shrink the continuation region, order the stopping times pathwise, and—under a one-sided boundary representation—lower the alarm boundary. The same principle gives a sampling-cost comparison for sequential testing. A constant-SNR Shiryaev example, solved through the variational inequality, exhibits the predicted threshold monotonicity. The comparison is deliberately structural rather than constructive: it presumes a stopping formulation and, for the boundary statement, the one-sided structure secured by monotone signal-to-noise conditions [ernst2024gapeev], and it does not provide new explicit boundary solutions. Natural extensions include multi-source and multi-hypothesis detection, where the posterior lives on a simplex and the stopping regions are separated by hypersurfaces [dayanik2008multisource]; nonlinear penalties whose marginal statistic (25) requires genuine state augmentation; and minimax analogues of the comparison principle. In each case the comparison continues to apply whenever a common Markov state and terminal cost are available.