[1]\fnmSatoshi \surKanno

[1] \orgdivQuantum Information Technology Department, Quantum Technology Division, Product Research and Development Division, \orgnameSoftBank Corp., \orgaddress \street1-7-1 Kaigan, \cityMinato-ku, \postcode105-7529, \stateTokyo, \countryJapan

Hodge Spectral Surrogates for Topology-Constrained Optimization

satoshi.kanno06@g.softbank.co.jp \fnmYoshi-aki \surShimada yoshiaki.shimada01@g.softbank.co.jp *

Abstract

Topological information is widely used in data analysis, network design, and machine learning, and topological constraints naturally arise when optimizing or generating objects with prescribed homological structure. However, directly controlling Betti numbers and persistent homology is difficult because they are discrete and combinatorial. We propose a differentiable framework for topology-constrained optimization based on Hodge-spectral relaxations of homological constraints and low-pass spectral filters. From soft graphs and soft clique complexes, we construct Hodge-Laplacian-type spectral relaxations that unify graph clique complexes and Vietoris–Rips filtrations of point clouds. In the hard limit, the penalty-regularized ambient operator recovers the ordinary Hodge Laplacian on the active subcomplex, while in the soft regime it serves as a differentiable low-frequency spectral surrogate. Homological information is represented by zero and near-zero modes, and differentiable topological objectives are defined using heat filters, resolvent filters, and polynomial Laplacian moments. For point clouds, we show that the proposed Hodge spectral-filter losses yield more spatially distributed gradients, smoother scale-normalized behavior under persistence-pairing changes, and geometry-aware update directions than persistent-homology-based losses. For graph clique complexes, Laplacian moments control normalized first-Betti-type quantities and can be combined with ordinary graph-feature objectives. We also discuss connections to trace-based normalized Betti-number estimation, polynomial spectral methods, and possible quantum trace estimation.

keywords:

Applied and computational topology, Topological data analysis, Hodge Laplacian, Spectral filters, Topology-constrained optimization

1 Introduction

Understanding the shape of data is a fundamental problem in applied mathematics, data analysis, network science, and machine learning. In particular, the interaction between topology and artificial intelligence has become an important theme in applied and computational topology. Data such as point clouds, images, graphs, and time series often contain not only local metric information but also global structural information, including connected components, holes, loops, and higher-order relations. Homological invariants, such as Betti numbers and persistent homology ([edelsbrunner2002topological, zomorodian2004computing, cohen2005stability, edelsbrunner2008persistent, otter2017roadmap]), provide mathematical tools for describing such global structures and have been widely used as topological descriptors and features in artificial-intelligence and machine-learning pipelines (see [adams2017persistence, hofer2017deep, bubenik2020persistence, chazal2014stochastic, kalivsnik2019tropical]). Such topological features can improve representations or regularization when the underlying data distribution has nontrivial geometric or topological structure.

Topology is also important not only as a descriptor but also as a quantity to be controlled in optimization through loss functions or regularizers (see [moor2020topological, vandaele2021topologically, gabrielsson2020topology]). In many applications, one would like the output of a model or an optimization procedure to satisfy prescribed topological constraints. For example, in image segmentation, controlling the topology of predicted regions can reduce spurious connected components or undesired holes(see [clough2020topological, hu2019topology]). In graph and network design, the Betti numbers of clique complexes describe higher-order connectivity, redundancy, and cyclic structure, which are related to robustness and alternative pathways in networks. Thus, differentiable control of persistent homology and Betti numbers is relevant to topology-constrained optimization in a broad applied-mathematical sense. In this prescriptive setting, topology is optimized rather than only observed: the aim is to deform, generate, or regularize an object so that its homological structure satisfies a desired condition.

Persistent homology provides a natural mechanism for such topological control. Since a persistence diagram or barcode records the birth and death of homology classes across a filtration, one can define losses that penalize undesired bars, preserve selected bars, or match a target barcode. These barcode-based losses make it possible to incorporate persistent homology directly into optimization and learning procedures. However, directly controlling topological quantities is difficult because they are inherently discrete and combinatorial. Betti numbers of clique complexes depend on simplex inclusion and ranks of boundary operators. Even when graph edges are relaxed into continuous weights or probabilities, the resulting clique complex and its Betti numbers remain governed by combinatorial conditions.

For persistent homology, differentiable losses based on barcode or persistence-diagram representations have been studied, and such losses are useful when the desired constraint is naturally expressed in terms of persistent features. However, when topology is used as an optimization signal, such losses have several limitations: gradients may tend to be localized on a small number of critical simplices (see [nigmetov2024topological, nigmetov2024topological_2]), persistence-pairing changes may cause abrupt changes in gradient directions, and barcode representations may compress away geometric and spectral information that could provide smoother and more spatially distributed update directions.

We emphasize that our goal is not to replace persistent homology as a descriptor. Persistent homology remains the appropriate object when one wants stable barcode-level summaries of multiscale topology. Our focus is different: we study topology as a differentiable loss or constraint inside an optimization loop. In this setting, a smooth Hodge-spectral surrogate can be useful because it provides gradients through low-frequency spectral structure rather than only through selected birth–death simplices.

In this work, we address these difficulties by representing homology through the zero modes of Hodge Laplacians. For a simplicial complex, the $k$ -th Betti number is equal to the dimension of the kernel of the $k$ -th Hodge Laplacian. Instead of directly counting zero eigenvalues, we introduce smooth low-pass spectral filters that emphasize zero and near-zero eigenmodes. These filters provide differentiable spectral surrogates for topological information and allow homological information to be used as a continuous optimization signal.

More specifically, we construct Hodge-Laplacian-type spectral relaxations from soft graphs and soft clique complexes. This gives a unified formulation for two standard sources of finite simplicial complexes: graph data and Vietoris–Rips filtrations of point clouds. In the point-cloud setting, soft edge activations are defined from pairwise distances and scale parameters. In the graph setting, edge probabilities induce soft simplex weights and hence soft clique complexes. In both cases, topological losses are defined by applying low-pass spectral filters, such as heat filters, resolvent filters, and polynomial moment filters, to the corresponding Hodge-Laplacian-type spectral relaxations. In the hard limit, the proposed ambient operator recovers the ordinary Hodge Laplacian on the active subcomplex and hence retains the standard homological interpretation. In the soft regime, the same construction should be interpreted as a differentiable low-frequency spectral surrogate for homological structure.

The Laplacian-based formulation also connects naturally with polynomial spectral methods and trace estimation. In particular, trace-type quantities such as

\operatorname{Tr}(I-\alpha L_{q})^{d}

can be interpreted as low-frequency spectral masses and, in the hard ordinary-complex setting, as smooth surrogates for normalized Betti numbers, since zero modes contribute one to the trace while nonzero modes are suppressed by the polynomial filter. This structure is closely related to stochastic trace estimation and polynomial approximations of spectral projectors. It also has a structural connection to quantum trace-estimation approaches for normalized Betti numbers, which often formulate Betti numbers through the null space of combinatorial Laplacians. However, this paper does not implement a quantum algorithm and does not claim quantum advantage. We therefore treat the quantum connection as a possible computational direction rather than as a main contribution (see [lloyd2016quantum, akhalwaya2024comparing, gyurik2022towards, yamauchi2025quantum]).

The contributions of this work are summarized as follows.

1.

We construct an ambient Hodge-spectral relaxation for finite simplicial complexes. In the hard limit, the penalty-regularized ambient operator recovers the ordinary Hodge Laplacian on the active subcomplex and hence the corresponding Betti number.
2.

We define differentiable topological losses using heat filters, resolvent filters, and polynomial Laplacian moments. These objectives include spectral-filter matching losses and trace-type Betti surrogates.
3.

For Vietoris–Rips filtrations of point clouds, we numerically compare the proposed Hodge spectral-filter losses with persistent-homology-based losses and show that they provide more spatially distributed gradients, smoother scale-normalized behavior under persistence-pairing changes, and update directions that better reflect surrounding geometry.
4.

For graph clique complexes, we use Laplacian moments to control normalized first-Betti-type quantities and show that the proposed topological loss can be combined with ordinary graph-feature objectives.
5.

We discuss computational trade-offs and identify polynomial filtering, Chebyshev approximation, and trace-estimation-based implementations as possible routes toward larger-scale applications.

In summary, this work presents a Hodge-spectral framework for controlling homological constraints in point clouds and graphs. By replacing discrete topological quantities with smooth low-pass spectral surrogates, the proposed method enables topology to be treated not only as a descriptor of data, but also as a controllable object in optimization, network design, machine learning, and future quantum-assisted topological computation.

2 Preliminaries and Problem Setting

In this section, we summarize the topological quantities considered in this work and the difficulties that arise when using them as loss functions. We first introduce the Betti numbers of clique complexes constructed from graphs, and describe why it is difficult to directly use them as loss functions. We then introduce persistent homology for point-cloud data and loss functions based on barcode diagrams. Finally, we discuss three issues of barcode-based loss functions: gradient localization, discontinuity caused by changes in persistence pairings, and loss of geometric and spectral information.

2.1 Clique Complexes and Betti Numbers

We first introduce the Betti numbers of clique complexes as topological quantities for graph data. Let $G=(V,E)$ be a graph, where $V$ is the set of vertices and $E$ is the set of edges. The clique complex $X(G)$ of $G$ is a simplicial complex obtained by regarding each clique in the graph as a simplex. That is, a subset of vertices

\sigma=\{v_{0},\ldots,v_{k}\}\subset V

is a $k$ -simplex if, for any distinct $i,j$ ,

(v_{i},v_{j})\in E

holds. Equivalently, $\sigma$ must form a $(k+1)$ -clique. Therefore, the clique complex is defined as

X(G)=\{\sigma\subset V:\sigma\text{ is a clique in }G\}.

Let $C_{k}(X(G))$ be the $k$ -th chain group of $X(G)$ , and let

\partial_{k}:C_{k}(X(G))\to C_{k-1}(X(G))

be the boundary operator. The $k$ -th homology group is defined by

H_{k}(X(G))=\ker\partial_{k}/\operatorname{im}\partial_{k+1}.

The dimension of this homology group, $\beta_{k}(X(G))=\dim H_{k}(X(G))$ is called the $k$ -th Betti number. In particular, $\beta_{0}$ represents the number of connected components, while $\beta_{1}$ represents the number of independent one-dimensional cycles, namely loops that are not filled by triangles or higher-dimensional simplices.

The Betti numbers of clique complexes capture higher-order connectivity and cyclic structures of graphs. Therefore, controlling the Betti numbers of clique complexes in graph generation or network design is meaningful for controlling redundancy and robustness of graph structures.

2.2 Difficulties in Using Betti Numbers as Loss Functions

Suppose that we want to optimize a graph so that the Betti number of its clique complex approaches a target value. For example, for a target Betti number $\beta_{k}^{\mathrm{tar}}$ , one may consider the loss function

L_{\beta}(G)=\left(\beta_{k}(X(G))-\beta_{k}^{\mathrm{tar}}\right)^{2}.

This formulation is natural from an intuitive viewpoint. However, it is difficult to optimize this loss function using standard gradient-based methods.

The reason is that the map

G\longmapsto X(G)\longmapsto\beta_{k}(X(G))

is combinatorial and discrete. A small change in the presence or absence of edges can discontinuously change the set of simplices contained in the clique complex. As a result, the ranks of boundary operators and the dimensions of homology groups can also change discontinuously.

2.3 Persistent Homology

Next, we introduce persistent homology[edelsbrunner2002topological, zomorodian2004computing, cohen2005stability, edelsbrunner2008persistent, otter2017roadmap] as a topological quantity for point-cloud data and related data types. Suppose that, from data $X$ , we construct a sequence of simplicial complexes

K_{0}\subset K_{1}\subset\cdots\subset K_{m}

according to a scale parameter. Such a nested sequence of simplicial complexes is called a filtration. For example, for point-cloud data, one can construct a filtration using Vietoris–Rips complexes indexed by a distance scale $r$ .

At each scale $r$ , the $k$ -th homology group is

H_{k}(K_{r})=\ker\partial_{k}/\operatorname{im}\partial_{k+1}.

Persistent homology tracks when homology classes are born and when they die along the filtration. For a homology class $\gamma$ , let its birth time be $b_{\gamma}$ and its death time be $d_{\gamma}$ . Then the $k$ -th persistence diagram is represented as

\operatorname{Dgm}_{k}(X)=\{(b_{\gamma},d_{\gamma})\}_{\gamma}\subset\mathbb{R}^{2}.

The persistence of $\gamma$ is defined by $\operatorname{pers}(\gamma)=d_{\gamma}-b_{\gamma}$ .

Persistence diagrams and barcode diagrams provide compact summaries of the multiscale topological structure of data. Homology classes with long persistence are often interpreted as reflecting essential structures of the data rather than noise. For this reason, persistent homology is widely used for shape analysis of point clouds, images, time-series data, and related objects.

2.4 Loss Functions Based on Barcode Diagrams

One way to incorporate persistent homology into machine learning is to construct loss functions from barcode diagrams or persistence diagrams. In general, given a target persistence diagram $D^{\mathrm{tar}}$ , one may define a loss function for input data $X$ as

L_{\mathrm{PD}}(X)=\ell\left(\operatorname{Dgm}_{k}(X),D^{\mathrm{tar}}\right).

For example, if the goal is to remove undesired homology classes, let $U$ be the set of undesired classes. Then one can use a loss of the form

L_{\mathrm{PD}}(X)=\sum_{\gamma\in U}(d_{\gamma}-b_{\gamma})^{2}.

This loss acts to shorten the lifetimes of undesired homology classes.

On the other hand, if the goal is to preserve specific homology classes, let $T$ be the set of classes to be preserved, and let $\tau_{\gamma}$ be the target persistence. Then one can consider the loss function

L_{\mathrm{PD}}(X)=\sum_{\gamma\in T}\left((d_{\gamma}-b_{\gamma})-\tau_{\gamma}\right)^{2}.

Such barcode-based loss functions can be differentiated through birth and death values. The birth and death of each homology class $\gamma$ are determined by certain critical simplices. That is, one can write

b_{\gamma}=f(\sigma_{b}),\qquad d_{\gamma}=f(\sigma_{d}),

where $\sigma_{b}$ is the simplex that determines the birth, $\sigma_{d}$ is the simplex that determines the death, and $f$ is the function assigning filtration values to simplices.

Then the gradient of the loss function can formally be written as

\nabla_{X}L_{\mathrm{PD}}=\frac{\partial L_{\mathrm{PD}}}{\partial b_{\gamma}}\nabla_{X}f(\sigma_{b})+\frac{\partial L_{\mathrm{PD}}}{\partial d_{\gamma}}\nabla_{X}f(\sigma_{d}).

Thus, in barcode-based loss functions, gradients can be computed through the critical simplices that determine birth and death values.

2.5 Issues of Barcode-Based Loss Functions

This property makes it possible to use persistent homology as a loss function. However, this formulation has the following three important issues.

2.5.1 Gradient Localization

The first issue is that gradients tend to be localized on a small number of critical simplices(see [nigmetov2024topological, nigmetov2024topological_2]).

As described above, in barcode-based loss functions, the birth and death of each homology class are determined by critical simplices $\sigma_{b}$ and $\sigma_{d}$ . Therefore, the gradient of the loss mainly flows through

\nabla_{X}f(\sigma_{b}),\qquad\nabla_{X}f(\sigma_{d}).

In other words, the actual updates tend to be concentrated on the vertices contained in these critical simplices and their neighborhoods.

However, topological structures are inherently determined by the global configuration of the data. For example, when a point cloud forms a large loop, the birth or death of the loop may be represented by a small number of critical simplices, but geometrically the shape of the entire loop is important. Nevertheless, barcode-based losses concentrate gradients on a small number of simplices and may fail to provide smooth update directions for the entire data set.

Such gradient localization can make optimization unstable and may prevent geometrically natural deformations.

2.5.2 Discontinuity of Gradients Caused by Pairing Changes

The second issue is that gradients can change discontinuously when persistence pairings change due to small perturbations of filtration values.

In persistent homology, each homology class $\gamma$ is associated with a birth–death pair of simplices

(\sigma_{b},\sigma_{d}).

However, when a small perturbation is added to the input data $X$ , the ordering of filtration values of simplices may change. As a result, the persistence pairing may change as

(\sigma_{b},\sigma_{d})\longrightarrow(\sigma^{\prime}_{b},\sigma^{\prime}_{d}).

In this case, even if the persistence diagram itself remains stable, the simplices through which gradients flow switch from

\nabla_{X}f(\sigma_{b}),~\nabla_{X}f(\sigma_{d})\quad\longrightarrow\quad\nabla_{X}f(\sigma^{\prime}_{b}),~\nabla_{X}f(\sigma^{\prime}_{d}).

Consequently, even if the value of the loss function does not change significantly, the gradient direction may change discontinuously.

This issue is important from the viewpoint of optimization. Gradient-based methods update parameters using local changes of the loss function. If the gradient direction changes discontinuously under small perturbations, stable optimization becomes difficult.

2.5.3 Loss of Geometric and Spectral Information

The third issue is that barcode diagrams are representations specialized for birth–death information of homology classes, and they do not sufficiently preserve surrounding geometric and spectral information.

A persistence diagram is obtained from input data $X$ through the map

X\longmapsto\operatorname{Dgm}_{k}(X)=\{(b_{\gamma},d_{\gamma})\}_{\gamma}.

This representation compactly summarizes the scales at which each homology class is born and dies. However, in this process, much information is compressed, including the geometric locations supporting the homology classes, the surrounding neighborhood structure, and spectral information such as eigenvectors and low-frequency modes of Laplacians.

For example, two homology classes may have the same persistence, but they may be arranged differently in the data space, may have different degrees of geometric smoothness as cycles, or may contain different near-closed structures. Barcode diagrams do not sufficiently distinguish such differences.

Therefore, while barcode-based loss functions are effective for controlling topological summary quantities, they may be insufficient for providing geometrically natural and smooth update directions for the entire data.

2.6 Problem Addressed in This Work

The preceding subsections identify two related but distinct uses of topology in data analysis and optimization. The first is descriptive: one computes Betti numbers, persistent homology, or barcode diagrams of a fixed data set in order to describe its global structure. The second is prescriptive: one seeks to optimize a point cloud, graph, or model output so that its topology satisfies a desired condition. This paper focuses on the second use.

The difficulty is that the topological quantities considered above do not directly provide smooth optimization signals. For clique complexes of graphs, the map

G\longmapsto X(G)\longmapsto\beta_{k}(X(G))

is combinatorial, because the set of simplices and the ranks of the boundary operators can change discontinuously when edges are added or removed. For Vietoris–Rips filtrations, barcode-based losses make persistent homology differentiable through birth and death values, but the resulting gradients may be localized on critical simplices, may change abruptly when persistence pairings switch, and may discard geometric and spectral information that could be useful for optimization.

Our goal is therefore not merely to compute topological summaries, but to construct differentiable objectives that retain a clear relationship with homology while providing smoother and more spatially distributed optimization signals. We use the Hodge-theoretic identity between homology and zero modes of the combinatorial Laplacian as the starting point. For a fixed finite simplicial complex $K$ , the kernel of the $q$ -th Hodge Laplacian $L_{q}(K)$ is naturally isomorphic to $H_{q}(K)$ , and hence $\dim\ker L_{q}(K)=\beta_{q}(K)$ .

The main question is how to use this identity when the simplicial complex itself changes with continuous parameters. In the next section, we address this question by embedding changing complexes into a fixed ambient chain space, separating inactive simplex directions by a penalty term, and then replacing hard simplex indicators with soft simplex weights. This leads to a Hodge-Laplacian-type spectral relaxation that recovers the ordinary Hodge-theoretic interpretation in the hard limit and provides differentiable low-frequency spectral quantities in the soft regime.

3 Construction of Hodge-Laplacian-type spectral relaxation

In this section, we introduce ambient Hodge-spectral relaxations as the basic objects for constructing differentiable topological losses for both point-cloud data and graph data. The construction is exact in the hard simplicial-complex limit and is used as a smooth spectral surrogate in the soft regime.

The basic idea of this work is to represent both point-cloud data and graph data as soft graphs and then construct soft clique complexes from them. In graph generation, edge logits are used as optimization variables. In Vietoris–Rips filtrations, edge logits are defined from pairwise distances and a scale parameter. Therefore, both settings can be described by the common sequence

\theta\longmapsto a_{e}(\theta)\longmapsto p_{e}(\theta)\longmapsto w_{\sigma}(\theta)\longmapsto\widehat{L}_{q}(\theta).

Here, $a_{e}$ denotes an edge logit, $p_{e}$ denotes a soft edge activation, $w_{\sigma}$ denotes a simplex activation, and $\widehat{L}_{q}$ denotes a penalty-regularized Hodge-Laplacian-type spectral relaxation.

The terminology is important: for hard simplex indicators, the construction recovers an ordinary Hodge Laplacian on the active subcomplex. For soft simplex weights, the same formula should be interpreted as a differentiable Hodge-spectral surrogate rather than as the Hodge Laplacian of an exact chain complex.

For the ordinary Hodge Laplacian, zero modes correspond to homology classes, and the dimension of the kernel of the $q$ -th Hodge Laplacian is equal to the $q$ -th Betti number. However, when the simplicial complex changes according to input data or model parameters, the dimension and basis of the corresponding chain space may also change. Therefore, it is difficult to directly treat the ordinary Hodge Laplacian as a differentiable object on a fixed vector space.

To address this issue, we introduce a fixed ambient chain space containing all candidate simplices. On this space, we construct a hard Hodge Laplacian using projections and a penalty term so that the number of zero modes agrees with the Betti number of the active subcomplex. We then relax the projections using simplex activations obtained from a soft graph, yielding a Hodge-Laplacian-type spectral relaxation that depends smoothly on the parameters and recovers the ordinary Hodge-theoretic interpretation in the hard limit.

3.1 Ambient Chain Spaces and Boundary Operators

Let $K_{\max}$ be a finite fixed ambient simplicial complex containing all candidate simplices that may appear during optimization. For example, given a candidate edge set $E_{\max}$ , one may define the maximum candidate graph

G_{\max}=(V,E_{\max})

and use its clique complex $K_{\max}=X(G_{\max})$ as the ambient complex.

For each dimension $q$ , define the ambient $q$ -chain space by

C_{q}^{\mathrm{amb}}=C_{q}(K_{\max}).

This is a finite-dimensional real vector space generated by all oriented $q$ -simplices in $K_{\max}$ . After fixing orientations and equipping it with the standard inner product, $C_{q}^{\mathrm{amb}}$ can be regarded as a finite-dimensional Hilbert space.

Let $B_{q}:C_{q}^{\mathrm{amb}}\to C_{q-1}^{\mathrm{amb}}$ be the ambient boundary operator of the fixed complex $K_{\max}$ . For an oriented simplex $\sigma=[v_{0},\ldots,v_{q}]$ ,

\partial_{q}\sigma=\sum_{i=0}^{q}(-1)^{i}[v_{0},\ldots,\widehat{v_{i}},\ldots,v_{q}],

and the matrix representation of this operator is $B_{q}$ . Since the boundary of a boundary is zero, we have $B_{q}B_{q+1}=0$ .

Suppose that a hard subcomplex $K(\theta)\subset K_{\max}$ is determined by a parameter $\theta$ . For each $q$ -simplex $\sigma\in K_{\max}^{(q)}$ , define the activity indicator by

\chi_{\sigma}(\theta)=\begin{cases}1,&\sigma\in K(\theta),\\ 0,&\sigma\notin K(\theta).\end{cases}

The orthogonal projection onto the active $q$ -simplex directions is defined by

\Pi_{q}(\theta)=\operatorname{diag}\left(\chi_{\sigma}(\theta)\right)_{\sigma\in K_{\max}^{(q)}}.

Then the $q$ -chain space of $K(\theta)$ is embedded as

C_{q}(K(\theta))=\operatorname{Im}\Pi_{q}(\theta)\subset C_{q}^{\mathrm{amb}}.

Since $K(\theta)$ is a simplicial complex, every face of an active simplex is also active. Hence,

B_{q}\operatorname{Im}\Pi_{q}(\theta)\subset\operatorname{Im}\Pi_{q-1}(\theta),

or equivalently,

\Pi_{q-1}(\theta)B_{q}\Pi_{q}(\theta)=B_{q}\Pi_{q}(\theta).

We define the hard restricted boundary operator on the ambient space by

B_{q}^{\mathrm{hard}}(\theta)=\Pi_{q-1}(\theta)B_{q}\Pi_{q}(\theta).

On the active subspace $C_{q}(K(\theta))=\operatorname{Im}\Pi_{q}(\theta)$ , this agrees with the ordinary boundary map of the subcomplex $K(\theta)$ .

3.2 Penalty-Regularized Hard Ambient Hodge Laplacian

A naive Hodge operator on the fixed ambient space $C_{q}^{\mathrm{amb}}$ would be

\left(B_{q}^{\mathrm{hard}}(\theta)\right)^{\top}B_{q}^{\mathrm{hard}}(\theta)+B_{q+1}^{\mathrm{hard}}(\theta)\left(B_{q+1}^{\mathrm{hard}}(\theta)\right)^{\top}.

However, since this operator acts on the entire ambient space, inactive simplex directions may remain as spurious zero modes that do not correspond to homology classes of the active subcomplex. To remove them, we add an inactive-direction penalty. For $\mu>0$ , define

L_{q}^{\mathrm{hard}}(\theta)=\left(B_{q}^{\mathrm{hard}}(\theta)\right)^{\top}B_{q}^{\mathrm{hard}}(\theta)+B_{q+1}^{\mathrm{hard}}(\theta)\left(B_{q+1}^{\mathrm{hard}}(\theta)\right)^{\top}+\mu\left(I-\Pi_{q}(\theta)\right).

The last term assigns energy $\mu$ to inactive $q$ -simplex directions and removes them from the low-eigenvalue region.

With respect to the orthogonal decomposition

C_{q}^{\mathrm{amb}}=\operatorname{Im}\Pi_{q}(\theta)\oplus\ker\Pi_{q}(\theta),

we have

L_{q}^{\mathrm{hard}}(\theta)=L_{q}(K(\theta))\oplus\mu I_{\ker\Pi_{q}(\theta)}.

Therefore,

\ker L_{q}^{\mathrm{hard}}(\theta)=\ker L_{q}(K(\theta))\simeq H_{q}(K(\theta)).

Hence,

\dim\ker L_{q}^{\mathrm{hard}}(\theta)=\beta_{q}(K(\theta)).

Thus, the number of zero modes of the penalty-regularized hard ambient Hodge Laplacian agrees with the $q$ -th Betti number of the active subcomplex.

Proposition 1 (Hard-limit consistency of the ambient Hodge Laplacian).

Let $K(\theta)\subset K_{\max}$ be an active subcomplex and let $L_{q}^{\mathrm{hard}}(\theta)$ be the penalty-regularized hard ambient Hodge Laplacian defined above. Then, with respect to the orthogonal decomposition

C_{q}^{\mathrm{amb}}=C_{q}(K(\theta))\oplus C_{q}(K(\theta))^{\perp},

one has

L_{q}^{\mathrm{hard}}(\theta)=L_{q}(K(\theta))\oplus\mu I_{C_{q}(K(\theta))^{\perp}}.

Consequently,

\ker L_{q}^{\mathrm{hard}}(\theta)\simeq H_{q}(K(\theta)),\qquad\dim\ker L_{q}^{\mathrm{hard}}(\theta)=\beta_{q}(K(\theta)).

Proof.

On the active subspace $C_{q}(K(\theta))=\operatorname{Im}\Pi_{q}(\theta)$ , the projected boundary operators coincide with the ordinary boundary operators of the subcomplex $K(\theta)$ , because $K(\theta)$ is closed under taking faces. On the inactive orthogonal complement, the projected boundary terms vanish and the penalty term acts as multiplication by $\mu$ . Thus the operator decomposes as

L_{q}(K(\theta))\oplus\mu I_{C_{q}(K(\theta))^{\perp}}.

The statement follows from the finite-dimensional Hodge decomposition, which identifies $\ker L_{q}(K(\theta))$ with $H_{q}(K(\theta))$ . ∎

3.3 Soft Graphs and Hodge-Laplacian-type spectral relaxation

We now relax the hard projection $\Pi_{q}(\theta)$ using a soft graph and construct a differentiable spectral relaxation of the hard ambient operator. Let $V=\{1,\ldots,n\}$ be the vertex set, and let $E_{\max}\subset\binom{V}{2}$ be the set of candidate edges. For each candidate edge $e\in E_{\max}$ , define an edge logit $a_{e}(\theta)\in\mathbb{R}$ , and define the corresponding soft edge activation by

p_{e}(\theta)=\sigma(a_{e}(\theta))=\frac{1}{1+\exp(-a_{e}(\theta)/\varepsilon_{e})}.

We call $G_{\theta}=(V,E_{\max},p_{\theta})$ a soft graph.

The activation or soft weight of a simplex $\sigma$ is defined as the product of the activations of its constituent edges:

w_{\sigma}(\theta)=\prod_{e\subset\sigma}p_{e}(\theta).

Vertices are assumed to always exist, so $w_{v}(\theta)=1$ . In particular, for an edge $e$ , $w_{e}=p_{e}$ , and for a triangle $t=\{i,j,k\}$ ,

w_{t}=p_{ij}p_{ik}p_{jk}.

In the hard limit $p_{e}\to\mathbf{1}_{e\in E}$ , we have

w_{\sigma}(\theta)\to\mathbf{1}_{\sigma\in X(G)}.

Thus, this construction is a continuous relaxation of the ordinary clique complex at the level of simplex indicators.

For each dimension $q$ , define

W_{q}(\theta)=\operatorname{diag}\left(w_{\sigma}(\theta)\right)_{\sigma\in K_{\max}^{(q)}},\qquad R_{q}(\theta)=W_{q}(\theta)^{1/2}.

Using these matrices, define the weighted boundary-type operator by

\widetilde{B}_{q}(\theta)=R_{q-1}(\theta)B_{q}R_{q}(\theta).

In components,

\left(\widetilde{B}_{q}(\theta)\right)_{\tau,\sigma}=\sqrt{w_{\tau}(\theta)}(B_{q})_{\tau,\sigma}\sqrt{w_{\sigma}(\theta)}.

Thus, the boundary relation between a simplex and its face is smoothly weighted by their activations.

Remark 1 (Soft weighted boundaries).

For hard simplex indicators, the projected boundary operators recover the ordinary boundary maps of the active subcomplex. For soft weights, however, the matrices

\widetilde{B}_{q}(\theta)=R_{q-1}(\theta)B_{q}R_{q}(\theta)

do not generally satisfy

\widetilde{B}_{q-1}(\theta)\widetilde{B}_{q}(\theta)=0.

Thus, in the soft regime, these matrices should not be interpreted as boundary maps of an exact chain complex. They are instead boundary-inspired weighted operators used to define a smooth Hodge-Laplacian-type spectral relaxation. The exact homological interpretation is recovered in the hard limit.

We define the $q$ -th penalty-regularized Hodge-Laplacian-type spectral relaxation by

\widehat{L}_{q}(\theta)=\widetilde{B}_{q}(\theta)^{\top}\widetilde{B}_{q}(\theta)+\widetilde{B}_{q+1}(\theta)\widetilde{B}_{q+1}(\theta)^{\top}+\mu\left(I-W_{q}(\theta)\right).

The first term is the lower Laplacian-type term, the second term is the upper Laplacian-type term, and the third term is the inactive-direction penalty.

If $w_{\sigma}(\theta)\approx 1$ , the penalty has almost no effect on the corresponding direction. If $w_{\sigma}(\theta)\approx 0$ , the direction receives approximately energy $\mu$ . In the hard limit $w_{\sigma}(\theta)\to\chi_{\sigma}(\theta)$ , we have

W_{q}(\theta)\to\Pi_{q}(\theta),\qquad R_{q}(\theta)\to\Pi_{q}(\theta),\qquad\widehat{L}_{q}(\theta)\to L_{q}^{\mathrm{hard}}(\theta).

Therefore, $\widehat{L}_{q}(\theta)$ is a differentiable Hodge-spectral relaxation of the hard Hodge Laplacian that recovers the Betti-number interpretation in the hard limit. Away from the hard limit, its low-frequency spectrum should be interpreted as a spectral surrogate for homological structure rather than as an exact Betti-number representation.

Proposition 2 (Convergence to the hard ambient operator).

Let $w_{\sigma}^{(m)}\in[0,1]$ be a sequence of simplex weights converging to hard activity indicators $\chi_{\sigma}\in\{0,1\}$ for all simplices of $K_{\max}$ . Let $\widehat{L}_{q}(w^{(m)})$ be the corresponding penalty-regularized Hodge-Laplacian-type spectral relaxation. Then

\widehat{L}_{q}(w^{(m)})\longrightarrow L_{q}^{\mathrm{hard}}(K)

in operator norm, where

K=\{\sigma\in K_{\max}:\chi_{\sigma}=1\}.

Proof.

The ambient boundary matrices $B_{q}$ are fixed finite matrices. The diagonal matrices $W_{q}(w^{(m)})$ converge entrywise to $\Pi_{q}$ , and the square-root matrices $R_{q}(w^{(m)})=W_{q}(w^{(m)})^{1/2}$ converge entrywise to $\Pi_{q}$ . Since all matrices are finite-dimensional, entrywise convergence implies convergence in operator norm. Therefore,

R_{q-1}(w^{(m)})B_{q}R_{q}(w^{(m)})\longrightarrow\Pi_{q-1}B_{q}\Pi_{q},

and the corresponding lower, upper, and penalty terms converge to those of $L_{q}^{\mathrm{hard}}(K)$ . ∎

3.4 Logit Parameterization for Graph Clique Complexes

In graph generation or graph-structured optimization, the optimization variables are the edge logits

\theta=a=\{a_{e}\}_{e\in E_{\max}}.

Then $p_{e}(a)=\sigma(a_{e})$ and

w_{\sigma}(a)=\prod_{e\subset\sigma}p_{e}(a).

Thus, the Hodge-Laplacian-type spectral relaxation is obtained through

a\longmapsto p_{e}(a)\longmapsto w_{\sigma}(a)\longmapsto\widehat{L}_{q}(a).

For controlling one-dimensional cyclic structure, which is measured by the first Betti number in the hard clique-complex limit, we use $q=1$ and define

\widehat{L}_{1}(a)=\widetilde{B}_{1}(a)^{\top}\widetilde{B}_{1}(a)+\widetilde{B}_{2}(a)\widetilde{B}_{2}(a)^{\top}+\mu(I-W_{1}(a)).

This operator acts on the fixed ambient edge space $C_{1}^{\mathrm{amb}}$ . Hence, graph optimization or graph-structured topology control can be performed in a fixed-dimensional Hilbert space, even though the underlying hard clique complex changes combinatorially.

3.5 Soft-Graph Representation of Vietoris–Rips Filtrations

We next represent Vietoris–Rips filtrations of point-cloud data within the same soft-graph and Hodge-spectral framework. Let

X=\{x_{1},\ldots,x_{n}\}\subset\mathbb{R}^{d}

be a point cloud. In the ordinary Vietoris–Rips complex at scale $r>0$ , an edge $(i,j)$ is present when $\|x_{i}-x_{j}\|\leq r$ .

To relax this condition smoothly, define the stabilized distance with a small numerical parameter $\delta>0$

d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta},

and define the Vietoris–Rips edge logit by

a_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon}.

Here, $\varepsilon>0$ controls the softness of the threshold. The corresponding soft edge activation is

p_{ij}^{(r)}(X)=\sigma\left(a_{ij}^{(r)}(X)\right)=\frac{1}{1+\exp\left(-\frac{r-d_{ij}(X)}{\varepsilon}\right)}.

This value is close to one when $d_{ij}(X)<r$ , and close to zero when $d_{ij}(X)>r$ . In the hard-threshold limit $\varepsilon\to 0$ ,

p_{ij}^{(r)}(X)\to\mathbf{1}_{d_{ij}(X)\leq r}.

The activation of a simplex $\sigma$ is defined by

w_{\sigma}^{(r)}(X)=\prod_{e\subset\sigma}p_{e}^{(r)}(X).

In the hard limit,

w_{\sigma}^{(r)}(X)\to\mathbf{1}_{\sigma\in\mathrm{VR}_{r}(X)},

because the Vietoris–Rips condition is equivalent to requiring all edges in $\sigma$ to have length at most $r$ , namely $\max_{e\subset\sigma}d_{e}(X)\leq r$ . Thus, the soft clique construction recovers the ordinary Vietoris–Rips complex at each fixed scale in the hard-threshold limit.

Thus, the Vietoris–Rips filtration can be represented as the soft clique complex of a soft graph whose edge logits are

a_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon}.

For each scale $r$ , we obtain the sequence

X\longmapsto a^{(r)}(X)\longmapsto p^{(r)}(X)\longmapsto w^{(r)}(X)\longmapsto\widehat{L}_{q}^{(r)}(X).

More explicitly, define

W_{q}^{(r)}(X)=\operatorname{diag}\left(w_{\sigma}^{(r)}(X)\right)_{\sigma\in K_{\max}^{(q)}},\qquad R_{q}^{(r)}(X)=\left(W_{q}^{(r)}(X)\right)^{1/2},

and

\widetilde{B}_{q}^{(r)}(X)=R_{q-1}^{(r)}(X)B_{q}R_{q}^{(r)}(X).

Then

\widehat{L}_{q}^{(r)}(X)=\left(\widetilde{B}_{q}^{(r)}(X)\right)^{\top}\widetilde{B}_{q}^{(r)}(X)+\widetilde{B}_{q+1}^{(r)}(X)\left(\widetilde{B}_{q+1}^{(r)}(X)\right)^{\top}+\mu\left(I-W_{q}^{(r)}(X)\right).

This is the Vietoris–Rips instance of the Hodge-Laplacian-type spectral relaxation defined above. For controlling one-dimensional homology, we use $q=1$ .

3.6 Multi-Scale Vietoris–Rips Construction

Persistent homology describes the birth and death of homology classes across scales. Therefore, instead of using a single scale, we consider a sequence

r_{1}<r_{2}<\cdots<r_{M}.

For each scale $r_{m}$ , define

p_{ij}^{(r_{m})}(X)=\sigma\left(\frac{r_{m}-d_{ij}(X)}{\varepsilon}\right).

This yields a Hodge-Laplacian-type spectral relaxation

\widehat{L}_{q}^{(r_{m})}(X)

at each scale. The family

\left\{\widehat{L}_{q}^{(r_{m})}(X)\right\}_{m=1}^{M}

can be regarded as a Hodge-spectral smooth relaxation of the Vietoris–Rips filtration. In the next section, we apply low-pass spectral filters to these operators and define differentiable topological objectives for controlling Betti-type and persistent-homological structures.

4 Topological Objectives via Low-Pass Hodge Spectral Filters

In this section, we define differentiable topological objectives using the penalty-regularized Hodge-Laplacian-type spectral relaxation $\widehat{L}_{q}(\theta)$ constructed in the previous section.

The basic idea is to regard homology as the zero modes of the Hodge Laplacian and to extract zero and near-zero modes using smooth low-pass spectral filters. For an ordinary simplicial complex $K$ , the $q$ -th Betti number satisfies

\beta_{q}(K)=\dim\ker L_{q}(K).

However, directly counting zero eigenvalues, namely the map $L_{q}\mapsto\dim\ker L_{q}$ , is discontinuous and is not suitable for gradient-based optimization. Therefore, we use a smooth function $f:\mathbb{R}_{\geq 0}\to\mathbb{R}$ that emphasizes the low-eigenvalue region and use

f(\widehat{L}_{q}(\theta))\quad\text{or}\quad\operatorname{Tr}f(\widehat{L}_{q}(\theta))

as differentiable spectral surrogates for topological information. In the hard ordinary-complex case, these quantities approximate information about the kernel of the Hodge Laplacian. In the soft regime, they should be interpreted as low-frequency spectral quantities associated with the Hodge-Laplacian-type relaxation, rather than as exact Betti numbers.

We first introduce low-pass spectral filters and define two types of losses: spectral-filter matching losses and trace-type spectral-mass Betti surrogate losses. We then formulate normalized first-Betti-type control for graph clique complexes and topology control for Vietoris–Rips filtrations of point clouds within the same framework. All losses introduced in this section are differentiable with respect to $\widehat{L}_{q}(\theta)$ . Since $\widehat{L}_{q}(\theta)$ is constructed from soft-graph edge logits, gradients can be propagated analytically to the edge logits. In the Vietoris–Rips setting, the edge logits are functions of point coordinates through pairwise distances, and hence gradients can also be propagated to the point-cloud coordinates. In this section, we focus on the definitions and roles of the objectives; the detailed backward computation and gradient formulas are given in the Appendix.

4.1 Low-Pass Spectral Filters

Let

L(\theta)=\widehat{L}_{q}(\theta)

be the Hodge-Laplacian-type spectral relaxation, and let

L(\theta)=U\Lambda U^{\top},\qquad\Lambda=\operatorname{diag}(\lambda_{1},\ldots,\lambda_{N})

be its eigendecomposition. In the hard ordinary-complex case, homological modes correspond to zero eigenvalues. In addition, near-zero modes with small positive eigenvalues may reflect geometrically meaningful structures such as almost-closed cycles, weakly filled holes, or unstable higher-order structures. Thus, it is useful to treat not only exact zero modes but also the low-eigenvalue region smoothly.

We call a smooth function $f:\mathbb{R}_{\geq 0}\to\mathbb{R}$ a low-pass spectral filter if it emphasizes low eigenvalues and suppresses high eigenvalues. The corresponding matrix function is defined by

f(L)=Uf(\Lambda)U^{\top},\qquad f(\Lambda)=\operatorname{diag}(f(\lambda_{1}),\ldots,f(\lambda_{N})).

In this work, we use the following representative filters.

The heat filter is defined by

f_{\mathrm{heat}}(\lambda)=\exp\left(-\frac{\lambda}{\tau}\right),\qquad F_{\mathrm{heat}}(L)=\exp\left(-\frac{L}{\tau}\right),

where $\tau>0$ is a temperature parameter. It preserves zero modes and exponentially suppresses high-eigenvalue modes.

The resolvent filter is defined by

f_{\mathrm{res}}(\lambda)=\frac{\alpha}{\lambda+\alpha},\qquad F_{\mathrm{res}}(L)=\alpha(L+\alpha I)^{-1},

where $\alpha>0$ . It also emphasizes low-eigenvalue modes, but its decay is rational rather than exponential.

The polynomial moment filter is defined by

f_{\mathrm{mom}}(\lambda)=(1-\alpha\lambda)^{d},\qquad F_{\mathrm{mom}}(L)=(I-\alpha L)^{d},

where $\alpha>0$ and $d\in\mathbb{N}$ . A zero mode contributes one, while high-eigenvalue modes are suppressed under suitable choices of $\alpha$ and $d$ . More precisely, this polynomial behaves as a low-pass filter only when the positive spectrum is mapped inside the unit disk; this condition is made explicit below. Since this filter is expressed as a polynomial of the Laplacian, it does not require eigendecomposition and is compatible with matrix-vector products, stochastic trace estimation, and structural connections to possible quantum or quantum–classical hybrid computation.

As a more stable polynomial approximation, one may also use a Chebyshev polynomial filter. After normalizing the Laplacian by an upper bound or estimate of its largest eigenvalue, for example

\widetilde{L}=\frac{2L-\lambda_{\max}I}{\lambda_{\max}},

so that its spectrum is approximately contained in $[-1,1]$ , define

f_{\mathrm{cheb}}(L)=\sum_{\ell=0}^{D}c_{\ell}T_{\ell}(\widetilde{L}),

where $T_{\ell}$ is the $\ell$ -th Chebyshev polynomial, $c_{\ell}$ are coefficients approximating a desired low-pass function, and $D$ is the polynomial degree. This filter can be computed recursively without eigendecomposition and is useful for approximating low-frequency spectral quantities in large-scale problems.

4.2 Spectral-Filter Matching Loss

One way to use a low-pass filter is to compare low-energy subspaces themselves. Let $\theta_{\mathrm{tar}}$ be a target parameter and define

L_{\mathrm{tar}}=\widehat{L}_{q}(\theta_{\mathrm{tar}}).

For a low-pass filter $f$ , we define the spectral-filter matching loss by

\mathcal{L}_{\mathrm{sf}}(\theta)=\frac{1}{2}\left\|f(\widehat{L}_{q}(\theta))-f(L_{\mathrm{tar}})\right\|_{F}^{2}.

This loss compares not only the number of zero modes but also the orientation and distribution of the low-eigenvalue subspace. Therefore, compared with barcode-based losses, it can use richer geometric and spectral information.

For the heat and resolvent filters, this gives

\mathcal{L}_{\mathrm{heat}}(\theta)=\frac{1}{2}\left\|\exp\left(-\frac{\widehat{L}_{q}(\theta)}{\tau}\right)-\exp\left(-\frac{L_{\mathrm{tar}}}{\tau}\right)\right\|_{F}^{2}

and

\mathcal{L}_{\mathrm{res}}(\theta)=\frac{1}{2}\left\|\alpha(\widehat{L}_{q}(\theta)+\alpha I)^{-1}-\alpha(L_{\mathrm{tar}}+\alpha I)^{-1}\right\|_{F}^{2}.

These losses act to move the current structure toward the low-energy subspace of the target structure. Although the heat and resolvent filters approximate the spectral projector onto the kernel in suitable parameter limits, they are not idempotent projectors for finite parameter values. For this reason, we refer to these objectives as spectral-filter matching losses rather than projector losses.

4.3 Trace-Type Spectral-Mass Betti Surrogate Loss

Another way to use a low-pass filter is to construct a smooth low-frequency spectral-mass surrogate for Betti-type information from its trace. For a low-pass filter $f$ , define

S_{f}(\theta)=\operatorname{Tr}f(\widehat{L}_{q}(\theta))=\sum_{i=1}^{N}f(\lambda_{i}(\theta)).

If $f(0)=1$ and $f(\lambda)\approx 0$ for high eigenvalues, then $S_{f}(\theta)$ approximates the number of zero and low-eigenvalue modes and can be regarded as a smooth surrogate for $\beta_{q}$ in the hard ordinary-complex setting, and as a low-frequency spectral mass in the soft setting.

Proposition 3 (Trace approximation of Betti numbers).

Let $K$ be a finite simplicial complex and let the eigenvalues of the ordinary Hodge Laplacian $L_{q}(K)$ be

0=\lambda_{1}=\cdots=\lambda_{\beta_{q}}<\lambda_{\beta_{q}+1}\leq\cdots\leq\lambda_{N}.

Let $f:\mathbb{R}_{\geq 0}\to\mathbb{R}$ satisfy $f(0)=1$ , and define

\varepsilon_{f}=\max_{i>\beta_{q}}|f(\lambda_{i})|.

Then

\left|\operatorname{Tr}f(L_{q}(K))-\beta_{q}(K)\right|\leq(N-\beta_{q})\varepsilon_{f}.

Proof.

Since $f(L_{q}(K))$ has eigenvalues $f(\lambda_{i})$ , we have

\operatorname{Tr}f(L_{q}(K))=\sum_{i=1}^{N}f(\lambda_{i})=\beta_{q}(K)+\sum_{i=\beta_{q}+1}^{N}f(\lambda_{i}).

Taking absolute values gives the stated bound. ∎

Given a target spectral-mass value $\tau_{q}$ , define

\mathcal{L}_{\mathrm{trace}}(\theta)=\frac{1}{2}\left(S_{f}(\theta)-\tau_{q}\right)^{2}.

In particular, for the polynomial moment filter, set

S_{q,d}(\theta)=\operatorname{Tr}\left(I-\alpha\widehat{L}_{q}(\theta)\right)^{d}

and define

\mathcal{L}_{\mathrm{mom}}(\theta)=\frac{1}{2}\left(S_{q,d}(\theta)-\tau_{q}\right)^{2}.

For an ordinary hard complex, if

\rho=\max_{\lambda_{i}>0}|1-\alpha\lambda_{i}|<1,

then

\left|\operatorname{Tr}(I-\alpha L_{q}(K))^{d}-\beta_{q}(K)\right|\leq(N-\beta_{q})\rho^{d}.

Thus, the parameters $\alpha$ and $d$ should be chosen together with an appropriate spectral scaling of the Laplacian. This loss moves the low-frequency spectral mass toward the target value without explicitly counting Betti numbers.

4.4 Normalized Spectral-Moment Betti Surrogate

When graph sizes or ambient chain-space dimensions differ, directly comparing trace values introduces scale dependence due to the number of simplices. Therefore, let

N_{q}=\dim C_{q}^{\mathrm{amb}}

and define the normalized moment

\overline{S}_{q,d}(\theta)=\frac{1}{N_{q}}\operatorname{Tr}\left(I-\alpha\widehat{L}_{q}(\theta)\right)^{d}.

In the hard ordinary-complex setting, this normalization corresponds to comparing a Betti-type quantity per $q$ -simplex. In the soft ambient setting, it should be interpreted as normalized low-frequency spectral mass. Given a target normalized value $\bar{\tau}_{q}$ , define

\mathcal{L}_{\mathrm{norm}}(\theta)=\frac{1}{2}\left(\overline{S}_{q,d}(\theta)-\bar{\tau}_{q}\right)^{2}.

This normalization yields topological losses that are more comparable across graphs or ambient complexes of different sizes.

4.5 Normalized First-Betti-Type Control in Graph Clique Complexes

In graph generation or graph-structured topology control, the optimization variables are the edge logits

a=\{a_{e}\}_{e\in E_{\max}}.

The edge activations $p_{e}(a)=\sigma(a_{e})$ define a soft clique complex and hence a Hodge-Laplacian-type spectral relaxation $\widehat{L}_{q}(a)$ . To control one-dimensional cyclic structures in the hard clique-complex limit, we take $q=1$ and use

\widehat{L}_{1}(a)=\widetilde{B}_{1}(a)^{\top}\widetilde{B}_{1}(a)+\widetilde{B}_{2}(a)\widetilde{B}_{2}(a)^{\top}+\mu(I-W_{1}(a)).

Define the Laplacian moment and its normalized version by

S_{1,d}(a)=\operatorname{Tr}\left(I-\alpha\widehat{L}_{1}(a)\right)^{d},\qquad\overline{S}_{1,d}(a)=\frac{1}{N_{1}}\operatorname{Tr}\left(I-\alpha\widehat{L}_{1}(a)\right)^{d},

where $N_{1}=\dim C_{1}^{\mathrm{amb}}$ . The quantity $\overline{S}_{1,d}(a)$ can be regarded as a normalized $\beta_{1}$ -type surrogate in the hard-limit sense, and as a normalized low-frequency $H_{1}$ -type spectral mass in the soft setting. Given a target normalized spectral-mass value $\bar{\tau}_{1}$ , define the graph topological loss by

\mathcal{L}_{\mathrm{graph}}^{\mathrm{topo}}(a)=\frac{1}{2}\left(\overline{S}_{1,d}(a)-\bar{\tau}_{1}\right)^{2}.

In practical graph-generation tasks, one may combine this topological loss with ordinary graph objectives such as edge density, degree distribution, clustering coefficient, or task-dependent losses. If such a base loss is denoted by $\mathcal{L}_{\mathrm{base}}(a)$ , the total loss is

\mathcal{L}_{\mathrm{total}}(a)=\mathcal{L}_{\mathrm{base}}(a)+\lambda_{\mathrm{topo}}\mathcal{L}_{\mathrm{graph}}^{\mathrm{topo}}(a),

where $\lambda_{\mathrm{topo}}>0$ controls the strength of topological regularization. This formulation allows one to optimize ordinary graph-structural objectives while controlling cyclic structures corresponding to $\beta_{1}$ of the clique complex. The ambient normalization above is the abstract formulation used in this section. In the numerical graph experiments below, we use an ordinary-complex or weighted-current-complex implementation and normalize by an effective edge count; the relation between these choices is discussed in Section 6.

4.6 Topology Control for Vietoris–Rips Filtrations

Let

X=\{x_{1},\ldots,x_{n}\}\subset\mathbb{R}^{d}

be a point cloud, and let

r_{1}<r_{2}<\cdots<r_{M}

be a sequence of scales. For each scale $r_{m}$ , define the edge logits by

a_{ij}^{(r_{m})}(X)=\frac{r_{m}-d_{ij}(X)}{\varepsilon},\qquad d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta}.

These logits define a soft graph, a soft clique complex, and a Hodge-Laplacian-type spectral relaxation

\widehat{L}_{q}^{(r_{m})}(X)

at each scale. Since persistent homology describes the birth and death of homology classes across scales, the corresponding Laplacian-based losses are defined as sums over scales.

4.6.1 Trace-Type Vietoris–Rips Loss

Barcode-based losses often reduce undesired homology by penalizing persistence, for example

\mathcal{L}_{\mathrm{PD}}(X)=\sum_{\gamma\in U}(d_{\gamma}-b_{\gamma})^{2}.

In the Laplacian-based formulation, we instead use the low-frequency spectral mass at each scale,

S_{q}^{(r_{m})}(X)=\operatorname{Tr}f\left(\widehat{L}_{q}^{(r_{m})}(X)\right).

To suppress $q$ -dimensional homology, one can minimize

\mathcal{L}_{\mathrm{VR}}^{\mathrm{sup}}(X)=\sum_{m=1}^{M}\omega_{m}\operatorname{Tr}f\left(\widehat{L}_{q}^{(r_{m})}(X)\right),

where $\omega_{m}\geq 0$ are scale weights.

Conversely, to preserve or generate a prescribed topological structure, one can specify target low-frequency spectral masses $\tau_{q}^{(r_{m})}$ and define

\mathcal{L}_{\mathrm{VR}}^{\mathrm{trace}}(X)=\frac{1}{2}\sum_{m=1}^{M}\omega_{m}\left(\operatorname{Tr}f\left(\widehat{L}_{q}^{(r_{m})}(X)\right)-\tau_{q}^{(r_{m})}\right)^{2}.

This loss can be interpreted as a smooth Laplacian-based approximation of the low-frequency $H_{q}$ -type spectral mass present at each scale. In the hard limit and with sufficiently sharp filters, this quantity is related to the Betti profile across scales, but it is not itself a persistence-diagram loss.

4.6.2 Spectral-Filter Matching Vietoris–Rips Loss

When a target point cloud $X_{\mathrm{tar}}$ is given, we can match low-energy subspaces at each scale. Define

L_{m}(X)=\widehat{L}_{q}^{(r_{m})}(X),\qquad L_{m}^{\mathrm{tar}}=\widehat{L}_{q}^{(r_{m})}(X_{\mathrm{tar}}).

Then

\mathcal{L}_{\mathrm{VR}}^{\mathrm{sf}}(X)=\frac{1}{2}\sum_{m=1}^{M}\omega_{m}\left\|f(L_{m}(X))-f(L_{m}^{\mathrm{tar}})\right\|_{F}^{2}.

For the heat and resolvent filters, this gives

\mathcal{L}_{\mathrm{VR}}^{\mathrm{heat}}(X)=\frac{1}{2}\sum_{m=1}^{M}\omega_{m}\left\|\exp\left(-\frac{\widehat{L}_{q}^{(r_{m})}(X)}{\tau}\right)-\exp\left(-\frac{\widehat{L}_{q}^{(r_{m})}(X_{\mathrm{tar}})}{\tau}\right)\right\|_{F}^{2}

and

\mathcal{L}_{\mathrm{VR}}^{\mathrm{res}}(X)=\frac{1}{2}\sum_{m=1}^{M}\omega_{m}\left\|\alpha\left(\widehat{L}_{q}^{(r_{m})}(X)+\alpha I\right)^{-1}-\alpha\left(\widehat{L}_{q}^{(r_{m})}(X_{\mathrm{tar}})+\alpha I\right)^{-1}\right\|_{F}^{2}.

Unlike barcode-based losses, which compare birth–death pairs, this loss compares low-energy subspaces at each scale. Therefore, it is designed to reduce gradient localization on critical simplices and to provide update directions that retain more geometric and spectral information. Since the Vietoris–Rips edge logits are functions of the point-cloud coordinates, these losses are differentiable with respect to the point coordinates.

4.7 Practical Choice of Objectives

The objectives above are intended for different uses. Spectral-filter matching losses are appropriate when a target point cloud, graph, or simplicial complex is available and one wants to align low-frequency Hodge spectral structure with that target. Trace-type spectral-mass losses are appropriate when the goal is to control the amount of $q$ -dimensional topological structure without specifying a target low-eigenvalue subspace. Normalized moment losses are useful when comparing complexes with different numbers of simplices or when controlling graph clique complexes.

Table 1: Practical interpretation of the proposed Hodge-spectral objectives.

Goal	Suggested objective	Interpretation
Match a target point cloud or target complex	Spectral-filter matching loss	Aligns low-frequency Hodge spectral structure with that of a target object.
Control the amount of $q$ -dimensional topology	Trace-type spectral-mass loss	Moves the low-frequency spectral mass toward a prescribed value.
Control a Betti-type profile across scales	Scale-wise Vietoris–Rips spectral loss	Controls low-frequency $H_{q}$ -type structure over selected scales.
Control cyclic structure in graph clique complexes	Normalized polynomial moment loss	Controls a normalized $\beta_{1}$ -type spectral quantity.
Scale to larger complexes	Polynomial, Chebyshev, or trace-estimation-based filters	Avoids full eigendecomposition and uses matrix-vector or trace-estimation routines.

This table also clarifies the numerical experiments below. The Vietoris–Rips experiments examine whether spectral-filter losses provide useful gradients for point-cloud topology control, while the graph experiments examine whether normalized polynomial moments can control first-Betti-type quantities of clique complexes.

5 Numerical Experiments on Vietoris–Rips Topology Control

In this section, we compare topology control based on persistent homology with the proposed Hodge spectral-filter losses on Vietoris–Rips filtrations. The purpose of these experiments is to examine whether the proposed Hodge-spectral losses can control $H_{1}$ -type structures without explicitly optimizing barcode diagrams, and whether they alleviate some of the characteristic difficulties of persistence-diagram losses, such as gradient localization and sensitivity to persistence-pairing changes. These experiments are intended to study the behavior of the optimization signals rather than to replace persistent homology as a topological descriptor.

5.1 Experimental Setup

Let

X=\{x_{1},\ldots,x_{n}\}\subset\mathbb{R}^{2}

be a point cloud. For each scale $r$ , we consider the Vietoris–Rips complex

\mathrm{VR}_{r}(X).

In the persistent-homology baseline, the loss is defined directly from the $H_{1}$ barcode. For example, to suppress or control one-dimensional homological features, we use losses based on the persistence values

d_{\gamma}-b_{\gamma}

of $H_{1}$ bars.

For the proposed Hodge-based losses, we construct a Hodge-Laplacian-type spectral relaxation

\widehat{L}_{1}^{(r)}(X)

for each scale $r$ , and then apply a low-pass spectral filter. We mainly use the heat filter

F_{\tau}^{\mathrm{heat},(r)}(X)=\exp\left(-\frac{\widehat{L}_{1}^{(r)}(X)}{\tau}\right)

and the resolvent filter

F_{\epsilon}^{\mathrm{res},(r)}(X)=\epsilon\left(\widehat{L}_{1}^{(r)}(X)+\epsilon I\right)^{-1}.

For multi-scale objectives, we sum these quantities over a finite set of scales

I=\{r_{1},\ldots,r_{m}\}.

The Hodge spectral-filter losses are combined with mild geometric regularization terms to avoid degenerate point configurations such as excessive collapse or excessive spreading. These regularization terms are used only to keep the point-cloud geometry numerically well behaved; the topological signal itself is provided by the Hodge spectral filters.

5.2 Multi-Scale Topology Control from Random Point Clouds

We first test whether the Hodge spectral-filter losses can induce low-frequency $H_{1}$ -type structures across multiple Vietoris–Rips scales. We initialize $n=36$ points in the square $[-1,1]^{2}$ , and use the scale interval

I=\{0.42,0.46,0.50,0.54,0.58,0.62\}.

We compare a persistent-homology baseline, the Hodge heat objective, and the Hodge resolvent objective.

Figure 1 shows the initial point cloud and the optimized point clouds after 10 epochs. The Hodge objectives, especially the resolvent objective, produce point clouds with more pronounced loop-like structures than the PH baseline in this example.

Refer to caption — Figure 1: Multi-scale topology control from a random point cloud. The exact Betti-profile sum is used only as a diagnostic of the optimized point clouds.

We evaluate the exact Betti number over the scale interval by

\sum_{r\in I}\beta_{1}(\mathrm{VR}_{r}(X)).

This quantity is used as an exact Betti-profile diagnostic of the optimized point cloud; it is not the differentiable objective optimized by the Hodge spectral-filter losses. The initial point cloud has value $1$ , the PH baseline reaches $2$ , the Hodge heat objective reaches $10$ , and the Hodge resolvent objective reaches $28$ . Thus, the Hodge spectral-filter losses increase low-frequency $H_{1}$ -type spectral structure across the selected scale range without directly optimizing persistence diagrams. This result should be interpreted as a scale-wise Hodge-spectral effect rather than as direct optimization of a persistence diagram.

5.3 Gradient Localization

Next, we compare the localization of gradients for PH losses and Hodge spectral-filter losses. The PH loss is based on the squared sum of finite $H_{1}$ persistence values, while the Hodge losses are based on heat and resolvent trace sums over the selected scales.

Figure 2 visualizes the gradient directions for an input point cloud. The PH squared-sum gradient is concentrated on a small number of points, whereas the Hodge heat and Hodge resolvent gradients are distributed more globally around the point cloud for this input.

To quantify this behavior, we compute the entropy of the gradient norm distribution and the mass carried by the top $10\%$ of points with largest gradient norms. The results are shown in Figure 3. The PH squared-sum loss has gradient entropy $1.3863$ and top- $10\%$ mass $0.5000$ . The Hodge heat trace-sum loss has entropy $2.4623$ and top- $10\%$ mass $0.2049$ . The Hodge resolvent trace-sum loss has entropy $2.4740$ and top- $10\%$ mass $0.2018$ .

These results are consistent with the interpretation that PH losses tend to concentrate gradients around critical simplices, while Hodge spectral-filter losses provide more spatially distributed update directions. The comparison concerns the spatial distribution of gradient norms in these experiments, not a general dominance statement for all PH-based objectives.

5.4 Pairing Instability Stress Test

We next examine sensitivity to persistence-pairing changes. We construct a point cloud with two similar loop-like structures and vary a parameter $\alpha$ , which changes the relative sizes of the two loops. Representative point clouds for $\alpha=-0.14$ , $\alpha=0$ , and $\alpha=0.14$ are shown in Figure 4.

For the PH baseline, we use a loss based on the largest $H_{1}$ persistence value. Around $\alpha=0$ , the two dominant $H_{1}$ bars can exchange their order, causing a sharp change in which bar is selected by the loss. For the Hodge method, we use an interval Hodge spectral-filter loss over the scale interval $[0.98,1.12]$ .

Figure 5 shows the normalized losses and their derivatives with respect to $\alpha$ . The PH loss has a sharp derivative jump near the pairing switch, whereas the interval Hodge spectral-filter loss changes more smoothly after normalization by the loss range.

Quantitatively, the maximum raw derivative jump is $1.1773$ for the PH loss and $15.8489$ for the interval Hodge spectral-filter loss. Thus, the Hodge loss is not uniformly smoother in raw scale. However, the absolute magnitudes and dynamic ranges of the two losses are different. After normalization by the loss range, the maximum derivative jump is $11.2194$ for the PH loss and $1.6137$ for the Hodge loss. Therefore, in this experiment, the Hodge interval loss exhibits a substantially smaller derivative jump relative to its own scale. This supports the interpretation that the Hodge-spectral objective provides a smoother optimization signal in a scale-normalized sense.

5.5 Shape Synthesis by Analytic Gradients

We next test whether the analytic gradients of the losses can guide point-cloud shape synthesis. We parameterize a radial point cloud by

x_{i}(r_{i})=(r_{i}\cos\theta_{i},r_{i}\sin\theta_{i}),

where the angles $\theta_{i}$ are fixed and the radii $r_{i}$ are optimized. The target shape is a wavy ring. We compare PH gradient updates, Hodge heat filter gradient updates, and Hodge resolvent filter gradient updates.

Figure 6 shows the recovered radial profiles and the mean RMSE with respect to the target radial profile. The PH gradient update gives final RMSE

0.1504\pm 0.0172,

whereas the Hodge heat projector gives

0.0324\pm 0.0092,

and the Hodge resolvent projector gives

0.0224\pm 0.0054.

Thus, the Hodge spectral-filter gradients recover the target shape more accurately in this controlled radial-shape experiment.

Figure 7 visualizes the initial circle, target wavy shape, and the results obtained by the three update rules. The Hodge heat and resolvent filter updates follow the target deformation more closely than the PH update.

We also compute the cosine similarity between the first update direction and the desired target deformation direction. The PH gradient has mean cosine similarity $-0.2156$ , the Hodge heat filter has $0.2143$ , and the Hodge resolvent filter has $0.8311$ . This indicates that the resolvent filter provides an initial update direction strongly aligned with the desired geometric deformation in this experiment.

The first-gradient entropy is $1.2627$ for PH, $2.5367$ for Hodge heat, and $2.5274$ for Hodge resolvent. Again, the Hodge spectral-filter gradients are less localized and more globally distributed according to this entropy metric.

5.6 Runtime Comparison

Finally, we compare the cost of one loss-gradient evaluation. We vary the number of points as

n=12,16,20,24,28,32,36,40,48,56,64,70,80,90,100.

For each $n$ , we measure the runtime of one gradient step for PH, Hodge heat, and Hodge resolvent losses. The results are shown in Figure 8.

At $n=100$ , the PH gradient evaluation takes

3.4515\ \mathrm{s},

the Hodge heat gradient evaluation takes

60.4083\ \mathrm{s},

and the Hodge resolvent gradient evaluation takes

43.5290\ \mathrm{s}.

Thus, at this scale, Hodge heat is about $17.5$ times slower than PH, and Hodge resolvent is about $12.6$ times slower.

This increased cost is expected because the Hodge spectral-filter losses use richer spectral information. In the current implementation, dense matrix operations are used, and the number of ambient edges and triangles grows rapidly with $n$ . Therefore, applying the Hodge spectral-filter losses to larger problems will require more scalable implementations, such as sparse linear algebra, Chebyshev polynomial approximation, and stochastic trace estimation. Possible quantum or quantum–classical trace-estimation approaches are a separate future direction and are not used in the present experiments.

6 Normalized First-Betti-Type Control on Graph Clique Complexes via Laplacian Moments

In this section, we apply the Laplacian-moment topological loss introduced in Section 4 to normalized first-Betti-type control in graph clique complexes. In particular, we focus on the first homology of the clique complex of a graph and control a quantity corresponding to the normalized Betti number

\frac{\beta_{1}}{|K_{1}|},

where $K_{1}$ denotes the set of $1$ -simplices, namely the set of edges, in the clique complex. In the soft optimization problem below, this hard normalized Betti number is used as an evaluation quantity, while the differentiable loss is applied to a calibrated soft spectral moment.

In the theoretical formulation developed in the previous sections, we constructed a Hodge-Laplacian-type spectral relaxation on a fixed ambient chain space and introduced a penalty term to remove inactive directions. However, in the numerical experiments in this section, we do not use the full ambient penalty-regularized operator. Instead, we use a weighted-current-complex implementation: the edge and triangle spaces are represented by the current soft graph weights, weighted boundary matrices are assembled from these weights, and the moment is computed without adding an inactive ambient block. For sampled hard graphs, this construction reduces to the ordinary Hodge Laplacian on the sampled clique complex. In this case, the chain space itself corresponds to the current weighted or sampled clique complex, and inactive simplex directions do not exist as a separate ambient block. Therefore, no penalty term is required to push inactive directions away from the low-energy region.

We first explain why the Laplacian moment based on the ordinary Hodge Laplacian can be theoretically justified as a surrogate for the normalized Betti number in the hard-complex setting. We then discuss the relation between the ordinary-complex computation, the ambient formulation, and polynomial trace estimation. Finally, we present the results of our numerical experiments.

6.1 Normalized Betti Surrogate Based on the Ordinary Hodge Laplacian for Hard Clique Complexes

Let $G=(V,E)$ be a graph, and let its clique complex be $K=X(G)$ . Let $C_{q}(K)$ be the $q$ -th chain space, and let $B_{q}:C_{q}(K)\to C_{q-1}(K)$ be the boundary operator. The ordinary $q$ -th Hodge Laplacian is defined by

L_{q}(K)=B_{q}^{\top}B_{q}+B_{q+1}B_{q+1}^{\top}.

By the Hodge decomposition,

\ker L_{q}(K)\simeq H_{q}(K).

Therefore, $\dim\ker L_{q}(K)=\dim H_{q}(K)=\beta_{q}(K)$ . In particular, for $q=1$ , $\dim\ker L_{1}(K)=\beta_{1}(K)$ .

The important point is that $L_{q}(K)$ is defined not on an ambient space but on the actual chain space $C_{q}(K)$ of the clique complex $K$ . Hence, the inactive simplex directions that appear in the ambient formulation do not exist in this space. Therefore, when using the ordinary Hodge Laplacian, zero modes directly correspond to homology classes, and the penalty term

\mu(I-W_{q})

for removing inactive directions is unnecessary.

This subsection serves as the hard-complex reference. In the soft optimization experiments below, the differentiable objective is evaluated using weighted boundary matrices. The final topological evaluation is then performed by sampling hard graphs from the optimized edge probabilities and computing the ordinary Betti number of the sampled clique complexes.

6.2 Laplacian Moments and Normalized Betti Numbers

To treat normalized Betti numbers as differentiable objectives in the hard ordinary-complex setting, we use the polynomial moment filter

f_{d}(\lambda)=(1-\alpha\lambda)^{d},

where $\alpha>0$ and $d\in\mathbb{N}$ .

For the ordinary Hodge Laplacian $L_{q}(K)$ , define the Laplacian moment by

S_{q,d}(K)=\operatorname{Tr}\left(I-\alpha L_{q}(K)\right)^{d}.

We then normalize this quantity by the number of $q$ -simplices,

N_{q}=\dim C_{q}(K)=|K_{q}|,

and define

\overline{S}_{q,d}(K)=\frac{\operatorname{Tr}\left(I-\alpha L_{q}(K)\right)^{d}}{N_{q}}.

Let the eigenvalues of $L_{q}(K)$ be

0=\lambda_{1}=\cdots=\lambda_{\beta_{q}}<\lambda_{\beta_{q}+1}\leq\cdots\leq\lambda_{N_{q}}.

Then

S_{q,d}(K)=\sum_{i=1}^{N_{q}}(1-\alpha\lambda_{i})^{d}.

The contribution from each zero eigenvalue is equal to one. Therefore,

S_{q,d}(K)=\beta_{q}(K)+\sum_{i=\beta_{q}+1}^{N_{q}}(1-\alpha\lambda_{i})^{d}.

If $\alpha$ is chosen so that

\rho=\max_{\lambda_{i}>0}|1-\alpha\lambda_{i}|<1,

then

\left|S_{q,d}(K)-\beta_{q}(K)\right|\leq(N_{q}-\beta_{q})\rho^{d}.

Therefore, for the normalized quantity,

\left|\overline{S}_{q,d}(K)-\frac{\beta_{q}(K)}{N_{q}}\right|\leq\frac{N_{q}-\beta_{q}}{N_{q}}\rho^{d}\leq\rho^{d}.

Thus, when $\rho^{d}$ is sufficiently small,

\overline{S}_{q,d}(K)\approx\frac{\beta_{q}(K)}{N_{q}}.

In particular, for $q=1$ ,

\overline{S}_{1,d}(K)=\frac{\operatorname{Tr}\left(I-\alpha L_{1}(K)\right)^{d}}{|K_{1}|}\approx\frac{\beta_{1}(K)}{|K_{1}|}.

In this sense, the Laplacian moment based on the ordinary Hodge Laplacian is a theoretically justified surrogate for the normalized Betti number when the spectrum is scaled so that the positive eigenvalues are suppressed by the polynomial filter.

6.3 Relation to the Ambient Formulation

In the theoretical formulation of the previous sections, the Hodge-Laplacian-type spectral relaxation was defined on an ambient chain space containing all candidate simplices. In that setting, inactive simplex directions remain as ambient basis directions, and a penalty term is needed to exclude them from the low-energy region.

In the hard limit, the penalty-regularized ambient Hodge Laplacian decomposes as

\widehat{L}_{q}\to L_{q}(K)\oplus\mu I_{\mathrm{inactive}}.

Here, $L_{q}(K)$ is the ordinary Hodge Laplacian on the active complex, and $\mu I_{\mathrm{inactive}}$ is the block acting on inactive directions.

For the Laplacian moment, we obtain

\operatorname{Tr}\left(I-\alpha\widehat{L}_{q}\right)^{d}\to\operatorname{Tr}\left(I-\alpha L_{q}(K)\right)^{d}+N_{q}^{\mathrm{inactive}}(1-\alpha\mu)^{d}.

Therefore, if $\alpha$ and $\mu$ are chosen so that

|1-\alpha\mu|^{d}\ll 1,

then the forward contribution from the inactive block is negligible.

Moreover, the derivative of the moment filter is

f_{d}^{\prime}(\lambda)=-\alpha d(1-\alpha\lambda)^{d-1}.

Thus, the backward contribution from inactive directions is controlled by

\alpha d|1-\alpha\mu|^{d-1}.

Therefore, if

|1-\alpha\mu|^{d-1}\ll 1,

then inactive directions also have negligible influence on the gradient.

In particular, when

\alpha\mu\simeq 1,

the inactive block contributes almost nothing to either the forward or backward computation of the Laplacian moment. In this sense, the numerical computation in this section using the ordinary Hodge Laplacian can be viewed as the limit of the ambient formulation in which inactive directions are completely removed, or as the regime in which their contribution is negligible.

Lemma 4 (Ambient-to-ordinary reduction for polynomial moments).

Suppose that, in the hard limit, the ambient operator decomposes as

\widehat{L}_{q}=L_{q}(K)\oplus\mu I_{\mathrm{inactive}}.

Let $f_{d}(\lambda)=(1-\alpha\lambda)^{d}$ . Then

\operatorname{Tr}f_{d}(\widehat{L}_{q})=\operatorname{Tr}f_{d}(L_{q}(K))+N_{q}^{\mathrm{inactive}}(1-\alpha\mu)^{d}.

Moreover,

f_{d}^{\prime}(\mu)=-\alpha d(1-\alpha\mu)^{d-1}.

Thus, if $|1-\alpha\mu|^{d}$ and $|1-\alpha\mu|^{d-1}$ are small, both the forward moment contribution and the local filter derivative associated with the inactive block are negligible. In this regime, the polynomial moment of the ambient operator reduces to the ordinary-complex moment up to a negligible inactive-block contribution.

6.4 Relation to Polynomial Trace Estimation

The Laplacian moment

\operatorname{Tr}(I-\alpha L_{q})^{d}

has the same general form as polynomial trace quantities used in normalized Betti-number estimation. Since zero modes contribute one and suitably filtered nonzero modes are suppressed, such traces can approximate normalized kernel dimensions when the spectrum is properly scaled. This connection is useful computationally because polynomial filters can be evaluated by matrix-vector products and can be combined with stochastic trace estimation.

This viewpoint is closely related to recent trace-estimation approaches for estimating normalized Betti numbers of clique complexes [lloyd2016quantum, akhalwaya2024comparing, gyurik2022towards]. In such approaches, Betti numbers are represented through the null space of a combinatorial Laplacian or a related reflected operator, and polynomial traces are used to approximate the normalized dimension of this null space. The polynomial moment used in this paper has the same mathematical form, but here it is used as a differentiable objective inside a graph optimization loop.

The same polynomial-trace structure is also compatible with quantum trace-estimation viewpoints. However, this paper does not implement a quantum algorithm and does not claim quantum advantage. We therefore treat the quantum connection only as a structural observation and a possible future direction. The main contribution of this section is the use of Laplacian moments as differentiable first-Betti-type objectives for graph clique complexes.

6.5 Numerical Results

In the numerical experiments, each candidate edge $e=(i,j)$ is assigned an edge logit $a_{e}$ . The edge probability is defined by

p_{e}=\sigma(a_{e})=\frac{1}{1+\exp(-a_{e})}.

For a triangle $\sigma=(i,j,k)$ , the weight is defined by

w_{\sigma}=\prod_{e\subset\sigma}p_{e}.

This gives a weighted clique complex from a soft graph. We then compute the weighted-current-complex version of the first Hodge Laplacian.

Let

R_{1}=\operatorname{diag}(\sqrt{p_{e}})_{e\in E_{\max}},\qquad R_{2}=\operatorname{diag}(\sqrt{w_{\sigma}})_{\sigma\in K_{\max}^{(2)}}.

Since vertex weights are fixed to one, we use

\widetilde{B}_{1}=B_{1}R_{1},\qquad\widetilde{B}_{2}=R_{1}B_{2}R_{2}.

The soft weighted first Hodge-Laplian-type operator used in the experiments is

L_{1}^{\mathrm{soft}}=\widetilde{B}_{1}^{\top}\widetilde{B}_{1}+\widetilde{B}_{2}\widetilde{B}_{2}^{\top}.

Unlike the ambient operator in Section 3, this implementation does not include the inactive-direction penalty term.

We define $M=I-\alpha L_{1}^{\mathrm{soft}}$ and compute the Laplacian moment

S_{1,d}=\operatorname{Tr}(M^{d}).

For normalization, we use the effective edge count $N_{1}^{\mathrm{eff}}=\sum_{e}p_{e}$ , and define

\overline{S}_{1,d}=\frac{S_{1,d}}{N_{1}^{\mathrm{eff}}+\delta}.

Here $N_{1}^{\mathrm{eff}}$ is the soft analogue of the number of active edges $|K_{1}|$ , and $\delta>0$ is a small numerical stabilizer. Given a fixed soft moment target $\bar{s}_{1}^{\star}$ , the soft spectral-moment Betti loss is

\mathcal{L}_{\mathrm{Betti}}=\frac{1}{2}\left(\overline{S}_{1,d}-\bar{s}_{1}^{\star}\right)^{2}.

Here, $\bar{s}_{1}^{\star}$ is the target used in the differentiable soft optimization. It should be distinguished from the hard normalized Betti target $\bar{\beta}_{1}^{\star}$ , which is used to evaluate sampled hard graphs after optimization.

Soft and hard targets.

The soft moment target $\bar{s}_{1}^{\star}$ and the hard normalized Betti target $\bar{\beta}_{1}^{\star}$ are conceptually different quantities. The loss above optimizes the differentiable soft moment $\overline{S}_{1,d}$ toward $\bar{s}_{1}^{\star}$ . After optimization, we sample hard graphs from the final edge probabilities and compute the ordinary normalized Betti number $\beta_{1}/|K_{1}|$ of their clique complexes. This sampled hard value is then compared with the hard target $\bar{\beta}_{1}^{\star}$ .

In the present proof-of-concept experiments, the soft target values are specified before optimization and kept fixed during each run. We do not claim an analytic or universal conversion rule from $\bar{\beta}_{1}^{\star}$ to $\bar{s}_{1}^{\star}$ . Instead, the experiments test whether optimizing the prescribed soft spectral moment induces sampled hard graphs whose normalized first Betti numbers move toward the desired hard target. The soft targets used in each experiment are reported explicitly in the tables.

In the experiments, we used $\alpha=0.15$ as the basic setting, and mainly used $d=8$ .

The edge logits were optimized using an Adam-type update, and both forward and backward computations were performed using the ordinary Hodge Laplacian.

6.5.1 Convergence of the Path-Integral Estimator

We first compare the path-integral estimator with the exact value of the trace-gradient

\operatorname{Tr}(M^{d-1}C_{u})

for a fixed soft graph and a fixed direction $u$ . In this experiment, the number of nodes is set to $n=8$ , and the moment order is set to $d=6$ . The number of Monte Carlo samples is varied over

N_{\mathrm{MC}}\in\{100,300,1000,3000,10000\},

and the estimator is evaluated multiple times for each sample size.

Figure 9 compares the mean and standard deviation of the path-integral estimator with the exact trace-gradient. The dashed line represents the exact value, the blue curve represents the mean of the path-integral estimator, and the error bars represent the standard deviation. When the number of samples is small, the variance is large and the estimator deviates significantly from the exact value. As $N_{\mathrm{MC}}$ increases, the estimator approaches the exact value, and in particular, at $N_{\mathrm{MC}}=10000$ , it reaches the neighborhood of the exact value.

Figure 9 shows the mean absolute error with respect to the exact value. The mean absolute errors are $1.2014$ for $N_{\mathrm{MC}}=100$ , $0.4782$ for $N_{\mathrm{MC}}=300$ , $0.6399$ for $N_{\mathrm{MC}}=1000$ , $0.2626$ for $N_{\mathrm{MC}}=3000$ , and $0.1518$ for $N_{\mathrm{MC}}=10000$ .

Table 2: Mean absolute error of the path-integral trace-gradient estimator.

$N_{\mathrm{MC}}$	Mean absolute error
100	1.2014
300	0.4782
1000	0.6399
3000	0.2626
10000	0.1518

At $N_{\mathrm{MC}}=1000$ , the error temporarily increases. This is due to the variance caused by a finite number of Monte Carlo trials. For larger sample sizes, the error decreases again, confirming the tendency of the path-integral estimator to approach the exact trace-gradient. These results show that the proposed path-integral backward estimator functions as a Monte Carlo estimator of the trace-gradient.

6.5.2 Convergence of Betti Loss Optimization

Next, we verify whether the proposed Betti loss can actually be minimized. In this experiment, the number of nodes is set to $n=10$ , the hard normalized Betti target is set to $\bar{\beta}_{1}^{\star}=0.10$ , and the soft moment target used in the differentiable loss is fixed to $\bar{s}_{1}^{\star}=3.0717$ . The optimization is performed using the exact coordinate gradient and is terminated when the improvement of the loss reaches a plateau.

Figure 10 shows the evolution of the Betti loss over optimization steps. The loss rapidly decreases in the early stage and then converges near zero with small oscillations. This result shows that the proposed soft projector Betti loss provides an effective learning signal for the edge logits.

Figure 10 shows the corresponding trajectory of the soft Betti moment. The dotted line represents the fixed soft moment target $\bar{s}_{1}^{\star}$ . The soft moment initially decreases below the target, then oscillates and returns toward the target, eventually stabilizing in its neighborhood. This behavior demonstrates that the Betti loss directly controls the soft moment toward the prescribed soft target.

In this experiment, the optimization reaches a plateau after 45 steps. The final loss is $1.09\times 10^{-5}$ , the final soft spectral moment is $3.0763$ , and the fixed soft target is $\bar{s}_{1}^{\star}=3.0717$ . Thus, the absolute error with respect to the soft target is reduced to $0.0047$ .

Table 3: Final result of Betti loss optimization for hard normalized Betti target

0.10

. The loss is optimized against the fixed soft moment target

\bar{s}_{1}^{\star}=3.0717

. The hard target is used for evaluation after sampling hard graphs from the optimized edge probabilities.

Quantity	Value
Number of steps	45
Final loss	$1.09\times 10^{-5}$
Final soft spectral moment	3.0763
Fixed soft target $\bar{s}_{1}^{\star}$	3.0717
Soft target error	0.0047
Sampled hard normalized $\beta_{1}$ mean	0.1014
Sampled hard normalized $\beta_{1}$ std	0.0904
Hard target $\bar{\beta}_{1}^{\star}$	0.1000
Hard target error	0.0014
Sampled edge density mean	0.2564

These results confirm that minimizing the soft spectral-moment Betti loss drives the soft moment very close to the fixed soft target. Moreover, when hard graphs are sampled from the final soft graph, the mean ordinary hard normalized $\beta_{1}$ of the sampled clique complexes is $0.1014$ , yielding an error of only $0.0014$ with respect to the hard target $0.10$ . Therefore, in this example, the minimization of the soft Betti loss is reflected in the topology of the sampled hard graphs.

On the other hand, the hard $\beta_{1}$ of a single thresholded graph is $0.0$ . This is because converting a soft graph into a hard graph by a single threshold is a discrete operation and is sensitive to the threshold value. Therefore, in these experiments, we evaluate the final hard topology by sampling multiple hard graphs from the edge probabilities and computing the mean normalized Betti number. This evaluation better reflects the probabilistic output of the soft generative model.

6.5.3 Topology-Controlled Graph Generation

We next examine whether the generated graph topology follows different hard target normalized Betti values. In this experiment, the number of nodes is set to $n=15$ , and the hard target values are chosen as

\bar{\beta}_{1}^{\star}\in\{0.02,0.05,0.10,0.20\}.

For each hard target, we specify a fixed soft moment target $\bar{s}_{1}^{\star}$ used in the differentiable loss. These soft targets are reported in Table 4. For all targets, optimization is initialized from the same soft graph.

Figure 11 shows the relationship between the target normalized Betti value and the sampled hard normalized $\beta_{1}$ after optimization. The black dashed line indicates $y=x$ , the orange dotted line indicates the mean $\beta_{1}$ of the shared initial graph, and the green points with error bars indicate the mean and standard deviation of the sampled hard $\beta_{1}$ after optimization.

The mean sampled hard normalized $\beta_{1}$ of the initial graph is $0.1604$ , with standard deviation $0.0769$ . Therefore, for targets $0.02$ and $0.05$ , the optimization must decrease $\beta_{1}$ , while for target $0.20$ , it must increase $\beta_{1}$ . As shown in Figure 11, the optimized sampled hard $\beta_{1}$ increases monotonically with the target and moves from the initial graph value toward each target.

The final results are shown in Table 4.

Table 4: Topology-controlled graph generation. “Soft mom.” denotes the optimized soft spectral moment

\overline{S}_{1,d}

, and “Soft target” denotes the fixed soft moment target

\bar{s}_{1}^{\star}

used in the differentiable loss. Hard

\beta_{1}

values are computed from sampled hard clique complexes and compared with the hard target

\bar{\beta}_{1}^{\star}

Target $\beta_{1}$	Steps	Soft mom.	Soft target	Soft err.	Hard $\beta_{1}$ mean	Hard $\beta_{1}$ std	Hard err.	Density
0.02	176	0.2562	0.2492	0.0070	0.0068	0.0135	0.0132	0.5671
0.05	43	0.8180	0.8878	0.0699	0.0447	0.0457	0.0053	0.4713
0.10	50	1.3477	1.3480	0.0002	0.1173	0.0744	0.0173	0.3978
0.20	59	2.4705	2.4937	0.0231	0.1999	0.0831	0.0001	0.2997

Figure 12 compares the soft target error and the hard target error for each target. The soft error is relatively large for target $0.05$ , but the hard error remains small overall. In particular, for target $0.20$ , the hard error is almost zero. This indicates that the proposed method can generate graphs close to the desired target not only in terms of the prescribed soft moment but also in terms of the hard topology after sampling.

These results confirm that the final sampled hard normalized $\beta_{1}$ generally increases as the target increases. In particular, for targets $0.05$ and $0.20$ , the hard target errors are $0.0053$ and $0.0001$ , respectively, indicating that graphs very close to the target topology are generated. For target $0.10$ , the mean hard $\beta_{1}$ is $0.1173$ , with an error of $0.0173$ , but it still moves from the initial value $0.1604$ toward the target. For target $0.02$ , the final mean is $0.0068$ , which slightly underestimates the target, but the optimization successfully reduces $\beta_{1}$ significantly from the initial value.

The sampled edge density decreases as the target $\beta_{1}$ increases, taking values $0.5671$ , $0.4713$ , $0.3978$ , and $0.2997$ . This is because, when the edge density is too high, loops are more likely to be filled by triangles, reducing the first Betti number. Therefore, the proposed method does not merely increase or decrease edges. Rather, it controls $\beta_{1}$ through the balance between loops and filling triangles in the clique complex.

These results show that topology-controlled graph generation with different hard target normalized Betti values can be achieved using the soft spectral-moment Betti loss together with fixed soft moment targets.

6.5.4 Combination with Graph Feature Loss

Finally, we examine whether the soft spectral-moment Betti loss can be combined with another graph feature loss. As an independent graph feature, we use the soft degree variance. The objective function is defined as the sum of the soft spectral-moment Betti loss and the degree variance loss. The target hard normalized $\beta_{1}$ is fixed to $0.10$ , while the degree variance target is varied over $0.41$ , $0.45$ , and $0.49$ . For each condition, we run experiments with five random seeds.

As a baseline, we use a variance-only optimization method, which optimizes only the degree variance. This baseline approaches the degree variance target but does not use the Betti loss. Therefore, it serves as a control experiment to check whether the Betti topology is preserved.

Figure 13 shows the sampled hard normalized $\beta_{1}$ for the variance-only baseline and the joint optimization. The dashed horizontal line indicates the Betti target $0.10$ . In the variance-only baseline, the hard $\beta_{1}$ remains around $0.16$ for all variance targets, deviating significantly from the Betti target. In contrast, the joint optimization keeps the hard $\beta_{1}$ around $0.11$ , close to the target $0.10$ .

Figure 13 shows the achieved soft degree variance. The dashed line indicates the degree variance target. Both the variance-only baseline and the joint optimization follow the degree variance target well. In particular, the joint optimization does not significantly degrade the control of degree variance.

The numerical results are shown in Table 5.

Table 5: Variance-only and joint optimization.

Var. target	Method	Var. mean	Var. std	Hard $\beta_{1}$ mean	Hard $\beta_{1}$ std	Hard $\beta_{1}$ err.
0.41	Variance-only	0.4036	0.0079	0.1692	0.0121	0.0692
0.41	Joint	0.4112	0.0030	0.1100	0.0076	0.0108
0.45	Variance-only	0.4556	0.0158	0.1627	0.0106	0.0627
0.45	Joint	0.4504	0.0034	0.1125	0.0067	0.0125
0.49	Variance-only	0.4987	0.0172	0.1596	0.0089	0.0596
0.49	Joint	0.4892	0.0043	0.1088	0.0090	0.0102

The variance-only baseline achieves values close to the degree variance target, but the sampled hard $\beta_{1}$ remains around $0.16$ , far from the target $0.10$ . The hard $\beta_{1}$ error ranges from approximately $0.0596$ to $0.0692$ .

In contrast, the joint optimization achieves values close to the degree variance target while keeping the sampled hard $\beta_{1}$ in the range from $0.1088$ to $0.1125$ . The hard $\beta_{1}$ error is approximately between $0.0102$ and $0.0125$ , which is much smaller than that of the variance-only baseline.

These results show that the Betti loss is not merely a special regularization term that conflicts with other graph feature objectives. Rather, it functions as a topology-aware objective that can be combined with ordinary graph statistics. In other words, the proposed method can control a standard graph feature such as degree variance while simultaneously keeping the first-order topology of the clique complex near the target value.

7 Summary and Discussion

In this work, we proposed a framework for differentiably controlling homological constraints on finite simplicial complexes using the low-frequency spectrum of Hodge-Laplacian-type spectral relaxations. The central idea is not to treat Betti numbers or persistent homology directly as discrete quantities, but instead to extract zero and near-zero modes, which correspond to homological structures in the hard ordinary-complex limit, using smooth spectral filters. In the soft regime, these quantities should be interpreted as low-frequency Hodge-spectral surrogates rather than as exact homological invariants.

The main contributions of this work can be summarized as follows. First, we introduced an ambient Hodge-spectral relaxation on a fixed chain space and showed that, in the hard limit, the penalty-regularized operator recovers the ordinary Hodge Laplacian on the active subcomplex and hence the corresponding Betti number. Second, using low-pass spectral filters such as heat filters, resolvent filters, and polynomial moment filters, we defined spectral-filter matching losses and trace-type spectral-mass losses. Third, we applied these objectives to Vietoris–Rips complexes of point clouds and compared their optimization signals with persistence-diagram-based losses. Fourth, we used polynomial Laplacian moments to control normalized first-Betti-type quantities of graph clique complexes and showed that this topological objective can be combined with ordinary graph-feature objectives.

It is important to distinguish this contribution from the use of persistent homology as a descriptor. Persistent homology remains the appropriate object when the goal is to compute stable barcode-level summaries of multiscale topology. The proposed Hodge-spectral objectives are complementary: they are designed for optimization settings in which topology must provide a differentiable signal. In such settings, exact barcode information is not always the most convenient optimization object, and low-frequency spectral quantities can provide more distributed and geometry-aware gradients.

In the experiments on Vietoris–Rips filtrations, we observed that Hodge spectral-filter losses have properties different from persistent-homology-based losses. Hodge spectral-filter losses do not act only on birth–death pairs, but instead provide gradients through low-energy subspaces at each scale. As a result, compared with PH losses, the gradients were less localized around a small number of critical simplices and were more broadly distributed over the point cloud in the gradient-localization experiments. In the pairing-instability stress test, PH losses showed sharp changes when the maximal persistence bar switched, whereas the Hodge interval loss exhibited relatively smoother behavior after normalization by the loss range. In the shape synthesis experiment, Hodge spectral-filter gradients provided update directions more aligned with the target geometry and achieved lower reconstruction error than PH gradients in the controlled radial-shape setting.

In the graph clique-complex experiments, we used the Laplacian moment

\operatorname{Tr}\left(I-\alpha L_{1}\right)^{d}

to control a normalized first-Betti-type spectral quantity. When the ordinary Hodge Laplacian is used on a hard clique complex, the chain space consists only of active simplices. Therefore, unlike the ambient formulation, no inactive-direction penalty is required. In the soft numerical implementation, the moment is computed using a weighted-current-complex operator, and the normalized soft spectral moment is evaluated against a fixed soft target. The resulting sampled hard graphs are then evaluated using the ordinary normalized Betti number

\frac{\beta_{1}}{|K_{1}|}.

The numerical experiments showed that, for both a single target and multiple targets, optimizing edge logits with the soft spectral-moment Betti loss moves the sampled hard normalized Betti number toward the prescribed hard target value. We also confirmed that the soft spectral-moment Betti loss can be combined with ordinary graph-feature losses such as degree variance, enabling simultaneous control of graph statistics and topological quantities.

These results suggest that the low-frequency spectrum of Hodge-Laplacian-type operators provides an effective continuous representation for topology-constrained optimization. In particular, unlike barcode representations, the proposed Hodge-based losses are sensitive not only to the existence of homology classes but also to the surrounding spectral and geometric structure. Therefore, they can induce more geometrically natural deformations and updates, rather than merely matching topological summary statistics in the settings studied here. This should not be interpreted as a general dominance statement over barcode-based objectives; rather, the two approaches emphasize different information and are useful for different purposes.

At the same time, the proposed method has clear computational limitations. Hodge spectral-filter losses use richer spectral information than PH losses, but this also increases the computational cost. As the size of a point cloud or graph grows, the number of simplices can increase rapidly, and constructing the Hodge-Laplacian-type operator and evaluating spectral filters become expensive. In our experiments, dense matrix computations were used, and the runtime per gradient step for Hodge losses was larger than that for PH losses. Applying the method to larger datasets will require scalable implementations based on sparse linear algebra, low-rank approximation, Chebyshev approximation, stochastic trace estimation, and related techniques. Thus, the present experiments should be viewed as proof-of-concept demonstrations of the optimization behavior rather than as optimized large-scale implementations.

Another important point is that trace-type losses should be interpreted as low-frequency spectral masses rather than exact Betti numbers in the soft regime. This is both a limitation and an advantage. By including near-zero modes, the loss can respond not only to exact homology classes but also to weakly closed cycles, unstable holes, and approximate topological structures. This yields smoother and more geometry-aware update directions. On the other hand, hyperparameters such as filter temperature, moment degree, scale set, spectral scaling, and regularization strength affect the resulting optimization behavior. For polynomial moment filters, in particular, the choice of $\alpha$ and $d$ must be coordinated with the spectrum of the Laplacian so that positive eigenvalues are actually suppressed. A more systematic theoretical and empirical understanding of these parameters remains an important topic for future work.

The polynomial trace form of the moment loss also suggests possible connections with scalable trace-estimation methods. Classical stochastic trace estimation and polynomial filtering can reduce the need for full eigendecomposition, and they are natural candidates for scaling Hodge-spectral objectives to larger complexes. Quantum trace-estimation methods for normalized Betti numbers have a related mathematical structure, because they also represent Betti information through traces of polynomial functions of Laplacian-related operators. However, this work does not implement a quantum algorithm and does not claim quantum advantage. A careful study of block encodings, state preparation costs, sampling error, and gradient estimation would be required before making algorithmic claims in that direction.

On the application side, the proposed framework can be extended to many problems where topological constraints are important. Hodge spectral-filter losses for point clouds and images may be useful for medical image segmentation, shape interpolation, and geometric generative modeling. Normalized spectral-moment Betti losses for graphs may be useful for communication networks, molecular graphs, material structures, and social networks, where higher-order loops and redundancy affect functionality. In particular, Betti numbers of clique complexes describe higher-order graph structures that cannot be fully captured by degree distributions or clustering coefficients, and therefore may serve as useful objectives for controlling network robustness and higher-order connectivity. Developing such applications will require task-specific choices of filtrations, spectral filters, target quantities, and geometric or structural regularization terms.

In summary, this work presented a framework for treating topology not merely as an external descriptor computed after the fact, but as an object that can be directly optimized as a loss function. By using the low-frequency spectrum of Hodge-Laplacian-type spectral relaxations, discrete homological information can be embedded into smooth optimization problems, yielding a common mathematical structure across Vietoris–Rips complexes of point clouds and clique complexes of graphs. Further development of scalable spectral approximations and trace-estimation methods may enable the control of larger and more complex topological structures.

Statements and Declarations

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Competing interests

The authors have no relevant financial or non-financial interests to disclose.

Author contributions

S.K. conceived the study, developed the theoretical framework, and carried out the numerical computations. Y.S. made significant contributions through extensive discussions with S.K. and provided important guidance for refining the direction of the study. All authors reviewed and approved the final manuscript.

Data availability

The datasets generated and analyzed during the current study are available from the corresponding author upon reasonable request.

Code availability

The code used for the numerical experiments is available from the corresponding author upon reasonable request.

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Appendix A Details of Backward Computation

In this appendix, we derive the backward computation for the low-pass spectral losses defined in the main text. The basic flow is as follows. First, from a loss function

\mathcal{J}=\mathcal{J}(\widehat{L}_{q}),

we compute the gradient with respect to the Hodge-Laplacian-type spectral relaxation:

G_{L}=\frac{\partial\mathcal{J}}{\partial\widehat{L}_{q}}.

Then, using the structure of $\widehat{L}_{q}$ , we propagate the gradient to the weighted boundary operators, simplex weights, edge probabilities, and edge logits. In the Vietoris–Rips setting, since the edge logits are defined as functions of point-cloud coordinates, the gradients are further propagated to the point coordinates.

Throughout this appendix, for a variable $z$ , we denote the gradient of $\mathcal{J}$ with respect to $z$ by

g_{z}=\frac{\partial\mathcal{J}}{\partial z}.

For example,

g_{\rho_{\sigma}}=\frac{\partial\mathcal{J}}{\partial\rho_{\sigma}},\qquad g_{w_{\sigma}}=\frac{\partial\mathcal{J}}{\partial w_{\sigma}},\qquad g_{p_{e}}=\frac{\partial\mathcal{J}}{\partial p_{e}},\qquad g_{a_{e}}=\frac{\partial\mathcal{J}}{\partial a_{e}}.

Here, $\rho_{\sigma}$ denotes the square root of the simplex weight $w_{\sigma}$ :

\rho_{\sigma}=\sqrt{w_{\sigma}}.

A.1 Notation and Overall Computational Path

For a fixed degree $q$ , we write the Hodge-Laplacian-type spectral relaxation as

L=\widehat{L}_{q}.

Let the loss function be

\mathcal{J}=\mathcal{J}(L).

The matrix gradient obtained from the spectral loss is denoted by

G_{L}=\frac{\partial\mathcal{J}}{\partial L}.

Since $L$ is symmetric, in numerical implementations one may symmetrize this gradient as

G_{L}\leftarrow\frac{1}{2}\left(G_{L}+G_{L}^{\top}\right).

The Hodge-Laplacian-type spectral relaxation is defined by

L=\widetilde{B}_{q}^{\top}\widetilde{B}_{q}+\widetilde{B}_{q+1}\widetilde{B}_{q+1}^{\top}+\mu(I-W_{q}).

For simplicity, we write

A=\widetilde{B}_{q},\qquad C=\widetilde{B}_{q+1}.

Then

L=A^{\top}A+CC^{\top}+\mu(I-W_{q}).

Each weighted boundary operator is defined by

\widetilde{B}_{p}=R_{p-1}B_{p}R_{p},

where

R_{p}=\operatorname{diag}(\rho_{\sigma})_{\sigma\in K_{\max}^{(p)}},\qquad\rho_{\sigma}=\sqrt{w_{\sigma}}.

In the soft clique complex, the simplex weights are given by

w_{\sigma}=\prod_{e\subset\sigma}p_{e},

and each edge probability is defined by

p_{e}=\sigma(a_{e})=\frac{1}{1+\exp(-a_{e})}.

Therefore, the backward computational path is

\mathcal{J}\longrightarrow L\longrightarrow\widetilde{B}_{q},\widetilde{B}_{q+1},W_{q}\longrightarrow\rho_{\sigma}\longrightarrow w_{\sigma}\longrightarrow p_{e}\longrightarrow a_{e}.

In the Vietoris–Rips setting, we further have

a_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon},\qquad d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta}.

Thus, gradients are further propagated through

a_{ij}^{(r)}\longrightarrow d_{ij}\longrightarrow x_{i}.

A.2 From Spectral Losses to $G_{L}$

We first compute

G_{L}=\frac{\partial\mathcal{J}}{\partial L}

for representative low-pass spectral losses used in the main text.

A.2.1 Heat Projector Matching Loss

Let the heat projector be

F(L)=\exp\left(-\frac{L}{\tau}\right).

Let the target projector be

F_{\mathrm{tar}}=\exp\left(-\frac{L_{\mathrm{tar}}}{\tau}\right).

The loss function is

\mathcal{J}_{\mathrm{heat}}=\frac{1}{2}\left\|F(L)-F_{\mathrm{tar}}\right\|_{F}^{2}.

Define

D=F(L)-F_{\mathrm{tar}}.

Then

d\mathcal{J}_{\mathrm{heat}}=\langle D,dF\rangle_{F}.

Let the eigendecomposition of $L$ be

L=U\Lambda U^{\top},\qquad\Lambda=\operatorname{diag}(\lambda_{1},\ldots,\lambda_{N}).

Define

f(\lambda)=\exp\left(-\frac{\lambda}{\tau}\right).

Then

F(L)=Uf(\Lambda)U^{\top}.

By the Fréchet derivative of a matrix function,

U^{\top}(dF)U=K\odot\left(U^{\top}(dL)U\right),

where $\odot$ denotes the Hadamard product, and the Loewner matrix $K$ is given by

K_{ij}=\begin{cases}\dfrac{f(\lambda_{i})-f(\lambda_{j})}{\lambda_{i}-\lambda_{j}},&i\neq j,\\[8.53581pt] f^{\prime}(\lambda_{i}),&i=j.\end{cases}

For the heat filter,

f^{\prime}(\lambda_{i})=-\frac{1}{\tau}\exp\left(-\frac{\lambda_{i}}{\tau}\right).

Therefore,

d\mathcal{J}_{\mathrm{heat}}=\left\langle U^{\top}DU,K\odot\left(U^{\top}(dL)U\right)\right\rangle_{F}.

Using the inner-product property of the Hadamard product,

d\mathcal{J}_{\mathrm{heat}}=\left\langle K\odot\left(U^{\top}DU\right),U^{\top}(dL)U\right\rangle_{F}.

Furthermore, using

\left\langle M,U^{\top}(dL)U\right\rangle_{F}=\left\langle UMU^{\top},dL\right\rangle_{F},

we obtain

G_{L}=U\left[K\odot\left(U^{\top}DU\right)\right]U^{\top}.

Thus, for the heat projector matching loss,

G_{L}=\frac{\partial\mathcal{J}_{\mathrm{heat}}}{\partial L}=U\left[K\odot\left(U^{\top}\left(F(L)-F_{\mathrm{tar}}\right)U\right)\right]U^{\top}.

A.2.2 Resolvent Projector Matching Loss

Let the resolvent projector be

F(L)=\alpha(L+\alpha I)^{-1}.

Let the target projector be

F_{\mathrm{tar}}=\alpha(L_{\mathrm{tar}}+\alpha I)^{-1}.

The loss function is

\mathcal{J}_{\mathrm{res}}=\frac{1}{2}\left\|F(L)-F_{\mathrm{tar}}\right\|_{F}^{2}.

Define

D=F(L)-F_{\mathrm{tar}}.

Also define

M=L+\alpha I.

Then

F(L)=\alpha M^{-1}.

Using the differential formula for the inverse matrix,

dM^{-1}=-M^{-1}(dM)M^{-1}.

Since

dM=dL,

we have

dF=-\alpha M^{-1}(dL)M^{-1}.

The differential of the loss is

d\mathcal{J}_{\mathrm{res}}=\langle D,dF\rangle_{F}.

Therefore,

d\mathcal{J}_{\mathrm{res}}=-\alpha\operatorname{Tr}\left[D^{\top}M^{-1}(dL)M^{-1}\right].

By cyclicity of the trace,

d\mathcal{J}_{\mathrm{res}}=-\alpha\operatorname{Tr}\left[M^{-1}D^{\top}M^{-1}dL\right].

When $L$ , $M$ , and $D$ are symmetric,

D^{\top}=D.

Thus,

G_{L}=-\alpha M^{-1}DM^{-1}.

Equivalently,

G_{L}=\frac{\partial\mathcal{J}_{\mathrm{res}}}{\partial L}=-\alpha(L+\alpha I)^{-1}\left(F(L)-F_{\mathrm{tar}}\right)(L+\alpha I)^{-1}.

A.2.3 Moment Trace Loss

Let the moment surrogate be

S_{q,d}(L)=\operatorname{Tr}\left(I-\alpha L\right)^{d}.

Define

M=I-\alpha L.

Then

S_{q,d}(L)=\operatorname{Tr}(M^{d}).

We first compute

dS_{q,d}=d\,\operatorname{Tr}(M^{d}).

By the product rule,

d(M^{d})=\sum_{\ell=0}^{d-1}M^{\ell}(dM)M^{d-1-\ell}.

Therefore,

dS_{q,d}=\sum_{\ell=0}^{d-1}\operatorname{Tr}\left[M^{\ell}(dM)M^{d-1-\ell}\right].

By cyclicity of the trace,

\operatorname{Tr}\left[M^{\ell}(dM)M^{d-1-\ell}\right]=\operatorname{Tr}\left[M^{d-1}dM\right].

Hence,

dS_{q,d}=d\,\operatorname{Tr}\left[M^{d-1}dM\right].

Since

M=I-\alpha L,

we have

dM=-\alpha dL.

Therefore,

dS_{q,d}=-\alpha d\,\operatorname{Tr}\left[M^{d-1}dL\right].

Thus,

\frac{\partial S_{q,d}}{\partial L}=-\alpha d\left(M^{d-1}\right)^{\top}.

If $L$ is symmetric, then $M$ is also symmetric, and hence

\frac{\partial S_{q,d}}{\partial L}=-\alpha dM^{d-1}.

For the loss function

\mathcal{J}_{\mathrm{mom}}=\frac{1}{2}\left(S_{q,d}(L)-\tau_{q}\right)^{2},

the chain rule gives

G_{L}=\frac{\partial\mathcal{J}_{\mathrm{mom}}}{\partial L}=\left(S_{q,d}(L)-\tau_{q}\right)\frac{\partial S_{q,d}}{\partial L}.

Therefore,

G_{L}=\left(S_{q,d}(L)-\tau_{q}\right)\left(-\alpha dM^{d-1}\right).

When using the normalized moment surrogate

\overline{S}_{q,d}(L)=\frac{1}{N_{q}}S_{q,d}(L),

the loss function is

\mathcal{J}_{\mathrm{norm}}=\frac{1}{2}\left(\overline{S}_{q,d}(L)-\bar{\tau}_{q}\right)^{2}.

In this case,

G_{L}=\left(\overline{S}_{q,d}(L)-\bar{\tau}_{q}\right)\frac{1}{N_{q}}\left(-\alpha dM^{d-1}\right).

A.3 From the Hodge-Laplacian-type spectral relaxation to Weighted Boundaries

We now assume that

G_{L}=\frac{\partial\mathcal{J}}{\partial L}

has already been computed, and propagate it to the weighted boundary operators.

Recall that

L=A^{\top}A+CC^{\top}+\mu(I-W_{q}),

where

A=\widetilde{B}_{q},\qquad C=\widetilde{B}_{q+1}.

Taking the differential, we obtain

dL=d(A^{\top}A)+d(CC^{\top})-\mu dW_{q}.

The first two terms are

d(A^{\top}A)=(dA)^{\top}A+A^{\top}(dA)

and

d(CC^{\top})=(dC)C^{\top}+C(dC)^{\top}.

Thus,

dL=(dA)^{\top}A+A^{\top}(dA)+(dC)C^{\top}+C(dC)^{\top}-\mu dW_{q}.

The differential of the loss is

d\mathcal{J}=\langle G_{L},dL\rangle_{F}.

First, consider the terms involving $A$ :

d\mathcal{J}_{A}=\left\langle G_{L},(dA)^{\top}A+A^{\top}(dA)\right\rangle_{F}.

Using properties of the Frobenius inner product,

\left\langle G_{L},(dA)^{\top}A\right\rangle_{F}=\left\langle AG_{L}^{\top},dA\right\rangle_{F}

and

\left\langle G_{L},A^{\top}(dA)\right\rangle_{F}=\left\langle AG_{L},dA\right\rangle_{F}.

Since $G_{L}$ is symmetrized,

G_{L}^{\top}=G_{L}.

Therefore,

d\mathcal{J}_{A}=\left\langle 2AG_{L},dA\right\rangle_{F}.

Hence,

G_{A}=\frac{\partial\mathcal{J}}{\partial A}=2AG_{L}.

Next, consider the terms involving $C$ :

d\mathcal{J}_{C}=\left\langle G_{L},(dC)C^{\top}+C(dC)^{\top}\right\rangle_{F}.

Similarly,

\left\langle G_{L},(dC)C^{\top}\right\rangle_{F}=\left\langle G_{L}C,dC\right\rangle_{F}

and

\left\langle G_{L},C(dC)^{\top}\right\rangle_{F}=\left\langle G_{L}^{\top}C,dC\right\rangle_{F}.

Since $G_{L}^{\top}=G_{L}$ ,

d\mathcal{J}_{C}=\left\langle 2G_{L}C,dC\right\rangle_{F}.

Thus,

G_{C}=\frac{\partial\mathcal{J}}{\partial C}=2G_{L}C.

We also compute the contribution from the penalty term

\mu(I-W_{q}).

The differential of this term is

-\mu dW_{q}.

Therefore,

d\mathcal{J}_{\mathrm{pen}}=-\mu\langle G_{L},dW_{q}\rangle_{F}.

Since $W_{q}$ is diagonal,

W_{q}=\operatorname{diag}(w_{\sigma})_{\sigma\in K_{\max}^{(q)}},

we have

dW_{q}=\operatorname{diag}(dw_{\sigma})_{\sigma\in K_{\max}^{(q)}}.

Thus,

\langle G_{L},dW_{q}\rangle_{F}=\sum_{\sigma\in K_{\max}^{(q)}}(G_{L})_{\sigma\sigma}dw_{\sigma}.

Hence, the direct gradient contribution from the penalty term to a $q$ -simplex weight is

g_{w_{\sigma}}^{\mathrm{pen}}=-\mu(G_{L})_{\sigma\sigma},\qquad\sigma\in K_{\max}^{(q)}.

A.4 From Weighted Boundaries to Square-Root Weights

We next propagate the gradient through the weighted boundary

\widetilde{B}_{p}=R_{p-1}B_{p}R_{p}

to the square-root weights $\rho_{\sigma}$ .

Recall that

R_{p}=\operatorname{diag}(\rho_{\sigma})_{\sigma\in K_{\max}^{(p)}},\qquad\rho_{\sigma}=\sqrt{w_{\sigma}}.

Let

G_{\widetilde{B}_{p}}=\frac{\partial\mathcal{J}}{\partial\widetilde{B}_{p}}.

In components,

(\widetilde{B}_{p})_{\tau\sigma}=\rho_{\tau}(B_{p})_{\tau\sigma}\rho_{\sigma},

where

\tau\in K_{\max}^{(p-1)},\qquad\sigma\in K_{\max}^{(p)}.

First, consider the contribution to $\rho_{\tau}$ for a $(p-1)$ -simplex $\tau$ . We have

\frac{\partial(\widetilde{B}_{p})_{\tau\sigma}}{\partial\rho_{\tau}}=(B_{p})_{\tau\sigma}\rho_{\sigma}.

Therefore,

\frac{\partial\mathcal{J}}{\partial\rho_{\tau}}=\sum_{\sigma\in K_{\max}^{(p)}}(G_{\widetilde{B}_{p}})_{\tau\sigma}(B_{p})_{\tau\sigma}\rho_{\sigma}.

In vector form, the contribution from $\widetilde{B}_{p}$ to $\rho_{p-1}$ is

g_{\rho_{p-1}}^{(p,\mathrm{lower})}=\left(G_{\widetilde{B}_{p}}\odot B_{p}\right)\rho_{p}.

Next, consider the contribution to $\rho_{\sigma}$ for a $p$ -simplex $\sigma$ . We have

\frac{\partial(\widetilde{B}_{p})_{\tau\sigma}}{\partial\rho_{\sigma}}=\rho_{\tau}(B_{p})_{\tau\sigma}.

Therefore,

\frac{\partial\mathcal{J}}{\partial\rho_{\sigma}}=\sum_{\tau\in K_{\max}^{(p-1)}}(G_{\widetilde{B}_{p}})_{\tau\sigma}(B_{p})_{\tau\sigma}\rho_{\tau}.

In vector form, the contribution from $\widetilde{B}_{p}$ to $\rho_{p}$ is

g_{\rho_{p}}^{(p,\mathrm{upper})}=\left(G_{\widetilde{B}_{p}}\odot B_{p}\right)^{\top}\rho_{p-1}.

A key point is that the same $\rho_{p}$ appears in two weighted boundaries. It appears on the right side of

\widetilde{B}_{p}=R_{p-1}B_{p}R_{p}

and on the left side of

\widetilde{B}_{p+1}=R_{p}B_{p+1}R_{p+1}.

Therefore, the total gradient with respect to $\rho_{p}$ is the sum of the two contributions.

Thus, for a general degree $p$ , the full gradient with respect to $\rho_{p}$ is

g_{\rho_{p}}=\left(G_{\widetilde{B}_{p}}\odot B_{p}\right)^{\top}\rho_{p-1}+\left(G_{\widetilde{B}_{p+1}}\odot B_{p+1}\right)\rho_{p+1},

where terms corresponding to non-existent boundary operators at the ends of the complex are omitted.

For the most important case $q=1$ , we have

L=\widehat{L}_{1}=\widetilde{B}_{1}^{\top}\widetilde{B}_{1}+\widetilde{B}_{2}\widetilde{B}_{2}^{\top}+\mu(I-W_{1}).

Then

G_{\widetilde{B}_{1}}=2\widetilde{B}_{1}G_{L},\qquad G_{\widetilde{B}_{2}}=2G_{L}\widetilde{B}_{2}.

The gradient with respect to the square-root weights of edges, $\rho_{1}$ , is

g_{\rho_{1}}=\left(G_{\widetilde{B}_{1}}\odot B_{1}\right)^{\top}\rho_{0}+\left(G_{\widetilde{B}_{2}}\odot B_{2}\right)\rho_{2}.

The gradient with respect to the square-root weights of triangles, $\rho_{2}$ , is

g_{\rho_{2}}=\left(G_{\widetilde{B}_{2}}\odot B_{2}\right)^{\top}\rho_{1}.

The vertex weights are usually fixed, and hence gradients with respect to $\rho_{0}$ are not used for optimization.

A.5 From Square-Root Weights to Simplex Weights

We now compute the derivative from

\rho_{\sigma}=\sqrt{w_{\sigma}}

w_{\sigma}.

Since

\rho_{\sigma}=w_{\sigma}^{1/2},

we have

\frac{\partial\rho_{\sigma}}{\partial w_{\sigma}}=\frac{1}{2}w_{\sigma}^{-1/2}=\frac{1}{2\rho_{\sigma}}.

Therefore, the gradient contribution from the weighted boundaries to the simplex weight is

g_{w_{\sigma}}^{\mathrm{bdry}}=g_{\rho_{\sigma}}\frac{\partial\rho_{\sigma}}{\partial w_{\sigma}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}.

If $\sigma\in K_{\max}^{(q)}$ , then the penalty contribution

g_{w_{\sigma}}^{\mathrm{pen}}=-\mu(G_{L})_{\sigma\sigma}

must be added. Therefore,

g_{w_{\sigma}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}-\mu(G_{L})_{\sigma\sigma},\qquad\sigma\in K_{\max}^{(q)}.

For a $p$ -simplex $\sigma\in K_{\max}^{(p)}$ with $p\neq q$ , there is no direct penalty contribution, and hence

g_{w_{\sigma}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}.

Numerically, when $w_{\sigma}$ is very small, $\rho_{\sigma}$ also becomes very small and instability may occur. In such cases, one may use

\rho_{\sigma}=\sqrt{w_{\sigma}+\epsilon_{\mathrm{num}}}.

Then

g_{w_{\sigma}}^{\mathrm{bdry}}=\frac{g_{\rho_{\sigma}}}{2\sqrt{w_{\sigma}+\epsilon_{\mathrm{num}}}}.

A.6 From Simplex Weights to Edge Probabilities

In the soft clique complex, simplex weights are defined by

w_{\sigma}=\prod_{e\subset\sigma}p_{e}.

If an edge $e$ is contained in $\sigma$ , then

\frac{\partial w_{\sigma}}{\partial p_{e}}=\prod_{\begin{subarray}{c}e^{\prime}\subset\sigma\\ e^{\prime}\neq e\end{subarray}}p_{e^{\prime}}.

Equivalently,

\frac{\partial w_{\sigma}}{\partial p_{e}}=\frac{w_{\sigma}}{p_{e}}.

e\not\subset\sigma,

then

\frac{\partial w_{\sigma}}{\partial p_{e}}=0.

Therefore, the gradient with respect to the edge probability $p_{e}$ is the sum of the contributions from all simplices containing $e$ :

g_{p_{e}}=\frac{\partial\mathcal{J}}{\partial p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{\partial w_{\sigma}}{\partial p_{e}}.

Thus,

g_{p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}}.

For numerical stability when $p_{e}$ is very small, one may use

g_{p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}+\epsilon_{\mathrm{num}}}.

A.7 From Edge Probabilities to Edge Logits

The edge probability is defined from the edge logit $a_{e}$ by

p_{e}=\sigma(a_{e})=\frac{1}{1+\exp(-a_{e})}.

The derivative of the logistic function is

\frac{\partial p_{e}}{\partial a_{e}}=p_{e}(1-p_{e}).

Therefore,

g_{a_{e}}=\frac{\partial\mathcal{J}}{\partial a_{e}}=g_{p_{e}}p_{e}(1-p_{e}).

Substituting the result from the previous subsection gives

g_{a_{e}}=p_{e}(1-p_{e})\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}}.

When $p_{e}>0$ , this can be simplified as

g_{a_{e}}=(1-p_{e})\sum_{\sigma\supset e}g_{w_{\sigma}}w_{\sigma}.

However, for numerical stability, it is safer to use

g_{a_{e}}=p_{e}(1-p_{e})\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}+\epsilon_{\mathrm{num}}}.

Thus, gradients of arbitrary low-pass spectral losses can be propagated analytically to the edge logits

a_{e}.

A.8 Backward Computation for Graph Generation

In graph generation, the optimization variables are the edge logits

a=\{a_{e}\}_{e\in E_{\max}}.

Thus, the final output of the backward computation is

\nabla_{a}\mathcal{J}=\left(g_{a_{e}}\right)_{e\in E_{\max}}.

The computation proceeds as follows.

First, compute edge probabilities from edge logits:

p_{e}=\sigma(a_{e}).

Next, compute simplex weights:

w_{\sigma}=\prod_{e\subset\sigma}p_{e}.

Then construct

W_{q},\qquad R_{q},\qquad\widetilde{B}_{q},\qquad\widehat{L}_{q}.

Next, compute

G_{L}=\frac{\partial\mathcal{J}}{\partial\widehat{L}_{q}}

from the loss function. For example, for the normalized Laplacian moment loss

\mathcal{J}_{\mathrm{norm}}=\frac{1}{2}\left(\overline{S}_{q,d}-\bar{\tau}_{q}\right)^{2},

where

\overline{S}_{q,d}=\frac{1}{N_{q}}\operatorname{Tr}\left(I-\alpha\widehat{L}_{q}\right)^{d},

define

M=I-\alpha\widehat{L}_{q}.

Then

G_{L}=\left(\overline{S}_{q,d}-\bar{\tau}_{q}\right)\frac{1}{N_{q}}\left(-\alpha dM^{d-1}\right).

Using this $G_{L}$ , compute

G_{\widetilde{B}_{q}}=2\widetilde{B}_{q}G_{L}

and

G_{\widetilde{B}_{q+1}}=2G_{L}\widetilde{B}_{q+1}.

Then compute

g_{\rho_{p}}

using the formula in the previous subsection. In particular, for $q=1$ ,

g_{\rho_{1}}=\left(G_{\widetilde{B}_{1}}\odot B_{1}\right)^{\top}\rho_{0}+\left(G_{\widetilde{B}_{2}}\odot B_{2}\right)\rho_{2}.

Then compute

g_{w_{\sigma}}.

Next compute

g_{p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}}.

Finally compute

g_{a_{e}}=g_{p_{e}}p_{e}(1-p_{e}).

This gives

\frac{\partial\mathcal{J}}{\partial a_{e}}=g_{a_{e}}.

Therefore, the edge logits can be updated by gradient-based optimization.

A.9 From Vietoris–Rips Edge Logits to Distances

In the Vietoris–Rips setting, edge logits are defined from point-cloud coordinates. At scale $r$ ,

a_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon},

where

d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta}.

Assume that the gradient with respect to each edge logit has already been computed:

g_{a_{ij}^{(r)}}=\frac{\partial\mathcal{J}}{\partial a_{ij}^{(r)}}.

We now propagate this gradient to the distance $d_{ij}$ .

Since

a_{ij}^{(r)}=\frac{r-d_{ij}}{\varepsilon},

we have

\frac{\partial a_{ij}^{(r)}}{\partial d_{ij}}=-\frac{1}{\varepsilon}.

Therefore,

g_{d_{ij}}^{(r)}=\frac{\partial\mathcal{J}}{\partial d_{ij}}=g_{a_{ij}^{(r)}}\frac{\partial a_{ij}^{(r)}}{\partial d_{ij}}=-\frac{1}{\varepsilon}g_{a_{ij}^{(r)}}.

A.10 From Distances to Point-Cloud Coordinates

We next differentiate the stabilized distance

d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta}

with respect to the point-cloud coordinates.

Since

d_{ij}(X)^{2}=\|x_{i}-x_{j}\|^{2}+\delta,

we have

\nabla_{x_{i}}d_{ij}=\frac{x_{i}-x_{j}}{d_{ij}}

and

\nabla_{x_{j}}d_{ij}=\frac{x_{j}-x_{i}}{d_{ij}}.

At a single scale $r$ , the gradient with respect to a point $x_{i}$ is the sum of the contributions from all candidate edges incident to $x_{i}$ :

\nabla_{x_{i}}\mathcal{J}^{(r)}=\sum_{j:(i,j)\in E_{\max}}g_{d_{ij}}^{(r)}\frac{x_{i}-x_{j}}{d_{ij}}.

Substituting

g_{d_{ij}}^{(r)}=-\frac{1}{\varepsilon}g_{a_{ij}^{(r)}},

we obtain

\nabla_{x_{i}}\mathcal{J}^{(r)}=-\frac{1}{\varepsilon}\sum_{j:(i,j)\in E_{\max}}g_{a_{ij}^{(r)}}\frac{x_{i}-x_{j}}{d_{ij}}.

Furthermore, using

g_{a_{ij}^{(r)}}=p_{ij}^{(r)}\left(1-p_{ij}^{(r)}\right)\sum_{\sigma\supset(i,j)}g_{w_{\sigma}}^{(r)}\frac{w_{\sigma}^{(r)}}{p_{ij}^{(r)}},

we obtain

\nabla_{x_{i}}\mathcal{J}^{(r)}=-\frac{1}{\varepsilon}\sum_{j:(i,j)\in E_{\max}}\left[p_{ij}^{(r)}\left(1-p_{ij}^{(r)}\right)\sum_{\sigma\supset(i,j)}g_{w_{\sigma}}^{(r)}\frac{w_{\sigma}^{(r)}}{p_{ij}^{(r)}}\right]\frac{x_{i}-x_{j}}{d_{ij}}.

For numerical stability, one may replace

\frac{w_{\sigma}^{(r)}}{p_{ij}^{(r)}}

\frac{w_{\sigma}^{(r)}}{p_{ij}^{(r)}+\epsilon_{\mathrm{num}}}.

A.11 Multi-Scale Vietoris–Rips Backward Computation

In a Vietoris–Rips filtration, we use multiple scales

r_{1}<r_{2}<\cdots<r_{M}.

Suppose that the loss function is

\mathcal{J}(X)=\sum_{m=1}^{M}\omega_{m}\mathcal{J}_{m}\left(\widehat{L}_{q}^{(r_{m})}(X)\right).

For each scale $r_{m}$ , first compute

G_{L}^{(m)}=\frac{\partial\mathcal{J}_{m}}{\partial\widehat{L}_{q}^{(r_{m})}}.

Then, following the procedure above, compute

g_{a_{ij}^{(r_{m})}}=\frac{\partial\mathcal{J}_{m}}{\partial a_{ij}^{(r_{m})}}.

The edge logit at scale $r_{m}$ is

a_{ij}^{(r_{m})}(X)=\frac{r_{m}-d_{ij}(X)}{\varepsilon}.

The distance $d_{ij}(X)$ does not depend on the scale, whereas the edge-logit gradient

g_{a_{ij}^{(r_{m})}}

does depend on the scale.

The gradient of the total loss with respect to $x_{i}$ is the weighted sum of the contributions from all scales:

\nabla_{x_{i}}\mathcal{J}=\sum_{m=1}^{M}\omega_{m}\nabla_{x_{i}}\mathcal{J}_{m}.

Therefore,

\nabla_{x_{i}}\mathcal{J}=-\frac{1}{\varepsilon}\sum_{m=1}^{M}\omega_{m}\sum_{j:(i,j)\in E_{\max}}g_{a_{ij}^{(r_{m})}}\frac{x_{i}-x_{j}}{d_{ij}}.

Writing $g_{a_{ij}^{(r_{m})}}$ explicitly, we have

\nabla_{x_{i}}\mathcal{J}=-\frac{1}{\varepsilon}\sum_{m=1}^{M}\omega_{m}\sum_{j:(i,j)\in E_{\max}}\left[p_{ij}^{(r_{m})}\left(1-p_{ij}^{(r_{m})}\right)\sum_{\sigma\supset(i,j)}g_{w_{\sigma}}^{(m)}\frac{w_{\sigma}^{(r_{m})}}{p_{ij}^{(r_{m})}}\right]\frac{x_{i}-x_{j}}{d_{ij}}.

This shows that the multi-scale Vietoris–Rips loss is analytically differentiable with respect to the point-cloud coordinates $X$ .

A.12 Summary of Backward Computation

The backward computation derived in this appendix can be summarized as follows.

First, compute

G_{L}=\frac{\partial\mathcal{J}}{\partial\widehat{L}_{q}}

from the spectral loss.

Next, using

\widehat{L}_{q}=\widetilde{B}_{q}^{\top}\widetilde{B}_{q}+\widetilde{B}_{q+1}\widetilde{B}_{q+1}^{\top}+\mu(I-W_{q}),

compute

G_{\widetilde{B}_{q}}=2\widetilde{B}_{q}G_{L}

and

G_{\widetilde{B}_{q+1}}=2G_{L}\widetilde{B}_{q+1}.

For a general degree $p$ , the gradient with respect to $\rho_{p}$ is

g_{\rho_{p}}=\left(G_{\widetilde{B}_{p}}\odot B_{p}\right)^{\top}\rho_{p-1}+\left(G_{\widetilde{B}_{p+1}}\odot B_{p+1}\right)\rho_{p+1},

where non-existent boundary terms are omitted.

Then, since

\rho_{\sigma}=\sqrt{w_{\sigma}},

compute

g_{w_{\sigma}}^{\mathrm{bdry}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}.

For $\sigma\in K_{\max}^{(q)}$ , add the penalty contribution:

g_{w_{\sigma}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}-\mu(G_{L})_{\sigma\sigma}.

Next, using the soft clique relation

w_{\sigma}=\prod_{e\subset\sigma}p_{e},

compute

g_{p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}}.

Finally, since

p_{e}=\sigma(a_{e}),

compute

g_{a_{e}}=g_{p_{e}}p_{e}(1-p_{e}).

In graph generation, this

g_{a_{e}}

is the gradient with respect to the optimization variable.

In the Vietoris–Rips setting, using

a_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon}

and

d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta},

we obtain the point-coordinate gradient

\nabla_{x_{i}}\mathcal{J}=-\frac{1}{\varepsilon}\sum_{m=1}^{M}\omega_{m}\sum_{j:(i,j)\in E_{\max}}g_{a_{ij}^{(r_{m})}}\frac{x_{i}-x_{j}}{d_{ij}}.

Thus, all low-pass spectral losses introduced in the main text are analytically differentiable through the Hodge-Laplacian-type spectral relaxation, soft simplex weights, edge probabilities, edge logits, and, in the Vietoris–Rips setting, point-cloud coordinates.

Hodge Spectral Surrogates for Topology-Constrained Optimization

Abstract

keywords:

1 Introduction

2 Preliminaries and Problem Setting

2.1 Clique Complexes and Betti Numbers

2.2 Difficulties in Using Betti Numbers as Loss Functions

2.3 Persistent Homology

2.4 Loss Functions Based on Barcode Diagrams

2.5 Issues of Barcode-Based Loss Functions

2.5.1 Gradient Localization

2.5.2 Discontinuity of Gradients Caused by Pairing Changes

2.5.3 Loss of Geometric and Spectral Information

2.6 Problem Addressed in This Work

3 Construction of Hodge-Laplacian-type spectral relaxation

3.1 Ambient Chain Spaces and Boundary Operators

3.2 Penalty-Regularized Hard Ambient Hodge Laplacian

Proposition 1 (Hard-limit consistency of the ambient Hodge Laplacian).

Proof.

3.3 Soft Graphs and Hodge-Laplacian-type spectral relaxation

Remark 1 (Soft weighted boundaries).

Proposition 2 (Convergence to the hard ambient operator).

Proof.

3.4 Logit Parameterization for Graph Clique Complexes

3.5 Soft-Graph Representation of Vietoris–Rips Filtrations

3.6 Multi-Scale Vietoris–Rips Construction

4 Topological Objectives via Low-Pass Hodge Spectral Filters

4.1 Low-Pass Spectral Filters

4.2 Spectral-Filter Matching Loss

4.3 Trace-Type Spectral-Mass Betti Surrogate Loss

Proposition 3 (Trace approximation of Betti numbers).

Proof.

4.4 Normalized Spectral-Moment Betti Surrogate

4.5 Normalized First-Betti-Type Control in Graph Clique Complexes

4.6 Topology Control for Vietoris–Rips Filtrations

4.6.1 Trace-Type Vietoris–Rips Loss

4.6.2 Spectral-Filter Matching Vietoris–Rips Loss

4.7 Practical Choice of Objectives

5 Numerical Experiments on Vietoris–Rips Topology Control

5.1 Experimental Setup

5.2 Multi-Scale Topology Control from Random Point Clouds

5.3 Gradient Localization

5.4 Pairing Instability Stress Test

5.5 Shape Synthesis by Analytic Gradients

5.6 Runtime Comparison

6 Normalized First-Betti-Type Control on Graph Clique Complexes via Laplacian Moments

6.1 Normalized Betti Surrogate Based on the Ordinary Hodge Laplacian for Hard Clique Complexes

6.2 Laplacian Moments and Normalized Betti Numbers

6.3 Relation to the Ambient Formulation

Lemma 4 (Ambient-to-ordinary reduction for polynomial moments).

6.4 Relation to Polynomial Trace Estimation

6.5 Numerical Results

Soft and hard targets.

6.5.1 Convergence of the Path-Integral Estimator

6.5.2 Convergence of Betti Loss Optimization

6.5.3 Topology-Controlled Graph Generation

6.5.4 Combination with Graph Feature Loss

7 Summary and Discussion

Statements and Declarations

Funding

Competing interests

Author contributions

Data availability

Code availability

Ethics approval

Consent to participate

Consent for publication

Appendix A Details of Backward Computation

A.1 Notation and Overall Computational Path

A.2 From Spectral Losses to GLG_{L}

A.2.1 Heat Projector Matching Loss

A.2.2 Resolvent Projector Matching Loss

A.2.3 Moment Trace Loss

A.3 From the Hodge-Laplacian-type spectral relaxation to Weighted Boundaries

A.4 From Weighted Boundaries to Square-Root Weights

A.5 From Square-Root Weights to Simplex Weights

A.6 From Simplex Weights to Edge Probabilities

A.7 From Edge Probabilities to Edge Logits

A.8 Backward Computation for Graph Generation

A.9 From Vietoris–Rips Edge Logits to Distances

A.2 From Spectral Losses to $G_{L}$