License: CC BY 4.0
arXiv:2606.25194v1 [math.AT] 23 Jun 2026

[1]\fnmSatoshi \surKanno

[1] \orgdivQuantum Information Technology Department, Quantum Technology Division, Product Research and Development Division, \orgnameSoftBank Corp., \orgaddress \street1-7-1 Kaigan, \cityMinato-ku, \postcode105-7529, \stateTokyo, \countryJapan

Hodge Spectral Surrogates for Topology-Constrained Optimization

Abstract

Topological information is widely used in data analysis, network design, and machine learning, and topological constraints naturally arise when optimizing or generating objects with prescribed homological structure. However, directly controlling Betti numbers and persistent homology is difficult because they are discrete and combinatorial. We propose a differentiable framework for topology-constrained optimization based on Hodge-spectral relaxations of homological constraints and low-pass spectral filters. From soft graphs and soft clique complexes, we construct Hodge-Laplacian-type spectral relaxations that unify graph clique complexes and Vietoris–Rips filtrations of point clouds. In the hard limit, the penalty-regularized ambient operator recovers the ordinary Hodge Laplacian on the active subcomplex, while in the soft regime it serves as a differentiable low-frequency spectral surrogate. Homological information is represented by zero and near-zero modes, and differentiable topological objectives are defined using heat filters, resolvent filters, and polynomial Laplacian moments. For point clouds, we show that the proposed Hodge spectral-filter losses yield more spatially distributed gradients, smoother scale-normalized behavior under persistence-pairing changes, and geometry-aware update directions than persistent-homology-based losses. For graph clique complexes, Laplacian moments control normalized first-Betti-type quantities and can be combined with ordinary graph-feature objectives. We also discuss connections to trace-based normalized Betti-number estimation, polynomial spectral methods, and possible quantum trace estimation.

keywords:
Applied and computational topology, Topological data analysis, Hodge Laplacian, Spectral filters, Topology-constrained optimization

1 Introduction

Understanding the shape of data is a fundamental problem in applied mathematics, data analysis, network science, and machine learning. In particular, the interaction between topology and artificial intelligence has become an important theme in applied and computational topology. Data such as point clouds, images, graphs, and time series often contain not only local metric information but also global structural information, including connected components, holes, loops, and higher-order relations. Homological invariants, such as Betti numbers and persistent homology ([edelsbrunner2002topological, zomorodian2004computing, cohen2005stability, edelsbrunner2008persistent, otter2017roadmap]), provide mathematical tools for describing such global structures and have been widely used as topological descriptors and features in artificial-intelligence and machine-learning pipelines (see [adams2017persistence, hofer2017deep, bubenik2020persistence, chazal2014stochastic, kalivsnik2019tropical]). Such topological features can improve representations or regularization when the underlying data distribution has nontrivial geometric or topological structure.

Topology is also important not only as a descriptor but also as a quantity to be controlled in optimization through loss functions or regularizers (see [moor2020topological, vandaele2021topologically, gabrielsson2020topology]). In many applications, one would like the output of a model or an optimization procedure to satisfy prescribed topological constraints. For example, in image segmentation, controlling the topology of predicted regions can reduce spurious connected components or undesired holes(see [clough2020topological, hu2019topology]). In graph and network design, the Betti numbers of clique complexes describe higher-order connectivity, redundancy, and cyclic structure, which are related to robustness and alternative pathways in networks. Thus, differentiable control of persistent homology and Betti numbers is relevant to topology-constrained optimization in a broad applied-mathematical sense. In this prescriptive setting, topology is optimized rather than only observed: the aim is to deform, generate, or regularize an object so that its homological structure satisfies a desired condition.

Persistent homology provides a natural mechanism for such topological control. Since a persistence diagram or barcode records the birth and death of homology classes across a filtration, one can define losses that penalize undesired bars, preserve selected bars, or match a target barcode. These barcode-based losses make it possible to incorporate persistent homology directly into optimization and learning procedures. However, directly controlling topological quantities is difficult because they are inherently discrete and combinatorial. Betti numbers of clique complexes depend on simplex inclusion and ranks of boundary operators. Even when graph edges are relaxed into continuous weights or probabilities, the resulting clique complex and its Betti numbers remain governed by combinatorial conditions.

For persistent homology, differentiable losses based on barcode or persistence-diagram representations have been studied, and such losses are useful when the desired constraint is naturally expressed in terms of persistent features. However, when topology is used as an optimization signal, such losses have several limitations: gradients may tend to be localized on a small number of critical simplices (see [nigmetov2024topological, nigmetov2024topological_2]), persistence-pairing changes may cause abrupt changes in gradient directions, and barcode representations may compress away geometric and spectral information that could provide smoother and more spatially distributed update directions.

We emphasize that our goal is not to replace persistent homology as a descriptor. Persistent homology remains the appropriate object when one wants stable barcode-level summaries of multiscale topology. Our focus is different: we study topology as a differentiable loss or constraint inside an optimization loop. In this setting, a smooth Hodge-spectral surrogate can be useful because it provides gradients through low-frequency spectral structure rather than only through selected birth–death simplices.

In this work, we address these difficulties by representing homology through the zero modes of Hodge Laplacians. For a simplicial complex, the kk-th Betti number is equal to the dimension of the kernel of the kk-th Hodge Laplacian. Instead of directly counting zero eigenvalues, we introduce smooth low-pass spectral filters that emphasize zero and near-zero eigenmodes. These filters provide differentiable spectral surrogates for topological information and allow homological information to be used as a continuous optimization signal.

More specifically, we construct Hodge-Laplacian-type spectral relaxations from soft graphs and soft clique complexes. This gives a unified formulation for two standard sources of finite simplicial complexes: graph data and Vietoris–Rips filtrations of point clouds. In the point-cloud setting, soft edge activations are defined from pairwise distances and scale parameters. In the graph setting, edge probabilities induce soft simplex weights and hence soft clique complexes. In both cases, topological losses are defined by applying low-pass spectral filters, such as heat filters, resolvent filters, and polynomial moment filters, to the corresponding Hodge-Laplacian-type spectral relaxations. In the hard limit, the proposed ambient operator recovers the ordinary Hodge Laplacian on the active subcomplex and hence retains the standard homological interpretation. In the soft regime, the same construction should be interpreted as a differentiable low-frequency spectral surrogate for homological structure.

The Laplacian-based formulation also connects naturally with polynomial spectral methods and trace estimation. In particular, trace-type quantities such as

Tr(IαLq)d\operatorname{Tr}(I-\alpha L_{q})^{d}

can be interpreted as low-frequency spectral masses and, in the hard ordinary-complex setting, as smooth surrogates for normalized Betti numbers, since zero modes contribute one to the trace while nonzero modes are suppressed by the polynomial filter. This structure is closely related to stochastic trace estimation and polynomial approximations of spectral projectors. It also has a structural connection to quantum trace-estimation approaches for normalized Betti numbers, which often formulate Betti numbers through the null space of combinatorial Laplacians. However, this paper does not implement a quantum algorithm and does not claim quantum advantage. We therefore treat the quantum connection as a possible computational direction rather than as a main contribution (see [lloyd2016quantum, akhalwaya2024comparing, gyurik2022towards, yamauchi2025quantum]).

The contributions of this work are summarized as follows.

  1. 1.

    We construct an ambient Hodge-spectral relaxation for finite simplicial complexes. In the hard limit, the penalty-regularized ambient operator recovers the ordinary Hodge Laplacian on the active subcomplex and hence the corresponding Betti number.

  2. 2.

    We define differentiable topological losses using heat filters, resolvent filters, and polynomial Laplacian moments. These objectives include spectral-filter matching losses and trace-type Betti surrogates.

  3. 3.

    For Vietoris–Rips filtrations of point clouds, we numerically compare the proposed Hodge spectral-filter losses with persistent-homology-based losses and show that they provide more spatially distributed gradients, smoother scale-normalized behavior under persistence-pairing changes, and update directions that better reflect surrounding geometry.

  4. 4.

    For graph clique complexes, we use Laplacian moments to control normalized first-Betti-type quantities and show that the proposed topological loss can be combined with ordinary graph-feature objectives.

  5. 5.

    We discuss computational trade-offs and identify polynomial filtering, Chebyshev approximation, and trace-estimation-based implementations as possible routes toward larger-scale applications.

In summary, this work presents a Hodge-spectral framework for controlling homological constraints in point clouds and graphs. By replacing discrete topological quantities with smooth low-pass spectral surrogates, the proposed method enables topology to be treated not only as a descriptor of data, but also as a controllable object in optimization, network design, machine learning, and future quantum-assisted topological computation.

2 Preliminaries and Problem Setting

In this section, we summarize the topological quantities considered in this work and the difficulties that arise when using them as loss functions. We first introduce the Betti numbers of clique complexes constructed from graphs, and describe why it is difficult to directly use them as loss functions. We then introduce persistent homology for point-cloud data and loss functions based on barcode diagrams. Finally, we discuss three issues of barcode-based loss functions: gradient localization, discontinuity caused by changes in persistence pairings, and loss of geometric and spectral information.

2.1 Clique Complexes and Betti Numbers

We first introduce the Betti numbers of clique complexes as topological quantities for graph data. Let G=(V,E)G=(V,E) be a graph, where VV is the set of vertices and EE is the set of edges. The clique complex X(G)X(G) of GG is a simplicial complex obtained by regarding each clique in the graph as a simplex. That is, a subset of vertices

σ={v0,,vk}V\sigma=\{v_{0},\ldots,v_{k}\}\subset V

is a kk-simplex if, for any distinct i,ji,j,

(vi,vj)E(v_{i},v_{j})\in E

holds. Equivalently, σ\sigma must form a (k+1)(k+1)-clique. Therefore, the clique complex is defined as

X(G)={σV:σ is a clique in G}.X(G)=\{\sigma\subset V:\sigma\text{ is a clique in }G\}.

Let Ck(X(G))C_{k}(X(G)) be the kk-th chain group of X(G)X(G), and let

k:Ck(X(G))Ck1(X(G))\partial_{k}:C_{k}(X(G))\to C_{k-1}(X(G))

be the boundary operator. The kk-th homology group is defined by

Hk(X(G))=kerk/imk+1.H_{k}(X(G))=\ker\partial_{k}/\operatorname{im}\partial_{k+1}.

The dimension of this homology group, βk(X(G))=dimHk(X(G))\beta_{k}(X(G))=\dim H_{k}(X(G)) is called the kk-th Betti number. In particular, β0\beta_{0} represents the number of connected components, while β1\beta_{1} represents the number of independent one-dimensional cycles, namely loops that are not filled by triangles or higher-dimensional simplices.

The Betti numbers of clique complexes capture higher-order connectivity and cyclic structures of graphs. Therefore, controlling the Betti numbers of clique complexes in graph generation or network design is meaningful for controlling redundancy and robustness of graph structures.

2.2 Difficulties in Using Betti Numbers as Loss Functions

Suppose that we want to optimize a graph so that the Betti number of its clique complex approaches a target value. For example, for a target Betti number βktar\beta_{k}^{\mathrm{tar}}, one may consider the loss function

Lβ(G)=(βk(X(G))βktar)2.L_{\beta}(G)=\left(\beta_{k}(X(G))-\beta_{k}^{\mathrm{tar}}\right)^{2}.

This formulation is natural from an intuitive viewpoint. However, it is difficult to optimize this loss function using standard gradient-based methods.

The reason is that the map

GX(G)βk(X(G))G\longmapsto X(G)\longmapsto\beta_{k}(X(G))

is combinatorial and discrete. A small change in the presence or absence of edges can discontinuously change the set of simplices contained in the clique complex. As a result, the ranks of boundary operators and the dimensions of homology groups can also change discontinuously.

2.3 Persistent Homology

Next, we introduce persistent homology[edelsbrunner2002topological, zomorodian2004computing, cohen2005stability, edelsbrunner2008persistent, otter2017roadmap] as a topological quantity for point-cloud data and related data types. Suppose that, from data XX, we construct a sequence of simplicial complexes

K0K1KmK_{0}\subset K_{1}\subset\cdots\subset K_{m}

according to a scale parameter. Such a nested sequence of simplicial complexes is called a filtration. For example, for point-cloud data, one can construct a filtration using Vietoris–Rips complexes indexed by a distance scale rr.

At each scale rr, the kk-th homology group is

Hk(Kr)=kerk/imk+1.H_{k}(K_{r})=\ker\partial_{k}/\operatorname{im}\partial_{k+1}.

Persistent homology tracks when homology classes are born and when they die along the filtration. For a homology class γ\gamma, let its birth time be bγb_{\gamma} and its death time be dγd_{\gamma}. Then the kk-th persistence diagram is represented as

Dgmk(X)={(bγ,dγ)}γ2.\operatorname{Dgm}_{k}(X)=\{(b_{\gamma},d_{\gamma})\}_{\gamma}\subset\mathbb{R}^{2}.

The persistence of γ\gamma is defined by pers(γ)=dγbγ\operatorname{pers}(\gamma)=d_{\gamma}-b_{\gamma}.

Persistence diagrams and barcode diagrams provide compact summaries of the multiscale topological structure of data. Homology classes with long persistence are often interpreted as reflecting essential structures of the data rather than noise. For this reason, persistent homology is widely used for shape analysis of point clouds, images, time-series data, and related objects.

2.4 Loss Functions Based on Barcode Diagrams

One way to incorporate persistent homology into machine learning is to construct loss functions from barcode diagrams or persistence diagrams. In general, given a target persistence diagram DtarD^{\mathrm{tar}}, one may define a loss function for input data XX as

LPD(X)=(Dgmk(X),Dtar).L_{\mathrm{PD}}(X)=\ell\left(\operatorname{Dgm}_{k}(X),D^{\mathrm{tar}}\right).

For example, if the goal is to remove undesired homology classes, let UU be the set of undesired classes. Then one can use a loss of the form

LPD(X)=γU(dγbγ)2.L_{\mathrm{PD}}(X)=\sum_{\gamma\in U}(d_{\gamma}-b_{\gamma})^{2}.

This loss acts to shorten the lifetimes of undesired homology classes.

On the other hand, if the goal is to preserve specific homology classes, let TT be the set of classes to be preserved, and let τγ\tau_{\gamma} be the target persistence. Then one can consider the loss function

LPD(X)=γT((dγbγ)τγ)2.L_{\mathrm{PD}}(X)=\sum_{\gamma\in T}\left((d_{\gamma}-b_{\gamma})-\tau_{\gamma}\right)^{2}.

Such barcode-based loss functions can be differentiated through birth and death values. The birth and death of each homology class γ\gamma are determined by certain critical simplices. That is, one can write

bγ=f(σb),dγ=f(σd),b_{\gamma}=f(\sigma_{b}),\qquad d_{\gamma}=f(\sigma_{d}),

where σb\sigma_{b} is the simplex that determines the birth, σd\sigma_{d} is the simplex that determines the death, and ff is the function assigning filtration values to simplices.

Then the gradient of the loss function can formally be written as

XLPD=LPDbγXf(σb)+LPDdγXf(σd).\nabla_{X}L_{\mathrm{PD}}=\frac{\partial L_{\mathrm{PD}}}{\partial b_{\gamma}}\nabla_{X}f(\sigma_{b})+\frac{\partial L_{\mathrm{PD}}}{\partial d_{\gamma}}\nabla_{X}f(\sigma_{d}).

Thus, in barcode-based loss functions, gradients can be computed through the critical simplices that determine birth and death values.

2.5 Issues of Barcode-Based Loss Functions

This property makes it possible to use persistent homology as a loss function. However, this formulation has the following three important issues.

2.5.1 Gradient Localization

The first issue is that gradients tend to be localized on a small number of critical simplices(see [nigmetov2024topological, nigmetov2024topological_2]).

As described above, in barcode-based loss functions, the birth and death of each homology class are determined by critical simplices σb\sigma_{b} and σd\sigma_{d}. Therefore, the gradient of the loss mainly flows through

Xf(σb),Xf(σd).\nabla_{X}f(\sigma_{b}),\qquad\nabla_{X}f(\sigma_{d}).

In other words, the actual updates tend to be concentrated on the vertices contained in these critical simplices and their neighborhoods.

However, topological structures are inherently determined by the global configuration of the data. For example, when a point cloud forms a large loop, the birth or death of the loop may be represented by a small number of critical simplices, but geometrically the shape of the entire loop is important. Nevertheless, barcode-based losses concentrate gradients on a small number of simplices and may fail to provide smooth update directions for the entire data set.

Such gradient localization can make optimization unstable and may prevent geometrically natural deformations.

2.5.2 Discontinuity of Gradients Caused by Pairing Changes

The second issue is that gradients can change discontinuously when persistence pairings change due to small perturbations of filtration values.

In persistent homology, each homology class γ\gamma is associated with a birth–death pair of simplices

(σb,σd).(\sigma_{b},\sigma_{d}).

However, when a small perturbation is added to the input data XX, the ordering of filtration values of simplices may change. As a result, the persistence pairing may change as

(σb,σd)(σb,σd).(\sigma_{b},\sigma_{d})\longrightarrow(\sigma^{\prime}_{b},\sigma^{\prime}_{d}).

In this case, even if the persistence diagram itself remains stable, the simplices through which gradients flow switch from

Xf(σb),Xf(σd)Xf(σb),Xf(σd).\nabla_{X}f(\sigma_{b}),~\nabla_{X}f(\sigma_{d})\quad\longrightarrow\quad\nabla_{X}f(\sigma^{\prime}_{b}),~\nabla_{X}f(\sigma^{\prime}_{d}).

Consequently, even if the value of the loss function does not change significantly, the gradient direction may change discontinuously.

This issue is important from the viewpoint of optimization. Gradient-based methods update parameters using local changes of the loss function. If the gradient direction changes discontinuously under small perturbations, stable optimization becomes difficult.

2.5.3 Loss of Geometric and Spectral Information

The third issue is that barcode diagrams are representations specialized for birth–death information of homology classes, and they do not sufficiently preserve surrounding geometric and spectral information.

A persistence diagram is obtained from input data XX through the map

XDgmk(X)={(bγ,dγ)}γ.X\longmapsto\operatorname{Dgm}_{k}(X)=\{(b_{\gamma},d_{\gamma})\}_{\gamma}.

This representation compactly summarizes the scales at which each homology class is born and dies. However, in this process, much information is compressed, including the geometric locations supporting the homology classes, the surrounding neighborhood structure, and spectral information such as eigenvectors and low-frequency modes of Laplacians.

For example, two homology classes may have the same persistence, but they may be arranged differently in the data space, may have different degrees of geometric smoothness as cycles, or may contain different near-closed structures. Barcode diagrams do not sufficiently distinguish such differences.

Therefore, while barcode-based loss functions are effective for controlling topological summary quantities, they may be insufficient for providing geometrically natural and smooth update directions for the entire data.

2.6 Problem Addressed in This Work

The preceding subsections identify two related but distinct uses of topology in data analysis and optimization. The first is descriptive: one computes Betti numbers, persistent homology, or barcode diagrams of a fixed data set in order to describe its global structure. The second is prescriptive: one seeks to optimize a point cloud, graph, or model output so that its topology satisfies a desired condition. This paper focuses on the second use.

The difficulty is that the topological quantities considered above do not directly provide smooth optimization signals. For clique complexes of graphs, the map

GX(G)βk(X(G))G\longmapsto X(G)\longmapsto\beta_{k}(X(G))

is combinatorial, because the set of simplices and the ranks of the boundary operators can change discontinuously when edges are added or removed. For Vietoris–Rips filtrations, barcode-based losses make persistent homology differentiable through birth and death values, but the resulting gradients may be localized on critical simplices, may change abruptly when persistence pairings switch, and may discard geometric and spectral information that could be useful for optimization.

Our goal is therefore not merely to compute topological summaries, but to construct differentiable objectives that retain a clear relationship with homology while providing smoother and more spatially distributed optimization signals. We use the Hodge-theoretic identity between homology and zero modes of the combinatorial Laplacian as the starting point. For a fixed finite simplicial complex KK, the kernel of the qq-th Hodge Laplacian Lq(K)L_{q}(K) is naturally isomorphic to Hq(K)H_{q}(K), and hence dimkerLq(K)=βq(K)\dim\ker L_{q}(K)=\beta_{q}(K).

The main question is how to use this identity when the simplicial complex itself changes with continuous parameters. In the next section, we address this question by embedding changing complexes into a fixed ambient chain space, separating inactive simplex directions by a penalty term, and then replacing hard simplex indicators with soft simplex weights. This leads to a Hodge-Laplacian-type spectral relaxation that recovers the ordinary Hodge-theoretic interpretation in the hard limit and provides differentiable low-frequency spectral quantities in the soft regime.

3 Construction of Hodge-Laplacian-type spectral relaxation

In this section, we introduce ambient Hodge-spectral relaxations as the basic objects for constructing differentiable topological losses for both point-cloud data and graph data. The construction is exact in the hard simplicial-complex limit and is used as a smooth spectral surrogate in the soft regime.

The basic idea of this work is to represent both point-cloud data and graph data as soft graphs and then construct soft clique complexes from them. In graph generation, edge logits are used as optimization variables. In Vietoris–Rips filtrations, edge logits are defined from pairwise distances and a scale parameter. Therefore, both settings can be described by the common sequence

θae(θ)pe(θ)wσ(θ)L^q(θ).\theta\longmapsto a_{e}(\theta)\longmapsto p_{e}(\theta)\longmapsto w_{\sigma}(\theta)\longmapsto\widehat{L}_{q}(\theta).

Here, aea_{e} denotes an edge logit, pep_{e} denotes a soft edge activation, wσw_{\sigma} denotes a simplex activation, and L^q\widehat{L}_{q} denotes a penalty-regularized Hodge-Laplacian-type spectral relaxation.

The terminology is important: for hard simplex indicators, the construction recovers an ordinary Hodge Laplacian on the active subcomplex. For soft simplex weights, the same formula should be interpreted as a differentiable Hodge-spectral surrogate rather than as the Hodge Laplacian of an exact chain complex.

For the ordinary Hodge Laplacian, zero modes correspond to homology classes, and the dimension of the kernel of the qq-th Hodge Laplacian is equal to the qq-th Betti number. However, when the simplicial complex changes according to input data or model parameters, the dimension and basis of the corresponding chain space may also change. Therefore, it is difficult to directly treat the ordinary Hodge Laplacian as a differentiable object on a fixed vector space.

To address this issue, we introduce a fixed ambient chain space containing all candidate simplices. On this space, we construct a hard Hodge Laplacian using projections and a penalty term so that the number of zero modes agrees with the Betti number of the active subcomplex. We then relax the projections using simplex activations obtained from a soft graph, yielding a Hodge-Laplacian-type spectral relaxation that depends smoothly on the parameters and recovers the ordinary Hodge-theoretic interpretation in the hard limit.

3.1 Ambient Chain Spaces and Boundary Operators

Let KmaxK_{\max} be a finite fixed ambient simplicial complex containing all candidate simplices that may appear during optimization. For example, given a candidate edge set EmaxE_{\max}, one may define the maximum candidate graph

Gmax=(V,Emax)G_{\max}=(V,E_{\max})

and use its clique complex Kmax=X(Gmax)K_{\max}=X(G_{\max}) as the ambient complex.

For each dimension qq, define the ambient qq-chain space by

Cqamb=Cq(Kmax).C_{q}^{\mathrm{amb}}=C_{q}(K_{\max}).

This is a finite-dimensional real vector space generated by all oriented qq-simplices in KmaxK_{\max}. After fixing orientations and equipping it with the standard inner product, CqambC_{q}^{\mathrm{amb}} can be regarded as a finite-dimensional Hilbert space.

Let Bq:CqambCq1ambB_{q}:C_{q}^{\mathrm{amb}}\to C_{q-1}^{\mathrm{amb}} be the ambient boundary operator of the fixed complex KmaxK_{\max}. For an oriented simplex σ=[v0,,vq]\sigma=[v_{0},\ldots,v_{q}],

qσ=i=0q(1)i[v0,,vi^,,vq],\partial_{q}\sigma=\sum_{i=0}^{q}(-1)^{i}[v_{0},\ldots,\widehat{v_{i}},\ldots,v_{q}],

and the matrix representation of this operator is BqB_{q}. Since the boundary of a boundary is zero, we have BqBq+1=0B_{q}B_{q+1}=0.

Suppose that a hard subcomplex K(θ)KmaxK(\theta)\subset K_{\max} is determined by a parameter θ\theta. For each qq-simplex σKmax(q)\sigma\in K_{\max}^{(q)}, define the activity indicator by

χσ(θ)={1,σK(θ),0,σK(θ).\chi_{\sigma}(\theta)=\begin{cases}1,&\sigma\in K(\theta),\\ 0,&\sigma\notin K(\theta).\end{cases}

The orthogonal projection onto the active qq-simplex directions is defined by

Πq(θ)=diag(χσ(θ))σKmax(q).\Pi_{q}(\theta)=\operatorname{diag}\left(\chi_{\sigma}(\theta)\right)_{\sigma\in K_{\max}^{(q)}}.

Then the qq-chain space of K(θ)K(\theta) is embedded as

Cq(K(θ))=ImΠq(θ)Cqamb.C_{q}(K(\theta))=\operatorname{Im}\Pi_{q}(\theta)\subset C_{q}^{\mathrm{amb}}.

Since K(θ)K(\theta) is a simplicial complex, every face of an active simplex is also active. Hence,

BqImΠq(θ)ImΠq1(θ),B_{q}\operatorname{Im}\Pi_{q}(\theta)\subset\operatorname{Im}\Pi_{q-1}(\theta),

or equivalently,

Πq1(θ)BqΠq(θ)=BqΠq(θ).\Pi_{q-1}(\theta)B_{q}\Pi_{q}(\theta)=B_{q}\Pi_{q}(\theta).

We define the hard restricted boundary operator on the ambient space by

Bqhard(θ)=Πq1(θ)BqΠq(θ).B_{q}^{\mathrm{hard}}(\theta)=\Pi_{q-1}(\theta)B_{q}\Pi_{q}(\theta).

On the active subspace Cq(K(θ))=ImΠq(θ)C_{q}(K(\theta))=\operatorname{Im}\Pi_{q}(\theta), this agrees with the ordinary boundary map of the subcomplex K(θ)K(\theta).

3.2 Penalty-Regularized Hard Ambient Hodge Laplacian

A naive Hodge operator on the fixed ambient space CqambC_{q}^{\mathrm{amb}} would be

(Bqhard(θ))Bqhard(θ)+Bq+1hard(θ)(Bq+1hard(θ)).\left(B_{q}^{\mathrm{hard}}(\theta)\right)^{\top}B_{q}^{\mathrm{hard}}(\theta)+B_{q+1}^{\mathrm{hard}}(\theta)\left(B_{q+1}^{\mathrm{hard}}(\theta)\right)^{\top}.

However, since this operator acts on the entire ambient space, inactive simplex directions may remain as spurious zero modes that do not correspond to homology classes of the active subcomplex. To remove them, we add an inactive-direction penalty. For μ>0\mu>0, define

Lqhard(θ)=(Bqhard(θ))Bqhard(θ)+Bq+1hard(θ)(Bq+1hard(θ))+μ(IΠq(θ)).L_{q}^{\mathrm{hard}}(\theta)=\left(B_{q}^{\mathrm{hard}}(\theta)\right)^{\top}B_{q}^{\mathrm{hard}}(\theta)+B_{q+1}^{\mathrm{hard}}(\theta)\left(B_{q+1}^{\mathrm{hard}}(\theta)\right)^{\top}+\mu\left(I-\Pi_{q}(\theta)\right).

The last term assigns energy μ\mu to inactive qq-simplex directions and removes them from the low-eigenvalue region.

With respect to the orthogonal decomposition

Cqamb=ImΠq(θ)kerΠq(θ),C_{q}^{\mathrm{amb}}=\operatorname{Im}\Pi_{q}(\theta)\oplus\ker\Pi_{q}(\theta),

we have

Lqhard(θ)=Lq(K(θ))μIkerΠq(θ).L_{q}^{\mathrm{hard}}(\theta)=L_{q}(K(\theta))\oplus\mu I_{\ker\Pi_{q}(\theta)}.

Therefore,

kerLqhard(θ)=kerLq(K(θ))Hq(K(θ)).\ker L_{q}^{\mathrm{hard}}(\theta)=\ker L_{q}(K(\theta))\simeq H_{q}(K(\theta)).

Hence,

dimkerLqhard(θ)=βq(K(θ)).\dim\ker L_{q}^{\mathrm{hard}}(\theta)=\beta_{q}(K(\theta)).

Thus, the number of zero modes of the penalty-regularized hard ambient Hodge Laplacian agrees with the qq-th Betti number of the active subcomplex.

Proposition 1 (Hard-limit consistency of the ambient Hodge Laplacian).

Let K(θ)KmaxK(\theta)\subset K_{\max} be an active subcomplex and let Lqhard(θ)L_{q}^{\mathrm{hard}}(\theta) be the penalty-regularized hard ambient Hodge Laplacian defined above. Then, with respect to the orthogonal decomposition

Cqamb=Cq(K(θ))Cq(K(θ)),C_{q}^{\mathrm{amb}}=C_{q}(K(\theta))\oplus C_{q}(K(\theta))^{\perp},

one has

Lqhard(θ)=Lq(K(θ))μICq(K(θ)).L_{q}^{\mathrm{hard}}(\theta)=L_{q}(K(\theta))\oplus\mu I_{C_{q}(K(\theta))^{\perp}}.

Consequently,

kerLqhard(θ)Hq(K(θ)),dimkerLqhard(θ)=βq(K(θ)).\ker L_{q}^{\mathrm{hard}}(\theta)\simeq H_{q}(K(\theta)),\qquad\dim\ker L_{q}^{\mathrm{hard}}(\theta)=\beta_{q}(K(\theta)).
Proof.

On the active subspace Cq(K(θ))=ImΠq(θ)C_{q}(K(\theta))=\operatorname{Im}\Pi_{q}(\theta), the projected boundary operators coincide with the ordinary boundary operators of the subcomplex K(θ)K(\theta), because K(θ)K(\theta) is closed under taking faces. On the inactive orthogonal complement, the projected boundary terms vanish and the penalty term acts as multiplication by μ\mu. Thus the operator decomposes as

Lq(K(θ))μICq(K(θ)).L_{q}(K(\theta))\oplus\mu I_{C_{q}(K(\theta))^{\perp}}.

The statement follows from the finite-dimensional Hodge decomposition, which identifies kerLq(K(θ))\ker L_{q}(K(\theta)) with Hq(K(θ))H_{q}(K(\theta)). ∎

3.3 Soft Graphs and Hodge-Laplacian-type spectral relaxation

We now relax the hard projection Πq(θ)\Pi_{q}(\theta) using a soft graph and construct a differentiable spectral relaxation of the hard ambient operator. Let V={1,,n}V=\{1,\ldots,n\} be the vertex set, and let Emax(V2)E_{\max}\subset\binom{V}{2} be the set of candidate edges. For each candidate edge eEmaxe\in E_{\max}, define an edge logit ae(θ)a_{e}(\theta)\in\mathbb{R}, and define the corresponding soft edge activation by

pe(θ)=σ(ae(θ))=11+exp(ae(θ)/εe).p_{e}(\theta)=\sigma(a_{e}(\theta))=\frac{1}{1+\exp(-a_{e}(\theta)/\varepsilon_{e})}.

We call Gθ=(V,Emax,pθ)G_{\theta}=(V,E_{\max},p_{\theta}) a soft graph.

The activation or soft weight of a simplex σ\sigma is defined as the product of the activations of its constituent edges:

wσ(θ)=eσpe(θ).w_{\sigma}(\theta)=\prod_{e\subset\sigma}p_{e}(\theta).

Vertices are assumed to always exist, so wv(θ)=1w_{v}(\theta)=1. In particular, for an edge ee, we=pew_{e}=p_{e}, and for a triangle t={i,j,k}t=\{i,j,k\},

wt=pijpikpjk.w_{t}=p_{ij}p_{ik}p_{jk}.

In the hard limit pe𝟏eEp_{e}\to\mathbf{1}_{e\in E}, we have

wσ(θ)𝟏σX(G).w_{\sigma}(\theta)\to\mathbf{1}_{\sigma\in X(G)}.

Thus, this construction is a continuous relaxation of the ordinary clique complex at the level of simplex indicators.

For each dimension qq, define

Wq(θ)=diag(wσ(θ))σKmax(q),Rq(θ)=Wq(θ)1/2.W_{q}(\theta)=\operatorname{diag}\left(w_{\sigma}(\theta)\right)_{\sigma\in K_{\max}^{(q)}},\qquad R_{q}(\theta)=W_{q}(\theta)^{1/2}.

Using these matrices, define the weighted boundary-type operator by

B~q(θ)=Rq1(θ)BqRq(θ).\widetilde{B}_{q}(\theta)=R_{q-1}(\theta)B_{q}R_{q}(\theta).

In components,

(B~q(θ))τ,σ=wτ(θ)(Bq)τ,σwσ(θ).\left(\widetilde{B}_{q}(\theta)\right)_{\tau,\sigma}=\sqrt{w_{\tau}(\theta)}(B_{q})_{\tau,\sigma}\sqrt{w_{\sigma}(\theta)}.

Thus, the boundary relation between a simplex and its face is smoothly weighted by their activations.

Remark 1 (Soft weighted boundaries).

For hard simplex indicators, the projected boundary operators recover the ordinary boundary maps of the active subcomplex. For soft weights, however, the matrices

B~q(θ)=Rq1(θ)BqRq(θ)\widetilde{B}_{q}(\theta)=R_{q-1}(\theta)B_{q}R_{q}(\theta)

do not generally satisfy

B~q1(θ)B~q(θ)=0.\widetilde{B}_{q-1}(\theta)\widetilde{B}_{q}(\theta)=0.

Thus, in the soft regime, these matrices should not be interpreted as boundary maps of an exact chain complex. They are instead boundary-inspired weighted operators used to define a smooth Hodge-Laplacian-type spectral relaxation. The exact homological interpretation is recovered in the hard limit.

We define the qq-th penalty-regularized Hodge-Laplacian-type spectral relaxation by

L^q(θ)=B~q(θ)B~q(θ)+B~q+1(θ)B~q+1(θ)+μ(IWq(θ)).\widehat{L}_{q}(\theta)=\widetilde{B}_{q}(\theta)^{\top}\widetilde{B}_{q}(\theta)+\widetilde{B}_{q+1}(\theta)\widetilde{B}_{q+1}(\theta)^{\top}+\mu\left(I-W_{q}(\theta)\right).

The first term is the lower Laplacian-type term, the second term is the upper Laplacian-type term, and the third term is the inactive-direction penalty.

If wσ(θ)1w_{\sigma}(\theta)\approx 1, the penalty has almost no effect on the corresponding direction. If wσ(θ)0w_{\sigma}(\theta)\approx 0, the direction receives approximately energy μ\mu. In the hard limit wσ(θ)χσ(θ)w_{\sigma}(\theta)\to\chi_{\sigma}(\theta), we have

Wq(θ)Πq(θ),Rq(θ)Πq(θ),L^q(θ)Lqhard(θ).W_{q}(\theta)\to\Pi_{q}(\theta),\qquad R_{q}(\theta)\to\Pi_{q}(\theta),\qquad\widehat{L}_{q}(\theta)\to L_{q}^{\mathrm{hard}}(\theta).

Therefore, L^q(θ)\widehat{L}_{q}(\theta) is a differentiable Hodge-spectral relaxation of the hard Hodge Laplacian that recovers the Betti-number interpretation in the hard limit. Away from the hard limit, its low-frequency spectrum should be interpreted as a spectral surrogate for homological structure rather than as an exact Betti-number representation.

Proposition 2 (Convergence to the hard ambient operator).

Let wσ(m)[0,1]w_{\sigma}^{(m)}\in[0,1] be a sequence of simplex weights converging to hard activity indicators χσ{0,1}\chi_{\sigma}\in\{0,1\} for all simplices of KmaxK_{\max}. Let L^q(w(m))\widehat{L}_{q}(w^{(m)}) be the corresponding penalty-regularized Hodge-Laplacian-type spectral relaxation. Then

L^q(w(m))Lqhard(K)\widehat{L}_{q}(w^{(m)})\longrightarrow L_{q}^{\mathrm{hard}}(K)

in operator norm, where

K={σKmax:χσ=1}.K=\{\sigma\in K_{\max}:\chi_{\sigma}=1\}.
Proof.

The ambient boundary matrices BqB_{q} are fixed finite matrices. The diagonal matrices Wq(w(m))W_{q}(w^{(m)}) converge entrywise to Πq\Pi_{q}, and the square-root matrices Rq(w(m))=Wq(w(m))1/2R_{q}(w^{(m)})=W_{q}(w^{(m)})^{1/2} converge entrywise to Πq\Pi_{q}. Since all matrices are finite-dimensional, entrywise convergence implies convergence in operator norm. Therefore,

Rq1(w(m))BqRq(w(m))Πq1BqΠq,R_{q-1}(w^{(m)})B_{q}R_{q}(w^{(m)})\longrightarrow\Pi_{q-1}B_{q}\Pi_{q},

and the corresponding lower, upper, and penalty terms converge to those of Lqhard(K)L_{q}^{\mathrm{hard}}(K). ∎

3.4 Logit Parameterization for Graph Clique Complexes

In graph generation or graph-structured optimization, the optimization variables are the edge logits

θ=a={ae}eEmax.\theta=a=\{a_{e}\}_{e\in E_{\max}}.

Then pe(a)=σ(ae)p_{e}(a)=\sigma(a_{e}) and

wσ(a)=eσpe(a).w_{\sigma}(a)=\prod_{e\subset\sigma}p_{e}(a).

Thus, the Hodge-Laplacian-type spectral relaxation is obtained through

ape(a)wσ(a)L^q(a).a\longmapsto p_{e}(a)\longmapsto w_{\sigma}(a)\longmapsto\widehat{L}_{q}(a).

For controlling one-dimensional cyclic structure, which is measured by the first Betti number in the hard clique-complex limit, we use q=1q=1 and define

L^1(a)=B~1(a)B~1(a)+B~2(a)B~2(a)+μ(IW1(a)).\widehat{L}_{1}(a)=\widetilde{B}_{1}(a)^{\top}\widetilde{B}_{1}(a)+\widetilde{B}_{2}(a)\widetilde{B}_{2}(a)^{\top}+\mu(I-W_{1}(a)).

This operator acts on the fixed ambient edge space C1ambC_{1}^{\mathrm{amb}}. Hence, graph optimization or graph-structured topology control can be performed in a fixed-dimensional Hilbert space, even though the underlying hard clique complex changes combinatorially.

3.5 Soft-Graph Representation of Vietoris–Rips Filtrations

We next represent Vietoris–Rips filtrations of point-cloud data within the same soft-graph and Hodge-spectral framework. Let

X={x1,,xn}dX=\{x_{1},\ldots,x_{n}\}\subset\mathbb{R}^{d}

be a point cloud. In the ordinary Vietoris–Rips complex at scale r>0r>0, an edge (i,j)(i,j) is present when xixjr\|x_{i}-x_{j}\|\leq r.

To relax this condition smoothly, define the stabilized distance with a small numerical parameter δ>0\delta>0

dij(X)=xixj2+δ,d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta},

and define the Vietoris–Rips edge logit by

aij(r)(X)=rdij(X)ε.a_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon}.

Here, ε>0\varepsilon>0 controls the softness of the threshold. The corresponding soft edge activation is

pij(r)(X)=σ(aij(r)(X))=11+exp(rdij(X)ε).p_{ij}^{(r)}(X)=\sigma\left(a_{ij}^{(r)}(X)\right)=\frac{1}{1+\exp\left(-\frac{r-d_{ij}(X)}{\varepsilon}\right)}.

This value is close to one when dij(X)<rd_{ij}(X)<r, and close to zero when dij(X)>rd_{ij}(X)>r. In the hard-threshold limit ε0\varepsilon\to 0,

pij(r)(X)𝟏dij(X)r.p_{ij}^{(r)}(X)\to\mathbf{1}_{d_{ij}(X)\leq r}.

The activation of a simplex σ\sigma is defined by

wσ(r)(X)=eσpe(r)(X).w_{\sigma}^{(r)}(X)=\prod_{e\subset\sigma}p_{e}^{(r)}(X).

In the hard limit,

wσ(r)(X)𝟏σVRr(X),w_{\sigma}^{(r)}(X)\to\mathbf{1}_{\sigma\in\mathrm{VR}_{r}(X)},

because the Vietoris–Rips condition is equivalent to requiring all edges in σ\sigma to have length at most rr, namely maxeσde(X)r\max_{e\subset\sigma}d_{e}(X)\leq r. Thus, the soft clique construction recovers the ordinary Vietoris–Rips complex at each fixed scale in the hard-threshold limit.

Thus, the Vietoris–Rips filtration can be represented as the soft clique complex of a soft graph whose edge logits are

aij(r)(X)=rdij(X)ε.a_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon}.

For each scale rr, we obtain the sequence

Xa(r)(X)p(r)(X)w(r)(X)L^q(r)(X).X\longmapsto a^{(r)}(X)\longmapsto p^{(r)}(X)\longmapsto w^{(r)}(X)\longmapsto\widehat{L}_{q}^{(r)}(X).

More explicitly, define

Wq(r)(X)=diag(wσ(r)(X))σKmax(q),Rq(r)(X)=(Wq(r)(X))1/2,W_{q}^{(r)}(X)=\operatorname{diag}\left(w_{\sigma}^{(r)}(X)\right)_{\sigma\in K_{\max}^{(q)}},\qquad R_{q}^{(r)}(X)=\left(W_{q}^{(r)}(X)\right)^{1/2},

and

B~q(r)(X)=Rq1(r)(X)BqRq(r)(X).\widetilde{B}_{q}^{(r)}(X)=R_{q-1}^{(r)}(X)B_{q}R_{q}^{(r)}(X).

Then

L^q(r)(X)=(B~q(r)(X))B~q(r)(X)+B~q+1(r)(X)(B~q+1(r)(X))+μ(IWq(r)(X)).\widehat{L}_{q}^{(r)}(X)=\left(\widetilde{B}_{q}^{(r)}(X)\right)^{\top}\widetilde{B}_{q}^{(r)}(X)+\widetilde{B}_{q+1}^{(r)}(X)\left(\widetilde{B}_{q+1}^{(r)}(X)\right)^{\top}+\mu\left(I-W_{q}^{(r)}(X)\right).

This is the Vietoris–Rips instance of the Hodge-Laplacian-type spectral relaxation defined above. For controlling one-dimensional homology, we use q=1q=1.

3.6 Multi-Scale Vietoris–Rips Construction

Persistent homology describes the birth and death of homology classes across scales. Therefore, instead of using a single scale, we consider a sequence

r1<r2<<rM.r_{1}<r_{2}<\cdots<r_{M}.

For each scale rmr_{m}, define

pij(rm)(X)=σ(rmdij(X)ε).p_{ij}^{(r_{m})}(X)=\sigma\left(\frac{r_{m}-d_{ij}(X)}{\varepsilon}\right).

This yields a Hodge-Laplacian-type spectral relaxation

L^q(rm)(X)\widehat{L}_{q}^{(r_{m})}(X)

at each scale. The family

{L^q(rm)(X)}m=1M\left\{\widehat{L}_{q}^{(r_{m})}(X)\right\}_{m=1}^{M}

can be regarded as a Hodge-spectral smooth relaxation of the Vietoris–Rips filtration. In the next section, we apply low-pass spectral filters to these operators and define differentiable topological objectives for controlling Betti-type and persistent-homological structures.

4 Topological Objectives via Low-Pass Hodge Spectral Filters

In this section, we define differentiable topological objectives using the penalty-regularized Hodge-Laplacian-type spectral relaxation L^q(θ)\widehat{L}_{q}(\theta) constructed in the previous section.

The basic idea is to regard homology as the zero modes of the Hodge Laplacian and to extract zero and near-zero modes using smooth low-pass spectral filters. For an ordinary simplicial complex KK, the qq-th Betti number satisfies

βq(K)=dimkerLq(K).\beta_{q}(K)=\dim\ker L_{q}(K).

However, directly counting zero eigenvalues, namely the map LqdimkerLqL_{q}\mapsto\dim\ker L_{q}, is discontinuous and is not suitable for gradient-based optimization. Therefore, we use a smooth function f:0f:\mathbb{R}_{\geq 0}\to\mathbb{R} that emphasizes the low-eigenvalue region and use

f(L^q(θ))orTrf(L^q(θ))f(\widehat{L}_{q}(\theta))\quad\text{or}\quad\operatorname{Tr}f(\widehat{L}_{q}(\theta))

as differentiable spectral surrogates for topological information. In the hard ordinary-complex case, these quantities approximate information about the kernel of the Hodge Laplacian. In the soft regime, they should be interpreted as low-frequency spectral quantities associated with the Hodge-Laplacian-type relaxation, rather than as exact Betti numbers.

We first introduce low-pass spectral filters and define two types of losses: spectral-filter matching losses and trace-type spectral-mass Betti surrogate losses. We then formulate normalized first-Betti-type control for graph clique complexes and topology control for Vietoris–Rips filtrations of point clouds within the same framework. All losses introduced in this section are differentiable with respect to L^q(θ)\widehat{L}_{q}(\theta). Since L^q(θ)\widehat{L}_{q}(\theta) is constructed from soft-graph edge logits, gradients can be propagated analytically to the edge logits. In the Vietoris–Rips setting, the edge logits are functions of point coordinates through pairwise distances, and hence gradients can also be propagated to the point-cloud coordinates. In this section, we focus on the definitions and roles of the objectives; the detailed backward computation and gradient formulas are given in the Appendix.

4.1 Low-Pass Spectral Filters

Let

L(θ)=L^q(θ)L(\theta)=\widehat{L}_{q}(\theta)

be the Hodge-Laplacian-type spectral relaxation, and let

L(θ)=UΛU,Λ=diag(λ1,,λN)L(\theta)=U\Lambda U^{\top},\qquad\Lambda=\operatorname{diag}(\lambda_{1},\ldots,\lambda_{N})

be its eigendecomposition. In the hard ordinary-complex case, homological modes correspond to zero eigenvalues. In addition, near-zero modes with small positive eigenvalues may reflect geometrically meaningful structures such as almost-closed cycles, weakly filled holes, or unstable higher-order structures. Thus, it is useful to treat not only exact zero modes but also the low-eigenvalue region smoothly.

We call a smooth function f:0f:\mathbb{R}_{\geq 0}\to\mathbb{R} a low-pass spectral filter if it emphasizes low eigenvalues and suppresses high eigenvalues. The corresponding matrix function is defined by

f(L)=Uf(Λ)U,f(Λ)=diag(f(λ1),,f(λN)).f(L)=Uf(\Lambda)U^{\top},\qquad f(\Lambda)=\operatorname{diag}(f(\lambda_{1}),\ldots,f(\lambda_{N})).

In this work, we use the following representative filters.

The heat filter is defined by

fheat(λ)=exp(λτ),Fheat(L)=exp(Lτ),f_{\mathrm{heat}}(\lambda)=\exp\left(-\frac{\lambda}{\tau}\right),\qquad F_{\mathrm{heat}}(L)=\exp\left(-\frac{L}{\tau}\right),

where τ>0\tau>0 is a temperature parameter. It preserves zero modes and exponentially suppresses high-eigenvalue modes.

The resolvent filter is defined by

fres(λ)=αλ+α,Fres(L)=α(L+αI)1,f_{\mathrm{res}}(\lambda)=\frac{\alpha}{\lambda+\alpha},\qquad F_{\mathrm{res}}(L)=\alpha(L+\alpha I)^{-1},

where α>0\alpha>0. It also emphasizes low-eigenvalue modes, but its decay is rational rather than exponential.

The polynomial moment filter is defined by

fmom(λ)=(1αλ)d,Fmom(L)=(IαL)d,f_{\mathrm{mom}}(\lambda)=(1-\alpha\lambda)^{d},\qquad F_{\mathrm{mom}}(L)=(I-\alpha L)^{d},

where α>0\alpha>0 and dd\in\mathbb{N}. A zero mode contributes one, while high-eigenvalue modes are suppressed under suitable choices of α\alpha and dd. More precisely, this polynomial behaves as a low-pass filter only when the positive spectrum is mapped inside the unit disk; this condition is made explicit below. Since this filter is expressed as a polynomial of the Laplacian, it does not require eigendecomposition and is compatible with matrix-vector products, stochastic trace estimation, and structural connections to possible quantum or quantum–classical hybrid computation.

As a more stable polynomial approximation, one may also use a Chebyshev polynomial filter. After normalizing the Laplacian by an upper bound or estimate of its largest eigenvalue, for example

L~=2LλmaxIλmax,\widetilde{L}=\frac{2L-\lambda_{\max}I}{\lambda_{\max}},

so that its spectrum is approximately contained in [1,1][-1,1], define

fcheb(L)==0DcT(L~),f_{\mathrm{cheb}}(L)=\sum_{\ell=0}^{D}c_{\ell}T_{\ell}(\widetilde{L}),

where TT_{\ell} is the \ell-th Chebyshev polynomial, cc_{\ell} are coefficients approximating a desired low-pass function, and DD is the polynomial degree. This filter can be computed recursively without eigendecomposition and is useful for approximating low-frequency spectral quantities in large-scale problems.

4.2 Spectral-Filter Matching Loss

One way to use a low-pass filter is to compare low-energy subspaces themselves. Let θtar\theta_{\mathrm{tar}} be a target parameter and define

Ltar=L^q(θtar).L_{\mathrm{tar}}=\widehat{L}_{q}(\theta_{\mathrm{tar}}).

For a low-pass filter ff, we define the spectral-filter matching loss by

sf(θ)=12f(L^q(θ))f(Ltar)F2.\mathcal{L}_{\mathrm{sf}}(\theta)=\frac{1}{2}\left\|f(\widehat{L}_{q}(\theta))-f(L_{\mathrm{tar}})\right\|_{F}^{2}.

This loss compares not only the number of zero modes but also the orientation and distribution of the low-eigenvalue subspace. Therefore, compared with barcode-based losses, it can use richer geometric and spectral information.

For the heat and resolvent filters, this gives

heat(θ)=12exp(L^q(θ)τ)exp(Ltarτ)F2\mathcal{L}_{\mathrm{heat}}(\theta)=\frac{1}{2}\left\|\exp\left(-\frac{\widehat{L}_{q}(\theta)}{\tau}\right)-\exp\left(-\frac{L_{\mathrm{tar}}}{\tau}\right)\right\|_{F}^{2}

and

res(θ)=12α(L^q(θ)+αI)1α(Ltar+αI)1F2.\mathcal{L}_{\mathrm{res}}(\theta)=\frac{1}{2}\left\|\alpha(\widehat{L}_{q}(\theta)+\alpha I)^{-1}-\alpha(L_{\mathrm{tar}}+\alpha I)^{-1}\right\|_{F}^{2}.

These losses act to move the current structure toward the low-energy subspace of the target structure. Although the heat and resolvent filters approximate the spectral projector onto the kernel in suitable parameter limits, they are not idempotent projectors for finite parameter values. For this reason, we refer to these objectives as spectral-filter matching losses rather than projector losses.

4.3 Trace-Type Spectral-Mass Betti Surrogate Loss

Another way to use a low-pass filter is to construct a smooth low-frequency spectral-mass surrogate for Betti-type information from its trace. For a low-pass filter ff, define

Sf(θ)=Trf(L^q(θ))=i=1Nf(λi(θ)).S_{f}(\theta)=\operatorname{Tr}f(\widehat{L}_{q}(\theta))=\sum_{i=1}^{N}f(\lambda_{i}(\theta)).

If f(0)=1f(0)=1 and f(λ)0f(\lambda)\approx 0 for high eigenvalues, then Sf(θ)S_{f}(\theta) approximates the number of zero and low-eigenvalue modes and can be regarded as a smooth surrogate for βq\beta_{q} in the hard ordinary-complex setting, and as a low-frequency spectral mass in the soft setting.

Proposition 3 (Trace approximation of Betti numbers).

Let KK be a finite simplicial complex and let the eigenvalues of the ordinary Hodge Laplacian Lq(K)L_{q}(K) be

0=λ1==λβq<λβq+1λN.0=\lambda_{1}=\cdots=\lambda_{\beta_{q}}<\lambda_{\beta_{q}+1}\leq\cdots\leq\lambda_{N}.

Let f:0f:\mathbb{R}_{\geq 0}\to\mathbb{R} satisfy f(0)=1f(0)=1, and define

εf=maxi>βq|f(λi)|.\varepsilon_{f}=\max_{i>\beta_{q}}|f(\lambda_{i})|.

Then

|Trf(Lq(K))βq(K)|(Nβq)εf.\left|\operatorname{Tr}f(L_{q}(K))-\beta_{q}(K)\right|\leq(N-\beta_{q})\varepsilon_{f}.
Proof.

Since f(Lq(K))f(L_{q}(K)) has eigenvalues f(λi)f(\lambda_{i}), we have

Trf(Lq(K))=i=1Nf(λi)=βq(K)+i=βq+1Nf(λi).\operatorname{Tr}f(L_{q}(K))=\sum_{i=1}^{N}f(\lambda_{i})=\beta_{q}(K)+\sum_{i=\beta_{q}+1}^{N}f(\lambda_{i}).

Taking absolute values gives the stated bound. ∎

Given a target spectral-mass value τq\tau_{q}, define

trace(θ)=12(Sf(θ)τq)2.\mathcal{L}_{\mathrm{trace}}(\theta)=\frac{1}{2}\left(S_{f}(\theta)-\tau_{q}\right)^{2}.

In particular, for the polynomial moment filter, set

Sq,d(θ)=Tr(IαL^q(θ))dS_{q,d}(\theta)=\operatorname{Tr}\left(I-\alpha\widehat{L}_{q}(\theta)\right)^{d}

and define

mom(θ)=12(Sq,d(θ)τq)2.\mathcal{L}_{\mathrm{mom}}(\theta)=\frac{1}{2}\left(S_{q,d}(\theta)-\tau_{q}\right)^{2}.

For an ordinary hard complex, if

ρ=maxλi>0|1αλi|<1,\rho=\max_{\lambda_{i}>0}|1-\alpha\lambda_{i}|<1,

then

|Tr(IαLq(K))dβq(K)|(Nβq)ρd.\left|\operatorname{Tr}(I-\alpha L_{q}(K))^{d}-\beta_{q}(K)\right|\leq(N-\beta_{q})\rho^{d}.

Thus, the parameters α\alpha and dd should be chosen together with an appropriate spectral scaling of the Laplacian. This loss moves the low-frequency spectral mass toward the target value without explicitly counting Betti numbers.

4.4 Normalized Spectral-Moment Betti Surrogate

When graph sizes or ambient chain-space dimensions differ, directly comparing trace values introduces scale dependence due to the number of simplices. Therefore, let

Nq=dimCqambN_{q}=\dim C_{q}^{\mathrm{amb}}

and define the normalized moment

S¯q,d(θ)=1NqTr(IαL^q(θ))d.\overline{S}_{q,d}(\theta)=\frac{1}{N_{q}}\operatorname{Tr}\left(I-\alpha\widehat{L}_{q}(\theta)\right)^{d}.

In the hard ordinary-complex setting, this normalization corresponds to comparing a Betti-type quantity per qq-simplex. In the soft ambient setting, it should be interpreted as normalized low-frequency spectral mass. Given a target normalized value τ¯q\bar{\tau}_{q}, define

norm(θ)=12(S¯q,d(θ)τ¯q)2.\mathcal{L}_{\mathrm{norm}}(\theta)=\frac{1}{2}\left(\overline{S}_{q,d}(\theta)-\bar{\tau}_{q}\right)^{2}.

This normalization yields topological losses that are more comparable across graphs or ambient complexes of different sizes.

4.5 Normalized First-Betti-Type Control in Graph Clique Complexes

In graph generation or graph-structured topology control, the optimization variables are the edge logits

a={ae}eEmax.a=\{a_{e}\}_{e\in E_{\max}}.

The edge activations pe(a)=σ(ae)p_{e}(a)=\sigma(a_{e}) define a soft clique complex and hence a Hodge-Laplacian-type spectral relaxation L^q(a)\widehat{L}_{q}(a). To control one-dimensional cyclic structures in the hard clique-complex limit, we take q=1q=1 and use

L^1(a)=B~1(a)B~1(a)+B~2(a)B~2(a)+μ(IW1(a)).\widehat{L}_{1}(a)=\widetilde{B}_{1}(a)^{\top}\widetilde{B}_{1}(a)+\widetilde{B}_{2}(a)\widetilde{B}_{2}(a)^{\top}+\mu(I-W_{1}(a)).

Define the Laplacian moment and its normalized version by

S1,d(a)=Tr(IαL^1(a))d,S¯1,d(a)=1N1Tr(IαL^1(a))d,S_{1,d}(a)=\operatorname{Tr}\left(I-\alpha\widehat{L}_{1}(a)\right)^{d},\qquad\overline{S}_{1,d}(a)=\frac{1}{N_{1}}\operatorname{Tr}\left(I-\alpha\widehat{L}_{1}(a)\right)^{d},

where N1=dimC1ambN_{1}=\dim C_{1}^{\mathrm{amb}}. The quantity S¯1,d(a)\overline{S}_{1,d}(a) can be regarded as a normalized β1\beta_{1}-type surrogate in the hard-limit sense, and as a normalized low-frequency H1H_{1}-type spectral mass in the soft setting. Given a target normalized spectral-mass value τ¯1\bar{\tau}_{1}, define the graph topological loss by

graphtopo(a)=12(S¯1,d(a)τ¯1)2.\mathcal{L}_{\mathrm{graph}}^{\mathrm{topo}}(a)=\frac{1}{2}\left(\overline{S}_{1,d}(a)-\bar{\tau}_{1}\right)^{2}.

In practical graph-generation tasks, one may combine this topological loss with ordinary graph objectives such as edge density, degree distribution, clustering coefficient, or task-dependent losses. If such a base loss is denoted by base(a)\mathcal{L}_{\mathrm{base}}(a), the total loss is

total(a)=base(a)+λtopographtopo(a),\mathcal{L}_{\mathrm{total}}(a)=\mathcal{L}_{\mathrm{base}}(a)+\lambda_{\mathrm{topo}}\mathcal{L}_{\mathrm{graph}}^{\mathrm{topo}}(a),

where λtopo>0\lambda_{\mathrm{topo}}>0 controls the strength of topological regularization. This formulation allows one to optimize ordinary graph-structural objectives while controlling cyclic structures corresponding to β1\beta_{1} of the clique complex. The ambient normalization above is the abstract formulation used in this section. In the numerical graph experiments below, we use an ordinary-complex or weighted-current-complex implementation and normalize by an effective edge count; the relation between these choices is discussed in Section 6.

4.6 Topology Control for Vietoris–Rips Filtrations

Let

X={x1,,xn}dX=\{x_{1},\ldots,x_{n}\}\subset\mathbb{R}^{d}

be a point cloud, and let

r1<r2<<rMr_{1}<r_{2}<\cdots<r_{M}

be a sequence of scales. For each scale rmr_{m}, define the edge logits by

aij(rm)(X)=rmdij(X)ε,dij(X)=xixj2+δ.a_{ij}^{(r_{m})}(X)=\frac{r_{m}-d_{ij}(X)}{\varepsilon},\qquad d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta}.

These logits define a soft graph, a soft clique complex, and a Hodge-Laplacian-type spectral relaxation

L^q(rm)(X)\widehat{L}_{q}^{(r_{m})}(X)

at each scale. Since persistent homology describes the birth and death of homology classes across scales, the corresponding Laplacian-based losses are defined as sums over scales.

4.6.1 Trace-Type Vietoris–Rips Loss

Barcode-based losses often reduce undesired homology by penalizing persistence, for example

PD(X)=γU(dγbγ)2.\mathcal{L}_{\mathrm{PD}}(X)=\sum_{\gamma\in U}(d_{\gamma}-b_{\gamma})^{2}.

In the Laplacian-based formulation, we instead use the low-frequency spectral mass at each scale,

Sq(rm)(X)=Trf(L^q(rm)(X)).S_{q}^{(r_{m})}(X)=\operatorname{Tr}f\left(\widehat{L}_{q}^{(r_{m})}(X)\right).

To suppress qq-dimensional homology, one can minimize

VRsup(X)=m=1MωmTrf(L^q(rm)(X)),\mathcal{L}_{\mathrm{VR}}^{\mathrm{sup}}(X)=\sum_{m=1}^{M}\omega_{m}\operatorname{Tr}f\left(\widehat{L}_{q}^{(r_{m})}(X)\right),

where ωm0\omega_{m}\geq 0 are scale weights.

Conversely, to preserve or generate a prescribed topological structure, one can specify target low-frequency spectral masses τq(rm)\tau_{q}^{(r_{m})} and define

VRtrace(X)=12m=1Mωm(Trf(L^q(rm)(X))τq(rm))2.\mathcal{L}_{\mathrm{VR}}^{\mathrm{trace}}(X)=\frac{1}{2}\sum_{m=1}^{M}\omega_{m}\left(\operatorname{Tr}f\left(\widehat{L}_{q}^{(r_{m})}(X)\right)-\tau_{q}^{(r_{m})}\right)^{2}.

This loss can be interpreted as a smooth Laplacian-based approximation of the low-frequency HqH_{q}-type spectral mass present at each scale. In the hard limit and with sufficiently sharp filters, this quantity is related to the Betti profile across scales, but it is not itself a persistence-diagram loss.

4.6.2 Spectral-Filter Matching Vietoris–Rips Loss

When a target point cloud XtarX_{\mathrm{tar}} is given, we can match low-energy subspaces at each scale. Define

Lm(X)=L^q(rm)(X),Lmtar=L^q(rm)(Xtar).L_{m}(X)=\widehat{L}_{q}^{(r_{m})}(X),\qquad L_{m}^{\mathrm{tar}}=\widehat{L}_{q}^{(r_{m})}(X_{\mathrm{tar}}).

Then

VRsf(X)=12m=1Mωmf(Lm(X))f(Lmtar)F2.\mathcal{L}_{\mathrm{VR}}^{\mathrm{sf}}(X)=\frac{1}{2}\sum_{m=1}^{M}\omega_{m}\left\|f(L_{m}(X))-f(L_{m}^{\mathrm{tar}})\right\|_{F}^{2}.

For the heat and resolvent filters, this gives

VRheat(X)=12m=1Mωmexp(L^q(rm)(X)τ)exp(L^q(rm)(Xtar)τ)F2\mathcal{L}_{\mathrm{VR}}^{\mathrm{heat}}(X)=\frac{1}{2}\sum_{m=1}^{M}\omega_{m}\left\|\exp\left(-\frac{\widehat{L}_{q}^{(r_{m})}(X)}{\tau}\right)-\exp\left(-\frac{\widehat{L}_{q}^{(r_{m})}(X_{\mathrm{tar}})}{\tau}\right)\right\|_{F}^{2}

and

VRres(X)=12m=1Mωmα(L^q(rm)(X)+αI)1α(L^q(rm)(Xtar)+αI)1F2.\mathcal{L}_{\mathrm{VR}}^{\mathrm{res}}(X)=\frac{1}{2}\sum_{m=1}^{M}\omega_{m}\left\|\alpha\left(\widehat{L}_{q}^{(r_{m})}(X)+\alpha I\right)^{-1}-\alpha\left(\widehat{L}_{q}^{(r_{m})}(X_{\mathrm{tar}})+\alpha I\right)^{-1}\right\|_{F}^{2}.

Unlike barcode-based losses, which compare birth–death pairs, this loss compares low-energy subspaces at each scale. Therefore, it is designed to reduce gradient localization on critical simplices and to provide update directions that retain more geometric and spectral information. Since the Vietoris–Rips edge logits are functions of the point-cloud coordinates, these losses are differentiable with respect to the point coordinates.

4.7 Practical Choice of Objectives

The objectives above are intended for different uses. Spectral-filter matching losses are appropriate when a target point cloud, graph, or simplicial complex is available and one wants to align low-frequency Hodge spectral structure with that target. Trace-type spectral-mass losses are appropriate when the goal is to control the amount of qq-dimensional topological structure without specifying a target low-eigenvalue subspace. Normalized moment losses are useful when comparing complexes with different numbers of simplices or when controlling graph clique complexes.

Table 1: Practical interpretation of the proposed Hodge-spectral objectives.
Goal Suggested objective Interpretation
Match a target point cloud or target complex Spectral-filter matching loss Aligns low-frequency Hodge spectral structure with that of a target object.
Control the amount of qq-dimensional topology Trace-type spectral-mass loss Moves the low-frequency spectral mass toward a prescribed value.
Control a Betti-type profile across scales Scale-wise Vietoris–Rips spectral loss Controls low-frequency HqH_{q}-type structure over selected scales.
Control cyclic structure in graph clique complexes Normalized polynomial moment loss Controls a normalized β1\beta_{1}-type spectral quantity.
Scale to larger complexes Polynomial, Chebyshev, or trace-estimation-based filters Avoids full eigendecomposition and uses matrix-vector or trace-estimation routines.

This table also clarifies the numerical experiments below. The Vietoris–Rips experiments examine whether spectral-filter losses provide useful gradients for point-cloud topology control, while the graph experiments examine whether normalized polynomial moments can control first-Betti-type quantities of clique complexes.

5 Numerical Experiments on Vietoris–Rips Topology Control

In this section, we compare topology control based on persistent homology with the proposed Hodge spectral-filter losses on Vietoris–Rips filtrations. The purpose of these experiments is to examine whether the proposed Hodge-spectral losses can control H1H_{1}-type structures without explicitly optimizing barcode diagrams, and whether they alleviate some of the characteristic difficulties of persistence-diagram losses, such as gradient localization and sensitivity to persistence-pairing changes. These experiments are intended to study the behavior of the optimization signals rather than to replace persistent homology as a topological descriptor.

5.1 Experimental Setup

Let

X={x1,,xn}2X=\{x_{1},\ldots,x_{n}\}\subset\mathbb{R}^{2}

be a point cloud. For each scale rr, we consider the Vietoris–Rips complex

VRr(X).\mathrm{VR}_{r}(X).

In the persistent-homology baseline, the loss is defined directly from the H1H_{1} barcode. For example, to suppress or control one-dimensional homological features, we use losses based on the persistence values

dγbγd_{\gamma}-b_{\gamma}

of H1H_{1} bars.

For the proposed Hodge-based losses, we construct a Hodge-Laplacian-type spectral relaxation

L^1(r)(X)\widehat{L}_{1}^{(r)}(X)

for each scale rr, and then apply a low-pass spectral filter. We mainly use the heat filter

Fτheat,(r)(X)=exp(L^1(r)(X)τ)F_{\tau}^{\mathrm{heat},(r)}(X)=\exp\left(-\frac{\widehat{L}_{1}^{(r)}(X)}{\tau}\right)

and the resolvent filter

Fϵres,(r)(X)=ϵ(L^1(r)(X)+ϵI)1.F_{\epsilon}^{\mathrm{res},(r)}(X)=\epsilon\left(\widehat{L}_{1}^{(r)}(X)+\epsilon I\right)^{-1}.

For multi-scale objectives, we sum these quantities over a finite set of scales

I={r1,,rm}.I=\{r_{1},\ldots,r_{m}\}.

The Hodge spectral-filter losses are combined with mild geometric regularization terms to avoid degenerate point configurations such as excessive collapse or excessive spreading. These regularization terms are used only to keep the point-cloud geometry numerically well behaved; the topological signal itself is provided by the Hodge spectral filters.

5.2 Multi-Scale Topology Control from Random Point Clouds

We first test whether the Hodge spectral-filter losses can induce low-frequency H1H_{1}-type structures across multiple Vietoris–Rips scales. We initialize n=36n=36 points in the square [1,1]2[-1,1]^{2}, and use the scale interval

I={0.42,0.46,0.50,0.54,0.58,0.62}.I=\{0.42,0.46,0.50,0.54,0.58,0.62\}.

We compare a persistent-homology baseline, the Hodge heat objective, and the Hodge resolvent objective.

Figure 1 shows the initial point cloud and the optimized point clouds after 10 epochs. The Hodge objectives, especially the resolvent objective, produce point clouds with more pronounced loop-like structures than the PH baseline in this example.

Refer to caption
Figure 1: Multi-scale topology control from a random point cloud. The exact Betti-profile sum is used only as a diagnostic of the optimized point clouds.

We evaluate the exact Betti number over the scale interval by

rIβ1(VRr(X)).\sum_{r\in I}\beta_{1}(\mathrm{VR}_{r}(X)).

This quantity is used as an exact Betti-profile diagnostic of the optimized point cloud; it is not the differentiable objective optimized by the Hodge spectral-filter losses. The initial point cloud has value 11, the PH baseline reaches 22, the Hodge heat objective reaches 1010, and the Hodge resolvent objective reaches 2828. Thus, the Hodge spectral-filter losses increase low-frequency H1H_{1}-type spectral structure across the selected scale range without directly optimizing persistence diagrams. This result should be interpreted as a scale-wise Hodge-spectral effect rather than as direct optimization of a persistence diagram.

5.3 Gradient Localization

Next, we compare the localization of gradients for PH losses and Hodge spectral-filter losses. The PH loss is based on the squared sum of finite H1H_{1} persistence values, while the Hodge losses are based on heat and resolvent trace sums over the selected scales.

Figure 2 visualizes the gradient directions for an input point cloud. The PH squared-sum gradient is concentrated on a small number of points, whereas the Hodge heat and Hodge resolvent gradients are distributed more globally around the point cloud for this input.

Refer to caption
Figure 2: Gradient directions for PH and Hodge spectral-filter

losses.

To quantify this behavior, we compute the entropy of the gradient norm distribution and the mass carried by the top 10%10\% of points with largest gradient norms. The results are shown in Figure 3. The PH squared-sum loss has gradient entropy 1.38631.3863 and top-10%10\% mass 0.50000.5000. The Hodge heat trace-sum loss has entropy 2.46232.4623 and top-10%10\% mass 0.20490.2049. The Hodge resolvent trace-sum loss has entropy 2.47402.4740 and top-10%10\% mass 0.20180.2018.

Refer to caption
Figure 3: Gradient localization metrics. Higher entropy and lower top-10%10\% mass indicate a more spatially distributed gradient norm profile.

These results are consistent with the interpretation that PH losses tend to concentrate gradients around critical simplices, while Hodge spectral-filter losses provide more spatially distributed update directions. The comparison concerns the spatial distribution of gradient norms in these experiments, not a general dominance statement for all PH-based objectives.

5.4 Pairing Instability Stress Test

We next examine sensitivity to persistence-pairing changes. We construct a point cloud with two similar loop-like structures and vary a parameter α\alpha, which changes the relative sizes of the two loops. Representative point clouds for α=0.14\alpha=-0.14, α=0\alpha=0, and α=0.14\alpha=0.14 are shown in Figure 4.

Refer to caption
Figure 4: Point clouds used for the pairing-instability test.

For the PH baseline, we use a loss based on the largest H1H_{1} persistence value. Around α=0\alpha=0, the two dominant H1H_{1} bars can exchange their order, causing a sharp change in which bar is selected by the loss. For the Hodge method, we use an interval Hodge spectral-filter loss over the scale interval [0.98,1.12][0.98,1.12].

Figure 5 shows the normalized losses and their derivatives with respect to α\alpha. The PH loss has a sharp derivative jump near the pairing switch, whereas the interval Hodge spectral-filter loss changes more smoothly after normalization by the loss range.

Refer to caption
Figure 5: Loss and derivative under pairing stress. The comparison of derivative jumps is interpreted in a scale-normalized sense.

Quantitatively, the maximum raw derivative jump is 1.17731.1773 for the PH loss and 15.848915.8489 for the interval Hodge spectral-filter loss. Thus, the Hodge loss is not uniformly smoother in raw scale. However, the absolute magnitudes and dynamic ranges of the two losses are different. After normalization by the loss range, the maximum derivative jump is 11.219411.2194 for the PH loss and 1.61371.6137 for the Hodge loss. Therefore, in this experiment, the Hodge interval loss exhibits a substantially smaller derivative jump relative to its own scale. This supports the interpretation that the Hodge-spectral objective provides a smoother optimization signal in a scale-normalized sense.

5.5 Shape Synthesis by Analytic Gradients

We next test whether the analytic gradients of the losses can guide point-cloud shape synthesis. We parameterize a radial point cloud by

xi(ri)=(ricosθi,risinθi),x_{i}(r_{i})=(r_{i}\cos\theta_{i},r_{i}\sin\theta_{i}),

where the angles θi\theta_{i} are fixed and the radii rir_{i} are optimized. The target shape is a wavy ring. We compare PH gradient updates, Hodge heat filter gradient updates, and Hodge resolvent filter gradient updates.

Figure 6 shows the recovered radial profiles and the mean RMSE with respect to the target radial profile. The PH gradient update gives final RMSE

0.1504±0.0172,0.1504\pm 0.0172,

whereas the Hodge heat projector gives

0.0324±0.0092,0.0324\pm 0.0092,

and the Hodge resolvent projector gives

0.0224±0.0054.0.0224\pm 0.0054.

Thus, the Hodge spectral-filter gradients recover the target shape more accurately in this controlled radial-shape experiment.

Refer to caption
Figure 6: Recovered radial profiles and RMSE in the controlled radial-shape synthesis experiment

.

Figure 7 visualizes the initial circle, target wavy shape, and the results obtained by the three update rules. The Hodge heat and resolvent filter updates follow the target deformation more closely than the PH update.

Refer to caption
Figure 7: Shape synthesis by analytic gradients.

We also compute the cosine similarity between the first update direction and the desired target deformation direction. The PH gradient has mean cosine similarity 0.2156-0.2156, the Hodge heat filter has 0.21430.2143, and the Hodge resolvent filter has 0.83110.8311. This indicates that the resolvent filter provides an initial update direction strongly aligned with the desired geometric deformation in this experiment.

The first-gradient entropy is 1.26271.2627 for PH, 2.53672.5367 for Hodge heat, and 2.52742.5274 for Hodge resolvent. Again, the Hodge spectral-filter gradients are less localized and more globally distributed according to this entropy metric.

5.6 Runtime Comparison

Finally, we compare the cost of one loss-gradient evaluation. We vary the number of points as

n=12,16,20,24,28,32,36,40,48,56,64,70,80,90,100.n=12,16,20,24,28,32,36,40,48,56,64,70,80,90,100.

For each nn, we measure the runtime of one gradient step for PH, Hodge heat, and Hodge resolvent losses. The results are shown in Figure 8.

Refer to caption
Figure 8: Complex size and one-step gradient runtime. The Hodge spectral-filter losses are evaluated using dense matrix operations in this implementation.

At n=100n=100, the PH gradient evaluation takes

3.4515s,3.4515\ \mathrm{s},

the Hodge heat gradient evaluation takes

60.4083s,60.4083\ \mathrm{s},

and the Hodge resolvent gradient evaluation takes

43.5290s.43.5290\ \mathrm{s}.

Thus, at this scale, Hodge heat is about 17.517.5 times slower than PH, and Hodge resolvent is about 12.612.6 times slower.

This increased cost is expected because the Hodge spectral-filter losses use richer spectral information. In the current implementation, dense matrix operations are used, and the number of ambient edges and triangles grows rapidly with nn. Therefore, applying the Hodge spectral-filter losses to larger problems will require more scalable implementations, such as sparse linear algebra, Chebyshev polynomial approximation, and stochastic trace estimation. Possible quantum or quantum–classical trace-estimation approaches are a separate future direction and are not used in the present experiments.

6 Normalized First-Betti-Type Control on Graph Clique Complexes via Laplacian Moments

In this section, we apply the Laplacian-moment topological loss introduced in Section 4 to normalized first-Betti-type control in graph clique complexes. In particular, we focus on the first homology of the clique complex of a graph and control a quantity corresponding to the normalized Betti number

β1|K1|,\frac{\beta_{1}}{|K_{1}|},

where K1K_{1} denotes the set of 11-simplices, namely the set of edges, in the clique complex. In the soft optimization problem below, this hard normalized Betti number is used as an evaluation quantity, while the differentiable loss is applied to a calibrated soft spectral moment.

In the theoretical formulation developed in the previous sections, we constructed a Hodge-Laplacian-type spectral relaxation on a fixed ambient chain space and introduced a penalty term to remove inactive directions. However, in the numerical experiments in this section, we do not use the full ambient penalty-regularized operator. Instead, we use a weighted-current-complex implementation: the edge and triangle spaces are represented by the current soft graph weights, weighted boundary matrices are assembled from these weights, and the moment is computed without adding an inactive ambient block. For sampled hard graphs, this construction reduces to the ordinary Hodge Laplacian on the sampled clique complex. In this case, the chain space itself corresponds to the current weighted or sampled clique complex, and inactive simplex directions do not exist as a separate ambient block. Therefore, no penalty term is required to push inactive directions away from the low-energy region.

We first explain why the Laplacian moment based on the ordinary Hodge Laplacian can be theoretically justified as a surrogate for the normalized Betti number in the hard-complex setting. We then discuss the relation between the ordinary-complex computation, the ambient formulation, and polynomial trace estimation. Finally, we present the results of our numerical experiments.

6.1 Normalized Betti Surrogate Based on the Ordinary Hodge Laplacian for Hard Clique Complexes

Let G=(V,E)G=(V,E) be a graph, and let its clique complex be K=X(G)K=X(G). Let Cq(K)C_{q}(K) be the qq-th chain space, and let Bq:Cq(K)Cq1(K)B_{q}:C_{q}(K)\to C_{q-1}(K) be the boundary operator. The ordinary qq-th Hodge Laplacian is defined by

Lq(K)=BqBq+Bq+1Bq+1.L_{q}(K)=B_{q}^{\top}B_{q}+B_{q+1}B_{q+1}^{\top}.

By the Hodge decomposition,

kerLq(K)Hq(K).\ker L_{q}(K)\simeq H_{q}(K).

Therefore, dimkerLq(K)=dimHq(K)=βq(K)\dim\ker L_{q}(K)=\dim H_{q}(K)=\beta_{q}(K). In particular, for q=1q=1, dimkerL1(K)=β1(K)\dim\ker L_{1}(K)=\beta_{1}(K).

The important point is that Lq(K)L_{q}(K) is defined not on an ambient space but on the actual chain space Cq(K)C_{q}(K) of the clique complex KK. Hence, the inactive simplex directions that appear in the ambient formulation do not exist in this space. Therefore, when using the ordinary Hodge Laplacian, zero modes directly correspond to homology classes, and the penalty term

μ(IWq)\mu(I-W_{q})

for removing inactive directions is unnecessary.

This subsection serves as the hard-complex reference. In the soft optimization experiments below, the differentiable objective is evaluated using weighted boundary matrices. The final topological evaluation is then performed by sampling hard graphs from the optimized edge probabilities and computing the ordinary Betti number of the sampled clique complexes.

6.2 Laplacian Moments and Normalized Betti Numbers

To treat normalized Betti numbers as differentiable objectives in the hard ordinary-complex setting, we use the polynomial moment filter

fd(λ)=(1αλ)d,f_{d}(\lambda)=(1-\alpha\lambda)^{d},

where α>0\alpha>0 and dd\in\mathbb{N}.

For the ordinary Hodge Laplacian Lq(K)L_{q}(K), define the Laplacian moment by

Sq,d(K)=Tr(IαLq(K))d.S_{q,d}(K)=\operatorname{Tr}\left(I-\alpha L_{q}(K)\right)^{d}.

We then normalize this quantity by the number of qq-simplices,

Nq=dimCq(K)=|Kq|,N_{q}=\dim C_{q}(K)=|K_{q}|,

and define

S¯q,d(K)=Tr(IαLq(K))dNq.\overline{S}_{q,d}(K)=\frac{\operatorname{Tr}\left(I-\alpha L_{q}(K)\right)^{d}}{N_{q}}.

Let the eigenvalues of Lq(K)L_{q}(K) be

0=λ1==λβq<λβq+1λNq.0=\lambda_{1}=\cdots=\lambda_{\beta_{q}}<\lambda_{\beta_{q}+1}\leq\cdots\leq\lambda_{N_{q}}.

Then

Sq,d(K)=i=1Nq(1αλi)d.S_{q,d}(K)=\sum_{i=1}^{N_{q}}(1-\alpha\lambda_{i})^{d}.

The contribution from each zero eigenvalue is equal to one. Therefore,

Sq,d(K)=βq(K)+i=βq+1Nq(1αλi)d.S_{q,d}(K)=\beta_{q}(K)+\sum_{i=\beta_{q}+1}^{N_{q}}(1-\alpha\lambda_{i})^{d}.

If α\alpha is chosen so that

ρ=maxλi>0|1αλi|<1,\rho=\max_{\lambda_{i}>0}|1-\alpha\lambda_{i}|<1,

then

|Sq,d(K)βq(K)|(Nqβq)ρd.\left|S_{q,d}(K)-\beta_{q}(K)\right|\leq(N_{q}-\beta_{q})\rho^{d}.

Therefore, for the normalized quantity,

|S¯q,d(K)βq(K)Nq|NqβqNqρdρd.\left|\overline{S}_{q,d}(K)-\frac{\beta_{q}(K)}{N_{q}}\right|\leq\frac{N_{q}-\beta_{q}}{N_{q}}\rho^{d}\leq\rho^{d}.

Thus, when ρd\rho^{d} is sufficiently small,

S¯q,d(K)βq(K)Nq.\overline{S}_{q,d}(K)\approx\frac{\beta_{q}(K)}{N_{q}}.

In particular, for q=1q=1,

S¯1,d(K)=Tr(IαL1(K))d|K1|β1(K)|K1|.\overline{S}_{1,d}(K)=\frac{\operatorname{Tr}\left(I-\alpha L_{1}(K)\right)^{d}}{|K_{1}|}\approx\frac{\beta_{1}(K)}{|K_{1}|}.

In this sense, the Laplacian moment based on the ordinary Hodge Laplacian is a theoretically justified surrogate for the normalized Betti number when the spectrum is scaled so that the positive eigenvalues are suppressed by the polynomial filter.

6.3 Relation to the Ambient Formulation

In the theoretical formulation of the previous sections, the Hodge-Laplacian-type spectral relaxation was defined on an ambient chain space containing all candidate simplices. In that setting, inactive simplex directions remain as ambient basis directions, and a penalty term is needed to exclude them from the low-energy region.

In the hard limit, the penalty-regularized ambient Hodge Laplacian decomposes as

L^qLq(K)μIinactive.\widehat{L}_{q}\to L_{q}(K)\oplus\mu I_{\mathrm{inactive}}.

Here, Lq(K)L_{q}(K) is the ordinary Hodge Laplacian on the active complex, and μIinactive\mu I_{\mathrm{inactive}} is the block acting on inactive directions.

For the Laplacian moment, we obtain

Tr(IαL^q)dTr(IαLq(K))d+Nqinactive(1αμ)d.\operatorname{Tr}\left(I-\alpha\widehat{L}_{q}\right)^{d}\to\operatorname{Tr}\left(I-\alpha L_{q}(K)\right)^{d}+N_{q}^{\mathrm{inactive}}(1-\alpha\mu)^{d}.

Therefore, if α\alpha and μ\mu are chosen so that

|1αμ|d1,|1-\alpha\mu|^{d}\ll 1,

then the forward contribution from the inactive block is negligible.

Moreover, the derivative of the moment filter is

fd(λ)=αd(1αλ)d1.f_{d}^{\prime}(\lambda)=-\alpha d(1-\alpha\lambda)^{d-1}.

Thus, the backward contribution from inactive directions is controlled by

αd|1αμ|d1.\alpha d|1-\alpha\mu|^{d-1}.

Therefore, if

|1αμ|d11,|1-\alpha\mu|^{d-1}\ll 1,

then inactive directions also have negligible influence on the gradient.

In particular, when

αμ1,\alpha\mu\simeq 1,

the inactive block contributes almost nothing to either the forward or backward computation of the Laplacian moment. In this sense, the numerical computation in this section using the ordinary Hodge Laplacian can be viewed as the limit of the ambient formulation in which inactive directions are completely removed, or as the regime in which their contribution is negligible.

Lemma 4 (Ambient-to-ordinary reduction for polynomial moments).

Suppose that, in the hard limit, the ambient operator decomposes as

L^q=Lq(K)μIinactive.\widehat{L}_{q}=L_{q}(K)\oplus\mu I_{\mathrm{inactive}}.

Let fd(λ)=(1αλ)df_{d}(\lambda)=(1-\alpha\lambda)^{d}. Then

Trfd(L^q)=Trfd(Lq(K))+Nqinactive(1αμ)d.\operatorname{Tr}f_{d}(\widehat{L}_{q})=\operatorname{Tr}f_{d}(L_{q}(K))+N_{q}^{\mathrm{inactive}}(1-\alpha\mu)^{d}.

Moreover,

fd(μ)=αd(1αμ)d1.f_{d}^{\prime}(\mu)=-\alpha d(1-\alpha\mu)^{d-1}.

Thus, if |1αμ|d|1-\alpha\mu|^{d} and |1αμ|d1|1-\alpha\mu|^{d-1} are small, both the forward moment contribution and the local filter derivative associated with the inactive block are negligible. In this regime, the polynomial moment of the ambient operator reduces to the ordinary-complex moment up to a negligible inactive-block contribution.

6.4 Relation to Polynomial Trace Estimation

The Laplacian moment

Tr(IαLq)d\operatorname{Tr}(I-\alpha L_{q})^{d}

has the same general form as polynomial trace quantities used in normalized Betti-number estimation. Since zero modes contribute one and suitably filtered nonzero modes are suppressed, such traces can approximate normalized kernel dimensions when the spectrum is properly scaled. This connection is useful computationally because polynomial filters can be evaluated by matrix-vector products and can be combined with stochastic trace estimation.

This viewpoint is closely related to recent trace-estimation approaches for estimating normalized Betti numbers of clique complexes [lloyd2016quantum, akhalwaya2024comparing, gyurik2022towards]. In such approaches, Betti numbers are represented through the null space of a combinatorial Laplacian or a related reflected operator, and polynomial traces are used to approximate the normalized dimension of this null space. The polynomial moment used in this paper has the same mathematical form, but here it is used as a differentiable objective inside a graph optimization loop.

The same polynomial-trace structure is also compatible with quantum trace-estimation viewpoints. However, this paper does not implement a quantum algorithm and does not claim quantum advantage. We therefore treat the quantum connection only as a structural observation and a possible future direction. The main contribution of this section is the use of Laplacian moments as differentiable first-Betti-type objectives for graph clique complexes.

6.5 Numerical Results

In the numerical experiments, each candidate edge e=(i,j)e=(i,j) is assigned an edge logit aea_{e}. The edge probability is defined by

pe=σ(ae)=11+exp(ae).p_{e}=\sigma(a_{e})=\frac{1}{1+\exp(-a_{e})}.

For a triangle σ=(i,j,k)\sigma=(i,j,k), the weight is defined by

wσ=eσpe.w_{\sigma}=\prod_{e\subset\sigma}p_{e}.

This gives a weighted clique complex from a soft graph. We then compute the weighted-current-complex version of the first Hodge Laplacian.

Let

R1=diag(pe)eEmax,R2=diag(wσ)σKmax(2).R_{1}=\operatorname{diag}(\sqrt{p_{e}})_{e\in E_{\max}},\qquad R_{2}=\operatorname{diag}(\sqrt{w_{\sigma}})_{\sigma\in K_{\max}^{(2)}}.

Since vertex weights are fixed to one, we use

B~1=B1R1,B~2=R1B2R2.\widetilde{B}_{1}=B_{1}R_{1},\qquad\widetilde{B}_{2}=R_{1}B_{2}R_{2}.

The soft weighted first Hodge-Laplian-type operator used in the experiments is

L1soft=B~1B~1+B~2B~2.L_{1}^{\mathrm{soft}}=\widetilde{B}_{1}^{\top}\widetilde{B}_{1}+\widetilde{B}_{2}\widetilde{B}_{2}^{\top}.

Unlike the ambient operator in Section 3, this implementation does not include the inactive-direction penalty term.

We define M=IαL1softM=I-\alpha L_{1}^{\mathrm{soft}} and compute the Laplacian moment

S1,d=Tr(Md).S_{1,d}=\operatorname{Tr}(M^{d}).

For normalization, we use the effective edge count N1eff=epeN_{1}^{\mathrm{eff}}=\sum_{e}p_{e}, and define

S¯1,d=S1,dN1eff+δ.\overline{S}_{1,d}=\frac{S_{1,d}}{N_{1}^{\mathrm{eff}}+\delta}.

Here N1effN_{1}^{\mathrm{eff}} is the soft analogue of the number of active edges |K1||K_{1}|, and δ>0\delta>0 is a small numerical stabilizer. Given a fixed soft moment target s¯1\bar{s}_{1}^{\star}, the soft spectral-moment Betti loss is

Betti=12(S¯1,ds¯1)2.\mathcal{L}_{\mathrm{Betti}}=\frac{1}{2}\left(\overline{S}_{1,d}-\bar{s}_{1}^{\star}\right)^{2}.

Here, s¯1\bar{s}_{1}^{\star} is the target used in the differentiable soft optimization. It should be distinguished from the hard normalized Betti target β¯1\bar{\beta}_{1}^{\star}, which is used to evaluate sampled hard graphs after optimization.

Soft and hard targets.

The soft moment target s¯1\bar{s}_{1}^{\star} and the hard normalized Betti target β¯1\bar{\beta}_{1}^{\star} are conceptually different quantities. The loss above optimizes the differentiable soft moment S¯1,d\overline{S}_{1,d} toward s¯1\bar{s}_{1}^{\star}. After optimization, we sample hard graphs from the final edge probabilities and compute the ordinary normalized Betti number β1/|K1|\beta_{1}/|K_{1}| of their clique complexes. This sampled hard value is then compared with the hard target β¯1\bar{\beta}_{1}^{\star}.

In the present proof-of-concept experiments, the soft target values are specified before optimization and kept fixed during each run. We do not claim an analytic or universal conversion rule from β¯1\bar{\beta}_{1}^{\star} to s¯1\bar{s}_{1}^{\star}. Instead, the experiments test whether optimizing the prescribed soft spectral moment induces sampled hard graphs whose normalized first Betti numbers move toward the desired hard target. The soft targets used in each experiment are reported explicitly in the tables.

In the experiments, we used α=0.15\alpha=0.15 as the basic setting, and mainly used d=8d=8.

The edge logits were optimized using an Adam-type update, and both forward and backward computations were performed using the ordinary Hodge Laplacian.

6.5.1 Convergence of the Path-Integral Estimator

We first compare the path-integral estimator with the exact value of the trace-gradient

Tr(Md1Cu)\operatorname{Tr}(M^{d-1}C_{u})

for a fixed soft graph and a fixed direction uu. In this experiment, the number of nodes is set to n=8n=8, and the moment order is set to d=6d=6. The number of Monte Carlo samples is varied over

NMC{100,300,1000,3000,10000},N_{\mathrm{MC}}\in\{100,300,1000,3000,10000\},

and the estimator is evaluated multiple times for each sample size.

Figure 9 compares the mean and standard deviation of the path-integral estimator with the exact trace-gradient. The dashed line represents the exact value, the blue curve represents the mean of the path-integral estimator, and the error bars represent the standard deviation. When the number of samples is small, the variance is large and the estimator deviates significantly from the exact value. As NMCN_{\mathrm{MC}} increases, the estimator approaches the exact value, and in particular, at NMC=10000N_{\mathrm{MC}}=10000, it reaches the neighborhood of the exact value.

Refer to caption
Refer to caption
Figure 9: (left) Path-integral estimate compared with the exact trace-gradient. (right) Mean absolute error of the path-integral estimator.

Figure 9 shows the mean absolute error with respect to the exact value. The mean absolute errors are 1.20141.2014 for NMC=100N_{\mathrm{MC}}=100, 0.47820.4782 for NMC=300N_{\mathrm{MC}}=300, 0.63990.6399 for NMC=1000N_{\mathrm{MC}}=1000, 0.26260.2626 for NMC=3000N_{\mathrm{MC}}=3000, and 0.15180.1518 for NMC=10000N_{\mathrm{MC}}=10000.

Table 2: Mean absolute error of the path-integral trace-gradient estimator.
NMCN_{\mathrm{MC}} Mean absolute error
100 1.2014
300 0.4782
1000 0.6399
3000 0.2626
10000 0.1518

At NMC=1000N_{\mathrm{MC}}=1000, the error temporarily increases. This is due to the variance caused by a finite number of Monte Carlo trials. For larger sample sizes, the error decreases again, confirming the tendency of the path-integral estimator to approach the exact trace-gradient. These results show that the proposed path-integral backward estimator functions as a Monte Carlo estimator of the trace-gradient.

6.5.2 Convergence of Betti Loss Optimization

Next, we verify whether the proposed Betti loss can actually be minimized. In this experiment, the number of nodes is set to n=10n=10, the hard normalized Betti target is set to β¯1=0.10\bar{\beta}_{1}^{\star}=0.10, and the soft moment target used in the differentiable loss is fixed to s¯1=3.0717\bar{s}_{1}^{\star}=3.0717. The optimization is performed using the exact coordinate gradient and is terminated when the improvement of the loss reaches a plateau.

Figure 10 shows the evolution of the Betti loss over optimization steps. The loss rapidly decreases in the early stage and then converges near zero with small oscillations. This result shows that the proposed soft projector Betti loss provides an effective learning signal for the edge logits.

Refer to caption
Refer to caption
Figure 10: (left)Convergence of the Betti loss during optimization. (right) Convergence of the soft Betti moment toward the target.

Figure 10 shows the corresponding trajectory of the soft Betti moment. The dotted line represents the fixed soft moment target s¯1\bar{s}_{1}^{\star}. The soft moment initially decreases below the target, then oscillates and returns toward the target, eventually stabilizing in its neighborhood. This behavior demonstrates that the Betti loss directly controls the soft moment toward the prescribed soft target.

In this experiment, the optimization reaches a plateau after 45 steps. The final loss is 1.09×1051.09\times 10^{-5}, the final soft spectral moment is 3.07633.0763, and the fixed soft target is s¯1=3.0717\bar{s}_{1}^{\star}=3.0717. Thus, the absolute error with respect to the soft target is reduced to 0.00470.0047.

Table 3: Final result of Betti loss optimization for hard normalized Betti target 0.100.10. The loss is optimized against the fixed soft moment target s¯1=3.0717\bar{s}_{1}^{\star}=3.0717. The hard target is used for evaluation after sampling hard graphs from the optimized edge probabilities.
Quantity Value
Number of steps 45
Final loss 1.09×1051.09\times 10^{-5}
Final soft spectral moment 3.0763
Fixed soft target s¯1\bar{s}_{1}^{\star} 3.0717
Soft target error 0.0047
Sampled hard normalized β1\beta_{1} mean 0.1014
Sampled hard normalized β1\beta_{1} std 0.0904
Hard target β¯1\bar{\beta}_{1}^{\star} 0.1000
Hard target error 0.0014
Sampled edge density mean 0.2564

These results confirm that minimizing the soft spectral-moment Betti loss drives the soft moment very close to the fixed soft target. Moreover, when hard graphs are sampled from the final soft graph, the mean ordinary hard normalized β1\beta_{1} of the sampled clique complexes is 0.10140.1014, yielding an error of only 0.00140.0014 with respect to the hard target 0.100.10. Therefore, in this example, the minimization of the soft Betti loss is reflected in the topology of the sampled hard graphs.

On the other hand, the hard β1\beta_{1} of a single thresholded graph is 0.00.0. This is because converting a soft graph into a hard graph by a single threshold is a discrete operation and is sensitive to the threshold value. Therefore, in these experiments, we evaluate the final hard topology by sampling multiple hard graphs from the edge probabilities and computing the mean normalized Betti number. This evaluation better reflects the probabilistic output of the soft generative model.

6.5.3 Topology-Controlled Graph Generation

We next examine whether the generated graph topology follows different hard target normalized Betti values. In this experiment, the number of nodes is set to n=15n=15, and the hard target values are chosen as

β¯1{0.02,0.05,0.10,0.20}.\bar{\beta}_{1}^{\star}\in\{0.02,0.05,0.10,0.20\}.

For each hard target, we specify a fixed soft moment target s¯1\bar{s}_{1}^{\star} used in the differentiable loss. These soft targets are reported in Table 4. For all targets, optimization is initialized from the same soft graph.

Figure 11 shows the relationship between the target normalized Betti value and the sampled hard normalized β1\beta_{1} after optimization. The black dashed line indicates y=xy=x, the orange dotted line indicates the mean β1\beta_{1} of the shared initial graph, and the green points with error bars indicate the mean and standard deviation of the sampled hard β1\beta_{1} after optimization.

Refer to caption
Figure 11: Initial and final sampled hard normalized Betti values for different targets. The dashed black line indicates y=xy=x, the orange dotted line indicates the mean Betti value of the shared initial graph, and the green curve shows the optimized results with standard deviation error bars. The final hard Betti values follow the target values after optimization.

The mean sampled hard normalized β1\beta_{1} of the initial graph is 0.16040.1604, with standard deviation 0.07690.0769. Therefore, for targets 0.020.02 and 0.050.05, the optimization must decrease β1\beta_{1}, while for target 0.200.20, it must increase β1\beta_{1}. As shown in Figure 11, the optimized sampled hard β1\beta_{1} increases monotonically with the target and moves from the initial graph value toward each target.

The final results are shown in Table 4.

Table 4: Topology-controlled graph generation. “Soft mom.” denotes the optimized soft spectral moment S¯1,d\overline{S}_{1,d}, and “Soft target” denotes the fixed soft moment target s¯1\bar{s}_{1}^{\star} used in the differentiable loss. Hard β1\beta_{1} values are computed from sampled hard clique complexes and compared with the hard target β¯1\bar{\beta}_{1}^{\star}.
Target β1\beta_{1} Steps Soft mom. Soft target Soft err. Hard β1\beta_{1} mean Hard β1\beta_{1} std Hard err. Density
0.02 176 0.2562 0.2492 0.0070 0.0068 0.0135 0.0132 0.5671
0.05 43 0.8180 0.8878 0.0699 0.0447 0.0457 0.0053 0.4713
0.10 50 1.3477 1.3480 0.0002 0.1173 0.0744 0.0173 0.3978
0.20 59 2.4705 2.4937 0.0231 0.1999 0.0831 0.0001 0.2997

Figure 12 compares the soft target error and the hard target error for each target. The soft error is relatively large for target 0.050.05, but the hard error remains small overall. In particular, for target 0.200.20, the hard error is almost zero. This indicates that the proposed method can generate graphs close to the desired target not only in terms of the prescribed soft moment but also in terms of the hard topology after sampling.

Refer to caption
Figure 12: Final soft and hard target errors for topology-controlled graph generation. The soft target error is computed from the optimized soft moment, while the hard target error is computed from sampled hard graphs. The hard topology follows the target values with small errors.

These results confirm that the final sampled hard normalized β1\beta_{1} generally increases as the target increases. In particular, for targets 0.050.05 and 0.200.20, the hard target errors are 0.00530.0053 and 0.00010.0001, respectively, indicating that graphs very close to the target topology are generated. For target 0.100.10, the mean hard β1\beta_{1} is 0.11730.1173, with an error of 0.01730.0173, but it still moves from the initial value 0.16040.1604 toward the target. For target 0.020.02, the final mean is 0.00680.0068, which slightly underestimates the target, but the optimization successfully reduces β1\beta_{1} significantly from the initial value.

The sampled edge density decreases as the target β1\beta_{1} increases, taking values 0.56710.5671, 0.47130.4713, 0.39780.3978, and 0.29970.2997. This is because, when the edge density is too high, loops are more likely to be filled by triangles, reducing the first Betti number. Therefore, the proposed method does not merely increase or decrease edges. Rather, it controls β1\beta_{1} through the balance between loops and filling triangles in the clique complex.

These results show that topology-controlled graph generation with different hard target normalized Betti values can be achieved using the soft spectral-moment Betti loss together with fixed soft moment targets.

6.5.4 Combination with Graph Feature Loss

Finally, we examine whether the soft spectral-moment Betti loss can be combined with another graph feature loss. As an independent graph feature, we use the soft degree variance. The objective function is defined as the sum of the soft spectral-moment Betti loss and the degree variance loss. The target hard normalized β1\beta_{1} is fixed to 0.100.10, while the degree variance target is varied over 0.410.41, 0.450.45, and 0.490.49. For each condition, we run experiments with five random seeds.

As a baseline, we use a variance-only optimization method, which optimizes only the degree variance. This baseline approaches the degree variance target but does not use the Betti loss. Therefore, it serves as a control experiment to check whether the Betti topology is preserved.

Figure 13 shows the sampled hard normalized β1\beta_{1} for the variance-only baseline and the joint optimization. The dashed horizontal line indicates the Betti target 0.100.10. In the variance-only baseline, the hard β1\beta_{1} remains around 0.160.16 for all variance targets, deviating significantly from the Betti target. In contrast, the joint optimization keeps the hard β1\beta_{1} around 0.110.11, close to the target 0.100.10.

Refer to caption
Refer to caption
Figure 13: (left) Hard Betti preservation under joint optimization. (right) Achieved degree variance under joint optimization.

Figure 13 shows the achieved soft degree variance. The dashed line indicates the degree variance target. Both the variance-only baseline and the joint optimization follow the degree variance target well. In particular, the joint optimization does not significantly degrade the control of degree variance.

The numerical results are shown in Table 5.

Table 5: Variance-only and joint optimization.
Var. target Method Var. mean Var. std Hard β1\beta_{1} mean Hard β1\beta_{1} std Hard β1\beta_{1} err.
0.41 Variance-only 0.4036 0.0079 0.1692 0.0121 0.0692
0.41 Joint 0.4112 0.0030 0.1100 0.0076 0.0108
0.45 Variance-only 0.4556 0.0158 0.1627 0.0106 0.0627
0.45 Joint 0.4504 0.0034 0.1125 0.0067 0.0125
0.49 Variance-only 0.4987 0.0172 0.1596 0.0089 0.0596
0.49 Joint 0.4892 0.0043 0.1088 0.0090 0.0102

The variance-only baseline achieves values close to the degree variance target, but the sampled hard β1\beta_{1} remains around 0.160.16, far from the target 0.100.10. The hard β1\beta_{1} error ranges from approximately 0.05960.0596 to 0.06920.0692.

In contrast, the joint optimization achieves values close to the degree variance target while keeping the sampled hard β1\beta_{1} in the range from 0.10880.1088 to 0.11250.1125. The hard β1\beta_{1} error is approximately between 0.01020.0102 and 0.01250.0125, which is much smaller than that of the variance-only baseline.

These results show that the Betti loss is not merely a special regularization term that conflicts with other graph feature objectives. Rather, it functions as a topology-aware objective that can be combined with ordinary graph statistics. In other words, the proposed method can control a standard graph feature such as degree variance while simultaneously keeping the first-order topology of the clique complex near the target value.

7 Summary and Discussion

In this work, we proposed a framework for differentiably controlling homological constraints on finite simplicial complexes using the low-frequency spectrum of Hodge-Laplacian-type spectral relaxations. The central idea is not to treat Betti numbers or persistent homology directly as discrete quantities, but instead to extract zero and near-zero modes, which correspond to homological structures in the hard ordinary-complex limit, using smooth spectral filters. In the soft regime, these quantities should be interpreted as low-frequency Hodge-spectral surrogates rather than as exact homological invariants.

The main contributions of this work can be summarized as follows. First, we introduced an ambient Hodge-spectral relaxation on a fixed chain space and showed that, in the hard limit, the penalty-regularized operator recovers the ordinary Hodge Laplacian on the active subcomplex and hence the corresponding Betti number. Second, using low-pass spectral filters such as heat filters, resolvent filters, and polynomial moment filters, we defined spectral-filter matching losses and trace-type spectral-mass losses. Third, we applied these objectives to Vietoris–Rips complexes of point clouds and compared their optimization signals with persistence-diagram-based losses. Fourth, we used polynomial Laplacian moments to control normalized first-Betti-type quantities of graph clique complexes and showed that this topological objective can be combined with ordinary graph-feature objectives.

It is important to distinguish this contribution from the use of persistent homology as a descriptor. Persistent homology remains the appropriate object when the goal is to compute stable barcode-level summaries of multiscale topology. The proposed Hodge-spectral objectives are complementary: they are designed for optimization settings in which topology must provide a differentiable signal. In such settings, exact barcode information is not always the most convenient optimization object, and low-frequency spectral quantities can provide more distributed and geometry-aware gradients.

In the experiments on Vietoris–Rips filtrations, we observed that Hodge spectral-filter losses have properties different from persistent-homology-based losses. Hodge spectral-filter losses do not act only on birth–death pairs, but instead provide gradients through low-energy subspaces at each scale. As a result, compared with PH losses, the gradients were less localized around a small number of critical simplices and were more broadly distributed over the point cloud in the gradient-localization experiments. In the pairing-instability stress test, PH losses showed sharp changes when the maximal persistence bar switched, whereas the Hodge interval loss exhibited relatively smoother behavior after normalization by the loss range. In the shape synthesis experiment, Hodge spectral-filter gradients provided update directions more aligned with the target geometry and achieved lower reconstruction error than PH gradients in the controlled radial-shape setting.

In the graph clique-complex experiments, we used the Laplacian moment

Tr(IαL1)d\operatorname{Tr}\left(I-\alpha L_{1}\right)^{d}

to control a normalized first-Betti-type spectral quantity. When the ordinary Hodge Laplacian is used on a hard clique complex, the chain space consists only of active simplices. Therefore, unlike the ambient formulation, no inactive-direction penalty is required. In the soft numerical implementation, the moment is computed using a weighted-current-complex operator, and the normalized soft spectral moment is evaluated against a fixed soft target. The resulting sampled hard graphs are then evaluated using the ordinary normalized Betti number

β1|K1|.\frac{\beta_{1}}{|K_{1}|}.

The numerical experiments showed that, for both a single target and multiple targets, optimizing edge logits with the soft spectral-moment Betti loss moves the sampled hard normalized Betti number toward the prescribed hard target value. We also confirmed that the soft spectral-moment Betti loss can be combined with ordinary graph-feature losses such as degree variance, enabling simultaneous control of graph statistics and topological quantities.

These results suggest that the low-frequency spectrum of Hodge-Laplacian-type operators provides an effective continuous representation for topology-constrained optimization. In particular, unlike barcode representations, the proposed Hodge-based losses are sensitive not only to the existence of homology classes but also to the surrounding spectral and geometric structure. Therefore, they can induce more geometrically natural deformations and updates, rather than merely matching topological summary statistics in the settings studied here. This should not be interpreted as a general dominance statement over barcode-based objectives; rather, the two approaches emphasize different information and are useful for different purposes.

At the same time, the proposed method has clear computational limitations. Hodge spectral-filter losses use richer spectral information than PH losses, but this also increases the computational cost. As the size of a point cloud or graph grows, the number of simplices can increase rapidly, and constructing the Hodge-Laplacian-type operator and evaluating spectral filters become expensive. In our experiments, dense matrix computations were used, and the runtime per gradient step for Hodge losses was larger than that for PH losses. Applying the method to larger datasets will require scalable implementations based on sparse linear algebra, low-rank approximation, Chebyshev approximation, stochastic trace estimation, and related techniques. Thus, the present experiments should be viewed as proof-of-concept demonstrations of the optimization behavior rather than as optimized large-scale implementations.

Another important point is that trace-type losses should be interpreted as low-frequency spectral masses rather than exact Betti numbers in the soft regime. This is both a limitation and an advantage. By including near-zero modes, the loss can respond not only to exact homology classes but also to weakly closed cycles, unstable holes, and approximate topological structures. This yields smoother and more geometry-aware update directions. On the other hand, hyperparameters such as filter temperature, moment degree, scale set, spectral scaling, and regularization strength affect the resulting optimization behavior. For polynomial moment filters, in particular, the choice of α\alpha and dd must be coordinated with the spectrum of the Laplacian so that positive eigenvalues are actually suppressed. A more systematic theoretical and empirical understanding of these parameters remains an important topic for future work.

The polynomial trace form of the moment loss also suggests possible connections with scalable trace-estimation methods. Classical stochastic trace estimation and polynomial filtering can reduce the need for full eigendecomposition, and they are natural candidates for scaling Hodge-spectral objectives to larger complexes. Quantum trace-estimation methods for normalized Betti numbers have a related mathematical structure, because they also represent Betti information through traces of polynomial functions of Laplacian-related operators. However, this work does not implement a quantum algorithm and does not claim quantum advantage. A careful study of block encodings, state preparation costs, sampling error, and gradient estimation would be required before making algorithmic claims in that direction.

On the application side, the proposed framework can be extended to many problems where topological constraints are important. Hodge spectral-filter losses for point clouds and images may be useful for medical image segmentation, shape interpolation, and geometric generative modeling. Normalized spectral-moment Betti losses for graphs may be useful for communication networks, molecular graphs, material structures, and social networks, where higher-order loops and redundancy affect functionality. In particular, Betti numbers of clique complexes describe higher-order graph structures that cannot be fully captured by degree distributions or clustering coefficients, and therefore may serve as useful objectives for controlling network robustness and higher-order connectivity. Developing such applications will require task-specific choices of filtrations, spectral filters, target quantities, and geometric or structural regularization terms.

In summary, this work presented a framework for treating topology not merely as an external descriptor computed after the fact, but as an object that can be directly optimized as a loss function. By using the low-frequency spectrum of Hodge-Laplacian-type spectral relaxations, discrete homological information can be embedded into smooth optimization problems, yielding a common mathematical structure across Vietoris–Rips complexes of point clouds and clique complexes of graphs. Further development of scalable spectral approximations and trace-estimation methods may enable the control of larger and more complex topological structures.

Statements and Declarations

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Competing interests

The authors have no relevant financial or non-financial interests to disclose.

Author contributions

S.K. conceived the study, developed the theoretical framework, and carried out the numerical computations. Y.S. made significant contributions through extensive discussions with S.K. and provided important guidance for refining the direction of the study. All authors reviewed and approved the final manuscript.

Data availability

The datasets generated and analyzed during the current study are available from the corresponding author upon reasonable request.

Code availability

The code used for the numerical experiments is available from the corresponding author upon reasonable request.

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Appendix A Details of Backward Computation

In this appendix, we derive the backward computation for the low-pass spectral losses defined in the main text. The basic flow is as follows. First, from a loss function

𝒥=𝒥(L^q),\mathcal{J}=\mathcal{J}(\widehat{L}_{q}),

we compute the gradient with respect to the Hodge-Laplacian-type spectral relaxation:

GL=𝒥L^q.G_{L}=\frac{\partial\mathcal{J}}{\partial\widehat{L}_{q}}.

Then, using the structure of L^q\widehat{L}_{q}, we propagate the gradient to the weighted boundary operators, simplex weights, edge probabilities, and edge logits. In the Vietoris–Rips setting, since the edge logits are defined as functions of point-cloud coordinates, the gradients are further propagated to the point coordinates.

Throughout this appendix, for a variable zz, we denote the gradient of 𝒥\mathcal{J} with respect to zz by

gz=𝒥z.g_{z}=\frac{\partial\mathcal{J}}{\partial z}.

For example,

gρσ=𝒥ρσ,gwσ=𝒥wσ,gpe=𝒥pe,gae=𝒥ae.g_{\rho_{\sigma}}=\frac{\partial\mathcal{J}}{\partial\rho_{\sigma}},\qquad g_{w_{\sigma}}=\frac{\partial\mathcal{J}}{\partial w_{\sigma}},\qquad g_{p_{e}}=\frac{\partial\mathcal{J}}{\partial p_{e}},\qquad g_{a_{e}}=\frac{\partial\mathcal{J}}{\partial a_{e}}.

Here, ρσ\rho_{\sigma} denotes the square root of the simplex weight wσw_{\sigma}:

ρσ=wσ.\rho_{\sigma}=\sqrt{w_{\sigma}}.

A.1 Notation and Overall Computational Path

For a fixed degree qq, we write the Hodge-Laplacian-type spectral relaxation as

L=L^q.L=\widehat{L}_{q}.

Let the loss function be

𝒥=𝒥(L).\mathcal{J}=\mathcal{J}(L).

The matrix gradient obtained from the spectral loss is denoted by

GL=𝒥L.G_{L}=\frac{\partial\mathcal{J}}{\partial L}.

Since LL is symmetric, in numerical implementations one may symmetrize this gradient as

GL12(GL+GL).G_{L}\leftarrow\frac{1}{2}\left(G_{L}+G_{L}^{\top}\right).

The Hodge-Laplacian-type spectral relaxation is defined by

L=B~qB~q+B~q+1B~q+1+μ(IWq).L=\widetilde{B}_{q}^{\top}\widetilde{B}_{q}+\widetilde{B}_{q+1}\widetilde{B}_{q+1}^{\top}+\mu(I-W_{q}).

For simplicity, we write

A=B~q,C=B~q+1.A=\widetilde{B}_{q},\qquad C=\widetilde{B}_{q+1}.

Then

L=AA+CC+μ(IWq).L=A^{\top}A+CC^{\top}+\mu(I-W_{q}).

Each weighted boundary operator is defined by

B~p=Rp1BpRp,\widetilde{B}_{p}=R_{p-1}B_{p}R_{p},

where

Rp=diag(ρσ)σKmax(p),ρσ=wσ.R_{p}=\operatorname{diag}(\rho_{\sigma})_{\sigma\in K_{\max}^{(p)}},\qquad\rho_{\sigma}=\sqrt{w_{\sigma}}.

In the soft clique complex, the simplex weights are given by

wσ=eσpe,w_{\sigma}=\prod_{e\subset\sigma}p_{e},

and each edge probability is defined by

pe=σ(ae)=11+exp(ae).p_{e}=\sigma(a_{e})=\frac{1}{1+\exp(-a_{e})}.

Therefore, the backward computational path is

𝒥LB~q,B~q+1,Wqρσwσpeae.\mathcal{J}\longrightarrow L\longrightarrow\widetilde{B}_{q},\widetilde{B}_{q+1},W_{q}\longrightarrow\rho_{\sigma}\longrightarrow w_{\sigma}\longrightarrow p_{e}\longrightarrow a_{e}.

In the Vietoris–Rips setting, we further have

aij(r)(X)=rdij(X)ε,dij(X)=xixj2+δ.a_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon},\qquad d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta}.

Thus, gradients are further propagated through

aij(r)dijxi.a_{ij}^{(r)}\longrightarrow d_{ij}\longrightarrow x_{i}.

A.2 From Spectral Losses to GLG_{L}

We first compute

GL=𝒥LG_{L}=\frac{\partial\mathcal{J}}{\partial L}

for representative low-pass spectral losses used in the main text.

A.2.1 Heat Projector Matching Loss

Let the heat projector be

F(L)=exp(Lτ).F(L)=\exp\left(-\frac{L}{\tau}\right).

Let the target projector be

Ftar=exp(Ltarτ).F_{\mathrm{tar}}=\exp\left(-\frac{L_{\mathrm{tar}}}{\tau}\right).

The loss function is

𝒥heat=12F(L)FtarF2.\mathcal{J}_{\mathrm{heat}}=\frac{1}{2}\left\|F(L)-F_{\mathrm{tar}}\right\|_{F}^{2}.

Define

D=F(L)Ftar.D=F(L)-F_{\mathrm{tar}}.

Then

d𝒥heat=D,dFF.d\mathcal{J}_{\mathrm{heat}}=\langle D,dF\rangle_{F}.

Let the eigendecomposition of LL be

L=UΛU,Λ=diag(λ1,,λN).L=U\Lambda U^{\top},\qquad\Lambda=\operatorname{diag}(\lambda_{1},\ldots,\lambda_{N}).

Define

f(λ)=exp(λτ).f(\lambda)=\exp\left(-\frac{\lambda}{\tau}\right).

Then

F(L)=Uf(Λ)U.F(L)=Uf(\Lambda)U^{\top}.

By the Fréchet derivative of a matrix function,

U(dF)U=K(U(dL)U),U^{\top}(dF)U=K\odot\left(U^{\top}(dL)U\right),

where \odot denotes the Hadamard product, and the Loewner matrix KK is given by

Kij={f(λi)f(λj)λiλj,ij,f(λi),i=j.K_{ij}=\begin{cases}\dfrac{f(\lambda_{i})-f(\lambda_{j})}{\lambda_{i}-\lambda_{j}},&i\neq j,\\[8.53581pt] f^{\prime}(\lambda_{i}),&i=j.\end{cases}

For the heat filter,

f(λi)=1τexp(λiτ).f^{\prime}(\lambda_{i})=-\frac{1}{\tau}\exp\left(-\frac{\lambda_{i}}{\tau}\right).

Therefore,

d𝒥heat=UDU,K(U(dL)U)F.d\mathcal{J}_{\mathrm{heat}}=\left\langle U^{\top}DU,K\odot\left(U^{\top}(dL)U\right)\right\rangle_{F}.

Using the inner-product property of the Hadamard product,

d𝒥heat=K(UDU),U(dL)UF.d\mathcal{J}_{\mathrm{heat}}=\left\langle K\odot\left(U^{\top}DU\right),U^{\top}(dL)U\right\rangle_{F}.

Furthermore, using

M,U(dL)UF=UMU,dLF,\left\langle M,U^{\top}(dL)U\right\rangle_{F}=\left\langle UMU^{\top},dL\right\rangle_{F},

we obtain

GL=U[K(UDU)]U.G_{L}=U\left[K\odot\left(U^{\top}DU\right)\right]U^{\top}.

Thus, for the heat projector matching loss,

GL=𝒥heatL=U[K(U(F(L)Ftar)U)]U.G_{L}=\frac{\partial\mathcal{J}_{\mathrm{heat}}}{\partial L}=U\left[K\odot\left(U^{\top}\left(F(L)-F_{\mathrm{tar}}\right)U\right)\right]U^{\top}.

A.2.2 Resolvent Projector Matching Loss

Let the resolvent projector be

F(L)=α(L+αI)1.F(L)=\alpha(L+\alpha I)^{-1}.

Let the target projector be

Ftar=α(Ltar+αI)1.F_{\mathrm{tar}}=\alpha(L_{\mathrm{tar}}+\alpha I)^{-1}.

The loss function is

𝒥res=12F(L)FtarF2.\mathcal{J}_{\mathrm{res}}=\frac{1}{2}\left\|F(L)-F_{\mathrm{tar}}\right\|_{F}^{2}.

Define

D=F(L)Ftar.D=F(L)-F_{\mathrm{tar}}.

Also define

M=L+αI.M=L+\alpha I.

Then

F(L)=αM1.F(L)=\alpha M^{-1}.

Using the differential formula for the inverse matrix,

dM1=M1(dM)M1.dM^{-1}=-M^{-1}(dM)M^{-1}.

Since

dM=dL,dM=dL,

we have

dF=αM1(dL)M1.dF=-\alpha M^{-1}(dL)M^{-1}.

The differential of the loss is

d𝒥res=D,dFF.d\mathcal{J}_{\mathrm{res}}=\langle D,dF\rangle_{F}.

Therefore,

d𝒥res=αTr[DM1(dL)M1].d\mathcal{J}_{\mathrm{res}}=-\alpha\operatorname{Tr}\left[D^{\top}M^{-1}(dL)M^{-1}\right].

By cyclicity of the trace,

d𝒥res=αTr[M1DM1dL].d\mathcal{J}_{\mathrm{res}}=-\alpha\operatorname{Tr}\left[M^{-1}D^{\top}M^{-1}dL\right].

When LL, MM, and DD are symmetric,

D=D.D^{\top}=D.

Thus,

GL=αM1DM1.G_{L}=-\alpha M^{-1}DM^{-1}.

Equivalently,

GL=𝒥resL=α(L+αI)1(F(L)Ftar)(L+αI)1.G_{L}=\frac{\partial\mathcal{J}_{\mathrm{res}}}{\partial L}=-\alpha(L+\alpha I)^{-1}\left(F(L)-F_{\mathrm{tar}}\right)(L+\alpha I)^{-1}.

A.2.3 Moment Trace Loss

Let the moment surrogate be

Sq,d(L)=Tr(IαL)d.S_{q,d}(L)=\operatorname{Tr}\left(I-\alpha L\right)^{d}.

Define

M=IαL.M=I-\alpha L.

Then

Sq,d(L)=Tr(Md).S_{q,d}(L)=\operatorname{Tr}(M^{d}).

We first compute

dSq,d=dTr(Md).dS_{q,d}=d\,\operatorname{Tr}(M^{d}).

By the product rule,

d(Md)==0d1M(dM)Md1.d(M^{d})=\sum_{\ell=0}^{d-1}M^{\ell}(dM)M^{d-1-\ell}.

Therefore,

dSq,d==0d1Tr[M(dM)Md1].dS_{q,d}=\sum_{\ell=0}^{d-1}\operatorname{Tr}\left[M^{\ell}(dM)M^{d-1-\ell}\right].

By cyclicity of the trace,

Tr[M(dM)Md1]=Tr[Md1dM].\operatorname{Tr}\left[M^{\ell}(dM)M^{d-1-\ell}\right]=\operatorname{Tr}\left[M^{d-1}dM\right].

Hence,

dSq,d=dTr[Md1dM].dS_{q,d}=d\,\operatorname{Tr}\left[M^{d-1}dM\right].

Since

M=IαL,M=I-\alpha L,

we have

dM=αdL.dM=-\alpha dL.

Therefore,

dSq,d=αdTr[Md1dL].dS_{q,d}=-\alpha d\,\operatorname{Tr}\left[M^{d-1}dL\right].

Thus,

Sq,dL=αd(Md1).\frac{\partial S_{q,d}}{\partial L}=-\alpha d\left(M^{d-1}\right)^{\top}.

If LL is symmetric, then MM is also symmetric, and hence

Sq,dL=αdMd1.\frac{\partial S_{q,d}}{\partial L}=-\alpha dM^{d-1}.

For the loss function

𝒥mom=12(Sq,d(L)τq)2,\mathcal{J}_{\mathrm{mom}}=\frac{1}{2}\left(S_{q,d}(L)-\tau_{q}\right)^{2},

the chain rule gives

GL=𝒥momL=(Sq,d(L)τq)Sq,dL.G_{L}=\frac{\partial\mathcal{J}_{\mathrm{mom}}}{\partial L}=\left(S_{q,d}(L)-\tau_{q}\right)\frac{\partial S_{q,d}}{\partial L}.

Therefore,

GL=(Sq,d(L)τq)(αdMd1).G_{L}=\left(S_{q,d}(L)-\tau_{q}\right)\left(-\alpha dM^{d-1}\right).

When using the normalized moment surrogate

S¯q,d(L)=1NqSq,d(L),\overline{S}_{q,d}(L)=\frac{1}{N_{q}}S_{q,d}(L),

the loss function is

𝒥norm=12(S¯q,d(L)τ¯q)2.\mathcal{J}_{\mathrm{norm}}=\frac{1}{2}\left(\overline{S}_{q,d}(L)-\bar{\tau}_{q}\right)^{2}.

In this case,

GL=(S¯q,d(L)τ¯q)1Nq(αdMd1).G_{L}=\left(\overline{S}_{q,d}(L)-\bar{\tau}_{q}\right)\frac{1}{N_{q}}\left(-\alpha dM^{d-1}\right).

A.3 From the Hodge-Laplacian-type spectral relaxation to Weighted Boundaries

We now assume that

GL=𝒥LG_{L}=\frac{\partial\mathcal{J}}{\partial L}

has already been computed, and propagate it to the weighted boundary operators.

Recall that

L=AA+CC+μ(IWq),L=A^{\top}A+CC^{\top}+\mu(I-W_{q}),

where

A=B~q,C=B~q+1.A=\widetilde{B}_{q},\qquad C=\widetilde{B}_{q+1}.

Taking the differential, we obtain

dL=d(AA)+d(CC)μdWq.dL=d(A^{\top}A)+d(CC^{\top})-\mu dW_{q}.

The first two terms are

d(AA)=(dA)A+A(dA)d(A^{\top}A)=(dA)^{\top}A+A^{\top}(dA)

and

d(CC)=(dC)C+C(dC).d(CC^{\top})=(dC)C^{\top}+C(dC)^{\top}.

Thus,

dL=(dA)A+A(dA)+(dC)C+C(dC)μdWq.dL=(dA)^{\top}A+A^{\top}(dA)+(dC)C^{\top}+C(dC)^{\top}-\mu dW_{q}.

The differential of the loss is

d𝒥=GL,dLF.d\mathcal{J}=\langle G_{L},dL\rangle_{F}.

First, consider the terms involving AA:

d𝒥A=GL,(dA)A+A(dA)F.d\mathcal{J}_{A}=\left\langle G_{L},(dA)^{\top}A+A^{\top}(dA)\right\rangle_{F}.

Using properties of the Frobenius inner product,

GL,(dA)AF=AGL,dAF\left\langle G_{L},(dA)^{\top}A\right\rangle_{F}=\left\langle AG_{L}^{\top},dA\right\rangle_{F}

and

GL,A(dA)F=AGL,dAF.\left\langle G_{L},A^{\top}(dA)\right\rangle_{F}=\left\langle AG_{L},dA\right\rangle_{F}.

Since GLG_{L} is symmetrized,

GL=GL.G_{L}^{\top}=G_{L}.

Therefore,

d𝒥A=2AGL,dAF.d\mathcal{J}_{A}=\left\langle 2AG_{L},dA\right\rangle_{F}.

Hence,

GA=𝒥A=2AGL.G_{A}=\frac{\partial\mathcal{J}}{\partial A}=2AG_{L}.

Next, consider the terms involving CC:

d𝒥C=GL,(dC)C+C(dC)F.d\mathcal{J}_{C}=\left\langle G_{L},(dC)C^{\top}+C(dC)^{\top}\right\rangle_{F}.

Similarly,

GL,(dC)CF=GLC,dCF\left\langle G_{L},(dC)C^{\top}\right\rangle_{F}=\left\langle G_{L}C,dC\right\rangle_{F}

and

GL,C(dC)F=GLC,dCF.\left\langle G_{L},C(dC)^{\top}\right\rangle_{F}=\left\langle G_{L}^{\top}C,dC\right\rangle_{F}.

Since GL=GLG_{L}^{\top}=G_{L},

d𝒥C=2GLC,dCF.d\mathcal{J}_{C}=\left\langle 2G_{L}C,dC\right\rangle_{F}.

Thus,

GC=𝒥C=2GLC.G_{C}=\frac{\partial\mathcal{J}}{\partial C}=2G_{L}C.

We also compute the contribution from the penalty term

μ(IWq).\mu(I-W_{q}).

The differential of this term is

μdWq.-\mu dW_{q}.

Therefore,

d𝒥pen=μGL,dWqF.d\mathcal{J}_{\mathrm{pen}}=-\mu\langle G_{L},dW_{q}\rangle_{F}.

Since WqW_{q} is diagonal,

Wq=diag(wσ)σKmax(q),W_{q}=\operatorname{diag}(w_{\sigma})_{\sigma\in K_{\max}^{(q)}},

we have

dWq=diag(dwσ)σKmax(q).dW_{q}=\operatorname{diag}(dw_{\sigma})_{\sigma\in K_{\max}^{(q)}}.

Thus,

GL,dWqF=σKmax(q)(GL)σσdwσ.\langle G_{L},dW_{q}\rangle_{F}=\sum_{\sigma\in K_{\max}^{(q)}}(G_{L})_{\sigma\sigma}dw_{\sigma}.

Hence, the direct gradient contribution from the penalty term to a qq-simplex weight is

gwσpen=μ(GL)σσ,σKmax(q).g_{w_{\sigma}}^{\mathrm{pen}}=-\mu(G_{L})_{\sigma\sigma},\qquad\sigma\in K_{\max}^{(q)}.

A.4 From Weighted Boundaries to Square-Root Weights

We next propagate the gradient through the weighted boundary

B~p=Rp1BpRp\widetilde{B}_{p}=R_{p-1}B_{p}R_{p}

to the square-root weights ρσ\rho_{\sigma}.

Recall that

Rp=diag(ρσ)σKmax(p),ρσ=wσ.R_{p}=\operatorname{diag}(\rho_{\sigma})_{\sigma\in K_{\max}^{(p)}},\qquad\rho_{\sigma}=\sqrt{w_{\sigma}}.

Let

GB~p=𝒥B~p.G_{\widetilde{B}_{p}}=\frac{\partial\mathcal{J}}{\partial\widetilde{B}_{p}}.

In components,

(B~p)τσ=ρτ(Bp)τσρσ,(\widetilde{B}_{p})_{\tau\sigma}=\rho_{\tau}(B_{p})_{\tau\sigma}\rho_{\sigma},

where

τKmax(p1),σKmax(p).\tau\in K_{\max}^{(p-1)},\qquad\sigma\in K_{\max}^{(p)}.

First, consider the contribution to ρτ\rho_{\tau} for a (p1)(p-1)-simplex τ\tau. We have

(B~p)τσρτ=(Bp)τσρσ.\frac{\partial(\widetilde{B}_{p})_{\tau\sigma}}{\partial\rho_{\tau}}=(B_{p})_{\tau\sigma}\rho_{\sigma}.

Therefore,

𝒥ρτ=σKmax(p)(GB~p)τσ(Bp)τσρσ.\frac{\partial\mathcal{J}}{\partial\rho_{\tau}}=\sum_{\sigma\in K_{\max}^{(p)}}(G_{\widetilde{B}_{p}})_{\tau\sigma}(B_{p})_{\tau\sigma}\rho_{\sigma}.

In vector form, the contribution from B~p\widetilde{B}_{p} to ρp1\rho_{p-1} is

gρp1(p,lower)=(GB~pBp)ρp.g_{\rho_{p-1}}^{(p,\mathrm{lower})}=\left(G_{\widetilde{B}_{p}}\odot B_{p}\right)\rho_{p}.

Next, consider the contribution to ρσ\rho_{\sigma} for a pp-simplex σ\sigma. We have

(B~p)τσρσ=ρτ(Bp)τσ.\frac{\partial(\widetilde{B}_{p})_{\tau\sigma}}{\partial\rho_{\sigma}}=\rho_{\tau}(B_{p})_{\tau\sigma}.

Therefore,

𝒥ρσ=τKmax(p1)(GB~p)τσ(Bp)τσρτ.\frac{\partial\mathcal{J}}{\partial\rho_{\sigma}}=\sum_{\tau\in K_{\max}^{(p-1)}}(G_{\widetilde{B}_{p}})_{\tau\sigma}(B_{p})_{\tau\sigma}\rho_{\tau}.

In vector form, the contribution from B~p\widetilde{B}_{p} to ρp\rho_{p} is

gρp(p,upper)=(GB~pBp)ρp1.g_{\rho_{p}}^{(p,\mathrm{upper})}=\left(G_{\widetilde{B}_{p}}\odot B_{p}\right)^{\top}\rho_{p-1}.

A key point is that the same ρp\rho_{p} appears in two weighted boundaries. It appears on the right side of

B~p=Rp1BpRp\widetilde{B}_{p}=R_{p-1}B_{p}R_{p}

and on the left side of

B~p+1=RpBp+1Rp+1.\widetilde{B}_{p+1}=R_{p}B_{p+1}R_{p+1}.

Therefore, the total gradient with respect to ρp\rho_{p} is the sum of the two contributions.

Thus, for a general degree pp, the full gradient with respect to ρp\rho_{p} is

gρp=(GB~pBp)ρp1+(GB~p+1Bp+1)ρp+1,g_{\rho_{p}}=\left(G_{\widetilde{B}_{p}}\odot B_{p}\right)^{\top}\rho_{p-1}+\left(G_{\widetilde{B}_{p+1}}\odot B_{p+1}\right)\rho_{p+1},

where terms corresponding to non-existent boundary operators at the ends of the complex are omitted.

For the most important case q=1q=1, we have

L=L^1=B~1B~1+B~2B~2+μ(IW1).L=\widehat{L}_{1}=\widetilde{B}_{1}^{\top}\widetilde{B}_{1}+\widetilde{B}_{2}\widetilde{B}_{2}^{\top}+\mu(I-W_{1}).

Then

GB~1=2B~1GL,GB~2=2GLB~2.G_{\widetilde{B}_{1}}=2\widetilde{B}_{1}G_{L},\qquad G_{\widetilde{B}_{2}}=2G_{L}\widetilde{B}_{2}.

The gradient with respect to the square-root weights of edges, ρ1\rho_{1}, is

gρ1=(GB~1B1)ρ0+(GB~2B2)ρ2.g_{\rho_{1}}=\left(G_{\widetilde{B}_{1}}\odot B_{1}\right)^{\top}\rho_{0}+\left(G_{\widetilde{B}_{2}}\odot B_{2}\right)\rho_{2}.

The gradient with respect to the square-root weights of triangles, ρ2\rho_{2}, is

gρ2=(GB~2B2)ρ1.g_{\rho_{2}}=\left(G_{\widetilde{B}_{2}}\odot B_{2}\right)^{\top}\rho_{1}.

The vertex weights are usually fixed, and hence gradients with respect to ρ0\rho_{0} are not used for optimization.

A.5 From Square-Root Weights to Simplex Weights

We now compute the derivative from

ρσ=wσ\rho_{\sigma}=\sqrt{w_{\sigma}}

to

wσ.w_{\sigma}.

Since

ρσ=wσ1/2,\rho_{\sigma}=w_{\sigma}^{1/2},

we have

ρσwσ=12wσ1/2=12ρσ.\frac{\partial\rho_{\sigma}}{\partial w_{\sigma}}=\frac{1}{2}w_{\sigma}^{-1/2}=\frac{1}{2\rho_{\sigma}}.

Therefore, the gradient contribution from the weighted boundaries to the simplex weight is

gwσbdry=gρσρσwσ=gρσ2ρσ.g_{w_{\sigma}}^{\mathrm{bdry}}=g_{\rho_{\sigma}}\frac{\partial\rho_{\sigma}}{\partial w_{\sigma}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}.

If σKmax(q)\sigma\in K_{\max}^{(q)}, then the penalty contribution

gwσpen=μ(GL)σσg_{w_{\sigma}}^{\mathrm{pen}}=-\mu(G_{L})_{\sigma\sigma}

must be added. Therefore,

gwσ=gρσ2ρσμ(GL)σσ,σKmax(q).g_{w_{\sigma}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}-\mu(G_{L})_{\sigma\sigma},\qquad\sigma\in K_{\max}^{(q)}.

For a pp-simplex σKmax(p)\sigma\in K_{\max}^{(p)} with pqp\neq q, there is no direct penalty contribution, and hence

gwσ=gρσ2ρσ.g_{w_{\sigma}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}.

Numerically, when wσw_{\sigma} is very small, ρσ\rho_{\sigma} also becomes very small and instability may occur. In such cases, one may use

ρσ=wσ+ϵnum.\rho_{\sigma}=\sqrt{w_{\sigma}+\epsilon_{\mathrm{num}}}.

Then

gwσbdry=gρσ2wσ+ϵnum.g_{w_{\sigma}}^{\mathrm{bdry}}=\frac{g_{\rho_{\sigma}}}{2\sqrt{w_{\sigma}+\epsilon_{\mathrm{num}}}}.

A.6 From Simplex Weights to Edge Probabilities

In the soft clique complex, simplex weights are defined by

wσ=eσpe.w_{\sigma}=\prod_{e\subset\sigma}p_{e}.

If an edge ee is contained in σ\sigma, then

wσpe=eσeepe.\frac{\partial w_{\sigma}}{\partial p_{e}}=\prod_{\begin{subarray}{c}e^{\prime}\subset\sigma\\ e^{\prime}\neq e\end{subarray}}p_{e^{\prime}}.

Equivalently,

wσpe=wσpe.\frac{\partial w_{\sigma}}{\partial p_{e}}=\frac{w_{\sigma}}{p_{e}}.

If

eσ,e\not\subset\sigma,

then

wσpe=0.\frac{\partial w_{\sigma}}{\partial p_{e}}=0.

Therefore, the gradient with respect to the edge probability pep_{e} is the sum of the contributions from all simplices containing ee:

gpe=𝒥pe=σegwσwσpe.g_{p_{e}}=\frac{\partial\mathcal{J}}{\partial p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{\partial w_{\sigma}}{\partial p_{e}}.

Thus,

gpe=σegwσwσpe.g_{p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}}.

For numerical stability when pep_{e} is very small, one may use

gpe=σegwσwσpe+ϵnum.g_{p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}+\epsilon_{\mathrm{num}}}.

A.7 From Edge Probabilities to Edge Logits

The edge probability is defined from the edge logit aea_{e} by

pe=σ(ae)=11+exp(ae).p_{e}=\sigma(a_{e})=\frac{1}{1+\exp(-a_{e})}.

The derivative of the logistic function is

peae=pe(1pe).\frac{\partial p_{e}}{\partial a_{e}}=p_{e}(1-p_{e}).

Therefore,

gae=𝒥ae=gpepe(1pe).g_{a_{e}}=\frac{\partial\mathcal{J}}{\partial a_{e}}=g_{p_{e}}p_{e}(1-p_{e}).

Substituting the result from the previous subsection gives

gae=pe(1pe)σegwσwσpe.g_{a_{e}}=p_{e}(1-p_{e})\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}}.

When pe>0p_{e}>0, this can be simplified as

gae=(1pe)σegwσwσ.g_{a_{e}}=(1-p_{e})\sum_{\sigma\supset e}g_{w_{\sigma}}w_{\sigma}.

However, for numerical stability, it is safer to use

gae=pe(1pe)σegwσwσpe+ϵnum.g_{a_{e}}=p_{e}(1-p_{e})\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}+\epsilon_{\mathrm{num}}}.

Thus, gradients of arbitrary low-pass spectral losses can be propagated analytically to the edge logits

ae.a_{e}.

A.8 Backward Computation for Graph Generation

In graph generation, the optimization variables are the edge logits

a={ae}eEmax.a=\{a_{e}\}_{e\in E_{\max}}.

Thus, the final output of the backward computation is

a𝒥=(gae)eEmax.\nabla_{a}\mathcal{J}=\left(g_{a_{e}}\right)_{e\in E_{\max}}.

The computation proceeds as follows.

First, compute edge probabilities from edge logits:

pe=σ(ae).p_{e}=\sigma(a_{e}).

Next, compute simplex weights:

wσ=eσpe.w_{\sigma}=\prod_{e\subset\sigma}p_{e}.

Then construct

Wq,Rq,B~q,L^q.W_{q},\qquad R_{q},\qquad\widetilde{B}_{q},\qquad\widehat{L}_{q}.

Next, compute

GL=𝒥L^qG_{L}=\frac{\partial\mathcal{J}}{\partial\widehat{L}_{q}}

from the loss function. For example, for the normalized Laplacian moment loss

𝒥norm=12(S¯q,dτ¯q)2,\mathcal{J}_{\mathrm{norm}}=\frac{1}{2}\left(\overline{S}_{q,d}-\bar{\tau}_{q}\right)^{2},

where

S¯q,d=1NqTr(IαL^q)d,\overline{S}_{q,d}=\frac{1}{N_{q}}\operatorname{Tr}\left(I-\alpha\widehat{L}_{q}\right)^{d},

define

M=IαL^q.M=I-\alpha\widehat{L}_{q}.

Then

GL=(S¯q,dτ¯q)1Nq(αdMd1).G_{L}=\left(\overline{S}_{q,d}-\bar{\tau}_{q}\right)\frac{1}{N_{q}}\left(-\alpha dM^{d-1}\right).

Using this GLG_{L}, compute

GB~q=2B~qGLG_{\widetilde{B}_{q}}=2\widetilde{B}_{q}G_{L}

and

GB~q+1=2GLB~q+1.G_{\widetilde{B}_{q+1}}=2G_{L}\widetilde{B}_{q+1}.

Then compute

gρpg_{\rho_{p}}

using the formula in the previous subsection. In particular, for q=1q=1,

gρ1=(GB~1B1)ρ0+(GB~2B2)ρ2.g_{\rho_{1}}=\left(G_{\widetilde{B}_{1}}\odot B_{1}\right)^{\top}\rho_{0}+\left(G_{\widetilde{B}_{2}}\odot B_{2}\right)\rho_{2}.

Then compute

gwσ.g_{w_{\sigma}}.

Next compute

gpe=σegwσwσpe.g_{p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}}.

Finally compute

gae=gpepe(1pe).g_{a_{e}}=g_{p_{e}}p_{e}(1-p_{e}).

This gives

𝒥ae=gae.\frac{\partial\mathcal{J}}{\partial a_{e}}=g_{a_{e}}.

Therefore, the edge logits can be updated by gradient-based optimization.

A.9 From Vietoris–Rips Edge Logits to Distances

In the Vietoris–Rips setting, edge logits are defined from point-cloud coordinates. At scale rr,

aij(r)(X)=rdij(X)ε,a_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon},

where

dij(X)=xixj2+δ.d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta}.

Assume that the gradient with respect to each edge logit has already been computed:

gaij(r)=𝒥aij(r).g_{a_{ij}^{(r)}}=\frac{\partial\mathcal{J}}{\partial a_{ij}^{(r)}}.

We now propagate this gradient to the distance dijd_{ij}.

Since

aij(r)=rdijε,a_{ij}^{(r)}=\frac{r-d_{ij}}{\varepsilon},

we have

aij(r)dij=1ε.\frac{\partial a_{ij}^{(r)}}{\partial d_{ij}}=-\frac{1}{\varepsilon}.

Therefore,

gdij(r)=𝒥dij=gaij(r)aij(r)dij=1εgaij(r).g_{d_{ij}}^{(r)}=\frac{\partial\mathcal{J}}{\partial d_{ij}}=g_{a_{ij}^{(r)}}\frac{\partial a_{ij}^{(r)}}{\partial d_{ij}}=-\frac{1}{\varepsilon}g_{a_{ij}^{(r)}}.

A.10 From Distances to Point-Cloud Coordinates

We next differentiate the stabilized distance

dij(X)=xixj2+δd_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta}

with respect to the point-cloud coordinates.

Since

dij(X)2=xixj2+δ,d_{ij}(X)^{2}=\|x_{i}-x_{j}\|^{2}+\delta,

we have

xidij=xixjdij\nabla_{x_{i}}d_{ij}=\frac{x_{i}-x_{j}}{d_{ij}}

and

xjdij=xjxidij.\nabla_{x_{j}}d_{ij}=\frac{x_{j}-x_{i}}{d_{ij}}.

At a single scale rr, the gradient with respect to a point xix_{i} is the sum of the contributions from all candidate edges incident to xix_{i}:

xi𝒥(r)=j:(i,j)Emaxgdij(r)xixjdij.\nabla_{x_{i}}\mathcal{J}^{(r)}=\sum_{j:(i,j)\in E_{\max}}g_{d_{ij}}^{(r)}\frac{x_{i}-x_{j}}{d_{ij}}.

Substituting

gdij(r)=1εgaij(r),g_{d_{ij}}^{(r)}=-\frac{1}{\varepsilon}g_{a_{ij}^{(r)}},

we obtain

xi𝒥(r)=1εj:(i,j)Emaxgaij(r)xixjdij.\nabla_{x_{i}}\mathcal{J}^{(r)}=-\frac{1}{\varepsilon}\sum_{j:(i,j)\in E_{\max}}g_{a_{ij}^{(r)}}\frac{x_{i}-x_{j}}{d_{ij}}.

Furthermore, using

gaij(r)=pij(r)(1pij(r))σ(i,j)gwσ(r)wσ(r)pij(r),g_{a_{ij}^{(r)}}=p_{ij}^{(r)}\left(1-p_{ij}^{(r)}\right)\sum_{\sigma\supset(i,j)}g_{w_{\sigma}}^{(r)}\frac{w_{\sigma}^{(r)}}{p_{ij}^{(r)}},

we obtain

xi𝒥(r)=1εj:(i,j)Emax[pij(r)(1pij(r))σ(i,j)gwσ(r)wσ(r)pij(r)]xixjdij.\nabla_{x_{i}}\mathcal{J}^{(r)}=-\frac{1}{\varepsilon}\sum_{j:(i,j)\in E_{\max}}\left[p_{ij}^{(r)}\left(1-p_{ij}^{(r)}\right)\sum_{\sigma\supset(i,j)}g_{w_{\sigma}}^{(r)}\frac{w_{\sigma}^{(r)}}{p_{ij}^{(r)}}\right]\frac{x_{i}-x_{j}}{d_{ij}}.

For numerical stability, one may replace

wσ(r)pij(r)\frac{w_{\sigma}^{(r)}}{p_{ij}^{(r)}}

by

wσ(r)pij(r)+ϵnum.\frac{w_{\sigma}^{(r)}}{p_{ij}^{(r)}+\epsilon_{\mathrm{num}}}.

A.11 Multi-Scale Vietoris–Rips Backward Computation

In a Vietoris–Rips filtration, we use multiple scales

r1<r2<<rM.r_{1}<r_{2}<\cdots<r_{M}.

Suppose that the loss function is

𝒥(X)=m=1Mωm𝒥m(L^q(rm)(X)).\mathcal{J}(X)=\sum_{m=1}^{M}\omega_{m}\mathcal{J}_{m}\left(\widehat{L}_{q}^{(r_{m})}(X)\right).

For each scale rmr_{m}, first compute

GL(m)=𝒥mL^q(rm).G_{L}^{(m)}=\frac{\partial\mathcal{J}_{m}}{\partial\widehat{L}_{q}^{(r_{m})}}.

Then, following the procedure above, compute

gaij(rm)=𝒥maij(rm).g_{a_{ij}^{(r_{m})}}=\frac{\partial\mathcal{J}_{m}}{\partial a_{ij}^{(r_{m})}}.

The edge logit at scale rmr_{m} is

aij(rm)(X)=rmdij(X)ε.a_{ij}^{(r_{m})}(X)=\frac{r_{m}-d_{ij}(X)}{\varepsilon}.

The distance dij(X)d_{ij}(X) does not depend on the scale, whereas the edge-logit gradient

gaij(rm)g_{a_{ij}^{(r_{m})}}

does depend on the scale.

The gradient of the total loss with respect to xix_{i} is the weighted sum of the contributions from all scales:

xi𝒥=m=1Mωmxi𝒥m.\nabla_{x_{i}}\mathcal{J}=\sum_{m=1}^{M}\omega_{m}\nabla_{x_{i}}\mathcal{J}_{m}.

Therefore,

xi𝒥=1εm=1Mωmj:(i,j)Emaxgaij(rm)xixjdij.\nabla_{x_{i}}\mathcal{J}=-\frac{1}{\varepsilon}\sum_{m=1}^{M}\omega_{m}\sum_{j:(i,j)\in E_{\max}}g_{a_{ij}^{(r_{m})}}\frac{x_{i}-x_{j}}{d_{ij}}.

Writing gaij(rm)g_{a_{ij}^{(r_{m})}} explicitly, we have

xi𝒥=1εm=1Mωmj:(i,j)Emax[pij(rm)(1pij(rm))σ(i,j)gwσ(m)wσ(rm)pij(rm)]xixjdij.\nabla_{x_{i}}\mathcal{J}=-\frac{1}{\varepsilon}\sum_{m=1}^{M}\omega_{m}\sum_{j:(i,j)\in E_{\max}}\left[p_{ij}^{(r_{m})}\left(1-p_{ij}^{(r_{m})}\right)\sum_{\sigma\supset(i,j)}g_{w_{\sigma}}^{(m)}\frac{w_{\sigma}^{(r_{m})}}{p_{ij}^{(r_{m})}}\right]\frac{x_{i}-x_{j}}{d_{ij}}.

This shows that the multi-scale Vietoris–Rips loss is analytically differentiable with respect to the point-cloud coordinates XX.

A.12 Summary of Backward Computation

The backward computation derived in this appendix can be summarized as follows.

First, compute

GL=𝒥L^qG_{L}=\frac{\partial\mathcal{J}}{\partial\widehat{L}_{q}}

from the spectral loss.

Next, using

L^q=B~qB~q+B~q+1B~q+1+μ(IWq),\widehat{L}_{q}=\widetilde{B}_{q}^{\top}\widetilde{B}_{q}+\widetilde{B}_{q+1}\widetilde{B}_{q+1}^{\top}+\mu(I-W_{q}),

compute

GB~q=2B~qGLG_{\widetilde{B}_{q}}=2\widetilde{B}_{q}G_{L}

and

GB~q+1=2GLB~q+1.G_{\widetilde{B}_{q+1}}=2G_{L}\widetilde{B}_{q+1}.

For a general degree pp, the gradient with respect to ρp\rho_{p} is

gρp=(GB~pBp)ρp1+(GB~p+1Bp+1)ρp+1,g_{\rho_{p}}=\left(G_{\widetilde{B}_{p}}\odot B_{p}\right)^{\top}\rho_{p-1}+\left(G_{\widetilde{B}_{p+1}}\odot B_{p+1}\right)\rho_{p+1},

where non-existent boundary terms are omitted.

Then, since

ρσ=wσ,\rho_{\sigma}=\sqrt{w_{\sigma}},

compute

gwσbdry=gρσ2ρσ.g_{w_{\sigma}}^{\mathrm{bdry}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}.

For σKmax(q)\sigma\in K_{\max}^{(q)}, add the penalty contribution:

gwσ=gρσ2ρσμ(GL)σσ.g_{w_{\sigma}}=\frac{g_{\rho_{\sigma}}}{2\rho_{\sigma}}-\mu(G_{L})_{\sigma\sigma}.

Next, using the soft clique relation

wσ=eσpe,w_{\sigma}=\prod_{e\subset\sigma}p_{e},

compute

gpe=σegwσwσpe.g_{p_{e}}=\sum_{\sigma\supset e}g_{w_{\sigma}}\frac{w_{\sigma}}{p_{e}}.

Finally, since

pe=σ(ae),p_{e}=\sigma(a_{e}),

compute

gae=gpepe(1pe).g_{a_{e}}=g_{p_{e}}p_{e}(1-p_{e}).

In graph generation, this

gaeg_{a_{e}}

is the gradient with respect to the optimization variable.

In the Vietoris–Rips setting, using

aij(r)(X)=rdij(X)εa_{ij}^{(r)}(X)=\frac{r-d_{ij}(X)}{\varepsilon}

and

dij(X)=xixj2+δ,d_{ij}(X)=\sqrt{\|x_{i}-x_{j}\|^{2}+\delta},

we obtain the point-coordinate gradient

xi𝒥=1εm=1Mωmj:(i,j)Emaxgaij(rm)xixjdij.\nabla_{x_{i}}\mathcal{J}=-\frac{1}{\varepsilon}\sum_{m=1}^{M}\omega_{m}\sum_{j:(i,j)\in E_{\max}}g_{a_{ij}^{(r_{m})}}\frac{x_{i}-x_{j}}{d_{ij}}.

Thus, all low-pass spectral losses introduced in the main text are analytically differentiable through the Hodge-Laplacian-type spectral relaxation, soft simplex weights, edge probabilities, edge logits, and, in the Vietoris–Rips setting, point-cloud coordinates.

References