License: arXiv.org perpetual non-exclusive license
arXiv:2606.25696v1 [cs.CE] 24 Jun 2026

A Two-Stage Decision Support System for Sustainability-Aware Long–Short Portfolio Optimization

Giacomo di Tollo g.ditollo@staff.univpm.it Massimiliano Kaucic Filippo Piccotto filippo.piccotto@deams.units.it Marche Polytechnic University, Department of Management, Piazza Roma 22, Ancona, Italy University of Trieste, Department of Economics, Business, Mathematics and Statistics, Via Valerio 4/1, Trieste, Italy
Abstract

This paper proposes a two-stage decision support system for long-short portfolio optimization under environmental, social, and governance (ESG) considerations. In the first stage, assets are evaluated using a multi-criteria procedure based on TODIMSort, with criterion weights derived using the MEREC (Removal Effects of Criteria) method. This allows assets to be assigned to classes ordered according to preferences that respond to market conditions and investor priorities, thus generating sets of long and short opportunities that dynamically adapt to the prevailing regime. In the second stage, we formulate a non-convex portfolio optimization problem that maximizes the Omega ratio while respecting budget, bound and leverage constraints. To solve it, we introduce an adaptive particle swarm solver equipped with a controller that selects, at each iteration, the most suitable recombination operator from a diverse pool of operators and combines it with a projection-based repair mechanism for constraint management. The empirical study, conducted on 421 stocks in the STOXX Europe 600 index, examines both the exploration capabilities and solution quality of the proposed solver compared to state-of-the-art benchmarks, as well as the ex post profitability of the resulting portfolio strategies. The results show that ESG-enhanced long-short portfolios offer competitive and often superior performance compared to their non-ESG counterparts and the market-value-weighted benchmark.

keywords:
Portfolio Optimization; Decision Support Systems; Evolutionary Computation
journal: Omega

1 Introduction

The mean-variance analysis (Markowitz (1952)) represents the cornerstone of modern portfolio theory and is widely accepted as a wealth management tool in the financial industry (Guerard (2009); Kalayci et al. (2019)). Specifically, this model illustrates the portfolio selection problem as a quadratic program that minimizes portfolio variance given a level of expected return. Beyond this foundational framework, several studies proposed alternative model formulations and incorporated various types of constraints to account for specific investment guidelines and institutional features of the investment process. On the one hand, advances in portfolio modeling include using other measures to quantify portfolio risk, such as mean absolute deviation, value-at-risk, and conditional value-at-risk (Konno and Yamazaki (1991); Rockafellar and Uryasev (2000); Krokhmal et al. (2002)). In addition, investors are typically interested in the reward component represented by the expected return of the portfolio, and in the performance of the proposed investment strategy compared to a market benchmark. Hence, many authors proposed optimization models in which benchmark-related reward-to-risk measures guide investors’ decisions. Some examples include the Sharpe ratio and its extensions (Kaucic et al. (2024)), the Omega ratio (Guastaroba et al. (2016); Sharma et al. (2017)), and other performance metrics that incorporate tail-risk measures (Farinelli et al. (2008); Corazza et al. (2021)). On the other hand, a substantial body of literature introduced real-world portfolio constraints in addition to standard budget, no short-selling, and floor/ceiling constraints. First, cardinality constraints limit a portfolio to have a maximum number of constituents (Chang et al., 2000). Then, turnover constraints control the magnitude of trades during portfolio rebalancements (Soleimani et al. (2009); Woodside-Oriakhi et al. (2011)), liquidity constraints impose a limit on the amount an individual can borrow (Lo et al. (2006)), or transaction costs constraints control the amount paid when buying or selling portfolio positions (Borkovec et al. (2010); Ruiz-Torrubiano and Suarez (2015)). Finally, to foster weights diversification, some studies introduced constraints on portfolio norms (DeMiguel et al. (2009)), or included risk-budgeting constraints in portfolio design (see Kaucic (2019) and references therein).

Although long-only portfolio selection simplifies optimization and is the standard practice among scholars and practitioners, it may lead to several pitfalls. In fact, forbidding short positions restricts diversification and limits potential opportunities to exploit negative views on assets and construct market-neutral strategies. Jacobs et al. (1999) proposed a long-short framework applied to the Markowitz mean-variance analysis, laying the foundations for the subsequent long-short portfolio management literature. Gilli et al. (2011) developed a portfolio model that maximizes the Omega ratio subject to cardinality constraints, which employs the 130/30 rule, where short sales are allowed up to 30%\% of the net asset value of the overall portfolio. However, enriching the mean-variance framework through alternative performance measures and realistic constraints significantly increases the complexity of the portfolio selection problem, often leading to mixed-integer, non-linear, and non-convex formulations, for which finding feasible and optimal solutions becomes computationally challenging using exact solvers (Moral-Escudero et al. (2006)). For these reasons, metaheuristic approaches, such as evolutionary and swarm intelligence algorithms equipped with hybrid constraint-handling techniques, have been developed to optimize complex portfolio optimization problems (see Erwin and Engelbrecht (2023) for a comprehensive literature review). In particular, several authors used metaheuristic algorithms in long-short portfolio formulations (Pai and Michel (2012); Chou et al. (2021); di Tollo et al. (2024)).

In parallel, a rapidly growing stream of research recently explored the use of machine-learning methods, and in particular learn-to-rank (LTR) algorithms, for long–short portfolio construction. These approaches aim at generating ordered lists of assets from which long and short legs are derived directly. Early contributions such as Zhang et al. (2022) propose list-wise ranking architectures in which the model is trained to maximize the separation between top- and bottom-ranked assets, while more recent developments refine this idea through enhanced ranking losses and feature engineering (Linger et al., 2025). Although promising, this line of work revealed important limitations, since many LTR-based long–short strategies tend to produce unstable rankings and exhibit pronounced sensitivity to noise in the predictors, which ultimately undermines the robustness of the resulting allocations and makes performance heavily dependent on the particular choice of hyperparameters and training windows. To address these issues, Kouloumpris et al. (2024) introduced SABER, a stochastic-aware bootstrap ensemble ranking method designed to improve stability and mitigate overfitting by aggregating multiple perturbed ranking estimates. While SABER enhances robustness relative to single-model LTR methods, it still operates as a purely data-driven ranking engine and does not explicitly incorporate behavioral or financial-interpretation principles in the formation of long and short sets.

To overcome these limitations, in this paper we propose an integrated two-stage decision support system (DSS) to construct equity portfolios that also allows for short selling. We consider the asset allocation problem from the perspective of institutional investors and fund managers operating in equity markets, who base their financial decisions on the Omega ratio performance measure and incorporate ESG considerations in their decision process. The central premise of our contribution is that the definition of long and short legs should arise from a data-driven and behaviorally coherent decision process rather than from ad-hoc statistical ranking procedures.

In the proposed architecture, stage 1 performs stock screening using multi-criteria decision-making (MCDM) theory. This module starts by identifying the current market phase, distinguishing among bull, bear, and sideways regime. Then, it integrates phase-independent and phase-dependent criteria in the stock-picking process. To objectively determine the relative importance of the considered criteria, we employ the MEREC method (Ghorabaee et al. (2021)). Then, we obtain assets’ rankings through the so-called TODIM (the portuguese acronym for interactive multi-criteria decision making), a MCDM method introduced by Gomes and Lima (1991) that integrates behavioral finance principles to evaluate criteria alternatives. This method has been successfully applied in the financial field, due to its capacity to reflect behavioral characteristics of decision-makers (Alali and Tolga (2019); Wu et al. (2022)). However, to construct long-short portfolios, a ranking is not sufficient, since we need to assign assets to three classes, namely long, neutral, and short. To achieve this goal, we utilize the TODIMSort (Wang et al. (2023)), an enhanced version of the TODIM method that assigns assets to ordered preference classes through limiting profiles. A key novelty of our approach is that the limiting profiles are not specified solely on the basis of subjective preferences, as in traditional TODIMSort implementations, but are instead constructed dynamically by combining investor-specified percentile thresholds with empirical distributions of criteria conditioned on the prevailing market regime. As a result, limiting profiles are both behaviorally meaningful and consistent with the prevailing market regime. Once the investment set is defined, stage 2 consists of a non-convex portfolio optimization model that maximizes the Omega ratio under long–short budget and bound constraints. To solve this problem, we develop an adaptive multi-operator particle swarm optimization algorithm (AMPSO), which combines time-varying acceleration coefficients (Ratnaweera et al., 2004), an adaptive operator selection mechanism (Filograsso and di Tollo, 2023), and a projection-based constraint-handling procedure to ensure feasibility (Maculan et al., 2003). The Supplementary Material contains a graphical chart that illustrates our two-stage DSS architecture.

To evaluate the proposed framework, we conduct a threefold extensive empirical analysis employing a large panel of European equities over a 10–year horizon, integrating daily prices, monthly ESG indicators, and regulatory constraints on short selling. First, we examine the behavior of the MEREC–TODIMSort module, analyzing how market phases influence criteria weights, the shape of limiting profiles, and the resulting long, neutral, and short sets. Second, we evaluate the optimization stage through a benchmark suite of long–short portfolio instances, comparing AMPSO with multiple PSO-based variants under identical computational budgets. Finally, we assess the out-of-sample performance of the resulting strategies under different leverage levels and limiting-profile configurations. Additional portfolios that exclude ESG-based criteria are constructed to isolate the marginal contribution of sustainability-related information to overall performance.

The main contributions of this study can be summarized as follows.

  • 1.

    We design a unified architecture for long-short portfolios in which a multi-criteria screening procedure and a non-convex optimization model are structurally interconnected, with ESG indicators directly embedded in the pre-selection stage.

  • 2.

    We introduce a novel implementation of TODIMSort in which limiting profiles are constructed dynamically using investor-defined percentile thresholds combined with empirical criterion distributions conditioned on the prevailing market phase. This yields a data-driven, market-coherent and behaviorally consistent segmentation of assets into short, neutral and long classes.

  • 3.

    We develop a variant of the particle swarm optimization algorithm that integrates time-varying acceleration coefficients, adaptive operator selection, and projection repair mechanisms.

The remainder of the paper is organized as follows. Section 2 positions our contribution within the existing literature on multi-criteria decision support systems and long–short portfolio optimization. Section 3 introduces the proposed long–short portfolio formulation based on the Omega ratio and discusses its financial interpretation. Sections 4 and 5 describe the MEREC-TODIMSort screening framework and the AMPSO algorithm, respectively. Section 6 reports the empirical analysis, covering data specification, screening behavior, algorithmic evaluation, and ex-post profitability. Finally, Section 7 concludes and indicates directions for future research.

2 Literature Positioning

Table 1 reports some recent selected studies that propose two-stage decision support systems for constructing portfolio strategies. We selected the papers that respected two conditions: (i) the proposal of a DSS in the field of portfolio optimization, and (ii) the presence of a long-short portfolio formulation, and/or the presence of ESG considerations. Only four papers in the recent literature satisfied these requirements. As an exception, we report the work of Xidonas et al. (2021), since it represents a seminal contribution in this field. In this paper, the authors employ four MCDM methods, namely ELECTRE III, PROMETHEE II, MAUT, and TOPSIS, to perform a pre-selection of 60 stocks from a subset of 516 NYSE-listed assets. Then, they introduce a long-only multi-objective portfolio model that maximizes the expected return while minimizing the portfolio risk, also accounting for many realistic constraints, and solve the resulting problem using a Genetic Algorithm (GA). Banerjee et al. (2024) propose a two-stage DSS where a fuzzy extension of TODIM performs the stock selection. In addition, they consider a mean–variance model with cardinality, round lots, transaction costs, and budget constraints that also allows for short sales. They employ a GA algorithm to find optimal solutions and assess the capabilities of this model across three major equity markets, namely Nifty 100 (India), S&P 500 (USA), and FTSE 100 (UK). The studies of Jain et al. (2025a, b) propose a two-stage architecture to construct optimal portfolios with an ESG mandate using data from the Nifty 50 and Nifty 100 indices, respectively. In these works, ESG considerations enter the portfolio construction in the constraints set, as the authors include in the model a minimum threshold for the ESG score of the portfolio. Moreover, their objective function is a weighted function of higher-order moments of the return distribution. For the optimization phase, both studies employ a tailored version of the teaching-learning-based optimization algorithm (TLBO), with a repair constraint-handling strategy for bound constraints. However, they do not allow for short-sales in their models. Finally, in the work of Van Wallendael et al. (2026), the authors first define ESG-informed returns as a linear combination of asset returns and ESG scores, then employ these returns in portfolio selection by proposing a long-only mean-CVaR formulation. In addition, they also use ESG as a screening tool, eliminating assets under a minimum ESG acceptance threshold in the pre-selection phase.

Reference MCDM Opt. Model Constraints Solver+CH ESG-Aware Long-Short
Xidonas et al. (2021) 4 MCDM methods + Entropy Weighting Mean-Variance MOP Cardinality, budget, no short-selling, bounds Genetic Algorithm (GA) - no CH reported
Banerjee et al. (2024) Fuzzy TODIM Mean-Variance MOP Cardinality, Budget, Round lot, Bounds, Transaction costs Genetic Algorithm (GA) - no CH reported
Jain et al. (2025a) VIKOR + Entropy Weighting Weighted function of skewness, kurtosis, Gini index Minimum expected return, Budget, no short-selling, Bounds, Minimum ESG Teaching–Learning-Based Optimization (TLBO) + repair CH
Jain et al. (2025b) TODIM + Entropy Weighting Weighted function of skewness, kurtosis, Gini index, conditional value at risk, semi-variance Minimum expected return, Budget, no short-selling, Bounds, Minimum ESG Teaching-Learning-Based Optimization (TLBO) + repair CH
Van Wallendael et al. (2026) Machine learning-based stock selection Minimum CVaR model Expected return, Budget, no short-selling Not reported
This paper TODIMSort + MEREC weighting Omega ratio maximization Budget, long-short bounds Particle swarm optimizer (PSO) + adaptive operator selection + projection CH
Table 1: Selected literature on two-stage decision support systems in portfolio selection with multi-criteria decision making screening. “CH” stands for constraint-handling.

This review highlights several open questions in the existing literature. Our study addresses these issues along the following dimensions.

  • 1.

    Although the introduction of two-stage decision support systems for portfolio construction is a topic of growing interest in the current literature, the application of this architecture is at an early stage of development, and no existing studies have proposed DSS in a long-short setting also accounting for ESG considerations.

  • 2.

    Analyzed studies use ESG information either in the definition of portfolio returns or to impose minimum ESG requirements within the optimization model. Our contribution embeds ESG indicators at a decision-making level, using them to obtain explicit criteria for the pre-selection stage.

  • 3.

    The proposed optimization models usually involve standard mean-variance or CVaR formulations. Our work considers the Omega ratio as the objective function, introducing non-convexity component in the optimization problem.

  • 4.

    While four out of the five works employ metaheuristic solvers to find optimal solutions of the proposed models, they devote limited attention to analyzing their convergence behavior, feasibility attainment, and constraint-handling effectiveness. In this study, we explicitly assess the performance of our proposed AMPSO, also introducing comparisons with other state-of-the art solvers.

Summing up, this work fits the intersection of (i) MCDM methods for asset screening, (ii) non-convex portfolio optimization, and (iii) advanced metaheuristic design, using these tools for constructing ESG-aware long-short portfolios.

3 Portfolio Optimization Framework

In this study, we consider an investment framework composed of a financial market with NN risky assets and define a portfolio as a NN-dimensional vector of asset weights wNw\in\mathbb{R}^{N}. We assume a single-period investment horizon, meaning that the investor allocates wealth at an initial time and evaluates performance at a subsequent time corresponding to the chosen horizon. Within this mono-periodic setting, let (Ξ,𝔉,P)(\Xi,\mathfrak{F},P) be the probability space on which the random variables are defined and let R=(R1,,RN)R=(R_{1},\ldots,R_{N})^{\top} be the random vector of asset returns. Hence, Rw=wRR_{w}=w^{\top}R represents the portfolio return. Similarly, we denote by RbR_{b} the stochastic return of a benchmark index. Moreover, investor decisions are guided by the Omega ratio, a performance measure introduced in Keating and Shadwick (2002). Formally, it is defined as

Ω(Rw;Rb)=𝔼((RwRb)+)𝔼((RbRw)+)\Omega(R_{w};R_{b})=\frac{\mathbb{E}\left((R_{w}-R_{b})_{+}\right)}{\mathbb{E}\left((R_{b}-R_{w})_{+}\right)} (3.1)

where (z)+=max{z,0}(z)_{+}=\max\{z,0\}, with zz\in\mathbb{R}. Intuitively, the numerator captures the expected gains, while the denominator quantifies the expected losses with respect to the benchmark. The Omega ratio satisfies the following desirable axiomatic properties for performance measures, as discussed in Rachev et al. (2008):

  • 1.

    Monotonicity: if a portfolio ww stochastically dominates another portfolio ww^{\prime}, then Ω(Rw;Rb)Ω(Rw;Rb)\Omega(R_{w};R_{b})\geq\Omega(R_{w^{\prime}};R_{b});

  • 2.

    Quasi-concavity: the upper level sets {wN:Ω(Rw;Rb)α}\{w\in\mathbb{R}^{N}:\Omega(R_{w};R_{b})\geq\alpha\} are convex;

  • 3.

    Scale-invariance: Ω(λRw;Rb)=Ω(Rw;Rb)\Omega(\lambda R_{w};R_{b})=\Omega(R_{w};R_{b}) for all λ>0\lambda>0;

  • 4.

    Translation-invariance: shifting all returns by a constant affects the ratio in a predictable way.

The fact that the Omega ratio satisfies these properties makes it particularly suitable for being optimized in portfolio selection models, especially in a non-Gaussian framework that considers higher-order moments and tail behavior.

3.1 Problem Statement

We consider a portfolio selection setting in which short-selling is allowed. Let IlongI_{\text{long}} and IshortI_{\text{short}} denote the sets of assets assigned, respectively, to the long and short legs of the strategy, with cardinalities nlongn_{\text{long}} and nshortn_{\text{short}}, where nlong+nshort=nNn_{\text{long}}+n_{\text{short}}=n\leq N. These sets are determined endogenously through the classification procedure described in Section 4, and therefore reflect both investor preferences and market conditions. Then, the long–short portfolio optimization problem is formulated as

maxwn\displaystyle\max_{w\in\mathbb{R}^{n}} Ω(Rw;Rb)\displaystyle\Omega(R_{w};\,R_{b}) (3.2)
s.t. iIlongwi=1+s\displaystyle\sum_{i\in I_{\text{long}}}w_{i}=1+s
iIshortwi=s\displaystyle\sum_{i\in I_{\text{short}}}w_{i}=-s
0wiwiu,iIlong\displaystyle 0\leq w_{i}\leq w_{i}^{u},\quad i\in I_{\text{long}}
wiwi0,iIshort\displaystyle w_{i}^{\ell}\leq w_{i}\leq 0,\quad i\in I_{\text{short}}

where s[0,1]s\in[0,1] is a parameter that controls the intensity of short exposure, wiu>0w_{i}^{u}>0, and wi<0w_{i}^{\ell}<0.

The first two constraints ensure that the net investment is equal to one, with a long exposure of 1+s1+s financed by a short exposure of ss. In particular, when s=0s=0 and Ishort=I_{\text{short}}=\emptyset, the model reduces to a standard long-only allocation with iIlongwi=1\sum_{i\in I_{\text{long}}}w_{i}=1. For s>0s>0, the short leg generates income of size ss, which finance an additional long exposure of the same size. Thus, the total long capital employed becomes 1+s1+s, while maintaining a net investment of one. Higher values of ss correspond to more aggressive hedge-fund-like strategies, increasing directional exposure and effective leverage. Upper and lower bounds wiuw_{i}^{u}, wiw_{i}^{\ell} impose realistic position limits on long and short weights.

When short-selling is temporarily suspended (for example, during periods of market stress or regulatory intervention) the admissible portfolio reduces to the following long-only allocation:

maxwnlong-only\displaystyle\max_{w\in\mathbb{R}^{n_{\text{long-only}}}} Ω(Rw;Rb)\displaystyle\Omega(R_{w};R_{b}) (3.3)
s.t. iIlong-onlywi=1\displaystyle\sum_{i\in I_{\text{long-only}}}w_{i}=1
0wiwiu,iIlong-only\displaystyle 0\leq w_{i}\leq w_{i}^{u},\quad i\in I_{\text{long-only}}

where Ilong-onlyI_{\text{long-only}} is the set of indices that correspond to the portfolio constituents in the long-only setting. To ensure comparable net exposure between the long–short model (3.2) and the long-only regime, the admissible number of long-only assets is adjusted as nlong-only=nlong1+sn_{\text{long-only}}=\left\lfloor\dfrac{n_{\text{long}}}{1+s}\right\rfloor where nlongn_{\text{long}} is the cardinality of the long set identified in the classification stage. The constituents of Ilong-onlyI_{\text{long-only}} are selected based on the TODIM ranking, thereby maintaining consistency with the pre-screening procedure. This specification reflects the fact that, in the absence of short-selling, the long leg cannot be expanded through internal financing. The adjustment in nlong-onlyn_{\text{long-only}} ensures that the effective risk concentration remains comparable to the leveraged long–short configuration with parameter ss.

3.1.1 Empirical Estimation of the Omega Ratio

In practical applications, the true distribution of returns is unknown. Consequently, we estimate the Omega ratio using historical observations of asset returns (Guastaroba et al., 2016). Given an in-sample window of TT observed trading periods, we denote by rj,tr_{j,t} the realized return of asset jj observed in period tt, with t=1,,Tt=1,\dots,T. Thus, rt=(r1,t,,rn,t)r_{t}=(r_{1,t},\dots,r_{n,t})^{\top} is the return vector for period tt, and the corresponding portfolio return is computed as rw,t=wrtr_{w,t}=w^{\top}r_{t}. Let rb,tr_{b,t} be the realized return of the benchmark, the estimation of the Omega ratio is given by:

Ω^w=t=1Tmax{rw,trb,t,0}t=1Tmax{rb,trw,t,0}.\widehat{\Omega}_{w}=\frac{\sum_{t=1}^{T}\max\{r_{w,t}-r_{b,t},0\}}{\sum_{t=1}^{T}\max\{r_{b,t}-r_{w,t},0\}}. (3.4)

This formulation corresponds to a discrete approximation of the metric given in Eqn. (3.1), where the numerator aggregates the gains and the denominator aggregates the losses. If the denominator is zero (a situation that occurs when all observed portfolio returns exceed the benchmark threshold), the ratio value is set to ++\infty. Notice that the empirical Omega ratio is non-convex and non-smooth in ww, which motivates the use of derivative-free optimization methods such as the swarm-based algorithm we propose.

4 Stock Screening Framework

The proposed procedure begins by identifying the securities in the reference market that will represent the investable universe. The process is structured into three components:

  1. (i)

    the MEREC method, which determines the relative importance of screening criteria;

  2. (ii)

    the TODIM asset ranking, which produces a dominance-based ordering of alternatives for long-only strategies;

  3. (iii)

    the TODIMSort classifier, which assigns assets to preference-ordered categories for long–short strategies.

These components jointly define the MEREC–TODIMSort architecture. Let NN denote the number of candidate assets and mm the number of screening criteria. Define the decision matrix X=(xij)N×mX=(x_{ij})\in\mathbb{R}^{N\times m}, where each row corresponds to an asset and each column corresponds to a criterion. Criteria are classified either as benefits, for which higher values indicate better performance, or as costs, for which lower values are preferable.

4.1 MEREC-Based Weighting of Screening Criteria

The MEREC weighting scheme quantifies the relative importance of the screening criteria. The computational steps of the procedure are summarized below.

  • 1.

    Positivity adjustment. Financial data may include zero or negative values, which are incompatible with the logarithmic ratios used in MEREC. Consequently, the decision matrix XX must contain strictly positive entries. To enforce positivity, we propose for each criterion jj to compute its minimum value xjmin=minixijx_{j}^{\min}=\min_{i}x_{ij} and apply

    xijxijxjmin+1,i=1,,N.x_{ij}\leftarrow x_{ij}-x_{j}^{\min}+1,\quad i=1,\dots,N. (4.1)
  • 2.

    Normalization. The translated matrix is normalized into Z=(zij)Z=(z_{ij}) as

    zij={xijmaxixij,if j is a benefit criterion,minixijxij,if j is a cost criterion.z_{ij}=\begin{cases}\dfrac{x_{ij}}{\max_{i}x_{ij}},&\text{if $j$ is a benefit criterion},\\[8.0pt] \dfrac{\min_{i}x_{ij}}{x_{ij}},&\text{if $j$ is a cost criterion}.\end{cases} (4.2)
  • 3.

    Aggregated performance. For each alternative ii, we compute

    Si=ln(1+1mj=1m|ln(zij)|).S_{i}=\ln\!\left(1+\frac{1}{m}\sum_{j=1}^{m}\big|\ln(z_{ij})\big|\right). (4.3)
  • 4.

    Performance after criterion removal. For each criterion jj, the performance of asset ii after removing criterion jj is defined as

    Si(j)=ln(1+1m1h=1hjm|ln(zih)|).S^{(-j)}_{i}=\ln\!\Bigg(1+\frac{1}{m-1}\sum_{\begin{subarray}{c}h=1\\ h\neq j\end{subarray}}^{m}\big|\ln(z_{ih})\big|\Bigg). (4.4)
  • 5.

    Removal effect and weights. The removal effect of criterion jj is

    Ej=i=1N|Si(j)Si|.E_{j}=\sum_{i=1}^{N}\big|S^{(-j)}_{i}-S_{i}\big|. (4.5)

    Finally, we normalize to obtain the MEREC weights

    πj=Ejh=1mEh,j=1,,m.\pi_{j}=\frac{E_{j}}{\sum_{h=1}^{m}E_{h}},\qquad j=1,\dots,m. (4.6)

Criteria whose removal causes larger changes in aggregated performance receive higher weights, reflecting their stronger discriminating capability.

4.2 Ranking Assets for Stock Screening via TODIM

In our context, the TODIM method is used to establish a dominance-based ranking of NN assets according to mm criteria. The procedure consists of the following steps:

  • 1.

    Normalization. The decision matrix XX is normalized on a [0,1][0,1] scale as:

    zi,j={xijminixijmaxixijminixij,if j is a benefit criterion,maxixijxijmaxixijminixi,j,if j is a cost criterion.z^{\prime}_{i,j}=\begin{cases}\dfrac{x_{ij}-\min_{i}x_{ij}}{\max_{i}x_{ij}-\min_{i}x_{ij}},&\text{if $j$ is a benefit criterion},\\[6.0pt] \dfrac{\max_{i}x_{ij}-x_{ij}}{\max_{i}x_{ij}-\min_{i}x_{i,j}},&\text{if $j$ is a cost criterion}.\end{cases} (4.7)
  • 2.

    Pairwise dominance. For each pair of assets (i,k)(i,k) and criterion jj, compute the comparison score:

    CSj(i,k)={πj(zijzk,j)η1,if zijzkj,ξπj(zkjzij)η2,if zij<zkjCS_{j}(i,k)=\begin{cases}\pi_{j}\,(z^{\prime}_{ij}-z^{\prime}_{k,j})^{\eta_{1}},&\text{if }z^{\prime}_{ij}\geq z^{\prime}_{kj},\\[6.0pt] -\xi\,\pi_{j}\,(z^{\prime}_{kj}-z^{\prime}_{ij})^{\eta_{2}},&\text{if }z^{\prime}_{ij}<z^{\prime}_{kj}\end{cases} (4.8)

    where πj\pi_{j} are the criterion weights obtained from MEREC, η1,η2>0\eta_{1},\eta_{2}>0 are sensitivity parameters for gains and losses, and ξ>1\xi>1 models loss aversion.

  • 3.

    Global dominance score. For each asset ii, aggregate the pairwise dominance contributions across all criteria and opponent assets:

    GSraw(i)=k=1Nj=1mCSj(i,k)GS_{\mathrm{raw}}(i)=\sum_{k=1}^{N}\sum_{j=1}^{m}CS_{j}(i,k) (4.9)

    where the self-comparison term k=ik=i is null by construction. Finally, apply a min–max normalization to obtain scores in [0,1][0,1]:

    GS(i)=GSraw(i)minGSraw()maxGSraw()minGSraw().GS(i)=\frac{GS_{\mathrm{raw}}(i)-\min_{\ell}GS_{\mathrm{raw}}(\ell)}{\max_{\ell}GS_{\mathrm{raw}}(\ell)-\min_{\ell}GS_{\mathrm{raw}}(\ell)}. (4.10)

The resulting scores enable a psychologically consistent ranking of assets, integrating performance and behavioral considerations into the selection process. This step provides a robust pre-screening phase that precedes the portfolio optimization problem (3.3).

4.3 Classifying Assets for Stock Screening via TODIMSort

Although the ranking produced by the TODIM method is well suited for long-only stock selection (Kaucic et al., 2025), the construction of long-short portfolios requires a classification scheme that assigns each asset to one of several categories according to ordered preference. To this end, we employ TODIMSort (Wang et al., 2023), an extension of TODIM designed to classify alternatives into predefined classes using limiting profiles as reference points.

Class structure

Let 𝒞={short,neutral,long}\mathcal{C}=\{\text{short},\text{neutral},\text{long}\} denote the ordered set of classes. Following the standard TODIMSort framework, the boundaries between adjacent classes are determined by the two limiting profiles LP1=(lp1,1,,lp1,m)LP_{1}=(lp_{1,1},\ldots,lp_{1,m}) and LP2=(lp2,1,,lp2,m),LP_{2}=(lp_{2,1},\ldots,lp_{2,m}), where LP1LP_{1} separates the short and neutral classes and LP2LP_{2} separates the neutral and long classes. Each limiting profile represents a hypothetical alternative that defines the minimum performance required to enter the corresponding higher-ranked class.

Dynamic, data-driven limiting profiles

A key novelty of our approach is that the limiting profiles are not specified solely on the basis of subjective preferences, as in traditional TODIMSort implementations, but are instead constructed by combining investor preferences with empirical information extracted from historical data. This yields limiting profiles that are both behaviorally meaningful and consistent with the prevailing market regime.

Given the market phase ϕ{bull,bear,sideways}\phi\in\{\text{bull},\text{bear},\text{sideways}\} prevailing at the evaluation time, we identify all historical evaluation dates whose market phase coincides with ϕ\phi. For each such date tt^{\prime}, let A(t)=(xij(t))N×mA^{(t^{\prime})}=(x_{ij}^{(t^{\prime})})\in\mathbb{R}^{N\times m} denote the corresponding decision matrix of criterion values. Collecting these matrices yields the set {A(t):phase(t)=ϕ}\{A^{(t^{\prime})}:\text{phase}(t^{\prime})=\phi\}. For each criterion j=1,,mj=1,\ldots,m, we construct the empirical sample 𝒳j={xij(t):t𝒦ϕ,i=1,,N}\mathcal{X}_{j}=\{\,x_{ij}^{(t^{\prime})}:t^{\prime}\in\mathcal{K}_{\phi},\;i=1,\ldots,N\,\}, where 𝒦ϕ\mathcal{K}_{\phi} denotes the set of historical evaluation dates with phase ϕ\phi. This sample represents the historical behavior of the criterion jj under the same market conditions as those that prevail at the current evaluation date.

The investor’s preference structure is then incorporated through the two quantile-based thresholds lp1,j=Qj(p1)lp_{1,j}=Q_{j}(p_{1}) and lp2,j=Qj(p2)lp_{2,j}=Q_{j}(p_{2}), where Qj()Q_{j}(\cdot) denotes the empirical quantile function associated with 𝒳j\mathcal{X}_{j}, and (p1,p2)(p_{1},p_{2}) are the investor-specified percentiles. For cost criteria, the two thresholds are swapped to preserve the correct preference ordering. This process yields two dynamic limiting profiles LP1LP_{1} and LP2LP_{2} that adapt to the prevailing market conditions while reflecting the investor’s preference structure.

Building on these definitions, the TODIMSort procedure unfolds in the following steps.

  • 1.

    Joint normalization. Let 𝒜\mathcal{A} denote the set of assets and 𝒫={LP1,LP2}\mathcal{P}=\{LP_{1},LP_{2}\} the set of limiting profiles. Form the combined set 𝒰=𝒜𝒫\mathcal{U}=\mathcal{A}\cup\mathcal{P} and normalize all criteria jointly to the interval [0,1][0,1]:

    zαj′′={xαjminγ𝒰xγjmaxγ𝒰xγjminγ𝒰xγj,if j is a benefit criterion,maxγ𝒰xγjxαjmaxγ𝒰xγjminγ𝒰xγj,if j is a cost criterion.z^{\prime\prime}_{\alpha j}=\begin{cases}\dfrac{x_{\alpha j}-\min_{\gamma\in\mathcal{U}}x_{\gamma j}}{\max_{\gamma\in\mathcal{U}}x_{\gamma j}-\min_{\gamma\in\mathcal{U}}x_{\gamma j}},&\text{if $j$ is a benefit criterion},\\[10.0pt] \dfrac{\max_{\gamma\in\mathcal{U}}x_{\gamma j}-x_{\alpha j}}{\max_{\gamma\in\mathcal{U}}x_{\gamma j}-\min_{\gamma\in\mathcal{U}}x_{\gamma j}},&\text{if $j$ is a cost criterion}.\end{cases} (4.11)
  • 2.

    Local dominance. For any pair (α,β)𝒰×𝒰(\alpha,\beta)\in\mathcal{U}\times\mathcal{U} and any criterion jj, compute the local dominance contribution:

    Lj(α,β)={πj(zαj′′zβj′′)η1,if zαj′′zβj′′,ξπj(zβj′′zαj′′)η2,if zαj′′<zβj′′L_{j}(\alpha,\beta)=\begin{cases}\pi_{j}\,(z^{\prime\prime}_{\alpha j}-z^{\prime\prime}_{\beta j})^{\eta_{1}},&\text{if }z^{\prime\prime}_{\alpha j}\geq z^{\prime\prime}_{\beta j},\\[6.0pt] -\,\xi\,\pi_{j}\,(z^{\prime\prime}_{\beta j}-z^{\prime\prime}_{\alpha j})^{\eta_{2}},&\text{if }z^{\prime\prime}_{\alpha j}<z^{\prime\prime}_{\beta j}\end{cases} (4.12)

    where πj\pi_{j} are the MEREC weights and (η1,η2,ξ)(\eta_{1},\eta_{2},\xi) are the TODIM sensitivity and loss-aversion parameters.

  • 3.

    Global dominance and scaling. For each object α𝒰\alpha\in\mathcal{U}, aggregate the local dominances across all opponents and all criteria:

    Δ(α)=β𝒰j=1mLj(α,β).\Delta(\alpha)=\sum_{\beta\in\mathcal{U}}\sum_{j=1}^{m}L_{j}(\alpha,\beta). (4.13)

    Then rescale dominance scores jointly over assets and profiles:

    s(α)=Δ(α)minγ𝒰Δ(γ)maxγ𝒰Δ(γ)minγ𝒰Δ(γ).s(\alpha)=\frac{\Delta(\alpha)-\min_{\gamma\in\mathcal{U}}\Delta(\gamma)}{\max_{\gamma\in\mathcal{U}}\Delta(\gamma)-\min_{\gamma\in\mathcal{U}}\Delta(\gamma)}. (4.14)
  • 4.

    Class assignment. Let l1=s(LP1)l_{1}=s(LP_{1}) and l2=s(LP2)l_{2}=s(LP_{2}), with 0l1l210\leq l_{1}\leq l_{2}\leq 1. Each asset i𝒜i\in\mathcal{A} is assigned to a class according to:

    s(i)<l1ishort,l1s(i)<l2ineutral,s(i)l2ilong.s(i)<l_{1}\ \Rightarrow\ i\in\text{short},\qquad l_{1}\leq s(i)<l_{2}\ \Rightarrow\ i\in\text{neutral},\qquad s(i)\geq l_{2}\ \Rightarrow\ i\in\text{long}.

This procedure yields an endogenous partition (Ishort,Ineutral,Ilong)(I_{\text{short}},I_{\text{neutral}},I_{\text{long}}) whose cardinalities are entirely determined by the empirical dominance structure induced by the data. The resulting segmentation provides a preference-consistent and market-coherent investable universe, which serves as the initialization for the long–short portfolio optimization problem (3.2).

Remark 4.1

The normalization adopted for MEREC follows the original protocol of the method, which requires strictly positive values due to the logarithmic aggregation of performance scores. By contrast, the TODIMSort procedure employs min–max normalization, which is more suitable for comparing alternatives and limiting profiles in a psychological ordering framework, and is fully consistent with the [0,1][0,1] scale required by the method. The use of different normalization schemes is, therefore, justified by the distinct nature and objectives of the two approaches.

4.4 Definition of Screening Criteria

The MEREC–TODIMSort architecture requires a structured set of quantitative criteria to build the decision matrices used in the weighting, dominance, and classification stages. These criteria integrate both structural features of assets and information that depends on the prevailing market environment, and are organized into two groups: phase‑independent indicators and phase‑dependent indicators.

4.4.1 Market–Phase Identification

A central component of this construction is the identification of the market phase, which determines when regime‑sensitive criteria are activated and governs the definition of the dynamic limiting profiles used by TODIMSort. To detect regime changes, we rely on a combination of standard technical indicators widely used in quantitative asset management. Their analytical definitions are reported in the Supplementary Material. Let {pb,t}t=1T\{p_{b,t}\}_{t=1}^{T} denote the sequence of closing prices of the benchmark. The market phase at time tt is determined by combining three elementary signals: trend direction, short–term momentum, and trend strength.

(i) Trend direction

Define a short–term and a long–term moving average

SMAt(S)=SMA({pb,t^}t^=1t,20),SMAt(L)=SMA({pb,t^}t^=1t,100)\mathrm{SMA}^{(S)}_{t}=\mathrm{SMA}(\{p_{b,\hat{t}}\}_{\hat{t}=1}^{t},20),\qquad\mathrm{SMA}^{(L)}_{t}=\mathrm{SMA}(\{p_{b,\hat{t}}\}_{\hat{t}=1}^{t},100)

and let Δt=SMAt(S)SMAt(L)\Delta_{t}=\mathrm{SMA}^{(S)}_{t}-\mathrm{SMA}^{(L)}_{t} be their difference. To avoid spurious signals, we introduce a dynamic tolerance threshold proportional to the long–term level, θt=0.005SMAt(L)\theta_{t}=0.005\,\mathrm{SMA}^{(L)}_{t}. A positive (resp. negative) deviation significantly above (resp. below) this threshold signals the presence of an upward (resp. downward) trend.

(ii) Short–term momentum

Recent price momentum is captured by the 2020–day return rb,t(20)=(pb,tpb,t201)r^{(20)}_{b,t}=(\frac{p_{b,t}}{p_{b,t-20}}-1), which reflects the instantaneous direction of the most recent movements.

(iii) Trend strength

The Average Directional Index (ADX) is computed over a standard 1414–day horizon and denoted by ADXt\mathrm{ADX}_{t}. A threshold of 2020 is commonly adopted in practice to distinguish trending markets from low–directionality environments. Then, the condition ADXt>20\mathrm{ADX}_{t}>20 indicates the presence of a trend.

Combining the above conditions, the market phase is classified as

Phase(t)={bull,if Δt>θt,rb,t(20)>0,ADXt>20,bear,if Δt<θt,rb,t(20)<0,ADXt>20,sideways,otherwise.\mathrm{Phase}(t)=\begin{cases}\text{bull},&\text{if }\Delta_{t}>\theta_{t},\;r^{(20)}_{b,t}>0,\;\mathrm{ADX}_{t}>20,\\ \text{bear},&\text{if }\Delta_{t}<-\theta_{t},\;r^{(20)}_{b,t}<0,\;\mathrm{ADX}_{t}>20,\\ \text{sideways},&\text{otherwise.}\end{cases}

The moving–average spread Δt\Delta_{t} captures the medium–term direction of the market, smoothing out high–frequency noise. The short–term return rb,t(20)r^{(20)}_{b,t} incorporates the most recent price dynamics, ensuring that the detected trend is not an artifact of past information. The ADX term provides a measure of directional strength: markets may exhibit positive or negative drift without being in a genuine trending state, and ADXt\mathrm{ADX}_{t} filters out these situations. The combined rule thus balances robustness and responsiveness.

Although not directly used in the classification rule, we also compute a volatility indicator based on the relative magnitude of short‑ and long‑term benchmark return variability. Specifically, σt(20)=std(rt19,,rt)\sigma^{(20)}_{t}=\operatorname{std}\!\left(r_{t-19},\dots,r_{t}\right) and σt(all)=std(r1,,rt)\sigma^{(\mathrm{all})}_{t}=\operatorname{std}\!\left(r_{1},\dots,r_{t}\right) denote the 20‑day realized volatility of the benchmark and its long‑horizon (3‑year) realized volatility, respectively. Then, we calculate the volatility ratio as ϱt=σt(20)/σt(all)\varrho_{t}=\nicefrac{{\sigma^{(20)}_{t}}}{{\sigma^{(\mathrm{all})}_{t}}}. Finally, we classify the volatility regime as high if ϱt>1.2\varrho_{t}>1.2, as low if ϱt<0.8\varrho_{t}<0.8, and as normal otherwise. The volatility classification complements the market‑phase signal and plays a direct role in defining the regime‑concordant technical score introduced in the sequel.

4.4.2 Phase–Independent Criteria

Let {ri,t}\{r_{i,t}\} denote the return series of asset ii and {rb,t}\{r_{b,t}\} the benchmark returns. Phase–independent criteria point out structural properties of assets that are informative across market regimes.

Momentum. Medium–term momentum measures recent compounding performance and is a robust predictor of cross–sectional returns (Daniel and Moskowitz, 2016). Over a lookback window of length HmomH_{\text{mom}}, we define

MOMi,t=h=0Hmom1(1+ri,th)1.\mathrm{MOM}_{i,t}=\prod_{h=0}^{H_{\text{mom}}-1}\!\bigl(1+r_{i,t-h}\bigr)-1.

Assets with higher momentum values are considered more attractive.

Upside–to–downside beta ratio. It measures the asymmetric responsiveness of asset ii to positive and negative benchmark movements (Fisher and D’Alessandro, 2021). Let τ\tau be a state threshold. Over a lookback window of length HudrH_{\text{udr}}, we define the state partitions 𝒯+={u:rb,u>τ}\mathcal{T}^{+}=\{u:\,r_{b,u}>\tau\} and 𝒯={u:rb,u<τ}\mathcal{T}^{-}=\{u:\,r_{b,u}<\tau\}. For each asset ii, compute the conditional betas

βi+=Cov(ri,u,rb,uu𝒯+)Var(rb,uu𝒯+),βi=Cov(ri,u,rb,uu𝒯)Var(rb,uu𝒯)\beta^{+}_{i}=\frac{\mathrm{Cov}\!\bigl(r_{i,u},r_{b,u}\mid u\in\mathcal{T}^{+}\bigr)}{\mathrm{Var}(r_{b,u}\mid u\in\mathcal{T}^{+})},\qquad\beta^{-}_{i}=\frac{\mathrm{Cov}\!\bigl(r_{i,u},r_{b,u}\mid u\in\mathcal{T}^{-}\bigr)}{\mathrm{Var}(r_{b,u}\mid u\in\mathcal{T}^{-})}

and the asymmetry ratio UDRi,tτ=βi+βi.\mathrm{UDR}_{i,t}^{\tau}=\frac{\beta^{+}_{i}}{\beta^{-}_{i}}. Higher values of UDR indicate that the asset tends to load more strongly on favorable benchmark states than on unfavorable ones, reflecting a desirable form of asymmetry in risk–return behavior.

Drawdown–recovery metrics. Let {pi,u}\{p_{i,u}\} denote the price series of asset ii over a lookback window of length HddH_{\text{dd}}, and define the running peak and drawdown processes as

Pi,umax=maxkupi,k,DDi,u=1pi,uPi,umax.P^{\max}_{i,u}=\max_{k\leq u}p_{i,k},\qquad\mathrm{DD}_{i,u}=1-\frac{p_{i,u}}{P^{\max}_{i,u}}.

In this study, we employ two recovery–based downside–risk indicators (Chekhlov et al., 2005): recovery time and recovery factor. A drawdown is initiated at index uu whenever pi,u<Pi,umaxp_{i,u}<P^{\max}_{i,u}. For such a drawdown, its recovery time is defined as m(u)=min{m>u:pi,mPi,umax}m(u)=\min\{m>u:p_{i,m}\geq P^{\max}_{i,u}\}, i.e., the first time the price returns to its previous peak. Let 𝒰\mathcal{U} denote the set of drawdown onset points that recover within the window. Average recovery time is then

RTi,t=1|𝒰|u𝒰(m(u)u).\mathrm{RT}_{i,t}=\frac{1}{|\mathcal{U}|}\sum_{u\in\mathcal{U}}\bigl(m(u)-u\bigr).

Lower values of RTi,t\mathrm{RT}_{i,t} indicate faster rebounds following adverse movements and are therefore preferred.

Let MaxDDi,t=max1uHddDDi,u\mathrm{MaxDD}_{i,t}=\max_{1\leq u\leq H_{\text{dd}}}\mathrm{DD}_{i,u} denote the maximum drawdown over the same window, and define the cumulative return as CRi,t=u=1Hdd(1+ri,u)1\mathrm{CR}_{i,t}=\prod_{u=1}^{H_{\text{dd}}}\left(1+r_{i,u}\right)-1. The recovery factor is then given by

RFi,t=CRi,tMaxDDi,t.\mathrm{RF}_{i,t}=\frac{\mathrm{CR}_{i,t}}{\mathrm{MaxDD}_{i,t}}.

Higher values of RFi,t\mathrm{RF}_{i,t} correspond to more efficient compensation of losses and are therefore preferred.

ESG–based indicators. Environmental, Social, and Governance assessments provide complementary information on firms’ non‑financial performance and long‑term sustainability. Let {ESGi,u}\{\mathrm{ESG}_{i,u}\} denote the monthly ESG score series of asset ii. Given a lookback parameter HesgH_{\mathrm{esg}}, we consider two classes of indicators (Nagy et al., 2016): ESG momentum and ESG volatility. The ESG momentum of asset ii over a horizon of length HesgH_{\mathrm{esg}} is defined as ESG-MOMi,tHesg=ESGi,tESGi,tHesg\mathrm{ESG\text{-}MOM}_{i,t}^{H_{\mathrm{esg}}}=\mathrm{ESG}_{i,t}-\mathrm{ESG}_{i,t-H_{\mathrm{esg}}}. This indicator captures the direction and magnitude of recent changes in a firm’s sustainability profile. Higher values of ESG-MOMi,t\mathrm{ESG\text{-}MOM}_{i,t} reflect improvements in ESG performance and are therefore preferred. ESG volatility measures the stability of a firm’s sustainability performance over the same horizon, and is defined as

ESG-VOLi,tHesg=1Hesgh=0Hesg1(ESGi,thESG¯i,t)2\mathrm{ESG\text{-}VOL}_{i,t}^{H_{\mathrm{esg}}}=\sqrt{\frac{1}{H_{\mathrm{esg}}}\sum_{h=0}^{H_{\mathrm{esg}}-1}\bigl(\mathrm{ESG}_{i,t-h}-\overline{\mathrm{ESG}}_{i,t}\bigr)^{2}}

where ESG¯i,t=1Hesgh=0Hesg1ESGi,th\overline{\mathrm{ESG}}_{i,t}=\frac{1}{H_{\mathrm{esg}}}\sum_{h=0}^{H_{\mathrm{esg}}-1}\mathrm{ESG}_{i,t-h}. Lower values of ESG-VOLi,t\mathrm{ESG\text{-}VOL}_{i,t} indicate a more stable ESG behavior and are therefore preferred.

4.4.3 Phase–Dependent Criteria

These indicators adapt the screening to the prevailing regime ϕ{bull,bear,sideways}\phi\in\{\text{bull},\text{bear},\text{sideways}\} detected by the market–phase identifier. They modulate diversification and tracking behaviour in a manner consistent with current conditions.

Normalized Mutual Information with the benchmark. Let {ri,u}\{r_{i,u}\} and {rb,u}\{r_{b,u}\} denote, respectively, the daily return series of asset ii and of the benchmark over a lookback window of length HnmiH_{\text{nmi}}. To quantify the degree of dependence between each asset and the market, we compute the normalized mutual information (NMI) between the return series of the asset and the benchmark:

NMIi,t=2MI(ri;rb)HSh(ri)+HSh(rb)[0,1]\mathrm{NMI}_{i,t}=\frac{2\,MI(r_{i};r_{b})}{H^{\text{Sh}}(r_{i})+H^{\text{Sh}}(r_{b})}\in[0,1]

where MI(,)MI(\cdot,\cdot) denotes mutual information and HSh()H^{\text{Sh}}(\cdot) Shannon entropy (their definitions are provided in the Supplementary Material). This formulation symmetrically normalizes the dependence measure, ensuring comparability across assets and preventing scale distortions due to heterogeneous return distributions. For each asset, NMIi,t\mathrm{NMI}_{i,t} is estimated non-parametrically using the procedure described in Kraskov et al. (2004). To incorporate regime–specific preferences into the NMI measure, we transform the score according to the prevailing phase:

NMI~i,t={NMIi,t,ϕt{bull,bear},1|NMIi,t0.5|,ϕt=sideways.\widetilde{\mathrm{NMI}}_{i,t}=\begin{cases}\mathrm{NMI}_{i,t},&\phi_{t}\in\{\text{bull},\text{bear}\},\\[4.0pt] 1-\lvert\mathrm{NMI}_{i,t}-0.5\rvert,&\phi_{t}=\text{sideways}.\end{cases}

In bull markets, strong co-movement with the benchmark is desirable, so NMI is used as a benefit criterion. In bear markets, high dependence makes the asset more exposed to market downturns; therefore, NMI is marked as a cost criterion. In sideways regimes, neither very high nor very low co-movement is optimal, as range–bound environments reward moderately correlated behavior. Transformation 1|NMIi,t0.5|1-\lvert\mathrm{NMI}_{i,t}-0.5\rvert achieves this by assigning the highest score to values near 0.50.5 and penalizing extreme co-movement. The transformed quantity is then treated as a benefit criterion.

Regime‑concordant technical signals. We construct for each asset a composite regime–concordant technical score Si,ttechS^{\mathrm{tech}}_{i,t}, designed to summarize short– and medium–term price information in a manner that reflects the prevailing market phase ϕt{bull,bear,sideways}\phi_{t}\in\{\text{bull},\text{bear},\text{sideways}\} and its associated volatility condition υt{low,normal,high}\upsilon_{t}\in\{\text{low},\text{normal},\text{high}\}. The score is obtained by aggregating the outputs of phase–specific signal functions applied to asset price series {pi,u}\{p_{i,u}\} over a lookback window of HtechH_{\text{tech}} days, each relying on a set of standard technical indicators whose operational definitions are reported in the Supplementary Material. For each asset ii, the signaling rule returns a raw indicator score sci,t\mathrm{sc}_{i,t}, which we subsequently transform into a continuous measure Si,ttech[1,1]S^{\mathrm{tech}}_{i,t}\in[-1,1] suitable for integration within the MEREC–TODIMSort machinery. In trending regimes (ϕt{bull,bear}\phi_{t}\in\{\text{bull},\text{bear}\}), each asset is evaluated through three elementary components: (i) trend strength via ADX, (ii) directional momentum via MACD, and (iii) a momentum–confirmation filter based on RSI. Let ADXi,t\mathrm{ADX}_{i,t}, MACDi,t\mathrm{MACD}_{i,t}, and RSIi,t\mathrm{RSI}_{i,t} denote the corresponding indicator values. The score sci,t{0,1,2,3}\mathrm{sc}_{i,t}\in\{0,1,2,3\} counts how many of the following conditions are satisfied:

  1. 1.

    ADXi,t>20\mathrm{ADX}_{i,t}>20 (trend strength);

  2. 2.

    MACDi,t>0\mathrm{MACD}_{i,t}>0 in bull markets, MACDi,t<0\mathrm{MACD}_{i,t}<0 in bear markets (directional momentum);

  3. 3.

    RSIi,t[30,70]\mathrm{RSI}_{i,t}\in[30,70] (momentum confirmation).

To obtain a unified directional scale, the raw score is mapped to [1,1][-1,1] as

Si,ttech={sci,t1.51.5,ϕt=bull,sci,t1.51.5,ϕt=bearS^{\mathrm{tech}}_{i,t}=\begin{cases}\displaystyle\frac{\mathrm{sc}_{i,t}-1.5}{1.5},&\phi_{t}=\text{bull},\\[6.0pt] \displaystyle-\frac{\mathrm{sc}_{i,t}-1.5}{1.5},&\phi_{t}=\text{bear}\end{cases}

so that higher values always favor long positions and lower values favor short positions, independently of the trend direction.

When the market is sideways and volatility is high (ϕt=sideways,υt=high\phi_{t}=\text{sideways},\ \upsilon_{t}=\text{high}), the scoring rule targets range–bound dynamics with large excursions, using Bollinger–band signals, the Commodity Channel Index (CCI), and RSI. Let BBsigi,t{1,0,1}\mathrm{BBsig}_{i,t}\in\{-1,0,1\}, CCIi,t\mathrm{CCI}_{i,t}, and RSIi,t\mathrm{RSI}_{i,t} represent the signals obtained from these indicators. We compute two partial scores, buySci,t\mathrm{buySc}_{i,t} and sellSci,t\mathrm{sellSc}_{i,t}, by counting the number of indicators simultaneously supporting oversold or overbought conditions, respectively, using the thresholds CCI[100,100]\mathrm{CCI}\in[-100,100] and RSI[30,70]\mathrm{RSI}\in[30,70]. The continuous technical score is then defined as

Si,ttech=buySci,tsellSci,t3,Si,ttech[1,1].S^{\mathrm{tech}}_{i,t}=\frac{\mathrm{buySc}_{i,t}-\mathrm{sellSc}_{i,t}}{3},\qquad S^{\mathrm{tech}}_{i,t}\in[-1,1].

When the market is sideways with low or normal volatility (ϕt=sideways,υt{low,normal}\phi_{t}=\text{sideways},\ \upsilon_{t}\in\{\text{low},\text{normal}\}), we adopt a finer range–trading logic based on the RSI, the Stochastic Oscillator (KKDD crossing), and the CCI. Let RSIi,t\mathrm{RSI}_{i,t}, Ki,tK_{i,t}, Di,tD_{i,t}, CCIi,t\mathrm{CCI}_{i,t} be the corresponding signal values, buy–oriented evidence is recorded when RSIi,t<30\mathrm{RSI}_{i,t}<30, Ki,t>Di,tK_{i,t}>D_{i,t}, and CCIi,t<100\mathrm{CCI}_{i,t}<-100; sell–oriented evidence when RSIi,t>70\mathrm{RSI}_{i,t}>70, Ki,t<Di,tK_{i,t}<D_{i,t}, and CCIi,t>100\mathrm{CCI}_{i,t}>100. Letting buySci,t\mathrm{buySc}_{i,t} and sellSci,t\mathrm{sellSc}_{i,t} denote the associated counts, the continuous regime–concordant score is

Si,ttech=buySci,tsellSci,t3[1,1].S^{\mathrm{tech}}_{i,t}=\frac{\mathrm{buySc}_{i,t}-\mathrm{sellSc}_{i,t}}{3}\in[-1,1].

5 Optimization Framework

We now introduce the adaptive multi-operator particle swarm optimization algorithm with projection-based repair mechanism (shortly, AMPSO), designed to solve the proposed portfolio optimization problems. The algorithm integrates three components:

  • (i)

    a particle swarm optimization with time-varying acceleration coefficients (PSO-TVAC) serving as the core search engine;

  • (ii)

    an adaptive operator selection strategy (AOS) that dynamically combines PSO dynamics with a set of variation operators based on performance feedback;

  • (iii)

    a constraint-repair technique that ensures feasibility with portfolio constraints.

Although our portfolio optimization problems are formulated as maximization of the Omega ratio, in practice we solve equivalent minimization problems by minimizing the function f(w)=Ω(Rw;Rb)f(w)=-\Omega(R_{w};R_{b}) while keeping the feasible regions unchanged. This transformation is standard in the metaheuristic optimization literature, where solvers are typically designed to minimize objective functions.

5.1 PSO Algorithm with Time-Varying Acceleration Coefficients

Particle swarm optimization is a population-based stochastic search method inspired by social learning and collective behavior. In its canonical form, PSO maintains a swarm of particles, each representing a candidate solution in an nn-dimensional search space. At iteration g=0,1,,gmaxg=0,1,\dots,g_{\max}, particle jj has position yj(g)ny_{j}(g)\in\mathbb{R}^{n} and velocity vj(g)nv_{j}(g)\in\mathbb{R}^{n}. The update mechanism consists of two coupled equations:

vj(g+1)\displaystyle v_{j}(g+1) =ω(g)vj(g)+c1(g)r1(g)(yj(g)yj(g))+c2(g)r2(g)(y^(g)yj(g))\displaystyle=\omega(g)\,v_{j}(g)+c_{1}(g)\,r_{1}(g)\odot\big(y_{j}^{\star}(g)-y_{j}(g)\big)+c_{2}(g)\,r_{2}(g)\odot\big(\hat{y}(g)-y_{j}(g)\big) (5.1)
yj(g+1)\displaystyle y_{j}(g+1) =yj(g)+vj(g+1)\displaystyle=y_{j}(g)+v_{j}(g+1) (5.2)

where \odot denotes the element-wise product, r1(g),r2(g)U(0,1)nr_{1}(g),r_{2}(g)\sim U(0,1)^{n} are random vectors, with each component independently drawn from the uniform distribution on [0,1][0,1], and yj(g)y_{j}^{\star}(g) and y^(g)\hat{y}(g) represent the best positions found by particle jj and by the entire swarm, respectively.

The inertia weight ω(g)\omega(g) regulates the trade-off between global exploration and local exploitation by scaling the contribution of the previous velocity. Large values of ω(g)\omega(g) favor exploration, while small values promote convergence. Similarly, the acceleration coefficients c1(g)c_{1}(g) and c2(g)c_{2}(g) control the cognitive and social components, weighting the attraction towards the particle’s own best position and the global best position, respectively.

To achieve a dynamic balance between exploration and exploitation, the PSO-TVAC algorithm employs linearly adaptive schedules (see Ratnaweera et al. (2004), and Deng et al. (2012) for an application in the field of portfolio selection):

ω(g)\displaystyle\omega(g) =ωmaxωmaxωmingmaxg,\displaystyle=\omega_{\max}-\frac{\omega_{\max}-\omega_{\min}}{g_{\max}}\,g, (5.3)
c1(g)\displaystyle c_{1}(g) =c1,maxc1,maxc1,mingmaxg,\displaystyle=c_{1,\max}-\frac{c_{1,\max}-c_{1,\min}}{g_{\max}}\,g, (5.4)
c2(g)\displaystyle c_{2}(g) =c2,min+c2,maxc2,mingmaxg\displaystyle=c_{2,\min}+\frac{c_{2,\max}-c_{2,\min}}{g_{\max}}\,g (5.5)

where ωmax>ωmin\omega_{\max}>\omega_{\min}, c1,max>c1,minc_{1,\max}>c_{1,\min}, and c2,max>c2,minc_{2,\max}>c_{2,\min}.

In our version of the algorithm, to prevent divergence and maintain stability, velocities are clamped component-wise vj(g+1)min{max(vj(g+1),vmin),vmax}v_{j}(g+1)\leftarrow\min\big\{\max\big(v_{j}(g+1),v_{\min}\big),v_{\max}\big\}, where vminv_{\min} and vmaxv_{\max} denote the component-wise lower and upper velocity bounds (Shi and Eberhart, 1998).

The PSO-TVAC procedure can thus be summarized as follows:

  1. 1.

    Initialize particle positions uniformly within the search bounds and set initial velocities to zero.

  2. 2.

    Evaluate the objective function for all particles; initialize yjy_{j}^{\star} and y^\hat{y}.

  3. 3.

    For g=1,,gmaxg=1,\dots,g_{\max}:

    1. (a)

      Update ω(g)\omega(g), c1(g)c_{1}(g), and c2(g)c_{2}(g) according to the TVAC schedules (5.3)–(5.5).

    2. (b)

      Compute new velocities using Eq. (5.1) and apply velocity clamping.

    3. (c)

      Update positions using Eq. (5.2).

    4. (d)

      Evaluate the objective function and update yjy_{j}^{\star} and y^\hat{y}.

  4. 4.

    Return y^\hat{y} as the best solution found.

The adaptation in time of ω\omega, c1c_{1}, and c2c_{2} promotes global search during early iterations and accelerates convergence in later stages, mitigating the risk of getting trapped in local optima and improving the quality of solutions in high-dimensional, non‑convex landscapes, such as long–short portfolio optimization.

5.2 Adaptive Operator Selection Strategy

The proposed metaheuristic employs an adaptive operator selection (AOS) mechanism to regulate the balance between exploration and exploitation. AOS operates on a heterogeneous set of variation operators, including real-coded crossovers, multi-parent recombination schemes, discrete operators, and mutation mechanisms. Detailed definitions and parameter settings for all operators are reported in the Supplementary Material. At each generation, AOS evaluates the relative performance of the available operators and probabilistically selects the one expected to offer the best trade-off between solution quality and population diversity. The controller follows a four-stage reinforcement-learning scheme consisting of criteria aggregation, reward computation, credit assignment, and operator selection.

Criteria aggregation

At generation gg, a subset of nparn_{\mathrm{par}} particles is sampled and each operator in 𝒪={o1,,onop}\mathcal{O}=\{o_{1},\ldots,o_{n_{\mathrm{op}}}\} is applied to this subset, generating trial offspring. For operator oko_{k}, we record Qg(k)Q_{g}^{(k)} (i.e., the mean objective value of the offspring) and Dg(k)D_{g}^{(k)} (i.e., the mean Euclidean dispersion of the offspring), where dispersion is measured as the average distance from the population centroid.111Recall that throughout this section all performance statistics are computed with respect to the minimization objective f(w)=Ω(Rw;Rb)f(w)=-\Omega(R_{w};R_{b}). To reduce short-term noise, improvements are aggregated over a sliding window of length TwT_{\text{w}}:

ΔQ(k)(g)=1Tw1t=gTw+1g1(Qt+1(k)Qt(k)),ΔD(k)(g)=1Tw1t=gTw+1g1(Dt+1(k)Dt(k)).\Delta_{Q}^{(k)}(g)=\frac{1}{T_{\text{w}}-1}\sum_{t=g-T_{\text{w}}+1}^{g-1}\bigl(Q_{t+1}^{(k)}-Q_{t}^{(k)}\bigr),\qquad\Delta_{D}^{(k)}(g)=\frac{1}{T_{\text{w}}-1}\sum_{t=g-T_{\text{w}}+1}^{g-1}\bigl(D_{t+1}^{(k)}-D_{t}^{(k)}\bigr).
Reward computation

Rewards are computed using the compass method, which evaluates the alignment of the vector (ΔQ(k),ΔD(k))\left(\Delta_{Q}^{(k)},\Delta_{D}^{(k)}\right) with a target search direction parameterized by an angle φg[0,π/2]\varphi_{g}\in[0,\pi/2] (Filograsso and di Tollo, 2023). The controller alternates exploration and exploitation through the schedule

φg={π2,g0.2gmax,0,0.2gmax<g0.4gmax,π2,0.4gmax<g0.6gmax,0,0.6gmax<g0.8gmax,π2,g>0.8gmax.\varphi_{g}=\begin{cases}\frac{\pi}{2},&g\leq 0.2\,g_{\max},\\ 0,&0.2\,g_{\max}<g\leq 0.4\,g_{\max},\\ \frac{\pi}{2},&0.4\,g_{\max}<g\leq 0.6\,g_{\max},\\ 0,&0.6\,g_{\max}<g\leq 0.8\,g_{\max},\\ \frac{\pi}{2},&g>0.8\,g_{\max}.\end{cases} (5.6)

Let m=tan(φg)m=\tan(\varphi_{g}) be the slope of the target direction in the (ΔQ,ΔD)(\Delta_{Q},\Delta_{D}) plane. For operator oko_{k} we compute d=(ΔQ(k))2+(ΔD(k))2d=\sqrt{\bigl(\Delta_{Q}^{(k)}\bigr)^{2}+\bigl(\Delta_{D}^{(k)}\bigr)^{2}}, and d=|ΔD(k)mΔQ(k)|1+m2d_{\perp}=\frac{\bigl|\Delta_{D}^{(k)}-m\,\Delta_{Q}^{(k)}\bigr|}{\sqrt{1+m^{2}}}. Then, the quantity reward(k){reward}^{(k)} is computed as d2d2\sqrt{d^{2}-d_{\perp}^{2}}, which corresponds to the projection of (ΔQ(k),ΔD(k))\left(\Delta_{Q}^{(k)},\Delta_{D}^{(k)}\right) onto the target direction.

Credit assignment

Credits are obtained by averaging rewards over the sliding window

ck=1Twt=gTw+1greward(k)(t).c_{k}=\frac{1}{T_{\text{w}}}\sum_{t=g-T_{\text{w}}+1}^{g}{reward}^{(k)}(t).
Operator selection

The next operator is selected by probability matching:

pk=pmin+(1noppmin)ckj=1nopcj,p_{k}=p_{\min}+(1-n_{\mathrm{op}}\,p_{\min})\frac{c_{k}}{\sum_{j=1}^{n_{\mathrm{op}}}c_{j}}\,,

where pmin>0p_{\min}>0 ensures non-zero probability for all operators. The operator corresponding to the highest pkp_{k} is applied to each particle with probability pappp_{\mathrm{app}}. This mechanism enables the algorithm to favor operators that consistently improve quality when φg=π/2\varphi_{g}=\pi/2 and those that increase diversity when φg=0\varphi_{g}=0, thus maintaining a robust balance between exploration and exploitation throughout the search.

5.3 Constraint Handling via Projection Methods

The portfolio optimization problems introduced in Section 3 involve equality and bound constraints that define a closed and convex feasible region. To guarantee that every particle generated by variation operators remains feasible, we adopt a projection-based repair strategy that maps any infeasible vector to its orthogonal projection onto the constraint set. This approach preserves feasibility without altering the objective function and is well suited to linear budget constraints coupled with box bounds.

5.3.1 Projection for the long-short model

After classifying the assets into the two disjoint index sets IlongI_{\text{long}} and IshortI_{\text{short}}, the feasible region of the long–short problem (3.2) is

S={wn:iIlongwi=1+s, 0wiwiuiIlong,iIshortwi=s,wiwi0iIshort}.S=\biggl\{w\in\mathbb{R}^{n}:\sum_{i\in I_{\text{long}}}w_{i}=1+s,\;0\leq w_{i}\leq w_{i}^{u}\;\forall i\in I_{\text{long}},\;\sum_{i\in I_{\text{short}}}w_{i}=-s,\;w_{i}^{\ell}\leq w_{i}\leq 0\;\forall i\in I_{\text{short}}\biggr\}.

Since IlongIshort=I_{\text{long}}\cap I_{\text{short}}=\emptyset and the Euclidean norm is separable, the projection of a vector ewne_{w}\in\mathbb{R}^{n} onto SS decomposes into two independent subproblems:

PS(ew)=(PSL(ew,(L)),PSS(ew,(S)))\mathrm{PS}(e_{w})=\bigl(\mathrm{PS}_{L}(e_{w,(L)}),\;\mathrm{PS}_{S}(e_{w,(S)})\bigr)

where ew,(L)=(ew,i)iIlonge_{w,(L)}=(e_{w,i})_{i\in I_{\text{long}}} and ew,(S)=(ew,i)iIshorte_{w,(S)}=(e_{w,i})_{i\in I_{\text{short}}}.

Long block

The projection for the long block solves

minunlong12uew,(L)22s.t.iIlongui=1+s, 0uiwiu.\min_{u\in\mathbb{R}^{n_{\text{long}}}}\frac{1}{2}\|u-e_{w,(L)}\|_{2}^{2}\quad\text{s.t.}\quad\sum_{i\in I_{\text{long}}}u_{i}=1+s,\;0\leq u_{i}\leq w_{i}^{u}.

By the Karush–Kuhn–Tucker conditions, the optimal solution has the form

ui=[ew,iλ] 0wiu,iIlongu_{i}=\bigl[e_{w,i}-\lambda\bigr]^{\,w_{i}^{u}}_{\,0},\qquad i\in I_{\text{long}} (5.7)

where [θ]ab=min{max{θ,a},b}[\theta]^{b}_{a}=\min\{\max\{\theta,a\},b\} denotes clipping, and λ\lambda is the unique scalar satisfying iIlong[ew,iλ] 0wiu=1+s.\sum_{i\in I_{\text{long}}}\bigl[e_{w,i}-\lambda\bigr]^{\,w_{i}^{u}}_{\,0}=1+s.

Short block

Similarly, the short block projection solves

minvnshort12vew,(S)22s.t.iIshortvi=s,wivi0\min_{v\in\mathbb{R}^{n_{\text{short}}}}\frac{1}{2}\|v-e_{w,(S)}\|_{2}^{2}\quad\text{s.t.}\quad\sum_{i\in I_{\text{short}}}v_{i}=-s,\;w_{i}^{\ell}\leq v_{i}\leq 0

whose solution is

vi=[ew,iλ]wi 0,iIshortv_{i}=\bigl[e_{w,i}-\lambda^{\prime}\bigr]^{\,0}_{\,w_{i}^{\ell}},\qquad i\in I_{\text{short}} (5.8)

with λ\lambda^{\prime} uniquely determined by iIshort[ew,iλ]wi 0=s.\sum_{i\in I_{\text{short}}}\bigl[e_{w,i}-\lambda^{\prime}\bigr]^{\,0}_{\,w_{i}^{\ell}}=-s.

Existence and uniqueness of λ\lambda and λ\lambda^{\prime} follow from the fact that the left-hand sides of the above equations define continuous, strictly decreasing functions whose ranges cover the imposed budgets. In practice, the values of λ\lambda and λ\lambda^{\prime} can be efficiently computed using the O(n)O(n) algorithm for projecting a vector onto the intersection of a hyperplane and a box, as described in Maculan et al. (2003). This guarantees a numerically stable and computationally scalable implementation of operator PS.

5.3.2 Long-only model

Under short-selling suspension, the feasible region reduces to

S={wnlong-only:iIlong-onlywi=1, 0wiwiu}.S^{\prime}=\biggl\{w\in\mathbb{R}^{n_{\text{long-only}}}:\sum_{i\in I_{\text{long-only}}}w_{i}=1,\;0\leq w_{i}\leq w_{i}^{u}\biggr\}.

The projection onto SS^{\prime} is the specialization of the long-block projector (5.7):

wi=[ew,iλ] 0wiu,iIlong-onlyw_{i}=\bigl[e_{w,i}-\lambda\bigr]^{\,w_{i}^{u}}_{\,0},\qquad i\in I_{\text{long-only}}

where λ\lambda is chosen so that iIlong-only[ew,iλ] 0wiu=1\sum_{i\in I_{\text{long-only}}}\bigl[e_{w,i}-\lambda\bigr]^{\,w_{i}^{u}}_{\,0}=1.

5.4 AMPSO

The AMPSO solver integrates the components introduced in Sections 5.1-5.3.1 into a unified multi‑operator swarm framework. At each iteration, particles evolve according to the PSO–TVAC dynamics and are subsequently projected onto the feasible region to enforce budget and long–short constraints. The adaptive operator selection (AOS) mechanism then complements the PSO update by choosing, with probability matching, the variation operator that has shown the most favorable balance between quality improvement and population dispersion over a sliding performance window.

This integration allows AMPSO to alternate between exploitation phases, dominated by PSO–TVAC updates, and exploration phases, driven by the adaptive activation of mutation and crossover operators. The overall procedure yields a solver capable of adjusting its search behavior in response to the topology of the optimization landscape while maintaining feasibility at every iteration. A complete pseudocode of AMPSO, together with the full specification of the variation operators, is reported in the Supplementary Material.

6 Experimental Analysis

This section presents the empirical assessment of the proposed long–short portfolio optimization framework. First, we introduce the data set used in the study and the parameters adopted for the screening criteria. Second, we examine the effectiveness of the MEREC–TODIMSort procedure in assigning assets to long, neutral, and short categories. Third, we evaluate the performance of the AMPSO algorithm and compare its behavior with standard evolutionary benchmarks. Finally, we assess the ex‑post profitability of the portfolios generated by the full pipeline.

6.1 Data and Parameter Specification

The empirical analysis is conducted on a European equity universe composed of a subset of the constituents of the STOXX Europe 600 index. Daily closing prices, market capitalization, and ESG scores are retrieved from Refinitiv, and all series are aligned across the sample to ensure temporal consistency. The data set spans the period from 1 June 2013 to 30 June 2023. Assets with missing observations within any relevant estimation window are removed, resulting in an investment universe of N=421N=421 stocks.

The backtesting procedure follows a monthly rebalancing scheme. On each rebalancing date, defined as the last trading day of the month, we utilize a 3-year rolling in-sample window to obtain the historical information needed to estimate the empirical Omega ratio and to compute the quantities involved in the screening and classification stages. Then, we allocate the optimal portfolio to the subsequent 1-month out-of-sample period, in line with the single-period framework described in Section 3. In addition to price-based information, the pre-selection stage requires the construction of regime-conditioned decision matrices used to define the dynamic limiting profiles of the TODIMSort procedure (Section 4.3). Since at least two historical matrices are needed for each market phase, we impose a minimum of 12 initial months before the first evaluation date. Combined with the 3-year rolling window, this implies that the empirical study is carried out on 72 admissible rebalancing dates, from 31 July 2017 to 30 June 2023. This trimming step ensures that all computations rely on statistically coherent price histories and regime-specific information sets that remain representative of the prevailing market conditions.

The data set also incorporates relevant regulatory constraints. During the COVID-19 turmoil, the Italian market authority CONSOB introduced a temporary ban on creating or increasing net short positions. The measure, enacted by Resolution No. 21303 on March 17, 2020 and endorsed by ESMA, remained in force from March 18 to June 18, 2020. Hence, we treated the rebalancing windows intersecting March, April, and May 2020 as long-only, ensuring full adherence to the regulatory environment.

The implementation of the MEREC–TODIMSort framework requires a consistent parameter specification for all screening criteria, that we report in Table 2. Medium-term momentum is computed using monthly returns over a 1-year lookback horizon, with a 1-month lag to mitigate short-term reversal effects. The upside-to-downside beta ratio is estimated using two alternative benchmark state thresholds τ{q0.25(rb),q0.50(rb)}\tau\in\bigl\{q_{0.25}(r_{b}),\;q_{0.50}(r_{b})\bigr\}, where qδ(rb)q_{\delta}(r_{b}) denotes the empirical δ\delta-quantile of benchmark returns. The threshold τ=q0.25(rb)\tau=q_{0.25}(r_{b}) captures adverse market events, while τ=q0.50(rb)\tau=q_{0.50}(r_{b}) provides a balanced state partition consistent with typical market conditions. Drawdown-related indicators are evaluated using an daily observations on a 3-year in-sample window. ESG-based indicators are computed from end-of-month Refinitiv scores: ESG momentum uses a 12-month horizon, while ESG volatility is computed over 24- and 36-month windows, with a 1-month lag applied uniformly to prevent look-ahead bias. Regime-dependent indicators, including Normalized Mutual Information (NMI) and regime-concordant technical score StechS_{\text{tech}}, rely on the market-phase detection mechanism of Section 4.4.1 and therefore inherit the same rolling estimation windows for return- and price-based inputs. This ensures that all regime-conditioned decision matrices remain synchronized and constructed exclusively from information available at each rebalancing date.

Table 2: Summary of screening criteria and parameter settings used in the MEREC–TODIMSort framework.
Criterion Symbol Lookback Type
Momentum MOMi\mathrm{MOM}_{i} 1 year (monthly) and 1-month lag Benefit
Upside–Downside Beta Ratio
(0.25 quantile of benchmark returns) UDRi0.25\mathrm{UDR}^{0.25}_{i} 2 years (daily) Benefit
Upside–Downside Beta Ratio
(0.50 quantile of benchmark returns) UDRi0.50\mathrm{UDR}^{0.50}_{i} 2 years (daily) Benefit
Recovery Time RTi\mathrm{RT}_{i} 3 years (daily) Cost
Recovery Factor RFi\mathrm{RF}_{i} 3 years (daily) Benefit
ESG Momentum ESG-MOMi\mathrm{ESG\text{-}MOM}_{i} 1 year (monthly) and 1-month lag Benefit
ESG Volatility (24m) ESG-VOLi24\mathrm{ESG\text{-}VOL}^{24}_{i} 2 years (monthly) and 1-month lag Cost
ESG Volatility (36m) ESG-VOLi36\mathrm{ESG\text{-}VOL}^{36}_{i} 3 years (monthly) and 1-month lag Cost
Normalized Mutual Information NMIi\mathrm{NMI}_{i} 2 years (daily) Phase-dependent
Regime-Concordant Technical Score Stech,iS_{\mathrm{tech},i} Indicator-specific Benefit

For benchmarking purposes, a synthetic market-value-weighted index constructed from the 421 assets in the investment universe is used as the market proxy throughout the empirical analysis.

6.2 Analysis of the MEREC–TODIMSort Procedure

This subsection examines the behavior of the screening and classification component, detailing how regime identification, MEREC weighting patterns, and TODIMSort limiting profiles jointly shape the construction of long, neutral, and short sets.

6.2.1 Market-phase identifier

A central component of the screening and classification architecture is the market-phase identifier, which governs both the activation of regime-dependent criteria (such as NMI and the regime-concordant technical score) and the construction of the limiting profiles used in TODIMSort. Its reliability is therefore essential to ensure that the criteria matrix and the class boundaries remain consistent with prevailing market conditions. Figure 1 shows the benchmark together with the phases detected at each rebalancing date. The alternation between bull, bear, and sideways states aligns closely with observable shifts in trend direction, short‑term momentum, and directional strength, indicating that the identifier captures regime changes in a timely and coherent manner. This alignment is obtained without anticipatory information: all labels are produced under a strict no–look–ahead policy and rely exclusively on data available at each evaluation date. The evidence in Figure 1 indicates that a classical yet suitably filtered set of technical indicators can generate a parsimonious and stable regime map, providing a coherent foundation for regime‑conditioned screening and classification decisions.

Refer to caption
Figure 1: Evolution of the benchmark index over the experimental period with soft‑colored bands indicating the market phase detected at each rebalancing date (bull, bear, sideways). Regime labels are based exclusively on information available up to the corresponding date.

6.2.2 MEREC: regime-dependent relevance of screening criteria

Next, we examine how MEREC allocates importance across criteria and how this allocation varies with market conditions. From the joint inspection of the box plots in Figure 2 and the phase-specific averages reported in Table 3, a clear and economically coherent hierarchy of criteria emerges.

First, downside- and recovery-related indicators dominate the screening stage across all market regimes. Recovery time (RT) is the most influential criterion, with average weights between 0.21 and 0.23 and a pronounced peak in bear markets, underscoring the central role of resilience and post-drawdown behavior under stressed conditions. Recovery factor (RF) consistently follows as the second most relevant indicator, with weights around 0.18–0.19 and limited dispersion, confirming the stability of longer-horizon drawdown metrics in providing cross-sectional discrimination. Then, a second group of criteria contributes more uniformly across regimes. Specifically, ESG-related measures display stable and meaningful weights: ESG momentum ranges between 0.084 and 0.097, while ESGVOL24\mathrm{ESG\!-\!VOL}^{24} and ESGVOL36\mathrm{ESG\!-\!VOL}^{36} lie in the 0.09–0.12 interval, with no pronounced regime-specific peaks. This pattern indicates that short- and medium-horizon ESG variability provide consistent discriminative power independently of market conditions. Similarly, the two asymmetric beta ratios retain significant importance, particularly UDR0.25\mathrm{UDR}^{0.25}, while exhibiting the highest temporal variability (standard deviations around 0.04–0.05), reflecting the inherently time-varying nature of conditional betas and their sensitivity to shifts in market states. Finally, a third group comprises criteria whose contribution is strongly regime dependent. Normalized mutual information (NMI) attains its lowest relevance in bear markets (0.0082), compared to bull (0.0203) and sideways (0.0214) phases, suggesting that benchmark co-movement becomes less informative precisely when market-wide risk intensifies and cross-sectional dispersion narrows. In contrast, the technical score StechS^{\mathrm{tech}} becomes substantially more informative during downturns (0.0830), compared to bull (0.0257) and sideways (0.0471) regimes. This behavior indicates that trend asymmetries, volatility-driven signals, and range-trading dynamics contain richer discriminative content under stressed environments.

Refer to caption
Figure 2: Box plot of the criterion weights assigned by the MEREC method across all rebalancing periods.
Table 3: Average MEREC weights across market phases, with standard deviations in parentheses.
Criterion Bull Bear Sideways
MOM\mathrm{MOM} 0.0663 (0.0178) 0.0616 (0.0152) 0.0606 (0.0168)
UDR0.25\mathrm{UDR}^{0.25} 0.1079 (0.0463) 0.0997 (0.0473) 0.0904 (0.0365)
UDR0.50\mathrm{UDR}^{0.50} 0.0583 (0.0214) 0.0480 (0.0184) 0.0605 (0.0285)
RT\mathrm{RT} 0.2153 (0.0278) 0.2292 (0.0351) 0.2098 (0.0230)
RF\mathrm{RF} 0.1837 (0.0094) 0.1874 (0.0070) 0.1845 (0.0075)
ESGMOM\mathrm{ESG\!-\!MOM} 0.0949 (0.0133) 0.0844 (0.0150) 0.0969 (0.0132)
ESGVOL24\mathrm{ESG\!-\!VOL}^{24} 0.1054 (0.0196) 0.0900 (0.0170) 0.1068 (0.0189)
ESGVOL36\mathrm{ESG\!-\!VOL}^{36} 0.1222 (0.0175) 0.1086 (0.0150) 0.1221 (0.0186)
NMI\mathrm{NMI} 0.0203 (0.0061) 0.0082 (0.0030) 0.0214 (0.0049)
StechS^{\mathrm{tech}} 0.0257 (0.0093) 0.0830 (0.0141) 0.0471 (0.0146)

Our findings show that MEREC assigns weights in a statistically stable and economically interpretable manner: recovery-related metrics dominate the hierarchy; ESG, momentum, and asymmetric betas provide a consistent intermediate layer of information; and regime-sensitive criteria — especially NMI and StechS^{\mathrm{tech}} — gain or lose relevance according to market conditions.

6.2.3 TODIMSort: limiting profiles and regime sensitivity

The TODIMSort stage converts the weighted multi‑criteria information into an endogenous assignment of assets to short, neutral, and long categories. A key modeling choice concerns the construction of the limiting profiles, which are defined through regime‑conditioned empirical quantiles. In this respect, the three quantile pairs (p1,p2){(5,50),(10,60),(20,70)}(p_{1},p_{2})\in\{(5,50),(10,60),(20,70)\} allow us to vary the selectivity of the profiles in a controlled manner that enables a direct assessment of the robustness of the classification mechanism.

The impact of these alternative specifications on portfolio rotation can be quantified by tracking changes in class membership among consecutive rebalancing dates. To this end, we measure a Hamming‑type distance between the label vectors at tt and t+1t{+}1. Let t,i{1,0,+1}\ell_{t,i}\in\{-1,0,+1\} denote the class assigned to asset ii at date tt, and define the full label vector t=(t,1,,t,N)\boldsymbol{\ell}_{t}=(\ell_{t,1},\ldots,\ell_{t,N})^{\top}.222Specifically, t,i=+1\ell_{t,i}=+1 if iIlongi\in I_{\text{long}}, it is 1-1 if iIshorti\in I_{\text{short}}, and 0 otherwise. The intertemporal Hamming distance is

dHamm(t,t+1)=i=1N𝕀(t,it+1,i),d_{\text{Hamm}}(t,t{+}1)=\sum_{i=1}^{N}\mathbb{I}\!\left(\ell_{t,i}\neq\ell_{t+1,i}\right), (6.1)

with 𝕀\mathbb{I} representing the indicator function. This distance counts how many assets change class between tt and t+1t{+}1. Since our empirical analysis focuses on the rotation of the long and short legs separately, we apply the same definition to the corresponding subsets of indices. Specifically, dHammlong(t,t+1)d_{\text{Hamm}}^{\text{long}}(t,t{+}1) is computed by restricting the sum in Eq. (6.1) to the assets that belong to the long leg at either tt or t+1t{+}1, and analogously for dHammshort(t,t+1)d_{\text{Hamm}}^{\text{short}}(t,t{+}1). This makes explicit that the leg-specific distances shown in Figure 3 are projections of the general Hamming measure onto the buy-list and sell-list, respectively. In particular, we can see that the three configurations (5,50)(5,50), (10,60)(10,60), and (20,70)(20,70) produce box plots that largely overlap in both panels, indicating that the construction of regime-conditioned profiles is not excessively sensitive to moderate changes in (p1,p2)(p_{1},p_{2}). In practice, most of the observed rotation originates from genuine shifts in dominance and from regime changes rather than from arbitrary threshold choices.

Refer to caption
Refer to caption
Figure 3: Box plot of the Hamming distance for long (left) and short (right) legs under the three limiting-profile configurations.

Table 4 reports the average number of assets assigned to the short, neutral, and long categories under each market regime and each (p1,p2)(p_{1},p_{2}) setting. Two implications stand out. First, the neutral class consistently contains the majority of the universe (roughly 330–364 assets), indicating that TODIMSort concentrates the tails of the dominance distribution into actionable long and short opportunity sets while preserving a sizable middle region of borderline assets. Second, class sizes respond to regime conditions in an economically coherent way: bear markets yield a larger short set and a smaller long set compared to bull and sideways phases. Moreover, changing the quantile thresholds leads to predictable and contained variations in the sizes of the long and short sets, thus providing a clear way to balance the degree of selectivity against the breadth of the investable universe, without disrupting the stability of the rotation mechanism.

Table 4: Average number of assets assigned to long, neutral, and short categories under each market regime (bull, bear, sideways) across the three parameter settings (5,50)(5,50), (10,60)(10,60), and (20,70)(20,70).
(5,50) (10,60) (20,70)
Category Bull Bear Side Bull Bear Side Bull Bear Side
Short 32 51 31 44 37 39 52 38 48
Neutral 330 349 332 340 364 345 342 358 345
Long 59 20 57 37 20 38 27 25 28

Overall, evidences provided by Figures 1, 2, 3, and the summary statistics reported in Tables 3 and 4, show that the combined MEREC–TODIMSort stage remains coherent with the prevailing market regimes. In addition, it exhibits economically interpretable shifts in criterion relevance, and it is robust to reasonable variations in the quantile parameters of the limiting-profiles, both in terms of rotation stability and in terms of long/short opportunity-set sizes.

6.3 Evaluation of the AMPSO Algorithm

We evaluate the algorithmic efficiency of the proposed solver on a benchmark suite constructed to reflect heterogeneous long-short portfolio configurations. We randomly sample without replacement 10 out of the 69 admissible rebalancing dates of the empirical study (the three dates with short-selling ban has not been considered). For each selected date, the MEREC-TODIMSort procedure with (p1,p2)=(5,50)(p_{1},p_{2})=(5,50) for the limiting-profile configurations identified the sets of investable assets. Then, three independent instances of the long–short portfolio optimization problem were generated by varying the leverage parameter in the set s{0.10,0.20,0.30}s\in\{0.10,0.20,0.30\}, while setting the upper bound for the weights of assets in the long leg to wiu=1+sw_{i}^{u}=1+s and the lower bound for those in the short leg to wi=sw_{i^{\prime}}^{\ell}=-s. This design yields a total of 30 instances of the problem 3.2, each characterized by a different market regime, a different composition of the investable universe, and a feasible region that varies with the leverage level. Alternative limiting-profile configurations (10,60)(10,60) and (20,70)(20,70) exhibit consistent trends with those observed in the above setting, hence we report them in the Supplementary Material.

In the analysis, we compare the proposed AMPSO algorithm with the original PSO‑TVAC, as well as with two PSO‑TVAC variants enhanced through our adaptive operator selection (AOS) strategy: one employing exclusively crossover operators (PSO‑TVAC‑CROSS) and the other relying solely on mutation operators (PSO‑TVAC‑MUT). Table 5 reports the parameter settings used for the PSO‑related and AOS components of the algorithms. The initial population size depends on the number of decision variables nn according to the rule NP=min{100,max{20,4log2n}}NP=\min\{100,\max\{20,4\log_{2}n\}\}, which follows the recommendations of Piotrowski et al. (2020). The parameter values for the TVAC component are those suggested in Deng et al. (2012). The algorithm implementation and the evaluation of the benchmark suite was conducted in MATLAB r2025b on a workstation with an Intel Core i9‑7900X CPU and 16 GB of RAM. The optimization process terminates when the maximum number of function evaluations is reached, i.e. MAXFES=10000×n\mathrm{MAX}_{FES}=10000\times n. To ensure a fair and statistically robust comparison, each solver was executed for 30 independent runs on every problem in the benchmark suite. Random seeds were varied in runs, while all other experimental conditions were kept fixed.

Table 5: Parameter settings for AMPSO and baseline PSO variants.
Parameter Value
Swarm Size NP=min{100,max{20, 4log2n}}NP=\min\{100,\,\max\{20,\,4\log_{2}n\}\}
Max Fitness Eval MAXFES=10000×n\mathrm{MAX}_{FES}=10000\times n
TVAC ωmin=0.4\omega_{\min}=0.4, ωmax=0.9\omega_{\max}=0.9, c1,min=c2,min=0.5c_{1,\min}=c_{2,\min}=0.5,
c1,max=c2,max=2.5c_{1,\max}=c_{2,\max}=2.5
AOS npar=min{10,NP}n_{\text{par}}=\min\{10,NP\}, Tw=50T_{\text{w}}=50, pmin=0.02p_{\min}=0.02, papp=0.5p_{\text{app}}=0.5

Figure 4 provides a synthetic view of the operator‑selection behavior of AMPSO across the entire set of benchmark instances. The heatmap reports the relative frequency with which each operator is selected within 15 equally spaced segments of the total computational budget. A clear pattern emerges. The two mutation operators — Gaussian and Levy — together with the vertical crossover dominate the selection profile throughout most phases of the search. Their consistently high activation rates indicate that AMPSO relies primarily on these mechanisms to balance local refinement and global exploration. In contrast, the remaining crossover operators contribute only marginally. Their activation frequencies remain low and relatively stable across all evaluation intervals, suggesting that they play a secondary role in the dynamics of the algorithm. This behavior is coherent with the structure of the optimization landscape: the combination of mutation‑based perturbations and the vertical crossover appears to provide the most effective means of navigating the feasible region induced by the long–short constraints.

Refer to caption
Figure 4: Aggregated heatmap of AMPSO operator usage for configuration (5,50)(5,50) over 15 normalized fitness-evaluation intervals. The first row aggregates the 12 crossover operators into the category “Other XOs”, while the remaining rows report the usage of the vertical crossover (VX), Gaussian mutation (GAUSS), and Lévy mutation (LEVY). Cell intensities and annotations indicate the proportion of operator applications within each interval.

The pairwise comparison among algorithms was first assessed in terms of final performance scores using the Wilcoxon signed ranks test, following Derrac et al. (2011). Table 6 reports, for each competing method, the number of test instances in which AMPSO performs significantly better, equivalent, or worse at the 5%5\% significance level. The comparison with PSO-TVAC is straightforward: AMPSO secures a large majority of significant wins and virtually no losses, clearly outperforming the baseline variant. A similar, though slightly less pronounced, pattern is observed against PSO-TVAC-MUT, where AMPSO still achieves substantially more significant wins than defeats. In contrast, the comparison with PSO-TVAC-CROSS is notably more balanced. The numbers of significant wins and losses are comparable, and most cases fall under statistical equivalence. This indicates that the crossover-only variant remains competitive with AMPSO across the benchmark suite, with neither method exhibiting a decisive statistical advantage at the 5%5\% level.

Table 6: Number of instances in which AMPSO performs better, equally, or worse than the reference algorithms according to the Wilcoxon signed-rank test at the 5% significance level.
Algorithm Better Equal Worse
PSO-TVAC 30 0 0
PSO-TVAC-CROSS 4 22 4
PSO-TVAC-MUT 29 1 0

To assess differences in convergence behavior across solvers, we employ Page’s trend test as in Derrac et al. (2014). For each optimization problem and each algorithm, we record objective function values at 15 equally spaced cut points and compute, at each cut point, the rank of the competing methods. Let LL denote the Page statistic obtained by aggregating these ranks with increasing weights that emphasize later stages of the optimization process. We test the ordered one-sided hypothesis

H0:AMPSO and the competitor have the same convergence pace\displaystyle H_{0}:\text{AMPSO and the competitor have the same convergence pace}
H1:AMPSO converges faster than the competitor\displaystyle H_{1}:\text{AMPSO converges faster than the competitor}

which corresponds to smaller objective values in our case. In our case, since the empirical Omega ratio produces non-convex and non-smooth surfaces without guaranteed plateaus, and objective values typically continue to evolve until the evaluation budget is exhausted, we can not apply the “Alternative” Page trend test formulation reported in Derrac et al. (2014). We only report the results for the standard form of the Page statistic in Table 7. In this context, the emerging pattern is fully consistent with the evidence from the Wilcoxon analysis. AMPSO exhibits a clear and systematic convergence advantage over PSO-TVAC, with pp-values that strongly reject the null hypothesis of equal convergence profiles, indicating that it not only reaches better final objective function values but also progresses more rapidly throughout the optimization process. A similar advantage is observed in comparison with PSO-TVAC-MUT, where the trend test again favors AMPSO in most instances. The mutation-only variant does not match the rate at which AMPSO reduces the objective value, suggesting that adaptive selection of variation operators provides a more effective exploratory–exploitative balance. In contrast, the comparison with PSO-TVAC-CROSS yields more tempered results. In several cases, the pp-values do not indicate a statistically significant trend in favor of either method, suggesting that the crossover-only variant achieves a convergence profile comparable to that of AMPSO.

Collectively, the results of the Page trend test confirm that the improvements achieved by AMPSO relative to PSO-TVAC and PSO-TVAC-MUT derive not only from superior end-of-run performance, but also from a consistently faster convergence pattern. However, PSO-TVAC-CROSS remains competitive in convergence speed, showing no systematic disadvantage when the temporal evolution of the optimization process is taken into account.

Table 7: Page trend test comparing the convergence speed of AMPSO with baseline solvers. The statistic LL is computed as the weighted sum of within-problem ranks across the 15 cut points. The one-sided pp–value tests the ordered alternative that AMPSO exhibits faster convergence than the competitor. Significant values (<5%<5\%) are in bold.
Comparison LL ZZ pp–value
AMPSO vs PSO–TVAC 30375 3.8414 6.12e-05
AMPSO vs PSO–TVAC–CROSS 29220 1.0235 0.1530
AMPSO vs PSO–TVAC–MUT 30060 3.0729 0.0011

6.4 Ex-Post Profitability Analysis

This subsection evaluates the ex‑post performance of the long–short portfolios produced by the MEREC–TODIMSort–AMPSO framework. Building on the optimization setup introduced in Section 6.3, we retain the same three leverage levels, s{0.10,0.20,0.30}s\in\{0.10,0.20,0.30\}, and focus here on how portfolio quality is affected by the pre‑selection stage. Specifically, we examine three limiting‑profile configurations for TODIMSort, (p1,p2){(5,50),(10,60),(20,70)}(p_{1},p_{2})\in\{(5,50),\,(10,60),\,(20,70)\}, which span increasing degrees of selectivity in defining the long and short opportunity sets and thus enable a controlled assessment of how stringent classification rules shape portfolio characteristics. Moreover, to assess the contribution of sustainability-related information to long–short portfolio construction, the performance of the ESG-enhanced strategies is compared both with portfolios built under the same parameterization but excluding ESG criteria in the pre-selection phase and with the market-value-weighted index. This design enables us to evaluate whether the incorporation of ESG-based indicators yields systematic improvements in risk-adjusted performance and downside-risk behavior.

6.4.1 Ex-post performance and risk measures

Following established principles in performance‑measurement theory, we employ a multidimensional set of ex‑post indicators that capture return efficiency, total risk, downside risk, and persistence of drawdowns (Reilly et al., 2018). Ex‑post performance is evaluated on the monthly sequence of optimized portfolio returns {rtopt}\{r_{t}^{\text{opt}}\} generated in the M=72M=72 investment windows. Let r¯\bar{r} and σM\sigma_{M} denote, respectively, the sample monthly mean and volatility of {rtopt}\{r_{t}^{\text{opt}}\}. The performance metrics considered are as follows.

Compound annual growth rate. Let WM=t=1M(1+rtopt)W_{M}=\prod_{t=1}^{M}(1+r_{t}^{\text{opt}}) be the terminal wealth from unit initial capital. The annualized growth rate is

CAGR=WM 12/M1.\mathrm{CAGR}=W_{M}^{\,12/M}-1.

Annualized Sharpe ratio. This measure represents the compensation earned by the investor per unit of risk, and is computed as

Sharpeann=r¯σM12.\mathrm{Sharpe}_{\mathrm{ann}}=\frac{\bar{r}}{\sigma_{M}}\,\sqrt{12}.

Annualized Sortino ratio. To penalize only downside variability, we first define the lower semideviation as σM=1Mt=1M(min{rtoptMAR,0})2\sigma_{M}^{-}=\sqrt{\frac{1}{M}\sum_{t=1}^{M}\bigl(\min\{r_{t}^{\text{opt}}-\mathrm{MAR},0\}\bigr)^{2}}, where MARMAR denotes the minimum acceptable return. Then, the annualized Sortino ratio is

Sortinoann=r¯σM12.\mathrm{Sortino}_{\mathrm{ann}}=\frac{\bar{r}}{\sigma_{M}^{-}}\,\sqrt{12}.

Following standard practice in empirical performance analysis, we set MAR=0MAR=0, which yields a downside-risk assessment relative to the break even level and avoids introducing additional noise from short-term interest-rate fluctuations.

Rachev ratio. Let q1αq_{1-\alpha} and qβq_{\beta} be the empirical (1α)(1-\alpha) and β\beta quantiles of {rtopt}\{r_{t}^{\text{opt}}\}, respectively. We define expected tail gain (ETG) as the average of portfolio returns that exceed q1αq_{1-\alpha}, and expected tail loss (ETL) as the average of returns that fall below qβq_{\beta}. Then, the Rachev ratio is

Rachevα,β=ETGETL.\mathrm{Rachev}_{\alpha,\beta}=\frac{\mathrm{ETG}}{\mathrm{ETL}}.

While Sharpe and Sortino summarize central tendencies and downside dispersion, the Rachev ratio contrasts extreme gains with extreme losses, thus capturing tail asymmetry. In our implementation, we set tail parameters α=β=0.05\alpha=\beta=0.05.

Annualized standard deviation. Overall volatility is reported as

σann=σM12.\sigma_{\mathrm{ann}}=\sigma_{M}\,\sqrt{12}.

Maximum drawdown. Let VtV_{t} denote the wealth process implied by {rtopt}\{r_{t}^{\text{opt}}\} and Pt=maxstVsP_{t}=\max_{s\leq t}V_{s} its running peak. The maximum drawdown is

MDD=maxtPtVtPt.\mathrm{MDD}=\max_{t}\frac{P_{t}-V_{t}}{P_{t}}.

Ulcer index. With Dt=(PtVt)/PtD_{t}=(P_{t}-V_{t})/P_{t} denoting the drawdown relative to the preceding peak, the Ulcer index captures both the depth and the persistence of drawdowns, and is defined as

UI=1Mt=1MDt2.\mathrm{UI}=\sqrt{\frac{1}{M}\sum_{t=1}^{M}D_{t}^{2}}.

Average turnover. Let wt,jw_{t,j} be the post-trade weight of asset jj at time tt, and w~t,j\widetilde{w}_{t^{-},j} the pre-trade weight, then the one-way turnover at time tt, with t=2,Mt=2,\ldots M, is

Turnovert=12j=1N|wt,jw~t,j|.\mathrm{Turnover}_{t}=\frac{1}{2}\sum_{j=1}^{N}|w_{t,j}-\widetilde{w}_{t^{-},j}|.

To assess trading activity and approximate transaction costs in the ex‑post period, we rely on the average monthly turnover.

6.4.2 Discussion of Results

Table 8 reports ex-post performance and downside-risk measures of optimized portfolios for the three configurations of the limiting-profiles and leverage levels. A cross-configuration inspection reveals several consistent patterns. First, moderate and higher leverage levels generally translate into improved risk-adjusted performance, as reflected by the Sharpe and Sortino ratios, with configuration (20,70)(20,70) producing the most stable gains across ss. Second, downside-risk indicators (maximum drawdown and Ulcer index) remain well controlled and exhibit limited dispersion despite the increased long exposure 1+s1+s. Finally, portfolio turnover remains moderate and increases only gradually with leverage, suggesting that the combination of MEREC–TODIMSort screening and AMPSO optimization yields stable long–short allocations even when the limiting-profile becomes more selective.

Table 8: Ex-post performance and risk measures for the optimized portfolio across the three limiting-profile configurations (5,50)(5,50), (10,60)(10,60), and (20,70)(20,70), evaluated at leverage levels s=0.10s=0.10, 0.200.20, and 0.300.30.
Config ss CAGR Sharpe (ann.) Sortino (ann.) Rachev Std (ann.) Max DD Ulcer Avg Turnover
(5,50) 0.100.10 0.1785 1.0096 1.5521 0.8724 0.1799 0.2336 0.0826 0.7240
0.200.20 0.1440 0.7985 1.2031 0.8532 0.1929 0.3830 0.1438 0.8389
0.300.30 0.1726 0.8675 1.3417 1.0172 0.2104 0.3813 0.1383 0.9557
(10,60) 0.100.10 0.1436 0.8372 1.2549 0.7921 0.1810 0.2973 0.1197 0.6670
0.200.20 0.1269 0.7583 1.1067 0.7702 0.1800 0.3623 0.1461 0.7568
0.300.30 0.1320 0.7828 1.1308 0.7692 0.1804 0.3009 0.1266 0.9288
(20,70) 0.100.10 0.1438 0.8124 1.2266 0.8818 0.1882 0.3894 0.1532 0.6104
0.200.20 0.1698 0.9083 1.4306 0.9741 0.1946 0.3462 0.1286 0.7346
0.300.30 0.1860 0.9293 1.5107 1.0878 0.2080 0.3265 0.1087 0.8588

Table 9 shows the corresponding results for portfolios constructed without ESG criteria. Eliminating ESG-based indicators generally leads to a deterioration in both performance and downside-risk behavior. Across most configurations and leverage levels, Sharpe and Sortino ratios decrease, while drawdown measures increase, indicating a reduced ability to mitigate adverse outcomes. Turnover is consistently higher, reflecting less stable long–short allocations when sustainability-related information is excluded.

Table 9: Ex-post performance and risk measures for the comparison models corresponding to the limiting-profile configurations (5,50)(5,50), (10,60)(10,60), and (20,70)(20,70), evaluated at leverage levels s=0.10s=0.10, 0.200.20, and 0.300.30, together with the benchmark portfolio. In all comparison models, metrics are obtained by excluding ESG criteria from the MEREC-TODIMSort preselection stage.
Config ss CAGR Sharpe (ann.) Sortino (ann.) Rachev Std (ann.) Max DD Ulcer Avg Turnover
(5,50) 0.100.10 0.1066 0.5886 0.8702 0.9054 0.2106 0.3719 0.1468 0.8182
0.200.20 0.1497 0.7904 1.2242 0.9587 0.2037 0.3769 0.1485 0.9384
0.300.30 0.1588 0.7737 1.2702 1.1279 0.2235 0.3770 0.1441 1.1613
(10,60) 0.100.10 0.1722 0.9230 1.5106 1.0478 0.1933 0.2981 0.0978 0.7420
0.200.20 0.1079 0.6301 0.9609 0.9333 0.1927 0.4015 0.1463 0.8885
0.300.30 0.1325 0.6967 1.0589 0.9561 0.2118 0.3614 0.1321 1.0572
(20,70) 0.100.10 0.1512 0.7979 1.2159 0.9083 0.2035 0.3729 0.1440 0.7528
0.200.20 0.1368 0.8043 1.3161 1.1184 0.1801 0.3301 0.1242 0.8140
0.300.30 0.1646 0.8643 1.3734 1.0496 0.2006 0.3259 0.1118 0.9504
Benchmark 0.1964 1.0561 1.8839 1.3669 0.1873 0.2942 0.0965

Overall, while the benchmark achieves the strongest tail‑sensitive performance, ESG‑enhanced portfolios exhibit systematically lower downside risk and more stable risk‑adjusted returns. These improvements are consistent with the evidence in Magnani et al. (2024), who show that firms with more stable ESG profiles — captured through lower ESG‑score volatility — earn higher Sharpe ratios and significant alphas relative to their less stable counterparts. This suggests that ESG‑based filtering can contribute to more resilient long–short allocations by favouring issuers with sustained and predictable ESG trajectories. Finally, a formal statistical assessment of the differences in Sharpe ratios and return variances across ESG, no‑ESG, and benchmark portfolios was also conducted using HAC‑robust bootstrap procedures and log‑variance tests. The detailed numerical results, together with the corresponding pp‑values for the three limiting‑profile configurations and leverage levels, are reported in the Supplementary Material. These tests corroborate the qualitative evidence discussed above: ESG‑enhanced portfolios tend to exhibit lower downside risk and more stable risk‑adjusted performance than their no‑ESG counterparts, especially under the baseline configuration (5,50)(5,50) and moderate leverage.

7 Conclusions and Future Works

This study demonstrates how a modular combination of multi‑criteria asset classification and adaptive swarm‑based optimization can be effectively used to design ESG‑aware long–short portfolios. At the screening and classification level, the TODIMSort framework, enhanced with MEREC objective weights, assigns assets to preference‑ordered categories in a manner that naturally supports both long‑only and long–short constructions. At the optimization level, the AMPSO solver maximizes the Omega ratio subject to realistic budget, bound, and leverage constraints. To address this non-convex setting, we develop AMPSO, an adaptive particle swarm variant equipped with a multi-operator selection controller and a projection-based repair mechanism for constraint handling. The resulting MEREC–TODIMSort–AMPSO process offers a flexible architecture in which screening, classification, and optimization remain distinct but fully integrable components, allowing each module to be adapted to alternative sets of criteria, realistic constraints, and specific investor objectives.

Empirical analysis of European equity data illustrates the effectiveness of the proposed system. The MEREC–TODIMSort module provides stable long, neutral, and short partitions that adapt to market phases and preserve economic interpretability. The AMPSO solver demonstrates robust convergence behavior and high performance in navigating non-convex and constrained search scenarios. Finally, ex post performance evaluation shows that ESG-aware strategies remain competitive and often outperform their non-ESG counterparts and the market-value-weighted benchmark, particularly in terms of downside protection and drawdown mitigation.

Despite its strengths, the framework presents several limitations that motivate future research. First, the empirical study focuses exclusively on the European market, where ESG data disclosure is relatively advanced. Extending the analysis to regions with different regulatory contexts would offer a broader perspective on the role of sustainability information in heterogeneous markets. Second, turnover limits and explicit transaction cost constraints are not modeled in the current formulation. Although this omission is less restrictive for institutional investors, the integration of such frictions would improve the operational relevance of the framework and further enhance its practical applicability.

References

  • F. Alali and A. C. Tolga (2019) Portfolio allocation with the TODIM method. Expert Systems with Applications 124, pp. 341–348. External Links: Document Cited by: §1.
  • A. K. Banerjee, H. K. Pradhan, A. Sensoy, F. Fabozzi, and B. Mahapatra (2024) Robust portfolio optimization with fuzzy TODIM, genetic algorithm and multi-criteria constraints. Annals of Operations Research 337, pp. 1–22. External Links: Document Cited by: Table 1, §2.
  • M. Borkovec, I. Domowitz, B. Kiernan, and V. Serbin (2010) Portfolio optimization and the cost of trading. The Journal of Investing 19 (2), pp. 63–72. External Links: Document Cited by: §1.
  • T. J. Chang, N. Meade, J. E. Beasley, and Y. M. Sharaiha (2000) Heuristics for cardinality constrained portfolio optimisation. Computers and Operations Research 27 (13), pp. 1271–1302. External Links: Document Cited by: §1.
  • A. Chekhlov, S. Uryasev, and M. Zabarankin (2005) Drawdown measure in portfolio optimization. International Journal of Theoretical and Applied Finance 8 (1), pp. 13–58. External Links: Document Cited by: §4.4.2.
  • Y.-H. Chou, Y.-C. Jiang, and S.-Y. Kuo (2021) Portfolio optimization in both long and short selling trading using trend ratios and quantum-inspired evolutionary algorithms. IEEE Access 9, pp. 152115–152130. External Links: Document Cited by: §1.
  • M. Corazza, G. di Tollo, G. Fasano, and R. Pesenti (2021) A novel hybrid pso-based metaheuristic for costly portfolio selection problems. Annals of Operations Research 304 (1), pp. 109–137. External Links: Document Cited by: §1.
  • K. Daniel and T. J. Moskowitz (2016) Momentum crashes. Journal of Financial Economics 122 (2), pp. 221–247. External Links: Document Cited by: §4.4.2.
  • V. DeMiguel, L. Garlappi, F. J. Nogales, and R. Uppal (2009) Portfolio optimization: improving performance by constraining portfolio norms. Management Science 55 (5), pp. 798–812. External Links: Document Cited by: §1.
  • G. F. Deng, W. T. Lin, and C. C. Lo (2012) Markowitz-based portfolio selection with cardinality constraints using improved particle swarm optimization. Expert Systems with Applications 39 (4), pp. 4558–4566. External Links: Document Cited by: §5.1, §6.3.
  • J. Derrac, S. García, S. Hui, P. N. Suganthan, and F. Herrera (2014) Analyzing convergence performance of evolutionary algorithms: a statistical approach. Information Sciences 289, pp. 41–58. External Links: Document Cited by: §6.3, §6.3.
  • J. Derrac, S. García, D. Molina, and F. Herrera (2011) A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm and Evolutionary Computation 1 (1), pp. 3–18. External Links: Document Cited by: §6.3.
  • G. di Tollo, G. Fattoruso, and G. Filograsso (2024) An adaptive evolutionary strategy for long–short portfolio construction. Decisions in Economics and Finance.. External Links: Document Cited by: §1.
  • K. Erwin and A. Engelbrecht (2023) Meta-heuristics for portfolio optimization. Soft Computing 27, pp. 19045–19073. External Links: Document Cited by: §1.
  • S. Farinelli, M. Ferreira, D. Rossello, M. Thoeny, and L. Tibiletti (2008) Beyond Sharpe ratio: optimal asset allocation using different performance ratios. Journal of Banking and Finance 32 (10), pp. 2057–2063. External Links: Document Cited by: §1.
  • G. Filograsso and G. di Tollo (2023) Adaptive evolutionary algorithms for portfolio selection problems. Computational Management Science 20 (1), pp. 7. External Links: Document Cited by: §1, §5.2.
  • J. D. Fisher and J. D’Alessandro (2021) Portfolio upside and downside risk—both matter. The Journal of Portfolio Management 47 (10), pp. 158–171. External Links: Document Cited by: §4.4.2.
  • M. Ghorabaee, M. Amiri, E. K. Zavadskas, Z. Turskis, and J. Antucheviciene (2021) Determination of objective weights using a new method based on the removal effects of criteria (MEREC). Symmetry 13 (4), pp. 525. External Links: Document Cited by: §1.
  • M. Gilli, E. Schumann, G. di Tollo, and G. Cabej (2011) Constructing 130/30-portfolios with the Omega ratio. Journal of Asset Management 12 (2), pp. 94–108. External Links: Document Cited by: §1.
  • L. F. A. M. Gomes and M. M. P. P. Lima (1991) TODIM: basics and application to multicriteria ranking. Foundations of Computing and Decision Sciences 16 (3–4), pp. 113–127. External Links: Link Cited by: §1.
  • G. Guastaroba, R. Mansini, W. Ogryczak, and M. G. Speranza (2016) Linear programming models based on Omega ratio for the enhanced index tracking problem. European Journal of Operational Research 251, pp. 938–956. External Links: Document Cited by: §1, §3.1.1.
  • J. B. Guerard (Ed.) (2009) Handbook of portfolio construction: contemporary applications of markowitz techniques. Springer, New York. Note: ISBN: 978-0-387-77438-1 Cited by: §1.
  • B. I. Jacobs, K. N. Levy, and D. Starer (1999) Long–short portfolio management: an integrated approach. The Journal of Portfolio Management 25, pp. 23–32. External Links: Document Cited by: §1.
  • V. Jain, R. R. Sahay, and Nupur (2025a) Integrated robust EM and TODIM approach with teaching learning based portfolio optimization. Computational Economics. External Links: Document Cited by: Table 1, §2.
  • V. Jain, R. R. Sahay, and Nupur (2025b) Leveraging ESG rating and advanced analytics for portfolio optimization. International Journal of System Assurance Engineering and Management. External Links: Document Cited by: Table 1, §2.
  • C. B. Kalayci, O. Ertenlice, and M. A. Akbay (2019) A comprehensive review of deterministic models and applications for mean-variance portfolio optimization. Expert Systems with Applications 125, pp. 345–368. External Links: Document Cited by: §1.
  • M. Kaucic, F. Piccotto, and G. Sbaiz (2024) A constrained swarm optimization algorithm for large-scale long-run investments using sharpe ratio-based performance measures. Computational Management Science 21 (1), pp. 6. External Links: Document Cited by: §1.
  • M. Kaucic (2019) Equity portfolio management with cardinality constraints and risk parity control using multi-objective particle swarm optimization. Computers and Operations Research 109, pp. 300–316. External Links: Document Cited by: §1.
  • M. Kaucic, R. Pelessoni, and F. Piccotto (2025) An automated decision support system for portfolio allocation based on mutual information and financial criteria. Entropy 27 (5), pp. 480. External Links: Document Cited by: §4.3.
  • C. Keating and W. F. Shadwick (2002) A universal performance measure. Journal of Performance Measurement 6, pp. 59–84. External Links: Link Cited by: §3.
  • H. Konno and H. Yamazaki (1991) Mean-absolute deviation portfolio optimization model and its applications to tokyo stock market. Management Science 37 (5), pp. 519–531. External Links: Link Cited by: §1.
  • E. Kouloumpris, K. Moutsianas, and I. Vlahavas (2024) SABER: stochastic-aware bootstrap ensemble ranking for portfolio management. Expert Systems with Applications 249, pp. 123637. External Links: Document Cited by: §1.
  • A. Kraskov, H. Stögbauer, and P. Grassberger (2004) Estimating mutual information. Physical Review E—Statistical, Nonlinear, and Soft Matter Physics 69 (6), pp. 066138. External Links: Document Cited by: §4.4.3.
  • P. Krokhmal, J. Palmquist, and S. Uryasev (2002) Portfolio optimization with conditional value-at-risk objective and constraints. Journal of Risk 4, pp. 43–68. External Links: Document Cited by: §1.
  • M. Linger, T. Metz, K. Sbai, and G. Boulanger (2025) Enhancing long–short portfolios: a refined approach using learn-to-rank algorithms.. Journal of Financial Data Science 7 (1). External Links: Document Cited by: §1.
  • A. W. Lo, C. Petrov, and M. Wierzbicki (2006) It’s 11 pm—do you know where your liquidity is? The mean–variance–liquidity frontier. In The World of Risk Management, H. G. Fong (Ed.), pp. 47–92. External Links: Document Cited by: §1.
  • N. Maculan, C. P. Santiago, E. Macambira, and M. Jardim (2003) An O(n) algorithm for projecting a vector on the intersection of a hyperplane and a box in n\mathbb{R}^{n}. Journal of Optimization Theory and Applications 117 (3), pp. 553–574. External Links: Link Cited by: §1, §5.3.1.
  • M. Magnani, M. Guidolin, and I. Berk (2024) Strong vs. stable: the impact of ESG ratings momentum and their volatility on the cost of equity capital. Journal of Asset Management 25 (7), pp. 666–699. External Links: Document Cited by: §6.4.2.
  • H. Markowitz (1952) Portfolio selection. The Journal of Finance 7, pp. 77–91. External Links: Document Cited by: §1.
  • R. Moral-Escudero, R. Ruiz-Torrubiano, and A. Suárez (2006) Selection of optimal investment portfolios with cardinality constraints. In Proceedings of the IEEE International Conference on Evolutionary Computation (CEC 2006), pp. 2382–2388. External Links: Document Cited by: §1.
  • Z. Nagy, A. Kassam, and L.-E. Lee (2016) Can ESG add alpha? an analysis of ESG tilt and momentum strategies. The Journal of Investing 25 (2), pp. 113–124. External Links: Document Cited by: §4.4.2.
  • G. V. Pai and T. Michel (2012) Integrated metaheuristic optimization of 130–30 investment-strategy-based long-short portfolios. Intelligent Systems in Accounting, Finance and Management 19 (1), pp. 43–74. External Links: Document Cited by: §1.
  • A. P. Piotrowski, J. J. Napiorkowski, and A. E. Piotrowska (2020) Population size in particle swarm optimization. Swarm and Evolutionary Computation 58, pp. 100718. External Links: Document Cited by: §6.3.
  • S. Rachev, S. Ortobelli, S. Stoyanov, F. J. Fabozzi, and A. Biglova (2008) Desirable properties of an ideal risk measure in portfolio theory. International Journal of Theoretical and Applied Finance 11, pp. 19–54. External Links: Document Cited by: §3.
  • A. Ratnaweera, S. K. Halgamuge, and H. C. Watson (2004) Self-organizing hierarchical particle swarm optimizer with time-varying acceleration coefficients. IEEE Transactions on Evolutionary Computation 8, pp. 240–255. External Links: Document Cited by: §1, §5.1.
  • F. K. Reilly, K. C. Brown, and S. J. Leeds (2018) Investment analysis and portfolio management. 11 edition, Cengage Learning, Boston, MA. Note: ISBN: 978-1305262997 Cited by: §6.4.1.
  • R. T. Rockafellar and S. Uryasev (2000) Optimization of conditional value-at-risk. Journal of Risk 2, pp. 21–41. External Links: Document Cited by: §1.
  • R. Ruiz-Torrubiano and A. Suarez (2015) A memetic algorithm for cardinality-constrained portfolio optimization with transaction costs. Applied Soft Computing 36, pp. 125–142. External Links: Document Cited by: §1.
  • A. Sharma, S. Utz, and A. Mehra (2017) Omega-CVaR portfolio optimization and its worst case analysis. OR Spectrum 39 (2), pp. 505–539. External Links: Document Cited by: §1.
  • Y. Shi and R. Eberhart (1998) A modified particle swarm optimizer. In Proceedings of the IEEE International Congress on Evolutionary Computation, Anchorage, USA. External Links: Document Cited by: §5.1.
  • H. Soleimani, H. R. Golmakani, and M. H. Salimi (2009) Markowitz-based portfolio selection with minimum transaction lots, cardinality constraints and regarding sector capitalization using genetic algorithm. Expert Systems with Applications 36 (3), pp. 5058–5063. External Links: Document Cited by: §1.
  • L. Van Wallendael, L. Vanneste, Y. Zhang, A. Stevens, and J. De Smedt (2026) Mitigating downside risk: ESG integration in portfolio construction with stock preselection using machine learning and mean-CVaR optimization. The Quarterly Review of Economics and Finance 106 (102118). External Links: Document Cited by: Table 1, §2.
  • L. Wang, Z. Zhang, A. Ishizaka, Y. Wang, and L. Martínez (2023) TODIMSort: a TODIM-based method for sorting problems. Omega 115, pp. 102771. External Links: Document Cited by: §1, §4.3.
  • M. Woodside-Oriakhi, C. Lucas, and J. E. Beasley (2011) Heuristic algorithms for the cardinality constrained efficient frontier. European Journal of Operational Research 213 (3), pp. 538–550. External Links: Document Cited by: §1.
  • Q. Wu, X. Liu, J. Qin, L. Zhou, A. Mardani, and M. Deveci (2022) An integrated generalized TODIM model for portfolio selection based on financial performance of firms. Knowledge-Based Systems 249 (108794). External Links: Document Cited by: §1.
  • P. Xidonas, H. Doukas, and E. Sarmas (2021) A python-based multicriteria portfolio selection DSS. RAIRO - Operations Research 55, pp. S3009–S3034. External Links: Document Cited by: Table 1, §2.
  • X. Zhang, L. Wu, and Z. Chen (2022) Constructing long-short stock portfolio with a new listwise learn-to-rank algorithm. Quantitative Finance 22 (2), pp. 321–331. External Links: Document Cited by: §1.