CC<i>k</i>EL: Compensation-based correlated <i>k</i>-labelsets for classifying imbalanced multi-label data

Qianpeng Xiao; Changbin Shao; Sen Xu; Xibei Yang; Hualong Yu; Qianpeng Xiao; Changbin Shao; Sen Xu; Xibei Yang; Hualong Yu

doi:10.3934/era.2024139

Electronic Research Archive

2024, Volume 32, Issue 5: 3038-3058. doi: 10.3934/era.2024139

Previous Article Next Article

Research article

CCkEL: Compensation-based correlated k-labelsets for classifying imbalanced multi-label data

1.
School of Computer, Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, China
2.
Jiangsu Key Laboratory of Media Design and Software Technology, Jiangnan University, Wuxi, Jiangsu, China
3.
School of Information Engineering, Yancheng Institute of Technology, Yancheng, Jiangsu, China

Received: 07 February 2024 Revised: 03 April 2024 Accepted: 11 April 2024 Published: 23 April 2024

Imbalanced data distribution and label correlation are two intrinsic characteristics of multi-label data. This occurs because in this type of data, instances associated with certain labels may be sparse, and some labels may be associated with others, posing a challenge for traditional machine learning techniques. To simultaneously adapt imbalanced data distribution and label correlation, this study proposed a novel algorithm called compensation-based correlated k-labelsets (CCkEL). First, for each label, the CCkEL selects the k-1 strongest correlated labels in the label space to constitute multiple correlated k-labelsets; this improves its efficiency in comparison with the random k-labelsets (RAkEL) algorithm. Then, the CCkEL transforms each k-labelset into a multiclass issue. Finally, it uses a fast decision output compensation strategy to address class imbalance in the decoded multi-label decision space. We compared the performance of the proposed CCkEL algorithm with that of multiple popular multi-label imbalance learning algorithms on 10 benchmark multi-label datasets, and the results show its effectiveness and superiority.

Keywords:

Citation: Qianpeng Xiao, Changbin Shao, Sen Xu, Xibei Yang, Hualong Yu. CCkEL: Compensation-based correlated k-labelsets for classifying imbalanced multi-label data[J]. Electronic Research Archive, 2024, 32(5): 3038-3058. doi: 10.3934/era.2024139

Related Papers:

[1]	Md Akther Uddin, Mohammad Enamul Hoque, Md Hakim Ali . International economic policy uncertainty and stock market returns of Bangladesh: evidence from linear and nonlinear model. Quantitative Finance and Economics, 2020, 4(2): 236-251. doi: 10.3934/QFE.2020011
[2]	Thomas C. Chiang . Economic policy uncertainty and stock returns—evidence from the Japanese market. Quantitative Finance and Economics, 2020, 4(3): 430-458. doi: 10.3934/QFE.2020020
[3]	Arifenur Güngör, Hüseyin Taştan . On macroeconomic determinants of co-movements among international stock markets: evidence from DCC-MIDAS approach. Quantitative Finance and Economics, 2021, 5(1): 19-39. doi: 10.3934/QFE.2021002
[4]	Tetsuya Takaishi . Volatility estimation using a rational GARCH model. Quantitative Finance and Economics, 2018, 2(1): 127-136. doi: 10.3934/QFE.2018.1.127
[5]	Xiaoling Yu, Kaitian Xiao, Javier Cifuentes-Faura . Closer is more important: The impact of Chinese and global macro-level determinants on Shanghai crude oil futures volatility. Quantitative Finance and Economics, 2024, 8(3): 573-609. doi: 10.3934/QFE.2024022
[6]	Albert A. Agyemang-Badu, Fernando Gallardo Olmedo, José María Mella Márquez . Conditional macroeconomic and stock market volatility under regime switching: Empirical evidence from Africa. Quantitative Finance and Economics, 2024, 8(2): 255-285. doi: 10.3934/QFE.2024010
[7]	Korhan Gokmenoglu, Baris Memduh Eren, Siamand Hesami . Exchange rates and stock markets in emerging economies: new evidence using the Quantile-on-Quantile approach. Quantitative Finance and Economics, 2021, 5(1): 94-110. doi: 10.3934/QFE.2021005
[8]	Mustafa Tevfik Kartal, Özer Depren, Serpil Kılıç Depren . The determinants of main stock exchange index changes in emerging countries: evidence from Turkey in COVID-19 pandemic age. Quantitative Finance and Economics, 2020, 4(4): 526-541. doi: 10.3934/QFE.2020025
[9]	Mitchell Ratner, Chih-Chieh (Jason) Chiu . Portfolio Effects of VIX Futures Index. Quantitative Finance and Economics, 2017, 1(3): 288-299. doi: 10.3934/QFE.2017.3.288
[10]	Kim Hiang Liow, Jeongseop Song, Xiaoxia Zhou . Volatility connectedness and market dependence across major financial markets in China economy. Quantitative Finance and Economics, 2021, 5(3): 397-420. doi: 10.3934/QFE.2021018

Abstract

1. Introduction

Parallel machine scheduling has received extensive attention since 1950, given the wide diversity of real-world systems it represents. A variety of criteria has been considered. Among the most studied criteria are makespan (maximum completion time) and total completion time, which can measure the effective utilization of the machines. A second set of criteria are related to meeting due dates and thus considering the system's customers. If the criteria are not specified, we can consider two types of general objective functions: min-sum and min-max.

In real production, decision makers may need to consider a number of criteria simultaneously before arriving at a decision. However, it is often the case that different criteria are in conflict. A solution which is optimal with respect to a given criterion might be a poor candidate for some other criterion. Thus, in the last two decades, multicriteria optimization approaches and techniques have been increasingly applied to provide solutions where the criteria are balanced in an acceptable and profitable way ^[1,2].

An important subclass in multicriteria optimization is bicriteria optimization where only two criteria, say ${{\gamma }^{1}}$ and ${{\gamma }^{2}}$ , are considered. There are four popular approaches in the literature: (a) Positive combination optimization: find a schedule to minimize the positive linear combination of ${{\gamma }^{1}}$ and ${{\gamma }^{2}}$ . (b) Constrained optimization: find a schedule to minimize ${{\gamma }^{2}}$ under an upper bound on ${{\gamma }^{1}}$ . (c) Pareto optimization (also called simultaneous optimization): find all Pareto-optimal solutions for ${{\gamma }^{1}}$ and ${{\gamma }^{2}}$ . A feasible schedule $\sigma$ is (strict) Pareto-optimal for ${{\gamma }^{1}}$ and ${{\gamma }^{2}}$ if there is no feasible schedule ${\sigma }'$ such that ${{\gamma }^{1}}({\sigma }')\le {{\gamma }^{1}}(\sigma)$ and ${{\gamma }^{2}}({\sigma }')\le {{\gamma }^{2}}(\sigma)$ , where at least one of the inequalities is strict. The objective vector $({{\gamma }^{1}}(\sigma), {{\gamma }^{2}}(\sigma))$ of a Pareto-optimal schedule $\sigma$ is called a Pareto-optimal point ^[1]. A feasible schedule $\sigma$ is weak Pareto-optimal for ${{\gamma }^{1}}$ and ${{\gamma }^{2}}$ if there is no feasible schedule ${\sigma }'$ such that ${{\gamma }^{1}}({\sigma }') < {{\gamma }^{1}}(\sigma)$ and ${{\gamma }^{2}}({\sigma }') < {{\gamma }^{2}}(\sigma)$ . (d) Hierarchical optimization (also called lexicographical optimization): find a schedule to minimize ${{\gamma }^{2}}$ among the set of optimal schedules minimizing ${{\gamma }^{1}}$ . In hierarchical bicriteria scheduling problems, the two criteria have different levels of importance thus they are optimized in a lexicographic fashion. Such problems appear naturally in situations where there are several optimal solutions with respect to a specific objective and the decision maker needs to select from among these solutions the one with the best second objective.

In this paper, we consider the bicriteria problem of scheduling equal-processing-time jobs with release dates non-preemptively on identical machines. We apply a Pareto optimization approach to minimize the total completion time (and makespan) and maximum cost simultaneously, and we apply a hierarchical optimization approach to minimize two general min-max criteria hierarchically.

Formally speaking, we are given a set of $n$ jobs, $\mathcal{J} = \{{{J}_{1}}, {{J}_{2}}, \ldots, {{J}_{n}}\}$ , to be processed on $m$ identical machines. The machines run in parallel and each machine can process at most one job at a time. All jobs have the same processing time $p > 0$ . Each job, ${{J}_{j}}\in \mathcal{J}$ , has a release date ${{r}_{j}}\ge 0$ before which it cannot be processed, as well as two cost functions ${{f}_{j}}(t)$ and ${{g}_{j}}(t)$ which denote the costs incurred if the job is completed at time $t$ . We assume that all ${{f}_{j}}$ and ${{g}_{j}}$ are regular, i.e., ${{f}_{j}}$ and ${{g}_{j}}$ are non-decreasing in the job completion times ^[3].

A schedule assigns each job ${{J}_{j}}$ to exactly one machine and specifies its completion time ${{C}_{j}}$ on the machine. Given a schedule $\sigma$ , let ${{f}_{j}}({{C}_{j}}(\sigma))$ and ${{g}_{j}}({{C}_{j}}(\sigma))$ be two scheduling costs of ${{J}_{j}}$ . Then ${{f}_{\max }}(\sigma) = {{\max }_{j}}{{f}_{j}}({{C}_{j}}(\sigma))$ and ${{g}_{\max }}(\sigma) = {{\max }_{j}}{{g}_{j}}({{C}_{j}}(\sigma))$ are two maximum costs of $\sigma$ . Two important special cases of maximum cost are the makespan ${{C}_{\max }}(\sigma) = {{\max }_{j}}\{{{C}_{j}}(\sigma)\}$ and the maximum lateness ${{L}_{\max }}(\sigma) = {{\max }_{j}}\{{{C}_{j}}(\sigma)-{{d}_{j}}\}$ , where $d_j$ denotes the due date of job $J_j$ . We omit the argument $\sigma$ when it is clear to which schedule we are referring.

The first bicriteria problem we consider is to determine Pareto-optimal schedules which simultaneously minimize the total completion time $\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ (and makespan ${{C}_{\max }}$ ) and maximum cost ${{f}_{\max }}$ . Following the notation schemes of ^[1,2,3], it can be denoted by $P|{{r}_{j}}, {{p}_{j}} = p|(\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ (and $P|{{r}_{j}}, {{p}_{j}} = p|({{C}_{\max }}, {{f}_{\max }})$ ).

The second bicriteria problem we consider is to determine a lexicographical optimal schedule such that the secondary criterion ${{g}_{\max }}$ is minimized under the constraint that the primary criterion ${{f}_{\max }}$ is minimized. Following the notation schemes of ^[1,2,3], it can be denoted by $P|{{r}_{j}}, {{p}_{j}} = p|Lex({{f}_{\max }}, {{g}_{\max }})$ , where the criterion mentioned first in the argument of $Lex$ is the more important one.

For parallel machine scheduling that considers multiple criteria, please refer to ^[4,5,6] for the surveys. Examples of Pareto optimization and hierarchical optimization scheduling on parallel machines can be found in ^{[7,8,9,10,11]} and ^[12,13,14], respectively.

Bruno et al. ^[15] proved that problem $P\vert \vert Lex(\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{C}_{\max }})$ (the jobs have unequal processing times and equal release dates) is NP-hard. Gupta et al. ^[16] further gave a complexity result: they showed that $P\vert \vert Lex(\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{C}_{\max }})$ is strongly NP-hard. Hence, Pareto optimization problem $P\vert \vert (\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ is also strongly NP-hard. Since the single criterion problem $P\vert \vert {{C}_{\max }}$ is strongly NP-hard ^[17], the lexicographical optimization problem $P\vert \vert Lex({{f}_{\max }}, {{g}_{\max }})$ is strongly NP-hard, too. Also, problem $1|{{r}_{j}}|(\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ (the single machine case where the jobs have arbitrary processing times and release dates) is strongly NP-hard, due to the strong NP-hardness results by Lenstra et al. ^[18] for problems $1|{{r}_{j}}|\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ and $1|{{r}_{j}}|{{L}_{\max }}$ . Thus, we are interested in the special case where all jobs have equal processing times.

Although the problem setting of equal-processing-time jobs appears simple, it captures important aspects of a wide range of applications. For example, in standardized systems in practice, the products consistently have the same processing times. In networking and information systems, transmission packets also often have a constant length ^[19]. Since products and data packets usually arrive dynamically, it is reasonable to consider the jobs with release dates.

For single criterion scheduling, Kravchenko and Werner ^[19,20] surveyed the approaches and exposed the problems with an open complexity status for scheduling jobs with equal processing times on parallel machines. Brucker and Shakhlevich ^[21] characterized optimal schedules for scheduling jobs with unit processing times on parallel machines by providing necessary and sufficient conditions of optimality. Hong et al. ^[22] studied the problem of scheduling jobs with equal processing times and eligibility restrictions on identical machines to minimize total completion time. For the problem with a fixed number of machines, they provided a polynomial time dynamic programming algorithm. For the problem with an arbitrary number of machines, they provided two polynomial time approximation algorithms with approximation ratios of $3/5$ and $1.4$ . Vakhania ^[23] studied the problem of scheduling jobs with equal processing times on identical machines to minimize the maximum delivery completion time, which is defined to be the time by which all jobs are delivered. He presented an algorithm which can be considered as either pseudo-polynomial with time complexity $O({{q}_{\max }}mn\log n)$ or as polynomial with time complexity $O(m\kappa n))$ , where ${{q}_{\max }}$ denotes the maximum delivery time of all jobs and $\kappa < n$ is a parameter which is known only after the termination of the algorithm. The maximum cost minimization problem $P|{{r}_{j}}, {{p}_{j}} = p, {{\bar{d}}_{j}}|{{f}_{\max }}$ can be solved by the polynomial time algorithm developed in ^[19] by Kravchenko and Werner, where ${{\bar{d}}_{j}}$ denotes the deadline of job ${{J}_{j}}$ before which ${{J}_{j}}$ must be completed in any feasible schedule. Vakhania and Werner ^[24] studied the problem of scheduling jobs with equal processing times on uniform machines (processing jobs at different speeds) to minimize the maximum delivery completion time. For this problem whose complexity status remains open for a long time, they presented an $O(\lambda {{m}^{2}}n\log n)$ -time algorithm which is optimal under an explicitly stated special condition, where $\lambda$ can be any of the magnitudes $n$ or ${{q}_{\max }}$ .

Tuzikov et al. ^[25] studied the bicriteria problems of scheduling jobs with equal processing times on uniform machines, denoted as $Q\vert {{p}_{j}} = p\vert({{f}_{\max }}, {{g}_{\max }})$ and $Q\vert{{p}_{j}} = p\vert(\sum\nolimits_{j = 1}^{n}{{{f}_{j}}}, {{g}_{\max }})$ , where ${{f}_{\max }}$ and ${{g}_{\max }}$ are two min-max criteria, and $\sum\nolimits_{j = 1}^{n}{{{f}_{j}}}$ is a min-sum criterion. They showed that problems $Q\vert {{p}_{j}} = p\vert ({{f}_{\max }}, {{g}_{\max }})$ and $Q\vert {{p}_{j}} = p\vert (\sum\nolimits_{j = 1}^{n}{{{f}_{j}}}, {{g}_{\max }})$ can be solved iteratively in $O({{n}^{4}})$ and $O({{n}^{5}})$ time, respectively. Note that they discussed only the case of equal release dates. In this paper, we apply the framework used in ^[25] and extend the results in ^[25] to deal with release dates. In fact, the main contribution of this paper is the incorporation of the release dates and general maximum costs into the problem.

Sarin and Prakash ^[26] studied the lexicographical optimization problem of scheduling jobs with equal processing times and equal release dates on identical machines for various pairwise combinations of primary and secondary criteria $f$ and $g$ , where $f, g\in \{{{T}_{\max }}, \sum\nolimits_{j}{{{T}_{j}}}, \sum\nolimits_{j}{{{U}_{j}}}, \sum\nolimits_{j}{{{C}_{j}}}, \sum\nolimits_{j}{{{w}_{j}}{{C}_{j}}}\}$ . (Please refer to ^[3] for the definitions.) Apart from $P\vert {{p}_{j}} = p\vert Lex(\sum\nolimits_{j}{{{U}_{j}}}, \sum\nolimits_{j}{{{w}_{j}}{{C}_{j}}})$ whose computational complexity was left open in ^[26], all other problems $P\vert {{p}_{j}} = p\vert Lex(f, g)$ studied in ^[26] are solvable in polynomial time. Zhao and Yuan ^[27] revisited the bicriteria problems of scheduling jobs with equal processing times on uniform machines. They presented a comprehensive study on the problems with respect to various regular criteria. Particularly, they obtained an $O({{n}^{3}})$ -time algorithm for $P\vert {{p}_{j}} = p\vert Lex(\sum\nolimits_{j}{{{U}_{j}}}, \sum\nolimits_{j}{{{w}_{j}}{{C}_{j}}})$ , solving the open problem posed in ^[26].

As for the parallel machines case with release dates and equal processing times, Simons ^[28] proposed the first polynomial algorithm running in $O({{n}^{3}}\log \log n)$ time for $P\vert {{r}_{j}}, {{p}_{j}} = p, {{\bar{d}}_{j}}\vert \sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ . (The algorithm also solves the feasibility problem $P|{{r}_{j}}, {{p}_{j}} = p, {{\bar{d}}_{j}}|-$ ). Simons and Warmuth ^[29] further improved this bound to $O(m{{n}^{2}})$ . For the same problem, Dürr and Hurand ^[30], López-Ortiz and Quimper ^[31] gave algorithms that run in $O({{n}^{3}})$ and $O(\min \{1, p/m\}{{n}^{2}})$ time, respectively. These schedules all minimize both the objectives $\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ and ${{C}_{\max }}$ . Since the maximum lateness ${{L}_{\max }}$ is upper-bounded by $\left\lceil np/m \right\rceil$ , Fahimi and Quimper ^[32] remarked that problems $P\vert {{r}_{j}}, {{p}_{j}} = p\vert (\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{L}_{\max }})$ and $P\vert {{r}_{j}}, {{p}_{j}} = p\vert ({{C}_{\max }}, {{L}_{\max }})$ can be solved in polynomial time with time complexity $O(\log (np/m)\min \{1, p/m\}{{n}^{2}})$ and using the binary search that calls the algorithm in ^[31] at most $\log (np/m)$ times. They also extended the algorithm presented in ^[31] for $P\vert {{r}_{j}}, {{p}_{j}} = p, {{\bar{d}}_{j}}\vert \sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ to solve a variation of the problem where the number of machines fluctuates over time. They further proved that minimizing the total cost of the jobs, i.e., $\sum\nolimits_{j = 1}^{n}{{{f}_{j}}({{S}_{j}})}$ , for arbitrary functions ${{f}_{j}}(t)$ is NP-hard, where $S_j$ denotes the start time of job $J_j$ . They then specialized this objective function to the case that it is merely contingent on the time and showed that although this case is pseudo-polynomial solvable, one can derive polynomial time algorithms for either a monotonic or periodic cost function.

To the best of our knowledge, problems $P\vert {{r}_{j}}, {{p}_{j}} = p\vert (\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ and $P\vert {{r}_{j}}, {{p}_{j}} = p\vert Lex({{f}_{\max }}, {{g}_{\max }})$ have not been studied to date. Note that here ${{f}_{\max }}$ and ${{g}_{\max }}$ are two general min-max criteria. The above-mentioned results ^{[28,29,30,31,32]} discussed only $\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ , ${{C}_{\max }}$ or ${{L}_{\max }}$ .

In Section 2, we present an $O({{n}^{3}})$ -time algorithm for $P\vert {{r}_{j}}, {{p}_{j}} = p\vert (\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ . The algorithm also solves problem $P\vert {{r}_{j}}, {{p}_{j}} = p\vert ({{C}_{\max }}, {{f}_{\max }})$ . Consequently, problem $P|{{r}_{j}}, {{p}_{j}} = p|{{f}_{\max }}$ can be solved in $O({{n}^{3}})$ time, which has its own independent interest. In Section 3, we adapt the algorithm to solve $P\vert {{r}_{j}}, {{p}_{j}} = p\vert Lex({{f}_{\max }}, {{g}_{\max }})$ in $O({{n}^{3}})$ time. The final generated schedule also has the minimum total completion time and minimum makespan among the lexicographical optimal schedules for ${{f}_{\max }}$ and ${{g}_{\max }}$ . Finally, we draw some concluding remarks in the last section.

2. An algorithm for $P\vert {{r}_{j}}, {{p}_{j}}=p\vert(\sum\nolimits_{j=1}^{n}{{{C}_{j}}}, {{f}_{\max }})$

In this section we will present an $O({{n}^{3}})$ -time algorithm for $P\vert {{r}_{j}}, {{p}_{j}} = p\vert (\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ . As a by-product, the last schedule constructed by the algorithm is optimal for $P\vert {{r}_{j}}, {{p}_{j}} = p\vert {{f}_{\max }}$ .

For ease of discussion, throughout the paper, we always represent a feasible schedule $\sigma$ by a sequence ${{J}_{\sigma (1)}}, {{J}_{\sigma (2)}}, \cdots, {{J}_{\sigma (n)}}$ of jobs, where ${{J}_{\sigma (i)}}$ is the job scheduled at the $i$ -th position in $\sigma$ , $i = 1, 2, \ldots, n$ . The positions in $\sigma$ are indexed from 1 to $n$ in non-decreasing order of their start times in $\sigma$ (ties broken in favor of the job on the machine with the smallest index).

Intuitively, if a job has a large cost when it completes late, then we need to move it to the left (i.e., start it earlier) to decrease its cost, even if it has a large release date. Therefore, it is quite often that in a feasible schedule, some jobs with larger release dates may start earlier than some jobs with smaller release dates.

To recover a schedule from its job sequence representation, we need the following lemma whose proof can be found in ^[28]. Though simple, this lemma plays a non-negligible role in our algorithms. It allows us to focus on the positions of the jobs in a schedule; we do not worry about their release dates.

Let ${{S}_{\sigma (i)}}$ denote the start time of ${{J}_{\sigma (i)}}$ in $\sigma = ({{J}_{\sigma (1)}}, {{J}_{\sigma (2)}}, \cdots, {{J}_{\sigma (n)}})$ , $i = 1, 2, \ldots, n$ .

Lemma 2.1. (^[28]) For any feasible schedule, a solution $\sigma$ identical except in machine assignment exists and is cyclic, i.e., for any $i$ , ${{J}_{\sigma (i)}}, {{J}_{\sigma (i+m)}}, \ldots$ are scheduled on the same machine. Moreover, ${{S}_{\sigma (1)}} = {{r}_{\sigma (1)}}$ , ${{S}_{\sigma (i)}} = \max \{{{S}_{\sigma (i-1)}}, {{r}_{\sigma (i)}}\}$ ( $i = 2, 3, \ldots, m$ ) and ${{S}_{\sigma (i)}} = \max \{{{S}_{\sigma (i-1)}}, {{r}_{\sigma (i)}}, {{S}_{\sigma (i-m)}}+p\}$ ( $i = m+1, m+2, \ldots, n$ ).

The $\varepsilon$ -constraint method (see, e.g., ^[1,2]) provides a general way to find Pareto-optimal points: let $y$ be the optimal value of constrained optimization problem $\alpha |f\le \hat{x}|g$ , and let $x$ be the optimal value of constrained optimization problem $\alpha |g\le y|f$ . Then $(x, y)$ is a Pareto-optimal point for problem $\alpha ||(f, g)$ .

The algorithm follows the framework used in ^[25] which repeatedly uses the $\varepsilon$ -constraint method to construct the Pareto-optimal schedules. Please see Figure 1 for an illustration. Similar figures and illustrations can be found, e.g., in ^[25,33]. All circles (white solid, black solid and white dashed) in represent weak Pareto-optimal points (schedules), but only the black solid circles represent Pareto-optimal points (schedules). The weak Pareto-optimal schedules ( ${{\sigma }_{1}}, {{\sigma }_{2}}, {{\sigma }_{3}}, \ldots$ , which are generated in turn in Algorithm ${{M}_{1}}$ ) are constructed in strictly decreasing order of their ${{f}_{\max }}$ -values, and within that order in non-decreasing order of their $\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ -values. Since the constraint ${{f}_{\max }} < y$ is used instead of ${{f}_{\max }}\le y$ , all white dashed circles will be ignored by Algorithm ${{M}_{1}}$ . The Pareto-optimal schedules output by Algorithm ${{M}_{1}}$ are ${{\pi }_{1}}, {{\pi }_{2}}, {{\pi }_{3}}, \ldots$ . The last Pareto-optimal schedule has the minimum ${{f}_{\max }}$ -value.

Figure 1. Illustration of Algorithm

${{M}_{1}}$ .

DownLoad: Full-Size Img PowerPoint

Let $\Omega (\mathcal{J})$ denote the Pareto set which consists of all Pareto-optimal points together with their corresponding Pareto-optimal schedules. Let $\Pi \left(\mathcal{J} \right)$ denote the set of all feasible schedules for $\mathcal{J}$ . Let $\Pi \left(\mathcal{J}, y \right)\subseteq \Pi \left(\mathcal{J} \right)$ denote the set of the schedules with maximum cost ( ${{f}_{\max }}$ -value) less than $y$ , where $y$ denotes a given threshold value. Obviously, we have that $\Pi \left(\mathcal{J}, +\infty \right) = \Pi \left(\mathcal{J} \right)$ .

Below is the algorithm called Algorithm ${{M}_{1}}$ for constructing the Pareto set $\Omega (\mathcal{J})$ for $P|{{r}_{j}}, {{p}_{j}} = p|(\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ . It first assigns the unassigned job with the largest release date to the $i$ -th position ( $i = n, n-1, \ldots, 1$ ), ignoring the scheduling cost ${{f}_{\max }}$ (see the initial schedule $\hat{\sigma }$ below). It then repeatedly decreases the ${{f}_{\max }}$ -value of the current schedule until the cost cannot be further improved. During the process, all Pareto-optimal schedules are constructed one by one.

The initial schedule is $\hat{\sigma } = \{{{J}_{\hat{\sigma }(1)}}, {{J}_{\hat{\sigma }(2)}}, \ldots, {{J}_{\hat{\sigma }(n)}}\}$ , where the jobs ${{J}_{\hat{\sigma }(1)}}, {{J}_{\hat{\sigma }(2)}}, \ldots, {{J}_{\hat{\sigma }(n)}}$ are in non-decreasing order of their release dates (ties broken arbitrarily). Note that this order is also the non-decreasing order of their start times in $\hat{\sigma }$ (ties broken in favor of the job on the machine with the smallest index). It is easy to see that $\hat{\sigma }$ is optimal for $P\vert {{r}_{j}}, {{p}_{j}} = p\vert \sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ and $P\vert {{r}_{j}}, {{p}_{j}} = p\vert {{C}_{\max }}$ . (Set the start times of the jobs in $\hat{\sigma }$ by Lemma 2.1.)

The basic idea of our algorithms is as follows: schedule the jobs backwardly (from the right to the left) and at each decision point always select the job with the largest release date from among the candidate jobs (check the choice of $\hat{\sigma }$ in Step 1 of Algorithm ${{M}_{1}}$ and Step 3 of Procedure ${{A}_{1}}(\ldots)$ , as well as the choice of ${{\pi }^{*}}$ in Step 1 of Algorithm ${{M}_{2}}$ and Step 3 of Procedure ${{A}_{2}}(\ldots)$ ). To coincide with this idea, we treat the initial schedule $\hat{\sigma } = \{{{J}_{\hat{\sigma }(1)}}, {{J}_{\hat{\sigma }(2)}}, \cdots, {{J}_{\hat{\sigma }(n)}}\}$ as a schedule in which the jobs ${{J}_{\hat{\sigma }(n)}}, {{J}_{\hat{\sigma }(n-1)}}, \ldots, {{J}_{\hat{\sigma }(1)}}$ are in non-increasing order of their release dates.

Algorithm $M_1$ : Input: An instance of $P|{{r}_{j}}, {{p}_{j}} = p|(\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ .

Output: The Pareto set $\Omega (\mathcal{J})$ .

Step 1. Initially, set $s = 1$ . Let ${{\sigma }_{s}} = \{{{J}_{{{\sigma }_{s}}(1)}}, {{J}_{{{\sigma }_{s}}(2)}}, \cdots, {{J}_{{{\sigma }_{s}}(n)}}\} = \hat{\sigma }$ , ${{y}_{s}} = {{f}_{\max }}({{\sigma }_{s}})$ . Let $\Omega (\mathcal{J}) = \varnothing$ , $k = 0$ .

Step 2. Run Procedure ${{A}_{1}}({{y}_{s}})$ to get a schedule ${{\sigma }_{s+1}}$ , using ${{\sigma }_{s}}$ as the input schedule.

Step 3. If ${{\sigma }_{s+1}}\ne \varnothing$ , then do the following:

(i) Set ${{y}_{s+1}} = {{f}_{\max }}({{\sigma }_{s+1}})$ .

(ii) If $\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}({{\sigma }_{s}}) < \sum\nolimits_{j = 1}^{n}{{{C}_{j}}}({{\sigma }_{s+1}})$ , then set $k = k+1$ and ${{\pi }_{k}} = {{\sigma }_{s}}$ . Incorporate $(\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}({{\pi }_{k}}), {{f}_{\max }}({{\pi }_{k}}), {{\pi }_{k}})$ into $\Omega (\mathcal{J})$ .

(iii) Set $s = s+1$ . Go to Step 2.

Step 4. If ${{\sigma }_{s+1}} = \varnothing$ , then set $k = k+1$ and ${{\pi }_{k}} = {{\sigma }_{s}}$ . Incorporate $(\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}({{\pi }_{k}}), {{f}_{\max }}({{\pi }_{k}}), {{\pi }_{k}})$ into $\Omega (\mathcal{J})$ and return $\Omega (\mathcal{J})$ .

Procedure ${{A}_{1}}({{y}_{s}})$ :

Input: Schedule ${{\sigma }_{s}} = \{{{J}_{{{\sigma }_{s}}(1)}}, {{J}_{{{\sigma }_{s}}(2)}}, \cdots, {{J}_{{{\sigma }_{s}}(n)}}\}$ with ${{y}_{s}} = {{f}_{\max }}({{\sigma }_{s}})$ .

Output: Schedule ${{\sigma }_{s+1}} = \{{{J}_{{{\sigma }_{s+1}}(1)}}, {{J}_{{{\sigma }_{s+1}}(2)}}, \cdots, {{J}_{{{\sigma }_{s+1}}(n)}}\}$ which has the minimum total completion time (and minimum makespan) among all schedules in $\Pi \left(\mathcal{J}, {{y}_{s}} \right)$ .

Step 1. Initially, set $h = 0$ . Let ${{\sigma }^{h}} = \{{{J}_{{{\sigma }^{h}}(1)}}, {{J}_{{{\sigma }^{h}}(2)}}, \cdots, {{J}_{{{\sigma }^{h}}(n)}}\}$ , where ${{\sigma }^{h}}(i) = {{\sigma }_{s}}(i)$ , $i = 1, 2, \ldots, n$ .

Step 2. Set the start times of the jobs in ${{\sigma }^{h}}$ (by Lemma 2.1): ${{S}_{{{\sigma }^{h}}(1)}} = {{r}_{{{\sigma }^{h}}(1)}}$ ; for $i = 2, 3, \ldots, m$ , let ${{S}_{{{\sigma }^{h}}(i)}} = \max \{{{S}_{{{\sigma }^{h}}(i-1)}}, {{r}_{{{\sigma }^{h}}(i)}}\}$ ; for $i = m+1, m+2, \ldots, n$ , let ${{S}_{{{\sigma }^{h}}(i)}} = \max \{{{S}_{{{\sigma }^{h}}(i-1)}}, {{r}_{{{\sigma }^{h}}(i)}}, {{S}_{{{\sigma }^{h}}(i-m)}}+p\}$ . Update the cost ${{f}_{j}}({{C}_{j}}({{\sigma }^{h}}))$ of job $j$ in ${{\sigma }^{h}}$ , ${{J}_{j}}\in \mathcal{J}$ .

Step 3. Adjust ${{\sigma }^{h}}$ :

IF for all $i$ , the inequality ${{f}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}}) < {{y}_{s}}$ holds, THEN Return ${{\sigma }_{s+1}} = {{\sigma }^{h}}$ .

ELSE Pick a job ${{J}_{{{\sigma }^{h}}(i)}}$ such that ${{f}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}})\ge {{y}_{s}}$ . Let $E({{\sigma }^{h}}(i)) = \{l|1\le l\le i\wedge {{f}_{{{\sigma }^{h}}(l)}}({{C}_{{{\sigma }^{h}}(i)}}) < {{y}_{s}}\}$ \quad denote the set of the candidate jobs at time ${{C}_{{{\sigma }^{h}}(i)}}$ .

IF $E({{\sigma }^{h}}(i)) = \varnothing$ , THEN Return ${{\sigma }_{s+1}} = \varnothing$ .

ELSE Find the job with the largest release date in $E({{\sigma }^{h}}(i))$ , say ${{J}_{{{\sigma }^{h}}(e)}}$ . Let ${{J}_{{{\sigma }^{h}}(e)}}$ be scheduled at the $i$ -th position instead of ${{J}_{{{\sigma }^{h}}(i)}}$ . Set ${{J}_{x}} = {{J}_{{{\sigma }^{h}}(i)}}$ . For $q = i-1, i-2, \ldots, e+1$ (this ordering is used crucially), let ${{J}_{{{\sigma }^{h}}(q)}}$ be the job in $\{{{J}_{{{\sigma }^{h}}(q)}}, {{J}_{x}}\}$ with the larger release date, and let ${{J}_{x}}$ be the other job. Finally, let ${{J}_{x}}$ be scheduled at the $e$ -th position.

Let ${{\sigma }^{h+1}} = {{\sigma }^{h}}$ and then set $h = h+1$ . Go to Step 2.

Remark 1. Let us illustrate Step 3 of Procedure ${{A}_{1}}({{y}_{s}})$ in more detail. Suppose that we find a job ${{J}_{{{\sigma }^{h}}(i)}}$ violating its inequality. We remove ${{J}_{{{\sigma }^{h}}(i)}}$ from position $i$ and select the candidate job ${{J}_{{{\sigma }^{h}}(e)}}$ from $E({{\sigma }^{h}}(i))$ which has the largest release date. Let ${{J}_{{{\sigma }^{h}}(e)}}$ be scheduled at position $i$ . Treat ${{J}_{{{\sigma }^{h}}(i)}}$ as the job in hand, denoted by ${{J}_{x}}$ , for which we need to find a suitable position. We compare ${{J}_{x}}$ and ${{J}_{{{\sigma }^{h}}(i-1)}}$ , and let the one with the larger release date be scheduled at position $i-1$ . Let ${{J}_{x}}$ be the other job. As will be seen in Lemma 2.2 below, the job at position $i-1$ now has the release date that is not less than the largest release date among the candidate jobs in $E({{\sigma }^{h}}(i-1))$ . Its cost may not less than ${{y}_{s}}$ at time ${{C}_{{{\sigma }^{h}}(i-1)}}$ . However, we do not worry about this possibility, since the job can be moved further to the left in the next iterations. We continue to compare ${{J}_{x}}$ and ${{J}_{{{\sigma }^{h}}(i-2)}}$ , and so on. In this way, we can find suitable positions for jobs ${{J}_{{{\sigma }^{h}}(i)}}, {{J}_{{{\sigma }^{h}}(i-1)}}, \ldots, {{J}_{{{\sigma }^{h}}(e+1)}}$ . Thus, we accomplish the idea mentioned before: schedule the jobs backwardly and at each decision point always select the job with the largest release date from among the candidate jobs.

Example: We now demonstrate an example to illustrate Algorithm ${{M}_{1}}$ . There are three machines and six jobs ${{J}_{1}}, {{J}_{2}}, \ldots, {{J}_{6}}$ with processing four times, where ${{r}_{1}} = {{r}_{2}} = 0$ , ${{r}_{3}} = 1$ , ${{r}_{4}} = 2$ , ${{r}_{5}} = {{r}_{6}} = 3$ ; ${{d}_{1}} = {{d}_{2}} = 4$ , ${{d}_{3}} = 3$ , ${{d}_{4}} = 2$ , ${{d}_{5}} = {{d}_{6}} = 1$ . Algorithm ${{M}_{1}}$ works as follows:

(1) ${{\sigma }_{1}} = \hat{\sigma } = \{{{J}_{1}}, {{J}_{2}}, {{J}_{3}}, {{J}_{4}}, {{J}_{5}}, {{J}_{6}}\}$ , jobs ${{J}_{1}}, {{J}_{2}}, \ldots, {{J}_{6}}$ are in non-decreasing order of their release dates. The schedule is recovered by Lemma 2.1 with $\sum{{{C}_{j}}({{\sigma }_{1}})} = 38$ and ${{L}_{\max }}({{\sigma }_{1}}) = 8$ .

(2) Run Procedure ${{A}_{1}}({{y}_{1}})$ , where ${{y}_{1}} = 8$ .

Initially, ${{\sigma }^{0}} = {{\sigma }_{1}} = \{{{J}_{1}}, {{J}_{2}}, {{J}_{3}}, {{J}_{4}}, {{J}_{5}}, {{J}_{6}}\}$ . The job violating the inequality is ${{J}_{6}}$ . The set of the candidate jobs at time ${{C}_{6}}$ is $E({{\sigma }^{0}}(6)) = \{{{J}_{1}}, {{J}_{2}}, {{J}_{3}}, {{J}_{4}}\}$ . Since ${{J}_{4}}$ has the largest release date among the jobs in $E({{\sigma }^{0}}(6))$ , it is scheduled at the sixth position instead of ${{J}_{6}}$ . Job ${{J}_{6}}$ becomes the job in hand. We compare ${{J}_{6}}$ and ${{J}_{5}}$ . Since ${{r}_{5}}\ge {{r}_{6}}$ , ${{J}_{5}}$ stays at the fifth position. We continue to consider the fourth position. The fourth position is not occupied because ${{J}_{4}}$ has been moved from this position to the sixth position. Therefore, ${{J}_{6}}$ is scheduled at the fourth position. Set the start times of the jobs by Lemma 2.1. We get: ${{\sigma }^{1}} = \{{{J}_{1}}, {{J}_{2}}, {{J}_{3}}, {{J}_{6}}, {{J}_{5}}, {{J}_{4}}\}$ with $\sum{{{C}_{j}}}({{\sigma }^{1}}) = 38$ and ${{L}_{\max }}({{\sigma }^{1}}) = 7$ . Since ${{L}_{\max }}({{\sigma }_{1}}) = 8 > 7 = {{L}_{\max }}({{\sigma }^{1}})$ , Procedure ${{A}_{1}}({{y}_{1}})$ returns ${{\sigma }_{2}} = {{\sigma }^{1}} = \{{{J}_{1}}, {{J}_{2}}, {{J}_{3}}, {{J}_{6}}, {{J}_{5}}, {{J}_{4}}\}$ with $\sum{{{C}_{j}}}({{\sigma }_{2}}) = 38$ and ${{L}_{\max }}({{\sigma }_{2}}) = 7$ . Since $\sum{{{C}_{j}}({{\sigma }_{1}})} = 38 = \sum{{{C}_{j}}({{\sigma }_{2}})}$ , by Step 3 of Algorithm ${{M}_{1}}$ , we get rid of ${{\sigma }_{1}}$ since it cannot be a Pareto-optimal schedule.

(3) Run Procedure ${{A}_{1}}({{y}_{2}})$ , where ${{y}_{2}} = 7$ .

(i) Initially, ${{\sigma }^{0}} = {{\sigma }_{2}} = \{{{J}_{1}}, {{J}_{2}}, {{J}_{3}}, {{J}_{6}}, {{J}_{5}}, {{J}_{4}}\}$ . The jobs violating the inequalities are ${{J}_{4}}, {{J}_{5}}, {{J}_{6}}$ . The set of the candidate jobs at time ${{C}_{4}}$ is $E({{\sigma }^{0}}(6)) = \{{{J}_{1}}, {{J}_{2}}, {{J}_{3}}\}$ . Since ${{J}_{3}}$ has the largest release date among the jobs in $E({{\sigma }^{0}}(6))$ , it is scheduled at the sixth position instead of ${{J}_{4}}$ . Job ${{J}_{4}}$ becomes the job in hand. We compare ${{J}_{4}}$ and ${{J}_{5}}$ . Next, we compare ${{J}_{4}}$ and ${{J}_{6}}$ , and then decide to schedule ${{J}_{4}}$ at the third position. Set the start times of the jobs by Lemma 2.1. We get ${{\sigma }^{1}} = \{{{J}_{1}}, {{J}_{2}}, {{J}_{4}}, {{J}_{6}}, {{J}_{5}}, {{J}_{3}}\}$ with $\sum{{{C}_{j}}({{\sigma }^{1}})} = 40$ and ${{L}_{\max }}({{\sigma }^{1}}) = 7$ .

(ii) Now, the jobs violating the inequalities are ${{J}_{3}}, {{J}_{5}}, {{J}_{6}}$ . We select ${{J}_{3}}$ as the job in hand and adjust ${{\sigma }^{1}}$ . We get ${{\sigma }^{2}} = \{{{J}_{1}}, {{J}_{3}}, {{J}_{4}}, {{J}_{6}}, {{J}_{5}}, {{J}_{2}}\}$ with $\sum{{{C}_{j}}}({{\sigma }^{2}}) = 42$ and ${{L}_{\max }}({{\sigma }^{2}}) = 8$ .

(iii) Since ${{y}_{2}} = 7$ , the jobs violating the inequalities are ${{J}_{5}}, {{J}_{6}}$ . We select ${{J}_{5}}$ as the job in hand and adjust ${{\sigma }^{2}}$ . We get: ${{\sigma }^{3}} = \{{{J}_{1}}, {{J}_{4}}, {{J}_{5}}, {{J}_{6}}, {{J}_{3}}, {{J}_{2}}\}$ with $\sum{{{C}_{j}}}({{\sigma }^{3}}) = 47$ and ${{L}_{\max }}({{\sigma }^{3}}) = 8$ .

(iv) Since ${{y}_{2}} = 7$ , the jobs violating the inequalities are ${{J}_{2}}, {{J}_{3}}, {{J}_{6}}$ . We select ${{J}_{2}}$ as the job in hand and adjust ${{\sigma }^{3}}$ . Since $E({{\sigma }^{3}}(6)) = \varnothing$ , Procedure ${{A}_{1}}({{y}_{2}})$ returns ${{\sigma }_{3}} = \varnothing$ , implying that $\Pi \left(\mathcal{J}, {{y}_{2}} \right) = \varnothing$ .

By Step 4 of Algorithm ${{M}_{1}}$ , we get: ${{\pi }_{1}} = {{\sigma }_{2}} = \{{{J}_{1}}, {{J}_{2}}, {{J}_{3}}, {{J}_{6}}, {{J}_{5}}, {{J}_{4}}\}$ with $\sum{{{C}_{j}}}({{\pi }_{1}}) = 38$ and ${{L}_{\max }}({{\pi }_{1}}) = 7$ . Finally, Algorithm ${{M}_{1}}$ returns the Pareto set $\Omega (\mathcal{J}) = \{(38, 7, {{\pi }_{1}})\}$ .

Step 1 of Procedure ${{A}_{1}}({{y}_{s}})$ can be implemented in $O(n)$ time. Steps 2 and 3 can be implemented in $O(n)$ time in each iteration. Here, an iteration refers to a job inequality violation adjustment. In each iteration, there is a job which has to be moved to the left because of the inequality violation. Later, by Lemma 3.1 we will know that this job cannot be moved back again. Hence, since there are $n$ jobs and each job goes through at most $n-1$ positions (from the rightmost to the leftmost), the total number of iterations is $O({{n}^{2}})$ . The running time of Procedure ${{A}_{1}}({{y}_{s}})$ is $O({{n}^{3}})$ .

The running time of Algorithm ${{M}_{1}}$ is $O({{n}^{3}})$ , since although it is not clear how many times Algorithm ${{M}_{1}}$ calls for Procedure ${{A}_{1}}(...)$ , the total number of iterations in all calls for Procedure ${{A}_{1}}(...)$ is still $O({{n}^{2}})$ , and each iteration can be done in $O(n)$ time.

In Lemma 2.2 below, we will prove the following: (1) Algorithm ${{M}_{1}}$ schedules the jobs backwardly and always selects the job with the largest release date from among the candidate jobs; (2) in the course of the algorithm, no job moved to the left can be moved back again; (3) at each iteration, Procedure ${{A}_{1}}({{y}_{s}})$ constructs a schedule in which the $i$ -th job, counting from the left, starts no later than the $i$ -th job in any feasible schedule in $\Pi \left(\mathcal{J}, {{y}_{s}} \right)$ , $i = 1, 2, \ldots, n$ .

Therefore, the schedule obtained at each iteration of Procedure ${{A}_{1}}({{y}_{s}})$ is no worse than any feasible schedule in $\Pi \left(\mathcal{J}, {{y}_{s}} \right)$ for the objectives $\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ and ${{C}_{\max }}$ . Therefore, the final schedule (i.e., ${{\sigma }_{s+1}}$ , if it exists) obtained by Procedure ${{A}_{1}}({{y}_{s}})$ at the last iteration is in $\Pi \left(\mathcal{J}, {{y}_{s}} \right)$ and optimal for $\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ and ${{C}_{\max }}$ .

Lemma 2.2. Let ${{\sigma }^{h}} = ({{J}_{{{\sigma }^{h}}(1)}}, {{J}_{{{\sigma }^{h}}(2)}}, \cdots, {{J}_{{{\sigma }^{h}}(n)}})$ be the schedule obtained at iteration $h$ ( $h = 0, 1, \ldots$ ) of Procedure ${{A}_{1}}({{y}_{s}})$ . Let $\sigma = ({{J}_{\sigma (1)}}, {{J}_{\sigma (2)}}, \cdots, {{J}_{\sigma (n)}})$ be any schedule in $\Pi \left(\mathcal{J}, {{y}_{s}} \right)$ . Then for $i = 1, 2, \ldots, n$ , we have the following: (1) ${{r}_{{{\sigma }^{h}}(i)}}\ge \max \{{{r}_{j}}|j\in E({{\sigma }^{h}}(i))\}$ ; (2) ${{S}_{{{\sigma }^{h}}(i)}}\le {{S}_{\sigma (i)}}$ (and thus ${{C}_{{{\sigma }^{h}}(i)}}\le {{C}_{\sigma (i)}}$ ); (3) ${{C}_{{{\sigma }^{h}}(i)}}\le {{C}_{{{\sigma }^{h+1}}(i)}}$ .

Proof. We prove the lemma by induction on $s$ and $h$ .

First, we consider the input schedule for Procedure ${{A}_{1}}({{y}_{1}})$ , which is the initial schedule $\hat{\sigma }$ . Property (1) of the lemma clearly holds. We are going to prove property (2) for the base case $h = 0$ of the call for Procedure ${{A}_{1}}({{y}_{1}})$ . Let $\sigma$ be any schedule in $\Pi \left(\mathcal{J}, {{y}_{1}} \right)$ . We compare $\hat{\sigma }$ and $\sigma$ backwardly (from the right to the left) looking for a difference between the jobs. Suppose that the first difference occurs at the $k$ -th position, which is occupied by jobs ${{J}_{a}}$ and ${{J}_{b}}$ in $\hat{\sigma }$ and $\sigma$ , respectively. By the construction of $\hat{\sigma }$ , we know that ${{r}_{a}}\ge {{r}_{b}}$ . Since ${{J}_{a}}$ is processed earlier than ${{J}_{b}}$ in $\sigma$ , we can safely interchange ${{J}_{a}}$ and ${{J}_{b}}$ in $\sigma$ (regardless of whether ${{J}_{a}}$ can be scheduled at the $k$ -th position in $\sigma$ ), without increasing the job start or completion time at any position. Repetition of this argument shows that $\sigma$ can be safely transformed into $\hat{\sigma }$ . Thus, for $i = 1, 2, \ldots, n$ , we have that ${{S}_{\hat{\sigma }(i)}}\le {{S}_{\sigma (i)}}$ , proving property (2) for the base case $h = 0$ of the call for Procedure ${{A}_{1}}({{y}_{1}})$ . Hence, the lemma holds for the 0-th iteration of Procedure ${{A}_{1}}({{y}_{1}})$ .

Assume that the lemma holds for the first $h$ iterations of Procedure ${{A}_{1}}({{y}_{1}})$ . We now consider the $(h+1)$ -th iteration. More precisely, we observe ${{\sigma }^{h+1}}$ at the moment that it is being constructed to adjust ${{\sigma }^{h}}$ during Step 3 of Procedure ${{A}_{1}}({{y}_{1}})$ , but the next round of Step 2 has not been executed yet. That is, ${{\sigma }^{h+1}}$ is obtained from ${{\sigma }^{h}}$ by performing an inequality violation adjustment, but the completion times and the costs of the jobs in ${{\sigma }^{h+1}}$ have not been updated yet.

As described in Step 3 of Procedure ${{A}_{1}}({{y}_{1}})$ , for adjusting ${{\sigma }^{h}}$ , we pick a job ${{J}_{{{\sigma }^{h}}(i)}}$ in ${{\sigma }^{h}}$ such that ${{f}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}})\ge {{y}_{s}}$ , and find a job ${{J}_{{{\sigma }^{h}}(e)}}$ which has the largest release date in $E({{\sigma }^{h}}(i))$ . Let ${{J}_{{{\sigma }^{h}}(e)}}$ be scheduled at the $i$ -th position instead of ${{J}_{{{\sigma }^{h}}(i)}}$ . Clearly, ${{r}_{{{\sigma }^{h}}(e)}} = \max \{{{r}_{j}}|j\in E({{\sigma }^{h}}(i))\}$ . By the inductive assumption, if we do not consider ${{J}_{{{\sigma }^{h}}(i)}}$ , then we have that ${{r}_{{{\sigma }^{h}}(i-1)}}\ge \max \{{{r}_{j}}|j\in E({{\sigma }^{h}}(i-1))\}$ . By comparing ${{J}_{x}} = {{J}_{{{\sigma }^{h}}(i)}}$ and ${{J}_{{{\sigma }^{h}}(i-1)}}$ , letting the one with the larger release date be scheduled at position $i-1$ , and letting ${{J}_{x}}$ be the other job, we can ensure that ${{r}_{{{\sigma }^{h}}(i-1)}}\ge \max \{{{r}_{j}}|j\in E({{\sigma }^{h}}(i-1))\}$ after ${{J}_{{{\sigma }^{h}}(i)}}$ has been taken into consideration. We continue to deal with ${{J}_{x}}$ and ${{J}_{{{\sigma }^{h}}(i-2)}}$ , and so on, as described in Step 3. Therefore, we prove property (1) for the $(h+1)$ -th iteration of Procedure ${{A}_{1}}({{y}_{1}})$ .

To prove property (2) for the $(h+1)$ -th iteration of Procedure ${{A}_{1}}({{y}_{1}})$ , we compare ${{\sigma }^{h+1}}$ and $\sigma$ backwardly looking for a difference between the jobs. Suppose that the first difference occurs at the $k$ -th position, which is occupied by jobs ${{J}_{a}}$ and ${{J}_{b}}$ in ${{\sigma }^{h+1}}$ and $\sigma$ , respectively. By the inductive assumption, ${{C}_{{{\sigma }^{h}}(k)}}\le {{C}_{\sigma (k)}}$ and thus ${{f}_{b}}({{C}_{{{\sigma }^{h}}(k)}})\le {{f}_{b}}({{C}_{\sigma (k)}}) < {{y}_{1}}$ (test the feasibility of job ${{J}_{b}}$ when it is completed at ${{C}_{{{\sigma }^{h}}(k)}}$ ), which means that job ${{J}_{b}}$ is also a candidate job at time ${{C}_{{{\sigma }^{h}}(k)}}$ . By the rule of selecting a candidate job in favor of the largest release date (which has just been proved), we know that ${{r}_{a}}\ge {{r}_{b}}$ . Since ${{J}_{a}}$ is processed earlier than ${{J}_{b}}$ in $\sigma$ , we can safely interchange ${{J}_{a}}$ and ${{J}_{b}}$ in $\sigma$ , without increasing the job start or completion time at any position. Repetition of this argument shows that $\sigma$ can be safely transformed into ${{\sigma }^{h+1}}$ , without increasing the job start or completion time at any position. Thus, for $i = 1, 2, \ldots, n$ we have that ${{S}_{{{\sigma }^{h+1}}(i)}}\le {{S}_{\sigma (i)}}$ , proving property (2) of the lemma for the $(h+1)$ -th iteration of Procedure ${{A}_{1}}({{y}_{1}})$ .

Finally, we observe ${{\sigma }^{h+1}}$ at the moment that it has been obtained and the next round of Step 2 has been executed already, which is, the moment when the completion times and the costs of the jobs in ${{\sigma }^{h+1}}$ have been updated. Clearly, for $e+1\le l\le i$ we have that ${{r}_{{{\sigma }^{h}}(l)}}\ge {{r}_{{{\sigma }^{h}}(e)}}$ . (Otherwise, since ${{J}_{{{\sigma }^{h}}(e)}}$ is a candidate job at time ${{C}_{{{\sigma }^{h}}(l)}}$ , by the rule of selecting a candidate job in favor of the largest release date, ${{J}_{{{\sigma }^{h}}(e)}}$ should have been scheduled at the $l$ -th position instead of ${{J}_{{{\sigma }^{h}}(l)}}$ .) After scheduling jobs ${{J}_{{{\sigma }^{h}}(i)}}, {{J}_{{{\sigma }^{h}}(i-1)}}, \ldots, {{J}_{{{\sigma }^{h}}(e)}}$ at the suitable positions, only the release date at $i$ -th position may become smaller. The release dates at all the other positions remain unchanged or become larger. Since ${{J}_{{{\sigma }^{h}}(1)}}, {{J}_{{{\sigma }^{h}}(2)}}, \cdots, {{J}_{{{\sigma }^{h}}(n)}}$ are processed in non-decreasing order of their start times in ${{\sigma }^{h}}$ , by Lemma 2.1, during the adjustment of ${{\sigma }^{h}}$ , none among ${{C}_{{{\sigma }^{h}}(1)}}, {{C}_{{{\sigma }^{h}}(2)}}, \cdots, {{C}_{{{\sigma }^{h}}(n)}}$ can decrease. It follows that for all $i$ ${{C}_{{{\sigma }^{h}}(i)}}\le {{C}_{{{\sigma }^{h+1}}(i)}}$ . Therefore, we prove property (3) for the $(h+1)$ -th iteration of Procedure ${{A}_{1}}({{y}_{1}})$ . Note that property (1) still holds, since the increasing completion times can only reduce the set of candidate jobs, and thus only makes property (1) easier to satisfy. Property (3) also holds, since scheduling the jobs in a given sequence as described in Lemma 2.1 is optimal.

Summarizing the above, we have proved the lemma for Procedure ${{A}_{1}}({{y}_{1}})$ . Assume that the lemma holds for Procedures ${{A}_{1}}({{y}_{1}})$ , ${{A}_{1}}({{y}_{2}}), \ldots$ , ${{A}_{1}}({{y}_{s}})$ . We now consider Procedure ${{A}_{1}}({{y}_{s+1}})$ . Let $\sigma = ({{J}_{\sigma (1)}}, {{J}_{\sigma (2)}}, \cdots, {{J}_{\sigma (n)}})$ be any schedule in $\Pi \left(\mathcal{J}, {{y}_{s+1}} \right)$ . Since ${{y}_{s+1}} < {{y}_{s}}$ , we have that $\Pi \left(\mathcal{J}, {{y}_{s+1}} \right)\subseteq \Pi \left(\mathcal{J}, {{y}_{s}} \right)$ . The input schedule for Procedure ${{A}_{1}}({{y}_{s+1}})$ is just the output schedule of Procedure ${{A}_{1}}({{y}_{s}})$ . By the inductive assumption, the lemma holds for this schedule and $\sigma$ . Assume that the lemma holds for the first $h$ iterations of Procedure ${{A}_{1}}({{y}_{s+1}})$ . We now consider ${{\sigma }^{h+1}}$ and $\sigma$ . In almost the same manner as described above, we can prove that the lemma holds for the $(h+1)$ -th iteration of Procedure ${{A}_{1}}({{y}_{s+1}})$ .

By the principle of induction, we complete the proof. □

We get the following:

Lemma 2.3. Let ${{\sigma }^{last}}$ (i.e., ${{\sigma }_{s+1}}$ ) be the schedule obtained at the last iteration of Procedure ${{A}_{1}}({{y}_{s}})$ . If ${{\sigma }^{last}} = \varnothing$ , then $\Pi \left(\mathcal{J}, {{y}_{s}} \right) = \varnothing$ ; Otherwise ${{\sigma }^{last}}$ is a schedule which has minimum total completion time (and minimum makespan) among all schedules in $\Pi \left(\mathcal{J}, {{y}_{s}} \right)$ .

Proof. If ${{\sigma }^{last}} = \varnothing$ , then by Step 3 of Procedure ${{A}_{1}}({{y}_{s}})$ , at the last iteration, there is a job ${{J}_{{{\sigma }^{h}}(i)}}$ such that ${{f}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}})\ge {{y}_{s}}$ and $E({{\sigma }^{h}}(i)) = \varnothing$ , where $E({{\sigma }^{h}}(i)) = \{l|1\le l\le i\wedge {{f}_{{{\sigma }^{h}}(l)}}({{C}_{{{\sigma }^{h}}(i)}}) < {{y}_{s}}\}$ denotes the set of the candidate jobs at time ${{C}_{{{\sigma }^{h}}(i)}}$ . Therefore, the first $i$ jobs in ${{\sigma }^{last-1}}$ can only be scheduled at the first $i$ positions in any schedule in $\Pi \left(\mathcal{J}, {{y}_{s}} \right)$ , but none of them can be scheduled at the $i$ -th position. This contradiction tells us that $\Pi \left(\mathcal{J}, {{y}_{s}} \right) = \varnothing$ .

If ${{\sigma }^{last}}\ne \varnothing$ , then by Lemma 2.2, we have the following: ${{C}_{{{\sigma }^{last}}(i)}}\le {{C}_{\sigma (i)}}$ , $i = 1, 2, \ldots, n$ , where $\sigma = ({{J}_{\sigma (1)}}, {{J}_{\sigma (2)}}, \cdots, {{J}_{\sigma (n)}})$ denotes any schedule in $\Pi \left(\mathcal{J}, {{y}_{s}} \right)$ . Hence, ${{\sigma }^{last}}$ is a schedule which has the minimum total completion time (and minimum makespan) among all schedules in $\Pi \left(\mathcal{J}, {{y}_{s}} \right)$ .

□

Algorithm ${{M}_{1}}$ applies the $\varepsilon$ -constraint method of Pareto optimization. The following theorem shows its correctness, the proof of which is based on Lemma 2.3. We omit the proof since it is standard and implied in Figure 1 and, e.g., ^[1,25].

Theorem 2.4. Algorithm ${{M}_{1}}$ constructs all Pareto-optimal points together with the corresponding Pareto-optimal schedules for $P|{{r}_{j}}, {{p}_{j}} = p|(\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ in $O({{n}^{3}})$ time. Consequently, problem $P|{{r}_{j}}, {{p}_{j}} = p|{{f}_{\max }}$ can also be solved in $O({{n}^{3}})$ time.

Moreover, by Lemma 2.3, Algorithm ${{M}_{1}}$ also solves $P\vert {{r}_{j}}, {{p}_{j}} = p\vert ({{C}_{\max }}, {{f}_{\max }})$ in $O({{n}^{3}})$ time. We only need to change the obtained Pareto-optimal points. The Pareto-optimal schedules for ${{C}_{\max }}$ and ${{f}_{\max }}$ are the same as those for $\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ and ${{f}_{\max }}$ .

3. An algorithm for $P\vert {{r}_{j}}, {{p}_{j}}=p\vert Lex({{f}_{\max }}, {{g}_{\max }})$

In this section we will adapt Algorithm ${{M}_{1}}$ to solve $P\vert {{r}_{j}}, {{p}_{j}} = p\vert Lex({{f}_{\max }}, {{g}_{\max }})$ in $O({{n}^{3}})$ time. Note that Lemma 2.1 still holds for this problem.

During the run of Algorithm ${{M}_{1}}$ , we only care about the criteria $\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}$ and ${{f}_{\max }}$ , totally ignoring ${{g}_{\max }}$ . To solve $P|{{r}_{j}}, {{p}_{j}} = p|Lex({{f}_{\max }}, {{g}_{\max }})$ , we need to incorporate ${{g}_{\max }}$ into the framework.

Let schedule ${{\pi }^{*}}$ be the last schedule obtained upon the completion of Algorithm ${{M}_{1}}$ . Let ${{f}^{*}} = {{f}_{\max }}({{\pi }^{*}})$ . By Theorem 2.4, ${{\pi }^{*}}$ is an optimal schedule for $P\vert {{r}_{j}}, {{p}_{j}} = p\vert {{f}_{\max }}$ , i.e., ${{f}^{*}} = {{\min }_{\sigma \in \Pi \left(\mathcal{J} \right)}}{{f}_{\max }}(\sigma)$ . Let $\sigma$ be any schedule in $\Pi \left(\mathcal{J} \right)$ with ${{f}_{\max }}(\sigma) = {{f}^{*}}$ . By Lemma 2.2, the $i$ -th job in ${{\pi }^{*}}$ , counting from the left, starts no later than the $i$ -th job in $\sigma$ , $i = 1, 2, \ldots, n$ . As we saw in the last section, this property plays a key role in solving $P\vert {{r}_{j}}, {{p}_{j}} = p\vert (\sum\nolimits_{j = 1}^{n}{{{C}_{j}}}, {{f}_{\max }})$ . We will maintain a similar property for solving $P\vert {{r}_{j}}, {{p}_{j}} = p\vert Lex({{f}_{\max }}, {{g}_{\max }})$ .

Let $\Pi \left(\mathcal{J}, {{f}^{*}}, y \right)$ denote the set of the schedules in $\Pi \left(\mathcal{J} \right)$ whose ${{f}_{\max }}$ -values are equal to ${{f}^{*}}$ and ${{g}_{\max }}$ -values are less than $y$ .

Below is the algorithm (Algorithm ${{M}_{2}}$ ) for $P|{{r}_{j}}, {{p}_{j}} = p|Lex({{f}_{\max }}, {{g}_{\max }})$ . (The basic idea of the algorithm has been illustrated in the preceding section before the description of Algorithm ${{M}_{1}}$ .) The initial schedule is ${{\pi }^{*}}$ , which is the optimal schedule for $P|{{r}_{j}}, {{p}_{j}} = p|{{f}_{\max }}$ obtained by Algorithm ${{M}_{1}}$ .

Algorithm $M_2$ :

Input: An instance of $P|{{r}_{j}}, {{p}_{j}} = p|Lex({{f}_{\max }}, {{g}_{\max }})$ .

Output: A lexicographical optimal schedule such that ${{g}_{\max }}$ is minimized under the constraint that the ${{f}_{\max }}$ -value is equal to ${{f}^{*}}$ .

Step 1. Initially, set $s = 1$ . Let ${{\sigma }_{s}} = {{\pi }^{*}}$ , ${{y}_{s}} = {{g}_{\max }}({{\sigma }_{s}})$ .

Step 2. Run Procedure ${{A}_{2}}({{y}_{s}})$ to get a schedule ${{\sigma }_{s+1}}$ in $\Pi \left(\mathcal{J}, {{f}^{*}}, {{y}_{s}} \right)$ , using ${{\sigma }_{s}}$ as the input schedule.

Step 3. If ${{\sigma }_{s+1}}\ne \varnothing$ , then set ${{y}_{s+1}} = {{g}_{\max }}({{\sigma }_{s+1}})$ , $s = s+1$ . Go to Step 2. Otherwise, return ${{\sigma }_{s}}$ .

Procedure ${{A}_{2}}({{y}_{s}})$ :

Input: Schedule ${{\sigma }_{s}} = \{{{J}_{{{\sigma }_{s}}(1)}}, {{J}_{{{\sigma }_{s}}(2)}}, \cdots, {{J}_{{{\sigma }_{s}}(n)}}\}$ with ${{f}_{\max }}({{\sigma }_{s}}) = {{f}^{*}}$ and ${{y}_{s}} = {{g}_{\max }}({{\sigma }_{s}})$ .

Step 3. Adjust ${{\sigma }^{h}}$ :

IF for all $i$ , the inequalities ${{f}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}})\le {{f}^{*}}$ and ${{g}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}}) < {{y}_{s}}$ hold,

THEN Return ${{\sigma }_{s+1}} = {{\sigma }^{h}}$ .

ELSE Pick a job ${{J}_{{{\sigma }^{h}}(i)}}$ such that ${{f}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}}) > {{f}^{*}}$ or ${{g}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}})\ge {{y}_{s}}$ . Let $E({{\sigma }^{h}}(i)) = \{l|1\le l\le i\wedge {{f}_{{{\sigma }^{h}}(l)}}({{C}_{{{\sigma }^{h}}(i)}})\le {{f}^{*}}\wedge {{g}_{{{\sigma }^{h}}(l)}}({{C}_{{{\sigma }^{h}}(i)}}) < {{y}_{s}}\}$ denote the set of the candidate jobs at time ${{C}_{{{\sigma }^{h}}(i)}}$ .

IF $E(i) = \varnothing$ , THEN Return ${{\sigma }_{s+1}} = \varnothing$ .

Let ${{\sigma }^{h+1}} = {{\sigma }^{h}}$ and then set $h = h+1$ . Go to Step 2.

We omit the proof of Lemma 3.1 because it is very similar to that of Lemma 2.2.

Lemma 3.1. Let ${{\sigma }^{h}} = ({{J}_{{{\sigma }^{h}}(1)}}, {{J}_{{{\sigma }^{h}}(2)}}, \cdots, {{J}_{{{\sigma }^{h}}(n)}})$ be the schedule obtained at iteration $h$ ( $h = 0, 1, \ldots$ ) of Procedure ${{A}_{2}}({{y}_{s}})$ . Let $\sigma = ({{J}_{\sigma (1)}}, {{J}_{\sigma (2)}}, \cdots, {{J}_{\sigma (n)}})$ be any schedule in $\Pi \left(\mathcal{J}, {{f}^{*}}, {{y}_{s}} \right)$ . Then for $i = 1, 2, \ldots, n$ , we have the following: (1) ${{r}_{{{\sigma }^{h}}(i)}}\ge \max \{{{r}_{j}}|j\in E({{\sigma }^{h}}(i))\}$ ; (2) ${{S}_{{{\sigma }^{h}}(i)}}\le {{S}_{\sigma (i)}}$ (and thus ${{C}_{{{\sigma }^{h}}(i)}}\le {{C}_{\sigma (i)}}$ ); (3) ${{C}_{{{\sigma }^{h}}(i)}}\le {{C}_{{{\sigma }^{h+1}}(i)}}$ .

We get the following:

Lemma 3.2. Let ${{\sigma }^{last}}$ (i.e., ${{\sigma }_{s+1}}$ ) be the schedule obtained at the last iteration of Procedure ${{A}_{2}}({{y}_{s}})$ . If ${{\sigma }^{last}} = \varnothing$ , then $\Pi \left(\mathcal{J}, {{f}^{*}}, {{y}_{s}} \right) = \varnothing$ ; otherwise ${{\sigma }^{last}}$ is a schedule which has the minimum total completion time and minimum makespan among all schedules in $\Pi \left(\mathcal{J}, {{f}^{*}}, {{y}_{s}} \right)$ .

Proof. If ${{\sigma }^{last}} = \varnothing$ , then by Step 3 of Procedure ${{A}_{2}}({{y}_{s}})$ , at the last iteration, there is a job ${{J}_{{{\sigma }^{h}}(i)}}$ such that ${{f}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}}) > {{f}^{*}}$ or ${{g}_{{{\sigma }^{h}}(i)}}({{C}_{{{\sigma }^{h}}(i)}})\ge {{y}_{s}}$ and $E({{\sigma }^{h}}(i)) = \varnothing$ , where $E({{\sigma }^{h}}(i)) = \{l|1\le l\le i\wedge {{f}_{{{\sigma }^{h}}(l)}}({{C}_{{{\sigma }^{h}}(i)}})\le {{f}^{*}}\wedge {{g}_{{{\sigma }^{h}}(l)}}({{C}_{{{\sigma }^{h}}(i)}}) < {{y}_{s}}\}$ denotes the set of candidate jobs at time ${{C}_{{{\sigma }^{h}}(i)}}$ . Therefore, the first $i$ jobs in ${{\sigma }^{last-1}}$ can only be scheduled at the first $i$ positions in any schedule in $\Pi \left(\mathcal{J}, {{f}^{*}}, {{y}_{s}} \right)$ , but none of them can be scheduled at the $i$ -th position. This contradiction tells us that $\Pi \left(\mathcal{J}, {{f}^{*}}, {{y}_{s}} \right) = \varnothing$ .

If ${{\sigma }^{last}}\ne \varnothing$ , then by Lemma 3.1, ${{\sigma }^{last}}$ is a schedule which has the minimum total completion time and minimum makespan among all schedules in $\Pi \left(\mathcal{J}, {{f}^{*}}, {{y}_{s}} \right)$ .

□

Based on Lemma 3.2, we get the following:

Theorem 3.3. Algorithm ${{M}_{2}}$ solves $P\vert {{r}_{j}}, {{p}_{j}} = p\vert Lex({{f}_{\max }}, {{g}_{\max }})$ in $O({{n}^{3}})$ time.

Moreover, by Lemma 3.2, the last schedule generated by Algorithm ${{M}_{2}}$ has the minimum total completion time and minimum makespan among the lexicographical optimal schedules for $P\vert {{r}_{j}}, {{p}_{j}} = p\vert Lex({{f}_{\max }}, {{g}_{\max }})$ .

4. Conclusions

In this paper we studied the bicriteria problem of scheduling equal-processing-time jobs with release dates on identical machines to minimize total completion time (and makespan) and maximum cost simultaneously, or to minimize the two general min-max criteria hierarchically. We presented $O({{n}^{3}})$ -time algorithms for the two problems. For future research, for Pareto optimization it is interesting to consider more general min-sum objective functions instead of total the completion time, such as the total weighted completion time, in combination with a maximum cost or another min-sum objective function. For lexicographical optimization, we can try to extend the results in ^[26,27] to the case of unequal release dates.

Use of AI tools declaration

The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

We thank the editor and reviewers for their helpful suggestions.

This work is supported by Natural Science Foundation of Shandong Province China (No. ZR2020MA030), and Shandong Soft Science Project (No. 2021RKY02040).

The authors express their gratitude to Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2023R61), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Conflict of interest

The authors declare that there is no conflict of interest.

References

[1]	M. L. Zhang, Z. H. Zhou, A review on multi-label learning algorithms, IEEE Trans. Knowl. Data Eng., 26 (2013), 1819–1837. https://doi.org/10.1109/TKDE.2013.39 doi: 10.1109/TKDE.2013.39
[2]	Z. Shao, W. Zhou, X. Deng, M. Zhang, Q. Cheng, Multilabel remote sensing image retrieval based on fully convolutional network, IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens., 13 (2020), 318–328. https://doi.org/10.1109/JSTARS.2019.2961634 doi: 10.1109/JSTARS.2019.2961634
[3]	Z. Zhang, Q. Zou, Y. Lin, L. Chen, S. Wang, Improved deep hashing with soft pairwise similarity for multi-label image retrieval, IEEE Trans. Multimedia, 22 (2019), 540–553. https://doi.org//10.1109/TMM.2019.2929957 doi: 10.1109/TMM.2019.2929957
[4]	X. Zhang, J. Xu, C. Soh, L. Chen, LA-HCN: label-based attention for hierarchical multi-label text classification neural network, Expert Syst. Appl., 187 (2022), 115922. https://doi.org/10.1016/j.eswa.2021.115922 doi: 10.1016/j.eswa.2021.115922
[5]	Z. Yang, F. Emmert-Streib, Optimal performance of Binary Relevance CNN in targeted multi-label text classification, Knowledge-Based Syst., 284 (2023), 111286. https://doi.org/10.1016/j.knosys.2023.111286 doi: 10.1016/j.knosys.2023.111286
[6]	R. Su, H. Yang, L. Wei, S. Chen, Q. Zou, A multi-label learning model for predicting drug-induced pathology in multi-organ based on toxicogenomics data, PLoS Comput. Biol., 18 (2022), e1010402. https://doi.org/10.1371/journal.pcbi.1010402 doi: 10.1371/journal.pcbi.1010402
[7]	S. Wan, M. K. Mak, S. Y. Kung, mGOASVM: Multi-label protein subcellular localization based on gene ontology and support vector machines, BMC Bioinf., 13 (2012), 1–16. https://doi.org/10.1186/1471-2105-13-290 doi: 10.1186/1471-2105-13-290
[8]	K. C. Chou, Advances in predicting subcellular localization of multi-label proteins and its implication for developing multi-target drugs, Curr. Med. Chem., 26 (2019), 4918–4943. https://doi.org/10.2174/0929867326666190507082559 doi: 10.2174/0929867326666190507082559
[9]	H. Wang, L. Yan, H. Huang, C. Ding, From protein sequence to protein function via multi-label linear discriminant analysis, IEEE/ACM Trans. Comput. Biol. Bioinf., 14 (2016), 503–513. https://doi.org/10.1109/TCBB.2016.2591529 doi: 10.1109/TCBB.2016.2591529
[10]	M. R. G. A. De Oliveira, P. M. Ciarelli, E. Oliveira, Recommendation of programming activities by multi-label classification for a formative assessment of students, Expert Syst. Appl., 40 (2013), 6641–6651. https://doi.org/10.1016/j.eswa.2013.06.011 doi: 10.1016/j.eswa.2013.06.011
[11]	M. L. Zhang, Y. K. Li, X. Y. Liu, X. Geng, Binary relevance for multi-label learning: an overview, Front. Comput. Sci., 12 (2018), 191–202. https://doi.org/10.1007/s11704-017-7031-7 doi: 10.1007/s11704-017-7031-7
[12]	J. Fürnkranz, E. Hüllermeie, E. Loza Mencía, K. Brinker, Multilabel classification via calibrated label ranking, Mach. Learn., 73 (2008), 133–153. https://doi.org/10.1007/s10994-008-5064-8 doi: 10.1007/s10994-008-5064-8
[13]	M. L. Zhang, Z. H. Zhou, ML-KNN: A lazy learning approach to multi-label learning, Pattern Recognit., 40 (2007), 2038–2048. https://doi.org/10.1016/j.patcog.2006.12.019 doi: 10.1016/j.patcog.2006.12.019
[14]	M. L. Zhang, Z. H. Zhou, Multilabel neural networks with applications to functional genomics and text categorization, IEEE Trans. Knowl. Data Eng., 18 (2006), 1338–1351. https://doi.org/10.1109/TKDE.2006.162 doi: 10.1109/TKDE.2006.162
[15]	M. R. Boutell, J. Luo, X. Shen, C. M. Brown, Learning multi-label scene classification, Pattern Recognit., 37 (2004), 1757–1771. https://doi.org/10.1016/j.patcog.2004.03.009 doi: 10.1016/j.patcog.2004.03.009
[16]	J. Read, B. Pfahringer, G. Holmes, E. Frank, Classifier chains for multi-label classification, Mach. Learn., 85 (2011), 333–359. https://doi.org/10.1007/s10994-011-5256-5 doi: 10.1007/s10994-011-5256-5
[17]	G. Tsoumakas, I. Katakis, I. Vlahavas, Random k-labelsets for multilabel classification, IEEE Trans. Knowl. Data Eng., 23 (2010), 1079–1089. https://doi.org/10.1109/TKDE.2010.164 doi: 10.1109/TKDE.2010.164
[18]	A. N. Tarekegn, M. Giacobini, K. Michalak, A review of methods for imbalanced multi-label classification, Pattern Recognit., 118 (2021), 107965. https://doi.org/10.1016/j.patcog.2021.107965 doi: 10.1016/j.patcog.2021.107965
[19]	A. Zhang, H. Yu, S. Zhou, Z. Huan, X. Yang, Instance weighted SMOTE by indirectly exploring the data distribution, Knowledge-Based Syst., 249 (2022), 108919. https://doi.org/10.1016/j.knosys.2022.108919 doi: 10.1016/j.knosys.2022.108919
[20]	A. Zhang, H. Yu, Z. Huan, X. Yang, S. Zheng, S. Gao, SMOTE-RkNN: A hybrid re-sampling method based on SMOTE and reverse k-nearest neighbors, Inf. Sci., 595 (2022), 70–88. https://doi.org/10.1016/j.ins.2022.02.038 doi: 10.1016/j.ins.2022.02.038
[21]	K. E. Bennin, J. Keung, P. Phannachitta, A. Monden, S. Mensah, MAHAKIL: Diversity based oversampling approach to alleviate the class imbalance issue in software defect prediction, IEEE Trans. Software Eng., 44 (2018), 534–550. https://doi.org/10.1109/TSE.2017.2731766 doi: 10.1109/TSE.2017.2731766
[22]	M. Zhang, T. Li, X. Zheng, Q. Yu, C. Chen, D. D. Zhou, et al., UFFDFR: Undersampling framework with denoising, fuzzy c-means clustering, and representative sample selection for imbalanced data classsification, Inf. Sci., 576 (2021), 658–680. https://doi.org/10.1016/j.ins.2021.07.053 doi: 10.1016/j.ins.2021.07.053
[23]	R. Batuwita, V. Palade, FSVM-CIL: fuzzy support vector machines for class imbalance learning, IEEE Trans. Fuzzy Syst., 18 (2010), 558–571. https://doi.org/10.1109/TFUZZ.2010.2042721 doi: 10.1109/TFUZZ.2010.2042721
[24]	C. L. Castro, A. P. Braga, Novel cost-sensitive approach to improve the multilayer perceptron performance on imbalanced data, IEEE Trans. Neural Networks Learn. Syst., 24 (2013), 888–899. https://doi.org/10.1109/TNNLS.2013.2246188 doi: 10.1109/TNNLS.2013.2246188
[25]	Z. H. Zhou, X. Y. Liu, Training cost-sensitive neural networks with methods addressing the class imbalance problem, IEEE Trans. Knowl. Data Eng., 18 (2005), 63–77. https://doi.org/10.1109/TKDE.2006.17 doi: 10.1109/TKDE.2006.17
[26]	H. Yu, C. Mu, C. Sun, W. Yang, X. Yang, X. Zuo, Support vector machine-based optimized decision threshold adjustment strategy for classifying imbalanced data, Knowledge-Based Syst., 76 (2015), 67–78. https://doi.org/10.1016/j.knosys.2014.12.007 doi: 10.1016/j.knosys.2014.12.007
[27]	H. Yu, C. Sun, X. Yang, W. Yang, J. Shen, Y. Qi, ODOC-ELM: Optimal decision outputs compensation-based extreme learning machine for classifying imbalanced data, Knowledge-Based Syst., 92 (2016), 55–70. https://doi.org/10.1016/j.knosys.2015.10.012 doi: 10.1016/j.knosys.2015.10.012
[28]	G. Collell, D. Prelec, K. R. Patil, A simple plug-in bagging ensemble based on threshold-moving for classifying binary and multiclass imbalanced data, Neurocomputing, 275 (2018), 330–340. https://doi.org/10.1016/j.neucom.2017.08.035 doi: 10.1016/j.neucom.2017.08.035
[29]	P. Lim, C. K. Goh, K. C. Tan, Evolutionary cluster-based synthetic oversampling ensemble (ECO-Ensemble) for imbalance learning, IEEE Trans. Cybern., 47 (2017), 2850–2861. https://doi.org/10.1109/TCYB.2016.2579658 doi: 10.1109/TCYB.2016.2579658
[30]	S. E. Roshan, S. Asadi, Improvement of Bagging performance for classification of imbalanced datasets using evolutionary multi-objective optimization, Eng. Appl. Artif. Intell., 87 (2020), 103319. https://doi.org/10.1016/j.engappai.2019.103319 doi: 10.1016/j.engappai.2019.103319
[31]	H. Yu, J. Ni, An improved ensemble learning method for classifying high-dimensional and imbalanced biomedicine data, IEEE/ACM Trans. Comput. Biol. Bioinf., 11 (2014), 657–666. https://doi.org/10.1109/TCBB.2014.2306838 doi: 10.1109/TCBB.2014.2306838
[32]	H. G. Zefrehi, H. Altincay, Imbalance learning using heterogeneous ensembles, Expert Syst. Appl., 142 (2020), 113005. https://doi.org/10.1016/j.eswa.2019.113005 doi: 10.1016/j.eswa.2019.113005
[33]	X. M. An, S. Xu, A selective evolutionary heterogeneous ensemble algorithm for classifying imbalanced data, Electron. Res. Arch., 31 (2023), 2733–2757. https://doi.org/10.3934/era.2023138 doi: 10.3934/era.2023138
[34]	F. Charte, A. J. Rivera, M. J. del Jesus, F. Herrera, Addressing imbalance in multilabel classification: Measures and random resampling algorithms, Neurocomputing, 163 (2015), 3–16. https://doi.org/10.1016/j.neucom.2014.08.091 doi: 10.1016/j.neucom.2014.08.091
[35]	F. Charte, A. J. Rivera, M. J. del Jesus, F. Herrera, ML-SMOTE: Approaching imbalanced multi-label learning through synthetic instance generation, Knowledge-Based Syst., 89 (2015), 385–397. https://doi.org/10.1016/j.knosys.2015.07.019 doi: 10.1016/j.knosys.2015.07.019
[36]	M. Zhang, Y. K. Li, H. Yang, Towards class-imbalance aware multi-label learning, IEEE Trans. Cybern., 52 (2020), 4459–4471. https://doi.org/10.1109/TCYB.2020.3027509 doi: 10.1109/TCYB.2020.3027509
[37]	B. Liu, G. Tsoumakas, Dealing with class imbalance in classifier chains via random undersampling, Knowledge-Based Syst., 192 (2020), 105292. https://doi.org/10.1016/j.knosys.2019.105292 doi: 10.1016/j.knosys.2019.105292
[38]	Y. Peng, E. Huang, G. Chen, C. Wang, J. Xie, A general framework for multi-label learning towards class correlations and class imbalance, Intell. Data Anal., 23 (2019), 371–383. https://doi.org/10.3233/IDA-183932 doi: 10.3233/IDA-183932
[39]	J. Rice, R. J. Belland, A simulation study of moss floras using Jaccard's coefficient of similarity, J. Biogeogr., 9 (1982), 411–419. https://doi.org/10.2307/2844573 doi: 10.2307/2844573
[40]	J. R. Quinlan, Improved use of continuous attributes in C4.5, J. Artif. Intell. Res., 4 (1996), 77–90. https://doi.org/10.1613/jair.279 doi: 10.1613/jair.279
[41]	J. Demsar, Statistical comparisons of classifiers over multiple data sets, J. Mach. Learn. Res., 7 (2006), 1–30. https://doi.org/10.1007/s10846-005-9016-2 doi: 10.1007/s10846-005-9016-2
[42]	S. García, A. Fernández, J. Luengo, F. Herrera, Advanced nonparametric tests for multiple comparisons in the design of experiments in computational intelligence and data mining: Experimental analysis of power, Inf. Sci., 180 (2010), 2044–2064. https://doi.org/10.1016/j.ins.2009.12.010 doi: 10.1016/j.ins.2009.12.010
[43]	S. Pandya, T. R. Gadekallu, P. K. Reddy, W. Wang, M. Alazab, InfusedHeart: A novel knowledge-infused learning framework for diagnosis of cardiovascular events, IEEE Trans. Comput. Social Syst., 2022 (2022), 1–10. http://doi.org/10.1109/TCSS.2022.3151643 doi: 10.1109/TCSS.2022.3151643
[44]	L. Zhang, J. Wang, W. Wang, Z. Jin, Y. Su, H. Chen, Smart contract vulnerability detection combined with multi-objective detection, Comput. Networks, 217 (2022), 109289. https://doi.org/10.1016/j.comnet.2022.109289 doi: 10.1016/j.comnet.2022.109289
[45]	X. Liu, T. Shi, G. Zhou, M. Liu, Z. Yin, L. Yin, et al., Emotion classification for short texts: an improved multi-label method, Humanit. Social Sci. Commun., 10 (2023), 1–9. https://doi.org/10.1057/s41599-023-01816-6 doi: 10.1057/s41599-023-01816-6

This article has been cited by:

1.	Zhenghui Li, Yanting Xu, Ziqing Du, Valuing financial data: The case of analyst forecasts, 2025, 75, 15446123, 106847, 10.1016/j.frl.2025.106847
2.	David Umoru, Enike Imran Abu, Beauty Igbinovia, Georgina Asemota, Ahinkweokhai Igbafe, Henry Imogiemhe Idogun, Stock Markets Returns and Interactive Effects of Economic Policy Uncertainty and Exchange Rate Volatility: Evidence from MENA Markets, 2025, 6, 2712-7508, 91, 10.3897/brics-econ.6.e142917

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)