All-pairwise squared distances lead to more balanced clustering

Mikko I. Malinen; Pasi Fränti; Mikko I. Malinen; Pasi Fränti

doi:10.3934/aci.2023006

Applied Computing and Intelligence

2023, Volume 3, Issue 1: 93-115. doi: 10.3934/aci.2023006

Previous Article Next Article

Research article

All-pairwise squared distances lead to more balanced clustering

Mikko I. Malinen ^,,
Pasi Fränti

Machine Learning Unit, School of Computing, University of Eastern Finland, Box 111, FIN-80101 Joensuu, FINLAND; mmali@cs.uef.fi, franti@cs.uef.fi

Academic Editor: Chih-Cheng Hung

Received: 09 December 2022 Revised: 14 March 2023 Accepted: 19 April 2023 Published: 15 May 2023

In clustering, the cost function that is commonly used involves calculating all-pairwise squared distances. In this paper, we formulate the cost function using mean squared error and show that this leads to more balanced clustering compared to centroid-based distance functions, like the sum of squared distances in $k$ -means. The clustering method has been formulated as a cut-based approach, more intuitively called Squared cut (Scut). We introduce an algorithm for the problem which is faster than the existing one based on the Stirling approximation. Our algorithm is a sequential variant of a local search algorithm. We show by experiments that the proposed approach provides better overall optimization of both mean squared error and cluster balance compared to existing methods.

Keywords:

Citation: Mikko I. Malinen, Pasi Fränti. All-pairwise squared distances lead to more balanced clustering[J]. Applied Computing and Intelligence, 2023, 3(1): 93-115. doi: 10.3934/aci.2023006

Related Papers:

[1]	Jamie L. Flexon, Lisa Stolzenberg, Stewart J. D'Alessio . The impact of cannabis legislation on benzodiazepine and opioid use and misuse. AIMS Medical Science, 2024, 11(1): 1-24. doi: 10.3934/medsci.2024001
[2]	Hicham Rahmi, Ben Yamine Mallouki, Fatiha Chigr, Mohamed Najimi . The effects of smoking Haschich on blood parameters in young people from the Beni Mellal region Morocco. AIMS Medical Science, 2021, 8(4): 276-290. doi: 10.3934/medsci.2021023
[3]	Gili Eshel, Baruch Harash, Maayan Ben Sasson, Amir Minerbi, Simon Vulfsons . Validation of the Hebrew version of the questionnaire “know pain 50”. AIMS Medical Science, 2022, 9(1): 51-64. doi: 10.3934/medsci.2022006
[4]	Carlos Forner-Álvarez, Ferran Cuenca-Martínez, Rafael Moreno-Gómez-Toledano, Celia Vidal-Quevedo, Mónica Grande-Alonso . Multimodal physiotherapy treatment based on a biobehavioral approach in a patient with chronic low back pain: A case report. AIMS Medical Science, 2024, 11(2): 77-89. doi: 10.3934/medsci.2024007
[5]	Carlos Forner-Álvarez, Ferran Cuenca-Martínez, Alba Sebastián-Martín, Celia Vidal-Quevedo, Mónica Grande-Alonso . Combined face-to-face and telerehabilitation physiotherapy management in a patient with chronic pain related to piriformis syndrome: A case report. AIMS Medical Science, 2024, 11(2): 113-123. doi: 10.3934/medsci.2024010
[6]	Diogo Henrique Constantino Coledam, Philippe Fanelli Ferraiol, Gustavo Aires de Arruda, Arli Ramos de Oliveira . Correlates of the use of health services among elementary school teachers: A cross-sectional exploratory study. AIMS Medical Science, 2023, 10(4): 273-290. doi: 10.3934/medsci.2023021
[7]	Benjamin P Jones, Srdjan Saso, Timothy Bracewell-Milnes, Jen Barcroft, Jane Borley, Teodor Goroszeniuk, Kostas Lathouras, Joseph Yazbek, J Richard Smith . Laparoscopic uterosacral nerve block: A fertility preserving option in chronic pelvic pain. AIMS Medical Science, 2019, 6(4): 260-267. doi: 10.3934/medsci.2019.4.260
[8]	Kaye Ervin, Julie Pallant, Daniel R. Terry, Lisa Bourke, David Pierce, Kristen Glenister . A Descriptive Study of Health, Lifestyle and Sociodemographic Characteristics and their Relationship to Known Dementia Risk Factors in Rural Victorian Communities. AIMS Medical Science, 2015, 2(3): 246-260. doi: 10.3934/medsci.2015.3.246
[9]	Joann E. Bolton, Elke Lacayo, Svetlana Kurklinsky, Christopher D. Sletten . Improvement in montreal cognitive assessment score following three-week pain rehabilitation program. AIMS Medical Science, 2019, 6(3): 201-209. doi: 10.3934/medsci.2019.3.201
[10]	Mansour Shakiba, Mohammad Hashemi, Zahra Rahbari, Salah Mahdar, Hiva Danesh, Fatemeh Bizhani, Gholamreza Bahari . Lack of Association between Human µ-Opioid Receptor (OPRM1) Gene Polymorphisms and Heroin Addiction in A Sample of Southeast Iranian Population. AIMS Medical Science, 2017, 4(2): 233-240. doi: 10.3934/medsci.2017.2.233

Abstract

1. Introduction

In 2005, Rodríguez ^[1] used the Lyapunov-Schmidt method and Brower fixed-point theorem to discuss the following discrete Sturm-Liouville boundary value problem

$\begin{aligned} \begin{cases} \Delta[p(t-1)\Delta y(t-1)]+q(t)y(t)+\lambda y(t) = f(y(t)), \ t\in[a+1, b+1]_{\mathbb{Z}}, \\ a_{11}y(a)+a_{12}\Delta y(a) = 0, \ a_{21}y(b+1)+a_{22}\Delta y(b+1) = 0, \end{cases} \end{aligned}$

where $\lambda$ is the eigenvalue of the corresponding linear problem and the nonlinearity $f$ is bounded.

Furthermore, in 2007, Ma ^[2] studied the following discrete boundary value problem

$\begin{aligned} \begin{cases} \Delta[p(t-1)\Delta y(t-1)]+q(t)y(t)+\lambda y(t) = f(t, y(t))+ h(t), \ t\in[a+1, b+1]_{\mathbb{Z}}, \\ a_{11}y(a)+a_{12}\Delta y(a) = 0, \ a_{21}y(b+1)+a_{22}\Delta y(b+1) = 0, \end{cases} \end{aligned}$

where $f$ is subject to the sublinear growth condition

$|f(t, s)|\leq A|s|^{\alpha}+B, s\in \mathbb{R}$

for some $0\leq\alpha < 1$ and $A, B\in (0, \infty)$ . Additional results to the existence of solutions to the related continuous and discrete problems on the nonresonance and the resonance can be found in ^{[3,4,5,6,7,8,9,10,11,12,13]} and the references therein. For example, Li and Shu ^[14] considered the existence of solutions to the continuous Sturm-Liouville problem with random impulses and boundary value problems using the Dhage's fixed-point theorem and considered the existence of upper and lower solutions to a second-order random impulsive differential equation in ^[15] using the monotonic iterative method.

Inspired by the above literature, we use the solution set connectivity theory of compact vector field ^[16] to consider the existence of solutions to discrete resonance problems

$\begin{align} \begin{aligned} \begin{cases} -\Delta[p(t-1)\Delta y(t-1)]+q(t)y(t) = \lambda_{k}r(t)y(t)\\ \quad+f(t, y(t))+\gamma\psi_{k}(t)+\overline{g}(t), \ \ \ t\in[1, T]_{\mathbb{Z}}, \\ (a_{0}\lambda_{k}+b_{0})y(0) = (c_{0}\lambda_{k}+d_{0})\Delta y(0), \\ (a_{1}\lambda_{k}+b_{1})y(T+1) = (c_{1}\lambda_{k}+d_{1})\nabla y(T+1), \end{cases} \end{aligned} \end{align}$

(1.1)

where $p:[0, T]_{\mathbb{Z}}\rightarrow (0, \infty), \ q:[1, T]_{\mathbb{Z}}\rightarrow \mathbb{R}$ , $\overline{g}:[1, T]_{\mathbb{Z}}\rightarrow \mathbb{R}$ , $r(t) > 0, \ t\in[1, T]_{\mathbb{Z}}$ , $(\lambda_{k}, \psi_{k})$ is the eigenpair of the corresponding linear problem

$\begin{align} \begin{aligned} \begin{cases} -\Delta[p(t-1)\Delta y(t-1)]+q(t)y(t) = \lambda r(t)y(t), \ t\in[1, T]_{\mathbb{Z}}, \\ (a_{0}\lambda+b_{0})y(0) = (c_{0}\lambda+d_{0})\Delta y(0), \\ (a_{1}\lambda+b_{1})y(T+1) = (c_{1}\lambda+d_{1})\nabla y(T+1). \end{cases} \end{aligned} \end{align}$

(1.2)

It is worth noting that the difference between the problem (1.1) and the above questions is the eigenvalue that not only appears in the equation but also in the boundary conditions, which causes us considerable difficulties. Furthermore, it should be noted that these problems also apply to a number of physical problems, including those involving heat conduction, vibrating strings, and so on. For instance, Fulton and Pruess ^[17] discussed a kind of heat conduction problem, which has the eigenparameter-dependent boundary conditions. However, to discuss this kind of problem, we should know the spectrum of the problem (1.2). Fortunately, in 2016, Gao and Ma ^[18] obtained the eigenvalue theory of problem (1.2) under the conditions listed as follows:

$({\rm{A}}_{1})$ $\delta_{0}: = a_{0}d_{0}-b_{0}c_{0} < 0, \, c_{0}\neq0$ , $d_{1}-b_{1}\neq0,$

$({\rm{A}}_{2})$ $\delta_{1}: = a_{1}d_{1}-b_{1}c_{1} > 0, \, c_{1}\neq0$ , $b_{0}+d_{0}\neq0,$

which laid a theoretical foundation for this paper.

Under the conditions $({\rm{A}}_{1})$ and $(A_{2})$ , we assume the following conditions hold:

$({\rm{H}}_{1})$ (Sublinear growth condition) $f:[1, T]_{\mathbb{Z}}\times\mathbb{R}\rightarrow \mathbb{R}$ is continuous and there exist $\alpha\in[0, 1)$ and $A, B\in(0, \infty)$ , such that

$|f(t, y)|\leq A|y|^{\alpha}+B,$

$({\rm{H}}_{2})$ (Symbol condition) There exists $\omega > 0$ , such that

$\begin{align} yf(t, y) > 0, \, \; \; \; \; \; \; \; \; \; \; \; t\in[1, T]_{\mathbb{Z}}\; \mbox{for}\; |y| > \omega, \end{align}$

(1.3)

$\begin{align} yf(t, y) < 0, \, \; \; \; \; \; \; \; \; \; \; \; t\in[1, T]_{\mathbb{Z}}\; \mbox{for}\; |y| > \omega, \end{align}$

(1.4)

$({\rm{H}}_{3})$ $\overline{g}:[1, T]_{\mathbb{Z}}\rightarrow \mathbb{R}$ satisfies

$\begin{align} \sum\limits_{s = 1}^{T}\overline{g}(s)\psi_{k}(s) = 0, \end{align}$

(1.5)

$({\rm{H}}_{4})$ $f:[1, T]_{\mathbb{Z}}\times\mathbb{R}\rightarrow \mathbb{R}$ is continuous and

$\lim\limits_{|y|\rightarrow \infty}f(t, y) = 0$

uniformly for $t\in[1, T]_{\mathbb{Z}}$ .

The organization of this paper is as follows. In the second section, we construct a completely new inner product space. In the new inner product space, we discuss the basic self-adjointness of the corresponding linear operator and the properties of the eigenpair of (1.2). Finally, under the above properties, the Lyapunov-Schmidit method is used to decompose the inner product space and transform our problem to an equivalent system, that is to say, finding the solutions of (1.1) is equivalent to finding the solutions of this system. Under the sublinear condition and sign conditions on nonlinear terms, an existence result of solutions to the problem (1.1) is obtained using Schauder's fixed-point theorem and the connectivity theories of the solution set of compact vector fields. Based on the first result, the existence of two solutions to the problem (1.1) is also obtained in this section.

2. Preliminaries

Definition 2.1. (^[19]) A linear operator $P$ from the linear space $X$ to itself is called the projection operator, if $P^{2} = P.$

Lemma 2.2. (^[16]) Let $\mathcal{C}$ be a bounded closed convex set in Banach space $E$ , $T:[\alpha, \beta]\times\mathcal{C}\rightarrow \mathcal{C}(\alpha < \beta)$ be a continuous compact mapping, then the set

$S_{\alpha, \beta} = \{(\rho, x)\in[\alpha, \beta]\times\mathcal{C}|T(\rho, x) = x\}$

contains a connected branch connecting $\{\alpha\}\times\mathcal{C}$ and $\{\beta\}\times\mathcal{C}$ .

Lemma 2.3. (^[20])(Schauder) Let $D$ be a bounded convex closed set in $E$ , $A:D\rightarrow D$ is completely continuous, then $A$ has a fixed point in $D$ .

First, we construct the inner product space needed in this paper.

Let

$Y: = \{u|u:[1, T]_{\mathbb{Z}}\rightarrow \mathbb{R}\},$

then $Y$ is a Hilbert space under the following inner product

$\langle y, z\rangle_{Y} = \sum\limits_{t = 1}^{T} y(t)z(t)$

and its norm is $\|y\|_{Y}: = \sqrt{\langle y, y\rangle_{Y}}$ .

Furthermore, consider the space $H: = Y\oplus\mathbb{R}^{2}$ . Define the inner product as follows:

$\langle[y, \alpha , \beta]^{\top}, [z, \zeta, \rho]^{\top} \rangle = \langle y , z\rangle_{Y}+\frac{p(0)}{|\delta_{0}|}\alpha\zeta+\frac{p(T)}{|\delta_{1}|}\beta\rho,$

which norm is defined as

$\|y^{*}\| = \langle[y, \alpha, \beta]^{\top}, [y, \alpha, \beta]^{\top}\rangle^{\frac{1}{2}},$

where $\top$ is transposition to a matrix.

Let

$y_{0, 0} = b_{0}y(0)-d_{0}\Delta y(0), \ y_{0, 1} = a_{0}y(0)-c_{0}\Delta y(0)$

and

$y_{T+1, 0} = b_{1}y(T+1)-d_{1}\nabla y(T+1), \ y_{T+1, 1} = a_{1}y(T+1)-c_{1}\nabla y(T+1).$

For $y^{*} = [y, \alpha, \beta]^{\top}$ , define an operator $L:D\rightarrow H$ as follows:

$Ly^{*} = \left[ \begin{array}{cc} -\Delta[p(t-1)\Delta y(t-1)]+q(t)y(t) \\ -y_{0, 0}\\ -y_{T+1, 0} \\ \end{array} \right]: = \left[ \begin{array}{cc} Ly \\ -y_{0, 0}\\ -y_{T+1, 0} \\ \end{array} \right],$

where $D = \left\{[y, \alpha, \beta]^{\top}:y\in Y, \ y_{0, 1} = \alpha, \ y_{T+1, 1} = \beta\right\}.$ Define $S:D\rightarrow H$ as follows:

$Sy^{*} = S \left[ \begin{array}{cc} y \\ \alpha\\ \beta \\ \end{array} \right] = \left[ \begin{array}{cc} ry \\ \alpha\\ \beta \\ \end{array} \right].$

Then, the problem (1.2) is equivalent to the eigenvalue problem as follows:

$\begin{align} Ly^{*} = \lambda Sy^{*}, \end{align}$

(2.1)

that is, if $(\lambda_{k}, y)$ is the eigenpair of the problem (1.2), then $(\lambda_{k}, y^{*})$ is the eigenpair of the opertor $L$ . Conversely, if $(\lambda_{k}, y^{*})$ is the eigenpair of the operator $L$ , then $(\lambda_{k}, y)$ is the eigenpair of the problem (1.2).

Eventually, we define $\mathcal{A}:D\rightarrow H$ as follows:

$\mathcal{A}y^{*} = \mathcal{F}(t, y^{*})+[\gamma \psi_{k}+\overline{g}, 0, 0]^{\top},$

where $\mathcal{F}(t, y^{*}) = \mathcal{F}(t, [y, \alpha, \beta]^{\top}) = [f(t, y), 0, 0]^{\top}$ . Obviously, the solution of the problem (1.1) is equivalent to the fixed point of the following operator

$\begin{align} Ly^{*} = \lambda_{k}Sy^{*}+\mathcal{A}y^{*}. \end{align}$

(2.2)

It can be seen that there is a homomorphism mapping $(\lambda_{k}, y)\leftrightarrow(\lambda_{k}, y^{*})$ between the problem (1.1) and the operator Eq (2.2).

Next, we are committed to obtaining the orthogonality of the eigenfunction.

Lemma 2.4. Assume that $(\lambda, y^{*})$ and $(\mu, z^{*})$ are eigenpairs of $L$ , then

$\langle y^{*}, Lz^{*}\rangle-\langle Ly^{*}, z^{*}\rangle = (\mu-\lambda)\langle y^{*}, Sz^{*}\rangle.$

Proof Let $y^{*} = [y, \alpha, \beta]^{\top}\in D, \ z^{*} = [z, \zeta, \rho]^{\top}\in D$ , then

$\begin{align} \begin{aligned} \langle y^{*}, Lz^{*}\rangle& = \langle [y, \alpha, \beta ]^{\top}, [Lz , -z_{0, 0}, -z_{T+1, 0} ]^{\top} \rangle\\ & = \langle y, Lz\rangle_{Y}+\frac{p(0)}{|\delta_{0}|}\alpha(-z_{0, 0})+\frac{p(T)}{|\delta_{1}|}\beta(-z_{T+1, 0})\\ & = \mu\langle y, rz\rangle_{Y}+\frac{p(0)}{|\delta_{0}|}\alpha(\mu\zeta)+\frac{p(T)}{|\delta_{1}|}\beta(\mu\rho)\\ & = \mu\langle y^{*}, Sz^{*}\rangle. \end{aligned} \end{align}$

(2.3)

Similarly, we have

$\begin{align} \begin{aligned} \langle Ly^{*}, z^{*}\rangle& = \langle [Ly, -y_{0, 0}, -y_{T+1, 0}]^{\top}, [z, \zeta, \rho]^{\top} \rangle\\ & = \langle Ly, z\rangle_{Y}+\frac{p(0)}{|\delta_{0}|}(-y_{0, 0})\zeta+\frac{p(T)}{|\delta_{1}|}(-y_{T+1, 0})\rho\\ & = \lambda\langle ry, z\rangle_{Y}+\frac{p(0)}{|\delta_{0}|}\lambda\alpha\zeta+\frac{p(T)}{|\delta_{1}|}\lambda\beta\rho\\ & = \lambda\langle y^{*}, Sz^{*}\rangle. \end{aligned} \end{align}$

(2.4)

It can be seen from (2.3) and (2.4)

$\langle y^{*}, Lz^{*}\rangle-\langle Ly^{*}, z^{*}\rangle = (\mu-\lambda)\langle y^{*}, Sz^{*}\rangle.$

□

Lemma 2.5. The operator $L$ is the self-adjoint operator in $H$ .

Proof For $y^{*} = [y, \alpha, \beta]^{\top}\in D, \, z^{*} = [z, \zeta, \rho]^{\top}\in D$ , we just need to prove that $\langle y^{*}, Lz^{*}\rangle = \langle Ly^{*}, z^{*}\rangle$ . By the definition of inner product in $H$ . we obtain

$\langle y^{*}, Lz^{*}\rangle = \langle y, Lz\rangle_{Y}+\frac{p(0)}{|\delta_{0}|}\alpha(-z_{0, 0})+\frac{p(T)}{|\delta_{1}|}\beta(-z_{T+1, 0}),$

and

$\langle Ly^{*}, z^{*}\rangle = \langle Ly, z\rangle_{Y}+\frac{p(0)}{|\delta_{0}|} (-y_{0, 0})\zeta+\frac{p(T)}{|\delta_{1}|}(-y_{T+1, 0})\rho.$

Therefore,

$\begin{aligned} \langle y^{*}, Lz^{*}\rangle-\langle Ly^{*}, z^{*}\rangle& = \langle y, Lz\rangle_{Y}-\langle Ly, z\rangle_{Y} +\frac{p(0)}{|\delta_{0}|}[\alpha(-z_{0, 0})-(-y_{0, 0})\zeta]\\ &+\frac{p(T)}{|\delta_{1}|}[\beta(-z_{T+1, 0})- (-y_{T+1, 0})\rho], \end{aligned}$

where

$\begin{aligned} \langle y, Lz\rangle_{Y}& = \sum\limits_{t = 1}^{T}y(t)(-\Delta[p(t-1)\Delta z(t-1)]+q(t)z (t))\\ & = \sum\limits_{t = 1}^{T}y(t)p(t-1)\Delta z(t-1)-\sum\limits_{t = 1}^{T}y(t)p(t)\Delta z(t)+\sum\limits_{t = 1}^{T}q(t)y(t)z(t)\\ & = \sum\limits_{t = 0}^{T-1}y(t+1)p(t)\Delta z(t)-\sum\limits_{t = 1}^{T}y(t)p(t)\Delta z(t)+\sum\limits_{t = 1}^{T}q(t)y(t)z(t)\\ & = \sum\limits_{t = 0}^{T-1}p(t)\Delta y(t)\Delta z(t) +p(0)y(0)\Delta z(0)-p(T)y(T)\Delta z(T)\\ &+\sum\limits_{t = 1}^{T}q(t)y(t)z(t) \end{aligned}$

and

$\begin{aligned} \langle Ly, z\rangle_{Y} = &\sum\limits_{t = 0}^{T-1}p(t)\Delta y(t)\Delta z(t) +p(0)\Delta y(0) z(0)-p(T)\Delta y(T)z(T)\\ &+\sum\limits_{t = 1}^{T}q(t)y(t)z(t). \end{aligned}$

Moreover, from

$\begin{aligned} \alpha(-z_{0, 0})-(-y_{0, 0})\zeta & = [a_{0}y(0)-c_{0}\Delta y(0)][d_{0}\Delta z(0)-b_{0}z(0)]\\ &-[d_{0}\Delta y(0)-b_{0}y(0)][a_{0}z(0)-c_{0}\Delta z(0)]\\ & = (a_{0}d_{0}-b_{0}c_{0})[y(0)\Delta z(0)-\Delta y(0)z(0)] \end{aligned}$

and

$\begin{aligned} &\beta(-z_{T+1, 0})-(-y_{T+1, 0})\rho\\ & = [a_{1}y(T+1)-c_{1}\nabla y(T+1)][-b_{1}z(T+1)+d_{1}\nabla z(T+1)]\\ &-[-b_{1}y(T+1)+d_{1}\nabla y(T+1)][a_{1}z(T+1)-c_{1}\nabla z(T+1)]\\ & = (a_{1}d_{1}-b_{1}c_{1})[y(T+1)\nabla z(T+1)-\nabla y(T+1)z(T+1)], \end{aligned}$

we have

$\begin{aligned} \langle y^{*}, Lz^{*}\rangle-\langle Ly^{*}, z^{*}\rangle& = p(0)\left| \begin{array}{cc} y(0) & \Delta y(0)\\ z(0) & \Delta z(0) \\ \end{array} \right|-p(T)\left| \begin{array}{cc} y(T) & \Delta y(T) \\ z(T) & \Delta z(T)\\ \end{array} \right|\\ &-p(0)\left| \begin{array}{cc} y(0) & \Delta y(0)\\ z(0) & \Delta z(0) \\ \end{array} \right|\\ &+p(T)\left| \begin{array}{cc} y(T+1) & \nabla y(T+1) \\ z(T+1) & \nabla z(T+1)\\ \end{array} \right|\\ & = 0. \end{aligned}$

□

In order to obtain the orthogonality of the eigenfunction, we define a weighted inner product related to the weighted function $r(t)$ in $H$ . First, we define the inner product in $Y$ as $\langle y, z\rangle_{r} = \sum\limits_{t = 1}^{T}r(t)y(t)z(t)$ .

Similarly, the inner product associated with the weight function $r(t)$ in the space $H$ is defined as follows:

$\langle [y, \alpha, \beta ]^{\top}, [z, \zeta, \rho ]^{\top}\rangle_{r} = \langle y , z\rangle_{r}+\frac{p(0)}{|\delta_{0}|}\alpha\zeta+\frac{p(T)}{|\delta_{1}|}\beta\rho.$

Lemma 2.6. (Orthogonality theorem) Assume that $({\rm{A}}_{1})$ and $(A_{2})$ hold. If $(\lambda, y^{*})$ and $(\mu, z^{*})$ are two different eigenpairs corresponding to $L$ , then $y^{*}$ and $z^{*}$ are orthogonal under the weight inner product related to the weight function $r(t)$ .

Proof Assume that $(\lambda, y^{*})$ and $(\mu, z^{*})$ is the eigenpair of $L$ , then it can be obtained from Lemmas 2.4 and 2.5

$0 = (\mu-\lambda)\langle y^{*}, Sz^{*}\rangle = (\mu-\lambda)\langle y^{*}, z^{*}\rangle_{r}.$

Therefore, if $\lambda\neq\mu$ , then $\langle y^{*}, z^{*}\rangle_{r} = 0$ , which implies that $y^{*}$ and $z^{*}$ are orthogonal to the inner product defined by the weighted function $r(t)$ . □

Lemma 2.7. (^[18]) Suppose that $({\rm{A}}_{1})$ and $(A_{2})$ hold. Then (1.2) has at least $T$ or at most $T+2$ simple eigenvalues.

In this paper, we consider that $\lambda_{k}$ is a simple eigenvalue, that is, the eigenspace corresponding to each eigenvalue is one-dimensional. Let $\psi_{k}^{*} = [\psi_{k}, \alpha, \beta]^{\top}\in D$ be the eigenfunction corresponding to $\lambda_{k}$ , and assume that it satisfies

$\begin{align} \langle\psi_{k}^{*}, \psi_{k}^{*}\rangle = 1. \end{align}$

(2.5)

Denote by $\mathcal{L}: = L-\lambda_{k}S$ , then the operator (2.2) is transformed into

$\begin{align} \mathcal{L}y^{*} = \mathcal{A}y^{*}. \end{align}$

(2.6)

Define $\mathcal{P}: D\rightarrow D$ by

$\begin{equation*} (\mathcal{P}x^{*})(t) = \psi_{k}^{*}(t)\langle\psi_{k}^{*}(t), x^{*}(t)\rangle. \end{equation*}$

Lemma 2.8. $\mathcal{P}$ is a projection operator and ${\rm Im}(\mathcal{P}) = \rm Ker(\mathcal{L}).$

Proof Obviously, $\mathcal{P}$ is a linear operator, next, we need to prove $\mathcal{P}^{2} = \mathcal{P}$ .

$\begin{aligned} (\mathcal{P}^{2}x^{*})(t)& = \mathcal{P}(\mathcal{P}x^{*})(t) = \psi_{k}^{*}(t)\langle\psi_{k}^{*}(t), \mathcal{P}x^{*}(t)\rangle\\ & = \psi_{k}^{*}(t)\langle\psi_{k}^{*}(t), \psi_{k}^{*}(t)\langle\psi_{k}^{*}(t), x^{*}(t)\rangle\rangle\\ & = \psi_{k}^{*}(t)\langle\psi_{k}^{*}(t), x^{*}(t)\rangle\langle\psi_{k}^{*}(t), \psi_{k}^{*}(t)\rangle\\ & = \psi_{k}^{*}(t)\langle\psi_{k}^{*}(t), x^{*}(t)\rangle\\ & = (\mathcal{P}x^{*})(t). \end{aligned}$

It can be obtained from the Definition 2.1, $\mathcal{P}$ is a projection operator. In addition, ${\rm Im}(\mathcal{P}) = {\rm span}\{\psi_{k}^{*}\} = {\rm Ker}(\mathcal{L})$ . □

Define $\mathcal{H}: H\rightarrow H$ by

$\mathcal{H}\left(\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right]\right) = \left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right]-\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\psi_{k}^{*}.$

Lemma 2.9. $\mathcal{H}$ is a projection operator and $\rm Im(\mathcal{H}) = Im(\mathcal{L}).$

Proof Obviously, $\mathcal{H}$ is a linear operator, next, we need to prove that $\mathcal{H}^{2} = \mathcal{H}$ .

$\begin{aligned} \mathcal{H}^{2}\left(\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right]\right)& = \mathcal{H}\left(\mathcal{H}\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right]\right) = \mathcal{H}\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right] -\langle \mathcal{H}\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\psi_{k}^{*}\\ & = \left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right]-\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\psi_{k}^{*}\\ &-\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right] -\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\psi_{k}^{*}, \psi_{k}^{*}\rangle\psi_{k}^{*}\\ & = \left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right]-2\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\psi_{k}^{*} +\langle\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\psi_{k}^{*}, \psi_{k}^{*}\rangle\psi_{k}^{*}\\ & = \left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right]-2\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\psi_{k}^{*} +\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\langle \psi_{k}^{*}, \psi_{k}^{*}\rangle\psi_{k}^{*}\\ & = \mathcal{H}\left(\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right]\right). \end{aligned}$

It can be obtained from Definition 2.1 that $\mathcal{H}$ is a projection operator. On the one hand, for any $[y, \alpha, \beta]^{\top}\in H$ , we have

$\begin{aligned} \langle\mathcal{ H}\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle& = \langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right]-\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\psi_{k}^{*}, \psi_{k}^{*}\rangle\\ & = \langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle-\langle\langle\left[ \begin{array}{cc} y \\ \alpha\\ \beta\\ \end{array} \right], \psi_{k}^{*}\rangle\psi_{k}^{*}, \psi_{k}^{*}\rangle\\ & = 0, \end{aligned}$

thus, ${\rm{Im}}(\mathcal{H})\subset {\rm{Im}}(\mathcal{L})$ . On the other hand, for any $y^{*}\in {\rm{Im}}(\mathcal{L})$ , we have

$\langle y^{*}, \psi_{k}^{*}\rangle = 0.$

In summary, ${\rm{Im}}(\mathcal{H}) = {\rm{Im}}(\mathcal{L}).$ □

Denote that $I$ is a identical operator, then

$D = {\rm Im}(\mathcal{P})\oplus {\rm Im}(I-\mathcal{P}), \, H = {\rm Im}(\mathcal{H})\oplus {\rm Im}(I-\mathcal{H}).$

The restriction of the operator $\mathcal{L}$ on $\mathcal{L}|_{{\rm Im}(I-\mathcal{P})}$ is a bijection from ${\rm Im}(I-\mathcal{P})$ to ${\rm Im}(\mathcal{H})$ . Define $\mathcal{M}: {\rm Im} (\mathcal{H})\rightarrow {\rm Im}(I-\mathcal{P})$ by

$\mathcal{M}: = (\mathcal{L}|_{{\rm Im}(I-\mathcal{P})})^{-1}.$

It can be seen from ${\rm Ker}\mathcal{L} = {\rm span}\{\psi_{k}^{*}\}$ that there is a unique decomposition for any $y^{*} = [y, \alpha, \beta]^{\top}\in D$

$y^{*} = \rho \psi_{k}^{*}+x^{*},$

where $\rho\in \mathbb{R}, \, x^{*} = [x, \alpha, \beta]^{\top}\in{\rm Im}(I-\mathcal{P}).$

Lemma 2.10. The operator Eq (2.6) is equivalent to the following system

$\begin{align} x^{*} = \mathcal{MHA}(\rho\psi_{k}^{*}+x^{*}), \end{align}$

(2.7)

$\begin{align} \sum\limits_{t = 1}^{T}\psi_{k}(t)f(t, \rho\psi_{k}(t)+x(t)) = \gamma(\frac{p(0)}{|\delta_{0}|}\alpha^{2}+ \frac{p(T)}{|\delta_{1}|}\beta^{2}-1): = \theta, \end{align}$

(2.8)

where $\alpha = a_{0}\psi_{k}(0)-c_{0}\Delta\psi_{k}(0), \, \beta = a_{1}\psi_{k}(T+1)-c_{1}\nabla\psi_{k}(T+1).$

Proof (ⅰ) For any $y^{*} = \rho\psi^{*}_{k}+ x^{*}$ , we have

$\begin{aligned} \mathcal{L}y^{*} = \mathcal{A}y^{*}\ \ &\Longleftrightarrow \mathcal{H}(\mathcal{L}(\rho \psi^{*}_{k} +x^{*})-\mathcal{A}(\rho\psi^{*}_{k} +x^{*})) = 0 \\ &\Longleftrightarrow \mathcal{L}x^{*}-\mathcal{HA}(\rho\psi_{k}^{*} +x^{*}) = 0\\ &\Longleftrightarrow x^{*} = \mathcal{MHA}(\rho\psi_{k}^{*} +x^{*}).\, \ \end{aligned}$

(ⅱ) Since $\langle \mathcal{L}y^{*}, \psi_{k}^{*} \rangle = 0$ , we have $\langle\mathcal{A}y^{*}, \psi_{k}^{*}\rangle = 0$ . Therefore,

$\begin{aligned} &\langle f(t, y)+\gamma \psi_{k}+\overline{g}, \psi_{k}\rangle_{Y}\\ = &\sum\limits_{t = 1}^{T}f(t, \rho\psi_{k}(t)+x(t))\psi_{k}(t)+\sum\limits_{t = 1}^{T}\gamma \psi_{k}(t)\psi_{k}(t)+\sum\limits_{t = 1}^{T}\overline{g}(t)\psi_{k}(t)\\ = &0. \end{aligned}$

Combining $({\rm{H}}_{3})$ with (2.5), we have

$\sum\limits_{t = 1}^{T}\psi_{k}(t)f(t, \rho\psi_{k}(t)+x(t)) = \gamma(\frac{p(0)}{|\delta_{0}|}\alpha^{2}+ \frac{p(T)}{|\delta_{1}|}\beta^{2}-1) = \theta,$

where $\alpha = a_{0}\psi_{k}(0)-c_{0}\Delta\psi_{k}(0), \beta = a_{1}\psi_{k}(T+1)-c_{1}\nabla\psi_{k}(T+1).$ □

3. Main results

Let

$A^{+} = \{t\in\{1, 2, \cdots, T\}\ \mbox{s.t.}\ \psi_{k}(t) > 0\},$

$A^{-} = \{t\in\{1, 2, \cdots, T\}\ \mbox{s.t.}\ \psi_{k}(t) < 0\}.$

Obviously,

$A^{+}\cup A^{-}\neq\emptyset, \ \min\{|\psi_{k}(t)||t\in A^{+}\cup A^{-}\} > 0.$

Lemma 3.1. Supposed that $({\rm{H}}_{1})$ holds, then there exist constants $M_{0}$ and $M_{1}$ , such that

$\|x^{*}\| \leq M_{1}(|\rho|\|\psi_{k}\|_{Y})^{\alpha},$

where $(\rho, x^{*})$ is the solution of (2.7) and satisfies $|\rho|\geq M_{0}$ .

Proof Since

$\mathcal{A}(\rho\psi_{k}^{*} +x^{*}) = \mathcal{F}(t, \rho\psi_{k}^{*} +x^{*})+ [\gamma \psi_{k}+\overline{g}, 0, 0]^{\top} = [f(t, \rho\psi_{k}+x)+\gamma \psi_{k}+\overline{g}, 0, 0]^{\top},$

we have

$\begin{aligned} &\|x^{*}\| \\ \leq&\|\mathcal{M}\|_{ {\rm{Im}}(\mathcal{H})\rightarrow {\rm Im}(I-\mathcal{P})}\|\mathcal{H}\|_{H\rightarrow {\rm Im}(\mathcal{H})}[\|\overline{g}\|_{Y}+\gamma\|\psi_{k}\|_{Y} +A(|\rho|\|\psi_{k}\|_{Y}+\|x\|_{Y})^{\alpha}+B]\\ = &\|\mathcal{M}\|_{{\rm Im}(\mathcal{H})\rightarrow {\rm Im}(I-\mathcal{P})}\|\mathcal{H}\|_{H\rightarrow {\rm Im}(\mathcal{H})}\bigg[\|\overline{g}\|_{Y} +A(|\rho|\|\psi_{k}\|_{Y})^{\alpha}\left(1+\frac{\|x\|_{Y}}{|\rho|\|\psi_{k}\|_{Y}}\right)^{\alpha}+B-\theta \bigg]\\ \leq&\|\mathcal{M}\|_{{\rm Im}(\mathcal{H})\rightarrow {\rm Im}(I-\mathcal{P})}\|\mathcal{H}\|_{H\rightarrow {\rm Im}(\mathcal{H})}\bigg[\|\overline{g}\|_{Y} +A(|\rho|\|\psi_{k}\|_{Y})^{\alpha}\left(1+\frac{\alpha\|x\|_{Y}}{|\rho|\|\psi_{k}\|_{Y}}\right) +B-\theta\bigg]\\ = &\|\mathcal{M}\|_{{\rm Im}(\mathcal{H})\rightarrow {\rm Im}(I-\mathcal{P})}\|\mathcal{H}\|_{H\rightarrow {\rm Im}(\mathcal{H})}\bigg[\|\overline{g}\|_{Y}\\ &+A(|\rho|\|\psi_{k}\|_{Y})^{\alpha}\left(1+\frac{\alpha}{(|\rho|\|\psi_{k}\|_{Y})^{1-\alpha}}\frac{\|x\|_{Y}}{(|\rho|\|\psi_{k}\|_{Y})^{\alpha}} \right)+B-\theta \bigg].\, \ \end{aligned}$

Denote that

$\begin{aligned} D_{0}& = \|\mathcal{M}\|_{{\rm Im}(\mathcal{H})\rightarrow {\rm Im}(I-\mathcal{P})}\|\mathcal{H}\|_{H\rightarrow {\rm Im}(\mathcal{H})}(\|\overline{g}\|_{Y}+B-\theta), \\ D_{1}& = A\|\mathcal{M}\|_{{\rm Im}(\mathcal{H})\rightarrow {\rm Im}(I-\mathcal{P})}\|\mathcal{H}\|_{H\rightarrow {\rm Im}(\mathcal{H})}. \end{aligned}$

Furthermore, we have

$\begin{aligned} \frac{\|x^{*}\|}{(|\rho|\|\psi_{k}\|_{Y})^{\alpha}}&\leq\frac{D_{0}}{(|\rho|\|\psi_{k}\|_{Y})^{\alpha}}+ D_{1}+\frac{\alpha D_{1}}{(|\rho|\|\psi_{k}\|_{Y})^{1-\alpha}}\frac{\|x\|_{Y}}{(|\rho|\|\psi_{k}\|_{Y})^{\alpha}}\\ &\leq\frac{D_{0}}{(|\rho|\|\psi_{k}\|_{Y})^{\alpha}}+ D_{1}+\frac{\alpha D_{1}}{(|\rho|\|\psi_{k}\|_{Y})^{1-\alpha}}\frac{\|x^{*}\|}{(|\rho|\|\psi_{k}\|_{Y})^{\alpha}}. \end{aligned}$

So, if we let

$\frac{\alpha D_{1}}{(|\rho|\|\psi_{k}\|_{Y})^{1-\alpha}}\leq\frac{1}{2},$

we have

$|\rho|\geq\frac{(2\alpha D_{1})^{\frac{1}{1-\alpha}}}{\|\psi_{k}\|_{Y}}: = M_{0}.$

Thus,

$\frac{\|x^{*}\|}{(|\rho|\|\psi_{k}\|_{Y})^{\alpha}}\leq\frac{2D_{0}}{(M_{0}\|\psi_{k}\|_{Y})^{\alpha}}+ 2D_{1}: = M_{1}.$

This implies that

$\|x^{*}\| \leq M_{1}(|\rho|\|\psi_{k}\|_{Y})^{\alpha}.$

□

Lemma 3.2. Suppose that $({\rm{H}}_{1})$ holds, then there exist constants $M_{0}$ and $\Gamma$ , such that

$\|x^{*}\|\leq \Gamma(|\rho|\min\{|\psi_{k}(t)||t\in A^{+}\cup A^{-}\})^{\alpha},$

where $(\rho, x^{*})$ is the solution of (2.7) and satisfies $|\rho|\geq M_{0}$ .

According to Lemma 3.2, choose constant $\rho_{0}$ , such that

$\begin{align} \rho_{0} > \max\{M_{0}, \; \Gamma(|\rho_{0}|\min\{|\psi_{k}(t)||t\in A^{+}\cup A^{-}\})^{\alpha}\}. \end{align}$

(3.1)

Let

$K: = \left\{x^{*}\in {\rm Im}(I-\mathcal{P})|x^{*} = \mathcal{MHA} (\rho\psi_{k}^{*}+x^{*}), \, \, |\rho|\leq\rho_{0}\right\}.$

Then, for sufficiently large $\rho\geq\rho_{0}$ , there is

$\begin{align} \rho\psi_{k}(t)+x(t)\geq\omega, \ \forall\; t\in A^{+}, x^{*}\in K, \end{align}$

(3.2)

$\begin{align} \rho\psi_{k}(t)+x(t)\leq-\omega, \ \forall\; t\in A^{-}, x^{*}\in K, \end{align}$

(3.3)

and for sufficiently small $\rho\leq-\rho_{0}$ , there is

$\begin{align} \rho\psi_{k}(t)+x(t)\leq-\omega, \ \forall\; t\in A^{+}, x^{*}\in K, \end{align}$

(3.4)

$\begin{align} \rho\psi_{k}(t)+x(t)\geq \omega, \ \forall\; t\in A^{-}, x^{*}\in K. \end{align}$

(3.5)

Theorem 3.3. Suppose that $({\rm{A}}_{1})$ , $({\rm{A}}_{2})$ and $({\rm{H}}_{1})$ – $({\rm{H}}_{3})$ hold, then there exists a non-empty bounded set $\Omega_{\overline{g}}\subset\mathbb{R}$ , such that the problem (1.1) has a solution if and only if $\theta\in\Omega_{\overline{g}}$ . Furthermore, $\Omega_{\overline{g}}$ contains $\theta = 0$ and has a non-empty interior.

Proof We prove only the case of (1.3) in $({\rm{H}}_{2})$ , and the case of (1.4) can be similarly proved.

From (1.3) and (3.2)–(3.5), it is not difficult to see that

$f(t, \rho\psi_{k}(t)+x(t)) > 0, \ \ \ \forall\; t\in A^{+}, \ x^{*}\in K,$

$f(t, \rho\psi_{k}(t)+x(t)) < 0, \ \ \ \forall\; t\in A^{-}, \ x^{*}\in K,$

for sufficiently large $\rho\geq\rho_{0}$ and for sufficiently small $\rho\leq-\rho_{0}$ ,

$f(t, \rho\psi_{k}(t)+x(t)) < 0, \ \ \ \forall\; t\in A^{+}, \ x^{*}\in K,$

$f(t, \rho\psi_{k}(t)+x(t)) > 0, \ \ \ \forall\; t\in A^{-}, \ x^{*}\in K.$

Therefore, if $\rho\geq\rho_{0}$ is sufficiently large,

$\begin{align} \psi_{k}(t)f(t, \rho\psi_{k}(t)+x(t)) > 0, \ \forall\; t\in A^{+}\cup A^{-}, \ x^{*}\in K, \end{align}$

(3.6)

if $\rho\leq-\rho_{0}$ is sufficiently small,

$\begin{align} \psi_{k}(t)f(t, \rho\psi_{k}(t)+x(t)) < 0, \ \forall\; t\in A^{+}\cup A^{-}, \ x^{*}\in K. \end{align}$

(3.7)

Let

$\mathcal{C}: = \{x^{*}\in {\rm Im}(I-\mathcal{P})|\|x^{*}\|\leq\rho_{0}\}.$

Define $\mathcal{T}_{\rho}: {\rm Im}(I-\mathcal{P})\rightarrow {\rm Im}(I-\mathcal{P})$ by

$\mathcal{T}_{\rho}: = \mathcal{MHA}(\rho\psi_{k}^{*}+x^{*}).$

Obviously, $\mathcal{T}_{\rho}$ is completely continuous. By (3.1), for $x^{*}\in\mathcal{C}$ and $\rho\in[-\rho_{0}, \rho_{0}]$ ,

$\begin{aligned} \|\mathcal{T}_{\rho}x^{*}\|&\leq\Gamma(|\rho|\min\{|\psi_{k}(t)||t\in A^{+}\cup A^{-}\})^{\alpha}\\ &\leq\Gamma(|\rho_{0}|\min\{|\psi_{k}(t)||t\in A^{+}\cup A^{-}\})^{\alpha}\\ &\leq\rho_{0}, \end{aligned}$

i.e.,

$\mathcal{T}_{\rho}(\mathcal{C})\subseteq\mathcal{C}.$

According to Schauder's fixed point theorem, $\mathcal{T}_{\rho}$ has a fixed point on $\mathcal{C}$ , such that $\mathcal{T}_{\rho}x^{*} = x^{*}$ . It can be seen from Lemma 2.10 that the problem (1.1) is equivalent to the following system

$\Psi(s, x^{*}) = \theta, \ \ \ (s, x^{*})\in S_{\overline{g}},$

where

$S_{\overline{g}}: = \left\{(\rho, x^{*})\in \mathbb{R}\times {\rm Im}(I-\mathcal{P})| x^{*} = \mathcal{MHA} (\rho\psi_{k}^{*}+x^{*})\right\},$

$\Psi(\rho, x^{*}): = \sum\limits_{s = 1}^{T}\psi_{k}(s)f(s, \rho\psi_{k}(s)+x(s)).$

At this time, the $\Omega_{\overline{g}}$ in Theorem 3.3 can be given by $\Omega_{\overline{g}} = \Psi(S_{\overline{g}})$ . There exists a solution to the problem (1.1) for $\theta\in\Omega_{\overline{g}}$ .

From (3.6), (3.7) and $A^{+}\cup A^{-}\neq\emptyset$ , we can deduce that for any $x^{*}\in K$

$\sum\limits_{s = 1}^{T}\psi_{k}(s)f(s, -\rho_{0}\psi_{k}(s)+x(s)) < 0, \ \sum\limits_{s = 1}^{T}\psi_{k}(s)f(s, \rho_{0}\psi_{k}(s)+x(s)) > 0.$

Thus,

$\begin{align} \Psi(-\rho_{0}, x^{*}) < 0 < \Psi(\rho_{0}, x^{*}), \ \forall\; x^{*}\in K. \end{align}$

(3.8)

According to Lemma 2.2, $S_{\overline{g}}\subset\mathbb{R}\times \overline{B}_{\rho_{0}}$ contains a connected branch $\xi_{-\rho_{0}, \rho_{0}}$ connecting $\{-\rho_{0}\}\times\mathcal{C}$ and $\{\rho_{0}\}\times\mathcal{C}$ . Combined with (3.8), $\Omega_{\overline{g}}$ contains $\theta = 0$ and has a non-empty interior. □

Theorem 3.4. Suppose that $({\rm{A}}_{1})$ , $({\rm{A}}_{2})$ , $({\rm{H}}_{2})$ – $({\rm{H}}_{4})$ hold. $\Omega_{\overline{g}}$ as shown in Theorem 3.3, then there exists a nonempty set $\Omega_{\overline{g}}^{*}\subset\Omega_{\overline{g}}\setminus\{0\}$ , such that problem (1.1) has at least two solutions for $\theta\in\Omega_{\overline{g}}^{*}$ .

Proof We prove only the case of (1.3), and the case of (1.4) can be similarly proved. Since the condition $({\rm{H}}_{4})$ implies that $({\rm{H}}_{1})$ , using Theorem 3.3, we know that there exists $\rho_{0} > 0$ , such that

$\Psi(\rho_{0}, x^{*}) > 0, \ \forall\; x^{*}\in K.$

Let

$\delta: = \min\{\Psi(\rho_{0}, x^{*})|x^{*}\in K\},$

then $\delta > 0$ .

Next, we prove that problem (1.1) has at least two solutions for any $\theta\in(0, \delta)$ .

Let

$S_{\overline{g}}: = \left\{(\rho, x^{*})\in \mathbb{R}\times {\rm Im}(I-\mathcal{P})| x^{*} = \mathcal{MHA} (\rho\psi_{k}^{*}+x^{*})\right\},$

$\overline{K}: = \{x^{*}\in {\rm Im}(I-\mathcal{P})|(\rho, x^{*})\in S_{\overline{g}}\}.$

By $({\rm{H}}_{4})$ , there exists a constant $A_{0}$ such that

$\|x^{*}\|\leq A_{0}, \ \forall\; x^{*}\in K.$

Similar to the derivation of Theorem 3.3, there exists $\rho^{*} > \rho_{0}$ such that the following results hold:

(ⅰ) For $\rho\geq\rho^{*}$ , there is

$\begin{align} \psi_{k}(t)f(t, \rho\psi_{k}(t)+x(t)) > 0, \ \forall\; t\in A^{+}\cup A^{-}, \ x^{*}\in \overline{K}, \end{align}$

(3.9)

(ⅱ) For $\rho\leq-\rho^{*}$ , there is

$\begin{align} \psi_{k}(t)f(t, \rho\psi_{k}(t)+x(t)) < 0, \ \forall\; t\in A^{+}\cup A^{-}, \ x^{*}\in \overline{K}. \end{align}$

(3.10)

Let

$\mathcal{C}^{*}: = \{x^{*}\in {\rm Im}(I-\mathcal{P})|\|x^{*}\|\leq A_{0}\}.$

According to $({\rm{H}}_{4})$ , (3.9) and (3.10), we have

$\lim\limits_{|\rho|\rightarrow \infty}\sum\limits_{s = 1}^{T}\psi_{k}(s)f(s, \rho\psi_{k}(s)+x(s)) = 0$

uniformly for $x^{*}\in \overline{K}$ , i.e.

$\lim\limits_{|\rho|\rightarrow \infty}\Psi(\rho, x^{*}) = 0, \ \ x^{*}\in \overline{K}.$

Therefore, there exists a constant $l:l > \rho^{*} > \rho_{0} > 0$ such that $S_{\overline{g}}$ contains a connected branch between $\{-l\}\times \mathcal{C}^{*}$ and $\{l\}\times \mathcal{C}^{*}$ , and

$\begin{align*} &\max\{|\Psi(\rho, x^{*})||\rho = \pm l, \ (\rho, x^{*})\in \xi_{-l, l}\}\\ \leq&\max\{|\Psi(\rho, x^{*})||(\rho, x^{*})\in \{-l, l\}\times\overline{K}\}\\ \leq& \frac{\theta}{3}. \end{align*}$

It can be seen from the connectivity of $\xi_{-l, l}$ that there exist $(\rho_{1}, x_{1}^{*})$ and $(\rho_{2}, x_{2}^{*})$ in $\xi_{-l, l}(\subset S_{\overline{g}})$ , such that

$\Psi(\rho_{1}, x_{1}^{*}) = \theta, \ \ \ \ \Psi(\rho_{2}, x_{2}^{*}) = \theta,$

where $\rho_{1}\in(-l, \rho_{0}), \rho_{2}\in(\rho_{0}, l)$ . It can be proved that $\rho_{1}\psi_{k}^{*}+x_{1}^{*}$ and $\rho_{2}\psi_{k}^{*}+x_{2}^{*}$ are two different solutions of problem (1.1). □

4. Example

In this section, we give a concrete example of the application of our major results of Theorems 3.3 and 3.4. We choose $T = 3, a_0, d_0, b_1, c_1 = 0$ and $a_1, d_1, b_0, c_0 = 1$ , which implies that the interval becomes $[1, 3]_{\mathbb{Z}}$ and the conditions $(A_1), (A_2)$ hold.

First, we consider the eigenpairs of the corresponding linear problem

$\begin{equation} \begin{cases} -\Delta^{2}y(t-1) = \lambda y(t), \ \ \ t\in[1, 3]_{\mathbb{Z}}, \\ y(0) = \lambda\Delta y(0), \ \ \ \lambda y(4) = \nabla y(4). \end{cases} \end{equation}$

(4.1)

Define the equivalent matrix of (4.1) as follows,

$A_{\lambda} = \begin{pmatrix} \lambda-2+\frac{\lambda}{1+\lambda}&1&0\\ 1&\lambda-2&1\\ 0&1&\lambda-2+\frac{1}{1-\lambda}\\ \end{pmatrix}$

Consequently, $A_{\lambda}y = 0$ is equivalent to (4.1). Let $|A_{\lambda}| = 0$ , we have

$\lambda_{1} = -1.4657, \lambda_{2} = 0.1149, \lambda_{3} = 0.8274, \lambda_{4} = 2.0911, \lambda_{5} = 3.4324,$

which are the eigenvalues of $(4.1).$ Next, we choose $\lambda = \lambda_{1} = -1.4657$ , then we obtain the corresponding eigenfunction

$\psi_{1}(t) = \begin{cases} \begin{aligned} &1, & t = 1, \\ &3.4657, & t = 2, \\ &3.4657^{2}-1, & t = 3. \end{aligned} \end{cases}$

Example 4.1. Consider the following problem

$\begin{equation} \begin{cases} -\Delta^{2}y(t-1) = -1.4657 y(t)+f(t, y(t))+\psi_{1}(t)+\overline{g}(t), \ \ \ t\in[1, 3]_{\mathbb{Z}}, \\ y(0) = -1.4657\Delta y(0), \ \ \ -1.4657 y(4) = \nabla y(4), \end{cases} \end{equation}$

(4.2)

where

$f(t, s) = \begin{cases} \begin{aligned} &ts^{3}, & s\in[-1, 1], \\ &t\sqrt[5]{s}, & s\in(-\infty, -1)\cup(1, +\infty), \\ \end{aligned} \end{cases}$

and

$\overline{g}(t) = \begin{cases} \begin{aligned} &0, & t = 1, \\ &3.4657^{2}-1, & t = 2, \\ &-3.4657, & t = 3. \end{aligned} \end{cases}$

Then, for $f(t, y(t))$ , we have $|f(t, y(t))|\leq 3|y(t)|^{\frac{1}{3}}$ . If we choose $\omega = 1$ , $yf(t, y) > 0$ for $|y(t)| > 1$ . For $\overline{g}(t)$ , we have $\sum\limits_{s = 1}^{3}\overline{g}(s)\psi_{1}(s) = 0$ .

Therefore, the problem (4.2) satisfies the conditions $({\rm{A}}_{1}), \; ({\rm{A}}_{2})$ , $({\rm{H}}_{1})$ – $({\rm{H}}_{3})$ , which implies that the problem (4.2) has at least one solution by Theorem 3.3.

Example 4.2. Consider the following problem

(4.3)

where

$f(t, s) = \frac{ts}{\text{e}^{|s|}}, \ \ \ t\in[1, 3]_{\mathbb{Z}}$

and

$\overline{g}(t) = \begin{cases} \begin{aligned} &0, & t = 1, \\ &1-3.4657^{2}, & t = 2, \\ &3.4657, & t = 3. \end{aligned} \end{cases}$

Then, for $f(t, y(t))$ , we always have $yf(t, y) > 0$ for all $y(t) > 0$ or $y(t) < 0$ , $f$ is continuous and satisfies

$\lim\limits_{|y|\rightarrow \infty}f(t, y) = 0.$

For $\overline{g}(t)$ , we have $\sum\limits_{s = 1}^{3}\overline{g}(s)\psi_{1}(s) = 0$ .

Therefore, the problem (4.3) satisfies the conditions $({\rm{A}}_{1}), \; ({\rm{A}}_{2})$ , $({\rm{H}}_{2})$ – $({\rm{H}}_{4})$ , which implies that the problem (4.3) has at least two solutions by Theorem 3.4.

Use of AI tools declaration

The authors declare that they have not used Artificial Intelligence (AI) tools in the creation of this article.

Acknowledgments

Supported by National Natural Science Foundation of China [Grant No. 11961060] and Natural Science Foundation of Qinghai Province(No.2024-ZJ-931).

Conflict of interest

The authors declare that there are no conflicts of interest.

References

[1]	J. H. Ward Jr, Hierarchical grouping to optimize an objective function, J. Am. Stat. Assoc., 58 (1963), 236–244. https://doi.org/10.1080/01621459.1963.10500845 doi: 10.1080/01621459.1963.10500845
[2]	T. Kohonen, Median strings, Pattern Recogn. Lett., 3 (1985), 309–313. https://doi.org/10.1016/0167-8655(85)90061-3 doi: 10.1016/0167-8655(85)90061-3
[3]	V. Hautamäki, P. Nykänen, P. Fränti, Time-series clustering by approximate prototypes, 19th International conference on pattern recognition, (2008), 1–4. IEEE. https://doi.org/10.1109/ICPR.2008.4761105
[4]	P. Fränti, R. Mariescu-Istodor, Averaging gps segments: competition 2019, Pattern Recogn., 112 (2021), 107730. https://doi.org/10.1016/j.patcog.2020.107730 doi: 10.1016/j.patcog.2020.107730
[5]	P. Fränti, S. Sieranoja, K. Wikström, T. Laatikainen, Clustering diagnoses from 58m patient visits in Finland 2015-2018, 2022.
[6]	M. Fatemi, P. Fränti, Clustering nordic twitter users based on their connections, 2023.
[7]	M. I. Malinen, P. Fränti, Clustering by analytic functions, Inform. Sciences, 217 (2012), 31–38. https://doi.org/10.1016/j.ins.2012.06.018 doi: 10.1016/j.ins.2012.06.018
[8]	M. I. Malinen, P. Fränti, Balanced $k$ -means for clustering, in: Joint Int. Workshop on Structural, Syntactic, and Statistical Pattern Recognition (S+SSPR 2014), LNCS 8621, Joensuu, Finland, 2014.
[9]	D. Aloise, A. Deshpande, P. Hansen, P. Popat, NP-hardness of Euclidean sum-of-squares clustering, Mach. Learn., 75 (2009), 245–248. https://doi.org/10.1007/s10994-009-5103-0 doi: 10.1007/s10994-009-5103-0
[10]	M. Inaba, N. Katoh, H. Imai, Applications of Weighted Voronoi Diagrams and Randomization to Variance-Based $k$ -Clustering, ACM symposium on computational geometry (SCG 1994), (1994), 332–339. https://doi.org/10.1145/177424.178042 doi: 10.1145/177424.178042
[11]	J. MacQueen, Some methods of classification and analysis of multivariate observations, Berkeley Symp. Mathemat. Statist. Probab., 1 (1967), 281–297.
[12]	W. H. Equitz, A New Vector Quantization Clustering Algorithm, IEEE Trans. Acoust., Speech, Signal Processing, 37 (1989), 1568–1575. https://doi.org/10.1109/29.35395 doi: 10.1109/29.35395
[13]	P. Fränti, O. Virmajoki, V. Hautamäki, Fast agglomerative clustering using a k-nearest neighbor graph, IEEE T. Pattern Anal., 28 (2006), 1875–1881. https://doi.org/10.1109/TPAMI.2006.227 doi: 10.1109/TPAMI.2006.227
[14]	P. Fränti, O. Virmajoki, Iterative shrinking method for clustering problems, Pattern Recogn., 39 (2006), 761–765. https://doi.org/10.1016/j.patcog.2005.09.012 doi: 10.1016/j.patcog.2005.09.012
[15]	P. Fränti, Efficiency of random swap clustering, Journal of Big Data, 5 (2018), 1–29. https://doi.org/10.1186/s40537-018-0122-y doi: 10.1186/s40537-018-0122-y
[16]	B. Fritzke, Breathing k-means, arXiv: 2006.15666.
[17]	C. Baldassi, Recombinator-k-means:an evolutionary algorithm that exploits k-means++ for recombination, IEEE T. Evolut. Comput., 26 (2022), 991–1003.
[18]	A. P. Dempster, N. M. Laird, D. B. Rubin, Maximun likelihood from incomplete data via the EM algorithm, J. R. Stat. Soc. B, 39 (1977), 1–38. https://doi.org/10.1111/j.2517-6161.1977.tb01600.x doi: 10.1111/j.2517-6161.1977.tb01600.x
[19]	Q. Zhao, V. Hautamäki, I. Kärkkäinen, P. Fränti, Random swap EM algorithm for finite mixture models in image segmentation, IEEE International Conference on Image Processing (ICIP), (2009), 2397–2400. https://doi.org/10.1109/ICIP.2009.5414459 doi: 10.1109/ICIP.2009.5414459
[20]	J. Shi, J. Malik, Normalized cuts and image segmentation, IEEE T. Pattern Anal., 22 (2000), 888–905. https://doi.org/10.1109/34.868688 doi: 10.1109/34.868688
[21]	C. H. Q. Ding, X. He, H. Zha, M. Gu, H. D. Simon, A min-max cut algorithm for graph partitioning and data clustering, IEEE International Conference on Data Mining (ICDM), (2001), 107–114.
[22]	M. I. Malinen, P. Fränti, K-means: Clustering by gradual data transformation, Pattern Recogn.*, 47 (2014), 3376–3386. https://doi.org/10.1016/j.patcog.2014.03.034 doi: 10.1016/j.patcog.2014.03.034
[23]	R. Nallusamy, K. Duraiswamy, R. Dhanalaksmi, P. Parthiban, Optimization of non-linear multiple traveling salesman problem using k-means clustering, shrink wrap algorithm and meta-heuristics, International Journal of Nonlinear Science, 9 (2010), 171–177.
[24]	R. Mariescu-Istodor, P. Fränti, Solving the large-scale tsp problem in 1 h: Santa claus challenge 2020, Front. Robot. AI, (2021), 1–20. https://doi.org/10.3389/frobt.2021.689908 doi: 10.3389/frobt.2021.689908
[25]	D. W. Sambo, B. O. Yenke, A. Förster, P. Dayang, Optimized clustering algorithms for large wireless sensor networks: A review, Sensors, 19 (2019), 322.
[26]	J. Singh, R. Kumar, A. K. Mishra, Clustering algorithms for wireless sensor networks: A review, International Conference on Computing for Sustainable Global Development (INDIACom), (2015), 637–642.
[27]	Y. Liao, H. Qi, W. Li, Load-Balanced Clustering Algorithm With Distributed Self-Organization for Wireless Sensor Networks, IEEE Sens. J., 13 (2013), 1498–1506. https://doi.org/10.1109/JSEN.2012.2227704 doi: 10.1109/JSEN.2012.2227704
[28]	L. Yao, X. Cui, M. Wang, An energy-balanced clustering routing algorithm for wireless sensor networks, IEEE World Congress on Computer Science and Information Engineering, 3 (2009), 316–320.
[29]	P. S. Bradley, K. P. Bennett, A. Demiriz, Constrained k-means clustering, Tech. rep., MSR-TR-2000-65, Microsoft Research, 2000.
[30]	S. Zhu, D. Wang, T. Li, Data clustering with size constraints, Knowledge-Based Syst., 23 (2010), 883–889. https://doi.org/10.1016/j.knosys.2010.06.003 doi: 10.1016/j.knosys.2010.06.003
[31]	A. Banerjee, J. Ghosh, Frequency sensitive competitive learning for balanced clustering on high-dimensional hyperspheres, IEEE Transactions on Neural Networks, 15 (2004), 702–719. https://doi.org/10.1109/TNN.2004.824416 doi: 10.1109/TNN.2004.824416
[32]	C. T. Althoff, A. Ulges, A. Dengel, Balanced clustering for content-based image browsing, in: GI-Informatiktage 2011, Gesellschaft für Informatik e.V., 2011.
[33]	A. Banerjee, J. Ghosh, On scaling up balanced clustering algorithms, SIAM International Conference on Data Mining, (2002), 333–349. https://doi.org/10.1137/1.9781611972726.20 doi: 10.1137/1.9781611972726.20
[34]	Y. Chen, Y. Zhang, X. Ji, Size regularized cut for data clustering, Advances in Neural Information Processing Systems, 2005.
[35]	Y. Kawahara, K. Nagano, Y. Okamoto, Submodular fractional programming for balanced clustering, Pattern Recogn. Lett., 32 (2011), 235–243. https://doi.org/10.1016/j.patrec.2010.08.008 doi: 10.1016/j.patrec.2010.08.008
[36]	G. Tzortzis, A. Likas, The minmax k-means clustering algorithm, Pattern Recogn., 47 (2014), 2505–2516. https://doi.org/10.1016/j.patcog.2014.01.015 doi: 10.1016/j.patcog.2014.01.015
[37]	W. Tang, Y. Yang, L. Zeng, Y. Zhan, Optimizing mse for clustering with balanced size constraints, Symmetry, 11 (2019), 338. https://doi.org/10.3390/sym11030338 doi: 10.3390/sym11030338
[38]	L. Hagen, A. B. Kahng, New spectrxal methods for ratio cut partitioning and clustering, IEEE T. Computer-Aided D., 11 (1992), 1074–1085. https://doi.org/10.1109/43.159993 doi: 10.1109/43.159993
[39]	T. H. Cormen, C. E. Leiserson, R. L. Rivest, C. Stein, Introduction to algorithms (2nd ed.), MIT Press and McGraw-Hill, 2001.
[40]	M. X. Goemans, D. P. Williamson, Improved approximation algorithms for maximum cut and satisfiability problems using semidefinite programming, J. ACM, 42 (1995), 1115–1145. https://doi.org/10.1145/227683.227684 doi: 10.1145/227683.227684
[41]	S. Arora, S. Rao, U. Vazirani, Expander flows, geometric embeddings and graph partitioning, J. ACM, 56 (2009), 1–37. https://doi.org/10.1145/1502793.1502794 doi: 10.1145/1502793.1502794
[42]	U. von Luxburg, A tutorial on spectral clustering, Stat. Comput., 17 (2007), 395–416. https://doi.org/10.1007/s11222-007-9033-z doi: 10.1007/s11222-007-9033-z
[43]	M. R. Garey, D. S. Johnson, Computers and intractability: A guide to the theory of NP-completeness, W. H. Freeman, 1979.
[44]	T. D. Bie, N. Cristianini, Fast sdp relaxations of graph cut clustering, transduction, and other combinatorial problems, J. Mach. Learn. Res., 7 (2006), 1409–1436.
[45]	A. Frieze, M. Jerrum, Improved approximation algorithms for max- $k$ -cut and max bisection, Algorithmica, 18 (1997), 67–81. https://doi.org/10.1007/BF02523688 doi: 10.1007/BF02523688
[46]	W. Zhu, C. Guo, A local search approximation algorithm for max- $k$ -cut of graph and hypergraph, International Symposium on Parallel Architectures, Algorithms and Programming, (2011), 236–240. https://doi.org/10.1109/PAAP.2011.35 doi: 10.1109/PAAP.2011.35
[47]	A. V. Kel'manov, A. V. Pyatkin, On the complexity of some quadratic euclidean 2-clustering problems, Comput. Math. Math. Phys., 56 (2016), 491–497. https://doi.org/10.1134/S096554251603009X doi: 10.1134/S096554251603009X
[48]	L. J. Schulman, Clustering for edge-cost minimization, Ann. ACM Symp. on Theory of Computing (STOC), (2000), 547–555. https://doi.org/10.1145/335305.335373 doi: 10.1145/335305.335373
[49]	S. Sahni, T. Gonzalez, P-complete approximation problems, J. ACM, 23 (1976), 555–565. https://doi.org/10.1145/321958.321975 doi: 10.1145/321958.321975
[50]	W. F. de la Vega, M. Karpinski, C. Kenyon, Y. Rabani, Approximation schemes for clustering problems, ACM symposium on Theory of computing (STOC '03), (2003), 50–58. https://doi.org/10.1145/780542.780550 doi: 10.1145/780542.780550
[51]	N. Guttmann-Beck, R. Hassin, Approximation algorithms for min-sum p-clustering, Discrete Appl. Math., 89 (1998), 125–142. https://doi.org/10.1016/S0166-218X(98)00100-0 doi: 10.1016/S0166-218X(98)00100-0
[52]	H. Späth, Cluster analysis algorithms for data reduction and classification of objects, Wiley, New York, 1980.
[53]	P. Fränti, S. Sieranoja, Clustering datasets, University of Eastern Finland, 2020. Available from: http://cs.uef.fi/sipu/datasets/.
[54]	P. Fränti, M. Rezaei, Q. Zhao, Centroid index: Cluster level similarity measure, Pattern Recogn., 47 (2014), 3034–3045. https://doi.org/10.1016/j.patcog.2014.03.017 doi: 10.1016/j.patcog.2014.03.017
[55]	S. Sieranoja, P. Fränti, Fast and general density peaks clustering, Pattern Recogn. Lett., 128 (2019), 551–558. https://doi.org/10.1016/j.patrec.2019.10.019 doi: 10.1016/j.patrec.2019.10.019
[56]	P. Fränti, Genetic algorithm with deterministic crossover for vector quantization, Pattern Recogn. Lett., 21 (2000), 61–68. https://doi.org/10.1016/S0167-8655(99)00133-6 doi: 10.1016/S0167-8655(99)00133-6
[57]	T. Cour, S. Yu, J. Shi, Normalized Cut Segmentation Code, 2004.

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)