Analysis of deep learning technique using a complex spherical fuzzy rough decision support model

Muhammad Ali Khan; Saleem Abdullah; Alaa O. Almagrabi; Muhammad Ali Khan; Saleem Abdullah; Alaa O. Almagrabi

doi:10.3934/math.20231188

AIMS Mathematics

2023, Volume 8, Issue 10: 23372-23402. doi: 10.3934/math.20231188

Previous Article Next Article

Research article Special Issues

Analysis of deep learning technique using a complex spherical fuzzy rough decision support model

1.
Department of Mathematics, Abdul Wali Khan Univesity Mardan, KP 23200, Pakistan
2.
Department of Information Systems, Faculty of Computing and Information Technology, King Abdulaziz Univesity, Jeddah, Saudi Arabia

Received: 15 February 2023 Revised: 06 June 2023 Accepted: 19 June 2023 Published: 25 July 2023
MSC : 03E72, 46S40

Deep learning (DL), a branch of machine learning and artificial intelligence, is nowadays considered as a core technology. Due to its ability to learn from data, DL technology originated from artificial neural networks and has become a hot topic in the context of computing, it is widely applied in various application areas. However, building an appropriate DL model is a challenging task, due to the dynamic nature and variations in real-world problems and data. The aim of this work was to develope a new method for appropriate DL model selection using complex spherical fuzzy rough sets (CSFRSs). The connectivity of two or more complex spherical fuzzy rough numbers can be defined by using the Hamacher t-norm and t-conorm. Using the Hamacher operational laws with operational parameters provides exceptional flexibility in dealing with uncertainty in data. We define a series of Hamacher averaging and geometric aggregation operators for CSFRSs, as well as their fundamental properties, based on the Hamacher t-norm and t-conorm. Further we have developed the proposed aggregation operators and provide here a group decision-making approach for solving decision making problems. Finally, a comparative analysis with existing methods is given to demonstrate the peculiarity of our proposed method.

Keywords:

Citation: Muhammad Ali Khan, Saleem Abdullah, Alaa O. Almagrabi. Analysis of deep learning technique using a complex spherical fuzzy rough decision support model[J]. AIMS Mathematics, 2023, 8(10): 23372-23402. doi: 10.3934/math.20231188

Related Papers:

[1]	Ling Zhu . Completely monotonic integer degrees for a class of special functions. AIMS Mathematics, 2020, 5(4): 3456-3471. doi: 10.3934/math.2020224
[2]	Chuan-Yu Cai, Qiu-Ying Zhang, Ti-Ren Huang . Properties of generalized $(p, q)$ -elliptic integrals and generalized $(p, q)$ -Hersch-Pfluger distortion function. AIMS Mathematics, 2023, 8(12): 31198-31216. doi: 10.3934/math.20231597
[3]	Wissem Jedidi, Hristo S. Sendov, Shen Shan . Classes of completely monotone and Bernstein functions defined by convexity properties of their spectral measures. AIMS Mathematics, 2024, 9(5): 11372-11395. doi: 10.3934/math.2024558
[4]	Fei Wang, Bai-Ni Guo, Feng Qi . Monotonicity and inequalities related to complete elliptic integrals of the second kind. AIMS Mathematics, 2020, 5(3): 2732-2742. doi: 10.3934/math.2020176
[5]	Xi-Fan Huang, Miao-Kun Wang, Hao Shao, Yi-Fan Zhao, Yu-Ming Chu . Monotonicity properties and bounds for the complete p-elliptic integrals. AIMS Mathematics, 2020, 5(6): 7071-7086. doi: 10.3934/math.2020453
[6]	Khaled Mehrez, Abdulaziz Alenazi . Bounds for certain function related to the incomplete Fox-Wright function. AIMS Mathematics, 2024, 9(7): 19070-19088. doi: 10.3934/math.2024929
[7]	Li Xu, Lu Chen, Ti-Ren Huang . Monotonicity, convexity and inequalities involving zero-balanced Gaussian hypergeometric function. AIMS Mathematics, 2022, 7(7): 12471-12482. doi: 10.3934/math.2022692
[8]	Feng Qi, Kottakkaran Sooppy Nisar, Gauhar Rahman . Convexity and inequalities related to extended beta and confluent hypergeometric functions. AIMS Mathematics, 2019, 4(5): 1499-1507. doi: 10.3934/math.2019.5.1499
[9]	Moquddsa Zahra, Dina Abuzaid, Ghulam Farid, Kamsing Nonlaopon . On Hadamard inequalities for refined convex functions via strictly monotone functions. AIMS Mathematics, 2022, 7(11): 20043-20057. doi: 10.3934/math.20221096
[10]	Xifeng Wang, Senlin Guo . Some conditions for sequences to be minimal completely monotonic. AIMS Mathematics, 2023, 8(4): 9832-9839. doi: 10.3934/math.2023496

Abstract

1. Introduction

This paper considers the following heteroscedastic model:

$\begin{eqnarray} {Y_i} = f({{X_i}}){U_i}+g({{X_i}}), i \in \{{1, \cdots , n}\}. \end{eqnarray}$

(1.1)

In this equation, $g(x)$ is a known mean function, and the variance function $r(x)(r(x): = f^2 (x))$ is unknown. Both the mean function $g(x)$ and variance function $r(x)$ are defined on $[0, 1]$ . The random variables $U_{1}, \ldots, U_{n}$ are independent and identically distributed $(i.i.d.)$ with ${\rm{E}}[U_{i}] = 0$ and $V[U_{i}] = 1$ . Furthermore, the random variable $X_{i}$ is independent of $U_{i}$ for any $i \in\{{1, \cdots, n}\}$ . The purpose of this paper is to estimate the $m$ th derivative functions $r^{(m)}(x) (m\in N)$ from the observed data $({{X_1}, {Y_1}}), \cdots, ({{X_n}, {Y_n}})$ by a wavelet method.

Heteroscedastic models are widely used in economics, engineering, biology, physical sciences and so on; see Box ^[1], Carroll and Ruppert ^[2], Härdle and Tsybakov ^[3], Fan and Yao ^[4], Quevedo and Vining ^[5] and Amerise ^[6]. For the above estimation model (1.1), the most popular method is the kernel method. Many important and interesting results of kernel estimators have been obtained by Wang et al. ^[7], Kulik and Wichelhaus ^[8] and Shen et al. ^[9]. However, the optimal bandwidth parameter of the kernel estimator is not easily obtained in some cases, especially when the function has some sharp spikes. Because of the good local properties in both time and frequency domains, the wavelet method has been widely used in nonparametric estimation problems; see Donoho and Johnstone ^[10], Cai ^[11], Nason et al. ^[12], Cai and Zhou ^[13], Abry and Didier ^[14] and Li and Zhang ^[15]. For the estimation problem (1.1), Kulik and Raimondo ^[16] studied the adaptive properties of warped wavelet nonlinear approximations over a wide range of Besov scales. Zhou et al. ^[17] developed wavelet estimators for detecting and estimating jumps and cusps in the mean function. Palanisamy and Ravichandran ^[18] proposed a data-driven estimator by applying wavelet thresholding along with the technique of sparse representation. The asymptotic normality for wavelet estimators of variance function under $\alpha-$ mixing condition was obtained by Ding and Chen ^[19].

In this paper, we focus on nonparametric estimation of the derivative function $r^{(m)}(x)$ of the variance function $r(x)$ . It is well known that derivative estimation plays an important and useful role in many practical applications (Woltring ^[20], Zhou and Wolfe, ^[21], Chacón and Duong ^[22], Wei et al.^[23]). For the estimation model (1.1), a linear wavelet estimator and an adaptive nonlinear wavelet estimator for the derivative function $r^{(m)}(x)$ are constructed. Moreover, the convergence rates over $L^{\tilde{p}} (1 \le \tilde{p} < \infty)$ risk of two wavelet estimators are proved in Besov space $B_{p, q}^{s}(\mathbb{R})$ with some mild conditions. Finally, numerical experiments are carried out, where an automatic selection method is used to obtain the best parameters of two wavelet estimators. According to the simulation study, both wavelet estimators can efficiently estimate the derivative function. Furthermore, the nonlinear wavelet estimator shows better performance than the linear estimator.

This paper considers wavelet estimations of a derivative function in Besov space. Now, we first introduce some basic concepts of wavelets. Let $\phi$ be an orthonormal scaling function, and the corresponding wavelet function is denoted by $\psi$ . It is well known that $\{{\phi _{\tau, k}}: = 2^{\tau/2}\phi(2^{\tau}x-k), {\psi _{j, k}}: = 2^{j/2}\psi(2^{j}x-k), j\geq\tau, k\in \mathbb{Z}\}$ forms an orthonormal basis of $L^{2}(\mathbb{R})$ . This paper uses the Daubechies wavelet, which has a compactly support. Then, for any integer $j_*$ , a function $h(x) \in {L^2}({[{0, 1}]})$ can be expanded into a wavelet series as

$\begin{equation} h(x) = \sum\limits_{k \in {\mit\Lambda _{{j_*}}}} {{\alpha _{{j_*}, k}}} {\phi _{{j_*}, k}(x)} + \sum\limits_{j = {j_*}}^\infty {\sum\limits_{k \in {\mit\Lambda _j}} {{\beta _{j, k}} } } {\psi _{j, k}}(x), x \in {[{0, 1}]}. \end{equation}$

(1.2)

In this equation, $\Lambda_{j} = \{0, 1, \ldots, 2^{j}-1\}$ , ${\alpha _{j_{*}, k}} = {\langle {h, {\phi _{j_{*}, k}}}\rangle _{{[{0, 1}]}}}$ and ${\beta _{j, k}} = {\langle {h, {\psi _{j, k}}}\rangle _{{[{0, 1}]}}}$ .

Lemma 1.1. Let a scaling function $\phi$ be t-regular (i.e., $\phi\in{\mathscr{C}^{t}}$ and $| {{D^\alpha}\phi(x)}|\le c{({1 + {{|x|}^2}})^{-l}}$ for each $l\in{\mathbb{Z}}$ and $\alpha = 0, 1, \ldots, t)$ . If $\left\lbrace \alpha_{k}\right\rbrace \in{l_p}$ and $1\le p \le \infty$ , there exist $c_{2} \ge c_{1} > 0$ such that

$\begin{gather*} c_{1}2^{j(\frac{1}{2}-\frac{1}{p})} \left\|(\alpha_{k}) \right\|_{p} \le\left\|\sum\limits_{k \in {\mit\Lambda _j}} \alpha_{k} 2^{\frac{j}{2}} \phi(2^{j}x-k) \right\|_{p} \le c_{2}2^{j(\frac{1}{2}-\frac{1}{p})} \left\|(\alpha_{k}) \right\|_{p}. \end{gather*}$

Besov spaces contain many classical function spaces, such as the well known Sobolev and Hölder spaces. The following lemma gives an important equivalent definition of a Besov space. More details about wavelets and Besov spaces can be found in Meyer ^[24] and Härdle et al. ^[25].

Lemma 1.2. Let $\phi$ be t-regular and $h \in {L^p}({{[0, 1]}})$ . Then, for $p, q\in[{1, \infty})$ and $0 < s < t$ , the following assertions are equivalent:

(i) $h \in B_{p, q}^s({{[0, 1]}})$ ;

(ii) $\{{{2^{js}}{{\|{h-{P_j}h}\|}_p}}\}\in{l_q}$ ;

(iii) $\{{{2^{j({s-\frac{1}{p} + \frac{1}{2}})}}{{\|{{\beta _{j, k}}}\|}_p}}\}\in{l_q}$ .

The Besov norm of $h$ can be defined by

${\left\| h \right\|_{B_{p, q}^s}} = {\left\| {\left( {{\alpha _{{\tau}, k}}} \right)} \right\|_p} + {\left\| {{{( {{2^{j({s-\frac{1}{p} + \frac{1}{2}})}}{{\|{{\beta _{j, k}}}\|}_p}})}_{j \ge {\tau}}}} \right\|_q},$

where $\left\| {{\beta _{j, k}}} \right\|_p^p = \sum\limits_{k \in {\mit\Lambda _j}} {{{\left|{{\beta _{j, k}}}\right|}^p}}$ .

2. Wavelet estimators and main theorem

In this section, we will construct our wavelet estimators, and give the main theorem of this paper. The main theorem shows the convergence rates of wavelet estimators under some mild assumptions. Now, we first give the technical assumptions of the estimation model (1.1) in the following.

A1: The variance function $r:[0, 1]\rightarrow \mathbb{R}$ is bounded.

A2: For any $i\in\{0, \ldots, m-1\}$ , variance function $r$ satisfies $r^{(i)}(0) = r^{(i)}(1) = 0$ .

A3: The mean function $g:[0, 1]\rightarrow \mathbb{R}$ is bounded and known.

A4: The random variable $X$ satisfies ${X}\sim U({{[0, 1]}})$ .

A5: The random variable $U$ has a moment of order $2\tilde{p}\left(\tilde{p} \ge1\right)$ .

In the above assumptions, A1 and A3 are conventional conditions for nonparametric estimations. The condition A2 is used to prove the unbiasedness of the following wavelet estimators. In addition, A4 and A5 are technique assumptions, which will be used in Lemmas 4.3 and 4.5.

According to the model (1.1), our linear wavelet estimator is constructed by

$\begin{eqnarray} \hat r_n^{lin}(x): = \sum\limits_{k \in {\mit\Lambda _{{j_*}}}}{{{\hat \alpha}_{{j_*}, k}}{\phi_{{j_*}, k}}(x)}. \end{eqnarray}$

(2.1)

In this definition, the scale parameter $j_{*}$ will be given in the following main theorem, and

$\begin{eqnarray} \hat \alpha _{j, k}: = \frac{1}{n}\sum\limits_{i = 1}^n {Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}}-\int_{0}^{1} {{g^2}(x){(-1)^{m}{\phi^{(m)} _{j, k}}(x)}dx}. \end{eqnarray}$

(2.2)

More importantly, it should be pointed out that this linear wavelet estimator is an unbiased estimator of the derivative function $r^{(m)}(x)$ by Lemma 4.1 and the properties of wavelets.

On the other hand, a nonlinear wavelet estimator is defined by

$\begin{eqnarray} \hat r_n^{non}(x) : = \sum\limits_{k \in {\mit\Lambda _{{j_*}}}} {{{\hat \alpha }_{{j_*}, k}}{\phi _{{j_*}, k}}(x)} + \sum\limits_{j = {j_*}}^{{j_1}}{\hat\beta}_{j, k}{\mathbb{I}_{\{{|{{{\hat\beta}_{j, k}}}|\ge\kappa{t_n}}\}}}{\psi _{j, k}}(x). \end{eqnarray}$

(2.3)

In this equation, $\mathbb{I}_{A}$ denotes the indicator function over an event $A$ , ${t_n} = 2^{mj}\sqrt{{\ln n}/n}$ ,

$\begin{eqnarray} \hat \beta _{j, k}: = \frac{1}{n}\sum\limits_{i = 1}^n \left({Y_i^{2}{(-1)^{m}{\psi^{(m)}_{j, k}}(X_i)}}-w_{j, k}\right)\mathbb{I}_{\left\{\left|{Y_i^{2}{(-1)^{m}{\psi^{(m)}_{j, k}}(X_i)}}-w_{j, k}\right| \leq \rho_{n}\right\}}, \end{eqnarray}$

(2.4)

$\rho_{n} = 2^{mj}\sqrt{n/{\ln n}}$ , and $w_{j, k} = \int_{0}^{1} {{g^2}(x){(-1)^{m}{\psi^{(m)} _{j, k}}(x)}dx}$ . The positive integer ${j_*}$ and ${j_1}$ will also be given in our main theorem, and the constant $\kappa$ will be chosen in Lemma 4.5. In addition, we adopt the following symbol: ${x_+}: = \max\{{x, 0}\}$ . $A\lesssim B$ denotes $A\leq cB$ for some constant $c > 0$ ; $A\gtrsim B$ means $B\lesssim A$ ; $A\thicksim B$ stands for both $A\lesssim B$ and $B\lesssim A$ .

In this position, the convergence rates of two wavelet estimators are given in the following main theorem.

Main theorem For the estimation model (1.1) with the assumptions A1-A5, $r^{(m)}(x)\in B_{p, q}^s({{[0, 1]}})$ $(p, q \in \left[1, \infty\right)$ , $s > 0)$ and $1 \le \tilde{p} < \infty$ , if $\{p > \tilde{p} \ge 1, s > 0\}$ or $\{1 \leq p \leq \tilde{p}, s > 1/p\}$ .

(a) the linear wavelet estimator $\hat r_n^{lin}(x)$ with $s' = s-({\frac{1}{p}-\frac{1}{\tilde{p}}})_+$ and ${2^{{j_*}}}\sim{n^{\frac{1}{{2s' + 2m+1}}}}$ satisfies

$\begin{eqnarray} {\rm{E}}\left[\left \| \hat r_n^{lin}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right]\lesssim {n^{-\frac{{\tilde{p}s'}}{{2s' + 2m+1}}}}. \end{eqnarray}$

(2.5)

(b) the nonlinear wavelet estimator $\hat r_n^{non}(x)$ with ${2^{{j_*}}}\sim{n^{\frac{1}{{2t+2m+1}}}}$ $\left(t > s\right)$ and ${2^{{j_1}}}\sim \left(\frac{n}{{\ln n}}\right)^{\frac{1}{2m+1}}$ satisfies

$\begin{eqnarray} {\rm{E}}\left[\left \| \hat r_n^{non}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right]\lesssim (\ln n)^{\tilde{p}-1} \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}, \end{eqnarray}$

(2.6)

where

$\begin{eqnarray*} \delta = min\left\lbrace \frac{s}{2s+2m+1}, \frac{s-1/p+1 /\tilde{p}}{2(s-1 /p)+2m+1} \right\rbrace = \begin{cases} \frac{s}{2s+2m+1} & p > \frac{\tilde{p}(2m+1)}{2s+2m+1} \\ \frac{s-1/p+1 /\tilde{p}}{2(s-1 /p)+2m+1} & p \leq \frac{\tilde{p}(2m+1)}{2s+2m+1}. \end{cases} \end{eqnarray*}$

Remark 1. Note that $n^{-\frac{s\tilde{p}}{2s+1}} \; (n^{-\frac{(s-1/p+1 /\tilde{p})\tilde{p}}{2(s-1 /p)+1}})$ is the optimal convergence rate over $L^{\tilde{p}} (1\leq \tilde{p} < +\infty)$ risk for nonparametric wavelet estimations (Donoho et al. ^[26]). The linear wavelet estimator can obtain the optimal convergence rate when $p > \tilde{p}\ge1$ and $m = 0$ .

Remark 2. When $m = 0$ , this derivative estimation problem reduces to the classical variance function estimation. Then, the convergence rates of the nonlinear wavelet estimator are same as the optimal convergence rates of nonparametric wavelet estimation up to a $\ln n$ factor in all cases.

Remark 3. According to main theorem (a) and the definition of the linear wavelet estimator, it is easy to see that the construction of the linear wavelet estimator depends on the smooth parameter $s$ of the unknown derivative function $r^{(m)}(x)$ , which means that the linear estimator is not adaptive. Compared with the linear estimator, the nonlinear wavelet estimator only depends on the observed data and the sample size. Hence, the nonlinear estimator is adaptive. More importantly, the nonlinear wavelet estimator has a better convergence rate than the linear estimator in the case of $p\leq\tilde{p}$ .

3. Simulation study

In order to illustrate the empirical performance of the proposed estimators, we produce a numerical illustration using an adaptive selection method, which is used to obtain the best parameters of the wavelet estimators. For the problem (1.1), we choose three common functions, $HeaviSine$ , $Corner$ and $Spikes$ , as the mean function $g(x)$ ; see . Those functions are usually used in wavelet literature. On the other hand, we choose the function $f(x)$ by $f_{1}(x) = 3(4x-2)^{2} e^{-(4x-2)^{2}}$ , $f_{2}(x) = sin(2\pi sin\pi x)$ and $f_{3}(x) = -(2x-1)^{2}+1$ , respectively. In addition, we assume that the random variable $U$ satisfies $U\sim N[0, 1]$ . The aim of this paper is to estimate the derivative function $r^{(m)}(x)$ of the variance function $r(x) (r = f^{2})$ by the observed data $(X_{1}, Y_{1}), \ldots, (X _{n}, Y_{n})$ . In this section, we adopt $r_{1}(x) = [f_{1}(x)]^{2}$ , $r_{2}(x) = [f_{2}(x)]^{2}$ and $r_{3}(x) = [f_{3}(x)]^{2}$ . For the sake of simplicity, our simulation study focuses on the derivative function $r'(x)(m = 1)$ and $r(x)(m = 0)$ by the observed data $(X_{1}, Y_{1}), \ldots, (X _{n}, Y_{n}) \; (n = 4096)$ . Furthermore, we use the mean square error ( $MSE\; (\hat r(x), r(x)) = \frac{1}{n}\sum\limits_{i = 1}^{n}(\hat r(X_{i})-r(X_{i}))^{2}$ ) and the average magnitude of error ( $AME\; (\hat r(x), r(x)) = \frac{1}{n}\sum\limits_{i = 1}^{n}|\hat r(X_{i})-r(X_{i})|$ ) to evaluate the performances of the wavelet estimators separately.

Figure 1. Three mean functions. (a)

$HeaviSine$ , (b)

$Corner$ , (c)

$Spikes$ .

DownLoad: Full-Size Img PowerPoint

For the linear and nonlinear wavelet estimators, the scale parameter $j_{*}$ and threshold value $\lambda\; (\lambda = \kappa t_{n})$ play important roles in the function estimation problem. In order to obtain the optimal scale parameter and threshold value of wavelet estimators, this section uses the two-fold cross validation (2FCV) approach (Nason ^[27], Navarro and Saumard ^[28]). During the first example of simulation study, we choose $HeaviSine$ as the mean function $g(x)$ , and $f_{1}(x) = 3(4x-2)^{2} e^{-(4x-2)^{2}}$ . The estimation results of two wavelet estimators are presented by . For the optimal scale parameter $j_{*}$ of the linear wavelet estimator, we built a collection of $j_{*}$ and $j_{*} = 1, \ldots, log2(n)-1$ . The best parameter $j_{*}$ is selected by minimizing a 2FCV criterion denoted by 2FCV $(j_{*})$ ; see . According to , it is easy to see that the 2FCV $(j_{*})$ and MSE both can get the minimum value when $j_{*} = 4$ . For the nonlinear wavelet estimator, the best threshold value $\lambda$ is also obtained by the 2FCV $(\lambda)$ criterion in . Meanwhile, the parameter $j_{*}$ is same as the linear estimator, and the parameter $j_{1}$ is chosen as the maximum scale parameter $log2(n)-1$ . From Figure 2(c) and 2(d), the linear and nonlinear wavelet estimators both can get a good performance with the best scale parameter and threshold value. More importantly, the nonlinear wavelet estimator shows better performance than the linear estimator.

Figure 2. The estimation results of wavelet estimators when

$g(x)$ is

$HeaviSine$ and

$r(x) = r_{1}(x)$ . (a) Graphs of the

$MSE$ (black line) and 2FCV criterion (red line) of the linear estimator. (b) Graphs of the

$MSE$ (black line) and 2FCV criterion (blue line) of the nonlinear estimator. (c) Fluctuating data

$(X, Y)$ (gray circles), the true variance

$r(x)$ (black line), the linear estimator

$\hat{r}^{lin}$ (red line) and the nonlinear estimator

$\hat{r}^{non}$ (blue line). (d) The estimation results of the linear (red line) and nonlinear (blue line) for derivative function

$r'(x)$ .

DownLoad: Full-Size Img PowerPoint

In the following simulation study, more numerical experiments are presented to sufficiently verify the performance of the wavelet method. According to Figures 3–10, the wavelet estimators both can obtain good performances in different cases. Especially, the nonlinear wavelet estimator gets better estimation results than the linear estimator. Also, the MSE and AME of the wavelet estimators in all examples are provided by Table 1. Meanwhile, it is easy to see from Table 1 that the nonlinear wavelet estimators can have better performance than the linear estimators.

Figure 3. The estimation results of wavelet estimators when

$g(x)$ is

$HeaviSine$ and

$r(x) = r_{2}(x)$ .

DownLoad: Full-Size Img PowerPoint

Figure 4. The estimation results of wavelet estimators when

$g(x)$ is

$HeaviSine$ and

$r(x) = r_{3}(x)$ .

DownLoad: Full-Size Img PowerPoint

Figure 5. The estimation results of wavelet estimators when

$g(x)$ is

$Corner$ and

$r(x) = r_{1}(x)$ .

DownLoad: Full-Size Img PowerPoint

Figure 6. The estimation results of wavelet estimators when

$g(x)$ is

$Corner$ and

$r(x) = r_{2}(x)$ .

DownLoad: Full-Size Img PowerPoint

Figure 7. The estimation results of wavelet estimators when

$g(x)$ is

$Corner$ and

$r(x) = r_{3}(x)$ .

DownLoad: Full-Size Img PowerPoint

Figure 8. The estimation results of wavelet estimators when

$g(x)$ is

$Spikes$ and

$r(x) = r_{1}(x)$ .

DownLoad: Full-Size Img PowerPoint

Figure 9. The estimation results of wavelet estimators when

$g(x)$ is

$Spikes$ and

$r(x) = r_{2}(x)$ .

DownLoad: Full-Size Img PowerPoint

Figure 10. The estimation results of wavelet estimators when

$g(x)$ is

$Spikes$ and

$r(x) = r_{3}(x)$ .

DownLoad: Full-Size Img PowerPoint

Table 1. The

$MSE$ and

$AME$ of the wavelet estimators.

	$HeaviSine$			$Corner$			$Spikes$
	$r_{1}$	$r_{2}$	$r_{3}$	$r_{1}$	$r_{2}$	$r_{3}$	$r_{1}$	$r_{2}$	$r_{3}$
$MSE(\hat r^{lin}, r)$	0.0184	0.0073	0.0071	0.0189	0.0075	0.0064	0.0189	0.0069	0.0052
$MSE(\hat r^{non}, r)$	0.0048	0.0068	0.0064	0.0044	0.0070	0.0057	0.0042	0.0061	0.0046
$MSE(\hat r'^{lin}, r')$	0.7755	0.0547	0.0676	0.7767	0.1155	0.0737	0.7360	0.2566	0.0655
$MSE(\hat r'^{non}, r')$	0.2319	0.0573	0.0560	0.2204	0.0644	0.0616	0.2406	0.2868	0.0539
$AME(\hat r^{lin}, r)$	0.0935	0.0653	0.0652	0.0973	0.0667	0.0615	0.0964	0.0621	0.0550
$AME(\hat r^{non}, r)$	0.0506	0.0641	0.0619	0.0486	0.0649	0.0583	0.0430	0.0595	0.0518
$AME(\hat r'^{lin}, r')$	0.6911	0.1876	0.2348	0.7021	0.2686	0.2451	0.6605	0.4102	0.2320
$AME(\hat r'^{non}, r')$	0.3595	0.1862	0.2125	0.3450	0.2020	0.2229	0.3696	0.4198	0.2095

| Show Table

DownLoad: CSV

4. Proof of main theorem

4.1. Auxiliary results

Now, we provide some lemmas for the proof of the main Theorem.

Lemma 4.1. For the model (1.1) with A2 and A4,

$\begin{gather} {\rm{E}}[{{{\hat\alpha}_{j, k}}}] = {\alpha_{j, k}} , \end{gather}$

(4.1)

$\begin{gather} {\rm{E}}\left[ \frac{1}{n}\sum\limits_{i = 1}^n \left({Y_i^{2}{(-1)^{m}{\psi^{(m)}_{j, k}}(X_i)}}-w_{j, k}\right)\right] = \beta _{j, k} . \end{gather}$

(4.2)

Proof. According to the definition of ${\hat \alpha _{j, k}}$ ,

$\begin{align*} {\rm{E}}[{{{\hat\alpha}_{j, k}}}] & = {\rm{E}}\left[{ \frac{1}{n}\sum\limits_{i = 1}^n {Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}}-\int_{0}^{1} {{g^2}(x){(-1)^{m}{\phi^{(m)} _{j, k}}(x)}dx} }\right]\\ & = \frac{1}{n}\sum\limits_{i = 1}^n{\rm{E}}\left[{Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}}\right]- \int_{0}^{1} {{g^2}(x){(-1)^{m}{\phi^{(m)} _{j, k}}(x)}dx}\\ & = {\rm{E}}\left[{Y_1^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_1)}}\right] -\int_{0}^{1} {{g^2}(x){(-1)^{m}{\phi^{(m)} _{j, k}}(x)}dx}\\ & = {\rm{E}}\left[{r({{X_1}})U_1^{2}(-1)^{m}{\phi^{(m)}_{j, k}}({{X_1}})}\right] + 2{\rm{E}}[{f({{X_1}}){U_1}g({{X_1}}){(-1)^{m}{\phi^{(m)}_{j, k}}(X_1)}}] \\ &+ {\rm{E}}\left[ {{g^2({{X_1}})}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_1)}}\right]- \int_{0}^{1} {{g^2}(x){(-1)^{m}{\phi^{(m)} _{j, k}}(x)}dx}. \end{align*}$

Then, it follows from A4 that

${\rm{E}}\left[{{g^2({{X_1}})}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_1)}}\right] = \int_{0}^{1} {{g^2}(x){(-1)^{m}{\phi^{(m)} _{j, k}}(x)}dx}.$

Using the assumption of independence between ${U_i}$ and ${X_i}$ ,

${\rm{E}}\left[{r({{X_1}})U_1^{2}(-1)^{m}{\phi^{(m)}_{j, k}}({{X_1}})}\right] = {\rm{E}}[{U_1^{2}}]{\rm{E}}\left[{r({{X_1}})(-1)^{m}{\phi^{(m)}_{j, k}}({{X_1}})}\right],$

${\rm{E}}[{f({{X_1}}){U_1}g({{X_1}}){(-1)^{m}{\phi^{(m)}_{j, k}}(X_1)}}] = {\rm{E}}[{U_1}]{\rm{E}}[{f({{X_1}})g({{X_1}}){(-1)^{m}{\phi^{(m)}_{j, k}}(X_1)}}].$

Meanwhile, the conditions ${\rm{V}}[{U_1}] = 1$ and ${\rm{E}}[{U_1}] = 0$ imply ${\rm{E}}[{U_1^{2}}] = 1$ . Hence, one gets

$\begin{align*} {\rm{E}}[{{{\hat\alpha}_{j, k}}}]& = {\rm{E}}\left[{r({{X_1}})(-1)^{m}{\phi^{(m)}_{j, k}}({{X_1}})}\right]\\ & = \int_{0}^{1} {r({x})(-1)^{m}{\phi^{(m)}_{j, k}}(x)dx} = (-1)^{m} \int_{0}^{1} {r({x}){\phi^{(m)}_{j, k}}(x)dx}\\ & = \int_{0}^{1} {r^{(m)}({x}){\phi_{j, k}}(x)dx} = \alpha _{j, k} \end{align*}$

by the assumption A2.

On the other hand, one takes $\psi$ instead of $\phi$ , and $w_{j, k}$ instead of $\int_{0}^{1} {{g^2}(x){(-1)^{m}{\phi^{(m)} _{j, k}}(x)}dx}$ . The second equation will be proved by the similar mathematical arguments. □

Lemma 4.2. (Rosenthal's inequality) Let $X_{1}, \ldots, X_{n}$ be independent random variables such that ${\rm{E}}[X_{i}] = 0$ and ${\rm{E}}[|X_{i}|^{p}] < \infty$ . Then,

$\begin{align*} {\rm{E}}\left[{{{\left|\sum\limits_{i = 1}^n X_{i}\right|}^{p}}}\right] \lesssim \begin{cases} \sum\limits_{i = 1}^n {\rm{E}}\left[{{{\left| X_{i}\right|}^{p}}}\right]+\left( \sum\limits_{i = 1}^n {\rm{E}}\left[{{{\left| X_{i}\right|}^{2}}}\right]\right) ^{\frac{p}{2}}, &\mathit{\text{ p > 2 }}, \\ \left(\sum\limits_{i = 1}^n {\rm{E}}\left[{{{\left| X_{i}\right|}^{2}}}\right]\right)^{\frac{p}{2}}, & {{ 1\leq p\leq 2 }}. \end{cases} \end{align*}$

Lemma 4.3. For the model (1.1) with A1–A5, $2^{j}\le n$ and $1\le\tilde{p} < \infty$ ,

$\begin{gather} {\rm{E}}\left[{{{\left|{{{\hat \alpha }_{j, k}} - {\alpha _{j, k}}}\right|}^{\tilde{p}}}}\right] \lesssim n^{-\frac{\tilde{p}}{2}}2^{\tilde{p} mj} , \end{gather}$

(4.3)

$\begin{gather} {\rm{E}}\left[{{{\left|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}\right|}^{\tilde{p}}}}\right] \lesssim \left( \dfrac{\ln n}{n}\right) ^{-\frac{\tilde{p}}{2}}2^{\tilde{p} mj} . \end{gather}$

(4.4)

Proof. By (4.1) and the independence of random variables ${X_i}$ and ${U_i}$ , one has

$\begin{align*} \left|{{\hat \alpha }_{j, k}} - {\alpha _{j, k}}\right|& = \left| \frac{1}{n}\sum\limits_{i = 1}^n {Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}} - \int_{0}^{1} {{g^2}(x){(-1)^{m}{\phi^{(m)} _{j, k}}(x)}dx} -{\rm{E}}\left[\hat \alpha _{j, k}\right]\right|\\ & = \dfrac{1}{n} \left|\sum\limits_{i = 1}^n \left( {Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}}-{\rm{E}}\left[{Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}} \right] \right) \right| \\ & = \dfrac{1}{n} \left|\sum\limits_{i = 1}^n A_{i}\right|. \end{align*}$

In this above equation, $A_{i}: = {Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}}-{\rm{E}}\left[{Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}} \right]$ .

According to the definition of $A_{i}$ , one knows that ${\rm{E}}\left[A_{i}\right] = 0$ and

$\begin{align*} {\rm{E}}\left[\left|A_{i}\right|^{\tilde{p}}\right] & = {\rm{E}}\left[\left|{Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}}-{\rm{E}}\left[{Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}} \right]\right|^{\tilde{p}}\right]\\ &\lesssim {\rm{E}}\left[\left|{Y_i^{2}{(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}}\right|^{\tilde{p}}\right]\\ &\lesssim {\rm{E}}\left[\left|(r({X_1})U_1^{2}+g^{2}(X_1)){(-1)^{m}{\phi^{(m)}_{j, k}}(X_i)}\right|^{\tilde{p}}\right]\\ &\lesssim {\rm{E}}\left[U_{1}^{2\tilde{p}}\right] {\rm{E}}\left[\left|r({X_1}){\phi^{(m)}_{j, k}}(X_i)\right|^{\tilde{p}}\right]+{\rm{E}}\left[\left|g^{2}({X_1}){\phi^{(m)}_{j, k}}(X_i)\right|^{\tilde{p}}\right]. \end{align*}$

The assumption A5 shows ${\rm{E}}[{U_1^{2\tilde{p}}}]\lesssim 1$ . Furthermore, it follows from A1 and A3 that

$\begin{gather*} {\rm{E}}[{U_1^{2\tilde{p}}}]{\rm{E}}\left[|{r({{X_1}}){{\phi^{(m)}_{j, k}}(X_1)|^{\tilde{p}}}}\right] \lesssim {\rm{E}}\left[{|{\phi^{(m)}_{j, k}}(X_1)|^{\tilde{p}}}\right] , \\ {\rm{E}}\left[{g^{2\tilde{p}}({{X_1}}){|{\phi^{(m)}_{j, k}}(X_1)|^{\tilde{p}}}}\right] \lesssim {\rm{E}}\left[{|{\phi^{(m)}_{j, k}}(X_1)|^{\tilde{p}}}\right]. \end{gather*}$

In addition, and the properties of wavelet functions imply that

$\begin{align*} {\rm{E}}\left[\left|{\phi^{(m)}_{j, k}}(X_i)\right|^{\tilde{p}}\right] = \int_{0}^{1} |{\phi^{(m)}_{j, k}}(x)|^{\tilde{p}}dx& = 2^{j(\tilde{p}/2+m \tilde{p}-1)} \int_{0}^{1} |\phi^{(m)}(2^{j}x-k)|^{\tilde{p}}d(2^{j}x-k)\\ & = 2^{j(\tilde{p}/2+m \tilde{p}-1)} ||\phi^{(m)}||_{\tilde{p}}^{\tilde{p}}\lesssim 2^{j(\tilde{p}/2+m \tilde{p}-1)}. \end{align*}$

Hence,

${\rm{E}}\left[\left|A_{i}\right|^{\tilde{p}}\right] \lesssim 2^{j(\tilde{p}/2+m \tilde{p}-1)}.$

Especially in $\tilde{p} = 2$ , ${\rm{E}}\left[\left|A_{i}\right|^{2}\right] \lesssim 2^{2mj}$ .

Using Rosenthal's inequality and $2^{j}\le n$ ,

$\begin{align*} \begin{split} {\rm{E}}\left[{{{\left|{{{\hat \alpha }_{j, k}} - {\alpha _{j, k}}}\right|}^{\tilde{p}}}}\right] & = \dfrac{1}{n^{\tilde{p}}} {\rm{E}}\left[{{{\left|\sum\limits_{i = 1}^n A_{i}\right|}^{\tilde{p}}}}\right]\\ &\lesssim \begin{cases} \dfrac{1}{n^{\tilde{p}}} \left(\sum\limits_{i = 1}^n {\rm{E}}\left[{{{\left| A_{i}\right|}^{\tilde{p}}}}\right]+(\sum\limits_{i = 1}^n {\rm{E}}\left[{{{\left| A_{i}\right|}^{2}}}\right])^{\frac{\tilde{p}}{2}} \right), & { \tilde{p} > 2, } \\ \dfrac{1}{n^{\tilde{p}}} \left(\sum\limits_{i = 1}^n {\rm{E}}\left[{{{\left| A_{i}\right|}^{2}}}\right]\right)^{\frac{\tilde{p}}{2}}, & { 1 \leq \tilde{p} \leq 2 , } \end{cases}\\ &\lesssim \begin{cases} \dfrac{1}{n^{\tilde{p}}} \left(n \cdot 2^{j(\frac{\tilde{p}}{2}+m \tilde{p}-1)} + (n \cdot 2^{2mj})^{\frac{\tilde{p}}{2}}\right), & { \tilde{p} > 2, } \\ \dfrac{1}{n^{\tilde{p}}} \left( n \cdot 2^{2mj} \right)^{\frac{\tilde{p}}{2}}, & { 1 \leq \tilde{p} \leq 2, } \end{cases}\\ &\lesssim n^{-\frac{\tilde{p}}{2}}2^{\tilde{p}mj}. \end{split} \end{align*}$

Then, the first inequality is proved.

For the second inequality, note that

$\begin{align*} \beta _{j, k} & = {\rm{E}}\left[ \frac{1}{n}\sum\limits_{i = 1}^n \left({Y_i^{2}{(-1)^{m}{\psi^{(m)}_{j, k}}(X_i)}}-w_{j, k}\right)\right]\\ & = \frac{1}{n}\sum\limits_{i = 1}^n {\rm{E}}\left[ \left({Y_i^{2}{(-1)^{m}{\psi^{(m)}_{j, k}}(X_i)}}-\int_{0}^{1} {{g^2}(x){(-1)^{m}{\psi^{(m)} _{j, k}}(x)}dx}\right)\right]\\ & = \frac{1}{n}\sum\limits_{i = 1}^n {\rm{E}}\left[ K_{i}\right] \end{align*}$

with (4.2) and $K_{i}: = {Y_i^{2}{(-1)^{m}{\psi^{(m)}_{j, k}}(X_i)}}-\int_{0}^{1} {{g^2}(x){(-1)^{m}{\psi^{(m)} _{j, k}}(x)}dx}.$

Let $B_{i}: = K_{i}\mathbb{I}_{\left\{|K_{i}| \leq \rho_{n}\right\}}-{\rm{E}}\left[K_{i}\mathbb{I}_{\left\{|K_{i}| \leq \rho_{n}\right\}}\right]$ . Then, by the definition of ${\hat \beta }_{j, k}$ in (2.4),

$\begin{align} |{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}| = |\frac{1}{n}\sum\limits_{i = 1}^n K_{i}\mathbb{I}_{\left\{|K_{i}| \leq \rho_{n}\right\}}-{\beta _{j, k}}| \leq \frac{1}{n} \left| \sum\limits_{i = 1}^n B_{i}\right| +\frac{1}{n} \sum\limits_{i = 1}^n {\rm{E}}\left[ |K_{i}|\mathbb{I}_{\left\{|K_{i}| > \rho_{n}\right\}}\right]. \end{align}$

(4.5)

Similar to the arguments of $A_{i}$ , it is easy to see that ${\rm{E}}\left[B_{i}\right] = 0$ and

${\rm{E}}\left[\left|B_{i}\right|^{\tilde{p}}\right] \lesssim {\rm{E}}\left[\left|K_{i}\mathbb{I}_{\left\{|K_{i}| \leq \rho_{n}\right\}}\right|^{\tilde{p}}\right] \lesssim {\rm{E}}\left[\left|K_{i}\right|^{\tilde{p}}\right]\lesssim 2^{j(\frac{\tilde{p}}{2}+m \tilde{p}-1)}.$

Especially in the case of $\tilde{p} = 2$ , one can obtain ${\rm{E}}\left[\left|B_{i}\right|^{2}\right] \lesssim 2^{2mj}.$ On the other hand,

$\begin{eqnarray} {\rm{E}}\left[ |K_{i}|\mathbb{I}_{\left\{\left|K_{i}\right| > \rho_{n}\right\}}\right] \lesssim {\rm{E}}\left[ |K_{i}|\cdot \dfrac{|K_{i}|}{\rho_{n}}\right] = \dfrac{{\rm{E}}\left[K_{1}^{2}\right]}{\rho_{n}} \lesssim \dfrac{2^{2mj}}{\rho_{n}} = t_n = 2^{mj}\sqrt{\frac{\ln n}{n}}. \end{eqnarray}$

(4.6)

According to Rosenthal's inequality and $2^{j}\le n$ ,

$\begin{align*} \begin{split} {\rm{E}}\left[{{{|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|}^{\tilde{p}}}}\right] &\lesssim \dfrac{1}{n^{\tilde{p}}} {\rm{E}}\left[{{{\left|\sum\limits_{i = 1}^n B_{i}\right|}^{\tilde{p}}}}\right]+(t_{n})^{\tilde{p}}\\ &\lesssim \begin{cases} \dfrac{1}{n^{\tilde{p}}} \left(\sum\limits_{i = 1}^n {\rm{E}}\left[{{{\left| B_{i}\right|}^{\tilde{p}}}}\right]+(\sum\limits_{i = 1}^n {\rm{E}}\left[{{{\left| B_{i}\right|}^{2}}}\right])^{\frac{\tilde{p}}{2}} \right)+(t_{n})^{\tilde{p}}, & { \tilde{p} > 2, } \\ \dfrac{1}{n^{\tilde{p}}} \left(\sum\limits_{i = 1}^n {\rm{E}}\left[{{{\left| B_{i}\right|}^{2}}}\right]\right)^{\frac{\tilde{p}}{2}}+(t_{n})^{\tilde{p}}, & { 1 \leq \tilde{p} \leq 2, } \end{cases}\\ &\lesssim \begin{cases} \dfrac{1}{n^{\tilde{p}}} \left(n \cdot 2^{j(\frac{\tilde{p}}{2}+m \tilde{p}-1)} + (n \cdot 2^{2mj})^{\frac{\tilde{p}}{2}}\right)+\left( \dfrac{\ln n}{n}\right) ^{-\frac{\tilde{p}}{2}}\cdot 2^{\tilde{p} mj}, & { \tilde{p} > 2 , } \\ \dfrac{1}{n^{\tilde{p}}} \left( n \cdot 2^{2mj} \right)^{\frac{\tilde{p}}{2}}+\left( \dfrac{\ln n}{n}\right) ^{-\frac{\tilde{p}}{2}}\cdot 2^{\tilde{p} mj}, & { 1 \leq \tilde{p} \leq 2, } \end{cases}\\ &\lesssim \left( \dfrac{\ln n}{n}\right) ^{-\frac{\tilde{p}}{2}}2^{\tilde{p} mj}. \end{split} \end{align*}$

Then, the second inequality is proved. □

Lemma 4.4. (Bernstein's inequality) Let $X_{1}, \ldots, X_{n}$ be independent random variables such that ${\rm{E}}[X_{i}] = 0$ , $|{{X_i}}| < M$ and ${\rm{E}}[|X_{i}|^{2}] : = \sigma^{2}$ . Then, for each $\nu > 0$

$\begin{align*} {\rm{P}}\left({\frac{1}{n}\left|{\sum\limits_{i = 1}^n {{X_i}}}\right| \ge \nu }\right) \le 2 \exp\left\{{ - \frac{n \nu^{2}}{{2({\sigma^{2} +\nu M/{3}})}}}\right\}. \end{align*}$

Lemma 4.5. For the model (1.1) with A1–A5 and $1\leq\tilde{p} < +\infty$ , there exists a constant $\kappa > 1$ such that

$\begin{eqnarray} {\rm{P}}\left({{\left|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}\right|}}\ge\kappa{t_n}\right) \lesssim n^{-\tilde{p}}. \end{eqnarray}$

(4.7)

Proof. According to (4.5), one gets $K_{i} = {Y_i^{2}{(-1)^{m}{\psi^{(m)}_{j, k}}(X_i)}}-\int_{0}^{1} {{g^2}(x){(-1)^{m}{\psi^{(m)} _{j, k}}(x)}dx}$ , $B_{i} = K_{i}\mathbb{I}_{\left\{|K_{i}| \leq \rho_{n}\right\}}-{\rm{E}}\left[K_{i}\mathbb{I}_{\left\{|K_{i}| \leq \rho_{n}\right\}}\right]$ and

$\begin{align*} |{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}| \leq \frac{1}{n} \left| \sum\limits_{i = 1}^n B_{i}\right| +\frac{1}{n} \sum\limits_{i = 1}^n {\rm{E}}\left[ |K_{i}|\mathbb{I}_{\left\{|K_{i}| > \rho_{n}\right\}}\right]. \end{align*}$

Meanwhile, (4.6) shows that there exists $c > 0$ such that ${\rm{E}}\left[|K_{i}|\mathbb{I}_{\left\{\left|K_{i}\right| > \rho_{n}\right\}}\right] \leq c{t_n}$ . Furthermore, the following conclusion is true.

$\begin{align*} \left\{{|{{{\hat \beta }_{j, k}}-{\beta _{j, k, u}}}| \ge \kappa {t_n}}\right\} &\subseteq \left\{\Bigg[{\frac{1}{n} \left| \sum\limits_{i = 1}^n B_{i}\right| +\frac{1}{n} \sum\limits_{i = 1}^n {\rm{E}}\left( |K_{i}|\mathbb{I}_{\left\{|K_{i}| > \rho_{n}\right\}}\right)\Bigg] \ge \kappa {t_n}}\right\}\\ &\subseteq \left\{{\frac{1}{n}\left|{\sum\limits_{i = 1}^n {{B_i}}}\right| \ge (\kappa-c ){t_n}}\right\}. \end{align*}$

Note that the definition of $B_{i}$ implies that $|{{B_i}}|\lesssim \rho_{n}$ and ${\rm{E}}\left[B_{i} \right] = 0$ . Using the arguments of Lemma 4.3, ${\rm{E}}[{B_{_i}^2}] : = \sigma^{2} \lesssim 2^{2mj}$ . Furthermore, by Bernstein's inequality,

$\begin{align*} {\rm{P}}\left({\frac{1}{n}\left|{\sum\limits_{i = 1}^n {{B_i}}}\right| \ge (\kappa-c) {t_n}}\right) &\lesssim \exp\left\{{ - \frac{n (\kappa-c )^{2} {t_n}^2}{{2({\sigma^{2} +{{(\kappa-c ){t_n} \rho_{n}}}/{3}})}}}\right\}\\ &\lesssim \exp\left\{{ - \frac{n (\kappa-c )^{2} 2^{2mj}\cdot\frac{\ln n}{n}}{{2({2^{2mj} +{{(\kappa-c )\cdot 2^{2mj}}}/{3}})}}}\right\}\\ & = \exp\left\{ { -(\ln n) \frac{{{(\kappa-c ) ^2}}}{{2({1 + {(\kappa-c )}/{3}})}}}\right\} \\ & = {n^{ - \frac{{{(\kappa-c ) ^2}}}{2({1 +(\kappa-c)/{3}})}}}. \end{align*}$

Then, one can choose large enough $\kappa$ such that

$\begin{eqnarray*} {\rm{P}}\left({{\left|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}\right|}}\ge\kappa{t_n}\right) \lesssim {n^{ - \frac{{{(\kappa-c ) ^2}}}{2({1 +{(\kappa-c)}/{3}})}}}\lesssim n^{-\tilde{p}}. \end{eqnarray*}$

□

4.2. Proof of main theorem

Proof of (a): Note that

$\begin{align*} \left \| \hat r_n^{lin}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}} \lesssim \left \| \hat r_n^{lin}(x)-{P_{j_{*}}}r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}+\left \| {P_{j_{*}}}r^{(m)}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}} \end{align*}$

Hence,

$\begin{align} {\rm{E}}\left[ \left \| \hat r_n^{lin}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right] &\lesssim {\rm{E}}\left[\left \| \hat r_n^{lin}(x)-{P_{j_{*}}}r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right] +\left \| {P_{j_{*}}}r^{(m)}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}. \end{align}$

(4.8)

$\blacksquare$ The stochastic term ${\rm{E}}\left[\left \| \hat r_n^{lin}(x)-{P_{j_{*}}}r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right]$ .

It follows from Lemma 1.1 that

$\begin{align*} {\rm{E}}\left[\left \| \hat r_n^{lin}(x)-{P_{j_{*}}}r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right] & = {\rm{E}}\left[\left\| \sum\limits_{k \in {\mit\Lambda _{{j_*}}}} \left( {\hat \alpha}_{{j_*}, k}-{\alpha}_{{j_*}, k}\right)\phi_{j_*, k}(x)\right\| ^{\tilde{p}}_{\tilde{p}}\right] \\ &\sim 2^{j_*(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _{{j_*}}}} {\rm{E}}\left[\left| {\hat \alpha}_{{j_*}, k}-{\alpha}_{{j_*}, k} \right| ^{\tilde{p}}\right]. \end{align*}$

Then, according to (4.3), $|{{\mit\Lambda _{{j_*}}}}|\sim{2^{j_*}}$ and ${2^{{j_*}}} \sim{n^{\frac{1}{{2s' + 2m+1}}}}$ , one gets

$\begin{align} {\rm{E}}\left[\left \| \hat r_n^{lin}(x)-{P_{j_{*}}}r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right] \sim 2^{j_* \frac{\tilde{p}}{2}(2m+1)} \cdot n^{-\frac{\tilde{p}}{2}} \sim {n^{-\frac{{\tilde{p}s'}}{{2s' + 2m+1}}}}. \end{align}$

(4.9)

$\blacksquare$ The bias term $\left \| {P_{j_{*}}}r^{(m)}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}$ .

When $p > \tilde{p} \ge 1$ , $s' = s-({\frac{1}{p}-\frac{1}{\tilde{p}}})_+ = s$ . Using Hölder inequality, Lemma 1.2 and $r^{(m)} \in B_{p, q}^s({{[0, 1]}})$ ,

$\left \| {P_{j_{*}}}r^{(m)}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\lesssim \left \| {P_{j_{*}}}r^{(m)}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{p} \lesssim 2^{-j_{*} \tilde{p} s} = 2^{-j_{*} \tilde{p} s'}\sim {n^{-\frac{\tilde{p}s'}{{2s' + 2m+1}}}}.$

When $1 \leq p\leq\tilde{p}$ and $s > \dfrac{1}{p}$ , one knows that $B_{p, q}^s({{[0, 1]}}) \subseteq B_{\tilde{p}, \infty}^{s'} ({{[0, 1]}})$ and

$\begin{align*} \left \| {P_{j_{*}}}r^{(m)}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}} \lesssim 2^{-j_{*} \tilde{p} s'}\sim {n^{-\frac{\tilde{p} s'}{{2s' + 2m+1}}}}. \end{align*}$

Hence, the following inequality holds in both cases.

$\begin{align} \left \| {P_{j_{*}}}r^{(m)}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}} \lesssim {n^{-\frac{\tilde{p} s'}{{2s' + 2m+1}}}}. \end{align}$

(4.10)

Finally, the results (4.8)–(4.10) show

$\begin{eqnarray*} {\rm{E}}\left[\left \| \hat r_n^{lin}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right]\lesssim {n^{-\frac{\tilde{p} s'}{{2s' + 2m+1}}}}. \end{eqnarray*}$

Proof of (b): By the definitions of $\hat r_n^{lin}(x)$ and $\hat r_n^{non}(x)$ , one has

$\begin{align*} \left\| {\hat r_n^{non}(x) - r^{(m)}(x)}\right\| ^{\tilde{p}}_{\tilde{p}}&\lesssim \left\| \hat r_n^{lin}(x) - {P_{{j_*}}}r^{(m)}(x)\right\| ^{\tilde{p}}_{\tilde{p}} + \left\| r^{(m)}(x)-{P_{{j_1} + 1}}r^{(m)}(x)\right\| ^{\tilde{p}}_{\tilde{p}}\\ &+\left\| \sum\limits_{j = {j_*}}^{{j_1}} \sum\limits_{k \in {\mit\Lambda _j}} {\left({{{\hat\beta }_{j, k}}{\mathbb{I}_{\{{|{{{\hat \beta }_{j, k}}}| \ge \kappa {t_n}}\}}} - {\beta _{j, k}}}\right)}{\psi _{j, k}}(x)\right\| ^{\tilde{p}}_{\tilde{p}}. \end{align*}$

Furthermore,

$\begin{align} {\rm{E}}\left[\left \| \hat r_n^{non}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right] \lesssim T_{1}+T_{2}+Q. \end{align}$

(4.11)

In this above inequality,

$\begin{gather*} T_{1}: = {\rm{E}}\left[\left\| \hat r_n^{lin}(x) - {P_{{j_*}}}r^{(m)}(x)\right\| ^{\tilde{p}}_{\tilde{p}} \right], \\ T_{2}: = \left\| r^{(m)}(x)-{P_{{j_1} + 1}}r^{(m)}(x)\right\| ^{\tilde{p}}_{\tilde{p}}, \\ Q: = {\rm{E}}\left[ \left\| \sum\limits_{j = {j_*}}^{{j_1}} \sum\limits_{k \in {\mit\Lambda _j}} {\left({{{\hat\beta }_{j, k}}{\mathbb{I}_{\{{|{{{\hat \beta }_{j, k}}}| \ge \kappa {t_n}}\}}} - {\beta _{j, k}}}\right)}{\psi _{j, k}}(x)\right\| ^{\tilde{p}}_{\tilde{p}}\right]. \end{gather*}$

$\blacksquare$ For $T_{1}$ . According to (4.9) and ${2^{{j_*}}}\sim{n^{\frac{1}{{2t + 2m+1}}}}$ $\left(t > s\right)$ ,

$\begin{align} T_{1} \sim 2^{j_* \frac{\tilde{p}}{2}(2m+1)} \cdot n^{-\frac{\tilde{p}}{2}} \sim n^{-\frac{\tilde{p} t}{2t+2m+1}} < n^{-\frac{\tilde{p} s}{2s+2m+1}} \leq n^{-\tilde{p} \delta}. \end{align}$

(4.12)

$\blacksquare$ For $T_{2}$ . Using similar mathematical arguments as (4.10), when $p > \tilde{p} \ge 1$ , one can obtain $T_{2}: = \left\| r^{(m)}(x)-{P_{{j_1} + 1}}r^{(m)}(x)\right\| ^{\tilde{p}}_{\tilde{p}} \lesssim 2^{-j_{1}\tilde{p}s}$ . This with ${2^{{j_1}}}\sim \left(\frac{n}{{\ln n}}\right)^{\frac{1}{2m+1}}$ leads to

$T_{2}\lesssim 2^{-j_{1} \tilde{p}s} < \left( \frac{\ln n}{{n}}\right)^{\frac{\tilde{p}s}{2m+1}} \le \left( \frac{\ln n}{{ n}}\right)^{\frac{\tilde{p}s}{2s+2m+1}} \le \left( \frac{\ln n}{{ n}}\right)^{\tilde{p} \delta}.$

On the other hand, when $1 \leq p\leq\tilde{p}$ and $s > \dfrac{1}{p}$ , one has $B_{p, q}^s({{[0, 1]}}) \subseteq B_{\tilde{p}, \infty}^{s-{\frac{1}{p}+\frac{1}{\tilde{p}}}} ({{[0, 1]}})$ and

$\begin{align*} T_{2} \lesssim 2^{-j_1 \tilde{p} (s-1/p+1/\tilde{p})}\sim \left( \dfrac{\ln n}{n}\right) ^{\frac{\tilde{p}(s-1/p+1/\tilde{p})}{2m+1}} < \left( \dfrac{\ln n}{n}\right) ^{\frac{\tilde{p}(s-1/p+1/\tilde{p})}{2(s-1/p)+2m+1}} \le \left( \frac{\ln n}{{ n}}\right)^{\tilde{p} \delta}. \end{align*}$

Therefore, for each $1 \le \tilde{p} < \infty$ ,

$\begin{align} T_{2}\lesssim \left( \frac{\ln n}{{ n}}\right)^{\tilde{p} \delta}. \end{align}$

(4.13)

$\blacksquare$ For $Q$ . According to Hölder inequality and Lemma 1.1,

$\begin{align*} Q&\lesssim (j_1-j_*+1)^{\tilde{p}-1} \sum\limits_{j = {j_*}}^{{j_1}} {\rm{E}}\left[ \left\| \sum\limits_{k \in {\mit\Lambda _j}} {\left({{{\hat\beta }_{j, k}}{\mathbb{I}_{\{{|{{{\hat \beta }_{j, k}}}| \ge \kappa {t_n}}\}}} - {\beta _{j, k}}}\right)}{\psi _{j, k}}(x)\right\| ^{\tilde{p}}_{\tilde{p}}\right] \\ &\lesssim (j_1-j_*+1)^{\tilde{p}-1} \sum\limits_{j = {j_*}}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _{j}}} {\rm{E}}\left[|{{{\hat \beta }_{j, k}}{\mathbb{I}_{\{{|{{{\hat \beta }_{j, k}}}|\ge \kappa{t_n}}\}}} - {\beta _{j, k}}}|^{\tilde{p}}\right]. \end{align*}$

Note that

$\begin{align*} |{{{\hat \beta }_{j, k}}{\mathbb{I}_{\{{|{{{\hat \beta }_{j, k}}}|\ge \kappa{t_n}}\}}} - {\beta _{j, k}}}|^{\tilde{p}} & = |{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}}{{\mathbb{I}_{\{{|{{{\hat \beta }_{j, k}}}| \ge \kappa {t_n}, | {{\beta _{j, k}}}| < \frac{{\kappa {t_n}}}{2}}\}}} + |{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}}{\mathbb{I}_{\{{|{{{\hat \beta }_{j, k}}}| \ge \kappa {t_n}, |{{\beta _{j, k}}}|\ge \frac{{\kappa {t_n}}}{2}}\}}}}\\ &+ |{{\beta _{j, k}}}|^{\tilde{p}}{{\mathbb{I}_{\{{|{{{\hat \beta }_{j, k}}}| < \kappa {t_n}, | {{\beta _{j, k}}}| > 2\kappa {t_n}}\}}} + |{{\beta _{j, k}}}|^{\tilde{p}}{\mathbb{I}_{\{{|{{{\hat \beta }_{j, k}}}| < \kappa {t_n}, | {{\beta _{j, k}}}| \le 2\kappa {t_n}}\}}}}. \end{align*}$

Meanwhile,

$\begin{gather*} {{\{{|{{{\hat \beta }_{j, k}}}|\ge \kappa{t_n}}, |{{\beta _{j, k}}}| < \frac{{\kappa {t_n}}}{2}\}}} \subseteq{{\{|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}| > \frac{{\kappa{t_n}}}{2}\}}}, \\ {{\{{|{{{\hat \beta }_{j, k}}}| < \kappa{t_n}}, |{{\beta _{j, k}}}| > 2\kappa {t_n}\}}}\subseteq {{\{|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}| > \kappa{t_n}\}}}\subseteq {{\{|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}| > \frac{{\kappa{t_n}}}{2}\}}}. \end{gather*}$

Then, $Q$ can be decomposed as

$\begin{align} Q\lesssim (j_1-j_*+1)^{\tilde{p}-1}\left( Q_{1}+Q_{2}+Q_{3}\right), \end{align}$

(4.14)

where

$\begin{gather*} Q_{1}: = \sum\limits_{j = {j_*}}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _{j}}} {\rm{E}}\left[|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}} {\mathbb{I}_{\{|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}| > \frac{{\kappa{t_n}}}{2}\}}}\right], \\ Q_{2}: = \sum\limits_{j = {j_*}}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _{j}}} {\rm{E}}\left[|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\ge \frac{{\kappa {t_n}}}{2}\}}}\right], \\ Q_{3}: = \sum\limits_{j = {j_*}}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _{j}}} | {\beta _{j, k}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\le 2\kappa {t_n}\}}}. \end{gather*}$

$\blacksquare$ For ${Q_1}$ . It follows from the Hölder inequality that

${\rm{E}}\left[|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}} {\mathbb{I}_{\{|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}| > \frac{{\kappa{t_n}}}{2}\}}}\right] \le {\left( {{\rm{E}}\left[{{{|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|}^{2\tilde{p}}}}\right]}\right)^{\frac{1}{2}}}{\left[{{\rm{P}}\left({|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}| > \frac{{\kappa {t_n}}}{2}}\right)}\right]^{\frac{1}{2}}}.$

By Lemma 4.3, one gets

${\rm{E}}\left[{{{\left|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}\right|}^{2\tilde{p}}}}\right] \lesssim \left( \dfrac{\ln n}{n}\right) ^{-\tilde{p}} \cdot 2^{2\tilde{p} mj} .$

This with Lemma 4.5, $|{{\mit\Lambda _{j}}}|\sim{2^{j}}$ and ${2^{{j_1}}}\sim \left(\frac{n}{{\ln n}}\right)^{\frac{1}{2m+1}}$ shows that

$\begin{align} Q_{1} \lesssim \sum\limits_{j = {j_*}}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}}2^{j} \cdot \left( \frac{\ln n}{n}\right) ^{\frac{\tilde{p}}{2}} 2^{\tilde{p} mj} \cdot n^{-\frac{\tilde{p}}{2}} \lesssim n^{-\frac{\tilde{p}}{2}} < n^{-\tilde{p} \delta}. \end{align}$

(4.15)

$\blacksquare$ For ${Q_2}$ . One defines

${2^{j'}} \sim \left(\frac{n}{\ln n} \right)^{\frac{1}{{2s + 2m+1}}}.$

Clearly, ${2^{{j_*}}}\sim{n^{\frac{1}{{2t+2m + 1}}}} \left(t > s\right) \le {2^{j'}}\sim \left(\frac{n}{\ln n} \right)^{\frac{1}{{2s + 2m+1}}} < {2^{{j_1}}}\sim \left(\frac{n}{\ln n} \right) ^{\frac{1}{2m+1}}$ . Furthermore, one rewrites

$\begin{align} {Q_2} = \left({\sum\limits_{j = {j_*}}^{j'} { + \sum\limits_{j = j' + 1}^{{j_1}}}}\right) 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}}{\rm{E}}\left[|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\ge \frac{{\kappa {t_n}}}{2}\}}}\right] : = {Q_{21}} + {Q_{22}}. \end{align}$

(4.16)

$\blacksquare$ For ${Q_{21}}$ . By Lemma 4.3 and ${2^{j'}} \sim \left(\frac{n}{\ln n} \right)^{\frac{1}{{2s + 2m+1}}},$

$\begin{align} {Q_{21}}&: = \sum\limits_{j = {j_*}}^{j'} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} {\rm{E}}\left[|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\ge \frac{{\kappa {t_n}}}{2}\}}}\right]\\ &\le \sum\limits_{j = {j_*}}^{j'} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} {\rm{E}}\left[|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}} \right] \lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}}{2}} \sum\limits_{j = {j_*}}^{j'} 2^{j(2m+1) \frac{\tilde{p}}{2}}\\ &\lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}}{2}} 2^{j'(2m+1) \frac{\tilde{p}}{2}} \sim \left( \frac{\ln n}{n}\right)^{\frac{{\tilde{p}}s}{{2s + 2m+1}}} \le \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{align}$

(4.17)

$\blacksquare$ For ${Q_{22}}$ . Using Lemma 4.3, one has

$\begin{align*} {Q_{22}}&: = \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} {\rm{E}}\left[|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\ge \frac{{\kappa {t_n}}}{2}\}}}\right]\\ &\lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}}{2}} \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}+\tilde{p}mj} \sum\limits_{k \in {\mit\Lambda _j}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\ge \frac{{\kappa {t_n}}}{2}\}}}. \end{align*}$

When $p > \tilde{p} \ge 1$ , by the Hölder inequality, ${t_n} = 2^{mj}\sqrt{{\ln n}/n}$ , ${2^{j'}}\sim\left(\frac{n}{\ln n} \right)^{\frac{1}{{2s + 2m+1}}}$ and Lemma 1.2, one can obtain that

$\begin{align} {Q_{22}} &\lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}}{2}} \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}+\tilde{p}mj} \sum\limits_{k \in {\mit\Lambda _j}} \left( \dfrac{|{{\beta _{j, k}}}|}{\frac{{\kappa {t_n}}}{2}}\right) ^{\tilde{p}}\\ &\lesssim \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} {|\beta _{j, k}|^{\tilde{p}}} = \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \left\| \beta _{j, k} \right\| ^{\tilde{p}}_{\tilde{p}}\\ &\le \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \cdot 2^{j(1-\frac{\tilde{p}}{p})}\left\| \beta _{j, k} \right\| ^{\tilde{p}}_{p}\\ & \lesssim \sum\limits_{j = j' + 1}^{{j_1}} 2^{-j\tilde{p}s} \lesssim 2^{-j'\tilde{p}s} \sim \left( \frac{\ln n}{n}\right)^{\frac{{\tilde{p}}s}{{2s + 2m+1}}} \le \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{align}$

(4.18)

When $1\leq p\leq\tilde{p}$ , it follows from Lemma 1.2 that

(4.19)

Take

$\epsilon : = sp-\dfrac{\tilde{p}-p}{2} (2m+1).$

Then, (4.19) can be rewritten as

$\begin{align} {Q_{22}} \lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} \sum\limits_{j = j' + 1}^{{j_1}} 2^{-j \epsilon}. \end{align}$

(4.20)

When $\epsilon > 0$ holds if and only if $p > \frac{\tilde{p}(2m+1)}{2s+2m+1}$ , $\delta = \frac{s}{2s+2m+1}$ and

$\begin{align} {Q_{22}} \lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} 2^{-j' \epsilon} \sim \left( \frac{\ln n}{n}\right)^{\frac{{\tilde{p}}s}{{2s + 2m+1}}} = \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{align}$

(4.21)

When $\epsilon\le0$ holds if and only if $p \leq \frac{\tilde{p}(2m+1)}{2s+2m+1}$ , $\delta = \frac{s-1/p+1 / \tilde{p}}{2(s-1 /p)+2m+1}$ . Define

$2^{j''} \sim \left( \frac{n}{\ln n}\right) ^{\frac{\delta}{s-1/p+1/\tilde{p}}} = \left( \frac{n}{\ln n}\right) ^{\frac{1}{2(s-1/p)+2m+1}} ,$

and obviously, ${2^{j'}} \sim \left(\frac{n}{\ln n} \right)^{\frac{1}{{2s + 2m+1}}} < 2^{j''} \sim \left(\frac{n}{\ln n}\right) ^{\frac{\delta}{s-1/p+1/\tilde{p}}} < {2^{{j_1}}}\sim \left(\frac{n}{\ln n} \right) ^{\frac{1}{2m+1}}$ . Furthermore, one rewrites

$\begin{align} \begin{split} {Q_{22}} & = \left({\sum\limits_{j = {j' + 1}}^{j''} + \sum\limits_{j = j'' + 1}^{{j_1}}}\right) 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} {\rm{E}}\left[|{{{\hat \beta }_{j, k}} - {\beta _{j, k}}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\ge \frac{{\kappa {t_n}}}{2}\}}}\right]\\ & : = {Q_{221}} + {Q_{222}}. \end{split} \end{align}$

(4.22)

For ${Q_{221}}$ . Note that $\frac{\tilde{p}-p}{2}+\frac{\delta \epsilon }{s-1/p+1 /\tilde{p}} = \tilde{p} \delta$ in the case of $\epsilon\le0$ . Then, by the same arguments of (4.20), one gets

$\begin{align} {Q_{221}} \lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} \sum\limits_{j = {j' + 1}}^{j''} 2^{-j\epsilon} \lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} 2^{-{j''}\epsilon} \sim \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{align}$

(4.23)

For ${Q_{222}}$ . The conditions $1\leq p\leq\tilde{p}$ and $s > 1/p$ imply $B_{p, q}^s({{[0, 1]}}) \subset B_{\tilde{p}, q}^{s-\frac{1}{p}+\frac{1}{\tilde{p}}}({{[0, 1]}})$ . Similar to (4.18), one obtains

$\begin{align} {Q_{222}} &\lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}}{2}} \sum\limits_{j = j'' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}+\tilde{p}mj} \sum\limits_{k \in {\mit\Lambda _j}} \left( \dfrac{|{{\beta _{j, k}}}|}{\frac{{\kappa {t_n}}}{2}}\right) ^{\tilde{p}}\\ &\lesssim \sum\limits_{j = j'' + 1}^{j_1} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \left\|{\beta _{j, k}} \right\| ^{\tilde{p}}_{\tilde{p}} \lesssim \sum\limits_{j = j'' + 1}^{j_1} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \cdot 2^{-j(s-\frac{1}{\tilde{p}}+\frac{1}{2}){\tilde{p}}}\\ &\lesssim 2^{-j'' (s-{\frac{1}{p}+\frac{1}{\tilde{p}}})\tilde{p}} \sim \left( \frac{\ln n}{n}\right)^{\tilde{p}\delta}. \end{align}$

(4.24)

Combining (4.18), (4.21), (4.23) and (4.24),

${Q_{22}}\lesssim \left( \frac{\ln n}{n}\right)^{\tilde{p}\delta}.$

This with (4.16) and (4.17) shows that

$\begin{align} {Q_{2}}\lesssim \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{align}$

(4.25)

$\blacksquare$ For ${Q_3}$ . According to the definition of ${2^{j'}}$ , one can write

$\begin{align*} {Q_3} = \left({\sum\limits_{j = {j_*}}^{j'} + \sum\limits_{j = j'+1}^{{j_1}}}\right) 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} |{\beta _{j, k}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\le 2\kappa {t_n}\}}}: = {Q_{31}} + {Q_{32}}. \end{align*}$

$\blacksquare$ For ${Q_{31}}$ . It is easy to see that

$\begin{align*} \begin{split} {Q_{31}}&: = \sum\limits_{j = {j_*}}^{j'} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} |{\beta _{j, k}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\le 2\kappa {t_n}\}}} \le \sum\limits_{j = {j_*}}^{j'} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} \left( 2\kappa {t_n}\right) ^{\tilde{p}} \\ &\lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}}{2}} \cdot 2^{(2m+1)j'\frac{\tilde{p}}{2}} \sim \left( \frac{\ln n}{n}\right)^{\frac{{\tilde{p}}s}{{2s + 2m+1}}} \le \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{split} \end{align*}$

$\blacksquare$ For ${Q_{32}}$ . One rewrites ${Q_{32}} = \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} |{\beta _{j, k}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\le 2\kappa {t_n}\}}}$ . When $p > \tilde{p}\ge1$ , using the Hölder inequality and Lemma 1.2,

$\begin{align*} {Q_{32}} \le \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} |{\beta _{j, k}}|^{\tilde{p}} \lesssim 2^{-j'\tilde{p}s} \sim \left( \frac{\ln n}{n}\right)^{\frac{{\tilde{p}}s}{{2s + 2m+1}}} \le \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{align*}$

When $1\leq p\leq\tilde{p}$ , one has

$\begin{align*} \begin{split} {Q_{32}} & \le \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} |{\beta _{j, k}}|^{\tilde{p}} \left( \frac{2 \kappa{t_n}}{|\beta _{j, k}|}\right) ^{\tilde{p}-p}\\ &\lesssim \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} \sum\limits_{j = j' + 1}^{{j_1}} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}+j(\tilde{p}-p)m} \left\| \beta _{j, k} \right\|^{p}_{p}\\ &\le \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} \sum\limits_{j = j' + 1}^{{j_1}} 2^{-j(sp+\frac{p}{2}-\frac{\tilde{p}}{2}-(\tilde{p}-p)m)}\\ & = \left( \frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} \sum\limits_{j = j' + 1}^{{j_1}} 2^{-j\epsilon}. \end{split} \end{align*}$

For the case of $\epsilon > 0$ , one can easily obtain that $\delta = \frac{s}{2s+2m+1}$ and

$\begin{align*} {Q_{32}} \lesssim \left(\frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} 2^{-j' \epsilon} \sim \left( \frac{\ln n}{n}\right)^{\frac{{\tilde{p}}s}{{2s + 2m+1}}} = \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{align*}$

When $\epsilon \le 0$ , $\delta = \frac{s-1/p+1 /\tilde{p}}{2(s-1 /p)+2m+1}$ . Moreover, by the definition of $2^{j''}$ , one rewrites

$\begin{align*} {Q_{32}} = \left({\sum\limits_{j = {j' + 1}}^{j''} + \sum\limits_{j = j'' + 1}^{{j_1}}}\right) 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} |{\beta _{j, k}}|^{\tilde{p}} {\mathbb{I}_{\{|{{\beta _{j, k}}}|\le 2\kappa {t_n}\}}} : = {Q_{321}} + {Q_{322}}. \end{align*}$

Note that

$\begin{align*} {Q_{321}} &\lesssim \left(\frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} \sum\limits_{j = {j' + 1}}^{j''} 2^{-j\epsilon} \lesssim \left(\frac{\ln n}{n}\right)^{\frac{\tilde{p}-p}{2}} 2^{-{j''}\epsilon} \sim \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{align*}$

On the other hand, similar to the arguments of (4.24), one has

$\begin{align*} \begin{split} {Q_{322}} &\le \sum\limits_{j = j'' + 1}^{j_1} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \sum\limits_{k \in {\mit\Lambda _j}} |{\beta _{j, k}}|^{\tilde{p}} = \sum\limits_{j = j'' + 1}^{j_1} 2^{j(\frac{1}{2}-\frac{1}{\tilde{p}}){\tilde{p}}} \left\|{\beta _{j, k}} \right\| ^{\tilde{p}}_{\tilde{p}} \lesssim \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{split} \end{align*}$

Therefore, in all of the above cases,

$\begin{align} {Q_{3}}\lesssim \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta}. \end{align}$

(4.26)

Finally, combining the above results (4.14), (4.15), (4.25) and (4.26), one gets

$\begin{eqnarray*} Q \lesssim (j_1-j_*+1)^{\tilde{p}-1} \left( \frac{\ln n}{n}\right)^{\tilde{p} \delta} \lesssim (\ln n)^{\tilde{p}-1} \left( \frac{\ln n}{n}\right)^{\tilde{p}\delta}. \end{eqnarray*}$

This with (4.11)–(4.13) shows

${\rm{E}}\left[\left \| \hat r_n^{non}(x)-r^{(m)}(x) \right \|^{\tilde{p}}_{\tilde{p}}\right]\lesssim (\ln n)^{\tilde{p}-1} \left( \frac{\ln n}{n}\right)^{\tilde{p}\delta}.$

5. Conclusions

This paper considers wavelet estimations of the derivatives $r^{(m)}(x)$ of the variance function $r(x)$ in a heteroscedastic model. The upper bounds over $L^{\tilde{p}} (1\leq \tilde{p} < \infty)$ risk of the wavelet estimators are discussed under some mild assumptions. The results show that the linear wavelet estimator can obtain the optimal convergence rate in the case of $p > \tilde{p}\ge1$ . When $p\leq\tilde{p}$ , the nonlinear wavelet estimator has a better convergence rate than the linear estimator. Moreover, the nonlinear wavelet estimator is adaptive. Finally, some numerical experiments are presented to verify the good performances of the wavelet estimators.

Acknowledgments

We would like to thank the reviewers for their valuable comments and suggestions, which helped us to improve the quality of the manuscript. This paper is supported by the Guangxi Natural Science Foundation (No. 2022JJA110008), National Natural Science Foundation of China (No. 12001133), Center for Applied Mathematics of Guangxi (GUET), and Guangxi Colleges and Universities Key Laboratory of Data Analysis and Computation.

Conflict of interest

All authors declare that they have no conflicts of interest.

References

[1]	J. Karhunen, T. Raiko, K. Cho, Unsupervised deep learning: A short review, In: Advances in independent component analysis and learning machines, 2015,125–142. https://doi.org/10.1016/B978-0-12-802806-3.00007-5
[2]	G. E. Hinton, S. Osindero, Y. W. Teh, A fast learning algorithm for deep belief nets, Neural Comput., 7 (2006), 1527–1554.
[3]	S. A. Cohen, T. Zhuang, M. Xiao, J. B. Michaud, L. Shapiro, R. N. Kamal, Using Google Trends data to track healthcare use for hand osteoarthritis, Cureus, 13 (2021), 1–7. https://doi.org/10.7759/cureus.13786 doi: 10.7759/cureus.13786
[4]	I. H. Sarker, Deep cybersecurity: A comprehensive overview from neural network and deep learning perspective, SN Comput. Sci., 3 (2021), 1–16.
[5]	Y. Xin, L. Kong, Z. Liu, Y. Chen, Y. Li, H. Zhu, et al., Machine learning and deep learning methods for cybersecurity, IEEE Access, 6 (2018), 35365–35381. https://doi.org/10.1109/ACCESS.2018.2836950 doi: 10.1109/ACCESS.2018.2836950
[6]	Y. LeCun, L. Bottou, Y. Bengio, P. Haffner, Gradient-based learning applied to document recognition, P. IEEE, 86 (1998), 2278–2324. https://doi.org/10.1109/5.726791 doi: 10.1109/5.726791
[7]	S. Dupond, A thorough review on the current advance of neural network structures, Ann. Rev. Control, 14 (2019).
[8]	D. Mandic, J. Chambers, Recurrent neural networks for prediction: Learning algorithms, architectures and stability, Wiley, 2001.
[9]	P. Wei, Y. Li, Z. Zhang, T. Hu, Z. Li, D. Liu, An optimization method for intrusion detection classification model based on deep belief network, IEEE Access, 7 (2019), 87593–87605. https://doi.org/10.1109/ACCESS.2019.2925828 doi: 10.1109/ACCESS.2019.2925828
[10]	A. Aggarwal, M. Mittal, G. Battineni, Generative adversarial network: An overview of theory and applications, Int. J. Inform. Manag. Data Insights, 1 (2021), 1–9. https://doi.org/10.1016/j.jjimei.2020.100004 doi: 10.1016/j.jjimei.2020.100004
[11]	F. Jin, Z. Ni, R. Langari, H. Chen, Consistency improvement-driven decision-making methods with probabilistic multiplicative preference relations, Group Decis. Negot., 29 (2020), 371–397. https://doi.org/10.1007/s10726-020-09658-2 doi: 10.1007/s10726-020-09658-2
[12]	F. Jin, M. Cao, J. Liu, L. Martínez, H. Chen, Consistency and trust relationship-driven social network group decision-making method with probabilistic linguistic information, Appl. Soft. Comput., 103 (2021), 107170. https://doi.org/10.1016/j.asoc.2021.107170 doi: 10.1016/j.asoc.2021.107170
[13]	F. Jin, J. Liu, L. Zhou, L. Martínez, Consensus-based linguistic distribution large-scale group decision making using statistical inference and regret theory, Group Decis. Negot., 30 (2021), 813–845. https://doi.org/10.1007/s10726-021-09736-z doi: 10.1007/s10726-021-09736-z
[14]	F. Jin, Y. Cai, W. Pedrycz, J. Liu, Efficiency evaluation with regret-rejoice cross-efficiency DEA models under the distributed linguistic environment, Comput. Ind. Eng., 169 (2022), 108281. https://doi.org/10.1016/j.cie.2022.108281 doi: 10.1016/j.cie.2022.108281
[15]	F. Jin, Y. Cai, L. Zhou, T. Ding, Regret-rejoice two-stage multiplicative DEA models-driven cross-efficiency evaluation with probabilistic linguistic information, Omega, 117 (2023), 102839. https://doi.org/10.1016/j.omega.2023.102839 doi: 10.1016/j.omega.2023.102839
[16]	L. A. Zadeh, Fuzzy sets, Inform. Control, 8 (1965), 338–353. https://doi.org/10.1016/S0019-9958(65)90241-X doi: 10.1016/S0019-9958(65)90241-X
[17]	K. Atanassov, Intuitionistic fuzzy sets, Fuzzy Set. Syst., 20 (1986), 87–96. https://doi.org/10.1016/j.chaos.2005.08.066 doi: 10.1016/j.chaos.2005.08.066
[18]	J. Y. Huang, Intuitionistic fuzzy Hamacher aggregation operators and their application to multiple attribute decision making, J. Int. Fuzzy Syst., 27 (2014), 505–513. https://doi.org/10.3233/IFS-131019 doi: 10.3233/IFS-131019
[19]	S. Zhou, W. Chang, Approach to multiple attribute decision making based on the Hamacher operation with fuzzy number intuitionistic fuzzy information and their application, J. Intell. Fuzzy Syst., 27 (2014), 1087–1094. https://doi.org/10.3233/IFS-131071 doi: 10.3233/IFS-131071
[20]	Z. Pawlak, Rough sets, Int. J. Comput. Inform. Sci., 11 (1982), 341–356. https://doi.org/10.1007/BF01001956 doi: 10.1007/BF01001956
[21]	D. Dubois, H. Prade, Rough fuzzy sets and fuzzy rough sets, Int. J. Gen. Syst., 17 (1990), 191–209. https://doi.org/10.1080/03081079008935107 doi: 10.1080/03081079008935107
[22]	M. A. Khan, S. Ashraf, S. Abdullah, F. Ghani, Applications of probabilistic hesitant fuzzy rough set in decision support system, Soft Comput., 24 (2020), 16759–16774. https://doi.org/10.1007/s00500-020-04971-z doi: 10.1007/s00500-020-04971-z
[23]	L. Zhou, W. Z. Wu, On generalized intuitionistic fuzzy rough approximation operators, Inform. Sci., 178 (2008), 2448–2465. https://doi.org/10.1016/j.ins.2008.01.012 doi: 10.1016/j.ins.2008.01.012
[24]	C. N. Huang, S. Ashraf, N. Rehman, S. Abdullah, A. Hussain, A novel spherical fuzzy rough aggregation operators hybrid with TOPSIS method and their application in decision making, Math. Prob. Eng., 2022 (2022), 1–20. https://doi.org/10.1155/2022/9339328 doi: 10.1155/2022/9339328
[25]	F. K. Gündoğdu, C. Kahraman, Spherical fuzzy sets and decision making applications, In: International Conference on Intelligent and Fuzzy Systems, Springer, Cham, 2019,979–987. https://doi.org/10.1155/2021/2284051
[26]	F. K. Gündoğdu, C. Kahraman, Spherical fuzzy sets and spherical fuzzy TOPSIS method, J. Intell. Fuzzy Syst., 36 (2019), 337–352. https://doi.org/10.3233/JIFS-181401 doi: 10.3233/JIFS-181401
[27]	C. Kahraman, F. K. Gundogdu, S. C. Onar, B. Oztaysi, Hospital location selection using spherical fuzzy TOPSIS, In: CProceedings of the 11th Conference of the European Society for Fuzzy Logic and Technology, 2019, 77–82. https://doi.org/10.2991/eusflat-19.2019.12
[28]	T. Mahmood, K. Ullah, Q. Khan, N. Jan, An approach toward decision-making and medical diagnosis problems using the concept of spherical fuzzy sets, Neural Comput. Appl., 31 (2019), 7041–7053. https://doi.org/10.1007/s00521-018-3521-2 doi: 10.1007/s00521-018-3521-2
[29]	S. Ashraf, S. Abdullah, Spherical aggregation operators and their application in multiattribute group decision-making, Int. J. Intell. Syst., 34 (2019), 493–523. https://doi.org/10.1002/int.22062 doi: 10.1002/int.22062
[30]	S. Ashraf, S. Abdullah, T. Mahmood, Spherical fuzzy Dombi aggregation operators and their application in group decision making problems, J. Amb. Intell. Hum. Comput., 11 (2020), 2731–2749. https://doi.org/10.1007/s12652-019-01333-y doi: 10.1007/s12652-019-01333-y
[31]	F. K. Gündoğdu, C. Kahraman, A novel VIKOR method using spherical fuzzy sets and its application to warehouse site selection, J. Intell. Fuzzy Syst., 37 (2019), 1197–1211. https://doi.org/10.3233/JIFS-182651 doi: 10.3233/JIFS-182651
[32]	F. K. Gündoğdu, C. Kahraman, A. Karaşan, Spherical fuzzy VIKOR method and its application to waste management, In: International Conference on Intelligent and Fuzzy Systems, 2019,997–1005. https://doi.org/10.1007/978-3-030-23756-1_118
[33]	I. M. Sharaf, Spherical fuzzy VIKOR with SWAM and SWGM operators for MCDM, In: Decision Making with Spherical Fuzzy Sets, Springer, Cham, 2022,217–240. https://doi.org/10.1007/978-3-030-45461-6_9
[34]	M. Akram, D. Saleem, T. Al-Hawary, Spherical fuzzy graphs with application to decision-making, Math. Comput. Appl., 25 (2020), 8. https://doi.org/10.3390/mca25010008 doi: 10.3390/mca25010008
[35]	M. Akram, Decision making method based on spherical fuzzy graphs, In: Decision making with spherical fuzzy sets, 2021,153–197. https://doi.org/10.1007/978-3-030-45461-6_7
[36]	D. Ramot, M. Friedman, G. Langholz, A. Kandel, Complex fuzzy logic, IEEE T. Fuzzy Syst., 11 (2003), 450–461. https://doi.org/10.1109/TFUZZ.2003.814832 doi: 10.1109/TFUZZ.2003.814832
[37]	D. Ramot, R. Milo, M. Friedman, A. Kandel, Complex fuzzy sets, IEEE T. Fuzzy Syst., 10 (2002), 171–186. https://doi.org/10.1109/91.995119 doi: 10.1109/91.995119
[38]	A. M. D. J. S. Alkouri, A. R. Salleh, Complex intuitionistic fuzzy sets, AIP Conf. Proc., 1482 (2012), 464–470. https://doi.org/10.1063/1.4757515 doi: 10.1063/1.4757515
[39]	M. Akram, X. Peng, A. Sattar, A new decision-making model using complex intuitionistic fuzzy Hamacher aggregation operators, Soft Comput., 25 (2021), 7059–7086. https://doi.org/10.1007/s00500-021-05658-9 doi: 10.1007/s00500-021-05658-9
[40]	K. Ullah, T. Mahmood, Z. Ali, N. Jan, On some distance measures of complex Pythagorean fuzzy sets and their applications in pattern recognition, Complex Intell. Syst., 6 (2020), 15–27. https://doi.org/10.1007/s40747-019-0103-6 doi: 10.1007/s40747-019-0103-6
[41]	X. Ma, M. Akram, K. Zahid, J. C. R. Alcantud, Group decision-making framework using complex Pythagorean fuzzy information, Neu. Comput. Appl., 33 (2021), 2085–2105. https://doi.org/10.1007/s00521-020-05100-5 doi: 10.1007/s00521-020-05100-5
[42]	M. Akram, H. Garg, K. Zahid, Extensions of ELECTRE-I and TOPSIS methods for group decision-making under complex Pythagorean fuzzy environment, Iran. J. Fuzzy Syst., 17 (2020) 147–164. https://doi.org/10.22111/IJFS.2020.5522 doi: 10.22111/IJFS.2020.5522
[43]	M. Akram, S. Naz, A novel decision-making approach under complex Pythagorean fuzzy environment, Math. Comput. Appl., 24 (2019), 73. https://doi.org/10.3390/mca24030073 doi: 10.3390/mca24030073
[44]	H. Garg, J. Gwak, T. Mahmood, Z. Ali, Power aggregation operators and VIKOR methods for complex q-rung orthopair fuzzy sets and their applications, Mathematics, 8 (2020), 538. https://doi.org/10.3390/math8040538 doi: 10.3390/math8040538
[45]	M. Akram, C. Kahraman, K. Zahid, Group decision-making based on complex spherical fuzzy VIKOR approach, Knowl.-Based Syst., 216 (2021), 106793. https://doi.org/10.1016/j.knosys.2021.106793 doi: 10.1016/j.knosys.2021.106793
[46]	M. Akram, A. Khan, J. C. R. Alcantud, G. Santos-García, A hybrid decision-making framework under complex spherical fuzzy prioritized weighted aggregation operators, Expert Syst., 38 (2021), 12712. https://doi.org/10.1111/exsy.12712 doi: 10.1111/exsy.12712
[47]	M. Akram, S. Naz, A novel decision-making approach under complex Pythagorean fuzzy environment, Math. Comput. Appl., 24 (2019), 73. https://doi.org/10.3390/mca24030073 doi: 10.3390/mca24030073
[48]	A. M. D. J. S. Alkouri, A. R. Salleh, Complex intuitionistic fuzzy sets, AIP Conf. Proc., 1482 (2012), 464–470. https://doi.org/10.1063/1.4757515 doi: 10.1063/1.4757515
[49]	I. H. Sarker, Data science and analytics: An overview from data-driven smart computing, decision-making and applications perspective, SN Comput. Sci., 2 (2021), 1–22. https://doi.org/10.1007/s42979-021-00765-8 doi: 10.1007/s42979-021-00765-8
[50]	I. H. Sarker, M. H. Furhad, R. Nowrozy, Ai-driven cybersecurity: An overview, security intelligence modeling and research directions, SN Comput. Sci., 2 (2021), 1–18. https://doi.org/10.1007/s42979-021-00557-0 doi: 10.1007/s42979-021-00557-0
[51]	I. H. Sarker, Data science and analytics: An overview from data-driven smart computing, decision-making and applications perspective, SN Comput. Sci., 2 (2021), 1–22. https://doi.org/10.1007/s42979-021-00765-8 doi: 10.1007/s42979-021-00765-8
[52]	A. Geron, Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems, 2019.
[53]	I. H. Sarker, Deep cybersecurity: A comprehensive overview from neural network and deep learning perspective, SN Comput. Sci., 2 (2021). https://doi.org/10.1007/s42979-021-00535-6 doi: 10.1007/s42979-021-00535-6
[54]	J. Vesanto, E. Alhoniemi, Clustering of the self-organizing map, IEEE T. Neur. Net., 11 (2000), 586–600. https://doi.org/10.1109/72.846731 doi: 10.1109/72.846731
[55]	P. Vincent, H. Larochelle, I. Lajoie, Y. Bengio, P. A. Manzagol, L. Bottou, Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion, J. Mach. Learn. Res., 11 (2010), 3371–3408.
[56]	W. Wang, M. Zhao, J. Wang, Effective android malware detection with a hybrid model based on deep autoencoder and convolutional neural network, J. Amb. Intell. Hum. Comput., 10 (2019), 3035–3043. https://doi.org/10.1007/s12652-018-0803-6 doi: 10.1007/s12652-018-0803-6
[57]	B. Li, V. François-Lavet, T. Doan, J. Pineau, Domain adversarial reinforcement learning, arXiv: 2102.07097, 2021.
[58]	R. Shokri, V. Shmatikov, Privacy-preserving deep learning, In: Proceedings of the 22nd ACM SIGSAC conference on computer and communications security, 2015, 1310–1321. https://doi.org/10.1145/2810103.2813687
[59]	C. Chen, P. Zhang, H. Zhang, J. Dai, Y. Yi, H. Zhang, et al., Deep learning on computational-resource-limited platforms: A survey, Mobile Inform. Syst., 2020, 1–19. https://doi.org/10.1155/2020/8454327 doi: 10.1155/2020/8454327
[60]	A. M. Hanif, S. Beqiri, P. A. Keane, J. P. Campbell, Applications of interpretability in deep learning models for ophthalmology, Curr. Opin. Ophthalmol., 32 (2021), 452–458. https://doi.org/10.1097/ICU.0000000000000780 doi: 10.1097/ICU.0000000000000780

This article has been cited by:

1.	Feng Qi, Decreasing properties of two ratios defined by three and four polygamma functions, 2022, 360, 1778-3569, 89, 10.5802/crmath.296
2.	Omelsaad Ahfaf, Ahmed Talat, Mansour Mahmoud, Bounds and Completely Monotonicity of Some Functions Involving the Functions ψ′(l) and ψ″(l), 2022, 14, 2073-8994, 1420, 10.3390/sym14071420
3.	Mona Anis, Hanan Almuashi, Mansour Mahmoud, Complete Monotonicity of Functions Related to Trigamma and Tetragamma Functions, 2022, 131, 1526-1506, 263, 10.32604/cmes.2022.016927
4.	Feng Qi, Bounds for completely monotonic degree of a remainder for an asymptotic expansion of the trigamma function, 2021, 28, 2576-5299, 314, 10.1080/25765299.2021.1962060
5.	Xifeng Wang, Senlin Guo, Some conditions for sequences to be minimal completely monotonic, 2023, 8, 2473-6988, 9832, 10.3934/math.2023496
6.	Ye Shuang, Bai-Ni Guo, Feng Qi, Logarithmic convexity and increasing property of the Bernoulli numbers and their ratios, 2021, 115, 1578-7303, 10.1007/s13398-021-01071-x
7.	Jian Cao, Wen-Hui Li, Da-Wei Niu, Feng Qi, Jiao-Lian Zhao, A Brief Survey and an Analytic Generalization of the Catalan Numbers and Their Integral Representations, 2023, 11, 2227-7390, 1870, 10.3390/math11081870
8.	Feng Qi, 2023, Chapter 23, 978-981-19-8053-4, 401, 10.1007/978-981-19-8054-1_23
9.	Feng Qi, Ravi Prakash Agarwal, Several Functions Originating from Fisher–Rao Geometry of Dirichlet Distributions and Involving Polygamma Functions, 2023, 12, 2227-7390, 44, 10.3390/math12010044
10.	Hesham Moustafa, Waad Al Sayed, Some New Bounds for Bateman’s G-Function in Terms of the Digamma Function, 2025, 17, 2073-8994, 563, 10.3390/sym17040563
11.	Waad Al Sayed, Hesham Moustafa, Some New Inequalities for the Gamma and Polygamma Functions, 2025, 17, 2073-8994, 595, 10.3390/sym17040595

Reader Comments

Your name:*

Email:*
© 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.1

Metrics

Article views(2036) PDF downloads(78) Cited by(2)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(1) / Tables(16)

AIMS Mathematics

Analysis of deep learning technique using a complex spherical fuzzy rough decision support model

Related Papers:

Abstract

1. Introduction

2. Wavelet estimators and main theorem

3. Simulation study

4. Proof of main theorem

4.1. Auxiliary results

4.2. Proof of main theorem

5. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

AIMS Mathematics

Analysis of deep learning technique using a complex spherical fuzzy rough decision support model

Related Papers:

Abstract

1. Introduction

2. Wavelet estimators and main theorem

3. Simulation study

4. Proof of main theorem

4.1. Auxiliary results

4.2. Proof of main theorem

5. Conclusions

Acknowledgments

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog