Growth, leaf gas exchange and biochemical changes of oil palm (Elaeis guineensis Jacq.) seedlings as affected by iron oxide nanoparticles

Ayu Azera Izad; Rosimah Nulit; Che Azurahanim Che Abdullah; A'fifah Abdul Razak; Teh Huey Fang; Mohd Hafiz Ibrahim; Ayu Azera Izad; Rosimah Nulit; Che Azurahanim Che Abdullah; A'fifah Abdul Razak; Teh Huey Fang; Mohd Hafiz Ibrahim

doi:10.3934/matersci.2019.6.960

AIMS Materials Science

2019, Volume 6, Issue 6: 960-984. doi: 10.3934/matersci.2019.6.960

Previous Article Next Article

Research article Special Issues

Growth, leaf gas exchange and biochemical changes of oil palm (Elaeis guineensis Jacq.) seedlings as affected by iron oxide nanoparticles

1.
Faculty of Science, Universiti Putra Malaysia, 43400 UPM Serdang, Selangor, Malaysia
2.
Tropical Peat Research Unit, Biological Research Division, Malaysian Palm Oil Board, 43000 Kajang, Selangor, Malaysia
3.
Sime Darby Technology Centre, UPM-MTDC Technology Centre III, Universiti Putra Malaysia, 43400 Serdang, Selangor, Malaysia

Received: 17 July 2019 Accepted: 17 September 2019 Published: 18 October 2019

Currently, magnetic iron oxide nanoparticles (Fe₃O₄ NPs) was extensively used in industries and agriculture. However, fewer studies have been conducted on the interaction between these nanomaterials and plants. With that, the work focused on the toxicity evaluation of Fe₃O₄ NPs towards the growth, leaf gas exchange and biochemical of oil palm (Elaeis guineensis Jacq.). Oil palm seedlings were grown in soil and treated with different concentrations of Fe₃O₄ NPs (0, 800, 1600, 2400 mg/L) for 30 days of exposure. The experiment was arranged in a randomized complete block design (RCBD) replicated three times. The study revealed that Fe₃O₄ NPs did not affect the plant growth but significantly (p ≤ 0.05) affected the leaf gas exchange and biochemical responses. Total chlorophyll content and leaf total stomata densities of seedlings were significantly decreased with Fe₃O₄ NPs, in particular with the higher Fe₃O₄ NPs concentration. The results showed that Fe₃O₄ NPs negatively affected the leaf gas exchange characteristics of seedlings as compared to the control. The Fe₃O₄ NPs increased the production of total flavonoids, total phenolics, proline, soluble sugar and malondialdehyde (MDA) in Fe₃O₄ NPs-stressed seedlings leaves extracts. Correlation analysis showed that net photosynthesis rate (A) has a significant positive correlation with leaf gas exchange traits. This showed that the reduction of leaf gas exchange performance of oil palm seedlings under elevated Fe₃O₄ NPs concentration might be due to decreasing of A in oil palm seedlings exposed to high Fe₃O₄ NPs concentration. The concentration of iron (Fe) in leaves was significantly increased with Fe₃O₄ NPs application. In comparison to the control (0 mg/L), the Fe content in the leaves was increased by 52% when the seedlings were treated with the highest Fe₃O₄ NPs concentration (2400 mg/L). Overall, a high application of Fe₃O₄ NPs has induced plant stress, which further affected its growth and development at phenotypic and physiological levels.

Keywords:

biochemical,
growth,
iron oxide nanoparticles,
leaf gas exchange,
oil palm seedlings (Elaeis guineensis Jacq.)

Citation: Ayu Azera Izad, Rosimah Nulit, Che Azurahanim Che Abdullah, A'fifah Abdul Razak, Teh Huey Fang, Mohd Hafiz Ibrahim. Growth, leaf gas exchange and biochemical changes of oil palm (Elaeis guineensis Jacq.) seedlings as affected by iron oxide nanoparticles[J]. AIMS Materials Science, 2019, 6(6): 960-984. doi: 10.3934/matersci.2019.6.960

Related Papers:

[1]	Xin Zhang, Zhaobin Ma, Bowen Ding, Wei Fang, Pengjiang Qian . A coevolutionary algorithm based on the auxiliary population for constrained large-scale multi-objective supply chain network. Mathematical Biosciences and Engineering, 2022, 19(1): 271-286. doi: 10.3934/mbe.2022014
[2]	Jianquan Guo, Guanlan Wang, Mitsuo Gen . Green closed-loop supply chain optimization strategy considering CER and incentive-compatibility theory under uncertainty. Mathematical Biosciences and Engineering, 2022, 19(9): 9520-9549. doi: 10.3934/mbe.2022443
[3]	Zhuang Shan, Leyou Zhang . A new Tseng method for supply chain network equilibrium model. Mathematical Biosciences and Engineering, 2023, 20(5): 7828-7844. doi: 10.3934/mbe.2023338
[4]	Jing Yin, Ran Huang, Hao Sun, Taosheng Lin . A collaborative scheduling model for production and transportation of ready-mixed concrete. Mathematical Biosciences and Engineering, 2023, 20(4): 7387-7406. doi: 10.3934/mbe.2023320
[5]	Ping Wang, Rui Chen, Qiqing Huang . Does supply chain finance business model innovation improve capital allocation efficiency? Evidence from the cost of capital. Mathematical Biosciences and Engineering, 2023, 20(9): 16421-16446. doi: 10.3934/mbe.2023733
[6]	Chun Li, Ying Chen, Zhijin Zhao . Frequency hopping signal detection based on optimized generalized S transform and ResNet. Mathematical Biosciences and Engineering, 2023, 20(7): 12843-12863. doi: 10.3934/mbe.2023573
[7]	Gang Zhao, Chang-ping Liu, Qi-sheng Zhao, Min Lin, Ying-bao Yang . A study on aviation supply chain network controllability and control effect based on the topological structure. Mathematical Biosciences and Engineering, 2022, 19(6): 6276-6295. doi: 10.3934/mbe.2022293
[8]	Pablo Flores-Sigüenza, Jose Antonio Marmolejo-Saucedo, Joaquina Niembro-Garcia, Victor Manuel Lopez-Sanchez . A systematic literature review of quantitative models for sustainable supply chain management. Mathematical Biosciences and Engineering, 2021, 18(3): 2206-2229. doi: 10.3934/mbe.2021111
[9]	Siyuan Yin, Yanmei Hu, Yuchun Ren . The parallel computing of node centrality based on GPU. Mathematical Biosciences and Engineering, 2022, 19(3): 2700-2719. doi: 10.3934/mbe.2022123
[10]	Tinghuai Ma, Lei Guo, Xin Wang, Yurong Qian, Yuan Tian, Najla Al-Nabhan . Friend closeness based user matching cross social networks. Mathematical Biosciences and Engineering, 2021, 18(4): 4264-4292. doi: 10.3934/mbe.2021214

Abstract

1. Introduction

Survival analysis is a branch of statistics for analyzing time-to-event data. When looking into survival data, one frequently encounters the problem of competing risks in which samples are subject to multiple kinds of failure. The Cox proportional hazards model, introduced by Cox ^[1], is popular in survival analysis for describing the relationship between the distributions of survival times and covariates and is commonly employed to analyze cause-specific survival data. The traditional approach is to separately fit a Cox proportional hazards model to the data for each failure type, after considering the data with other kinds of failure censored. However, this conventional method encounters problems like the estimates are hard to interpret and the confidence bands of estimated hazards are wide, because the method does not cover all failure types ^[2,3].

An alternative approach is to fit competing risks data by using a mixture model that incorporates the distinct types of failure to partition the population into groups, and it assumes an individual will fail from each risk with the probabilities being attributed to the proportions of each group, respectively. Moreover, the mixture approach is helpful for estimating the effects of covariates in each group through parametric proportional hazard regressions such as Cox’s model. McLachlan and Peel ^[4] noted that a mixture model is allowed for both dependent and independent competing risks and it can improve a model’s fit to the data than the traditional approach in which the causes of failure are assumed to be independent. Mixture models are popular in competing risks analysis, because their resultant estimates are easy to interpret ^[2], although complex.

Semi-parametric mixture models are a generalization of parametric mixture models and have become a prominent approach for modelling data with competing risks. Semiparametric approaches to mixture models are preferable for their ability to adjust for the associated variables and allow for assessing the effects of these variables on both the probabilities of eventual causes of failure through a logistic model and the relevant conditional hazard functions by applying the Cox proportional hazards model (cf. ^[2]). Below, we review the existing semiparametric methods of mixture models for competing risks data.

Ng and McLachlan ^[5] proposed an ECM-based semi-parametric mixture method without specifying the baseline hazard function to analyze competing risks data. They noted that when the component-baseline hazard is not monotonic increasing their semi-parametric approach can consistently produce less biased estimates than those done by fully parametric approaches. Moreover, when the component-baseline hazard is monotonic increasing, the two approaches demonstrate comparable efficiency in the estimation of parameters for mildly and moderately censoring. Chang et al. ^[6] studied non-parametric maximum-likelihood estimators through a semiparametric mixture model for competing risks data. Their model is feasible for right censored data and can provide estimates of quantities like a covariate-specific fatality rate or a covariate-specific expected time length. Moreover, Lu and Peng ^[7] set up a semiparametric mixture regression model to analyze competing risks data under the ordinary mechanism of conditional independent censoring. Choi and Huang ^[8] offered a maximum likelihood scheme for semiparametric mixture models to make efficient and reliable estimations for the cumulative hazard function. One advantage with their approach is that the joint estimations for model parameters connect all considered competing risks under the constraint that all the probabilities of failing from respective causes sum to 1 given any covariates. Other research studies for competing risks data are based on semiparametric mixture models, e.g. ^[5,6,7,8].

Although the mixture hazard model is preferable to direct approaches, two important but challenging issues frequently encountered in the applications are the determination of the number of risk types and the identification of the failure type of each individual.

It is understandable that the results of a mixture model analysis highly depend on the number of components. It is also conceivably hard to cover all types of competing risks in a mixture model. Validity indices are a vital technique in model selection. The cluster validity index is a kind of criterion function to determine the optimal number of clusters. Some cluster validity indices presented by ^[9,10,11] are designed to find an optimal cluster number for fuzzy clustering algorithms; some are only related to the membership, while some take into account the distance between the data sets and cluster centers. Wu et al. ^[12] proposed median-type validity indices, which are robust to noises and outliers. Zhou et al. ^[13] introduced a weighted summation type of validity indices for fuzzy clustering, but they are unfeasible for mixture regression models. Conversely, Henson et al. ^[14] evaluated the ability of several statistical criteria such as the Akaike information criterion (AIC) and Bayesian information criterion (BIC) to produce a proper number of components for latent variable mixture modeling. However, AIC and BIC may present the problem of over- and under-estimation on the number of components ^[15], respectively.

As to the identification of failure types, many studies on the problems of competing risks like ^[5,6,7,8] assumed that the failure type of an individual is known if the subject’s failure time is observed, but if an individual is censored and only the censored time is known, then which failure type the subject fails from is unknown. In fact, even if one observes the failure time, then the true cause of failure might not be clear and needs further investigations. Thus, deciding the number of competing risks and recognizing the failure type of each individual are critical in competing risks analysis, but scant work has been done on them.

Besides the above problems, another critical issue existing in mixture Cox hazard models particularly is the estimation of the baseline hazard function. The Cox proportional hazards model consists of two parts: the baseline hazard function and the proportional regression model. Bender et al. ^[16] assumed that the baseline hazard function follows a specific lifetime distribution, but this assumption is obviously restrictive. A single lifetime distribution may not adequately explain all data —for example, the failure rate is not monotonic increasing or decreasing. Alternatively, some scholars adopted nonparametric approaches to estimate the baseline hazard function that are more flexible. Ng and McLachlan ^[5] assumed the baseline hazard function to be piecewise constant by treating each observed survival time as a cut-off point, but the piecewise constant assumption has the disadvantage that the estimated curve is not smooth, while smoothing is required in several applications ^[17]. In fact, our simulation results also show that the derived estimates based on a piecewise constant hazard function are not sufficient in some cases (e.g. model 4 in Figure 4). Understandably, an inadequate estimation of the baseline function affects the selection of the number of model components; and hence leads to insufficient estimates of the model parameters.

Figure 1. Plot of different types of kernel function.

DownLoad: Full-Size Img PowerPoint

Figure 2. The scatter plot of the observed data and the expectation of the survival time E(T) from non-mixture model (a), mixture models with two types of risk (b), and three types of risk (c). Note that ER represents the type of risk for estimation.

DownLoad: Full-Size Img PowerPoint

Figure 3. The scatter plot of a sample data set for models M₁~M₄ respectively.

DownLoad: Full-Size Img PowerPoint

Figure 4. Plots of a single run simulated series by models M₁~M₄ respectively; E(T)_Ri: the i^th type of risk for estimation; CBH: cumulative baseline hazard rate; R_Ri: real cumulative baseline hazard function for the i^th type of risk; (M_i-j): model M_i is fitted and j = 1, 3: the scatter plots of the observed data and the estimated expectation curves; j = 2, 4: plots of the estimated cumulative baseline hazard functions; j = 1, 2: estimated by piecewise constant assumption; j = 3, 4: estimated by kernel method.

DownLoad: Full-Size Img PowerPoint

In order to solve the above mentioned problems with the Cox mixture hazard modelling for competing risks data, we propose four indices and the kernel estimation for the base line function in this paper. Validity indices are a vital technique in model selection, but they have been less utilized for deciding the number of components of a mixture regression model. By using posterior probabilities and residual functions, we propose four validity indices that are applicable to regression models in this study. Under the EM-based mixture model, the posterior probabilities play an important role in classifying data, which take role of data memberships in fuzzy clustering. Unlike the traditional regression model, the survival model does not meet the assumption that survival time variation is constant for each covariate. Therefore, we incorporate the functions of posterior probabilities and the sum of standard residuals to constitute the new validity indices and verify the effectiveness of the proposed new indices through extensive simulations. Moreover, we extend the kernel method of Guilloux et al. ^[18] to estimate the baseline hazard function smoothly and hence more accurately.

The remainder of this paper is organized as follows. Section 2 introduces the mixture Cox proportional hazards regression model, develops an EM-based algorithm to estimate the model parameters, and also discusses kernel estimations for the baseline hazard function. Section 3 constructs four validity indices for selecting the number of model components in a mixture Cox proportional hazards regression model. Section 4 carries out several simulations and assesses the effectiveness of our validity indices. Section 5 analyzes a practical data set of prostate cancer patients treated with different dosages of the drug diethylstilbestrol. Finally, Section 6 states conclusions and a discussion.

2. Mixture Cox proportional hazards model with kernel estimation

2.1. Mixture Cox proportional hazards model

For mixture model analysis, suppose each member of a population can be categorized into g mutually exclusive clusters according to its failure type. Let $D = \left\{ {\; ({t_j}, \mathit{\boldsymbol{X}}_j^T, {\delta _j})\; :j = 1, \cdots, n} \right\}$ , be a sample drawn from this population where T denotes the transpose of a vector, ${t_j}$ is the failure or right censoring time, ${\mathit{\boldsymbol{X}}_j} = {({x_{j1}}, {x_{j2}}, ..., {x_{j\, d}})^T}$ is a d-dimensional vector of covariates, and:

${\delta _j} = \left\{ {\begin{array}{*{20}{c}} {1, }&{\rm{if}}\;\;{\rm{the}}\;\;j{\rm{ - th}}\;\;{\rm{individaul}}\;\;{\rm{is}}\;\;{\rm{uncensored}}, \\ {0, }&{\rm{if}}\;\;{\rm{the}}\;\;j{\rm{ - th}}\;\;{\rm{individaul}}\;\;{\rm{is}}\;\;{\rm{censored}}. \end{array}} \right.$

The mixture probability density function (pdf) of t is defined by:

$f(t) = \sum\limits_{i = 1}^g {{p_i} \cdot {f_i}(t)}$ , subject to

$\sum\limits_{i = 1}^g {{p_i}} = 1,$

(1)

where ${p_i}$ is the mixing probability of failure due to the i^th type of risk and g is the number of model components.

In the i^th component, the hazard function $\; {h_i}(t\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }}_i}} \right.)\;$ given covariate ${\mathit{\boldsymbol{X}}_j}$ follows a Cox proportional hazards model defined by

${h_i}(t\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }}_i}} \right.) = {h_{0i}}(t)\exp ({\mathit{\boldsymbol{X}}_j}^T{\mathit{\boldsymbol{\beta }}_i}),$

(2)

where ${\mathit{\boldsymbol{\beta }}_i} = {({\beta _{i1}}, {\beta _{i2}}, ..., {\beta _{id}})^T}$ is the vector of regression coefficients, and ${h_{0i}}(t)$ is the baseline hazard function of the i^th component. We define the i^th component-survival function and pdf by:

${S_i}(t\left| {{\mathit{\boldsymbol{X}}_j}} \right., {\mathit{\boldsymbol{\beta }}_i}) = \exp \left[ { - {H_{0i}}(t)\exp ({\mathit{\boldsymbol{X}}_j}^T{\mathit{\boldsymbol{\beta }}_i})} \right]$

and

${f_i}(t\left| {{\mathit{\boldsymbol{X}}_j}} \right., {\mathit{\boldsymbol{\beta }}_i}) = {h_{0i}}(t)\exp \left[ {{\mathit{\boldsymbol{X}}_j}^T{\mathit{\boldsymbol{\beta }}_i} - {H_{0i}}(t)\exp ({\mathit{\boldsymbol{X}}_j}^T{\mathit{\boldsymbol{\beta }}_i})} \right] ,$

where ${H_{0i}}(t) = \int_0^t {{h_{0i}}(s)} \; ds$ is the i^th component-cumulative baseline hazard function.

The unknown parameters are the mixing probabilities $\mathit{\boldsymbol{p}} = {({p_{\rm{1}}}, {p_2}, ..., {p_{g - 1}})^T}$ and regression coefficients $\mathit{\boldsymbol{\beta }} = {({\mathit{\boldsymbol{\beta }}_{\rm{1}}}^T, {\mathit{\boldsymbol{\beta }} _2}^T, ..., {\mathit{\boldsymbol{\beta }}_g}^T)^T}$ , where ${\mathit{\boldsymbol{\beta }}_i} = {({\beta _{i1}}, {\beta _{i2}}, ..., {\beta _{id}})^T}$ . Based on (1) and Zhou ^[19], the log-likelihood function under the mixture hazards model with right censored data is given by

$l(\mathit{\boldsymbol{p}}, \mathit{\boldsymbol{\beta }}) = \sum\limits_{j = 1}^n {\sum\limits_{i = 1}^g {\left\{ {{\delta _j}\log \left[ {{p_i} \cdot {f_i}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }}_i}} \right.)} \right] + (1 - {\delta _j})\log \left[ {{p_i} \cdot {S_i}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }}_i}} \right.)} \right]} \right\}} } ,$

where $f({t_j}\left| {{\mathit{\boldsymbol{X}}_j}} \right., \mathit{\boldsymbol{\beta }}) = \sum\limits_{i = 1}^g {{p_i} \cdot {f_i}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }}_i}} \right.)}$ and $S({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, \mathit{\boldsymbol{\beta }}} \right.) = \sum\limits_{i = 1}^g {{p_i} \cdot {S_i}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}} \right., {\mathit{\boldsymbol{\beta }}_i})}$ .

Assume that the true causes of failure for an individual are unobserved, and hence the data are incomplete. We introduce the latent variable ${z_{ij}}$ as:

${z_{ij}} = \left\{ \begin{array}{l} {\rm{1,if}}\;j - {\rm{th}}\;\;{\rm{individual}}\;\;{\rm{fails}}\;\;{\rm{due}}\;\;{\rm{to}}\;\;i{\rm{ - th}}\;\;{\rm{type}}\;\;{\rm{of}}\;\;{\rm{risk;}}\\ {\rm{0,}}\;{\rm{otherwise}}. \end{array} \right.$

The complete-data log-likelihood function is given by:

${l_c}(\mathit{\boldsymbol{p}}, \mathit{\boldsymbol{\beta }} ) = \sum\limits_{j = 1}^n {\sum\limits_{i = 1}^g {{z_{ij}}\left\{ {{\delta _j}\log \left[ {{p_i} \cdot {f_i}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}} \right., {\mathit{\boldsymbol{\beta }}_i})} \right] + (1 - {\delta _j})\log \left[ {{p_i} \cdot {S_i}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}} \right., {\mathit{\boldsymbol{\beta }}_i})} \right]} \right\}} }.$

(3)

Subsequently, the parameters are estimated through the expectation and maximization (EM) algorithm.

E-step: On the (k+1)^th iteration of E-step, we calculate the conditional expectation of the complete-data log-likelihood (3) given the current estimates of the parameters, i.e.:

$\begin{array}{l} E\left[ {{l_c}(\mathit{\boldsymbol{p}}, \mathit{\boldsymbol{\beta }} )\left| {{\mathit{\boldsymbol{p}}^{(k)}}, {\mathit{\boldsymbol{\beta }} ^{(k)}}} \right.} \right] = \sum\limits_{j = 1}^n {\sum\limits_{i = 1}^g {{z_{ij}}^{(k)}\left\{ {{\delta _j}\log \left[ {{p_i} \cdot {f_i}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}} \right., {\mathit{\boldsymbol{\beta }}_i})} \right] + (1 - {\delta _j})\log \left[ {{p_i} \cdot {S_i}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}} \right., {\mathit{\boldsymbol{\beta }}_i})} \right]} \right\}} } \\ \quad \quad \quad \quad \quad \quad \quad \;\;\quad \; = \sum\limits_{j = 1}^n {\sum\limits_{i = 1}^g {{z_{ij}}^{(k)}\log {p_i}} } \\ \quad \quad \quad \quad \quad \quad \quad \;\;\quad \;\;\;\; + \sum\limits_{j = 1}^n {{z_{1j}}^{(k)}\left[ {{\delta _j}\log {f_1}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}} \right., {\mathit{\boldsymbol{\beta }} _1}) + (1 - {\delta _j})\log {S_1}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}} \right., {\mathit{\boldsymbol{\beta }} _1})} \right]} \\ \quad \quad \quad \quad \quad \quad \quad \;\;\quad \;\;\;\; + \quad \vdots \\ \quad \quad \quad \quad \quad \quad \quad \;\;\quad \;\;\;\; + \sum\limits_{j = 1}^n {{z_{gj}}^{(k)}\left[ {{\delta _j}\log {f_g}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }} _g}} \right.) + (1 - {\delta _j})\log {S_g}({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }} _g}} \right.)} \right]} \\ \quad \quad \quad \quad \quad \quad \quad \;\;\quad \; = {Q_0} + {Q_1} + \cdots + {Q_g}\;{\rm{.}} \\ \end{array}$

(4)

Here, ${\mathit{\boldsymbol{p}}^{(k)}}$ and ${\mathit{\boldsymbol{\beta }}^{(k)}}$ are the estimates of $\mathit{\boldsymbol{p}}$ and $\mathit{\boldsymbol{\beta }}$ , respectively, in the k^th iteration. By Baye’s Theorem, we have:

${z_{ij}}^{(k)} = E\left( {{z_{ij}}\left| {{\mathit{\boldsymbol{p}}^{(k)}}, {\mathit{\boldsymbol{\beta }} ^{(k)}}} \right.} \right) = \frac{{{p_i}^{(k)}{f_i}{{({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }}_i}^{(k)}} \right.)}^{{\delta _j}}}{S_i}{{({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }}_i}^{(k)}} \right.)}^{1 - {\delta _j}}}}}{{\sum\limits_{l = 1}^g {{p_l}^{(k)}{f_l}{{({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }} _l}^{(k)}} \right.)}^{{\delta _j}}}{S_l}{{({t_j}\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }} _l}^{(k)}} \right.)}^{1 - {\delta _j}}}} }}$

(5)

which is the posterior probability that the j^th individual with survival time $\; {t_j}\;$ fails due to the i^th type of risk.

M-step: The (k+1)^th iteration of M-step provides the updated estimates ${\mathit{\boldsymbol{p}}^{(k + 1)}}$ and ${\mathit{\boldsymbol{\beta }}^{(k + 1)}}$ that maximizes (4) with respect to $\mathit{\boldsymbol{p}}$ and $\mathit{\boldsymbol{\beta }}$ .

Under the constraints $\sum\limits_{i = 1}^g {{p_i}} = 1$ , to maximize ${Q_0} = \sum\limits_{j = 1}^n {\sum\limits_{i = 1}^g {{z_{ij}}^{(k)}\log {p_i}} }$ from (4), we obtain the estimation of mixing probability with

$\;{p_i}^{(k + 1)} = \frac{{\sum\limits_{j = 1}^n {{z_{ij}}^{(k)}} }}{n}.$

(6)

The equation ${Q_i}$ from (4) for $i = 1, ..., g$ can be written as:

${Q_i} = \sum\limits_{j = 1}^n {{z_{ij}}^{(k)}\left\{ {{\delta _j}\left[ {\log {h_{0i}}({t_j}) + {\mathit{\boldsymbol{X}}_j}^T{\mathit{\boldsymbol{\beta }}_i}} \right] - \exp ({\mathit{\boldsymbol{X}}_j}^T{\mathit{\boldsymbol{\beta }} _i}){H_{0i}}({t_j})} \right\}} \;{\rm{.}}$

(7)

Define the score vector $U({\mathit{\boldsymbol{\beta }} _i})$ for $i = 1, ..., g$ as the first derivate of (7) with respect to the vector ${\mathit{\boldsymbol{\beta }}_i}$ given ${H_{0i}}(t)$ fixed at ${H_{0i}}^{{\rm{(}}k + 1{\rm{)}}}(t)$ , and the estimation ${\mathit{\boldsymbol{\beta }}_i}^{{\rm{(}}k{\rm{ + 1)}}}$ satisfies the equation:

$U({\mathit{\boldsymbol{\beta }} _i}) = {\left. {\frac{{\partial {Q_i}}}{{\partial {\mathit{\boldsymbol{\beta }} _i}}}} \right|_{{H_{0i}}({t_j}){\rm{ = }}{H_{0i}}^{{\rm{(}}k + 1{\rm{)}}}({t_j})}} = \sum\limits_{j = 1}^n {{z_{ij}}^{(k)}\left[ {{\delta _j} - \exp ({\mathit{\boldsymbol{X}}_j}^T{\mathit{\boldsymbol{\beta }} _i}){H_{0i}}^{{\rm{(}}k + 1{\rm{)}}}({t_j})} \right]\;} {\mathit{\boldsymbol{X}}_j}{\rm{ = }}\;{\rm{0}}.$

(8)

2.2. Kernel estimation for the baseline hazard function

To estimate the baseline hazard function under the mixture hazards model, we propose the kernel estimator. Define ${N_j}(t) = I({t_j} \leqslant t\; \wedge \; {\delta _j} = 1)$ as an event counting process and ${Y_j}(t) = I({t_j} \geqslant t)$ as risk process. The updated kernel estimator of i^th component-baseline hazard function ${h_{0i}}(t)$ on the (k+1)^th iteration is defined by:

${h_{0i}}^{{\rm{(}}k + 1{\rm{)}}}(t\left| {\mathit{\boldsymbol{X}}, {\mathit{\boldsymbol{Z}}_i}^{(k)}, {\mathit{\boldsymbol{\beta }} _i}^{(k)}} \right., {b^{(k)}}) = \frac{1}{{{b^{(k)}}}}\int_0^\tau {K\left( {\frac{{t - u}}{{{b^{(k)}}}}} \right)} \;d{H_{0i}}^{{\rm{(}}k{\rm{ + 1)}}}(u\left| {\mathit{\boldsymbol{X}}, {\mathit{\boldsymbol{Z}}_i}^{(k)}, {\mathit{\boldsymbol{\beta }} _i}^{(k)}} \right.), \tau \geqslant 0,$

(9)

where $\; K:\mathbb{R} \to \mathbb{R}\;$ is a kernel function, and ${b^{(k)}}$ is a positive parameter called the bandwidth. There are several types of kernel functions commonly used, appearing in Table 1 and Figure 1. We try these kernel functions in the simulated examples and find no significant differences. In this paper, we choose biweight as the kernel function to estimate the baseline hazard function.

Table 1. Different types of kernel function.

Kernel function	$K(u)$
Gaussian	$K(u) = \frac{{\rm{1}}}{{\sqrt {{\rm{2}}\pi } }}{e^{ - \frac{1}{2}{u^2}}}\quad, \; - \infty < u < \infty$
Epanechnikov	$K(u) = \frac{{\rm{3}}}{{\rm{4}}}{\rm{(1}} - {u^2}{\rm{)}}\quad, \; \left\| u \right\| \leqslant 1$
Biweight	$K(u) = \frac{{{\rm{15}}}}{{{\rm{16}}}}{{\rm{(1}} - {u^2}{\rm{)}}^{\rm{2}}}\quad, \; \left\| u \right\| \leqslant 1$
Triweight	$K(u) = \frac{{{\rm{35}}}}{{{\rm{32}}}}{{\rm{(1}} - {u^2}{\rm{)}}^{\rm{3}}}\quad, \; \left\| u \right\| \leqslant 1$

| Show Table

DownLoad: CSV

Derived by smoothing the increments of the Breslow estimator, the kernel estimator (9) can be written as:

${h_{0i}}^{{\rm{(}}k + 1{\rm{)}}}(t\left| {\mathit{\boldsymbol{X}}, {\mathit{\boldsymbol{Z}}_i}^{(k)}, {\mathit{\boldsymbol{\beta }} _i}^{(k)}} \right., {b^{(k)}}) = \frac{1}{{{b^{(k)}}}}\sum\limits_{j = 1}^n {\int_0^\tau {K\left( {\frac{{t - u}}{{{b^{(k)}}}}} \right)} \frac{{{z_{ij}}^{(k)}I(\overline Y (u) \gt 0)}}{{{S_{n\, i}}(u\left| {\mathit{\boldsymbol{X}}, {\mathit{\boldsymbol{Z}}_i}^{(k)}, {\mathit{\boldsymbol{\beta }} _i}^{(k)}} \right.)}}} , \;d{N_j}(u),$

(10)

where $\overline Y = \frac{1}{n}\sum\limits_{j = 1}^n {{Y_j}}$ and ${S_{n\, i}}(u\left| {\mathit{\boldsymbol{X}}, {\mathit{\boldsymbol{Z}}_i}, {\mathit{\boldsymbol{\beta }} _i}} \right.) = \sum\limits_{j = 1}^n {{z_{ij}}\exp ({{\mathit{\boldsymbol{X}}}_j}^T{\mathit{\boldsymbol{\beta }}_i}){Y_j}(u)}$ .

Horova et al. ^[20] and Patil ^[21] introduced the cross-validation method to select the bandwidth of the kernel estimator. We define the cross-validation function for bandwidth b written as CV(b) under our model as:

$CV(b) = \sum\limits_{i = 1}^g {\sum\limits_{j = 1}^n {{z_{ij}}^{{\rm{(}}k{\rm{)}}} \cdot {{\left[ {{h_{{\rm{0}}i}}^{{\rm{(}}k + 1{\rm{)}}}(\, {t_j}\left| {{\mathit{\boldsymbol{X}}, {\mathit{\boldsymbol{Z}}_i}^{(k)}, {\mathit{\boldsymbol{\beta }} _i}^{(k)}}, {b^{(k)}}} \right.{\rm{)}}} \right]}^{\rm{2}}}} } - 2\sum\limits_{i = 1}^g {\mathop {\sum {\sum \, } }\limits_{l \ne j\;\;\;} \frac{1}{{{b^{(k)}}}}K\left( {\frac{{{t_l} - {t_j}}}{{{b^{(k)}}}}} \right)\frac{{{z_{il}}^{{\rm{(}}k{\rm{)}}}{\delta _l}}}{{Y({t_l})}}\frac{{{z_{ij}}^{{\rm{(}}k{\rm{)}}}{\delta _j}}}{{Y({t_j})}}} \;.$

The selection of bandwidth on the (k+1)^th iteration is given by:

${b^{{\rm{(}}k{\rm{ + 1)}}}} = \mathop {\arg \min }\limits_{b \in {B_n}} CV(b),$

(11)

where ${B_n}$ cover the set of acceptable bandwidths.

The algorithm is shown as follows, where we fix n and g and set up initial values for mixing probabilities ${\mathit{\boldsymbol{p}}^{(0)}}$ , which are usually ${{\rm{1}} \mathord{\left/ {\vphantom {{\rm{1}} g}} \right. } g}$ , regression coefficients ${\mathit{\boldsymbol{\beta }} ^{(0)}}$ , baseline hazard rates ${\mathit{\boldsymbol{h}}_0}^{(0)}$ , bandwidth ${b^{{\rm{(0)}}}}$ is 0.5, and a termination value, $\varepsilon \, {\rm{ > }}\, {\rm{0}}$ .

Set the initial counter $\; l = 1$ .

Step 1: Compute ${\mathit{\boldsymbol{Z}}^{(l - {\rm{1}})}}$ with ${\mathit{\boldsymbol{p}}^{(l - 1)}}$ , ${\mathit{\boldsymbol{h}}_0}^{(l - 1)}$ and ${\mathit{\boldsymbol{\beta }} ^{(l - 1)}}$ by (5);

Step 2: Update ${\mathit{\boldsymbol{p}}^{{\rm{(}}l{\rm{)}}}}$ with ${\mathit{\boldsymbol{Z}}^{(l - 1)}}$ by (6);

Step 3: Update $\mathit{\boldsymbol{h}}_0^{(l )}$ with ${\mathit{\boldsymbol{Z}}^{{\rm{(}}l - 1{\rm{)}}}}$ , ${\mathit{\boldsymbol{\beta }} ^{{\rm{(}}l - 1{\rm{)}}}}$ and ${b^{{\rm{(}}l - 1{\rm{)}}}}$ by (10);

Step 4: Update bandwidth ${b^{{\rm{(}}l{\rm{)}}}}$ with ${\mathit{\boldsymbol{Z}}^{(l - 1)}}$ , $\mathit{\boldsymbol{h}}_0^{(l )}$ and ${\mathit{\boldsymbol{\beta }} ^{(l - 1)}}$ by (11);

Step 5: Update ${\mathit{\boldsymbol{\beta }} ^{{\rm{(}}l{\rm{)}}}}$ with ${\mathit{\boldsymbol{Z}}^{(l - 1)}}$ , $\mathit{\boldsymbol{h}}_0^{(l )}$ and ${\mathit{\boldsymbol{\beta }} ^{(l - 1)}}$ by (8);

Step 6: IF $\; {\left\| {\; {\mathit{\boldsymbol{p}}^{(l)}} - \; {\mathit{\boldsymbol{p}}^{(l - 1)}}} \right\|_{\rm{2}}} + {\left\| {\; \mathit{\boldsymbol{h}}_0^{(l )} - \; \mathit{\boldsymbol{h}}_0^{(l - 1)}} \right\|_{\rm{2}}} + {\left\| {\; {\mathit{\boldsymbol{\beta }} ^{(l)}} - \; {\mathit{\boldsymbol{\beta }} ^{(l - 1)}}} \right\|_{\rm{2}}} < \varepsilon$ , THEN stop;

ELSE let $\; l = l{\rm{ + 1}}\;$ and GOTO Step 1.

Note that the superscript (.) represents the number of iterations, $\mathit{\boldsymbol{h}}_0^{(0)} = {({\mathit{\boldsymbol{h}}_{{\rm{01}}}}^{(0)}, {\mathit{\boldsymbol{h}}_{{\rm{0}}2}}^{(0)}, ..., {\mathit{\boldsymbol{h}}_{{\rm{0}}g}}^{(0)})^T}$ is a $g \times n$ matrix, where $\mathit{\boldsymbol{h}}_{{\rm{0}}i}^{(0)} = {(h_{{\rm{0}}i}^{(0)}{\rm{(}}{t_1}{\rm{)}}, h_{{\rm{0}}i}^{(0)}{\rm{(}}{t_2}{\rm{)}}, ..., h_{{\rm{0}}i}^{(0)}{\rm{(}}{t_n}{\rm{)}})^T}$ , and each row is initialized by a constant vector.

3. Validity indices

In traditional regression analysis, we select the best model by picking the one that minimizes the sum of squared residuals, but unlike the traditional regression model, the survival model does not meet the assumption that the standard deviation of the survival time is a constant at each covariate. From Figure 2(a), we see that the survival time with higher expectation has higher standard deviation. Therefore, to select the best model we need to adjust the standard deviation to avoid being greatly affected by data that have large standard deviations. Moreover, if the model fits the data well, then each observed survival time will be close to the expectation of the component model, which has the largest posterior probability corresponding to one’s risk type.

Figure 2(b) illustrate that observation A is closer to the mean line (red line) of the component model corresponding to risk type 1, say model 1, than to the mean line (blue line) of model 2. From (5), we see that the posterior probabilities of the observation A corresponding to the first type of risk (red line) will be much larger than that of the second type of risk (blue line). Hence, to build up validity indices for mixture models, we consider the posterior probabilities as the role of weights and define the mixture sum of standard absolute residuals (MsSAE) and mixture sum of standard squared residuals (MsSSE) as follows:

$MsSAE = \sum\limits_{i = 1}^g {\sum\limits_{j = 1}^n {\frac{{{{\hat z}_{ij}}\left| {{t_j} - {{\hat E}_i}({t_j})} \right|}}{{\sqrt {\hat Va{r_i}({t_j})} }}} } \;$ ;

$MsSSE = \sum\limits_{i = 1}^g {\sum\limits_{j = 1}^n {\frac{{{{\hat z}_{ij}}{{\left[ {{t_j} - {{\hat E}_i}({t_j})} \right]}^2}}}{{\hat Va{r_i}({t_j})}}} }$ ,

where ${\hat E_i}({t_j}) = \int\limits_0^\infty {\exp {{\left[{- {{\hat H}_{0i}}(t)} \right]}^{\exp \left({{x_j}^T{{\hat \beta }_i}} \right)}}\; dt}$ and $\hat Va{r_i}({t_j}) = {\rm{2}}\int\limits_0^\infty {t \cdot \exp {{\left[{- {{\hat H}_{0i}}(t)} \right]}^{\exp \left({{x_j}^T{{\hat \beta }_i}} \right)}}dt} - {\hat E_i}{({t_j})^{\rm{2}}}$ . The squared distance is considered, because it is easier to catch an abnormal model.

From we can see that the expectation (green line) of the survival time according to the third type of risk (ER3) is close to that (blue line) corresponding to the second type of risk (ER2). In order to penalize the overfitting model, which is the model with too many model components, we consider the distance between the expectations of each survival time according to any two types of risk as the penalty. Define the average absolute separation $\overline {ASep}$ , the average squared separation $\overline {SSep}$ , the minimum absolute separation $\min ASep$ and the minimum squared separation $\min SSep$ as:

$\overline {ASep} = \frac{{\rm{2}}}{{g(g - 1)}}\sum\limits_{i = 1}^g {\sum\limits_{l \gt i}^g {\sum\limits_{j = 1}^n {\left| {{{\hat E}_i}({t_j}) - {{\hat E}_l}({t_j})} \right|} } }$ ;

$\overline {SSep} = \frac{{\rm{2}}}{{g(g - 1)}}\sum\limits_{i = 1}^g {\sum\limits_{l \gt i}^g {\sum\limits_{j = 1}^n {{{\left[ {{{\hat E}_i}({t_j}) - {{\hat E}_l}({t_j})} \right]}^2}} } };$

$\min ASep = \mathop {\min }\limits_{i \ne l} \sum\limits_{j = 1}^n {\left| {{{\hat E}_i}({t_j}{\rm{)}} - {{\hat E}_l}({t_j})} \right|}$ ;

$\min SSep = \mathop {\min }\limits_{i \ne l} \sum\limits_{j = 1}^n {{{\left[ {{{\hat E}_i}({t_j}{\rm{)}} - {{\hat E}_l}({t_j})} \right]}^2}} .$

A good model will possess small mixture standard residuals and large separation of expectations. Hence, based on the above-mentioned functions of residuals and separation of means, we propose four validity indices V₁ ~ V₄ for selecting the number of model components under the mixture hazards regression model.

(V₁). Absolute standard residuals and average separation ${V_{aRaS}}$

${V_{aRaS}} = \frac{{MsSAE}}{{\overline {ASep} }} = \frac{{\sum\nolimits_{i = 1}^g {\sum\nolimits_{j = 1}^n {{{\hat z}_{ij}}{{\left| {{t_j} - {{\hat E}_i}({t_j})} \right|} \mathord{\left/ {\vphantom {{\left| {{t_j} - {{\hat E}_i}({t_j})} \right|} {\sqrt {\hat Va{r_i}({t_j})} }}} \right. } {\sqrt {\hat Va{r_i}({t_j})} }}} } }}{{\frac{{\rm{2}}}{{g(g - 1)}}\sum\limits_{i = 1}^g {\sum\limits_{l{\rm{ \gt }}i}^g {\sum\limits_{j = 1}^n {\left| {{{\hat E}_i}({t_j}) - {{\hat E}_l}({t_j})} \right|} } } }}$

We find an optimal number g of types of risk by solving ${\min _{2 \leqslant g \leqslant n - 1}}{V_{aRaS}}$ .

(V₂). Squared standard residuals and average separation ${V_{sRaS}}$

${V_{sRaS}} = \frac{{MsSSE}}{{\overline {SSep} }} = \frac{{\sum\nolimits_{i = 1}^g {\sum\nolimits_{j = 1}^n {{{{{\hat z}_{ij}}{{\left[ {{t_j} - {{\hat E}_i}({t_j})} \right]}^2}} \mathord{\left/ {\vphantom {{{{\hat z}_{ij}}{{\left[ {{t_j} - {{\hat E}_i}({t_j})} \right]}^2}} {\hat Va{r_i}({t_j})}}} \right. } {\hat Va{r_i}({t_j})}}} } }}{{\frac{{\rm{2}}}{{g(g - 1)}}\sum\limits_{i = 1}^g {\sum\limits_{l{\rm{ \gt }}i}^g {\sum\limits_{j = 1}^n {{{\left[ {{{\hat E}_i}({t_j}) - {{\hat E}_l}({t_j})} \right]}^2}} } } }}$

We find an optimal number g of types of risk by solving ${\min _{2 \leqslant g \leqslant n - 1}}{V_{sRaS}}$ .

(V₃). Absolute standard residuals and minimum separation ${V_{aRmS}}$

${V_{aRmS}} = \frac{{MsSAE}}{{\min ASep}} = \frac{{\sum\nolimits_{i = 1}^g {\sum\nolimits_{j = 1}^n {{{\hat z}_{ij}}{{\left| {{t_j} - {{\hat E}_i}({t_j})} \right|} \mathord{\left/ {\vphantom {{\left| {{t_j} - {{\hat E}_i}({t_j})} \right|} {\sqrt {\hat Va{r_i}({t_j})} }}} \right. } {\sqrt {\hat Va{r_i}({t_j})} }}} } }}{{\mathop {\min }\limits_{i \ne l} \sum\nolimits_{j = 1}^n {\left| {{{\hat E}_i}({t_j}) - {{\hat E}_l}({t_j})} \right|} }}$

We find an optimal number g of types of risk by solving ${\min _{2 \leqslant g \leqslant n - 1}}{V_{aRmS}}$ .

(V₄). Squared standard residuals and minimum separation ${V_{sRmS}}$

${V_{sRmS}} = \frac{{MsSSE}}{{\min SSep}} = \frac{{\sum\nolimits_{i = 1}^g {\sum\nolimits_{j = 1}^n {{{\hat z}_{ij}}{{{{\left[ {{t_j} - {{\hat E}_i}({t_j})} \right]}^2}} \mathord{\left/ {\vphantom {{{{\left[ {{t_j} - {{\hat E}_i}({t_j})} \right]}^2}} {\hat Va{r_i}({t_j})}}} \right. } {\hat Va{r_i}({t_j})}}} } }}{{\mathop {\min }\limits_{i \ne l} \sum\nolimits_{j = 1}^n {{{\left[ {{{\hat E}_i}({t_j}) - {{\hat E}_l}({t_j})} \right]}^2}} }}$

We find an optimal number g of types of risk by solving ${\min _{2 \leqslant g \leqslant n - 1}}{V_{sRmS}}$ .

4. Simulation

For the simulated data we consider four different models M₁~M₄. Under the mixture Cox proportional hazards model (2), the i^th component hazard function is:

${h_i}(t\left| {{\mathit{\boldsymbol{X}}_j}, {\mathit{\boldsymbol{\beta }} _i}} \right.) = {h_{0i}}(t)\exp ({x_j}{\beta _{i, 1, {\rm{1}}}} + ... + {x_j}^k{\beta _{i, 1, k}} + ... + {x_j}{\beta _{i, d, 1}} + ... + {x_j}^k{\beta _{i, d, k}}),$

where d is the number of covariates, k is the degree of models, ${\mathit{\boldsymbol{\beta }} _i} = {({\beta _{i, 1, {\rm{1}}}}, {\beta _{i, 1, k}}, ..., {\beta _{i, d, k}}, {\beta _{i, d, k}})^T}$ is the vector of regression coefficients and ${h_{0i}}(t)$ is the i^th component-baseline hazard function.

Consider two common distributions for the baseline hazard functions, Weibull and Gompertz; the i^th component Weibull baseline and Gompertz baseline are defined by ${h_{0i}}(t) = {\lambda _i}\, {\rho _i}\, {t^{{\rho _i} - 1}}$ and ${h_{0i}}(t) = {\lambda _i}\exp ({\rho _i}{t_j})$ , respectively, where ${\lambda _i}$ and ${\rho _i}$ are the scale and shape parameters. Let ${\bf{ \pmb{\mathit{ λ}} }} = {{\rm{(}}{\lambda _{\rm{1}}}{\rm{, }}...{\rm{, }}{\lambda _g}{\rm{)}}^T}$ , ${\bf{ \pmb{\mathit{ ρ}} }} = {{\rm{(}}{\rho _{\rm{1}}}{\rm{, }}...{\rm{, }}{\rho _g}{\rm{)}}^T}$ , and $\mathit{\boldsymbol{\beta }} = {({\mathit{\boldsymbol{\beta }} _{\rm{1}}}^T, {\mathit{\boldsymbol{\beta }}_2}^T, ..., {\mathit{\boldsymbol{\beta }} _g}^T)^T}$ . The covariates $X = {({x_1}, {x_2}, ..., {x_n})^T}$ in all cases are generated independently from a uniform distribution $U(- 4, 4)$ . The information for four models is shown in Table 2, and the scatter plots of a sample dataset are presented in Figure 3.

Table 2. The information for models M₁~M₄ respectively.

Model	n¹	g²	d³	k⁴	$\mathit{\boldsymbol{p}} = \left[{\begin{array}{*{20}{c}} {{p_{\rm{1}}}} \\ \vdots \\ {{p_g}} \end{array}} \right]$	BH⁵	${\bf{ \pmb{\mathit{ λ}} }} = \left[{\begin{array}{*{20}{c}} {{\lambda _{\rm{1}}}} \\ \vdots \\ {{\lambda _g}} \end{array}} \right]$	${\bf{ \pmb{\mathit{ ρ}} }} = \left[{\begin{array}{*{20}{c}} {{\rho _{\rm{1}}}} \\ \vdots \\ {{\rho _g}} \end{array}} \right]$	${\bf{ \pmb{\mathit{ β}} }} = \left[{\begin{array}{*{20}{c}} {{{\bf{ \pmb{\mathit{ β}} }}_{\rm{1}}}^T} \\ \vdots \\ {{{\bf{ \pmb{\mathit{ β}} }}_g}^T} \end{array}} \right]$	${U_i}$ ⁶
M₁	200	2	1	1	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.7}}} \\ {{\rm{0}}{\rm{.3}}} \end{array}} \right]$	Weibull	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.005}}} \\ {{\rm{1}}{\rm{.5}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{3}}{\rm{.0}}} \\ {{\rm{2}}{\rm{.0}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.3}}} \\ {{\rm{0}}{\rm{.5}}} \end{array}} \right]$	$\begin{gathered} {U_1}(5, 9) \\ {U_2}(2, 6) \\ \end{gathered}$
M₂	200	2	1	2	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.5}}} \\ {{\rm{0}}{\rm{.5}}} \end{array}} \right]$	Gompertz	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.2}}} \\ {{\rm{0}}{\rm{.7}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{1}}{\rm{.5}}} \\ {{\rm{2}}{\rm{.0}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.8}}} & {{\rm{0}}{\rm{.1}}} \\ { - {\rm{0}}{\rm{.6}}} & {{\rm{0}}{\rm{.1}}} \end{array}} \right]$	$\begin{gathered} {U_1}({\rm{4}}, {\rm{9}}) \\ {U_2}({\rm{4}}, {\rm{9}}) \\ \end{gathered}$
M₃	400	2	2	1	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.5}}} \\ {{\rm{0}}{\rm{.5}}} \end{array}} \right]$	Weibull	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.003}}} \\ {{\rm{0}}{\rm{.002}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.5}}} \\ {{\rm{0}}{\rm{.7}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.8}}} & { - {\rm{0}}{\rm{.5}}} \\ { - 0.6} & {0.5} \end{array}} \right]$	$\begin{gathered} {U_{\rm{1}}}(12, 15) \\ {U_{\rm{2}}}(10, 13) \\ \end{gathered}$
M₄	400	3	1	1	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.35}}} \\ {{\rm{0}}{\rm{.30}}} \\ {{\rm{0}}{\rm{.35}}} \end{array}} \right]$	Gompertz	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.0002}}} \\ {{\rm{0}}{\rm{.002}}} \\ {{\rm{0}}{\rm{.0003}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.7}}} \\ {{\rm{2}}{\rm{.0}}} \\ {{\rm{0}}{\rm{.8}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} { - 0.8} \\ {{\rm{0}}{\rm{.2}}} \\ {{\rm{1}}{\rm{.0}}} \end{array}} \right]$	$\begin{gathered} {U_{\rm{1}}}(10, 15) \\ {U_{\rm{2}}}(4, 6) \\ {U_{\rm{3}}}(15, 20) \\ \end{gathered}$
1: sample size; 2: number of risk types; 3: number of covariates; 4: degree of models; 5: baseline hazard function; 6: censored times are generated from a uniform distribution $U_{i}(a, b)$ for i=1, …, g.

| Show Table

DownLoad: CSV

4.1. Compare two methods of estimating the baseline hazard function

We consider an EM-based semi-parametric mixture hazards model to analyze simulated data, and compare the two methods of estimating the baseline hazard function. For the first method proposed by Ng and McLachlan ^[5], they assume the baseline hazard function is piecewise constant and calculate this function using maximum likelihood estimation (MLE). For the second method introduced in this paper, we use a kernel estimator to estimate the baseline hazard rates and choose biweight as the kernel function.

In order to graphically demonstrate the results, we first show the results for a single run of simulation in Table 3 and Figure 4. The correct rate (CR) in Table 3 is the percentage of individuals matched into the true attributable type of risk. According to the results of the estimation, we match the individuals into one type of risk with largest posterior probability. Thus, this correct rate is defined as:

$CR = \frac{1}{n}\sum\limits_{j = 1}^n {\sum\limits_{i = 1}^g {I\left\{ {j \in risk(i) \cap {{\hat z}_{ij}} = \mathop {\max }\limits_i ({{\mathit{\boldsymbol{\hat Z}}}_j})} \right\}} }$ where

${ \mathit{\boldsymbol{\hat Z}}_j} = {({\hat z_{1j}}, {\hat z_{2j}}, ..., {\hat z_{gj}})^T}.$

Table 3. The estimation of a simulated series by models M₁~M₄ respectively.

		${\bf{p}}$	${\bf{ \pmb{\mathit{ β}} }}$	CR	MsSSE/n
M₁	True¹	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.7}}} \\ {{\rm{0}}{\rm{.3}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.3}}} \\ {{\rm{0}}{\rm{.5}}} \end{array}} \right]$
	Piecewise²	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.561}}} \\ {{\rm{0}}{\rm{.439}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.528}}} \\ {{\rm{0}}{\rm{.851}}} \end{array}} \right]$	0.860	0.810
	Kernel³, bw⁴=1.0	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.672}}} \\ {{\rm{0}}{\rm{.328}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.336}}} \\ {{\rm{0}}{\rm{.586}}} \end{array}} \right]$	0.945	0.659
M₂	True	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.5}}} \\ {{\rm{0}}{\rm{.5}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.8}}} & {{\rm{0}}{\rm{.1}}} \\ { - {\rm{0}}{\rm{.6}}} & {{\rm{0}}{\rm{.1}}} \end{array}} \right]$
	Piecewise	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.641}}} \\ {{\rm{0}}{\rm{.958}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.674}}} & {{\rm{0}}{\rm{.136}}} \\ { - 1.136} & {{\rm{0}}{\rm{.298}}} \end{array}} \right]$	0.705	0.963
	Kernel, bw=0.5	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.523}}} \\ {{\rm{0}}{\rm{.476}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.738}}} & {{\rm{0}}{\rm{.078}}} \\ { - {\rm{0}}{\rm{.762}}} & {{\rm{0}}{\rm{.146}}} \end{array}} \right]$	0.855	0.910
M₃	True	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.5}}} \\ {{\rm{0}}{\rm{.5}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.8}}} & { - {\rm{0}}{\rm{.5}}} \\ { - 0.6} & {0.5} \end{array}} \right]$
	Piecewise	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.508}}} \\ {{\rm{0}}{\rm{.491}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.993}}} & { - {\rm{0}}{\rm{.562}}} \\ { - 0.{\rm{562}}} & {0.{\rm{608}}} \end{array}} \right]$	0.838	1.240
	Kernel, bw=0.4	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.478}}} \\ {{\rm{0}}{\rm{.522}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.885}}} & { - {\rm{0}}{\rm{.534}}} \\ { - 0.6{\rm{28}}} & {0.5{\rm{21}}} \end{array}} \right]$	0.843	1.142
M₄	True	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.35}}} \\ {{\rm{0}}{\rm{.30}}} \\ {{\rm{0}}{\rm{.35}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} { - 0.8} \\ {{\rm{0}}{\rm{.2}}} \\ {{\rm{1}}{\rm{.0}}} \end{array}} \right]$
	Piecewise	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.399}}} \\ {{\rm{0}}{\rm{.265}}} \\ {{\rm{0}}{\rm{.335}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} { - 0.{\rm{938}}} \\ {{\rm{0}}{\rm{.920}}} \\ {{\rm{1}}{\rm{.137}}} \end{array}} \right]$	0.693	1.211
	Kernel, bw=0.9	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.368}}} \\ {{\rm{0}}{\rm{.306}}} \\ {{\rm{0}}{\rm{.325}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} { - 0.8{\rm{06}}} \\ {{\rm{0}}{\rm{.192}}} \\ {{\rm{0}}{\rm{.927}}} \end{array}} \right]$	0.873	0.828
1: true parameters; 2: piecewise constant estimates; 3: kernel estimates; 4: bandwidth.

| Show Table

DownLoad: CSV

When using a piecewise constant estimator under M₁, from the estimated mixing probabilities, CR in Table 3 and the expectation line in Figure 4(M₁-1), it can be seen that we will misclassify some data into the 2^nd type of risk where their true risk type is the 1^st one. As a result, the estimates of regression coefficients in Table 3 and the cumulative baseline hazard rate in Figure 4(M₁-2) are not close to the true model. Furthermore, under M₄, from the expectation line according to the 1^st and 2^nd types of risk in Figure 4(M₄-1), it can be seen that we will misclassify some data between the 1^st and 2^nd types of risk when using piecewise constant estimator. The estimates of regression coefficients in Table 3 and the cumulative baseline hazard rate in Figure 4(M₄-2) are mismatched with the real model. It is obvious that using the kernel procedure for the baseline hazard estimation will increase CR compared to using the piecewise constant procedure.

We next show the results for 1000 simulations in Table 4. The absolute relative bias (ARB) for parameter θ is defined by:

$ARB(\theta ) = \left| {\frac{{E(\hat \theta ) - \theta }}{{E(\hat \theta )}}} \right|.$

Table 4. The estimation of 1000 simulated series by models M₁~M₄ respectively.

		bias_ ${\bf{p}}$ ³	MSE_ ${\bf{p}}$ ⁴	bias_ ${\bf{ \pmb{\mathit{ β}} }}$ ⁵	MSE_ ${\bf{ \pmb{\mathit{ β}} }}$ ⁶	$\overline {ARB}$	$\overline {CR}$	$\overline {MsSSE}$
M₁	Piecewise¹	$\left[{\begin{array}{*{20}{c}} { - 0.160} \\ {{\rm{0}}{\rm{.160}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.026}}} \\ {{\rm{0}}{\rm{.026}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.088}}} \\ {{\rm{0}}{\rm{.275}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.020}}} \\ {{\rm{0}}{\rm{.076}}} \end{array}} \right]$	0.401	0.699	0.796
	Kernel²	$\left[{\begin{array}{*{20}{c}} { - 0.{\rm{035}}} \\ {{\rm{0}}{\rm{.035}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.002}}} \\ {{\rm{0}}{\rm{.002}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} { - 0.073} \\ { - 0.007} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.007}}} \\ {{\rm{0}}{\rm{.000}}} \end{array}} \right]$	0.107	0.856	0.653
M₂	Piecewise	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.132}}} \\ { - 0.132} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.017}}} \\ {0.{\rm{017}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} { - 0.097} & {{\rm{0}}{\rm{.041}}} \\ { - {\rm{0}}{\rm{.652}}} & {{\rm{0}}{\rm{.172}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.010}}} & {{\rm{0}}{\rm{.001}}} \\ {{\rm{0}}{\rm{.429}}} & {{\rm{0}}{\rm{.029}}} \end{array}} \right]$	0.646	0.680	1.329
	Kernel	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.089}}} \\ { - 0.{\rm{089}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.008}}} \\ {0.{\rm{008}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} { - 0.{\rm{123}}} & {{\rm{0}}{\rm{.054}}} \\ { - {\rm{0}}{\rm{.311}}} & {{\rm{0}}{\rm{.017}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.018}}} & {{\rm{0}}{\rm{.006}}} \\ {{\rm{0}}{\rm{.124}}} & {{\rm{0}}{\rm{.000}}} \end{array}} \right]$	0.292	0.774	1.009
M₃	Piecewise	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.028}}} \\ { - 0.{\rm{028}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.000}}} \\ {0.{\rm{000}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.167}}} & { - {\rm{0}}{\rm{.091}}} \\ { - {\rm{0}}{\rm{.079}}} & {{\rm{0}}{\rm{.046}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.028}}} & {{\rm{0}}{\rm{.008}}} \\ {{\rm{0}}{\rm{.006}}} & {{\rm{0}}{\rm{.002}}} \end{array}} \right]$	0.122	0.847	1.271
	Kernel	$\left[{\begin{array}{*{20}{c}} { - {\rm{0}}{\rm{.006}}} \\ {0.{\rm{006}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.000}}} \\ {0.{\rm{000}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.033}}} & { - {\rm{0}}{\rm{.020}}} \\ {{\rm{0}}{\rm{.069}}} & { - {\rm{0}}{\rm{.051}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.001}}} & {{\rm{0}}{\rm{.000}}} \\ {{\rm{0}}{\rm{.004}}} & {{\rm{0}}{\rm{.002}}} \end{array}} \right]$	0.054	0.849	1.097
M₄	Piecewise	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.043}}} \\ { - 0.055} \\ {{\rm{0}}{\rm{.012}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.001}}} \\ {0.0{\rm{03}}} \\ {{\rm{0}}{\rm{.000}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} { - 0.003} \\ {0.791} \\ {{\rm{0}}{\rm{.251}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {0.00{\rm{2}}} \\ {0.{\rm{627}}} \\ {{\rm{0}}{\rm{.063}}} \end{array}} \right]$	0.766	0.646	0.737
	Kernel	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.018}}} \\ { - 0.0{\rm{42}}} \\ {{\rm{0}}{\rm{.023}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {{\rm{0}}{\rm{.000}}} \\ {0.0{\rm{01}}} \\ {{\rm{0}}{\rm{.000}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {0.0{\rm{32}}} \\ {0.{\rm{071}}} \\ { - {\rm{0}}{\rm{.014}}} \end{array}} \right]$	$\left[{\begin{array}{*{20}{c}} {0.00{\rm{2}}} \\ {0.{\rm{009}}} \\ {{\rm{0}}{\rm{.000}}} \end{array}} \right]$	0.112	0.799	0.565
1: piecewise constant estimates; 2: kernel estimates; 3: bias of ${\bf{p}}$ ; 4: mean square error (MSE) of ${\bf{p}}$ ; 5: bias of ${\bf{ \pmb{\mathit{ β}} }}$ ; 6: mean square error (MSE) of ${\bf{ \pmb{\mathit{ β}} }}$ .

| Show Table

DownLoad: CSV

In the mean absolute relative bias ( $\overline {ARB}$ ) of the model with k parameters is defined by $\overline {ARB} = {{\sum\nolimits_{i = 1}^k {ARB({\theta _i})} } \mathord{\left/ {\vphantom {{\sum\nolimits_{i = 1}^k {ARB({\theta _i})} } k}} \right. } k}$ . Moreover, $\overline {CR}$ and $\overline {MsSSE}$ are the mean of CR and MsSSE/n for each simulation. presents that the $\overline {ARB}$ and $\overline {MsSSE}$ of the kernel estimate are smaller than those of the piecewise constant estimate. Moreover, the $\overline {CR}$ of the kernel estimate is larger than that of the piecewise constant estimate in all cases considered. Thus, the model with the baseline hazard functions estimated by the kernel method fits the data better than that with piecewise constant baseline.

4.2. Select appropriate number of model components

In section 4.2, we consider an EM-based semi-parametric mixture hazards model to analyze simulated data under models M₁~M₄ by considering several possible number of risk types, that is model components, and use the kernel estimator to estimate the baseline hazard rates with biweight as the kernel function. Next, we use validity indices to select the optimal number of model components. The following six validity indices are used to compare with the validity indices we have come up with ( ${V_{aRaS}}$ , ${V_{sRaS}}$ , ${V_{aRmS}}$ , and ${V_{sRmS}}$ ).

1. Partition coefficient ${V_{PC}}$ proposed by Bezdek ^[22].

2. Normalized partition coefficient ${V_{NPC}}$ proposed by Dave ^[23].

3. Partition entropy ${V_{PE}}$ proposed by Bezdek ^[24].

4. Normalized partition entropy ${V_{NPE}}$ proposed by Dunn ^[25].

5. Akaike information criterion AIC.

6. Bayesian information criterion BIC.

It is well known that memberships play an important role in fuzzy clustering. Similarly, under the EM-based mixture model, the posterior probabilities are closely related to the role of memberships. Therefore, we replace the role of memberships with posterior probabilities in the validity indices ${V_{PC}}$ , ${V_{NPC}}$ , ${V_{PE}}$ , and ${V_{NPE}}$ . Moreover, the formulas for AIC and BIC are computed by

$AIC = - 2 \cdot {l_c}(\mathit{\boldsymbol{\hat p}}, \mathit{\boldsymbol{\hat \beta }}) + 2k ; BIC = - 2 \cdot {l_c}(\hat p, \hat \beta ) + k\log (n),$

where $\; {l_c}(\mathit{\boldsymbol{\hat p}}, \mathit{\boldsymbol{\hat \beta }})$ is the complete-data log-likelihood (3) given the estimated parameters, and k is the number of parameters for estimation.

All in all we consider ten indices, including ${V_{PC}}$ , ${V_{NPC}}$ , ${V_{PE}}$ , ${V_{NPE}}$ , AIC, BIC, ${V_{aRaS}}$ , ${V_{sRaS}}$ , ${V_{aRmS}}$ , and ${V_{sRmS}}$ , to select the optimal number of model components. Table 5 shows the proportion of choosing the correct number of model components over 1000 simulation runs based on the considered indices respectively. In each simulation run, each model of M₁~M₄ is fitted for components 2, 3, and 4 separately. Note that we assume the number of model components is greater than one for satisfying the requirement of the proposed validity indices. We define the proportion of choosing the correct number of risk types by each index in Table 5 as:

$\frac{{\# ({\rm{choose}}\;{\rm{correct}}\;{\rm{g}}\;{\rm{by}}\;{\rm{index}})}}{{\# ({\rm{simulaiton}})}}.$

Table 5. The proportion of choosing the correct g by each index for 1000 simulation runs under models M₁~M₄ respectively.

	${V_{PC}}$	${V_{NPC}}$	${V_{PE}}$	${V_{NPE}}$	AIC	BIC	${V_{aRaS}}$	${V_{sRaS}}$	${V_{aRmS}}$	${V_{sRmS}}$
M₁	0.962	0.880	0.976	0.894	0.964	0.950	0.896	0.896	0.984	0.992
M₂	0.954	0.564	0.963	0.485	0.524	0.631	0.863	0.851	0.981	0.990
M₃	1.000	0.798	1.000	0.868	0.998	0.998	0.994	0.998	1.000	1.000
M₄	0.486	0.780	0.413	0.810	0.646	0.660	0.923	0.916	0.813	0.703

| Show Table

DownLoad: CSV

shows that the proportion of choosing the correct g by traditional indices ${V_{PC}}$ , ${V_{NPC}}$ , ${V_{PE}}$ , ${V_{NPE}}$ , AIC, and BIC are not consistent under models M₁~M₄, where at least one model is not performing well (denoted by fluorescent yellow color in the table). On the other hand, the proposed indices ( ${V_{aRaS}}$ , ${V_{sRaS}}$ , ${V_{aRmS}}$ , and ${V_{sRmS}}$ ) are consistent and possess high proportions for every model, except that the proportion of ${V_{sRmS}}$ under M₄ is 0.703, which is slightly low, but it is still higher than that of most traditional indices. Hence, the proposed validity indices are superior than others in selecting the correct number of components.

5. Analysis of prostate cancer data

As a practical illustration of the proposed EM-based semi-parametric mixture hazard model, we consider the survival times of 506 patients with prostate cancer who entered a clinical trial during 1967–1969. These data were randomly allocated to different levels of treatment with the drug diethylstilbestrol (DES) and were considered by Byar and Green ^[26] and published by Andrews and Herzberg ^[27]. Kay ^[28] analyzed a subset of the data by considering eight types of risk, defined by eight categorical variables: drug treatment (RX: 0, 0.0 or 0.2 mg estrogen; 1, 1.0 or 5.0 mg estrogen); age group (AG: 0, < 75 years; 1, 75 to 79 years; 2, > 79 years); weight index (WT: 0, > 99 kg; 1, 80 to 99 kg; 2, < 80 kg); performance rating (PF: 0, normal; 1, limitation of activity); history of cardiovascular disease (HX: 0, no; 1, yes); serum haemoglobin (HG: 0, > 12 g/100 ml; 1, 9 to 12 g/100 ml; 2, < 9 g/100 ml); size of primary lesion (SZ: 0, < 30 cm²; 1, ≥ 30 cm2), and Gleason stage/grade category (SG: 0, ≤ 10; 1, > 10). Cheng et al. ^[26] classified this dataset with three types of risk as: (1) death due to prostate cancer; (2) death due to cardiovascular (CVD) disease; and (3) other causes.

We analyze the same dataset with eight categorical variables (RX, AG, WT, PF, HX, SZ, SG). There are 483 patients with complete information on these covariates, and the proportion of censored observations is 28.8%. We ignore the information about the risk factors and use indices, including ${V_{PC}}$ , ${V_{NPC}}$ , ${V_{PE}}$ , ${V_{NPE}}$ , AIC, BIC, ${V_{aRaS}}$ , ${V_{sRaS}}$ , ${V_{aRmS}}$ , and ${V_{sRmS}}$ to select the optimal number of risk types. From , the number of risk types selected by ${V_{aRaS}}$ , ${V_{sRaS}}$ , ${V_{aRmS}}$ , and ${V_{sRmS}}$ is three, and that selected by other indices is two. The number of model components selected by the indices we have proposed is the same as that in the previous studies introduced by Cheng et al. ^[3].

Table 6. The value of each index with different number of risk types under prostate cancer data.

	${V_{PC}}$	${V_{NPC}}$	${V_{PE}}$	${V_{NPE}}$	AIC	BIC	${V_{aRaS}}$	${V_{sRaS}}$	${V_{aRmS}}$	${V_{sRmS}}$
g = 2	0.7813	0.5626	0.3369	0.5720	4.1518	4.2989	0.5894	0.4437	0.5894	0.4437
g = 3	0.6684	0.5027	0.5260	0.6135	4.5012	4.7262	0.3783	0.1974	0.5016	0.2943
g = 4	0.5581	0.4109	0.7564	0.7075	4.7967	5.0996	0.4746	57.572	0.6123	98.534
Note: (1) g represents the number of risk types when estimating. (2) The optimal values of g according to each index are highlighted in bold.

| Show Table

DownLoad: CSV

From existing medical experience and a previous study, we presume that these model components may agree with some particular types of risk and thus can decide whether there are significant relationships between the covariates and the survival times by using the Wald statistical test. Based on the cause-specific hazard approach, Cheng et al. ^[3] found that treatment with a higher DES dosage (RX = 1) significantly reduces the risk of death due to prostate cancer. Table 7 shows that the DES dosage has a significant effect on time to death due to the 1^st type of risk, and that the estimated regression coefficients of RX is negative. Byar and Green ^[26] found that patients with a big size of primary lesion (SZ = 1) and high-grade tumors (SG = 1) are at greater risk of prostate cancer death. Table 7 lists that SZ and SG have a significant effect on time to death due to the 1^st type of risk, and that the estimated regression coefficients are all positive. Accordingly, we presume the 1^st type of risk relates to prostate cancer. Furthermore, based on the cause-specific hazard approach, Cheng et al. ^[3] found that treatment with a higher DES dosage (RX = 1) significantly increases the risk of death due to CVD. From Table 7, we see that DES dosage has a significant effect on time to death due to the 2^nd and 3^rd types of risk, and that the estimated regression coefficient of RX is positive.

Table 7. The model estimates (with standard errors) of prostate cancer data given the number of risk types equal to 3.

		1^st type of risk	2^nd type of risk	3^rd type of risk
$\mathit{\boldsymbol{p}}$		0.2132	0.3930	0.3936
$\mathit{\boldsymbol{\beta }}$	RX	−0.0296*(0.1267)	0.3546*(0.1414)	0.7589*(0.1425)
	AG	0.3144*(0.1143)	1.7445*(0.1041)	1.8104*(0.1396)
	WT	−0.0817*(0.0916)	1.7915*(0.0967)	−0.5555*(0.1290)
	PF	1.4742*(0.2233)	0.1244*(0.2527)	1.6468*(0.3325)
	HX	3.0027*(0.1176)	1.2829*(0.1377)	−0.6092*(0.1486)
	HG	0.8489*(0.1536)	1.6074*(0.1669)	−5.2153*(0.7267)
	SZ	0.8567*(0.2119)	3.0334*(0.1998)	−3.2661*(0.4074)
	SG	4.3184*(0.1010)	−0.3907*(0.1419)	−0.9933*(0.1560)
Note: denotes P-value < 0.05.*

| Show Table

DownLoad: CSV

We know that patients with a history of cardiovascular disease (HX = 1) have a higher probability of death due to CVD, compared to those patients without such a history. Table 7 shows that the estimated regression coefficient of HX is positive due to the 2^nd type of risk. Hence, we presume the 2^nd type of risk may relate to CVD. There is no explicit relationship between covariates and survival times adhering to the 3^rd type of risk. Thus, we only presume the 3^rd type of risk may relate to other death causes without specification. According to the significant relationship of covariates and survival times, we assess that the 1^st, 2^nd and 3^rd types of risk for estimation from an EM-based semi-parametric mixture hazard model are classified to prostate cancer, CVD, and other unspecified causes, respectively.

6. Conclusions and discussion

6.1. Conclusions

This study introduces four new validity indices, ${V_{aRaS}}$ , ${V_{sRaS}}$ , ${V_{aRmS}}$ , and ${V_{sRmS}}$ , for deciding the number of model components when applying an EM-based Cox proportional hazards mixture model to a dataset of competing risks. We incorporate the posterior probabilities and the sum of standard residuals to constitute the new validity indices. Moreover, our study sets up an extended kernel approach to estimate the baseline functions more smoothly and accurately. Extensive simulations show that the kernel procedure for the baseline hazard estimation is helpful for increasing the correct rate of classifying individual into the true attributable type of risk. Furthermore, simulation results demonstrate that the proposed validity indices are consistent and have a higher percentage of selecting the optimal number of model components than the traditional competitors. Thus, the proposed indices are superior to several traditional indices such as the most commonly used in statistics, AIC and BIC. We also employ the propose method to a prostate cancer data-set to illustrate its practicability.

6.2. Discussion

It is obvious that if we apply the four new validity indices at the same time, then we have the best chance to select the optimal number of model components. One concern is picking the best one among the proposed validity indices. In fact, the average separation versions ( ${V_{aRaS}}$ , ${V_{sRaS}}$ ) easily neutralizes the effects of small and large distances among the expectations of component models. On the other hand, as long as there is a small distance among the expectations of component models, the minimum separation versions ( ${V_{aRmS}}$ , ${V_{sRmS}}$ ) will catch the information about the overfitting model. Under the analysis of prostate cancer data, we see that ${V_{aRmS}}$ and ${V_{sRmS}}$ behave more sensitively than ${V_{aRaS}}$ and ${V_{sRaS}}$ for detecting the overfitting models (i.e., the distances of indices between overfitting and optimal models are much larger than those between underfitting and optimal models). Furthermore, according to the simulation results, the index ${V_{sRmS}}$ performs slightly poor on a certain model, we thus recommend employing ${V_{aRmS}}$ if just one of the proposed validity indices is to be used.

In the future we may test the effectiveness of the proposed validity indices on statistical models other than the mixture Cox proportional hazards regression models. We could also advance the efficiency of the proposed indices in determining the number of components of mixture models. Another issue is to reduce the computation cost. For instance, the bandwidth of the kernel procedure for baseline hazard function estimates is recalculated on each iteration, which consumes computation time. All these factors need further investigation and will be covered in our future research.

Acknowledgements

The authors thank the anonymous reviewers for their insightful comments and suggestions which have greatly improved this article. This work was partially supported by the Ministry of Science and Technology, Taiwan [Grant numbers MOST 108-2118-M-025-001-and MOST 108-2118-M-003-003-].

References

[1]	Rastogi A, Zivcak M, Sytar O, et al. (2017) Impact of metal and metal oxide nanoparticles on plant: a critical review. Front Chem 5: 78. doi: 10.3389/fchem.2017.00078
[2]	Etheridge ML, Campbell SA, Erdman AG, et al. (2013) The big picture on nanomedicine: the state of investigational and approved nanomedicine products. Nanomed: Nanotechnol, Biol Med 9: 1-14. doi: 10.1016/j.nano.2012.05.013
[3]	Roco MC (2005) International perspective on government nanotechnology funding in 2005. J Nanopart Res 7: 707-712. doi: 10.1007/s11051-005-3141-5
[4]	Maynard AD, Aitken RJ, Butz T, et al. (2006) Safe handling of nanotechnology. Nature 444: 267-269. doi: 10.1038/444267a
[5]	Faraji M, Yamini Y, Rezaee M (2010) Magnetic nanoparticles: synthesis, stabilization, functionalization, characterization, and applications. J Iran Chem Soc 7: 1-37. doi: 10.1007/BF03245856
[6]	Nidhin M, Indumathy R, Sreeram KJ, et al. (2008) Synthesis of iron oxide nanoparticles of narrow size distribution on polysaccharide templates. Bull Mater Sci 31: 93-96.
[7]	Xu Y, Qin Y, Palchoudhury S, et al. (2011) Water-soluble iron oxide nanoparticles with high stability and selective surface functionality. Langmuir 27: 8990-8997. doi: 10.1021/la201652h
[8]	Scott N, Chen H (2012) Nanoscale science and engineering for agriculture and food systems. Ind Biotechnol 8: 340-343.
[9]	Rizwan M, Ali S, Ali B, et al. (2019) Zinc and iron oxide nanoparticles improved the plant growth and reduced the oxidative stress and cadmium concentration in wheat. Chemosphere 214: 269-277. doi: 10.1016/j.chemosphere.2018.09.120
[10]	Zareii FD, Roozbahani A, Hosnamidi A (2014) Evaluation the effect of water stress and foliar application of Fe nanoparticles on yield, yield components and oil percentage of safflower (Carthamus tinctorious L.). Int J Adv Biol Biom Res 2: 1150-1159.
[11]	Vasconcelos MW, Grusak MA (2014) Morpho-physiological parameters affecting iron deficiency chlorosis in soybean (Glycine max L.). Plant Soil 374: 161-172. doi: 10.1007/s11104-013-1842-6
[12]	Rui M, Ma C, Hao Y, et al. (2016) Iron oxide nanoparticles as a potential iron fertilizer for peanut (Arachis hypogaea). Front Plant Sci 7: 815.
[13]	Kavian B. Negahdar, Ghaziani MVF (2014) The effect of iron nano-chelate and cycocel on some morphological and physiological characteristics, proliferation and enhancing the quality of Euphorbia pulcherrima Willd. Sci Pap Ser B Hortic 58: 337-342.
[14]	Bombin S, LeFebvre M, Sherwood J, et al. (2015) Developmental and reproductive effects of iron oxide nanoparticles in Arabidopsis thaliana. Int J Mol Sci 16: 24174-24193. doi: 10.3390/ijms161024174
[15]	Antisari LV, Carbone S, Gatti A, et al. (2015) Uptake and translocation of metals and nutrients in tomato grown in soil polluted with metal oxide (CeO₂, Fe₃O₄, SnO₂, TiO₂) or metallic (Ag, Co, Ni) engineered nanoparticles. Environ Sci Pollut Res 22: 1841-1853. doi: 10.1007/s11356-014-3509-0
[16]	El-Temsah YS, Joner EJ (2012) Impact of Fe and Ag nanoparticles on seed germination and differences in bioavailability during exposure in aqueous suspension and soil. Environ Toxicol 27: 42-49. doi: 10.1002/tox.20610
[17]	Ren HX, Liu L, Liu C, et al. (2011) Physiological investigation of magnetic iron oxide nanoparticles towards Chinese mung bean. J Biomed Nanotechnol 7: 677-684. doi: 10.1166/jbn.2011.1338
[18]	Kim JH, Lee Y, Kim EJ. et al. (2014). Exposure of iron nanoparticles to Arabidopsis thaliana enhances root elongation by triggering cell wall loosening. Environ Sci Technol 48: 3477-3485. doi: 10.1021/es4043462
[19]	García A, Espinosa R, Delgado L, et al. (2011) Acute toxicity of cerium oxide, titanium oxide and iron oxide nanoparticles using standardized tests. Desalination 269: 136-141. doi: 10.1016/j.desal.2010.10.052
[20]	Trujillo-Reyes J, Majumdar S, Botez CE, et al. (2014) Exposure studies of core-shell Fe/Fe₃O₄ and Cu/CuO NPs to lettuce (Lactuca sativa) plants: are they a potential physiological and nutritional hazard? J Hazard Mater 267: 255-263.
[21]	Popova LV, Popkova EG, Dubova YI, et al. (2016) Financial mechanisms of nanotechnology development in developing countries. J Appl Econ Sci 11: 584-590.
[22]	Bottero JY, Auffan M, Borschnek D, et al. (2015) Nanotechnology, global development in the frame of environmental risk forecasting. A necessity of interdisciplinary researches. C R Geosci 347: 35-42.
[23]	Ma X, Geisler-Lee J, Deng Y, et al. (2010) Interactions between engineered nanoparticles (ENPs) and plants: phytotoxicity, uptake and accumulation. Sci Total Environ 408: 3053-3061. doi: 10.1016/j.scitotenv.2010.03.031
[24]	Maxwell K, Johnson GN (2000) Chlorophyll fluorescence-a practical guide. J Exp Bot 51: 659-668. doi: 10.1093/jexbot/51.345.659
[25]	Hayat S, Hayat Q, Alyemeni MN, et al. (2012) Role of proline under changing environments: a review. Plant Signaling Behav 7: 1456-1466. doi: 10.4161/psb.21949
[26]	Zain N, Ismail M, Mahmood M, et al. (2014) Alleviation of water stress effects on MR220 rice by application of periodical water stress and potassium fertilization. Molecules 19: 1795-1819. doi: 10.3390/molecules19021795
[27]	Kong W, Liu F, Zhang C, et al. (2016) Non-destructive determination of Malondialdehyde (MDA) distribution in oilseed rape leaves by laboratory scale NIR hyperspectral imaging. Sci Rep 6: 35393. doi: 10.1038/srep35393
[28]	Ibrahim MH, Jaafar HZE (2012) Reduced photoinhibition under low irradiance enhanced Kacip fatimah (Labisia pumila Benth) secondary metabolites, phenyl alanine lyase and antioxidant activity. Int J Mol Sci 13: 5290-5306. doi: 10.3390/ijms13055290
[29]	Ibrahim MH, Jaafar HZE, Rahmat A, et al. (2011) The relationship between phenolics and flavonoids production with total non-structural carbohydrate and photosynthetic rate in Labisia pumila Benth. Under high CO₂ and nitrogen fertilization. Molecules 16: 162-174.
[30]	Baskar V, Venkatesh R, Ramalingam S (2018) Flavonoids (antioxidants systems) in higher plants and their response to stresses, In: Gupta DK, Palma JM, Corpas FJ, Antioxidants and antioxidant enzymes in higher plants, New York: Springer, 253-268.
[31]	Ibrahim MH, Jaafar HZE (2012) Primary, secondary metabolites, H₂O₂, malondialdehyde and photosynthetic responses of Orthosiphon stamineus Benth. To different irradiance levels. Molecules 17:1159-1176.
[32]	Bienfait HF, van den Briel W. Mesland-Mul NT (1985) Free space iron pools in roots: generation and mobilization. Plant Physiol 78: 596-600. doi: 10.1104/pp.78.3.596
[33]	Corley RHV, Tinker PBH (2003) Mineral nutrition of oil palms, In: The Oil Palm, 4 Eds., Blackwell Science, 327-354.
[34]	Iannone MF, Groppa MD, de Sousa ME, et al. (2016) Impact of magnetite iron oxide nanoparticles on wheat (Triticum aestivum L.) development: evaluation of oxidative damage. Environ Exp Bot 131: 77-88.
[35]	Ghafariyan MH, Malakouti MJ, Dadpour MR, et al. (2013) Effects of magnetite nanoparticles on soybean chlorophyll. Environ Sci Technol 47: 10645-10652.
[36]	Vasconcelos MW, Grusak MA (2014) Morpho-physiological parameters affecting iron deficiency chlorosis in soybean (Glycine max L.). Plant Soil 374 : 161-172.
[37]	Mukherjee A, Peralta-Videa JR. Bandyopadhyay S, et al. (2014) Physiological effects of nanoparticulate ZnO in green peas (Pisum sativum L.) cultivated in soil. Metallomics 6: 132-138.
[38]	Ma J, Stiller J, Berkman PJ, et al. (2013) Sequence-based analysis of translocations and inversions in bread wheat (Triticum aestivum L.). PloS one 8: e79329. doi: 10.1371/journal.pone.0079329
[39]	Nair PMG, Chung IM (2014) Impact of copper oxide nanoparticles exposure on Arabidopsis thaliana growth, root system development, root lignificaion, and molecular level changes. Environ Sci Pollut Res 21: 12709-12722. doi: 10.1007/s11356-014-3210-3
[40]	Servin AD, Morales MI, Castillo-Michel H, et al. (2013) Synchrotron verification of TiO₂ accumulation in cucumber fruit: a possible pathway of TiO₂ nanoparticle transfer from soil into the food chain. Environ Sci Technol 47: 11592-11598. doi: 10.1021/es403368j
[41]	Jacob DL, Borchardt JD, Navaratnam L, et al. (2013) Uptake and translocation of Ti from nanoparticles in crops and wetland plants. Int J Phytorem 15: 142-153. doi: 10.1080/15226514.2012.683209
[42]	Ainsworth EA, Rogers A (2007) The response of photosynthesis and stomatal conductance to rising [CO₂]: mechanisms and environmental interactions. Plant, Cell Environ 30: 258-270. doi: 10.1111/j.1365-3040.2007.01641.x
[43]	Schlüter U, Muschak M, Berger D, et al. (2003) Photosynthetic performance of an Arabidopsis mutant with elevated stomatal density (sdd1-1) under different light regimes. J Exp Bot 54: 867-874. doi: 10.1093/jxb/erg087
[44]	Xu Z, Zhou G (2008) Responses of leaf stomatal density to water status and its relationship with photosynthesis in a grass. J Exp Bot 59: 3317-3325. doi: 10.1093/jxb/ern185
[45]	Khan MIR, Iqbal N, Masood A, et al. (2013) Salicylic acid alleviates adverse effects of heat stress on photosynthesis through changes in proline production and ethylene formation. Plant Signaling Behav 8: e26374. doi: 10.4161/psb.26374
[46]	Urrego-Pereira YF, Martínez-Cob A, Fernández V, et al. (2013) Daytime sprinkler irrigation effects on net photosynthesis of maize and alfalfa. Agron J 105: 1515-1528. doi: 10.2134/agronj2013.0119
[47]	Izad AI, Ibrahim MH, Abdullah CAC, et al. (2018) Growth, leaf gas exchange and secondary metabolites of Orthosiphon stamineus as affected by multiwall carbon nanotubes application. Annu Res Rev Biol 23: 1-13.
[48]	Hanson DT, Stutz SS, Boyer JS (2016) Why small fluxes matter: the case and approaches for improving measurements of photosynthesis and (photo) respiration. J Exp Bot 67: 3027-3039. doi: 10.1093/jxb/erw139
[49]	Farquhar GD, von Caemmerer S, Berry JA (1980) A biochemical model of photosynthetic CO₂ assimilation in leaves of C₃ species. Planta 149: 78-90. doi: 10.1007/BF00386231
[50]	Sikuku PA, Netondo GW, Onyango JC, et al. (2010) Chlorophyll fluorescence, protein and chlorophyll content of three rainfed rice varieties under varying irrigation regimes. J Agric Biol Sci 5: 19-25.
[51]	Strasser RJ, Stirbet AD (1998) Heterogeneity of photosystem Ⅱ probed by the numerically simulated chlorophyll a fluorescence rise (O-J-I-P). Math Comput Simulat 48: 3-9. doi: 10.1016/S0378-4754(98)00150-5
[52]	Peterson RB, Havir EA (2003) Contrasting modes of regulation of PS Ⅱ light utilization with changing irradiance in normal and psbS mutant leaves of Arabidopsis thaliana. Photosynth Res 75: 57-70. doi: 10.1023/A:1022458719949
[53]	Tezera W, Mitchell V, Driscoll SP, et al. (2002) Effects of water deficit and its interaction with CO₂ supply on the biochemistry and physiology of photosynthesis in sunflower. J Exp Bot 53: 1781-1791. doi: 10.1093/jxb/erf021
[54]	Kalaji HM, Oukarroum A, Alexandrov V, et al. (2014) Identification of nutrient deficiency in maize and tomato plants by in vivo chlorophyll a fluorescence measurements. Plant Physiol Biochem 81: 16-25. doi: 10.1016/j.plaphy.2014.03.029
[55]	Msilini N, Zaghdoudi M, Govindachary S, et al. (2011) Inhibition of photosynthetic oxygen evolution and electron transfer from the quinone acceptor QA to QB by iron deficiency. Photosynth Res 107: 247-256. doi: 10.1007/s11120-011-9628-2
[56]	Yadavalli V, Neelam S, Rao ASVC, et al. (2012) Differential degradation of photosystem I subunits under iron deficiency in rice. J Plant Physiol 169: 753-759. doi: 10.1016/j.jplph.2012.02.008
[57]	Barhoumi N, Labiadh L, Oturan MA, et al. (2015) Electrochemical mineralization of the antibiotic levofloxacin by electro-Fenton-pyrite process. Chemosphere 141: 250-257. doi: 10.1016/j.chemosphere.2015.08.003
[58]	Kobayashi T, Nishizawa NK (2012) Iron uptake, translocation, and regulation in higher plants. Annu Rev Plant Biol 63: 131-152.
[59]	Briat JF, Ravet K, Arnaud N, et al. (2009) New insights into ferritin synthesis and function highlight a link between iron homeostasis and oxidative stress in plants. Ann Bot 105: 811-822.
[60]	Soliman AS, El-feky SA, Darwish E (2015) Alleviation of salt stress on Moringa peregrina using foliar application of nanofertilizers. J Hort For 7: 36-47. doi: 10.5897/JHF2014.0379
[61]	Yoshida Y, Kioyoshue T, Katagiri T. et al. (1995) Correlation between the induction of a gene for Δ¹-pyrroline-5-carboxylate synthase and the accumulation of proline in Arabidopsis thaliana under osmotic stress. Plant J 7: 751-760. doi: 10.1046/j.1365-313X.1995.07050751.x
[62]	Hayat S, Hayat Q, Alyemeni MN, et al. (2012) Role of proline under changing environments: a review. Plant Signaling Behav 7: 1456-1466. doi: 10.4161/psb.21949
[63]	Jaafar HZ, Ibrahim MH, Fakri M, et al. (2012) Impact of soil field water capacity on secondary metabolites, phenylalanine ammonia-lyase (PAL), maliondialdehyde (MDA) and photosynthetic responses of Malaysian Kacip Fatimah (Labisia pumila Benth). Molecules 17: 7305-7322. doi: 10.3390/molecules17067305
[64]	Jones CG, Hartley SE (1999) A protein competition model of phenolic allocation. Oikos 86: 27-44. doi: 10.2307/3546567
[65]	Bharti AK, Khurana JP (2003) Molecular characterization of transparent testa (tt) mutants of Arabidopsis thaliana (ecoype Estland) impaired in flavonoid biosynthesic pathway. Plant Sci 165: 1321-1332. doi: 10.1016/S0168-9452(03)00344-3
[66]	Palmqvis NGM, Seisenbaeva GA, Svedlindh P, et al. (2017) Maghemite nanoparticles acts as nanozymes, improving growth and abiotic stress tolerance in Brassica napus. Nanoscale Res Lett 12: 631. doi: 10.1186/s11671-017-2404-2
[67]	Rui M, Ma C, Hao Y, et al. (2016) Iron oxide nanoparticles as a potential iron fertilizer for peanut (Arachis hypogaea). Front Plant Sci 7: 815.
[68]	Ghasemzadeh A, Jaafar HZE, Rahmat A (2010) Antioxidant activities, total phenolics and flavonoids content in two varieties of malaysia young ginger (Zingiber officinale Roscoe). Molecules 15: 4324-4333. doi: 10.3390/molecules15064324
[69]	Guo R, Yuan G, Wang Q (2011) Effect of sucrose and mannitol on the accumulation of health-promoting compounds and the activity of metabolic enzymes in broccoli sprouts. Sci Hort 128: 159-165. doi: 10.1016/j.scienta.2011.01.014
[70]	Nguyen GN, Hailstones DL, Wilkes M, et al. (2010) Drought stress: role of carbohydrate metabolism in drought-induced male sterility in rice anthers. J Agron Crop Sci 196: 346-357. doi: 10.1111/j.1439-037X.2010.00423.x
[71]	Akula R, Ravishanka GA (2011) Influence of abiotic stress signals on secondary metabolites in plants. Plant Signaling Behav 6: 1720-1731. doi: 10.4161/psb.6.11.17613
[72]	Kubalt K (2016) The role of phenolic compounds in plant resistance. Biotechnol Food Sci 80: 97-108.
[73]	Lattanzio V, Lattanzio VMT, Cardinali A (2006) Role of phenolics in the resistance mechanisms of plants against fungal pathogens and insects. Phytochem: Adv in Res 661: 23-67.
[74]	Ghorbanpour M, Hadian J (2015) Multi-walled carbon nanotubes stimulate callus induction, secondary metabolites biosynthesis and antioxidant capacity in medicinal plant Satureja khuzestanica grown in vitro. Carbon 94: 749-759. doi: 10.1016/j.carbon.2015.07.056
[75]	Moore MN (2006) Do nanoparticles present ecotoxicological risks for the health of the aquatic environment? Environ Int 32: 967-976. doi: 10.1016/j.envint.2006.06.014
[76]	Chinnamuthu CR, Boopathi PM (2009) Nanotechnology and agroecosystem. Madras Agric J 96: 17-31.
[77]	Kanazawa K, Hashimoto T, Yoshida S, et al. (2012) Short photoirradiation induces flavonoid synthesis and increases its production in postharvest vegetables. J Agri Food Chem 60: 4359-4368. doi: 10.1021/jf300107s
[78]	Xie Y, Xu D, Cui W, et al. (2012) Mutation of Arabidopsis HY1 causes UV-C hypersensitivity by impairing carotenoid and flavonoid biosynthesis and the down-regulation of antioxidant defence. J ExpBotany 63: 3869-3883.
[79]	Kefeli VI, Kalevitch MV, Borsari B (2003) Phenolic cycle in plants and environment. J Cell Mol Biol 2: 13-18.
[80]	Adamski JM, Peters JA, Danieloski R, et al. (2011) Excess iron-induced changes in the photosynthetic characteristics of sweet potato. J Plant Physiol 168: 2056-2062.
[81]	Hänsch R, Mendel RR (2009) Physiological functions of mineral micronutrients (Cu, Zn, Mn, Fe, Ni, Mo, B, Cl). Curr Opin Plant Biol 12: 259-266. doi: 10.1016/j.pbi.2009.05.006
[82]	Briat JF, Curie C, Gaymard F (2007) Iron utilization and metabolism in plants. Curr Opin Plant Biol 10: 276-282. doi: 10.1016/j.pbi.2007.04.003
[83]	Kobayashi T, Nozoye T, Nishizawa NK (2019) Iron transport and its regulation in plants. Free Radical Biol Med 133: 11-20. doi: 10.1016/j.freeradbiomed.2018.10.439
[84]	Nenova VR (2009) Growth and photosynthesis of pea plants under different iron supply. Acta Physiol Plant 31: 385. doi: 10.1007/s11738-008-0247-2
[85]	Chatterjee C, Gopal R, Dube BK (2006) Impact of iron stress on biomass, yield, metabolism and quality of potato (Solanum tuberosum L.). Sci Hortic 108: 1-6.
[86]	Xing W, Huang WM, Liu GH (2010) Effect of excess iron and copper on physiology of aquatic plant Spirodela polyrrhiza (L.) Schleid. Environ Toxicol 25: 103-112.
[87]	Prasad MNV, Strzalka K (1999) Impact of heavy metals on photosynthesis. In: Prasad MNV, Hagemeyer J, Heavy Metal stress in Plants: from Molecules to Ecosystems. Berlin-Heidelberg: Springer Verlag, 117-138.
[88]	Robello E, Galatro A, Puntarulo S (2007) Iron role in oxidative metabolism of soybean axes upon growth: effect of iron overload. Plant Sci 172: 939-947. doi: 10.1016/j.plantsci.2007.01.003
[89]	Fang WC, Wang JW, Lin CC, et al. (2001) Iron induction of lipid peroxidation and effects on antioxidative enzyme activities in rice leaves. Plant Growth Regul 35: 75-80. doi: 10.1023/A:1013879019368
[90]	Yin XL, Wang JX, Duan ZQ, et al. (2006) Study on the stomatal density and daily change rule of the wheat. Chin Agric Sci Bull 22: 237-242.

This article has been cited by:

Yunfei Xu, Xianjun Wang, Huaizhi Yu, 2024, Chapter 32, 978-981-97-1978-5, 361, 10.1007/978-981-97-1979-2_32

Reader Comments

Your name:*

Email:*
© 2019 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)