Dynamic multivariate quantile inactivity time and applications in investigation of a treatment effect

Mohamed Kayid; Mohamed Kayid

doi:10.3934/math.20241449

AIMS Mathematics

2024, Volume 9, Issue 11: 30000-30014. doi: 10.3934/math.20241449

Previous Article Next Article

Research article Special Issues

Dynamic multivariate quantile inactivity time and applications in investigation of a treatment effect

Mohamed Kayid ^,

Department of Statistics and Operations Research, College of Science, King Saud University, P.O. Box 2455, Riyadh 11451, Saudi Arabia

Received: 08 August 2024 Revised: 27 September 2024 Accepted: 15 October 2024 Published: 22 October 2024

To investigate potentially dependent lifetimes, it is necessary to extend the $\alpha$ -quantile inactivity time to bivariate and multivariate frameworks. To extend this measure to a dynamic multivariate framework, all possible trajectories at time $t$ are considered. The behavior of the extended $\alpha$ -quantile of inactivity time was investigated in relation to the corresponding multivariate hazard rate function. The $\alpha$ -quantile of the inactivity order is defined and discussed for the multivariate case. The difference between the two bivariate $\alpha$ -quantile functions of inactivity, which is an important measure for studying the effect of treatment on lifespan, was also investigated. This measure was used to analyze the effect of laser treatment on the delay of blindness. Two bootstrap approaches were implemented to construct confidence bounds for the difference measure.

Keywords:

Citation: Mohamed Kayid. Dynamic multivariate quantile inactivity time and applications in investigation of a treatment effect[J]. AIMS Mathematics, 2024, 9(11): 30000-30014. doi: 10.3934/math.20241449

Related Papers:

[1]	Wenxue Huang, Xiaofeng Li, Yuanyi Pan . Increase statistical reliability without losing predictive power by merging classes and adding variables. Big Data and Information Analytics, 2016, 1(4): 341-348. doi: 10.3934/bdia.2016014
[2]	Jian-Bing Zhang, Yi-Xin Sun, De-Chuan Zhan . Multiple-instance learning for text categorization based on semantic representation. Big Data and Information Analytics, 2017, 2(1): 69-75. doi: 10.3934/bdia.2017009
[3]	Dongyang Yang, Wei Xu . Statistical modeling on human microbiome sequencing data. Big Data and Information Analytics, 2019, 4(1): 1-12. doi: 10.3934/bdia.2019001
[4]	Wenxue Huang, Qitian Qiu . Forward Supervised Discretization for Multivariate with Categorical Responses. Big Data and Information Analytics, 2016, 1(2): 217-225. doi: 10.3934/bdia.2016005
[5]	Jinyuan Zhang, Aimin Zhou, Guixu Zhang, Hu Zhang . A clustering based mate selection for evolutionary optimization. Big Data and Information Analytics, 2017, 2(1): 77-85. doi: 10.3934/bdia.2017010
[6]	Ricky Fok, Agnieszka Lasek, Jiye Li, Aijun An . Modeling daily guest count prediction. Big Data and Information Analytics, 2016, 1(4): 299-308. doi: 10.3934/bdia.2016012
[7]	Minlong Lin, Ke Tang . Selective further learning of hybrid ensemble for class imbalanced increment learning. Big Data and Information Analytics, 2017, 2(1): 1-21. doi: 10.3934/bdia.2017005
[8]	David E. Bernholdt, Mark R. Cianciosa, David L. Green, Kody J.H. Law, Alexander Litvinenko, Jin M. Park . Comparing theory based and higher-order reduced models for fusion simulation data. Big Data and Information Analytics, 2018, 3(2): 41-53. doi: 10.3934/bdia.2018006
[9]	Cai-Tong Yue, Jing Liang, Bo-Fei Lang, Bo-Yang Qu . Two-hidden-layer extreme learning machine based wrist vein recognition system. Big Data and Information Analytics, 2017, 2(1): 59-68. doi: 10.3934/bdia.2017008
[10]	Yaru Cheng, Yuanjie Zheng . Frequency filtering prompt tuning for medical image semantic segmentation with missing modalities. Big Data and Information Analytics, 2024, 8(0): 109-128. doi: 10.3934/bdia.2024006

Abstract

1. Introduction

In high dimensional, large sample categorical data analysis, feature selection or dimension reduction is usually involved. Existing feature selection procedures either call upon the original variables or rely on linear models (or generalized linear models). Linear models are restrained by certain assumptions of multivariate distribution of data. Some categories in the original categorical explanatory variables may not be informative enough, redundant or even irrelevant with the response variable. Besides, a regular feature selection's statistical reliability may be jeopardized if it picks up variables with a large domains. So we propose a category-based probabilistic approach for feature selection.

One can refer to [10,8] for more introductions to various data types and algorithms in feature selection. The reliability was introduced by variance in [9,3] and by proportion of categories in [5]. The reliability measure used here was proposed in [6], denoted as $E(\mbox{Gini}(X|Y))$ , which measures variable( $X$ )'s conditional concentration on variable $Y$ . One straightforward reason for this measure is to believe that an explanatory variable is more statistically reliable when its domain is more concentrated with respect to the given response variable.

As in [6], we propose a category based feature selection method in this article to improve the statistical reliability and to increase the overall point-hit accuracy by merging or removing the less-informative or redundant categories in the categorical explanatory variables. Instead of doing as in [6], we first transform each original categorical explanatory variable into multiple dummy variables, then select the more informative ones by a stepwise forward feature selection approach, and then merge the unselected categories. The merging process in [6], on the other hand, is to find less informative categories within those pre-selected original explanatory variables and merge. Our proposed approach here can compare not only the categories within one explanatory variable one another, but also among the categories in other explanatory variables. Certain introductions and applications to dummy variable can be found in [1,2].

The rest of this article is organized as follows. Section 2 introduces the association measures and the reliability measures; Section 3 introduces the dummy variable approach, proves two theorems, and proposes the detailed feature selection steps; two experiments are conducted in Section 4; we briefly summarize results of this article in the last section.

2. Data based association and explanatory reliability

Assume we are given a data set with one categorical explanatory variable $X$ with domain $\mbox{Domain}(X) = \{1, 2, \ldots, n_X\}$ , and one response variable $Y$ with domain $\mbox{Domain}(Y) = \{1, 2, \ldots, n_Y\}$ . We show our schemes with two Goodman-Kruskal association measures, the GK-lambda and the GK-tau. More details of these associations and the feature selection issues can be found in [8,4]. Please note that these measures can be extended to a categorical multivariate case since the multiple values of a multivariate set can be regarded as "one-dimensional" nominal values.

2.1. Association measures: The GK-lambda and the GK-tau

The GK-lambda (denoted as $\lambda$ thereafter) is for modal (or optimal) prediction given as follows.

$\lambda = \frac{\sum_{x}\rho_{xm}-\rho_{\cdot m}}{1-\rho_{\cdot m}},$

where

$\rho_{\cdot m} = \max\limits_{y}\rho_{\cdot y} = \max\limits_{y}p(Y = y), \ \ \rho_{xm} = \max\limits_{y}\rho_{xy} = \max\limits_{y}p(X = x;Y = y).$

Please note that $p(\cdot)$ is the probability of an event. One can see that $\lambda$ is the relative decreasing rate of the error of predicting $Y$ with $X$ to that without $X$ when the modal prediction is applied; while the GK-tau (denoted as $\tau$ thereafter) is the counterpart for the proportional prediction as follows.

$\tau = \frac{\sum_{x}\sum_{y}{{\rho_{xy}}^2}/{\rho_{x\cdot}}-\sum_{y}{\rho_{\cdot y}}^2}{1-\sum_{y}{\rho_{\cdot y}}^2},$

where

$\rho_{x\cdot} = p(X = x).$

Both $\tau$ and $\lambda$ are used to measure the prediction errors: the first one aims to maximize the point-to-point accuracy and the second one maximize the point hit accuracy under the reasonable assumption that the predicted response variable and the actual response variable share the same distribution (in other words, both the training and predicted are identical in distributions). In practice, a feature selection needs to take care of both data-based association and statistical reliability of features selected. Sometimes, even the balance between model and proportional association has to be considered, cf. Huang and Pan [7].

2.2. Reliability measure: $E(\mbox{Gini}(X|Y))$

Given a categorical data set with two variables $X$ , $Y$ as above, Huang, Li and Pan [6] proposed the following statistical reliability measure of predicting $Y$ based on the information of $X$ .

$E(\mbox{Gini}(X|Y)) = 1-\sum\limits_{i = 1}^{n_X}\sum\limits_{j = 1}^{n_Y}p(X = i|Y = j)^2p(Y = j).$

Notice that

$0\leq E(\mbox{Gini}(X|Y))\leq 1-\frac{1}{|\mbox{Domain}(X)|}\leq 1-\frac{1}{|\mbox{Domain}(X, Y)|};$

and the smaller $E(\mbox{Gini}(X|Y))$ , the more reliable the explanatory information. The size of $\mbox{Domain}(X, Y)$ determines the upper bound of $E(\mbox{Gini}(X|Y))$ .

3. Extraction of informative categories

3.1. Dummy variables and two propositions

We transform $X$ into $n_X$ dummy variables, written as $X_1, X_2, \ldots, X_{n_X}$ , where $X_i$ takes value $1$ if $X = i$ ; otherwise $X_i$ takes value $0$ . The following proposition is to ensure that this transformation does not change the association of $Y$ with $X$ (when sample size is large enough).

Proposition 3.1.

$\tau(Y|X_1, X_2, \cdots, X_{n_X}) = \tau(Y|X).$

Proof. Since $E_p(Y)$ is a constant for a given data set, we have $\tau(Y|X)\propto \omega^{Y|X}$ , where

$\omega^{Y|X} = \sum\limits_{i, s}p(Y = s|X = i)^2p(X = i).$

Thus we only need to prove that

$\omega^{Y|X_1, X_2, \cdots, X_{n_X}} = \omega^{Y|X}$

Since

$\begin{align*} &\omega^{Y|X_1, X_2, \cdots, X_{n_X}}\\& = \sum\limits_{j = 1}^{n_Y}\sum\limits_{i_1 = 0}^{1} \sum\limits_{i_2 = 0}^{1}\cdots \sum\limits_{i_{n_X} = 0}^{1}\frac{p(X_1 = i_1, X_2 = i_1, \cdots, X_{n_X} = i_{n_X}, Y = j)^2}{p(X_1 = i_1, X_2 = i_2, \cdots, X_n = i_n)} \end{align*}$

and

$X_i = 1 \hbox{ when and only when } X_s = 0, \hbox{ for all } s\neq i, s = 1, 2, \cdots, n_X,$

we have

$p(X_1 = 0, X_2 = 0, \cdots, X_i = 1, \cdots, X_{n_X} = 0, Y = j) = p(X = i, Y = j),$

$j = 1, 2, \cdots, n_Y.$

$\begin{align*} \omega^{Y|X_1, X_2, \cdots, X_{n_X}} & = \sum\limits_{j = 1}^{n_Y}\sum\limits_{i = 1}^{n_{n_X}}\frac{p(X_1 = 0, X_2 = 0, \cdots, X_i = 1, \cdots, X_{n_X} = 0, Y = j)^2}{p(X_1 = 0, X_2 = 0, \cdots, X_i = 1, \cdots, X_n = 0)} \\ & = \sum\limits_{j = 1}^{n_Y}\sum\limits_{i = 1}^{n_X}\frac{p(X = i, Y = j)^2}{p(X = i)} = \omega^{Y|X}, \end{align*}$

that is,

$\omega^{Y|X_1, \cdots, X_{n_X}} = \omega^{Y|X}\Leftrightarrow \tau(Y|X_1, \cdots, X_{n_X}) = \tau(Y|X).$

It is of interest to introduce the following notion.

Definition 3.1. For a given categorical response variable $Y$ and an association measure $\beta^{Y|X}$ , a categorical explanatory variable $X$ is referred to as reduced if merging of any two classes of $X$ does not strictly decreases $\beta$ .

The next proposition tells that merging two categories in $X$ usually decreases (data-based) association.

Proposition 3.2. If two categories, $s, t$ in $X$ are merged into a new level, we have

$\tau(Y|X')\leq \tau(Y|X),$

where $X'$ is the merged new explanatory variable with only $n_X-1$ categories.

Proof. Notice that this proposition is equivalent to the following inequality.

$\omega^{Y|X'}\leq \omega^{Y|X}.$

Let

$\frac{p(X = s;Y = j)}{p(X = s)} = b_s, \frac{p(X = t;Y = j)}{p(X = t)} = b_t, \hbox{ for }Y = 1, 2, \cdots, n_Y.$

We have

$\begin{align*} &p(X = s;Y = j) = p(X = s)b_s, \\ &p(X = t;Y = j) = p(X = t)b_t, \text{ and }\\ &\omega^{Y|X'} = \sum\limits_{j = 1}^{n_Y}\sum\limits_{i\neq s, t}\frac{p(X = i;Y = j)^2}{p(X = i)}+\sum\limits_{j = 1}^{n_Y}\frac{p(X = m;Y = j)^2}{p(X = m)}, \end{align*}$

where $m$ is the new category merged from $s$ and $t$ , because

$\begin{align} \sum\limits_{j = 1}^{n_Y}\frac{p(X = m;Y = j)^2}{p(X = m)}& = \sum\limits_{j = 1}^{n_Y}\frac{(p(X = s;Y = j)+p(X = t;Y = j))^2}{p(X = s)+p(X = t)} \nonumber\\ & = \sum\limits_{j = 1}^{n_Y}\frac{(p(X = s)b_s+p(X = t)b_t)^2}{p(X = s)+p(X = t)}, \label{eq1} \end{align}$

(1)

and

$\sum\limits_{j = 1}^{n_Y}\left(\frac{p(X = s;Y = j)^2}{p(X = s)}+\frac{p(X = t;Y = j)^2}{p(X = t)}\right) = \sum\limits_{j = 1}^{n_Y}(b_s^2p(X = s)+b_t^2p(X = t)).$

(2)

Multiplying $p(X = s)+p(X = t)$ to two sides of (1) and of(2), we have

$\begin{align*} &(1) = \sum\limits_{j = 1}^{n_Y}(b_s^2p(X = s)^2+b_t^2p(X = t)^2+2b_sb_tp(X = s)p(X = t)), \\ &(2) = \sum\limits_{j = 1}^{n_Y}(b_s^2p(X = s)^2+b_t^2p(X = t)^2+(b_s^2+b_t^2)p(X = s)p(X = t)). \end{align*}$

Since $2b_sb_t\leq (b_s^2+b_t^2)$ , we have

$\omega^{Y|X'}\leq \omega^{Y|X}\Leftrightarrow \tau(Y|X')\leq \tau(Y|X);$

and the equality holds if and only if $b_s = b_t$ .

In actual high dimensional data analysis projects, there are usually some categories in some explanatory variables that can be merged such that the association degree decrease is ignorable while the merge raises significantly selected features' statistical reliability. This is especially the case when the data set is high dimensional and many explanatory variables have many categories. Two experiments are conducted in the next section to support this supposition by showing that with similar reliability, merging categories can significantly increase the statistical reliability while not reducing association degree significantly.

3.2. Feature selection process

A feature selection procedure usually follows a stepwise forward variable selection scheme, in which explanatory variables are selected one by one until certain pre-assigned threshold is hit. A reasonable threshold to stop the process is set by an acceptable association degree and statistical reliability. Specifically, for a given set of explanatory variables $X = \{X_1, X_2, \cdots, X_n\}$ , and a response variable $Y$ ,

1. identify a subset of explanatory variables, denoted as $D_1$ , that have the highest association degree among a set of unselected explanatory variables, denoted as $X \backslash D_0$ , by

$D_1 = \{X_{h}\in X|\tau(Y|\{X_{h}\}\cup D_0) = \max \limits_{X_i \in X \backslash D_0}\tau(Y|\{X_{i}\}\cup D_0)\}$

where $D_0$ is the set of previously selected explanatory variables;

2. select the one in $D_1$ with the highest reliability:

$X_{i_1} = \{X_{k}|E(\mbox{Gini}(\{X_{k}\}\cup D_0|Y)) = \min \limits_{X_{h}\in D_1}E(\mbox{Gini}(\{X_{h}\}\cup D_0|Y))\};$

3. define the new set of selected variables as follows.

$D_2 = \{X_{i_1}\}\cup D_0$

4. repeat the previous steps until the stopping criterion is met.

Thus the idea of this general feature selection process is to select a set of variables with the highest association degree and the reliability from the previous step, or with the association degree from the previous step and the highest statistical reliability. More detailed explanations and similar procedures can be found in [8].

3.3. Our category-based forward selection process

The category based version of the previous procedure is to transform all the original (non-binary categorical) explanatory variables into dummy variables before the general steps. The unselected categories are then merged into a new category in each original variable as described below.

1. Transform each original variable $X_i$ into $n_i$ dummy variables. Then there are totally $M = \sum_{i = 1}^n n_i$ binary variables, denoted as $X_{11}, \cdots, X_{1, n_1}, \cdots$ , $X_{n, n_n}$ .

2. Follow the steps in Section (3.2) to select $m$ out of $M$ binary variables, where $m = \sum_{i = 1}^{n_t}n'_i$ , $1 \leq n_t\leq n$ and $n'_i$ is the number of selected categories in $X_i$ .

3. Merge the remaining $n_i-n'_i$ categories in $X_i$ into one new category and denote the new variable by $X'_i$ . We then have the selected new variables as $X'_1, X'_2, \ldots, X'_{n_t}$ .

3.4. Complexity analysis

Notice that despite the genuine advantage of the category-based forward selection process, it has a higher time cost than the corresponding original variable based approach. It has to go through more loops to reach the same target given more features to be scanned. Generally, a complexity analysis needs to be related to a specifically designed and implemented algorithm. However, it is not this article's objective to discuss this subject in detail thus a brief discussion is carried out as follows.

Assume that the time cost for one variable set's association is a constant $C$ , and that there are $N$ independent variables in the data set with $m$ categories in each variable on average. Further assuming that the original variable based process stops at $M_1$ variables and the category-based one stops at $M_2$ binary variables, we have the time cost of an original variable based feature selection method as $O(\frac{M_1(2 N - M_1+1)}{2})$ and that of a category-based one as $O(\frac{M_2(2 N m - M_2+1)}{2})$ .

4. Experiments

The experiment's purpose is to evaluate the association and the reliability differences between the category based and the original variable based feature selection processes. The first experiment uses the mushroom data set from UCI Machine Learning Repository[13]. It has $8,124$ observations with $22$ categorical variables. The second one uses the data set from a 1996 survey of family expenditure by Canada [12], shortened as FAMEX96. It has $10,417$ instances with more than $250$ continuous and categorical variables.

4.1. Experiment on mushroom data

The mushroom's type is chosen as the response variable while the other 21 variables are the explanatory ones. We are going to compare the feature selection result by the original variables and that by the transformed dummy variables. Please note that $X_2$ in Table 1 is the set of selected original variables and $X'_2$ in Table 2 is the selected new variables with merged categories after the feature selection to the dummy transformation. We use $EG$ as a short for $E(\mbox{Gini}(X|Y))$ .

Table 1. Feature selection by the original variables.

Original Features	$\|\mbox{Domain}(X_2, Y)\|$	$\tau(Y\|X_2)$	$\lambda(Y\|X_2)$	$EG$
1	18	0.9429	0.9693	0.4797
2	46	0.9782	0.9877	0.7718
3	108	0.9907	0.9939	0.9076
4	192	1	1	0.9490

| Show Table

DownLoad: CSV

Table 2. Feature selection by the dummy variables.

Merged Features	$\|\mbox{Domain}(X'_2, Y)\|$	$\tau(Y\|X'_2)$	$\lambda(Y\|X'_2)$	$EG$
4	16	0.9445	0.9693	0.2098
4	24	0.9908	0.9939	0.2143
5	30	0.9962	0.9979	0.4669
6	38	1	1	0.6638

| Show Table

DownLoad: CSV

As described in Table 1, there can only be four variables by the feature selection through the original variables with the final association which reaches the maximum (data-based) association degree as $1$ and the reliability ( $EG(X|Y)$ ) is as poor as $0.9076$ .

The category-based feature selection always gives rise to remarkably better reliability ( $EG(X|Y)$ ) and higher associations ( $\lambda$ or $\tau$ ).

It is also see from these two tables that we have in both experiments a higher association by the categories than that by the original variables given the same reliability threshold: for an almost equal reliability, say, $EG\approx 0.46$ , the original variable based version gets a $\tau = 0.9429$ but the category version gets a $\tau = 0.9962$ . The same story holds for $\lambda$ .

4.2. Experiment on FAMEX96 data

In this experiment, variable HouseholdType is chosen as the response variable and all other $24$ categorical variables are considered as the explanatory variables. Following the similar pattern as in the previous experiment, we have the feature selection results by the original variables in Table 3 and by the dummy variables (or categories) in Table 4.

Table 3. Feature selection by the original variables.

OrigVarFeatures	$\|\mbox{Domain}(X_2, Y)\|$	$\tau(Y\|X_2)$	$\lambda(Y\|X_2)$	$EG$
1	66	0.3005	0.3444	0.8201
2	252	0.3948	0.4391	0.9046
3	1830	0.4383	0.4648	0.9833

| Show Table

DownLoad: CSV

Table 4. Feature selection by the dummy variables.

Merged Features	$\|\mbox{Domain}(X'_2, Y)\|$	$\tau(Y\|X'_2)$	$\lambda(Y\|X'_2)$	$EG$
2	24	0.3242	0.3934	0.5491
2	36	0.3573	0.4165	0.6242
2	48	0.3751	0.4234	0.6388
3	96	0.3901	0.4234	0.7035
4	186	0.4017	0.4269	0.7774
4	282	0.4121	0.4317	0.8066
5	558	0.4221	0.4548	0.8782
6	966	0.4314	0.4768	0.8968
7	1716	0.4436	0.4856	0.9135

| Show Table

DownLoad: CSV

One can see from these two tables that the category based approaches produces an association degree $\tau = 0.3242$ with a reliability measure of $0.5491$ , while the original variable approach gives association degree $\tau = 0.3005$ with a significantly lower reliability as $EG = 0.8201$ . When the association reaches $\tau = 0.3948$ in the original version, its reliability is as low as $0.9046$ . But the category version can still have about $20\%$ better reliability as $0.7035$ . Another perspective of this story is that the dummy method can give us more choices when we have to choose between the association and the reliability since the original method only gives three choices when the reliability reaches a unreliable $0.9833$ but the result from the category based approach offers us more options.

5. Conclusion

By transforming the categorical explanatory variables into their dummy forms and applying feature selection procedure to the transformed variables, we can select the informative categories and merge the less informative or redundant categories in the explanatory variables in order to increase the association and raise the reliability. $E(\mbox{Gini}(X|Y))$ measures the statistical reliability. Experiments are conducted to show that both the association and the reliability are significantly better with the category-based feature selection than with original variables. Of course, the cost of category based approach is higher than the original variable based one. So in the situations that the dimension is not too high, the time cost (or computing complexity) of this category based approach is acceptable.

References

[1]	D. C. Schmittlein, D. G. Morrison, The median residual lifetime: A characterization theorem and an application, Oper. Res., 29 (1981), 392–399. https://doi.org/10.1287/opre.29.2.392 doi: 10.1287/opre.29.2.392
[2]	R. E. Barlow, A. W. Marshall, F. Proschan, Properties of probability distributions with monotone hazard rate, Ann. Math. Statist., 34 (1963), 375–389. https://doi.org/10.1214/aoms/1177704147 doi: 10.1214/aoms/1177704147
[3]	H. W. Block, T. H. Savits, H. Singh, The reversed hazard rate function, Probab. Eng. Inf. Sci., 12 (1998), 69–90. https://doi.org/10.1017/S0269964800005064 doi: 10.1017/S0269964800005064
[4]	A. D. Crescenzo, Some results on the proportional reversed hazards model, Stat. Probab. Lett., 50 (2000), 313–321. https://doi.org/10.1016/S0167-7152(00)00127-9 doi: 10.1016/S0167-7152(00)00127-9
[5]	N. K. Chandra, D. Roy, Some results on reversed hazard rate, Probab. Eng. Inf. Sci., 15 (2001), 95–102. https://doi.org/10.1017/S0269964801151077 doi: 10.1017/S0269964801151077
[6]	M. S. Finkelstein, On the reversed hazard rate, Reliab. Eng. Syst. Saf., 78 (2002), 71–75. https://doi.org/10.1016/S0951-8320(02)00113-8 doi: 10.1016/S0951-8320(02)00113-8
[7]	C. Kundu, A. K. Nanda, T. Hu, A note on reversed hazard rate of order statistics and record values, J. Stat. Plan. Infer., 139 (2009), 1257–1265. https://doi.org/10.1016/j.jspi.2008.08.002 doi: 10.1016/j.jspi.2008.08.002
[8]	X. Li, D. Da, P. Zhao, On reversed hazard rate in general mixture models, Stat. Probab. Lett., 80 (2010), 654–661. https://doi.org/10.1016/j.spl.2009.12.023 doi: 10.1016/j.spl.2009.12.023
[9]	M. Burkschat, N. Torrado, On the reversed hazard rate of sequential order statistics, Stat. Probab. Lett., 85 (2014), 106–113. https://doi.org/10.1016/j.spl.2013.11.015 doi: 10.1016/j.spl.2013.11.015
[10]	M. Esna-Ashari, N. Balakrishnan, M. Alimohammadi, HR and RHR orderings of generalized order statistics, Metrika, 86 (2023), 131–148. https://doi.org/10.1007/s00184-022-00865-2 doi: 10.1007/s00184-022-00865-2
[11]	J. E. Contreras-Reyes, D. I. Gallardo, O. Kharazmi, Time-dependent residual Fisher information and distance for some special continuous distributions, Commun. Stat.-Simul. Comput., 53 (2022), 4331–4351. https://doi.org/10.1080/03610918.2022.2146136 doi: 10.1080/03610918.2022.2146136
[12]	N. Unnikrishnan, B. Vineshkumar, Reversed percentile residual life and related concepts, J. Korean Stat. Soc., 40 (2011), 85–92. https://doi.org/10.1016/j.jkss.2010.06.001 doi: 10.1016/j.jkss.2010.06.001
[13]	M. Mahdy, Further results involving percentile inactivity time order and its inference, Metron, 72 (2014), 269–282. https://doi.org/10.1007/s40300-013-0017-9 doi: 10.1007/s40300-013-0017-9
[14]	M. Shafaei Noughabi, Solving a functional equation and characterizing distributions by quantile past lifetime functions, Econ. Qual. Control, 31 (2016), 55–58. https://doi.org/10.1515/eqc-2015-0017 doi: 10.1515/eqc-2015-0017
[15]	M. Shafaei Noughabi, S. Izadkhah, . On the quantile past lifetime of the components of the parallel systems, Commun. Stat.-Theory Methods, 45 (2016), 2130–2136. https://doi.org/10.1080/03610926.2013.875573 doi: 10.1080/03610926.2013.875573
[16]	L. Balmert, J. H. Jeong, Nonparametric inference on quantile lost lifespan, Biometrics, 73 (2017), 252–259. https://doi.org/10.1111/biom.12555 doi: 10.1111/biom.12555
[17]	L. C. Balmert, R. Li, L. Peng, J. H. Jeong, Quantile regression on inactivity time, Stat. Methods Med. Res., 30 (2021), 1332–1346. https://doi.org/10.1177/0962280221995977 doi: 10.1177/0962280221995977
[18]	M. Kayid, Statistical inference of an α-quantile past lifetime function with applications, AIMS Mathematics, 9 (2024), 15346–15360. https://doi.org/10.3934/math.2024745 doi: 10.3934/math.2024745
[19]	A. P. Basu, Bivariate failure rate, J. Am. Stat. Assoc., 66 (1971), 103–104.
[20]	N. L. Johnson, S. Kotz, A vector multivariate hazard rate, J. Multivar. Anal., 5 (1975), 53–66. https://doi.org/10.1016/0047-259X(75)90055-X doi: 10.1016/0047-259X(75)90055-X
[21]	K. R. Nair, N. U. Nair, Bivariate mean residual life, IEEE Trans. Reliab., 38 (1989), 362–364. https://doi.org/10.1109/24.44183 doi: 10.1109/24.44183
[22]	M. Shaked, J. G. Shanthikumar, Dynamic multivariate mean residual life functions, J. Appl. Probab., 28 (1991), 613–629. https://doi.org/10.2307/3214496 doi: 10.2307/3214496
[23]	M. Shafaei Noughabi, M. Kayid, A. M. Abouammoh, Dynamic multivariate quantile residual life in reliability theory, Math. Probl. Eng., 2018 (2018), 1245656. https://doi.org/10.1155/2018/1245656 doi: 10.1155/2018/1245656
[24]	M. Shafaei Noughabi, M. Kayid, Bivariate quantile residual life: a characterization theorem and statistical properties, Stat. Pap., 60 (2019), 2001–2012. https://doi.org/10.1007/s00362-017-0905-9 doi: 10.1007/s00362-017-0905-9
[25]	M. Kayid, Multivariate mean inactivity time functions with reliability applications, Int. J. Reliab. Appl., 7 (2006), 127–140.
[26]	F. Buono, E. De Santis, M. Longobardi, S. Fabio, Multivariate reversed hazard rates and inactivity times of systems, Methodol. Comput. Appl. Probab., 24 (2022), 1987–2008. https://doi.org/10.1007/s11009-021-09905-2 doi: 10.1007/s11009-021-09905-2
[27]	R. B. Nelsen, Archimedean Copulas, In: An introduction to copulas, New York: Springer, 2006,109–155.
[28]	M. Shaked, J. G. Shanthikumar, Stochastic orders, New York: Springer, 2007. https://doi.org/10.1007/978-0-387-34675-5
[29]	M. Shaked, J. G. Shanthikumar, Multivariate stochastic orderings and positive dependence in reliability theory, Math. Oper. Res., 15 (1990), 545–552. https://doi.org/10.1287/moor.15.3.545 doi: 10.1287/moor.15.3.545
[30]	M. Shaked, J. G. Shanthikumar, Multivariate conditional hazard rate and mean residual life functions and their applications, In: Reliability and Decision Making, New York: Chapman, 1993,137–155.

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Mathematics

1.8 3.1

Metrics

Article views(664) PDF downloads(66) Cited by(0)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

Figures and Tables

Figures(2) / Tables(3)

AIMS Mathematics

Dynamic multivariate quantile inactivity time and applications in investigation of a treatment effect

Related Papers:

Abstract

1. Introduction

2. Data based association and explanatory reliability

2.1. Association measures: The GK-lambda and the GK-tau

2.2. Reliability measure: $E(\mbox{Gini}(X|Y))$

3. Extraction of informative categories

3.1. Dummy variables and two propositions

3.2. Feature selection process

3.3. Our category-based forward selection process

3.4. Complexity analysis

4. Experiments

4.1. Experiment on mushroom data

4.2. Experiment on FAMEX96 data

5. Conclusion

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Catalog

AIMS Mathematics

Dynamic multivariate quantile inactivity time and applications in investigation of a treatment effect

Related Papers:

Abstract

1. Introduction

2. Data based association and explanatory reliability

2.1. Association measures: The GK-lambda and the GK-tau

2.2. Reliability measure: E(Gini(X|Y))E(\mbox{Gini}(X|Y))

3. Extraction of informative categories

3.1. Dummy variables and two propositions

3.2. Feature selection process

3.3. Our category-based forward selection process

3.4. Complexity analysis

4. Experiments

4.1. Experiment on mushroom data

4.2. Experiment on FAMEX96 data

5. Conclusion

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

2.2. Reliability measure: $E(\mbox{Gini}(X|Y))$