A nonparametric copula distribution framework for bivariate joint distribution analysis of flood characteristics for the Kelantan River basin in Malaysia

Shahid Latif; Firuza Mustafa; Shahid Latif; Firuza Mustafa

doi:10.3934/geosci.2020012

AIMS Geosciences

2020, Volume 6, Issue 2: 171-198. doi: 10.3934/geosci.2020012

Previous Article Next Article

Research article

A nonparametric copula distribution framework for bivariate joint distribution analysis of flood characteristics for the Kelantan River basin in Malaysia

Shahid Latif ^,,
Firuza Mustafa

Department of Geography, University of Malaya, Kuala Lumpur 50603, Malaysia

Received: 23 March 2020 Accepted: 11 May 2020 Published: 19 May 2020

The joint distribution analysis of multidimensional flood characteristics i.e., flood peak flow, volume and duration, often facilitates a comprehensive understanding in the hydrologic risk assessments. Copula-based methodology are frequently incorporated via parametric approach to model dependence structure of parametric based univariate marginal distributions. But, if the targeted copulas and univariate marginal distributions belongs to some specific parametric families, it might be problematic, if the underlying assumption are violated. Also, no universal rules and literatures are imposed to model any hydrologic vectors and their joint dependence structure through any fixed or pre-defined distributions. In this literature, a nonparametric copula simulation are incorporated and applied as a case study for 50 years annual maximum flood samples of the Kelantan River basin at the Gulliemard bridge station in Malaysia. In this study, a combination of both parametric and nonparametric marginal distribution separately conjoined by a nonparametric copulas framework, which is based on the Beta kernel function. The Beta kernel copula function are incorporated to estimate bivariate copula density which further used to derived joint cumulative density of flood peak-volume, volume-duration and peak-duration pairs and their associated joint as well as conditional return periods.

Keywords:

Citation: Shahid Latif, Firuza Mustafa. A nonparametric copula distribution framework for bivariate joint distribution analysis of flood characteristics for the Kelantan River basin in Malaysia[J]. AIMS Geosciences, 2020, 6(2): 171-198. doi: 10.3934/geosci.2020012

Related Papers:

[1]	Shahid Latif, Firuza Mustafa . Trivariate distribution modelling of flood characteristics using copula function—A case study for Kelantan River basin in Malaysia. AIMS Geosciences, 2020, 6(1): 92-130. doi: 10.3934/geosci.2020007
[2]	Margherita Bufalini, Farabollini Piero, Fuffa Emy, Materazzi Marco, Pambianchi Gilberto, Tromboni Michele . The significance of recent and short pluviometric time series for the assessment of flood hazard in the context of climate change: examples from some sample basins of the Adriatic Central Italy. AIMS Geosciences, 2019, 5(3): 568-590. doi: 10.3934/geosci.2019.3.568
[3]	Joan Rosselló-Geli, Miquel Grimalt-Gelabert . Flood spatial location in a Mediterranean coastal city: Ibiza (Balearic Islands) from 2000 to 2021. AIMS Geosciences, 2023, 9(2): 228-242. doi: 10.3934/geosci.2023013
[4]	Wenqing Liu . A study on the spatial and temporal distribution of habitation sites in the Amur River Basin and its relationship with geographical environments. AIMS Geosciences, 2024, 10(1): 172-195. doi: 10.3934/geosci.2024010
[5]	Jama Mohamed, Dahir Abdi Ali, Abdimalik Ali Warsame, Mukhtar Jibril Abdi, Eid Ibrahim Daud, Mohamed Mohamoud Abdilleh . Bayesian extreme value modelling of annual maximum monthly rainfall in Somalia from 1901 to 2022. AIMS Geosciences, 2024, 10(3): 598-622. doi: 10.3934/geosci.2024031
[6]	Elisabetta Genovese, Thomas Thaler . The benefits of flood mitigation strategies: effectiveness of integrated protection measures. AIMS Geosciences, 2020, 6(4): 459-472. doi: 10.3934/geosci.2020025
[7]	Ana Casado, Natalia C López . Comparison of synthetic unit hydrograph methods for flood assessment in a dryland, poorly gauged basin (Napostá Grande, Argentina). AIMS Geosciences, 2025, 11(1): 27-46. doi: 10.3934/geosci.2025003
[8]	Thompson Lennox, Velasco Aaron A., Kreinovich Vladik . A Multi-Objective Optimization Framework for Joint Inversion. AIMS Geosciences, 2016, 2(1): 63-87. doi: 10.3934/geosci.2016.1.63
[9]	Serin Değerli Şimşek, Ömer Faruk Çapar, Evren Turhan . Assessment of Hydrological Drought Index change over long period (1990–2020): The case of İskenderun Gönençay Stream, Türkiye. AIMS Geosciences, 2023, 9(3): 441-454. doi: 10.3934/geosci.2023024
[10]	Kimon Kardakaris, Dimitrios N Konispoliatis, Takvor H Soukissian . Theoretical evaluation of the power efficiency of a moored hybrid floating platform for wind and wave energy production in the Greek seas. AIMS Geosciences, 2023, 9(1): 153-183. doi: 10.3934/geosci.2023009

Abstract

1. Introduction

The Kelantan River basin in Malaysia is often subjecting to most severe monsoonal flooding and perceiving for increasing in term of their frequency and magnitude ^[1,2,3]. The expectation of the occurrence of catastrophic flooding has increased from once in every 50 years to 15 years from 2004 in the Kelantan region ^[1,3,4]. For example, the intense and prolonged precipitation in the year 2002 caused flooding of a total area of 1640 km² and affected the total of about 714,287 people or in the year December 2014, much heavy precipitation triggered the flood event in the several parts of the east coast of the Kelantan river basin and affected more than 200,000 people ^[3]. Hussain and Ismail (i.e., ^[5]), study revealed that the Gulliemard Bridge, Lebir and Galas gauge stations have highest in the flood frequency rather than Nenggiri station. Similarly, Nashwan et al. (i.e., ^[6]) literature pointed that the downstream area of this river basin is the highest risk of devastating flood events. The cause of frequent failure of hydrologic or flood defence infrastructure in Malaysia due to the impact of moderately severe of flood episodes might be attributed due to the lack of complete flood hydrograph or in other words, where only flood peak discharge samples often targeted in deriving the flood frequency curve during the structural development.

Flood is a multidimensional stochastic consequence usually characterized completely through its trivariate interdependence vectors i.e., flood peak flow, volume and duration of flood hydrograph ^[7,8]. Flood frequency analysis (FFA) is an approach of establishing the relationship between flood design quantiles and their frequency of occurrence or non-exceedance probabilities by fitting probability distribution functions (PDFs) ^[9]. Earlier efforts frequently incorporated the univariate FFA (i.e., ^[10]) but the necessity of estimating flood design hydrograph instead of design quantiles derived from single variable flood episodes motivated numerous literatures towards the joint probability analysis of flood characteristics using different traditional multivariate functions i.e., Krstanovic and Singh ^[11], Yue ^[12] and Escalante and Raynal ^[13]. Such distribution-based modelling approaches often surrounded with several statistical constraints and limitations such as, (1) each flood vectors must assume to have normal distributions or either transformed to have normal distributions; (2) if the number of variables are increased then mathematical formulation becomes more complex and complicated; (3) statistical parameter of univariate marginal structure is often employed to model their joint dependence structure; (4) limited space are available to justify joint dependence structure etc, ^[7,8,14]. Hence to over the above challenges, De Michele and Salvadori (i.e., ^[15]), firstly incorporated copulas for establishing the joint dependence structure between storm intensity and duration. The copulas function segregated the modelling of individual univariate vectors and their joint structure separately into two distinct stages, thus attributed higher flexibility in the selection of best-fitted marginal distributions not necessary from the same family of probability distributions and also their joint structure to capture a wider extent of mutual concurrency and also, preservation in their joint association ^[16,17]. Numerous literatures incorporated bivariate or few trivariate copulas distribution as a model risk for tackling different hydrological extremes issues such as, flood modelling ^[18,19], drought modelling ^[14] rainfall or storm modelling ^[20] or either modelling of hydro-climatic extremes ^[21] etc.

The multivariate FFA either with or without copulas has been applied frequently with parametric distributions where the parametric functions are often employed to modelled univariate marginal distributions and the parametric copulas function for establishing their joint dependence structure. But, the parametric functions always imposed an assumption that the random samples are coming from the known populations whose PDF are pre-defined i.e., the marginal distribution is assumed to follow some specific family of parametric functions ^[22]. In actual, no universal rules and studies are imposed to model any hydrologic vectors through any fixed or pre-defined distribution functions, which would follow different distributions and desire to model separately or in other words, the best-fitted marginal distributions not be from the same probability distribution family ^{[23,24,25,26]}. Also, according to Dooge ^[27], it is already pointed out that no amount of statistical refinement can overcome the consequences due to lack of prior distribution information of the observed random samples also, it would be quite difficult to approximate distribution tail beyond largest values under the parametric framework ^[28]. Therefore, in the past few decades, an attempt via kernel density estimators or KDE recognized as a much flexible and stable non-parametric data smoothing procedure to inference about the populations based on the finite observational samples and thus motivated in the field of hydrologic or flood frequency analysis and which often yielding a bonafide density function ^{[22,23,29,30]}. Nonparametric framework didn't require any prior distribution assumptions and will be directly retrieved from distributed series with a higher extent of flexibility in their univariate function as compared with parametric density estimators ^[23,31]. From the above review, it is also conclude that few work already performed the univariate nonparametric FFA i.e., Adamowski (i.e., ^[23]), Lall (i.e., ^[32]), but only limited literature focused over the establishment of copula-based methodologies under nonparametric or semiparametric framework i.e., Karmakar and Siminovic ^[22,33] and Reddy and Ganguli ^[9], performed bivariate flood modelling using a mixed marginal distribution from the parametric and nonparametric families of probability distributions. In both cases, the parametric copulas are employed to modelled the joint distribution of flood characteristics. On other side, the Chen and Huang (i.e., ^[34]) study demonstrated a bivariate kernel copula framework for handling the problem of boundary bias. Actually, if the targeted copulas and univariate marginal distributions belongs to some specific parametric families, it might be problematic, if the underlying assumption are violated. Therefore, these nonparametric distribution framework can ameliorate these modelling issues and can be able to produce a significant outcomes without assuming a particular form for the univariate marginal or multivariate copula distributions.

A study performed by Shahid and Firuza (i.e., ^[35]) over this river basin already demonstrated the modelling of trivariate joint distribution of flood characteristics by introducing the 3-D copula functions under the parametric settings and also pointed the importance of trivariate joint and conditional return periods. This literature introduced the concept and importance of nonparametric copula-based methodology in the establishment of multivariate FFA. In this study we demonstrated the efficacy of nonparametric copula distribution where both parametric and nonparametric marginal distribution functions are separately conjoined by a nonparametric copulas framework and applied as case study for the daily basis streamflow discharge records from period 1961–2016 for the Kelantan River Basin at the Gulliemard Bridge gauge station in Malaysia. This study pointed two different modelling approach for estimating the bivariate joint dependency. The first simulation approach comprises the modelling of nonparametric marginal distribution with a nonparametric copulas and the second approach consisting the modelling of parametric marginal distribution with a nonparametric copula.

2. Study area

The Gulliemard bridge station is located at the downstream of Kelantan river near the Kuala Kari region. The geographical location of this river basin is Lat 4°30′ N to 6°15′ N and Long 101°E to 102°45′ E. It is the longest river of Kelantan state, which originating from the Tahan mountain range to the South China Sea in the north-eastern part of Peninsular Malaysia. The river is about 248 km long with a drain area of 13,100 km² and which occupying more than 85% of the state of Kelantan. The estimated runoff is about 500 m³sec⁻¹ and the variations of annual precipitations for this region in between 0 mm (dry period)–1750 mm (wet or north-eastern monsoonal period) (i.e., ^[1]). The major land use of this area is agriculture (i.e., paddy, rubber and oil palm) for midstream and downstream and forest for upstream (i.e. near to Gua Musang). Few studies over this region such as Chan ^[4], Adnan and Atkinson ^[3], pointed that such extreme hydrologic consequences are mainly due to rapid human intervention from natural to land use activities in the form of deforestations or land clearance either for promoting the agricultural activities through palm oil and rubber plantations or either due to logging activities.

3. Proposed methodology and data analysis

3.1. Delineation of trivariate flood characteristics

Annual (Maximum) series or AM also called block (annual) maxima and Peak over Threshold (or POT) are the two holistic technique widely accepted in the extreme probability simulations ^[36]. In this study we adopted the AM approach to extract the trivariate flood characteristics. Figure 1, illustrates a typical flood hydrograph showing the delineation of trivariate flood characteristics. The flood peak flow, P, values are estimated using the maximum streamflow discharge records at an annual scale using Eq 1, which indicated that for each year there is only one flood episodes at the targeted site (refer to Figure 1) ^[9,37]. Referred to same Figure 1, the flood duration (D) samples are estimated by recognizing the time of rise and fall of the flood hydrograph (i.e., points at Q_is and Q_ie in the Figure 1) and volume (V) samples are obtained using Eqs 2 and 3 (i.e., ^[9,37]).

Figure 1. A typical hydrograph showing flood characteristic (Source: Adapted from Latif and Mustafa ^[35]).

Descriptive statistics	P (m³/sec)	V (m³)	D (days)
Sample Size	50	50	50
Range	19670	71558	57
Mean	6078	19122	19.04
Variance	2.15E+07	2.14E+08	117.75
Std. Deviation	4639	14623	10.851
Coef. of Variation	0.76324	0.76473	0.56993
Skewness (Pearson)	1.506	1.590	2.210
Kurtosis (Pearson)	1.883	2.864	6.252
Min	916.3	3182.3	7
25% (Q1)	2671.8	8668.5	12
50% (Median)	4961	15959	16
75% (Q3)	7711.7	24476	25
Max	20586	74740	64

Parametric functions	Probability density function (or PDF)	Remarks
Gamma (2P) & (3P)	${\rm{f}}\left({\rm{x}} \right) = {\rm{}}\frac{{{{\left({{\rm{x}} - {\rm{ \mathsf{ γ} }}} \right)}^{{\rm{ \mathsf{ α} }} - 1}}}}{{{{\rm{ \mathsf{ β} }}^{\rm{ \mathsf{ α} }}}{\rm{\Gamma }}\left({\rm{ \mathsf{ α} }} \right)}}{{\rm{e}}^{\frac{{ - \left({{\rm{x}} - {\rm{ \mathsf{ γ} }}} \right)}}{{\rm{ \mathsf{ β} }}}}}{\rm{}}\& f\left({\rm{x}} \right) = {\rm{}}\frac{{{{\rm{x}}^{{\rm{ \mathsf{ α} }} - 1}}}}{{{{\rm{ \mathsf{ β} }}^{\rm{ \mathsf{ α} }}}{\rm{\Gamma }}\left({\rm{ \mathsf{ α} }} \right)}}{{\rm{e}}^{\frac{{ - {\rm{x}}}}{{\rm{ \mathsf{ β} }}}}}{\rm{}}$	${\rm{ \mathsf{ α} }} > 0, \beta > 0, \gamma > 0$ - shape, scale and locations parameter; ${\rm{ \mathsf{ γ} }} \equiv 0{\rm{}}$ yield 2-parameter gamma structure
GEV(3P)	${\rm{f}}\left({\rm{x}} \right) = \frac{1}{{\rm{ \mathsf{ σ} }}}{{\rm{e}}^{ - {{\left({1 + {\rm{kz}}} \right)}^{ - 1/{\rm{k}}}}{{\left({1 + {\rm{kz}}} \right)}^{ - 1 - 1/{\rm{k\;\;\;}}}}}}{\rm{for\;k}}\; \ne 0{\rm{}}$ $\frac{1}{{\rm{ \mathsf{ σ} }}}{{\rm{e}}^{\left({ - 1 - {{\rm{e}}^{\left({ - {\rm{z}}} \right)}}} \right)}}{\rm{\;for\;k}} = 0$	${\rm{k}}\left({{\rm{shape}}} \right), {\rm{ \mathsf{ σ} }}\left({{\rm{scale}}} \right), {\rm{ \mathsf{ μ} }}\left({{\rm{location}}} \right)$ , such that, ${\rm{ \mathsf{ σ} }} > 0$ & ${\rm{z}} \equiv \left({\left({{\rm{x}} - {\rm{ \mathsf{ μ} }}} \right)} \right)/{\rm{ \mathsf{ σ} }}$ Domain: $1 + {\rm{k}}\left({{\rm{x}} - {\rm{ \mathsf{ μ} }}} \right)/{\rm{ \mathsf{ σ} \; for\; k}} \ne 0{\rm{}}\& - \infty < x < + \infty \; for\; k = 0$
Inv. Gaussian (2P)	${\rm{f}}\left({\rm{x}} \right) = \sqrt {\frac{{\rm{ \mathsf{ λ} }}}{{2{\rm{ \mathsf{ π} }}{{\rm{x}}^3}}}} {{\rm{e}}^{ - \frac{{{\rm{ \mathsf{ λ} }}{{\left({{\rm{x}} - {\rm{ \mathsf{ μ} }}} \right)}^2}}}{{2{{\rm{ \mathsf{ μ} }}^2}\left({\rm{x}} \right)}}}}{\rm{}}$	${\rm{ \mathsf{ λ} }} > 0, \mu > 0(continuous\; parameter, {\rm{ \mathsf{ γ} }}\left({{\rm{location}}\; {\rm{parameter}}} \right){\rm{}}$ for ${\rm{ \mathsf{ γ} }} < x < + \infty$
Johnson SB(4P)	${\rm{f}}\left({\rm{x}} \right) = {\rm{}}\frac{{\rm{ \mathsf{ δ} }}}{{{\rm{ \mathsf{ λ} }}\sqrt {2{\rm{ \mathsf{ π} }}} {\rm{z}}\left({1 - {\rm{z}}} \right)}}{{\rm{e}}^{ - 0.5{{\left({{\rm{ \mathsf{ γ} }} + {\rm{ \mathsf{ δ} }}\ln \frac{{\rm{z}}}{{1 - {\rm{z}}}}} \right)}^2}}}$	Domain: ${\rm{ \mathsf{ ξ} }} \le {\rm{x}} \le {\rm{ \mathsf{ ξ} }} + {\rm{ \mathsf{ λ} }}$ ${\rm{ \mathsf{ γ} }}, {\rm{ \mathsf{ δ} }} > 0\left({{\rm{shape}}} \right); {\rm{ \mathsf{ λ} }} > 0\left({{\rm{scale}}} \right); {\rm{ \mathsf{ ξ}\; location}}\; {\rm{parameter}})$
Log-Gamma (2P)	${\rm{f}}\left({\rm{x}} \right) = \frac{{{{\left({\ln {\rm{x}}} \right)}^{{\rm{ \mathsf{ α} }} - 1}}}}{{{\rm{x}}{{\rm{ \mathsf{ β} }}^{\rm{ \mathsf{ α} }}}{\rm{\Gamma }}\left({\rm{ \mathsf{ α} }} \right)}}{{\rm{e}}^{ - \left({\frac{{\ln {\rm{x}}}}{{\rm{ \mathsf{ β} }}}} \right)}}$	Domain: $0 < x < + \infty$ ${\rm{ \mathsf{ α} }} > 0, \beta > 0\left({shape\; parameter} \right)$
Log-Logistic (2P)	${\rm{f}}\left({\rm{x}} \right) = {\rm{}}\frac{{\rm{ \mathsf{ α} }}}{{\rm{ \mathsf{ β} }}}{\left({\frac{{\rm{x}}}{{\rm{ \mathsf{ β} }}}} \right)^{{\rm{ \mathsf{ α} }} - 1}}{\left({1 + {{\left({\frac{{\rm{x}}}{{\rm{ \mathsf{ β} }}}} \right)}^{\rm{ \mathsf{ α} }}}} \right)^{ - 2}}$	Domain: ${\rm{ \mathsf{ γ} }} < x < + \infty$ ${\rm{ \mathsf{ α} }} > 0\left({{\rm{shape}}} \right); {\rm{ \mathsf{ β} }} > 0\left({{\rm{scale}}} \right)$
Lognormal (2P)	${\rm{f}}\left({\rm{x}} \right) = {\rm{}}\frac{{{{\rm{e}}^{ - 0.5{{\left({\frac{{\ln \left({\rm{x}} \right) - {\rm{ \mathsf{ μ} }}}}{{\rm{ \mathsf{ σ} }}}} \right)}^2}}}}}{{\left({\rm{x}} \right){\rm{ \mathsf{ σ} }}\sqrt {2{\rm{ \mathsf{ π} }}} }}{\rm{}}$	${\rm{ \mathsf{ σ} }} > 0\left({{\rm{shape}}\; {\rm{parameter}}} \right);$ ${\rm{ \mathsf{ μ} }}\left({{\rm{scale}}\; {\rm{parameter}}} \right)$
Weibull (2P)	${\rm{f}}\left({\rm{x}} \right) = {\rm{}}\frac{{\rm{ \mathsf{ α} }}}{{\rm{ \mathsf{ β} }}}{\left({\frac{{\rm{x}}}{{\rm{ \mathsf{ β} }}}} \right)^{{\rm{ \mathsf{ α} }} - 1}}{{\rm{e}}^{ - {{\left({\frac{{\rm{x}}}{{\rm{ \mathsf{ β} }}}} \right)}^{\rm{ \mathsf{ α} }}}}}$	Domain: ${\rm{ \mathsf{ α} }} > 0\left({{\rm{shape}}} \right), {\rm{ \mathsf{ β} }} > 0\left({{\rm{scale}}} \right){\rm{}}$

Kernel function	K(x)
Epanechnikov	$= 0.75\left({1 - {{\rm{x}}^2}} \right), {\rm{}}\left\| {\rm{x}} \right\| \le 1$ =0 otherwise
Triangular	$= 1 - \left\| {\rm{x}} \right\|, {\rm{}}\left\| {\rm{x}} \right\| \le 1$ =0 otherwise
Bi-weight or Quartic	$= 0.9375{\left({1 - {{\rm{x}}^2}} \right)^2}, {\rm{}}\left\| {\rm{x}} \right\| \le 1$ =0 otherwise
Tri-weight	$= 1.09375{\left({1 - {{\rm{x}}^2}} \right)^3}, {\rm{}}\left\| {\rm{x}} \right\| \le 1$ =0 otherwise
Cosine	$= \frac{{\rm{ \mathsf{ π} }}}{4}{\rm{cos}}\left({{\rm{ \mathsf{ π} x}}/2} \right), {\rm{}}\left\| {\rm{x}} \right\| \le 1$ =0 otherwise

Functions	Peak			Volume			Duration
Functions	AIC	BIC	HQC	AIC	BIC	HQC	AIC	BIC	HQC
GEV(3P)	–374.335	–368.599	–372.15	–268.985	–263.249	–266.8	–336.32	–330.583	–334.135
Log-Gamma (2P)	–370.146	–366.322	–368.69	–359.914	–356.09	–358.46	–340.53	–336.709	–339.077
Log-Logistic (2P)	–360.392	–356.568	–358.94	–294.927	–291.103	–293.47	–321.32	–317.493	–319.861
Gamma (2P)	–335.861	–332.037	–334.4	–360.025	–356.201	–358.57	–260.55	–256.722	–259.089
Gamma (3P)	–216.301	–210.565	–214.12	–210.107	–204.371	–207.92	–343.62	–337.88	–341.438
Log-Normal (2P)	–379.344	–375.52	–377.89	–371.028	–367.204	–369.57	–327.46	–323.633	–326.001
Weibull (2P)	–329.681	–325.857	–328.23	–342.868	–339.044	–341.41	–292.91	–289.085	–291.453
Inv. Gaussian (2P)	–362.489	–358.665	–361.03	–344.722	–340.898	–343.27	–325.76	–321.938	–324.306
Johnson SB(4P)	–340.899	–333.251	–337.99	–381.821	–374.173	–378.91	–223.65	–216.006	–220.742
Notes: AIC stands for Akaike information criteria; BIC stands for Bayesian information criteria; HQIC or HQC stands for Hannan-Quinn information criteria.

Flood characteristics	F(X)	Error indices statistics		Information criteria statistics
Flood characteristics	F(X)	MSE (or Mean square error)	RMSE (or Root mean square error)	AIC (or Akaike information criteria)	BIC (or Bayesian information criteria)	HQC (or Hannan-Quinn Information criteria)
P	Epanechnikov	0.00038	0.01957	−391.37	−389.45	−390.64
	Bi-weight or quartic	0.00026	0.01620	−410.25	−408.34	−409.52
	Triweight	0.00022	0.01483	−419.07	−417.16	−418.34
	Triangular	0.00028	0.01686	−406.26	−404.35	−405.54
	Cosine	0.00032	0.01800	−399.98	−398.07	−399.25
V	Epanechnikov	0.00093	0.03060	−346.66	−344.75	−345.93
	Bi-weight or quartic	0.00018	0.01350	−428.44	−426.53	−427.71
	Triweight	0.00016	0.01287	−433.27	−431.36	−432.55
	Triangular	0.00020	0.01426	−423.01	−421.10	−422.29
	Cosine	0.00022	0.01514	−417.02	−415.11	−416.30
D	Epanechnikov	0.00059	0.02430	−369.69	−367.77	−368.96
	Bi-weight or quartic	0.00051	0.02265	−376.71	−374.80	−375.99
	Triweight	0.00048	0.02208	−379.27	−377.36	−378.54
	Triangular	0.00055	0.02357	−372.74	−370.83	−372.01
	Cosine	0.00062	0.02496	−367.03	−365.12	−366.30

P	V	D	T(P)	T(V)	T(D)	T_PV^OR	T_PD^OR	T_VD^OR	T_PV^AND	T_PD^AND	T_VD^AND
10436.8	17148	29	6.053	2.298	10.225	2.183	4.110	2.03	7.029	50.702	24.420
20586.4	43273.2	7	100	13.765	1.0279	12.548	1.027	1.025	338.580	104.788	14.260
11192.4	21994.2	30	8.118	3.347	12.610	3.079	5.270	2.805	10.294	78.495	46.272
9929.3	9667.4	56	5.402	1.402	33.333	1.387	4.815	1.386	5.643	134.531	46.571
7686.9	41309	19	4.031	9.445	2.456	3.533	1.763	2.094	14.111	11.360	28.132
5052.6	19073.8	64	2.131	2.769	100	1.822	2.109	2.720	3.552	190.669	281.210
18339.4	74740	16	21.428	100	1.980	18.361	1.886	1.959	453.389	46.333	224.181

P	V	D	${\rm{T}}\left({{\rm{P}}/{\rm{V}} \le {\rm{v}}} \right)$	${\rm{T}}\left({{\rm{V}}/{\rm{P}} \le {\rm{p}}} \right)$	${\rm{T}}\left({{\rm{P}}/{\rm{D}} \le {\rm{d}}} \right)$	${\rm{T}}\left({{\rm{D}}/{\rm{P}} \le {\rm{p}}} \right)$	${\rm{T}}\left({{\rm{V}}/{\rm{D}} \le {\rm{d}}} \right)$	${\rm{T}}\left({{\rm{D}}/{\rm{V}} \le {\rm{v}}} \right)$
10436.8	17148	29	24.647	2.850	6.202	10.692	2.289	9.937
20586.4	43273.2	7	131.605	14.205	59.429	1.027	10.776	1.027
11192.4	21994.2	30	26.938	4.349	8.337	13.173	3.322	12.156
9929.3	9667.4	56	36.244	1.521	5.459	36.110	1.402	33.669
7686.9	41309	19	5.047	21.478	3.705	2.356	8.429	2.406
5052.6	19073.8	64	3.403	6.669	2.133	111.611	2.768	99.146
18339.4	74740	16	22.266	122.310	19.738	1.972	89.385	1.978

[1]	Drainage and Irrigation Department Malaysia (2004) Annual flood report of DID for Peninsular Malaysia. DID: Kuala Lumpur. Available from: http://www.statistics.gov.my/eng/images/stories/files/journalDOSM/V104ArticleJamaliah.pdf.
[2]	Malaysian Meteorological Department (2007) Report on Heavy Rainfall that Caused Floods in Kelantan and Terengganu. MMD: Kuala Lumpur. Available from: https://reliefweb.int/sites/reliefweb.int/files/resources/EE19DAFDE99078B649257266001FED46-Full_Report.pdf.
[3]	Adnan NA, Atkinson PM (2011) Exploring the impact of climate and land use changes on streamflow trends in a monsoon catchment. Int J Clim 31: 815-831. doi: 10.1002/joc.2112
[4]	Chan NW (1997) Institutional arrangement of flood hazard management in Malaysia: an evaluation using criteria approach. Disasters 21: 206-222. doi: 10.1111/1467-7717.00057
[5]	Hussain STPR, Ismail H (2013) Flood frequency analysis of Kelantan River Basin, Malaysia. World Appl Sci J 28: 1989-1995.
[6]	Nashwan MS, Ismail T, Ahmed K (2018) Flood susceptibility assessment in Kelantan river basin using copula. Int J Eng Technol 7: 584-590. doi: 10.14419/ijet.v7i2.10447
[7]	Zhang L, Singh VP (2006) Bivariate flood frequency analysis using copula method. J Hydrol Eng 11: 150-164. doi: 10.1061/(ASCE)1084-0699(2006)11:2(150)
[8]	Zhang L (2005) Multivariate hydrological frequency analysis and risk mapping. Doctoral dissertation, Beijing Normal University.
[9]	Reddy MJ, Ganguli P (2012) Bivariate Flood Frequency Analysis of Upper Godavari River Flows Using Archimedean Copulas. Water Resour Manage 26: 3995-4018. doi: 10.1007/s11269-012-0124-z
[10]	Bobee B, Rasmussen PF (1994) Statistical analysis of annual flood series, In: Menon J (Ed.). Trend in Hydrology, 1. Council of Scientific Research Integration, India, 117-135.
[11]	Krstanovic PF, Singh VP (1987) A multivariate stochastic flood analysis using entropy. In: Singh VP (Ed.). Hydrologic Frequency Modelling, Reidel, Dordrecht, 515-539. doi: 10.1007/978-94-009-3953-0_37
[12]	Yue S (2000) The bivariate lognormal distribution to model a multivariate flood episode. Hydrol Process 14: 2575-2588. doi: 10.1002/1099-1085(20001015)14:14<2575::AID-HYP115>3.0.CO;2-L
[13]	Sandoval CE, Raynal-Villasenor J (2008) Trivariate generalized extreme value distribution in flood frequency analysis. Hydrol Sci J 53: 550-567. doi: 10.1623/hysj.53.3.550
[14]	Song S, Singh VP (2010) Metaelliptical copulas for drought frequency analysis of periodic hydrologic data. Stoch Environ Res Risk Assess 24: 425-444. doi: 10.1007/s00477-009-0331-1
[15]	De Michele C, Salvadori G (2003) A generalized Pareto intensity-duration model of storm rainfall exploiting 2-copulas. J Geophys Res 108: 4067. doi: 10.1029/2002JD002534
[16]	Saklar A (1959) Functions de repartition n dimensions et leurs marges. ublications de l'Institut Statistique de l'Université de Paris, 8: 229-231.
[17]	Nelsen RB (2006) An introduction to copulas. Springer, New York.
[18]	Salvadori G (2004) Bivariate return periods via-2 copulas. Stat Methodol 1:129-144. doi: 10.1016/j.stamet.2004.07.002
[19]	Salvadori G, De Michele C (2004) Frequency analysis via copulas: theoretical aspects and applications to hydrological events. Water Resour Res 40: W12511. doi: 10.1029/2004WR003133
[20]	Salvadori G, De Michele C (2006) Statistical characterization of temporal structure of storms. Adv Water Resour 29: 827-842. doi: 10.1016/j.advwatres.2005.07.013
[21]	Cong RG, Brady M (2011) The interdependence between Rainfall and Temperature: copula Analyses. Sci World J 2012: 405675.
[22]	Karmakar S, Simonovic SP (2008) Bivariate flood frequency analysis. Part 1: Determination of marginal by parametric and non-parametric techniques. J Flood Risk Manag 1: 190-200.
[23]	Adamowski K (1989) A monte Carlo comparison of parametric and nonparametric estimations of flood frequencies. J Hydrol 108: 295-308. doi: 10.1016/0022-1694(89)90290-4
[24]	Silverman BW (1986) Density Estimation for Statistics and Data Analysis, 1st edition. Chapman and Hall, London.
[25]	Kim KD, Heo JH (2002) Comparative study of flood quantiles estimation by nonparametric models. J Hydrol 260: 176-193. doi: 10.1016/S0022-1694(01)00613-8
[26]	Botev ZI, Grotowski JF, Kroese DP (2010) Kernel Density Estimation via Diffusion. Ann Stat 38: 2916-2957. doi: 10.1214/10-AOS799
[27]	Dooge JCE (1986) Looking for hydrologic laws. Water Resour Res 22: 46-58. doi: 10.1029/WR022i09Sp0046S
[28]	Bardsley WE (1988) Toward a General Procedure for Analysis of Extreme Random Events in the Earth Sciences. Math Geol 20: 513-528. doi: 10.1007/BF00890334
[29]	Lall U, Moon YI, Bosworth K (1993) kernel flood frequency estimators: Bandwidth selection and kernel choice. Water Resour Res 29: 1003-1015. doi: 10.1029/92WR02466
[30]	Santhosh D, Srinivas V (2013) Bivariate frequency analysis of flood using a diffusion kernel density estimators. Water Resour Res 49: 8328-8343. doi: 10.1002/2011WR010777
[31]	Moon YI, Lall U (1994) Kernel function estimator for flood frequency analysis. Water Resour Res 30: 3095-3103. doi: 10.1029/94WR01217
[32]	Lall U (1995) Nonparametric function estimation: recent hydrologic contributions, U.S. National Republic. International Union of Geodesy and Geophysics, 1991-1994. Rev Geophys 33: 1093-1099.
[33]	Karmakar S, Simonovic SP (2009) Bivariate flood frequency analysis. Part 2: A copula-based approach with mixed marginal distributions. J Flood Risk Manag 2: 32-44.
[34]	Chen SX, Huang TM (2007) Nonparametric estimation of copula functions for dependence modelling. Can J Stat 35: 265-282. doi: 10.1002/cjs.5550350205
[35]	Latif S, Mustafa F (2020) Trivariate distribution modelling of flood characteristics using copula function-A case study for Kelantan River basin in Malaysia. AIMS Geosci 6: 92-130. doi: 10.3934/geosci.2020007
[36]	Hosking JRM, Walis JR (1987) Parameter and quantile estimations for the generalized Pareto distributions. Technometrics 29: 339-349. doi: 10.1080/00401706.1987.10488243
[37]	Yue S, Rasmussen P (2002) Bivariate frequency analysis: discussion of some useful concepts in hydrological applications. Hydrol Process 16: 2881-2898. doi: 10.1002/hyp.1185
[38]	Rao AR, Hamed KH (2000) Flood frequency analysis. CRC Press, Boca Raton, Fla.
[39]	Rosenblatt M (1956) Remarks on some nonparametric estimates of a density function. Ann Math Stat 27: 832-837. doi: 10.1214/aoms/1177728190
[40]	Scott DW (1992) Multivariate Density estimation: Theory, Practice and Visualization. Wiley, New York.
[41]	Härdle W (1991) Smoothing Technique with Implementation in S. Springer, New York.
[42]	Kim KD, Heo JH (2002) Comparative study of flood quantiles estimation by nonparametric models. J Hydrol 260: 176-193. doi: 10.1016/S0022-1694(01)00613-8
[43]	Shabri A (2002) Nonparametric Kernel Estimation of Annual Maximum Stream Flow Quantiles, Matematika, 18: 99-107.
[44]	Miladinovic B (2008) Kernel density estimation of reliability with applications to extreme value distribution. Graduate Theses and Dissertations. Available from: https://scholarcommons.usf.edu/etd/408.
[45]	Azzalini A (1981) A note on the estimation of a distribution function and quantiles by a kernel method. Biometrika 68: 326-328. doi: 10.1093/biomet/68.1.326
[46]	Shiau JT (2006) Fitting drought duration and severity with two dimensional copulas. Water Resour Manag 20: 795-815. doi: 10.1007/s11269-005-9008-9
[47]	Harrell FE, Davis CE (1982) A new distribution-free quantile estimator. Biometrika 69: 635-640. doi: 10.1093/biomet/69.3.635
[48]	Brown BM, Chen SX (1999) Beta-bernstein smoothing for regression curves with compact support. Scand J Stat 26: 47-59. doi: 10.1111/1467-9469.00136
[49]	Chen SX (2000) Beta kernel estimators for density functions. Comput Stat Data Anal 31: 131-145. doi: 10.1016/S0167-9473(99)00010-9
[50]	Bounezmarni T, Rombouts JVK (2009) Nonparametric density estimation for positive time series. Comput Stat Data Anal 54: 245-261. doi: 10.1016/j.csda.2009.08.016
[51]	Charpentier A, Fermanian JD, Scaillet O (2006) The estimation of copulas: Theory and practice. In Rank J, editor. Copulas: From theory to application in finance. London: Risk Books, 35-64.
[52]	Kim TW, Valdés JB, Yoo C (2006) Nonparametric approach for bivariate drought characterisation using Palmer drought index. J Hydrol Eng 11: 134-143. doi: 10.1061/(ASCE)1084-0699(2006)11:2(134)
[53]	Kullback S, Leibler RA (1951) On information and sufficiency. Ann Math Stat 22: 79-86. doi: 10.1214/aoms/1177729694
[54]	Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19: 716-723. doi: 10.1109/TAC.1974.1100705
[55]	Schwarz GE (1978) Estimating the dimension of a model. Ann Stat 6: 461-464. doi: 10.1214/aos/1176344136
[56]	Hannan EJ, Quinn BG (1979) The Determination of the Order of an Autoregression. J R Stat Soc Ser B 41: 190-195.
[57]	Shiau JT (2003) Return period of bivariate distributed extreme hydrological events. Stoch Environ Res Risk Assess 17: 42-57. doi: 10.1007/s00477-003-0125-9
[58]	Brunner MI, Seibert J, Favre AC (2016) Bivariate return periods and their importance for flood peak and volume estimations. WIREs Water 3: 819-833. doi: 10.1002/wat2.1173

1.	Shahid Latif, Slobodan P. Simonovic, Trivariate Probabilistic Assessments of the Compound Flooding Events Using the 3-D Fully Nested Archimedean (FNA) Copula in the Semiparametric Distribution Setting, 2023, 0920-4741, 10.1007/s11269-023-03448-6
2.	Yadong Ji, Yi Li, Ning Yao, Asim Biswas, Xinguo Chen, Linchao Li, Alim Pulatov, Fenggui Liu, Multivariate global agricultural drought frequency analysis using kernel density estimation, 2022, 177, 09258574, 106550, 10.1016/j.ecoleng.2022.106550
3.	Shahid Latif, Slobodan P. Simonovic, Nonparametric Approach to Copula Estimation in Compounding The Joint Impact of Storm Surge and Rainfall Events in Coastal Flood Analysis, 2022, 36, 0920-4741, 5599, 10.1007/s11269-022-03321-y
4.	Shahid Latif, Slobodan P. Simonovic, Trivariate Joint Distribution Modelling of Compound Events Using the Nonparametric D-Vine Copula Developed Based on a Bernstein and Beta Kernel Copula Density Framework, 2022, 9, 2306-5338, 221, 10.3390/hydrology9120221
5.	Shahid Latif, Taha B.M.J. Ouarda, André St-Hilaire, Zina Souaissi, Shaik Rehana, A new nonparametric copula framework for the joint analysis of river water temperature and low flow characteristics for aquatic habitat risk assessment, 2024, 634, 00221694, 131079, 10.1016/j.jhydrol.2024.131079
6.	Mark Maimone, Sebastian Malter, Mahshid Ghanbari, A practical method for developing future joint probabilities of riverine and coastal flood risk in complex tidal river systems – a case study, 2025, 2040-2244, 10.2166/wcc.2025.653

Kernel function	K(x)
Parametric Functions	Peak (P)	Volume (V)	Durations (D)
Gamma (2P)	a = 1.7166, b = 3540.6	a = 1.71, b = 11183.0	a = 3.0786, b = 6.1845
Gamma(3P)	a = 1.2106, b =4290, g = 884.47	a = 1.0848, b = 14723.0, g = 3150.8	a = 1.4696, b = 8.3319, g = 6.7958
GEV(3P)	k = 0.22596, s = 2683.6, m = 3765.6	k = 0.20446, s = 8736.0, m = 11890.0	k = 0.20682, s = 6.0766, m = 13.987
Log-Gamma(2P)	a = 129.15, b = 0.06544	a = 164.32, b = 0.05839	a = 35.165, b = 0.08037
Log-Logistic (2P)	a = 2.2801, b = 4541.7	a = 2.2731, b = 14202.0	a = 3.6928, b = 16.426
Log-Normal (2P)	s = 0.7362, m = 8.4513	s = 0.74093, m = 9.5943	s = 0.47178, m = 2.826
Weibull (2P)	a = 1.599, b = 6398.7	a = 1.5993, b = 20008.0	a = 2.5437, b = 20.375
Inverse. Gaussian (2P)	l = 10434.0, m = 6078.0	l = 32699.0, m = 19122.0	l = 58.617, m = 19.04
Johnson SB (4P)	g = 1.5161, d = 0.74495 l = 27319.0, x = 1304.2	g = 2.2027, d = 1.0357, l = 1.3052E+5, x = 961.8	g = 2.5314, d = 0.92215, l = 118.81, x = 8.2791

	Peak		Volume		Duration
Functions	MSE	RMSE	MSE	RMSE	MSE	RMSE
GEV(3P)	0.00049	0.02229	0.00409	0.06394	0.00106	0.03261
Log-Gamma(2P)	0.00056	0.02372	0.00069	0.02627	0.0010172	0.031894
Log-Logistic(2P)	0.00068	0.02615	0.00253	0.05032	0.00149	0.03865
Gamma(2P)	0.00111	0.03341	0.00068	0.02624	0.005037	0.070973
Gamma(3P)	0.01173	0.10882	0.01327	0.11520	0.000918	0.030312
Log-Normal(2P)	0.00046	0.02163	0.00055	0.02351	0.001321	0.03635
Weibull(2P)	0.00126	0.03555	0.00097	0.03115	0.002637	0.05135
Inv. Gaussian(2P)	0.00066	0.02561	0.00094	0.03059	0.00137	0.03697
Johnson SB (4P)	0.00093	0.03053	0.00041	0.02028	0.00972	0.09861
Notes. MSE stands for Mean square error; RMSE stands for Root mean square error.

P	V	D	T(P)	T(V)	T(D)	T_PV^OR	T_PD^OR	T_VD^OR	T_PV^AND	T_PD^AND	T_VD^AND
10436.8	17148	29	7.243	2.329	6.969	2.224	3.861	1.944	8.487	44.294	17.102
20586.4	43273.2	7	45.206	13.394	1.003	11.100	1.003	1.002	149.285	45.491	13.457
11192.4	21994.2	30	8.460	3.219	7.736	2.981	4.359	2.478	10.712	55.281	27.481
9929.3	9667.4	56	6.513	1.424	131.578	1.408	6.268	1.420	6.871	625.302	187.388
7686.9	41309	19	3.995	11.715	2.550	3.604	1.802	2.221	17.180	11.439	36.619
5052.6	19073.8	64	2.180	2.649	322.580	1.812	2.172	2.635	3.514	627.012	911.610
18339.4	74740	16	31.434	149.925	1.928	26.730	1.870	1.914	932.964	62.583	324.412

P	V	D	${\rm{T}}\left({{\rm{P}}/{\rm{V}} \le {\rm{v}}} \right)$	${\rm{T}}\left({{\rm{V}}/{\rm{P}} \le {\rm{p}}} \right)$	${\rm{T}}\left({{\rm{P}}/{\rm{D}} \le {\rm{d}}} \right)$	${\rm{T}}\left({{\rm{D}}/{\rm{P}} \le {\rm{p}}} \right)$	${\rm{T}}\left({{\rm{V}}/{\rm{D}} \le {\rm{d}}} \right)$	${\rm{T}}\left({{\rm{D}}/{\rm{V}} \le {\rm{v}}} \right)$
10436.8	17148	29	28.195	2.766	7.417	7.128	2.309	6.712
20586.4	43273.2	7	60.001	14.388	23.458	1.003	9.303	1.003
11192.4	21994.2	30	27.747	4.059	8.697	7.931	3.175	7.423
9929.3	9667.4	56	37.203	1.521	6.531	141.059	1.424	131.699
7686.9	41309	19	4.762	27.611	3.733	2.461	10.472	2.507
5052.6	19073.8	64	3.574	5.824	2.180	359.637	2.648	310.782
18339.4	74740	16	32.313	172.947	30.400	1.926	134.173	1.926

AIMS Geosciences

A nonparametric copula distribution framework for bivariate joint distribution analysis of flood characteristics for the Kelantan River basin in Malaysia

Related Papers:

Abstract

1. Introduction

2. Study area

3. Proposed methodology and data analysis

3.1. Delineation of trivariate flood characteristics

3.2. Univariate analysis

3.2.1. Parametric marginal distribution analysis

3.2.2. Nonparametric estimates of flood marginal distributions

3.3. Concept of copula function

3.4. Simulation of Model type-1 (i.e., nonparametric marginal distribution with nonparametric copula)

3.5. Simulation of Model type-2 (parametric marginal distribution with nonparametric copula)

4. Results and discussions

4.1. Modelling of parametric and nonparametric marginal distributions

4.2. Analysis of Model type-1 and Model type-2

4.3. Estimation of return periods

4.3.1. Univariate return periods

4.3.2. Bivariate joint return periods

4.3.3. Conditional return periods

5. Conclusions

Acknowledgements

Conflict of interest

References

This article has been cited by:

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Study area

3. Proposed methodology and data analysis

3.1. Delineation of trivariate flood characteristics

3.2. Univariate analysis

3.2.1. Parametric marginal distribution analysis

3.2.2. Nonparametric estimates of flood marginal distributions

3.3. Concept of copula function

3.4. Simulation of Model type-1 (i.e., nonparametric marginal distribution with nonparametric copula)

3.5. Simulation of Model type-2 (parametric marginal distribution with nonparametric copula)

4. Results and discussions

4.1. Modelling of parametric and nonparametric marginal distributions

4.2. Analysis of Model type-1 and Model type-2

4.3. Estimation of return periods

4.3.1. Univariate return periods

4.3.2. Bivariate joint return periods

4.3.3. Conditional return periods

5. Conclusions

Acknowledgements

Conflict of interest

References