Research article

Assessment of Hydrological Drought Index change over long period (1990–2020): The case of İskenderun Gönençay Stream, Türkiye

  • Recently, due to changes in the global climate, there have been significant increases in flood and drought events. The changes in wet and dry periods can be examined by various methods using hydrometeorological data to analyze climate disasters. In this study, Gönençay Stream in the Asi River Basin was chosen as the study area, which contains abundant underground and surface water reserves in Türkiye. Within this region, not only are the agricultural activities intense, but also hydraulic structure applications such as dams and reservoirs draw attention. Previous studies stated that meteorological and agricultural droughts have started to be noticed in the basin. Therefore, temporal variation analyses can positively contribute to assessing possible hydrological droughts in the following years. In this context, wet and drought periods were determined using the Streamflow Drought Index method at 3, 6, 9, and 12-month time scales with monthly average flow data observed between 1990 and 2020. At the same time, the number and probabilities of drought categories on a 12-month time scale, the first expected transition times between classifications, and the expected residence times between categories were specified. The study revealed that the most severe dry period occurred between 2013 and 2014 and was classified as Extremely Drought. The wettest period was around 2018–2019 and was classified as Extremely Wet. The largest expected time residence among all classifications was calculated for the Extremely Drought category with nine months, means that if the Extremely Drought period ever occurs, it remains for approximately nine months. While the Moderately Drought period occurred within one month following the Extremely Drought duration, and a Near Normal Wet period was observed three months after the Extremely Wet period. The most seen drought category for monthly calculations was the Near Normal Wet category, and was seen over 200 times with a 52.8% probability. Considering the Gönençay region, it is possible that any Extreme drought classification eventually regresses to normal.

    Citation: Serin Değerli Şimşek, Ömer Faruk Çapar, Evren Turhan. Assessment of Hydrological Drought Index change over long period (1990–2020): The case of İskenderun Gönençay Stream, Türkiye[J]. AIMS Geosciences, 2023, 9(3): 441-454. doi: 10.3934/geosci.2023024

    Related Papers:

    [1] Patrick Seeling . Augmented Reality Device Operator Cognitive Strain Determination and Prediction. AIMS Electronics and Electrical Engineering, 2017, 1(1): 100-110. doi: 10.3934/ElectrEng.2017.1.100
    [2] Abdullah Yahya Abdullah Amer, Tamanna Siddiqu . A novel algorithm for sarcasm detection using supervised machine learning approach. AIMS Electronics and Electrical Engineering, 2022, 6(4): 345-369. doi: 10.3934/electreng.2022021
    [3] Suliana Ab-Ghani, Hamdan Daniyal, Abu Zaharin Ahmad, Norazila Jaalam, Norhafidzah Mohd Saad, Nur Huda Ramlan, Norhazilina Bahari . Adaptive online auto-tuning using Particle Swarm optimized PI controller with time-variant approach for high accuracy and speed in Dual Active Bridge converter. AIMS Electronics and Electrical Engineering, 2023, 7(2): 156-170. doi: 10.3934/electreng.2023009
    [4] Suriya Priya R Asaithambi, Sitalakshmi Venkatraman, Ramanathan Venkatraman . Proposed big data architecture for facial recognition using machine learning. AIMS Electronics and Electrical Engineering, 2021, 5(1): 68-92. doi: 10.3934/electreng.2021005
    [5] Edwin Collado, Euribiel Valdés, Antony García, Yessica Sáez . Design and implementation of a low-cost IoT-based agroclimatic monitoring system for greenhouses. AIMS Electronics and Electrical Engineering, 2021, 5(4): 251-283. doi: 10.3934/electreng.2021014
    [6] Folasade M. Dahunsi, Abayomi E. Olawumi, Daniel T. Ale, Oluwafemi A. Sarumi . A systematic review of data pre-processing methods and unsupervised mining methods used in profiling smart meter data. AIMS Electronics and Electrical Engineering, 2021, 5(4): 284-314. doi: 10.3934/electreng.2021015
    [7] Shaswat Chirantan, Bibhuti Bhusan Pati . Integration of predictive and computational intelligent techniques: A hybrid optimization mechanism for PMSM dynamics reinforcement. AIMS Electronics and Electrical Engineering, 2024, 8(2): 265-291. doi: 10.3934/electreng.2024012
    [8] Manisha Bangar, Prachi Chaudhary . A novel approach for the classification of diabetic maculopathy using discrete wavelet transforms and a support vector machine. AIMS Electronics and Electrical Engineering, 2023, 7(1): 1-13. doi: 10.3934/electreng.2023001
    [9] Hakkee Jung . Analytical models of threshold voltage and drain induced barrier lowering in junctionless cylindrical surrounding gate (JLCSG) MOSFET using stacked high-k oxide. AIMS Electronics and Electrical Engineering, 2022, 6(2): 108-123. doi: 10.3934/electreng.2022007
    [10] Dainan Zhang, Dehui Huang, Yanying Chen, Songquan Lin, Chuxuan Li . A lightweight IoT intrusion detection method based on two-stage feature selection and Bayesian optimization. AIMS Electronics and Electrical Engineering, 2025, 9(3): 359-389. doi: 10.3934/electreng.2025017
  • Recently, due to changes in the global climate, there have been significant increases in flood and drought events. The changes in wet and dry periods can be examined by various methods using hydrometeorological data to analyze climate disasters. In this study, Gönençay Stream in the Asi River Basin was chosen as the study area, which contains abundant underground and surface water reserves in Türkiye. Within this region, not only are the agricultural activities intense, but also hydraulic structure applications such as dams and reservoirs draw attention. Previous studies stated that meteorological and agricultural droughts have started to be noticed in the basin. Therefore, temporal variation analyses can positively contribute to assessing possible hydrological droughts in the following years. In this context, wet and drought periods were determined using the Streamflow Drought Index method at 3, 6, 9, and 12-month time scales with monthly average flow data observed between 1990 and 2020. At the same time, the number and probabilities of drought categories on a 12-month time scale, the first expected transition times between classifications, and the expected residence times between categories were specified. The study revealed that the most severe dry period occurred between 2013 and 2014 and was classified as Extremely Drought. The wettest period was around 2018–2019 and was classified as Extremely Wet. The largest expected time residence among all classifications was calculated for the Extremely Drought category with nine months, means that if the Extremely Drought period ever occurs, it remains for approximately nine months. While the Moderately Drought period occurred within one month following the Extremely Drought duration, and a Near Normal Wet period was observed three months after the Extremely Wet period. The most seen drought category for monthly calculations was the Near Normal Wet category, and was seen over 200 times with a 52.8% probability. Considering the Gönençay region, it is possible that any Extreme drought classification eventually regresses to normal.



    In todays and future technology trends, there is a lot of interest to expand smart devices that are compatible with Internet of Things (IoT) [1] based system design. Research work within this area aims to connect devices with most appropriate way towards the development of smart and safe living and smart city technologies [1,2]. In addition, innovative technological systems, as smart homes and smart grids, have enhanced automation limited duties to develop rapidly with the adaptation of new technologies to their applications [2]. The communication between home appliances of consumers is improved by installing advanced technology such as smart plug and smart meter [3,4]. The solution enables real-time monitoring and remote controlling capability of appliances on the consumer premises.

    IoT enabled smart grid is offering a reliable, high quality, flexible, and cost-effective energy supply. Advancement of information technology applications on electrical grid have been empowered real-time data acquisition and data analysis. These application are supported by a new kind of sensor technology for monitoring of the transmission control system [1,2,3] from distribution network to individual homes. Furthermore, combination of smart grid and smart meter allows proliferation of advanced demand-side management applications. Load profiling with smart meter indirectly can be also used for supporting services for senior citizens and their growing demand for living self-independently. However, the existing systems for health status monitoring based on profiling appliance’s power consumption is still immature. So far, health monitoring system is accomplished by either wearables or sensor-based systems [4,5,6,7]. There is a limited number of proposal from the research associated with smart meter data analyzing towards assisting personal health care system [8,9,10]. Smart meters and advance metering infrastructure, unlike IoT wearables, have capacity of analyzing a huge amount of data in real-time [4,5] for a number of houses and supporting a number of services and applications for homes. The goal of this work is to select an identification method for identity detection of appliances with the target to be cost-effective and accurate. This is based with the implementation of the classification of the energy load patterns. The collected energy measurements using smart meter and associated analysis can provide an abnormality detection scheme and to be usable for future eHealth monitoring application. The method is promising because it permits a cost-effective improvement that can empower an extensive modification on the eHealth system [9,10].

    This particular work targets on NIALM system for the appliances level of energy consumption that is based on a single-phase smart meter or socket connected for total agammaegated measurements. Utilization of NIALM has a competitive advantage because of its easy installation on the existing electrical system without performing structural modification. The identification appliances and their load consumption can be detected from central system and can be also controlled for minimizing energy costs and maximizing user's monitoring system at lower cost. The main contributions of this work are:

    ● Comparison of different classification algorithms and demonstration of suitability of k-NN as most suitable classifier for most of the appliances used in the household.

    ● Improvement of NIALM techniques to distinguish the signature of different appliances and their practical behavioral pattern problems based on the agammaegated measurement of the house hold.

    ● Employment of k-NN algorithm to evaluate and to show the performance for accuracy.

    NIALM approach delivers a cost effective solution to monitor appliances with a single metering unit. It also can provide an important information on the appliances during on their on/off status. The information can be beneficial for an activity monitoring system for elderly people, our main research that is not be discuss much in this paper. However, appliances identification is to be used for following up the real-time health status and for serving as a tool for elderly living independently without caregivers by staying in their own homes. This paper further focuses on the approach of using machine learning for NIALM classification, in order to detect or classify the appliances from the agammaegate power consumption with a high-level accuracy performance.

    The rest of the paper is organized as follows. Section 2 presents the background and related work of the appliances identification techniques and describes the k-NN algorithm. Section 3 discusses the NIALM system and the k-NN algorithm for NIALM classification. Experimental results and implementation analysis are presented in Section 4. Finally, Section 5 summarizes the key result and further steps for research in this work.

    ALM system has two categories: 1) Intrusive Load Monitoring (ILM), 2) Non-Intrusive Load Monitoring (NILM). In the next paragraphs, we described these two methods in brief:

    ILM method require a group of sensors attached to each appliance to show information about the appliance load measurement used for functions of appliances load monitoring and classification. The connections of sensors and each appliance through smart plugs [11] provide more accurate measurement than NILM system. The drawback of ILM method is its cost, which is considered to be very high due to the installation complicity of due to the multiple sensor configurations. In the research [12], ILM approach is used as middleware on Atlas and OSGi [12] platform which was implemented to detect a single appliance solution for reduction of costs, as shown in Figure 1. This method can identify real-time controlling and monitoring of household status by detecting each appliance connected to their own smart plug. The proposed framework of two ALM system types is as shown in the Figure 1.

    Figure 1.  NILM Frame work and ILM Framework [12].

    NILM approach identifies energy consumption of each appliance through a connection of a single meter that is attached to center of the house node of electrical system network. NILM concept was originally developed by George Hart during the 80s [13] and have been deployed for expansion of a non-intrusive system to monitor the power consumption of a specific appliance from the power load of the central control system of the electrical network. The method needs very high frequency (for PLAID dataset, 30 kHz) for data communication than ILM methods. It also requires computing a more complex algorithm for identification of the appliances. The process for appliance identification has following phases as described below:

    ● Data Acquisition and processing phase: It starts from the measurements of current, voltage and power consumption and de-noising the obtained measurements. These measurements are used for detection of the appliances by feature extraction. The electrical load measurement is taken at a certain frequency for best result of load pattern identification. Nowadays, smart meters are the more common tools for measuring load consumption. With smart meter, one can obtain pattern and even to distinguish features, starting from data acquisition phase of data collection.

    ● Appliances Feature Extraction phase. The feature is extracted from the signals associated with the voltage and current measurements of appliance and their corresponding current and voltage waveforms in order to compute reactive and active power states. Further, appliance signature for event detection can be extracted from on / off state change of appliances on the power consumption measurements. The identification of initial appliance state changes, and t power level analysis detects the state of the appliances from on to off or vice-versa. The load features can be extracted based on load signatures using various methods [14,15,16,17], in order to provide appliances signature information. Basically, the method [15] based on a numerical perspective meaning using statistical analysis methods in order to distinguish different appliances that are typically expressed in form of vectors and dimensions to determine the amount of features. Then, the selected features can increase the NILAM system classification capability. The real power (P) and reactive power (Q) of the appliances are developed by analyzing physical load characteristics of the turn-on operation of the transient energy and the traditional steady-state power features [16]. The feature extraction can be processed with different method as we described in previous section, in order to gain an accurate appliance dataset that can be used for the process of the next phase.

    ● Load recognition phase: The load recognition process matches the appliances load consumption features and with the features in the database that is obtained from the appliance operating event. The initial appliance state is important for understanding about appliance current-voltage pattern in order to choose an appropriate classification method from either supervised or unsupervised algorithms. The identification procedure is done through classification that is grouped with electrical appliances based on pattern of the current or Ⅴ-Ⅰ trajectory features. On this paper after comparing different machine learning methods, the classification we used is based a training data model. The identification / classification of the appliances is performed by using the k-nearest neighbor’s classifier (k- NN) algorithm.

    K-NN is one of ML algorithms that is used for solving classification and regression problems. The algorithm uses a supervised and instance-based learning algorithm that has been presented by [18], in the early 1950s. k-NN classifier usually used for pattern recognition, image recognition, data mining, text categorization, and predictive analysis. Researchers [18] have proposed k-NN classifier method that uses a memory area for training stage, and its classification space is at least the same as the training area. In that case, the processing time needed for classification steps of k-NN method can be greater than that of the training steps. The main computation of k-NN classifier algorithm is based by the distance calculation between the training dataset and the testing dataset of each element, and by concurrently categorizing instance of measurements. Through years, many researchers have developed and presented k-NN [13], on the NIALM analysis [18,19,20]. The k-NN classifier described on [19] followed the procedures for labeling of testing instances from a majority vote of nearest neighbors of each point and by recommending items of neighboring of unknown values. The labeling of the features followed the nearest neighbors in order to label the value of predictable features. On another research a Probabilistic k-NN (Pk-NN) [20] algorithm was used to compare temperature and humidity sensor datasets. The Pk-NN classifier performance was high and achieved accuracy was acceptable. The algorithm is considered as the easiest and the most common algorithm for processing and allocation of the stored data in the memory. It is also used with a comparison of a new database to perform accurate classification. Even from 19th, distance- weighted k-NN rule in [20] has been shown to motivate selection of the size of k-NN for improving the classification performance. Nowadays, k-NN have been demonstrated [21,22] to be an accurate, relevant and competent classifier method for text categorization, pattern recognition, image recognition.

    We propose that NIALM classification method can be used for monitoring household’s appliance power consumption. We used the PLAID [23] dataset as benchmark, and processing scheme of appliances classification as shown in Figure 2 below.

    Figure 2.  Steps for appliance classification.

    PLAID dataset [23] is an open publically available dataset that contains voltage and current measurements from 60 houses in the USA located at Pittsburgh, Pennsylvania and are collected at sampling frequency 30 kHz. The data contains the energy consumption of 11 kinds of appliances from 55 houses. The data was collected during the summer of 2013 and winter of 2014 and has 1074 sample datasets. The load consumption measurement of each appliance indicates the different instances and variable model of collection can be able to characterize each appliance type. On average 5 instances or measurements have performed for each appliance by considering each of the corresponding state of transition. Table 1, shows the summary of datasets [24] for each appliance. The measurement data are processed and extracted with an interval of a second to cover both the steady-state action and for some appliances with the available startup transient [25]. The PLAID dataset is used here as benchmark dataset for evaluating various approach to classification of various appliances in household. The identification of appliance power usage and pattern have been executed in diverse systems as explained in the paper [26] and based on event-based methodology for NIALM [27]. The collected data considered the measurements for the steady and transient states of the appliance. However, the transient event state is sampled at high frequency whereas the steady state is extracted from low frequency sampled of power measurements. The collected data preprocessed with different stages such as de-noising, filtering, harmonic component separation, signal synchronicity, etc. to be able to indicate an accurate and sensible information.

    Table 1.  Number of instances and appliances for each appliance type [24].
    Appliance type Number ofInstance Appliance type Number ofInstance
    Compact fluorescent lamp 175 Fridge 38
    Vacuum cleaner 38 Incandescent light bulb 114
    Hair dryer 156 Fan 115
    Microwave 139 Washing Machine 26
    Air conditioner 66 Heater 35
    Laptop 172 Total 1074

     | Show Table
    DownLoad: CSV

    NIALM is a methodology used to disagammaegate total power consumption into individual electrical appliance load. To measure the agammaegated signal, the appliances are activated one after another by monitoring the deactivation [26]. The features for the electrical signature is used from the publicly available PLAID data set are extracted by the combination of identifying the on or off state of and events of appliances and by event detection. The feature extraction can be processed from signature of appliances that are measured separately while each appliance is at on/off event state, i.e. the parameters are taken by measuring the load consumption of each appliances one at a time. The feature extraction is based on the Ⅴ-Ⅰ trajectory (voltage/current), and steady state measurements of devices before and after the event. It is also a crucial step to find a recognition mechanism for appliances during the transient threshold crossing point from on/off state or vise verse state. Overall, the data processing stage is responsible for delivering the filtered data to the event detection stage with the electrical parameters such as harmonic components, reactive power, and transient and active power waveforms. The recognition feature of the appliances is different upon event change of appliances state. For this work, as we stated earlier the experiments are done from NIALM, publicly available smart meter PLAID [24] dataset. Our aim is to perform analysis of the correlation between NIALM and the activity detection of consumers with accurate results. The process of identification is done according to the characteristics of the appliance operating states [26,27]. (1) two operating states as on/off; (lamp), (2) multistate (finite state machines) with multistate and a repeated operation; (washing machine), and (3) the infinite state that operates for longer time, day, week, operations; (mobile phones). The type of appliances according to their load consumption pattern is shown in Figure 3.

    Figure 3.  Different appliances types based on their energy consumption pattern [26].

    In all types of states, the signature of the load indicates a specific information for recognizing each appliance separately. In summary, appliance’s signature is categorized according to steady-state signatures and transient signatures of the appliances.

    The signature shows mode of the event from agammaegate power measurements. The measurements from the energy load of appliance are processed and extracted for analyzing appliance load behavior. We obtained the plot of Figure 4, from the given PLAID [24] current and voltage measurements for each appliance.

    Figure 4.  Appliances steady state signature (Ⅵ-trajectory, voltage, and current waveform).

    The signature of appliances is depending on the nonlinear variation state of operation of appliance state. It means that when the appliance is turned on, an instant flow of current appears, as is shown in Figure 5b, which describes transient event of the current pattern. In that case the amplitude of current will be much more than the level of steady-state current [20]. In addition, the onset happens with the change of patterns during the on switching of the selected appliances on transition of operating transition [27,28,29,30]. Such events for the appliances appears when the appliance change its power state, e.g. during transients of charging, boiling, rotating, etc. Normally, offset is caused by appliances switch off with a quick step to down to zero power or other similar way for cutting of power. As an example if the power is off immediately but washing machine motor is operating generator until mechanical stop. The parameters of these states indicate over or under shoots of power amplitude when turned on, and rise/fall time and settling time. For low power appliances, the power change will be visible at real power (P), reactive Power (Q) of the feature as shown as over lapping space as in Figure 5(b). The real power is the amount of energy consumed during operation and for the reactive load there is no any real power load. However, devices that have an electric motor when they turned on, produce a transient event and they have a non-sinusoidal current during operation as shown in Figure 5(a).

    Figure 5.  (a) Current waveform of a vacuum cleaner at transient state (b) Example of P-Q plane of the load distribution of the appliances [19,24].

    Applying feature extraction on the appliances signature of load consumption, can help to categorizes loads by distinguishing similar Ⅴ-Ⅰ trajectories of appliances. This is depending on the type of the appliance state and signature, as described in the section above on the appliance signature and features. Mainly, the appliance features are categorized according to steady state and transient state features including the non-traditional signatures. The steady state variation can also depend on power change between real and reactive power. The non-traditional signatures are often obtained from the mixture of traditional appliance features based on routine of the overall power consumption measurements. The recognition module spots on/off state change of the devices by distinguishing the load variations as described above. Additionally, it shows actions on the change of state (steady state or transient state) that is described as in subsections above. Appliance feature extraction and their state translations discussed in the previous literature [25,26,27] and we summarize this in Figure 6.

    Figure 6.  Classification of appliance features for energy disaggregation.

    The feature extraction thus is based on signature of appliances, as shown in Figure 6, which are classified by appliance steady and transient signatures. The transient describes the edge, trend, duration, and phase of the signature, while the steady state shown with current, the voltage and frequency components of appliances. However, as we described earlier our feature extraction method is based on the voltage and current readings obtained from the PLAID dataset which is used for appliances signature. The feature set includes the parameters such as peak power and current, and RMS current by considering steady state signatures of the appliances [26,27]. Figure 4 shows the voltage and current trajectory (Ⅴ-Ⅰ) of 11 appliances that are classified by steady and transient signatures of the appliances. The Ⅴ-Ⅰ trajectory can create a visible signature or pattern or appliance. We use the plot of the visual features for classification of the appliance. Equation 2, states the resultant wave of the period for voltage (ⅴ) and current (ⅰ) that were developed over time at a sampling frequency (fs) and grid frequency (fg) [28].

    Φp=[φ(p),φ(p+1),,φ(p+d1)]TRd,φ{i,v} (1)

    where d is the number of samples per period fs/fg), p is point at a time, φp, is the resultant wave.

    The PLAID dataset is obtained from readings of one smart meter per household for load consumption of the appliances. The feature has extracted and a combination of features to be used for NIALM and classification of appliances on daily bases. The recognized patterns of features and the classification can help to recognize each appliance based on the operation signature of specific state of the appliances during a definite period operation.

    The results of the appliances classification are depending on the valuable and effective processing method and by identifying features from the available PLAID smart meter dataset. The pattern can be recognized by the operating and estimated energy consumption of each type of the appliances. We verified the accuracy of classification by matching with the available set of features. Here the quantity of the input data is not a factor for the accuracy but it depends rather on the quality of the available information in terms of the feature extraction of the appliances [24,31]. For the load classification mainly machine learning algorithms and optimization and machine learning algorithms used the supervised and unsupervised methods for the classifications. In the study, as also shown below in Figure 7, we have investigated various classification algorithms that are used for features of the smart meter load measurement in order to find out suitable classification algorithms for the dataset. Among them [18] k-Nearest Neighbors (k-NN), Random Forest (RForest), Decision Tree (DTree), Gaussian naive Bayes (GNB) and logistic regression classifier (LGC), have been checked [18]. These multiple classifiers are applied to the dataset on the previous study [18]. The steps of presentation of these classifiers have been proved with various sets of the extracted input features from the measurements taken using smart meters. Some of the features that are used for identifying the appliances, for the classification purposes have both transient and steady state features, as we already mentioned in the previous sections. The binary Ⅵ-trajectory is used as a feature because of being the most capable feature extracted from the previous study achieved on PLAID [18]. This feature is created by plotting voltage compared to current of an instance by giving a binary a 0 or 1 value according the existing model of the trajectory.

    Figure 7.  k-NN classifier accuracy comparison on PLAID dataset.

    Figure 7, shows a plot on comparison of the different ML algorithm, for the purpose of identifying the accurate and suitable algorithm and the classification accuracy is changes with the sampling frequency. The result is based on the PLAID dataset as input and application software based with Python3 code representation and other necessary packages of R studio as illustrated in GitHub [24]. The k-NN and RF showed best performance among the other classifier method. However, and k-NN classifier suits the classification of active or reactive power input features. Even RF seems to be better in result at Figure 7, but we do not use it according to the analysis done by the previous work [32]. It was demonstrated that the k-NN achieved higher accuracy both for imbalanced and balanced datasets, similar to PLAID high frequency dataset, than RF. In addition, we choose for the appliance classification method based on the k-NN procedure since it shows the best result and the algorithm is broadly used as the classification algorithm with accurate implementation [33]. k-NN is simple and effective classification algorithms and we show on the confusion matrix [34] its performance evaluation. It shows that k-NN matrix has higher recognition accuracy for the NIALM methods. For the unbalanced dataset in Table 1, k-NN gives better performance by automatically detecting either linear or non-linear distributed data set. Hence, it tends to perform very well with many data points. In addition, the algorithm is used as a data-mining algorithm for feature extraction and pattern recognition including its classification of the features can be easily implement with low error rate.

    Figure 7, indicates the different algorithms classifications accuracy vs the sampling frequency (kHz) plot that are processed on the PLAID dataset measurement for classifying the household appliances.

    The dataset, using NILAM has classified the total power consumption of household into individual appliance consumption (fridge, air conditioner, etc.).

    Our method using k-NN for classification of events can be summarize as follows:

    ● Detection of events: Events are variations of total consumption, which caused by the change of state of an appliance. Different signal processing algorithms can detect events.

    ● Clustering of events: Events detected are further clustered, e.g. we try to group events that are likely to come from the same change of state of the appliances. The unsupervised machine learning in particular the k-NN algorithms is used at this stage.

    ● Modeling of appliances: With the clusters obtained above and time series analysis, appliance models are built. An appliance model regroups clusters to representing a change of state of the appliance. The supervised machine learning in particular k-Means algorithms is used at this stage.

    ● Tracking of appliances consumption: Once appliance models built, it is possible to track the behavior of each appliance from the load consumption of this appliance at each time.

    Neighbors-based classification is a type of the instance-based learning, which does not construct a general internal model, but simply stores instances of the training data [37]. Classification is computed from a majority vote of the nearest neighbors of each point. A query point is assigned to the data class, which has the most of representatives within the nearest neighbors of the particular point [37]. k-NN algorithm is the most common technique to implement the nearest neighbor classification in where k is an integer value. However, the optimal choice of the value of k is highly data-dependent [35]. Various matrices have been evaluated to determine the distance for categorizing query points in a training dataset [36,37,38].

    However, the k-NN computation of the algorithms is developed by using the distance measure between training dataset and the testing dataset of each element or means identify the k nearest neighbors of a test point. There are many ways to measure the distance between attributes in testing set and training set. The most significant ones is Euclidean equations is shown in equation (1) [39].

     Euclidean: D(x,y)=((x1y1)2+(x2y2)2+..+(x2y2)2)1/2 (2)

    From the formulas, for a definite of nearest number of k, and the unknown dataset x and a distance metric D, and k-NN checks all over the dataset and compute D among x and every training assignment. The new instance is classified by a common selecting its neighbors and by labeling and allocating to the same the class, the instants with maximum shared between their k nearest neighbors [39]. With k = 1, for instance, the case is just allocated to the class of its nearest neighbor. The nearest neighbor search is a challenging task in multidimensional data. The important factors in the nearest neighbor search (k-NN) are adapting the runtime with accuracy for practical and feasible huge datasets

    To summarize shortly the procedure for k-NN classification; it starts by processing the PLAID dataset and extracting its features based in the state of the appliances. The extracted dataset will be categorized into training and testing sets. The similarity will be calculated according to the closest test samples in the training set and by labeling of the new data point with a result of the nearest K data points in the training dataset. Later, the outcome of neighbors and closest neighbors are labeled for unknown data. In that case, the appliance classification based on the k-NN input by knowing the samples on the dataset output and the result, or the output will be on the majority label and we use the same common k-NN algorithm or characterize the NIALM methods [40]. The next sections will discuss the evaluation metrics of the classification of k-NN algorithm for the implementation.

    The PLAID dataset that is composed of 11 appliances (vacuum cleaner, microwave, compact fluorescent lamp, fan, laptop, fridge, hairdryer, heater, incandescent lightbulb, air conditioner, and washing machine) on them on-off state are considered for this model. We discard the washing machine from our evaluation because of the lack of data and due to the appliance has a variable current waveform which not constant during the process. The other issue was from the PLAID data was not possible to separate the testing dataset from the whole training dataset. However, for the purpose of evaluating the training and the testing was performed on the whole dataset and a higher tanning accuracy was obtained. Using our program, we divided the training and testing data proportion that is kept as 90%: 10% throughout the entire of this procedure.

    The VI-binary image is used as input to confirm the classification accuracy. The accuracy evaluation metrics is extensively applied for assessing the classification routine. It mainly helps for analyzing of misleading assumption mostly for unnecessary datasets [23]. The parameters for the confusion matrix [34] are obtained from True Positive (TP) and True Negative (TN) for calculating performance evaluation described on Table 2. The TP & FN are considered when the appliances are on and off respectively.

    Table 2.  Confusion matrix parameters.
    Predicted Positive Predicted Negative
    Actual Positive True Positive (TP) False negative (FN)
    Actual Negative False Positive (FP) True Negative (TN)

     | Show Table
    DownLoad: CSV

    The performance outcome which is calculated from true positive rate measurements are defined as sensitivity or recall parameters and the precision is the positive predicted value. Formula (7) shows the calculation for precision and recall value [34]. The evaluation criteria are distinguishing by performance evaluation indicator to consider the classification accuracy with the pointers of accuracy [34], error rate, G-mean and F-measure that are calculated as shown below:

    Accuracy=TP+TNTP+FP+TN+FN (4)
    Fmeasure=2xPrecisionxRecallPrecision+Recall (5)
    Gmean=Precision+Recall (6)

    The precision and the recall can be evaluated from the matrix as:

    Precision=TPTP+FN,Recall=TPTP++FN (7)

    The confusion matrix that displays the classification of each appliance for the k-NN algorithm is shown in Figure 8. For evaluation of k-NN the data set is divides as testing and training dataset. The 90% of the dataset are taken for training purpose and the rest 10% for testing. The k-NN algorithm has identified, or classified, the appliances from the NIALM dataset very well. However, it is not reliable for some of the appliances because of different reasons: the sample data used are insufficient, and lack of the algorithm ability in identifying the multi-state appliances and the appliances with similar power consumption profiles. However, the accuracies of the identification taken by our experiments are acceptable in case of almost all the appliances. For the appliances (vacuum cleaner, fluorescent lamp, AC, bulb, fan, microwave, hairdryer) the identification number of are mostly dispersed on the central sloping of the matrix. This is indicating when the rate of true positive is more, and the accuracy achieved with the k-NN algorithm is as shown high. From the confusion matrix, for some of the energy consumption of the appliances (laptop, fridge, heater), there is a false positive detection rate, which is high. This case happened for the reasons that are mentioned above and may be -also depend on selection of k values. The F-measure and the G-mean rate based on the performance of K-NN should be considered. However, the algorithm proposed in this paper has a high F1 value and an accuracy from 93% up to 100% is achieved in most cases with the implementation of k-NN algorithm. The confusion matrix as shown in Figure 8, gives more details on the appliances accuracy for the 1-NN configuration and Table 3 shows appliance performance evaluation by the overall F-measure and G-mean value of k-NN classifier.

    Figure 8.  The normalized confusions matrix k-NN classifier.
    Table 3.  Appliance’s F-measure and G-mean values from k-NN classifier.
    Appliance F-Measure G-mean
    Compact FL
    Laptop
    Microwave
    Hairdryer
    Vacuum Cleaner
    Fridge
    Heater
    Il bulb
    Fan
    Air-conditioned
    Total
    0.99
    0.72
    1
    1
    1
    0.68
    0.23
    0.97
    1
    1
    0.9
    1
    0.75
    1
    1
    1
    0.73
    0.21
    0.97
    1
    1
    0.94

     | Show Table
    DownLoad: CSV

    Table 4 and Figure 9 present the comparisons of our F-measure matrix result, with the previous result of Convolutional Neural Network (CNN) [41] that was based on the PLAID dataset on NIALM and classification of the appliances. The CNN classifies is better for recognizing of appliances from transient power state signal than k-NN.

    Table 4.  F-score comparisons of our result (k-NN) with CNN for appliances identification.
    Appliances K-NN (our result) CNN [41]
    CFL
    Laptop
    Microwave
    Hairdryer
    Vacuum
    Fridge
    Heater
    I bulb
    Fan
    AC
    99 %
    72%
    100%
    100%
    100%
    88%
    24%
    97%
    100%
    100%
    96%
    98%
    93%
    79%
    98%
    97%
    82%
    81%
    60%
    47%

     | Show Table
    DownLoad: CSV
    Figure 9.  Comparison of F-measure on k-NN and CNN algorithm for appliances identification results.

    On the other hand, F-measure and G-mean are the performance evaluation metrics for appliance classification. In Table 3 comparison of F- measure of our k-NN classifier and CNN shows that the k-NN algorithm have more accuracy than CNN for the PLAID dataset appliances classification. Figure 9 below shows how for most of the appliances identification (CFL, microwave, hair dryer, vacuum, fridge, Incandescent light bulb(Ibulb), fan and AC), the performance of classifier was extremely good with 100% classification accuracy for each appliances The CNN classifier performance for NIALM detection of appliances for transient power signal data was higher than KNN. However, for the majority of classification of the appliance k-NN was higher.

    NIALM based open dataset PLAID [23,24] shows a promising result for analyzing variations between current and voltage and detecting the energy consumption of each appliance from a single point agammaegated measurements. The extracted features from the voltage-current trajectory are implemented with the k-NN classifier in order to recognize the appliances. The accuracy matrix F-score average equals to 90%. The supervised learner based k-NN implementation has a better performance of accuracy than the rest of the compared machine learning algorithm to classify and to detect each appliance. The k-NN classifier shows that accuracy of the classification is depend on the size of the appliances training datasets. The paper shows the on/off state and event detection of the appliances with the deployment of k-NN classifier has acceptable accuracy for the identification of the appliances. The result of the appliances identification can show that NIALM, besides being an effective technique for advanced demand side management with appliances identification, can be also operated for ambient assisted living by providing indirectly information about the activity and health status for each household from the detected appliance usage. We can show this assumption further as the load consumption and the monitoring system we proposed can recognize the associated activities (cleaning, cooking, sleeping, etc.) of the user on daily basis. Therefore, the method can rely better on the information of the event of the activities and the duration of the appliance usage. In addition, we discovered the appliance energy consumption can be used to consumers monitoring systems with the implementation of machine learning. NIALM can identify the energy consumption of specific appliances with low cost power meter plug that are connected from the central control system.

    In this paper we presented an approach to classify appliances with NIALM techniques by distinguishing signature of different appliances and their behavioral pattern based on disagammaegated total energy consumption data to individual appliance specific energy load. The comparison of different classification algorithms have demonstrated that k-NN is most suitable classifier for most of appliances used in the household. Our analysis summarizes the identification performance can be done based on the operational states of the appliances. Finally, the implementation of the k-NN algorithm for PLAID dataset appliance classification has a potential scalability for detecting activity of the occupant and for monitoring their daily routines.

    For the future, there will be more discussion about the further steps on facilitating effective behavioral detection using NIALM. Further studies are needed to demonstrate the processing and identification events on the effective hardware implementation of NIALM with k-NN that can be more suitable for detecting transient state appliances based a smart meter dataset. Thus providing low cost scalability for system development.

    We would like to thank the Swedish Funding Agency, SIDA for funding and supported through iGrid Project and STINT through IoT4Health Projects at School of Electrical Engineering and Computer Science, Division of Electronics and Embedded system, Royal Institute of Technology (KTH).

    The authors declare that there is no conflict of interest in this paper.



    [1] McKee TB, Doesken NJ, Kleist J (1993) The relationship of drought frequency and duration to time scales. Proc 8th Conf Appl Climatol 17: 179–183.
    [2] Zhang N, Xia Z, Zhang S, et al. (2012) Temporal and spatial characteristics of precipitation and droughts in the upper reaches of the Yangtze River basin (China) in recent five decades. J Hydroinf 14: 221–235. https://doi.org/10.2166/hydro.2011.097 doi: 10.2166/hydro.2011.097
    [3] Łabędzki L, Bąk B (2014) Meteorological and agricultural drought indices used in drought monitoring in Poland: a review. Meteorology Hydrology and Water Management. Meteorol Hydrol Water Manage Res Oper Appl 2: 3–13.
    [4] Rahmat SN, Jayasuriya N, Bhuiyan M (2015) Assessing droughts using meteorological drought indices in Victoria, Australia. Hydrol Res 46: 463–476. https://doi.org/10.2166/nh.2014.105 doi: 10.2166/nh.2014.105
    [5] Oloruntade AJ, Mohammad TA, Ghazali AH, et al. (2017) Analysis of meteorological and hydrological droughts in the Niger-South Basin, Nigeria. Global Planet Change 155: 225–233. https://doi.org/10.1016/j.gloplacha.2017.05.002 doi: 10.1016/j.gloplacha.2017.05.002
    [6] Fung KF, Huang YF, Koo CH, et al. (2019) Standardized precipitation index (SPI) and standardized precipitation evapotranspiration index (SPEI) drought characteristic and trend analysis using the second-generation Canadian earth system model (CanESM2) outputs under representative concentration pathway (RCP) 8.5. Carpath J Earth Env 14: 399–408. https://doi.org/10.26471/cjees/2019/014/089 doi: 10.26471/cjees/2019/014/089
    [7] Sofiane K, Abdesselam M, Nekkache GA (2019) Long-term seasonal characterization and evolution of extreme drought and flooding variability in northwest Algeria. Meteorol Hydrol Water Manage 7: 63–71. https://doi.org/10.26491/mhwm/106101 doi: 10.26491/mhwm/106101
    [8] Bong CHJ, Richard J (2020) Drought and climate change assessment using standardized precipitation index (SPI) for Sarawak River Basin. J Water Clim Change 11: 956–965. https://doi.org/10.2166/wcc.2019.036 doi: 10.2166/wcc.2019.036
    [9] Fellag M, Achite M, Walega A (2021) Spatial-temporal characterization of meteorological drought using the Standardized precipitation index Case study in Algeria. Acta Sci Pol Formatio Circumiectus 20: 19–31. https://doi.org/10.15576/ASP.FC/2021.20.1.19 doi: 10.15576/ASP.FC/2021.20.1.19
    [10] Esit M, Yuce MI (2022) Comprehensive evaluation of trend analysis of extreme drought events in the Ceyhan River Basin, Turkey. Meteorol Hydrol Water Manage 10. https://doi.org/10.26491/mhwm/154573 doi: 10.26491/mhwm/154573
    [11] Minh HVT, Kumar P, Van Ty T, et al. (2022) Understanding Dry and Wet Conditions in the Vietnamese Mekong Delta Using Multiple Drought Indices: A Case Study in Ca Mau Province. Hydrology 9: 213. https://doi.org/10.3390/hydrology9120213 doi: 10.3390/hydrology9120213
    [12] Kebaili Bargaoui Z, Jemai S (2022) SPI-3 Analysis of Medjerda River Basin and Gamma Model Limits in Semi-Arid and Arid Contexts. Atmosphere 13: 2021. https://doi.org/10.3390/atmos13122021 doi: 10.3390/atmos13122021
    [13] Adnan RM, Dai HL, Kuriqi A, et al. (2023) Improving drought modeling based on new heuristic machine learning methods. Ain Shams Eng J 14: 102168. https://doi.org/10.1016/j.asej.2023.102168 doi: 10.1016/j.asej.2023.102168
    [14] Młyński D, Wałęga A, Kuriqi A (2021) Influence of meteorological drought on environmental flows in mountain catchments. Ecol Indic 133: 108460. https://doi.org/10.1016/j.ecolind.2021.108460 doi: 10.1016/j.ecolind.2021.108460
    [15] Adnan RM, Mostafa RR, Islam ARMT, et al. (2021) Improving drought modeling using hybrid random vector functional link methods. Water 13: 3379. https://doi.org/10.3390/w13233379 doi: 10.3390/w13233379
    [16] Amirataee B, Montaseri M (2017) The performance of SPI and PNPI in analyzing the spatial and temporal trend of dry and wet periods over Iran. Nat Hazards 86: 89–106. https://doi.org/10.1007/s11069-016-2675-4 doi: 10.1007/s11069-016-2675-4
    [17] Şişman E (2019) Su Talep Seviyesine Göre Kurak ve Sulak Dönem Analizi. Dicle Üniversitesi Mühendislik Fakültesi Mühendislik Dergisi 10: 301–310. (In Turkish). https://doi.org/10.24012/dumf.449592 doi: 10.24012/dumf.449592
    [18] Turhan E, Değerli S (2021) Analysis of Wet and Drought Periods based on Streamflow Data in the Fırtına Creek Sub-basin between 1965–2015 years. Black Sea J Sci 11: 277–288. https://doi.org/10.31466/kfbd.915979 doi: 10.31466/kfbd.915979
    [19] Moccia B, Mineo C, Ridolfi E, et al. (2022) SPI-Based Drought Classification in Italy: Influence of Different Probability Distribution Functions. Water 14: 3668. https://doi.org/10.3390/w14223668 doi: 10.3390/w14223668
    [20] Akar Ö, Oğuz İ, Yürekli K (2015) Comparison of Dry and Wet Periods with the Help of Some Drought Indexes in İkikara Watershed. J Agric Fac Gaziosmanpaşa Univ 32: 7–13.
    [21] Musonda B, Jing Y, Iyakaremye V, et al. (2020) Analysis of long-term variations of drought characteristics using standardized precipitation index over Zambia. Atmosphere 11: 1268. https://doi.org/10.3390/atmos11121268 doi: 10.3390/atmos11121268
    [22] Nalbantis I (2008) Evaluation of a hydrological drought index. Eur Water 23: 67–77.
    [23] Hong X, Guo S, Zhou Y, et al. (2015) Uncertainties in assessing hydrological drought using streamflow drought index for the upper Yangtze River basin. Stoch Environ Res Risk Assess 29: 1235–1247. https://doi.org/10.1007/s00477-014-0949-5 doi: 10.1007/s00477-014-0949-5
    [24] Gümüş V (2017) Hydrological Drought Analysis of Asi River Basin with Streamflow Drought Index. Gazi Univ Fen Blm Derg 5: 65–73.
    [25] Kubiak-Wójcicka K, Zeleňáková M, Purcz P, et al. (2019) The use of a Standardized Runoff Indicator for hydrological characterization of selected rivers of Poland and Slovakia. Rocznik Ochrona Środowiska 21: 167–183.
    [26] Yaltı S, Aksu H (2019) Drought Analysis of Iğdır Turkey. Turk J Agric Food Sci Technol 7: 2227–2232. https://doi.org/10.24925/turjaf.v7i12.2227-2232.3004 doi: 10.24925/turjaf.v7i12.2227-2232.3004
    [27] Ozkaya A, Zerberg Y (2019) A 40-year analysis of the hydrological drought index for the Tigris Basin, Turkey. Water 11: 657. https://doi.org/10.3390/w11040657 doi: 10.3390/w11040657
    [28] Turhan E, Duyan Çulha B, Değerli S (2022) Hydrological Evaluation of Streamflow Drought Index Method for Different Time Scales: A Case Study of Arsuz Plain, Turkey. J Nat Hazards Environ 8: 25–36. https://doi.org/10.21324/dacd.903655 doi: 10.21324/dacd.903655
    [29] Turhan E, Değerli S, Çatal EN (2022) Long-term hydrological drought analysis in agricultural irrigation area: The case of Dörtyol-Erzin Plain, Turkey. Curr Trends Nat Sci 11: 501–512. https://doi.org/10.47068/ctns.2022.v11i21.054 doi: 10.47068/ctns.2022.v11i21.054
    [30] Hasan HH, Razali SFM, Muhammad NS, et al. (2022) Modified Hydrological Drought Risk Assessment Based on Spatial and Temporal Approaches. Sustainability 14: 6337. https://doi.org/10.3390/su14106337 doi: 10.3390/su14106337
    [31] Dikici M (2020) Drought analysis with different indices for the Asi Basin (Turkey). Sci Rep 10: 20739. https://doi.org/10.1038/s41598-020-77827-z doi: 10.1038/s41598-020-77827-z
    [32] Dikici M, Aksel M (2021) Evaluation of two vegetation indices (NDVI and VCI) Over Asi Basin in Turkey. Teknik Dergi 32: 10995–11011. https://doi.org/10.18400/tekderg.590356 doi: 10.18400/tekderg.590356
    [33] Topçu E, Seçkin N, Haktanır NA (2022) Drought analyses of Eastern Mediterranean, Seyhan, Ceyhan, and Asi Basins by using aggregate drought index (ADI). Theor Appl Climatol 147: 909–924. https://doi.org/10.1007/s00704-021-03873-w doi: 10.1007/s00704-021-03873-w
    [34] Topçu E, Seçkin N (2022) Drought assessment using the reconnaissance drought index (RDI): A case study of Eastern Mediterranean, Seyhan, Ceyhan, and Asi basins of Turkey. J Eng Res 10. https://doi.org/10.36909/jer.12113 doi: 10.36909/jer.12113
    [35] The General Directorate of State Hydraulic Works, Türkiye (DSI), Annual Streamflow Observation Records (1986–2020). Head of Study and Planning Department, Ankara, 2015. Available from: https://www.dsi.gov.tr/Sayfa/Detay/744.
    [36] Hänsel S, Schucknecht A, Matschullat J (2016) The Modified Rainfall Anomaly Index (mRAI)—is this an alternative to the Standardised Precipitation Index (SPI) in evaluating future extreme precipitation characteristics. Theor Appl Climatol 123: 827–844. https://doi.org/10.1007/s00704-015-1389-y doi: 10.1007/s00704-015-1389-y
    [37] Mahmoudi P, Ghaemi A, Rigi A, et al. (2021) Recommendations for modifying the Standardized Precipitation Index (SPI) for drought monitoring in arid and semi-arid regions. Water Resour Manage 35: 3253–3275. https://doi.org/10.1007/s11269-021-02891-7 doi: 10.1007/s11269-021-02891-7
  • This article has been cited by:

    1. Qichun Zhang, Xuewu Dai, Special Issue “Uncertainties in large-scale networked control systems”, 2020, 4, 2578-1588, 345, 10.3934/ElectrEng.2020.4.345
    2. Amleset Kelati, Hossam Gaber, Juha Plosila, Hannu Tenhunen, 2020, Implementation of K-nearest Neighbor on Field Programmable Gate Arrays for Appliance Classification, 978-1-7281-9912-2, 51, 10.1109/SEGE49949.2020.9181975
    3. Yu Liu, Qianyun Shi, Yan Wang, Xin Zhao, Shan Gao, Xueliang Huang, An Enhanced Ensemble Approach for Non-Intrusive Energy Use Monitoring Based on Multidimensional Heterogeneity, 2021, 21, 1424-8220, 7750, 10.3390/s21227750
    4. Thi-Thu-Huong Le, Shinwook Heo, Howon Kim, Toward Load Identification Based on the Hilbert Transform and Sequence to Sequence Long Short-Term Memory, 2021, 12, 1949-3053, 3252, 10.1109/TSG.2021.3066570
    5. Lei Yan, Wei Tian, Jiayu Han, Zuy Li, Event-driven two-stage solution to non-intrusive load monitoring, 2022, 311, 03062619, 118627, 10.1016/j.apenergy.2022.118627
    6. Gyubaek Kim, Sanghyun Park, Pre-trained non-intrusive load monitoring model for recognizing activity of daily living, 2022, 0924-669X, 10.1007/s10489-022-04053-7
    7. Omar Al-Khadher, Azharudin Mukhtaruddin, Fakroul Ridzuan Hashim, Muhammad Mokhzaini Azizan, Hussin Mamat, Mohamed Mani, 2022, Comparison of non-intrusive load monitoring supervised methods using harmonics as feature, 978-1-6654-7087-2, 1, 10.1109/ICECET55527.2022.9872689
    8. Leonce Wehnelt TOKAM, Sanoussi S. OURO-DJOBO, Non-Intrusive Load Monitoring (NILM), Interests and Applications, 2023, 2714-4704, 1208, 10.46481/jnsps.2023.1208
    9. Zuoxin Wang, Xiaohu Zhao, AttG-BDGNets: Attention-Guided Bidirectional Dynamic Graph IndRNN for Non-Intrusive Load Monitoring, 2023, 14, 2078-2489, 383, 10.3390/info14070383
  • Reader Comments
  • © 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(1709) PDF downloads(183) Cited by(1)

Figures and Tables

Figures(8)  /  Tables(4)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog