Few-shot learning based on deep learning: A survey

Wu Zeng; Zheng-ying Xiao; Wu Zeng; Zheng-ying Xiao

doi:10.3934/mbe.2024029

Mathematical Biosciences and Engineering

2024, Volume 21, Issue 1: 679-711. doi: 10.3934/mbe.2024029

Previous Article Next Article

Review Special Issues

Few-shot learning based on deep learning: A survey

Wu Zeng ^,,
Zheng-ying Xiao

Engineering Training Center, Putian University, Putian 351100, China

Received: 19 November 2023 Revised: 05 December 2023 Accepted: 07 December 2023 Published: 19 December 2023

In recent years, with the development of science and technology, powerful computing devices have been constantly developing. As an important foundation, deep learning (DL) technology has achieved many successes in multiple fields. In addition, the success of deep learning also relies on the support of large-scale datasets, which can provide models with a variety of images. The rich information in these images can help the model learn more about various categories of images, thereby improving the classification performance and generalization ability of the model. However, in real application scenarios, it may be difficult for most tasks to collect a large number of images or enough images for model training, which also restricts the performance of the trained model to a certain extent. Therefore, how to use limited samples to train the model with high performance becomes key. In order to improve this problem, the few-shot learning (FSL) strategy is proposed, which aims to obtain a model with strong performance through a small amount of data. Therefore, FSL can play its advantages in some real scene tasks where a large number of training data cannot be obtained. In this review, we will mainly introduce the FSL methods for image classification based on DL, which are mainly divided into four categories: methods based on data enhancement, metric learning, meta-learning and adding other tasks. First, we introduce some classic and advanced FSL methods in the order of categories. Second, we introduce some datasets that are often used to test the performance of FSL methods and the performance of some classical and advanced FSL methods on two common datasets. Finally, we discuss the current challenges and future prospects in this field.

Keywords:

Citation: Wu Zeng, Zheng-ying Xiao. Few-shot learning based on deep learning: A survey[J]. Mathematical Biosciences and Engineering, 2024, 21(1): 679-711. doi: 10.3934/mbe.2024029

Related Papers:

[1]	Minh Y Nguyen . Optimal voltage controls of distribution systems with OLTC and shunt capacitors by modified particle swarm optimization: A case study. AIMS Energy, 2019, 7(6): 883-900. doi: 10.3934/energy.2019.6.883
[2]	Essam A. Al-Ammar, Ghazi A. Ghazi, Wonsuk Ko, Hamsakutty Vettikalladi . Temperature impact assessment on multi-objective DGs and SCBs placement in distorted radial distribution systems. AIMS Energy, 2020, 8(2): 320-338. doi: 10.3934/energy.2020.2.320
[3]	Abdul Matin Ibrahimi, K Narayanan, Mohammed Elsayed Lotfy, Mir Sayed Shah Danish, Mikaeel Ahmadi, Tomonobu Senjyu . Transients outrush current analysis and mitigation: A Case study of Afghanistan North East power system. AIMS Energy, 2019, 7(4): 493-506. doi: 10.3934/energy.2019.4.493
[4]	Akash Talwariya, Pushpendra Singh, Mohan Lal Kolhe, Jalpa H. Jobanputra . Fuzzy logic controller and game theory based distributed energy resources allocation. AIMS Energy, 2020, 8(3): 474-492. doi: 10.3934/energy.2020.3.474
[5]	Md Mashud Hyder, Kaushik Mahata . Reconfiguration of distribution system using a binary programming model. AIMS Energy, 2016, 4(3): 461-480. doi: 10.3934/energy.2016.3.461
[6]	Bitian Wu . Day ahead scheduling model of wind power system based on fuzzy stochastic chance constraints—considering source-load dual-side uncertainty case. AIMS Energy, 2025, 13(3): 471-492. doi: 10.3934/energy.2025018
[7]	Santosh Kumar Sharma, D. K. Palwalia, Vivek Shrivastava . Distributed generation integration optimization using fuzzy logic controller. AIMS Energy, 2019, 7(3): 337-348. doi: 10.3934/energy.2019.3.337
[8]	Anbazhagan Geetha, S. Usha, J. Santhakumar, Surender Reddy Salkuti . Forecasting of energy consumption rate and battery stress under real-world traffic conditions using ANN model with different learning algorithms. AIMS Energy, 2025, 13(1): 125-146. doi: 10.3934/energy.2025005
[9]	Amandeep Gill, Pushpendra Singh, Jalpa H. Jobanputra, Mohan Lal Kolhe . Placement analysis of combined renewable and conventional distributed energy resources within a radial distribution network. AIMS Energy, 2022, 10(6): 1216-1229. doi: 10.3934/energy.2022057
[10]	Abdollah Kavousi-Fard, Amin Khodaei . Multi-objective optimal operation of smart reconfigurable distribution grids. AIMS Energy, 2016, 4(2): 206-221. doi: 10.3934/energy.2016.2.206

Abstract

1. Introduction

Indeed, capacitors due to their capacitive characteristics are used as reactive power compensation devices in distribution networks. Compensation of the reactive power in distribution networks results in line-current reduction from the generation side to the distribution system which deduces the power system losses. If the reactive power supply, considers in the generation side only, all electrical parts of the network such as transformers, transmission and distribution lines, protection switching devices, and many other control units must be increased by size.

The capacitors can adjust the power system performance by reducing the reactive power demand in the network which results in reduction of line-carrying current. It also decreases the power loss in distribution lines, transformer stations, and transmission lines significantly. Also, it affects the voltage profile positively, and by reduction of reactive power, the current will decrease on transformers, lines, and equipment that consequently increases the capacity of distribution networks, the lifetime of the power system equipment as well as delay on their replacement.

Variations in demand load and power factor at consumption points are significant characteristics for capacitor banks to justify the supplied reactive power. For this reason, usually, two types of capacitor banks are considered in power system networks. The fixed type capacitor is the simplest form to supply a fixed amount of reactive power in the network. Usually, it can provide the base reactive power which does not fluctuate by variation of reactive power and the load.

As the reactive power supplied by capacitors neutralizes the extra inductive reactive power, it is not economically and technically recommended to provide a large amount of reactive power throughout the day and night. For instance, during the light load, reactive power neutralization is no longer needed since, it causes the voltage profile of the network to be raised unexpectedly.

The second type of capacitor bank is the automatic/switch-able capacitor bank which is made up of a combination of capacitor steps connected in parallel and can be automatically switched to different tap upon the condition of various parameters of the system. In such type of capacitors, an integrated power factor controller controls reactive power supply by switching on and off of all or part of the capacitor bank ^[1].

The distribution networks usually consist of several types of load demand; residential, industrial, and commercial which sometimes covers all sorts together and requires a variable electricity supply. For such fluctuating load demand in the network, utilizing automatic capacitor banks is preferable. However, controlling and managing their effective operation is the point of interest that has to enable and satisfy both; utility managers and electricity end-users for higher profit and power quality, respectively beside several other benefits.

The optimal capacitor allocations considering their appropriate sizing and placement have been considered in distribution networks vastly for a long time. Many research works consider capacitor allocation in distribution systems. Both fixed and automatic capacitors are found in literature to be applied in distribution networks using a variety of methodologies. In previous studies, researchers have discussed and addressed optimal reactive power allocation and optimal selection of capacitors in the distribution networks considering various criteria such as cost and effective utilization of such compensators. Going back to decades before the earliest research and investigation considering numerical and heuristic optimization is found in ^[2,3] for instances.

A methodology using NSGA-Ⅱ is proposed in ^[4] to determine number, locations, control, and size of fixed and switchable capacitor banks in distribution networks contaminated by harmonics in three different load level. The net present value (NPV), voltage deviation, and total harmonic distortion (THD) of voltage are the three objective functions, they considered. The same algorithm is proposed in ^[5] considering power loss and THD as the objective function for fixed capacitor allocation. A decision-making support technique in terms of switching transients is applied to determine the best solution from the obtained Pareto. They have found their proposed methodology to be satisfactory and consistent with previous methods.

Optimal capacitor placement is solved using a hybrid CODEQ method in the distribution network to overcome the previous drawback of parameters selection in differential evolution (DE). The performance of the proposed method over conventional CODEQ method, DE, simulated annealing (SA), and ant system (AS) is tested and shown in ^[6].

Authors in ^[7], considered allocation of capacitors in radial distribution networks using cuckoo search algorithm (CSA) to minimize the system operation cost and improving voltage profile. Their methodology is applied in 69-bus and 118-bus IEEE test distribution networks and resulted in the reduction of power loss and annual energy consumption in both systems. Two-stage procedure for optimal location and size of the capacitors applied in ^[8]. In their proposed first stage, loss sensitivity analysis using two loss sensitivity indices (LSIs) was applied to find the most suitable candidate locations for capacitor placement which can reduce the search space for the optimization process. In their second stage, ant colony optimization (ACO) algorithm proposed and utilized for the optimal location and size based on the minimization of energy loss and capacitor cost as the objective functions. Reference ^[9] introduced an exciting methodology for the simultaneous planning of vehicle charging station (CS) and capacitors in electric distribution networks. They observed the best results for improving power loss and voltage profile in their proposed real distribution network. Optimal placement of capacitor banks to distribution transformers (TRs) for the power loss reduction is offered in ^[10] where the problem modeled as maximizing the net present value (NPV) of the capacitor installation project. It was formulated as a mixed integer programming (MIP) model based on the explicit formula for direct calculation of the power loss. They have implemented their proposed method to a portion of Macau MV distribution network which resulted in significant improvement of the system. In the same way studies in ^{[11,12,13,14]} expressed different algorithms and methods such as genetic algorithm (GA) and sensitivity analysis in radial distribution networks for power quality improvement as well as maintaining the minimum cost of capacitor installation.

The authors in ^[15] propose an optimal allocation of capacitors along with distributed generations (DGs) and dispersed energy storage devices. They used a hybrid procedure based on GA and sequential quadratic programming (SQP) algorithm on 18-bus medium voltage (MV) balanced 3-phase network.

Reliability and economic-based capacitor allocation is studied in ^[16] utilizing simulated annealing (SA) as an optimization tool. The capacitor allocation problem is considered and studied only for three conditions as light, medium, and peak load levels. Then at each load level, the average load is applied for the capacitor allocation.

In the same manner, simultaneous allocation of DG and capacitor is addressed in ^[17] using single-objective GA considering limited load level into three load categories.

MINLP is another useful method which applied in many research studies. It is also the right approach for solving such optimization problems having continuous, and discrete variables ^[18,19].

In this study, those variables that are integers, use a lower bound of 1, and an upper bound of the number of different values that are valid for those integer position/decision variables. Then inside the objective function, an x value is considered for indexing into the vector of discrete values and the results reflexed simply into the calculations. What was found from literature and in previous studies that although fixed/automatic capacitors were widely used for optimizing the cost, power loss, THD, and voltage improvement and adopting various methodologies but still many drawbacks should be pointed out for better and proper implementation. In previous studies, little attention is paid to the time-dependent nature of distribution networks and the importance of automatic capacitors' switching steps in which minimizing capacitor switching will affect the lifetime of devices directly. In this study, a novel Pareto based epsilon multi-objective optimization is put forward for the first time to allocate the two types of capacitors optimally in the real distribution network. Capacitor switching steps and operational cost of energy loss and capacitor installation as the two significant objective functions are considered in this optimization.

In summary, the following points are addressed in this paper and further illustration is made.

● Utilizing Pareto based real optimization method which has not been considered for optimal capacitor allocation in distribution networks so far.

● Optimization considering the 24-hour real-time load flow analysis.

● Confirming the proposed methodologies performance with its application on the complex real distribution network, under capacity expansion planning.

● Simultaneous allocation of fixed and automatic capacitors' optimal operation.

● Time-variant load dependent switching operation and its minimization to improve the lifetime of devices as well as operational cost.

● Developing and deploying two different scenarios and their comparison for suitable selection.

● Applying the optimization using the hybrid methodology of sensitivity analysis and epsilon-MOGA as the first scenario for optimal placement and sizing.

● Applying the optimization using only epsilon-MOGA for both sizing and placement under the second scenario.

● Computing the annual saving by energy capacity release in distribution network.

1.1. Organization of the paper

The introduction part of the paper is considered in the first section, and the remainder of the paper is organized as follows. Section-2, expresses the related problem formulations for sensitivity analysis, objective functions and constraints. The methodology along with the theory of $\epsilon$ -MOGA is expressed in Section-3 and ends with introducing the real distribution model in Kabul city as the case study in this paper. Simulations and results are presented and discussed in Section-4 which is followed by concluding remarks in Section-5.

2. Problem formulations

In this study, two scenarios are conducted for both fixed type and automatic capacitor allocation in a real distribution network. In the first scenario, a two-step method is put forward to proceed with the optimization.

Accordingly, in its first step, a sensitivity analysis method coordinating with the Jacobian matrix of the Newton-Raphson load flow analysis is utilized for determination of the most sensitive buses to power loss considering the reactive power. These buses are sorted based on their sensitivity order. After selection of the most sensitive buses, the results are the input to the second step, which is a multi-objective optimization approach yielding to the best and faster convergence. It determines the most optimal solutions in terms of the optimal non-dominated Pareto points.

In the second scenario, both optimal placement and sizing are conducted based on applying multi-objective optimization utilizing $\epsilon$ -MOGA. The decision variables are the location and size of the fixed capacitors and hourly switching steps of automatic capacitor banks.

Cost of energy loss and capacitors is the first objective function and the second objective function is considered to be the minimization of the capacitor's switching steps.

2.1. Sensitivity analysis method for optimal capacitor banks placement

A sensitivity analysis is conducted based on a systematic methodology to eliminate and reduce the search space for optimal allocation of the capacitor banks. On the other hand, locations with the most or maximum impact on the power loss of the system with respect to their reactive power will be chosen.

2.1.1. Power loss sensitivity

The total power loss of the network including real and reactive power loss in complex form is mathematically expressed as (1), and the real part for total power loss of the network is formulated in (2).

$\begin{equation} P_{loss} +Q_{loss} = VI^{\ast} = VY^{\ast}V^{\ast} \end{equation}$

(2.1)

$\begin{equation} P_{L} = \sum\limits_{i = 1}^{n}\sum\limits_{j = 1}^{n}|V_i||V_j||Y_{ij}|cos(\delta_i-\delta_j-\alpha_{ij}) \end{equation}$

(2.2)

As shown in (2), there is no relation between power loss of the system $P_{L}$ and the nodal reactive power $Q$ , so a relation for applying sensitivity of the power loss respect to reactive power is needed. For this reason, the derivative of system loss with respect to reactive power is expanded as following equations.

$\begin{equation} \frac{\partial P_{L}}{\partial Q} = (\frac{\partial P_{L}}{\partial V})(\frac{\partial V}{\partial Q}) \end{equation}$

(2.3)

$\begin{equation} \frac{\partial P_{L}}{\partial V_{i}} = 2 \sum\limits_{j = 1}^{n}V_jY_{ij}cos\alpha_{ij}cos(\delta_i-\delta_j) \end{equation}$

(2.4)

$\begin{equation} \frac{\partial P_{L}}{\partial \delta_{i}} = -2 \sum\limits_{j = 1}^{n}V_jY_{ij}cos\alpha_{ij}sin(\delta_i-\delta_j) \end{equation}$

(2.5)

Likewise, the relation between system power loss and Jacobian matrix is obtained as below.

$\begin{equation} \begin{bmatrix} \frac{\partial P_{L}}{\partial \delta}\\ \\ \frac{\partial P_{L}}{\partial V} \end{bmatrix} = \begin{bmatrix}J \end{bmatrix}\begin{bmatrix} \frac{\partial P_{L}}{\partial P}\\ \\ \frac{\partial P_{L}}{\partial Q} \end{bmatrix} \end{equation}$

(2.6)

where, $J = \begin{bmatrix} \frac{\partial P}{\partial \delta} & \frac{\partial P}{\partial V}\\ \\ \frac{\partial Q}{\partial \delta} & \frac{\partial Q}{\partial V} \end{bmatrix}$

$\begin{equation} \begin{bmatrix} \frac{\partial P_{L}}{\partial P}\\ \\ \frac{\partial P_{L}}{\partial Q} \end{bmatrix} = \begin{bmatrix}J \end{bmatrix}^{-1} \begin{bmatrix} \frac{\partial P_{L}}{\partial \delta}\\ \\ \frac{\partial P_{L}}{\partial V} \end{bmatrix} \end{equation}$

(2.7)

where, $J^{-1} = \begin{bmatrix} J_{I1} & J_{I2}\ \\ J_{I3} & J_{I4} \end{bmatrix}$

Hence,

$\begin{equation} \begin{bmatrix} \frac{\partial P_{L}}{\partial Q} \end{bmatrix} = \begin{bmatrix} J_{I3} & J_{I4} \end{bmatrix} \begin{bmatrix} \frac{\partial P_{L}}{\partial \delta}\\ \\ \frac{\partial P_{L}}{\partial V} \end{bmatrix} \end{equation}$

(2.8)

where, $J_{I3}$ and $J_{I4}$ are $\frac{\partial \delta}{\partial Q}$ and $\frac{\partial V}{\partial Q}$ , respectively which are defined to be sub matrix of inverse of Jacobian matrix in Newton-Raphson (NR) power flow analysis.

Consequently, all the buses are assessed and evaluated according to the derived equation for sensitivity and the most sensitive buses are selected as candidates for capacitor placement.

2.1.2. Nodal voltage and reactive power constraints

After finding the loss sensitivity with respect to nodal reactive power, the voltage and reactive power of each bus is considered to be observed if it is within predetermined maximum and minimum limit. The procedure is followed to update the sensitive buses or in another words, the selected bus from sorted maximum sensitivity buses is automatically being removed or selected based on the defined criteria. The procedure continues until it reaches the assigned number of locations. In this study, based on the possible availability, twenty locations are considered in the real 162-bus distribution network model.

2.2. Power-flow method

One important and major tool for determining current flow, voltage difference, and losses in the network for comparison of base case condition or before placement of capacitor banks and the network condition with various scenarios of capacitor allocation, is the load-flow analysis.

In this study, as discussed in previous section NR due to its well-known accuracy and best convergence for complex and non-radial networks is used to perform the analysis. Due to the well-known NR method and its well introduction in many power system analysis textbooks ^[20,21], further details are skipped in this paper.

2.3. Objective functions for capacitor allocation

Objective functions are the most important part of the optimization formulations. It should be carefully selected based on their priority and effectiveness which can maximize or minimize a parameter of the system. In this study, a multi-objective optimization is conducted and the two objective functions are the total cost and capacitor banks' switching, respectively to be minimized.

The first objective function is to minimize the total cost associated with annual energy loss, peak power loss, equipment and lines, and all involving cost of both fixed and automatic capacitor banks. In other words, the total cost is computed anent to the revenue cost, cost of capacitors, power plants, and related equipment in a year. The second objective function is minimization of the capacitor banks' switching steps, where the less switching steps during its operation, the more the lifetime of the devices are enhanced. The mathematical formulation for the two objective functions are expressed in the following equations.

$\begin{equation} \aleph1 = \kappa_{E}\sum\limits_{i = 1}^{n}P_{i}T_{i} + \kappa_{P}P_{0} + \kappa_{R}S_{peak}+ CB.Cost \end{equation}$

(2.9)

$\begin{equation} \aleph2 = \frac{\sum _{\varphi = 1}^{20}\sum _{\psi = 1}^{24} \left |{CB_{\varphi}(\psi)- CB_{\varphi}(\psi+1)}\right|}{\alpha} \end{equation}$

(2.10)

The first objective function is developed involving five different terms which represent the total cost. Each term is discussed separately in the following subsections.

For the second objective function, $\varphi$ , $\psi$ , $CB_{\varphi}(\psi)$ , $CB_{\varphi}(\psi+1)$ and $\alpha$ are the number of capacitor banks, time in hour, output of capacitor bank in first step, output of capacitor bank in second step at hour plus one and constant for converting reactive power to switching steps, respectively.

2.3.1. Cost of energy loss (CEL)

For each kWh of energy supply, a specific cost is calculated based on fuel and electricity supply cost which are standard values for different countries. The mathematical formulation for the cost of energy loss (CEL) is shown in the equation below.

$\begin{equation} P_{Energy_{loss}} = \sum\limits_{i = 1}^{n}P_{i}T_{i} \end{equation}$

(2.11)

where, $n$ is number of the load levels, $P_{i}$ is the power loss at any load level $i$ with a time duration $T_{i}$ .

$\begin{equation} Cost_{Energy_{loss}} = \kappa_{E}.P_{Energy_{loss}} \end{equation}$

(2.12)

where, $\kappa_{E}$ is in kWh representing cost of energy.

2.3.2. Cost of peak power (CPP)

Construction of power plants usually depends on peak demand the peak time. Increasing the number of electricity consumers and thus growth in consumption, leads to constructing new power plants. Therefore, decreasing the power loss in peak time results in postponing the construction of the new power plants.

Usually, cost of construction for new power plant is evaluated based on kW cost and its lifetime. For instance, if every kW of a power plant is $A$ ＄, its lifetime is $n$ $year$ , and the return on investment (ROI) is $B$ %, then the yearly cost of peak power for each kW is obtained from the below equation.

$\begin{equation} Cost_{Peak(1kW)} = \kappa_{P} = A.\frac{(1+B)^n\times B}{(1+B)^n\ - 1} \end{equation}$

(2.13)

Hence, by determining the cost of each kW for yearly peak power, cost of annual peak power is easily calculated from the below equation.

$\begin{equation} Cost_{peak(P_{0}kW)} = \kappa_{P}\times P_{0} \end{equation}$

(2.14)

where $P_0$ represents the yearly peak power.

2.3.3. Cost of equipment and lines (CEL)

In a distribution network, the equipment and lines have their rating values in kVA. By an increase in the number of consumers, the consumption reaches to the rated operating value which requires renovation to change and replace the lines and equipment.

By compensating the reactive power, the total power carrying into lines and equipment decrease and thus postponing their construction and renovation. The economic factor is applied on several studies such in ^[22] and helps to extract cost components accordingly.

For calculating the cost of each kVA for the construction of the new equipment and lines, its mathematical formulation is shown below.

$\begin{equation} Cost_{equip(1kVA)} = \kappa_{R} = C.\frac{(1+D)^m\times D}{(1+D)^m\ - 1} \end{equation}$

(2.15)

So, when the yearly line and equipment cost for one kVA is determined, total cost for one-year construction of lines and equipment is obtainable from below equation.

$\begin{equation} Cost_{equip(SkVA)} = \kappa_{R}\times S_{peak} \end{equation}$

(2.16)

where $C$ is cost of equipment and lines, $D$ is its ROI, $m$ is their lifetime, and $S_{peak}$ represents the yearly peak kVA.

2.3.4. Cost of capacitor banks (CCB)

The cost of capacitor banks including capital, installation, and maintenance is formulated into two parts for the fixed and automatic types. This term is mathematically formulated and presented in the following equation.

$\begin{equation} CB.Cost = \kappa_f\sum\limits_{k = 1}^{L}CB_{fix}(k)+\kappa_a\sum\limits_{k = 1}^{L}CB_{aut}(k) \end{equation}$

(2.17)

where $\kappa_f$ , $CB_{fix}(k)$ , $\kappa_a$ , and $CB_{aut}(k)$ are price of the fixed capacitor bank in kvar/fanxiexian_myfh, size of the fixed capacitor bank, price of the automatic capacitor bank in kvar/fanxiexian_myfh, and maximum size of automatic capacitor bank, respectively.

2.4. Constraints

The constraints for this study consist of the power flow constraint, voltage condition constraint, current condition constraint, and the maximum reactive power compensation into the network.

2.4.1. Power balance constraints

Power balance is given by non-linear power flow equations which state that the sum of complex power flows at each bus in the distribution system injected into a bus minus the power flows extracted from the bus should equal zero. Equations formulated as follows:

$\begin{equation} P_{i}-P_{j}-\sum\limits_{j = 1}^{n}|V_i||V_j||Y_{ij}|cos(\delta_i-\delta_j-\alpha_{ij}) = 0 \end{equation}$

(2.18)

$\begin{equation} Q_{i}-Q_{j}-\sum\limits_{j = 1}^{n}|V_i||V_j||Y_{ij}|sin(\delta_i-\delta_j-\alpha_{ij}) = 0 \end{equation}$

(2.19)

2.4.2. Voltage constraint

The maximum and minimum operational limit on the bus voltage magnitude and phase angle are expressed below:

$\begin{equation} |V_i^{min}|\leq |V_i|\leq |V_i^{max}| \end{equation}$

(2.20)

$\begin{equation} \delta _i^{min}\leq |\delta _i|\leq \delta _i^{max} \end{equation}$

(2.21)

2.4.3. Current flow constraint

The thermal limit of all the transmission lines should not be violated; hence the line flow is constrained to be less than the line thermal limit for both directions of current flow as given below:

$\begin{equation} I_{ij}\leq I_{ij}^{max} \end{equation}$

(2.22)

$\begin{equation} I_{ji}\leq I_{ji}^{max} \end{equation}$

(2.23)

where $I_{ij}^{max}$ is the maximum current flow allowed along each branch $ij$ .

2.4.4. Maximum reactive power compensation

There should be a constraint for reactive power compensation to not exceed the maximum limit as the following equation.

$\begin{equation} \sum\limits_{i = 1}^{M} Q_{c} \leq Q_{max} \end{equation}$

(2.24)

where $M$ is the number of candidate buses for capacitor placement, $Q_{c}$ is the amount of reactive power compensated in bus $i$ , and $Q_{max}$ is the maximum compensation limit based on available reactive power in the network.

3. Methodology

Optimal time-based operational planning of large-scale distribution network involving power flow analysis is complex and often time-consuming. This is because there are additional constraints based on the grids steady state operating condition and more decision variables are involved which increases the possible convergence time. A robust evolutionary optimization tool that has been found capable of handling complex non-linear optimization problems involving NR load-flow analysis is employed in this work. $\epsilon$ -MOGA can perform the optimization task and reach the best compromise solution at better computational time than most of the other existing optimization tools ^[23].

Proposed methodologies are summarized in steps and are demonstrated in Figure 1. The flowchart is developed for both scenarios, and after obtaining Pareto fronts, decision-making is made considering the minimum switching steps operation of capacitor banks for each method. The theory and concept of the proposed algorithm is expressed in the following subsection.

Figure 1. Flowchart methodology of both scenarios.

DownLoad: Full-Size Img PowerPoint

3.1. $\epsilon$ -MOGA theory and concept

$\epsilon$ -MOGA is a multi-objective evolutionary optimization tool that is based on elitist population selection using the concept of $\epsilon$ -dominance sorting. The $\epsilon$ -dominance feature of the algorithm controls the storing of the superior solutions in the archive $A(t)$ . The integral action of $\epsilon$ -MOGA involves an $\epsilon$ -Pareto set which is the point at which the $A(t)$ converges quickly along the Pareto front $G (\Theta _{p})$ which reduces the required computational time ^[24]. In order to achieve this, a fixed number of boxes, $nBox_{i}$ based on the length of decision vectors are obtained for the objective functions. The width $\epsilon_{i}$ of each $nBox_{i}$ is calculated as given below for each dimension $i\in [1.....n]$ ^[24,25,26].

$\begin{multline} where, \quad \epsilon_{i} = \frac{(G_{i}^{max}-G_{i}^{min})}{nBox_{i}}, \quad G_{i}^{max} = max \quad G_{i}(\vartheta) \quad , \\ G_{i}^{min} = min \quad G_{i}(\vartheta) \quad and \quad \vartheta \in \left \{ \Theta _{p_\epsilon}^{\ast } \right \} \quad \end{multline}$

(3.1)

In this optimisation method each box is occupied by one solution in the archive A(t) which helps maintaining a good diversity $G(\Theta _{p_\epsilon}^{\ast })$ among solutions. The following equation clarify the idea behind $\epsilon$ -dominance and $Box_{i}(\vartheta)$ in solution space $\vartheta\in D$ accordingly.

$\begin{equation} Box_{i}(\vartheta) = \frac{G_{i}(\vartheta)-G_{i}^{min}}{G_{i}^{max}-G_{i}^{min}}\times(nBox_{i}) \quad \forall i \in[1.....n] \end{equation}$

(3.2)

A solution $\vartheta^{1}$ with value $G(\vartheta^{1})$ $\epsilon$ -dominates the solution $\vartheta^{2}$ with value $G(\vartheta^{2})$ , is represented by $\vartheta^{1}$ $\prec _{\epsilon}$ $\vartheta^{2}$ , if (27) holds for $box(\vartheta) = \left \{ {box_{1}(\vartheta), ..., box_{s}(\vartheta)}\right \}$ .

$\begin{equation} [Box(\vartheta^{1})\prec Box(\vartheta^{2}) \quad ]\vee [Box(\vartheta^{1}) = Box(\vartheta^{2}) \quad and \quad \vartheta^{1}\prec \vartheta^{2}] \quad \end{equation}$

(3.3)

Hence, a set $\Theta _{p_\epsilon}^{\ast }\subseteq \Theta _{p}$ is $\epsilon$ -Pareto, if and only if

$\begin{equation} \vartheta^{1}, \vartheta^{2} \epsilon \Theta _{p_\epsilon}^{\ast }, \vartheta^{1} \neq \vartheta^{2}, Box(\vartheta^{1}) \neq Box(\vartheta^{2}) ^{\wedge} Box(\vartheta^{1})\succ _{\epsilon}Box(\vartheta^{2}) \end{equation}$

(3.4)

Consequently, $\epsilon$ -dominated solutions updates the archive $A(t)$ without shared boxes. If there are two non-dominated solutions existing in a box, the solution nearer to the center of box is preferred by encouragement of a smart distribution. In summary, there are three forms of populations discussed in this algorithm so far and are defined below:

1- $P(t)$ having population size of $N_{P}$ , is the main population and deploy during each iteration.

2- $A(t)$ as an archive stores the $\epsilon$ -Pareto set, $\Theta _{p_\epsilon}^{\ast }$ with $N_{A}$ as its population size. The highest level of $\epsilon$ -Pareto set as its maximum limit is obtained using the following equation. The maximum number of boxes is expressed as $nBox_{max}$ .

$\begin{equation} \left | \Theta _{p_\epsilon}^{\ast } \right | \leq \frac{\prod\nolimits_{i = 1}^{s}(nBox_{i}+1)}{nBox_{max}+1} \end{equation}$

(3.5)

3- $G(t)$ which has an even number of population, is considered as auxiliary population. The output of mutation and crossover operators over P(t) and A(t).

The major steps of the proposed optimization algorithm are comprehensively expressed as following ^[27,28,29]:

Step1. the algorithm is started by creation of empty population as $A(t)$ .

Step2. Creating random values of $P(0)$ as initial population using $N_{p}$ individuals and $D$ as search space.

Step3. Objective function evaluation according to input of each individual in $P(t)$ .

Step4. Updating archive elements in $A(t)$ using individual in $P(t)$ considering following points:

a. Finding non-dominated individuals in $P(t)$ to determine $\Theta _{ND}$ .

b. The limits $G_{i}^{max}$ and $G_{i}^{min}$ of Pareto front are calculated from $G(\vartheta)$ , $\forall \vartheta \in \Theta _{ND}$ .

c. A(t) updates by introducing new $\varepsilon$ -non-dominated individuals which dominate the existing individual in $A(t)$ .

Step5. Creation of $G(t)$ as auxiliary population following the below points:

a. Selection of two different individuals, $\vartheta^{p}$ from $P(t)$ , and $\vartheta^{A}$ from $A(t)$ .

b. Generating a random number as $\mu \in [0...1]$ .

c. If probability of crossover/mutation $\mu > P_{c/m}$ , is less than generated random number in sub step b, an extended linear recombination technique utilized to perform the crossover.

d. If the case is opposite to sub step c, a mutation performs using Gaussian distribution and then the created population will be added to $G(t)$ . This process is repeated $N_{p}/2$ times till $G(t)$ saturates.

Step6. Evaluation of objective functions for every individual in G(t)again.

Step7. Updating $A(t)$ from $G(t)$ considering locations in objective space.

Step8. $P(t)$ modifies if and only if the elements in $G(t)$ dominates the elements in $p(t)$ .

Step9. $\epsilon$ -Pareto set, $\Theta _{p_\epsilon}^{\ast}$ are developed by individuals in archive considering Pareto front smart distribution.

3.2. A real case study

The proposed methodologies are applied to a complex real distribution system located in the most congested part of Kabul city, the capital of Afghanistan. The distribution network has 20 kV medium voltage level as well as including 162 buses and 165 distribution lines ^[30,31]. Typically, in Afghanistan power system, the local or imported electricity is transmitted to substations and from the substations to the junction stations in which electricity distributes to the load buses or transformer stations in medium voltage.

Further, in the low voltage side, the electricity goes from transformer stations to the single phase and three phase low voltage distribution loads. Figure 2 depicts the distribution network under study which consists of all types of load. In this study, the proposed methodology is applied on this real distribution network is applicable to all complex, radial, or meshed distribution networks.

Figure 2. Real Distribution Network Model.

DownLoad: Full-Size Img PowerPoint

4. Simulation results and discussions

After implementing the steps at the proposed methodologies and obtaining optimal Pareto solutions, an optimal point based on decision-making criterion of the minimum switching steps for all the capacitor banks is selected in each scenario and this point considered as one of the optimal results. Based on decided Pareto set at that point, the results for capacitor banks operation, the voltage of the network, power loss, and other economic parameters for the first and second scenario are achieved and appropriately illustrated.

Furthermore, voltage and power loss in the base case before optimization are extended and illustrated for comparison with the optimal results.

In the first scenario, after conducting the sensitivity analysis, twenty locations are determined for the allocation of fixed and automatic capacitor banks. The result of the sensitivity analysis for placement is utilized in the second step, conducting $\epsilon$ -MOGA and the Pareto front for both scenarios are shown in Figures 3 and 4.

Figure 3. First scenario Pareto front (Objective-function 1 is total cost and objective function 2 is CBs switching steps).

DownLoad: Full-Size Img PowerPoint

Figure 4. Second scenario Pareto front (Objective-function 1 is total cost and objective function 2 is CBs switching steps).

DownLoad: Full-Size Img PowerPoint

Optimum reactive power supply for each capacitor bank is demonstrated in Figures 5 and 6. It shows the optimal daily operation of the capacitor banks in which their locations are the output of the sensitivity analysis.

Figure 5. First scenario simulation results for fixed and automated capacitor banks operation (Part-Ⅰ).

DownLoad: Full-Size Img PowerPoint

Figure 6. First scenario simulation results for fixed and automated capacitor banks operation (Part-Ⅱ).

DownLoad: Full-Size Img PowerPoint

Switching steps and size for the automatic and fixed type of the capacitor banks are the results of applying proposed multi-objective optimization. Figure 9a shows all the capacitor banks' switching steps on a daily basis. Despite assigning a wider range of operation for switching steps in optimization constraint, less switching steps with a maximum of three steps are effectively allocated.

Also, Table 1 details each capacitor banks optimal location, size, status, configuration, and number of switching per day. The size for the fixed CB and automatic CB is specified as well.

Table 1. Case one simulation results for CB allocation.

CBs	Optimum Location Sensitivity	Fix CB	Auto CB	Configuration of Auto-switching	No of Switching Per Day
CB1	4	600	150	On/Off	8
CB2	33	450	300	3-Tap	7
CB3	160	600	150	On/Off	12
CB4	161	450	300	3-Tap	12
CB5	162	450	300	3-Tap	7
CB6	34	600	150	On/Off	12
CB7	159	450	300	3-Tap	10
CB8	134	450	300	3-Tap	13
CB9	118	450	300	3-Tap	4
CB10	107	450	300	3-Tap	12
CB11	59	600	150	On/Off	13
CB12	49	450	300	3-Tap	12
CB13	128	450	300	3-Tap	7
CB14	64	450	300	3-Tap	14
CB15	8	450	300	3-Tap	14
CB16	42	450	300	3-Tap	12
CB17	37	450	300	3-Tap	17
CB18	79	450	300	3-Tap	13
CB19	90	450	300	3-Tap	11
CB20	67	450	300	3-Tap	10

| Show Table

DownLoad: CSV

The details of the optimal sizing and placement for each CB along with their operation behaviour is indicated in . For the second scenario, simultaneous placement and sizing of the capacitor banks are performed using $\epsilon$ -MOGA. Based on the obtained Pareto, the optimal operation of capacitor banks and their switching steps are demonstrated in Figures 7 and 8 and Figure 9b, respectively.

Table 2. Case two simulation results for CB allocation.

CBs	Optimum Location $\epsilon$ -MOGA	Fix CB	Auto CB	Configuration of Auto-switching	No of Switching Per Day
CB1	65	450	300	3-Tap	6
CB2	113	450	300	3-Tap	7
CB3	78	450	300	3-Tap	8
CB4	90	450	300	3-Tap	11
CB5	59	450	300	3-Tap	11
CB6	58	450	300	3-Tap	10
CB7	86	450	300	3-Tap	13
CB8	87	450	300	3-Tap	15
CB9	98	450	300	3-Tap	10
CB10	66	450	300	3-Tap	16
CB11	93	450	300	3-Tap	13
CB12	99	450	300	3-Tap	12
CB13	70	450	300	3-Tap	10
CB14	72	450	300	3-Tap	4
CB15	97	450	300	3-Tap	8
CB16	71	450	300	3-Tap	12
CB17	79	450	300	3-Tap	9
CB18	80	450	300	3-Tap	19
CB19	73	450	300	3-Tap	13
CB20	82	450	300	3-Tap	14

| Show Table

DownLoad: CSV

Figure 7. Second scenario simulation results for fixed and automated capacitor banks operation (Part-Ⅰ).

DownLoad: Full-Size Img PowerPoint

Figure 8. Second scenario simulation results for fixed and automated capacitor banks operation (Part-Ⅱ).

DownLoad: Full-Size Img PowerPoint

Figure 9. Operation of automatic/switchable capacitors with their switching steps.

DownLoad: Full-Size Img PowerPoint

Figure 10 clearly shows the better justification of the voltage profile due to more efficient reactive power compensation.

Figure 10. 3D daily voltage profile of the network before and after optimization.

DownLoad: Full-Size Img PowerPoint

Furthermore, referring to the Figure 11a which represents the reactive load duration curve before and after optimization, it signifies that the first scenario outperforms the second methodology by more compensation of the reactive power. Table 3, is summarizing the comparison of the optimisation with the base case. Total loss of the network decreased from 117 MWh to 104.97 and 109.31 MWh.

Figure 11. Comparison of (a) Load duration curve (b) Mean voltage (c) Power loss before and after applying scenarios.

DownLoad: Full-Size Img PowerPoint

Table 3. Summary of optimisation results comparing to base case.

Scenario	Total Loss [MWh]	Minimum Voltage (Mean)	CB Daily Switching	Annual Energy Capacity Release [MWh]	Total Annual Operational Cost [USD]	Annual Saving [USD]
Base case	117.74	0.9026	0	0	1.103 $\times \ 10^7$	0
First Scenario	104.97	0.9075	220	4.07 $\times \ 10^3$	1.074 $\times \ 10^7$	2.81 $\times \ 10^5$
Second Scenario	109.31	0.9131	221	3.20 $\times \ 10^3$	1.086 $\times \ 10^7$	1.70 $\times \ 10^5$

| Show Table

DownLoad: CSV

Obtaining significant amount of annual energy capacity release leads to decrease total operational cost in both scenario. in addition, the annual saving of 2.81 $\times \ 10^5$ and 1.7 $\times \ 10^5$ are achievable in the first and second scenario.

Also, the hourly mean voltage and energy loss before and after optimization of the two scenarios are comparatively plotted and graphically illustrated in Figure 11b, c, respectively. The hourly mean voltage is improved to 0.9075 and 0.9131 in the first and second scenario from 0.9026 in the base case, respectively. However, the hybrid methodology in the first scenario outperforms the simultaneous allocation method in terms of power loss reduction as demonstrated.

Finally, It is worth mentioning that the computation time as an essential factor to evaluate the optimization methodology particularly for the proposed problem of allocating fixed and switching capacitors is described as follow.

In the first scenario, due to conducting sensitivity analysis for site selection and thus limiting the decision variables to size and switching of capacitors only the optimization performs faster. In the meantime, the second scenario with additional decision variables for site selection and wider search space dimension applies less efficient. In concluding the first scenario with limiting search space utilizing sensitivity analysis and 8.5 hours elapsed time outperforms the second scenario taking 12.3 hours. However, the time also depends on optimal coding as well as hardware robustness.

5. Conclusions

In this study, the optimal allocation of the automatic and fixed capacitor banks in a real distribution network is proposed and deployed. The two essential factors; minimization of the total cost and switching steps of the capacitor banks in terms of the objective functions are considered in this technical analysis. The optimizations are conducted utilizing two methods. The first method with its two-step mechanism for placement and sizing separately demonstrated better results whereas the second method with lower complexity using only $\epsilon$ -MOGA also had good performance. The performance and results of the proposed algorithm for its first-time utilization in sizing and placement in a distribution network indicated its effectiveness and superiority in such complex and extensive distribution network. Based on the decisioned Pareto points, the optimal operation for capacitor banks size and switching are obtained successfully which shows significant improvement not only power quality in terms of voltage profile and power loss, but financially profitable by gaining considerable annual saving as well as energy capacity release.

Conflict of interest

The authors declare that there is no conflict of interest.

References

[1]	H. E. Kim, A. Cosa-Linan, N. Santhanam, M. Jannesari, M. E. Maros, T. Ganslandt, Transfer learning for medical image classification: A literature review, BMC Med. Imaging, 22 (2022), 69. https://doi.org/10.1186/s12880-022-00793-7 doi: 10.1186/s12880-022-00793-7
[2]	Z. X. Zou, K. Y. Chen, Z. W. Shi, Y. H. Guo, J. P. Ye, Object detection in 20 years: A survey, Proc. IEEE, 111 (2023), 257–276. https://doi.org/10.1109/JPROC.2023.3238524 doi: 10.1109/JPROC.2023.3238524
[3]	H. Q. Zhao, W. B. Zhou, D. D. Chen, T. Y. Wei, N. H. Yu, Multi-attentional deepfake detection, in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE 8 (2021), 2185–2194. https://doi.org/10.1109/CVPR46437.2021.00222
[4]	I. Goodfellow, P. A. Jean, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, et al., Generative adversarial nets, in Advances in Neural Information Processing Systems, 27 (2014), 1–9.
[5]	B. Pandey, D. K. Pandey, B. P. Mishra, W. Rhmann, A comprehensive survey of deep learning in the field of medical imaging and medical natural language processing: Challenges and research directions, J. King Saud Univ. Comput. Inf. Sci., 34 (2022), 5083–5099. https://doi.org/10.1016/j.jksuci.2021.01.007 doi: 10.1016/j.jksuci.2021.01.007
[6]	P. Li, X. H. Xu, Recurrent compressed convolutional networks for short video event detection, in IEEE Access, 8 (2020), 114162–114171. https://doi.org/10.1109/ACCESS.2020.3003939
[7]	P. Li, Q. H. Ye, L. M. Zhang, L.Yuan, X. H. Xu, L. Shao, Exploring global diverse attention via pairwise temporal relation for video summarization, Pattern Recogn., 111 (2021), 107677. https://doi.org/10.1016/j.patcog.2020.107677 doi: 10.1016/j.patcog.2020.107677
[8]	P. Li, P. Zhang, T. Wang, H. X. Xiao, Time–frequency recurrent transformer with diversity constraint for dense video captioning, Inform. Process. Manag., 60 (2023), 103204. https://doi.org/10.1016/j.ipm.2022.103204 doi: 10.1016/j.ipm.2022.103204
[9]	P. Li, J. C. Cao, L. Yuan, Q. H. Ye, X. H. Xu, Truncated attention-aware proposal networks with multi-scale dilation for temporal action detection, Pattern Recogn., 142 (2023), 109684. https://doi.org/10.1016/j.patcog.2023.109684 doi: 10.1016/j.patcog.2023.109684
[10]	P. Li, Y. Zhang a, L. Yuan, H. X. Xiao, B. B. Lin, X. H. Xu, Efficient long-short temporal attention network for unsupervised video object segmentation, Pattern Recogn., 146 (2024), 110078. https://doi.org/10.1016/j.patcog.2023.110078 doi: 10.1016/j.patcog.2023.110078
[11]	K. Feng, J. C. Ji, Y. C. Zhang, Q. Ni, Z. Liu, M. Beer, Digital twin-driven intelligent assessment of gear surface degradation, Mechan. Syst. Signal Process., 186 (2023), 109896. https://doi.org/10.1016/j.ymssp.2022.109896 doi: 10.1016/j.ymssp.2022.109896
[12]	Y. D. Xu, K. Feng, X. A. Yan, R. Q. Yan, Q. Ni, B. B. Sun, et al., CFCNN: A novel convolutional fusion framework for collaborative fault identification of rotating machinery, Inform. Fusion, 95 (2023), 1–16. https://doi.org/10.1016/j.inffus.2023.02.012 doi: 10.1016/j.inffus.2023.02.012
[13]	K. Feng, Y. D. Xu, Y. L. Wang, S. Li, Q. B. Jiang, B. B. Sun, et al., Digital twin enabled domain adversarial graph networks for bearing fault diagnosis, in IEEE Transactions on Industrial Cyber-Physical Systems, 1 (2023), 113–122. https://doi.org/10.1109/TICPS.2023.3298879
[14]	O. Russakovsky, J. Deng, H. Su, J. Krause, S. Satheesh, S. Ma, et al., ImageNet large scale visual recognition challenge, Int J Comput Vis, 115 (2015), 211–252. https://doi.org/10.1007/s11263-015-0816-y doi: 10.1007/s11263-015-0816-y
[15]	K. M. He, X. Y. Zhang, S. Q. Ren, J. Sun, Deep residual learning for image recognition, in 2016 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2016), 770–778. https://doi.org/10.1109/CVPR.2016.90
[16]	A. G. Howard, M. L. Zhu, B. Chen, D. Kalenichenko, W. J. Wang, T. Weyand, et al., MobileNets: Efficient convolutional neural networks for mobile vision applications, preprint, arXiv: 1704.04861.
[17]	X. Y. Zhang, X. Y. Zhou, M. X. Lin, J. Sun, ShuffleNet: An extremely efficient convolutional neural network for mobile devices, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2018), 6848–6856. https://doi.org/10.1109/CVPR.2018.00716
[18]	G. Huan, Z. Liu, L. V. D. Maaten, K. Q. Weinberger, Densely connected convolutional networks, in 2017 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2017), 2261–2269. https://doi.org/10.1109/CVPR.2017.243
[19]	W. H. Yu, M. Luo, P. Zhou, C. Y. Si, Y. C. Zhou, X. C. Wang, et al., MetaFormer is actually what you need for vision, in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2022), 10809–10819. https://doi.org/10.1109/CVPR52688.2022.01055
[20]	Y. P. Chen, X. Y. Dai, D. D. Chen, M. C. Liu, X. Dong, L. Yuan, et al., Mobile-former: Bridging mobilenet and transforme, in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2022), 5270–5279. https://doi.org/10.1109/CVPR52688.2022.00520
[21]	Y. T. Vuong, Q. M. Bui, H. Nguyen, T. Nguyen, V. Tran, X. Phan, et al., SM-BERT-CR: A deep learning approach for case law retrieval with supporting model, Artif. Intell. Law, 31 (2023), 601–628. https://doi.org/10.1007/s10506-022-09319-6 doi: 10.1007/s10506-022-09319-6
[22]	J. Deng, W. Dong, R. Socher, L. J. Li, K. Li, F. F. Li, ImageNet: A large-scale hierarchical image database, in 2009 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2009), 248–255. https://doi.org/10.1109/CVPR.2009.5206848
[23]	T. Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, et al., Microsoft COCO: Common objects in context, in 2014 European conference computer vision (ECCV), (2014), 740–755. https://doi.org/10.1007/978-3-319-10602-1_48
[24]	J. C. Yang, X. L. Guo, Y. Li, F. Marinello, S. Ercisli, Z. Zhang, A survey of few-shot learning in smart agriculture: developments, applications and challenges, Plant Methods., 18 (2022), 28. https://doi.org/10.1186/s13007-022-00866-2 doi: 10.1186/s13007-022-00866-2
[25]	J. D. Chen, J. X. Chen, D.F. Zhang, Y. D. Sun, Y. A. Nanehkaran, Using deep transfer learning for image-based plant disease identification, Comput. Electron. Agri., 173 (2020), 105393. https://doi.org/10.1016/j.compag.2020.105393 doi: 10.1016/j.compag.2020.105393
[26]	S. Q. Jiang, W. Q. Min, Y. Q. Lyu, L. H. Liu, Few-shot food recognition via multi-view representation learning, ACM Transact. Multi. Comput. Commun. Appl., 16 (2020), 1–20. https://doi.org/10.1145/3391624 doi: 10.1145/3391624
[27]	J. Yang, X. M. Wang, Z. P. Luo, Few-shot remaining useful life prediction based on meta-learning with deep sparse kernel network, Inform. Sci., 653 (2024), 119795. https://doi.org/10.1016/j.ins.2023.119795 doi: 10.1016/j.ins.2023.119795
[28]	Y. Q. Wang, Q. M. Yao, J. T. Kwok, L. M. Ni, Generalizing from a few examples: A survey on few-shot learning, ACM Comput. Surveys, 53 (2020), 1–34. https://doi.org/10.1145/3386252 doi: 10.1145/3386252
[29]	J. Lu, P. H. Gong, J. P. Ye, C. H. Zhang, Learning from very few samples: A survey, preprint, arXiv: 2009.02653.
[30]	X. X. Li, X. C. Yang, Z. Y. Ma, J. H. Xue, Deep metric learning for few-shot image classification: A Review of recent developments, Pattern Recogn., 138 (2023), 109381. https://doi.org/10.1016/j.patcog.2023.109381 doi: 10.1016/j.patcog.2023.109381
[31]	A. Dabouei, S. Soleymani, F. Taherkhani, N. M. Nasrabadi, SuperMix: Supervising the mixing data augmentation, in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2021), 13789–13798. https://doi.org/10.1109/CVPR46437.2021.01358
[32]	M. Hong, J. Choi, G. Kim, StyleMix: Separating content and style for enhanced data augmentation, in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2021), 14857–14865. https://doi.org/10.1109/CVPR46437.2021.01462
[33]	N. E. Khalifa, M. Loey, S. Mirjalili, A comprehensive survey of recent trends in deep learning for digital images augmentation, Artif. Intell. Rev., 55 (2022), 2351–2377. https://doi.org/10.1007/s10462-021-10066-4 doi: 10.1007/s10462-021-10066-4
[34]	E. D. Ubuk, B. Zoph, D. Mané, V. Vasudevan, Q. V. Le, AutoAugment: learning augmentation strategies from data, in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2021), 113–123. https://doi.org/10.1109/CVPR.2019.00020
[35]	T. DeVries, G. W. Taylor, Improved regularization of convolutional neural networks with cutout, preprint, arXiv: 1708.04552.
[36]	J. Y. Zhu, T. Park, P. Isola, A. A. Efros, Unpaired image-to-image translation using cycle-consistent adversarial networks, in 2017 IEEE International Conference on Computer Vision (ICCV), IEEE, (2017), 2242–2251. https://doi.org/10.1109/ICCV.2017.244
[37]	T. Karras, T. Aila, S. Laine, J. Lehtinen, Progressive growing of GANs for improved quality, stability and variation, preprint, arXiv: 1710.10196.
[38]	Z. T. Chen, Y. W. Fu, Y. X. Wang, L. Ma, W. Liu, M. Hebert, Image deformation meta-networks for one-Shot learning, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2019), 8672–8681. https://doi.org/10.1109/CVPR.2019.00888
[39]	S. Yun, D. Han, S. Chun, S. J. Oh, S. Chun, J. Choe, Y. Yoo, CutMix: Regularization strategy to train strong classifiers with localizable features, in 2019 IEEE/CVF International Conference on Computer Vision (ICCV), IEEE, (2019), 6022–6031. https://doi.org/10.1109/ICCV.2019.00612
[40]	S. Khodadadeh, L. Boloni, M. Shah, Unsupervised meta-learning for few-shot image classification, in 2019 Advances in Neural Information Processing Systems (NIPS), (2019).
[41]	A. Antoniou, A. Storkey, Assume, augment and learn: Unsupervised few-shot meta-learning via random labels and data augmentation, preprint, arXiv: 1902.09884.
[42]	T. X. Qin, W. B. Li, Y. H. Shi, Y. Gao, Diversity helps: Unsupervised few-shot learning via distribution shift-based data augmentation, preprint, arXiv: 2004.05805.
[43]	H. Xu, J. X. Wang, H. Li, D. Q. Ouyang, J. Shao, Unsupervised meta-learning for few-shot learning, Pattern Recogn., 116 (2021), 107951. https://doi.org/10.1016/j.patcog.2021.107951 doi: 10.1016/j.patcog.2021.107951
[44]	M. Tao, H. Tang, F. Wu, X. Y. Jing, B. K. Bao, C. S. Xu, DF-GAN: A simple and effective baseline for text-to-image synthesis, in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2022), 16494–16504. https://doi.org/10.1109/CVPR52688.2022.01602
[45]	W. T. Liao, K. Hu, M. Y. Yang, B. Rosenhahn, Text to image generation with semantic-spatial aware GAN, in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2022), 18166–18175. https://doi.org/10.1109/CVPR52688.2022.01765
[46]	X. T. Wu, H. B. Zhao, L. L. Zheng, S. H. Ding, X. Li, Adma-GAN: Attribute-driven memory augmented GANs for text-to-image generation, in Proceedings of the 30th ACM International Conference on Multimedia, ACM, (2022), 1593–1602. https://doi.org/10.1145/3503161.3547821
[47]	A. Mehrotra, A. Dukkipati, Generative adversarial residual pairwise networks for one shot learning, preprint, arXiv: 1703.08033.
[48]	Y. X. Wang, R. Girshick, M. Hebert, B. Hariharan, Low-shot learning from imaginary data, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2018), 7278–7286. https://doi.org/10.1109/CVPR.2018.00760
[49]	R. X. Zhang, T. Che, Z. Ghahramani, Y. Bengio, Y. Q. Song, MetaGAN: An adversarial approach to few-Shot learning, in 2018 Advances in Neural Information Processing Systems (NIPS), (2018).
[50]	E. Schwartz, L. Karlinsky, J. Shtok, S. Harary, M. Marder, A. Kumar, et al., Delta-encoder: an effective sample synthesis method for few-shot object recognition, in 2018 Advances in Neural Information Processing Systems (NIPS), (2018).
[51]	Y. Q. Xian, S. Sharma, B. Schiele, Z. Akata, F-VAEGAN-D2: A Feature generating framework for any-shot learning, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2019), 10267–102765. https://doi.org/10.1109/CVPR.2019.01052
[52]	K. Li, Y. L. Zhang, K. P. Li, Y. Fu, Adversarial feature hallucination networks for few-shot learning, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2020), 13467–13476. https://doi.org/10.1109/CVPR42600.2020.01348
[53]	F. Pahde, P. Jähnichen, T. Klein, M. Nabi, Cross-modal hallucination for few-shot fine-grained recognition, preprint, arXiv: 1806.05147.
[54]	M. Dixit, R. Kwitt, M. Niethammer, N. Vasconcelos, AGA: Attribute-guided augmentation, in 2017 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2017), 3328–3336. https://doi.org/10.1109/CVPR.2017.355
[55]	B. Liu, X. D. Wang, M. Dixit, R. Kwitt, N. Vasconcelos, Feature space transfer for data augmentation, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2018), 9090–9098. https://doi.org/10.1109/CVPR.2018.00947
[56]	Z. T. Chen, Y. W. Fu, Y. D. Zhang, Y. G. Jiang, X. Y. Xue, L. Sigal, Multi-level semantic feature augmentation in few-shot learning, preprint, arXiv: 1804.05298.
[57]	H. G. Zhang, J. Zhang, P. Koniusz, Few-shot learning via saliency-guided hallucination of samples, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2019), 2765–2774. https://doi.org/10.1109/CVPR.2019.00288
[58]	G. Koch, R. Zemel, R. Salakhutdinov, Siamese neural networks for one-shot image recognition, in 2015 International Conference on Machine Leaning (ICML), (2015).
[59]	O. Vinyals, C. Blundell, T. Lillicrap, K. Kavukcuoglu, D. Wierstra, Matching networks for one shot learning, in 2019 Advances in Neural Information Processing Systems (NIPS), (2019).
[60]	J. Snell, K. Swersky, R. Zemel, Prototypical networks for few-shot learning, in 2017 Advances in Neural Information Processing Systems (NIPS), (2017).
[61]	F. Sung, Y. X. Yang, Li, Zhang, T. Xiang, P. H.S. Torr, T. M. Hospedales, Learning to compare: Relation network for few-shot learning, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2018), 1199–1208. https://doi.org/10.1109/CVPR.2018.00131
[62]	W. B. Li, L. Wang, J. L. Xu, J. Huo, Y. Gao, J. B. Luo, Revisiting local descriptor based image-to-class measure for few-shot learning, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2019), 7253–7260. https://doi.org/10.1109/CVPR.2019.00743
[63]	Y. B. Liu, J. H. Lee, M. Park, S. Kim, E. Yang, S. J. Hwang, et al., Learning to propagate labels: Transductive propagation network for few-shot learning, preprint, arXiv: 1805.10002.
[64]	C. Simon, P. Koniusz, R. Nock, M. Harandi, Adaptive Subspaces for Few-Shot Learning, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2020), 4135–4144. https://doi.org/10.1109/CVPR42600.2020.00419
[65]	K. Allen, E. Shelhamer, H. Shin, J. Tenenbaum, Infinite mixture prototypes for few-shot learning, in 2019 International Conference on Machine Leaning (ICML), (2019), 232–241.
[66]	C. Xing, N. Rostamzadeh, B. Oreshkin, P. O. O. Pinheiro, Adaptive cross-modal few-shot learning, in 2019 Advances in Neural Information Processing Systems (NIPS), (2019).
[67]	X. M. Li, L. Q. Yu, C. W. Fu, M. Fang, P.-A. Heng, Revisiting metric learning for few-shot image classification, Neurocomputing, 406 (2020), 49–58. https://doi.org/10.1016/j.neucom.2020.04.040 doi: 10.1016/j.neucom.2020.04.040
[68]	S. P. Yan, S. Y. Zhang, X. M. He, A dual attention network with semantic embedding for few-shot learning, in 2019 Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), (2019), 9079–9086. https://doi.org/10.1609/aaai.v33i01.33019079
[69]	P. Li, G. P. Zhao, X. H. Xu, Coarse-to-fine few-shot classification with deep metric learning, Inform.n Sci., 610 (2022), 592–604. https://doi.org/10.1016/j.ins.2022.08.048 doi: 10.1016/j.ins.2022.08.048
[70]	T. Y. Gao, X. Han, Z. Y. Liu, M. S. Sun, Hybrid attention-based prototypical networks for noisy few-shot relation classification, in 2019 Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), (2019), 6407–6414. https://doi.org/10.1609/aaai.v33i01.33016407
[71]	B. Oreshkin, P. R. López, A. Lacoste, Tadam: Task dependent adaptive metric for improved few-shot learning, in 2018 Advances in Neural Information Processing Systems (NIPS), (2018)
[72]	H. Y. Li, D. Eigen, S. Dodge, M. Zeiler, X. G. Wang, Finding task-relevant features for few-shot learning by category traversal, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2019), 1–10. https://doi.org/10.1109/CVPR.2019.00009
[73]	F. Y. Yang, R. P. Wang, X. L. Chen, SEGA: Semantic guided attention on visual prototype for few-shot learning, in 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), IEEE, (2022), 1586–1596. https://doi.org/10.1109/WACV51458.2022.00165
[74]	R. B. Hou, H. Chang, B. P. Ma, S. G. Shan, X. L. Chen, Cross attention network for few-shot classification, in 2019 Advances in Neural Information Processing Systems (NIPS), (2019).
[75]	A. Santoro, S. Bartunov, M. Botvinick, D. Wierstra, T. Lillicrap, One-shot with memory-augmented neural networks, preprint, arXiv: 1605.06065.
[76]	C. Finn, P. Abbeel, S. Levine, Model-agnostic meta-learning for fast adaptation of deep networks, in 2017 International Conference on Machine Leaning (ICML), (2017), 1126–1135.
[77]	A. Nichol, J. Achiam, J. Schulman, On first-order meta-learning algorithms, preprint, arXiv: 1803.02999.
[78]	A. Antoniou, H. Edwards, A. Storkey, How to train your MAML, preprint, arXiv: 1810.09502.
[79]	S. Ravi, H. Larochelle, Optimization as a model for few-shot learning, in 2017 International Conference on Learning Representations (ICLR), (2017)
[80]	S. Gidaris, N. Komodakis, Dynamic few-shot visual learning without forgetting, in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2018), 4367–4375. https://doi.org/10.1109/CVPR.2018.00459
[81]	Q. R. Sun, Y. Y. Liu, T. S. Chua, B. Schiele, Meta-transfer learning for few-shot learning, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2019), 403–412. https://doi.org/10.1109/CVPR.2019.00049
[82]	H. J. Ye, H. X. Hu, D. C. Zhan, F. Sha, Few-shot learning via embedding adaptation with set-to-set functions, in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2020), 8805–8814. https://doi.org/10.1109/CVPR42600.2020.00883
[83]	K. Lee, S. Maji, A. Ravichandran, S. Soatto, Meta-learning with differentiable convex optimization, in 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2019), 10649–10657. https://doi.org/10.1109/CVPR.2019.01091
[84]	C. Zhang, H. H. Ding, G. S. Lin, R. B. Li, C. H. Wang, C. H. Shen, Meta navigator: Search for a Good Adaptation Policy for Few-shot Learning, in 2021 IEEE/CVF International Conference on Computer Vision (ICCV), IEEE, (2021), 9415–9424. https://doi.org/10.1109/ICCV48922.2021.00930
[85]	A. Aimen, S. Sidheekh, N. C. Krishnan, Task attended meta-learning for few-shot learning, preprint, arXiv: 2106.10642.
[86]	R. Krishnan, P. Rajpurkar, E. J. Topol, Self-supervised learning in medicine and healthcare, Nature Biomedical Engineering., 6 (2022), 1346–1352. https://doi.org/10.1038/s41551-022-00914-1 doi: 10.1038/s41551-022-00914-1
[87]	S. Gidaris, P. Singh, N. Komodakis, Unsupervised representation learning by predicting image rotations, preprint, arXiv: 1803.07728.
[88]	W. X. Wang, J. Li, H. Ji, Self-supervised deep image restoration via adaptive stochastic gradient langevin dynamics, in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2022), 1979–1988. https://doi.org/10.1109/CVPR52688.2022.00203
[89]	H. Q. Wang, X. Guo, Z. H. Deng, Y. Lu, Rethinking minimal sufficient representation in contrastive learning, in 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2022), 16020-16029. https://doi.org/10.1109/CVPR52688.2022.01557
[90]	M. L. Zhang, J. H. Zhang, Z. W. Lu, T. Xiang, M. Y. Ding, S. F. Huang, IEPT: Instance-Level and Episode-Level Pretext Tasks for Few-Shot Learning, in 2021 International Conference on Learning Representations (ICLR), (2021)
[91]	X. Luo, Y. X. Chen, L. J. Wen, L. L. Pan, Z. L. Xu, Boosting few-shot classification with view-learnable contrastive learning, in 2021 IEEE International Conference on Multimedia and Expo (ICME), IEEE, (2021), 1–6. https://doi.org/10.1109/ICME51207.2021.9428444
[92]	T. Lee, S. Yoo, Augmenting few-shot learning with supervised contrastive learning, IEEE Access., 9 (2021), 61466-61474. https://doi.org/10.1109/ACCESS.2021.3074525 doi: 10.1109/ACCESS.2021.3074525
[93]	Z. Y. Yang, J. H. Wang, Y. Y. Zhu, Few-shot classification with contrastive learning, in 2022 European conference computer vision (ECCV), (2022), 293–309. https://doi.org/10.1007/978-3-031-20044-1_17
[94]	Y. N. Lu, L. J. Wen, J. Z. Liu, Self-supervision can be a good few-shot learner, in 2022 European conference computer vision (ECCV), (2022), 740–758. https://doi.org/10.1007/978-3-031-19800-7_43
[95]	S. Fort, Gaussian prototypical networks for few-shot learning on omniglot, preprint, arXiv: 1708.02735.
[96]	L. Bertinetto, J. F. Henriques, P. H.S. Torr, A. Vedaldi, Meta-learning with differentiable closed-form solvers, preprint, arXiv: 1805.08136.
[97]	C. Wah, S. Branson, P. Welinder, P. Perona, S. Belongie, The caltech-ucsd birds-200-2011 dataset: Technical report CNS-TR-2011-001, (2011), 1–8.
[98]	A. Khosla, N. Jayadevaprakash, B. P. Yao, F. F. Li, Novel dataset for fine-grained image categorization: stanford dogs, CVPR Workshop on Fine-Grained Visual Categorization., 2 (2021).
[99]	M. Y. Ren, E. Triantafillou, S. Ravi, J. Snell, K. Swersky, J. B. Tenenbaum, et al., Meta-learning for semi-supervised few-shot classification, preprint, arXiv: 1803.00676.
[100]	G. Liu, L. L. Zhao, W. Li, D. S. Guo, X. Z. Fang, Class-wise Metric Scaling for Improved Few-Shot Classification, in 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), IEEE, (2021), 586–595. https://doi.org/10.1109/WACV48630.2021.00063

This article has been cited by:

1.	Mikaeel Ahmadi, Mir Sayed Shah Danish, Tomonobu Senjyu, Habibullah Fedayee, Najib Rahman Sabory, Atsushi Yona, 2021, Chapter 4, 978-981-15-7178-7, 47, 10.1007/978-981-15-7179-4_4
2.	Mohamad Nur Hidayat Mat, Nor Zelawati Asmuin, Md Faisal Md Basir, Mohammad Reza Safaei, Mohd Shareduwan Mohd Kasihmuddin, Taufiq Khairi Ahmad Khairuddin, Marjan Godarzi, Optimizing nozzle convergent angle using central composite design on the particle velocity and acoustic power level for single-hose dry ice blasting nozzle, 2020, 1388-6150, 10.1007/s10973-020-10083-5
3.	Mikaeel Ahmadi, Oludamilare Bode Adewuyi, Mir Sayed Shah Danish, Paras Mandal, Atsushi Yona, Tomonobu Senjyu, Optimum coordination of centralized and distributed renewable power generation incorporating battery storage system into the electric distribution network, 2021, 125, 01420615, 106458, 10.1016/j.ijepes.2020.106458
4.	Mazdak Ebadi, Mohammad Bayat, Hossain Asadi, Evaluating maximum permissible feeder current in capacitive compensated harmonic polluted networks introducing Apparent RMS Current Ratio Index (ACRI), 2020, 187, 03787796, 106511, 10.1016/j.epsr.2020.106511
5.	Pramod Kumar, Nagendra Kumar Swarnkar, Ahmed Ali, Om Prakash Mahela, Baseem Khan, Divya Anand, Julien Brito Ballester, Transmission Network Loss Reduction and Voltage Profile Improvement Using Network Restructuring and Optimal DG Placement, 2023, 15, 2071-1050, 976, 10.3390/su15020976
6.	Amandeep Gill, Pushpendra Singh, Jalpa H. Jobanputra, Mohan Lal Kolhe, Placement analysis of combined renewable and conventional distributed energy resources within a radial distribution network, 2022, 10, 2333-8334, 1216, 10.3934/energy.2022057
7.	Evan S. Jones, Nicholas Jewell, Yuan Liao, Dan M. Ionel, Optimal Capacitor Placement and Rating for Large-Scale Utility Power Distribution Systems Employing Load-Tap-Changing Transformer Control, 2023, 11, 2169-3536, 19324, 10.1109/ACCESS.2023.3244572
8.	Shaikh Sohail Mohiyodin, Rajesh Maharudra Patil, Dr MS Nagaraj, Northern goshawk optimization for optimal reactive power compensation in photovoltaic low-voltage radial distribution networks, 2024, 5, 26664127, 206, 10.1016/j.susoc.2024.07.002
9.	Zeeshan Aslam, Nadeem Javaid, Muhammad Umar Javed, Muhammad Aslam, Abdulaziz Aldegheishem, Nabil Alrajeh, A new clustering-based semi-supervised method to restrict the users from anomalous electricity consumption: supporting urbanization, 2024, 106, 0948-7921, 6431, 10.1007/s00202-024-02362-3
10.	Sourav Mondal, Mala De, A graph theory-based solution method for capacitor bank and voltage regulator allocation problems in an unbalanced distribution system, 2024, 45, 0143-0750, 10.1080/01430750.2024.2304727
11.	Alexey Mikhaylov, 2024, Chapter 3, 978-3-031-53573-4, 69, 10.1007/978-3-031-53574-1_3
12.	Mohammad Hamid Ahadi, 2024, Chapter 4, 978-3-031-53573-4, 87, 10.1007/978-3-031-53574-1_4

Reader Comments

Your name:*

Email:*
© 2024 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)