
The Task Force on Climate-related Financial Disclosures (TCFD) provides an industry-led framework for the calculation of investment portfolios' carbon footprint metrics aimed to assess their carbon risk exposure. This is the framework that financial institutions (including fund managers) in the European Union should consider when disclosing their sustainability reports according to the EU's Corporate Sustainability Reporting Directive (CSRD). However, currently, global fund data providers either do not publicly offer such metrics (e.g., Morningstar) or do so partly by considering only investees' Scope 1 and 2 carbon emissions in their calculation (e.g., MSCI). In this paper, we analyse how informative the TCFD's fund metrics computed from investees' emissions along their full value chain (including Scope 1, Scope 2, and Scope 3 greenhouse gas (GHG) emissions) are for investors specifically committed to climate change. To that end, we collected reported emissions by Spanish equity funds' investees (from the sustainability reports issued by every investee) as our primary data source and employ a hybrid environmentally extended multiregional input–output model (hybrid EE-MRIO) to fill in the missing data, mainly for Scope 3. We show that disregard for Scope 3 emissions leads to a wrong identification of funds with low/medium-low exposure to carbon risk. The evaluation of funds' risk-adjusted financial performance further indicates that funds with medium-low exposure to carbon risk outperform funds more exposed to carbon emissions. Finally, we find that funds' WACI (weighted average carbon intensity) including Scope 1, 2, and 3 upstream emissions allows for a deeper screening of funds by carbon risk exposure than the Morningstar portfolio carbon risk score.
Citation: Luis Antonio López, Raquel López. What can green investors learn from funds' value chain carbon footprint? Evidence from Spain[J]. Green Finance, 2025, 7(2): 332-362. doi: 10.3934/GF.2025012
[1] | Yufeng Qian . Exploration of machine algorithms based on deep learning model and feature extraction. Mathematical Biosciences and Engineering, 2021, 18(6): 7602-7618. doi: 10.3934/mbe.2021376 |
[2] | Yu Li, Meilong Zhu, Guangmin Sun, Jiayang Chen, Xiaorong Zhu, Jinkui Yang . Weakly supervised training for eye fundus lesion segmentation in patients with diabetic retinopathy. Mathematical Biosciences and Engineering, 2022, 19(5): 5293-5311. doi: 10.3934/mbe.2022248 |
[3] | Keying Du, Liuyang Fang, Jie Chen, Dongdong Chen, Hua Lai . CTFusion: CNN-transformer-based self-supervised learning for infrared and visible image fusion. Mathematical Biosciences and Engineering, 2024, 21(7): 6710-6730. doi: 10.3934/mbe.2024294 |
[4] | Qiao Pan, Chen Huang, Dehua Chen . A method based on multi-standard active learning to recognize entities in electronic medical record. Mathematical Biosciences and Engineering, 2021, 18(2): 1000-1021. doi: 10.3934/mbe.2021054 |
[5] | Yue Li, Hongmei Jin, Zhanli Li . A weakly supervised learning-based segmentation network for dental diseases. Mathematical Biosciences and Engineering, 2023, 20(2): 2039-2060. doi: 10.3934/mbe.2023094 |
[6] | Xiaobo Zhang, Donghai Zhai, Yan Yang, Yiling Zhang, Chunlin Wang . A novel semi-supervised multi-view clustering framework for screening Parkinson's disease. Mathematical Biosciences and Engineering, 2020, 17(4): 3395-3411. doi: 10.3934/mbe.2020192 |
[7] | Yanghan Ou, Siqin Sun, Haitao Gan, Ran Zhou, Zhi Yang . An improved self-supervised learning for EEG classification. Mathematical Biosciences and Engineering, 2022, 19(7): 6907-6922. doi: 10.3934/mbe.2022325 |
[8] | Zhanhong Qiu, Weiyan Gan, Zhi Yang, Ran Zhou, Haitao Gan . Dual uncertainty-guided multi-model pseudo-label learning for semi-supervised medical image segmentation. Mathematical Biosciences and Engineering, 2024, 21(2): 2212-2232. doi: 10.3934/mbe.2024097 |
[9] | Ruoqi Zhang, Xiaoming Huang, Qiang Zhu . Weakly supervised salient object detection via image category annotation. Mathematical Biosciences and Engineering, 2023, 20(12): 21359-21381. doi: 10.3934/mbe.2023945 |
[10] | Jingyao Liu, Qinghe Feng, Yu Miao, Wei He, Weili Shi, Zhengang Jiang . COVID-19 disease identification network based on weakly supervised feature selection. Mathematical Biosciences and Engineering, 2023, 20(5): 9327-9348. doi: 10.3934/mbe.2023409 |
The Task Force on Climate-related Financial Disclosures (TCFD) provides an industry-led framework for the calculation of investment portfolios' carbon footprint metrics aimed to assess their carbon risk exposure. This is the framework that financial institutions (including fund managers) in the European Union should consider when disclosing their sustainability reports according to the EU's Corporate Sustainability Reporting Directive (CSRD). However, currently, global fund data providers either do not publicly offer such metrics (e.g., Morningstar) or do so partly by considering only investees' Scope 1 and 2 carbon emissions in their calculation (e.g., MSCI). In this paper, we analyse how informative the TCFD's fund metrics computed from investees' emissions along their full value chain (including Scope 1, Scope 2, and Scope 3 greenhouse gas (GHG) emissions) are for investors specifically committed to climate change. To that end, we collected reported emissions by Spanish equity funds' investees (from the sustainability reports issued by every investee) as our primary data source and employ a hybrid environmentally extended multiregional input–output model (hybrid EE-MRIO) to fill in the missing data, mainly for Scope 3. We show that disregard for Scope 3 emissions leads to a wrong identification of funds with low/medium-low exposure to carbon risk. The evaluation of funds' risk-adjusted financial performance further indicates that funds with medium-low exposure to carbon risk outperform funds more exposed to carbon emissions. Finally, we find that funds' WACI (weighted average carbon intensity) including Scope 1, 2, and 3 upstream emissions allows for a deeper screening of funds by carbon risk exposure than the Morningstar portfolio carbon risk score.
Supervised learning is an important branch of machine learning. In supervised multi-classification problems, each sample is assigned a label which indicates the category it belongs to [1]. Supervised learning is effective when there are enough samples with high quality labels. However, it is expensive and time-consuming to build datasets with a multitude of accurate labels. To solve this problem, researchers have proposed a series of weakly supervised learning (WSL) methods, which aim to train models with partial, incomplete or inaccurate supervised information, such as noise-label learning [2,3,4,5], semi-supervised learning [6,7,8,9], partial-label learning [10,11,12], positive-confidence learning [13], unlabeled-unlabeled learning [14] and others.
In this paper, we consider another WSLframework called complementary label learning (CLL). We show the difference between complemtary labels and true labels in Figure 1. Compared to an ordinary label, a complementary label indicates the class that the sample does not belong to. Obviously, it is easier and less costly to collect these complementary labels. For example, in some very specialized domains, the expert knowledge is very expensive. If complementary labels are used for annotation, we need to only determine the extent of the label space and then use common sense to determine which category is wrong. It is much simpler and faster to determine which class a sample does not belong to than it belongs to. Besides, CLL can also protect data privacy in some sensitive fields like medical and financial records because we no longer need to disclose the true information of the data. This not only protects data privacy and security, but also makes it easier to collect data in these areas.
The framework of CLL was first proposed by Ishida et al. [15]. They proved that the unbiased risk estimator (URE) only from complementary labels is equivalent to the ordinary classification risk when the loss function satisfies certain conditions. In URE, the loss function must be nonconvex and symmetric which leads to certain limitations. To overcome this limitation, Yu et al. [16] made cross-entropy loss usable in CLL by constructing a complementary label transition matrix, and they also considered that different labels had different probability of being selected as a complementary label. Then, Ishida et al. [17] expanded URE and proposed a CLL framework adapted to more general loss functions. This framework still has an unbiased estimator of the regular classification risk, but it works for all loss functions. Chou et al. [18] optimized URE from gradient estimation, and proposed that using surrogate complementary loss (SCL) to obtain unbiased risk estimation, which effectively alleviated the problem of overfitting in URE. Liu et al. [19] applied common losses such as categorical cross entropy (CCE), mean square error (MSE) and mean absolute error (MAE) to CLL. Ishiguro et al. [20] conducted a study on the problem that complementary labels may be affected by label noise. To mitigate its adverse effects, they selected losses with noise robustness which satisfied weighted symmetric condition or a more relaxed condition. Recently, Zhang et al. [21] broadened the setting of complementary label datasets and discussed the case that the datasets contained a large number of complementary labels and a small number of true labels at the same time. They proposed an adversarial complementary label learning network, named Clarinet. Clarinet consists of two deep neural networks, one to classify complementary labels and true labels, and the other to learn from complementary labels.
Previous studies on CLL always focus on rewriting the classification risk under the ordinary label distribution to the risk under the complementary label distribution and exploring the use of more loss functions [15,16,17,18,19]. These rewriting risk techniques prove the consistency relationship between the risk of complementary label classification and the risk of supervised classification. This enables the classifier to perform accurate classification using only the complementary labels. However, in this process, only complementary labels are involved in the risk calculation, and the information contained in them is extremely limited, which results in consistently lower performance of CLL compared to supervised learning. Therefore, we aim to enhance the supervision information of the complementary labels to further improve the performance of CLL. In this paper, we propose a two-step complementary label enhancement framework based on knowledge distillation (KDCL). It consists of the following components: 1) a teacher model trained on complementary label dataset to generate soft labels which contain more supervision information as label distribution; 2) a student model trained on the same dataset to learn from both soft labels and complementary labels; 3) a final loss function to integrate loss from soft labels and complementary labels and update parameters of the student model. We use three CLL loss functions to conduct experiments on several benchmark datasets, and compare the accuracy of the student model before and after enhancement by KDCL. The experimental results show that KDCL can effectively improve the performance of CLL.
Supposing that the input sample is a d-dimensional vector x∈Rd with class labels y∈{1,2,...,K}, where K stands for K classes in the dataset. Giving a training set D={(xi,yi)}Ni=1 with N samples, all of which independently follow the same distribution p(x,y). The goal of learning from true labels is to learn a mapping relation f(x) from the sample space Rd to the label space {1,2,...,K} and f(x) is also called a classifier. We want f(x) to minimize the multi-class classification risk:
R(f)=Ep(x,y)∼D[L(f(x),y)], | (1) |
where L(f(x),y) is multi-class loss function, f(x) is usually obtained by the following equation:
f(x)=argmaxy∈1,2,…,Kgy(x), | (2) |
where g(x):Rd→RK. In deep neural networks, g(x) is the prediction distribution of the output from the last fully connected layer.
In general, distribution p(x,y) is unknown. We can use the sample mean to approximate the classification risk in Eq (1). R(f) is empirically estimated as ˆR(f):
ˆR(f)=1Nn∑i=1L(f(xi),yi), | (3) |
where N is the number of training data and i is the i-th sample.
In CLL, each sample x is assigned only one complementary label ˉy. Therefore, the dataset is switched from D={(xi,yi)}Ni=1 to ˉD={(xi,ˉyi)}Ni=1, where ˉy∈{1,2,...,K}∖{y} and D≠ˉD. ˉD independently follow an unknown distribution ˉp(x,ˉy). If all complementary labels are selected in an unbiased way, which means that they have the same probability of being chosen, ˉp(x,ˉy) can be presented as:
ˉp(x,ˉy)=1K−1∑y≠ˉyp(x,y). | (4) |
Supposing that ˉL(f(x),ˉy) is complementary loss function, we can obtain similar multi-class risk as Eq (1) in distribution ˉp(x,ˉy):
ˉR(f)=Eˉp(x,ˉy)∼ˉD[ˉL(f(x),ˉy)]. | (5) |
To our best knowledge, Ishida et al. [15] are the first to prove that the difference between Eq (1) and Eq (5) is constant when the loss function ˉL satisfies certain conditions and this constant M only depends on the number of categories K:
R(f)=(K−1)Eˉp(x,ˉy)∼ˉD[ˉL(f(x),ˉy)]+M=(K−1)ˉR(f)+M. | (6) |
All coefficients are constant when the loss function satisfies the condition. So it is possible to learn from complementary labels by minimizing R(f) in Eq (6). Then, they rewrite one-versus-all (OVA) loss LOVA and pairwise-comparison (PC) loss LPC in ordinary multi-class classification as ˉLOVA and ˉLPC in CLL:
ˉLOVA(g(x),ˉy)=1K−1∑y≠ˉyl(gy(x))+l(−gˉy(x)),ˉLPC(g(x),ˉy)=∑y≠ˉyl(gy(x)−gˉy(x)), | (7) |
where l(z):R→R is a binary loss and it must be nonconvex and symmetric, such as sigmoid loss. g(x) is the same as Eq (2) and gy(x) is the y-th element of g(x). Finally, the unbiased risk estimator of R(f) can be obtained by sample mean:
ˆR(f)(K−1)NN∑n=1ˉL(f(xn),ˉyn)+M. | (8) |
Although it is feasible to learn a classifier that minimizes Eq (8) from complementary labels, the restriction on the loss function limits the application of URE. Yu et al. [16] analyze the relationship between ordinary and complementary labels in terms of conditional probability:
P(ˉy=j|x)=∑i≠jP(ˉy=j|y=i)P(y=i|x), | (9) |
where ∀i,j∈{1,2,…,K}. When all complementary labels are selected in an unbiased way, P(ˉy|y) can be expressed as a transition matrix Q:
Q=[0⋯1K−1⋮⋱⋮1K−11K−10]K×K, | (10) |
where each element in Q represents P(ˉy=j|y=i). Since the true label and the complementary label of the sample are mutually-exclusive, that is P(ˉy=j|y=i)=0. Therefore, the entries on the diagonal of the matrix are 0.
Combining Eqs (5), (9) and (10), we can rewrite ˉR(f) as:
ˉR(f)=Eˉp(x,ˉy)[LCE(QTg(x),ˉy)], | (11) |
where LCE is cross-entropy loss which is widely used in deep learning. The classification risk ˉR(f) in Eq (8) is also consistent with the ordinary classification risk R(f) [16].
In image classification, outputs from the last fully connected layer of a deep neural network contain the predicted probability distribution of all classes after the Softmax function. Comparing with a single logical label, the outputs carry more information. Hinton et al. [22] define the outputs as soft labels and propose a knowledge distillation framework. We draw on the idea of knowledge distillation and hope to improve the performance of CLL by enhancing complementary labels through soft labels.
In the framework of knowledge distillation, Hinton et al. [22] modify the Softmax function and they introduce the parameter T to control the smoothness of soft labels. The ordinary Softmax function can be expressed as follows:
y'i=exp(yi)∑jexp(yj), | (12) |
where y'i is the predicted probability of the i-th class, exp(⋅) is the exponential function and yi is the predicted output of the classification network for the ith class. The Softmax function combines the prediction outputs of the model for all classes, and uses the exponential function to normalize the output values in the interval [0, 1].
The rewritten Softmax function is as follows:
y'i=exp(yi╱T)∑jexp(yi╱T). | (13) |
We present a comparison of the smoothness of soft labels for different T in Figure 2. As T gradually increases, soft labels will become smoother. Actually, T regulates the degree to the attention to the negative labels. The higher T, the more attention is paid to negative labels. T is an adjustable hyperparameter during training.
For one sample, soft labels not only clarify its correct category, but also contain the correlation between other labels. More abundant information is carried in soft labels than the complementary label. If we add an extra term to the ordinary supplementary label classification loss and introduce soft labels as additional supervision information, CLL will perform better than using only complementary labels. Of course, we need a model with high accuracy to produce soft labels, which will make the soft labels more credible. This model is also trained by complementary labels.
Taking advantage of this property, we propose KDCL, a complementary label learning framework based on knowledge distillation. The overall structure is shown in Figure 3.
KDCL is a two-stage training framework consisting of a more complex teacher model with higher accuracy and a simpler student model with lower accuracy. First, the teacher model is trained with complementary labels on the dataset and predicts all samples in the training set. The prediction results are normalized by the Softmax function with T=t(t>1) to generate soft labels Stea. Second, the student model is trained and its outputs are processed in two ways, one to produce the soft prediction results Sstu with T=t(t>1), and the other to output ordinary prediction results Pstu with T=1. Then, the KL divergence between Stea and Sstu is calculated, and the complementary label loss between Pstu and the complementary labels is calculated at the same time. The two losses are weighted to obtain the final distillation loss. Finally, parameters of the student model will be updated by the final loss.
In KDCL, the final loss consists of Kullback-Leible (KL) loss and complementary loss. On the one hand, the student model needs to learn knowledge from the teacher model to improve its ability. On the other hand, the teacher model is not completely correct, and the student model also needs to learn by itself to reduce the influence of the teacher model’s errors on the learning process. It is better to consider both of them.
The final distillation loss consists of two parts and it can be expressed as follows:
LKDCL=αLKL+LCL, | (14) |
where LKL denotes the KL divergence and LCL denotes the complementary loss. Given the probability distributions pt from the teacher model and ps from the student model, their KL divergence can be expressed as follows:
LKL(pt,ps)=∑i−ptilogpsipti, | (15) |
where i denotes the i-th element in tensor pt or ps.
We select three complementary losses for KDCL. They are the PC loss proposed by Ishida et al. [15], FWD loss proposed by Yu et al. [16] and SCL-NL loss proposed by Chou et al. [18]. Supposing that ps is the probability distribution for sample x from the student model and ˉy is the complementary label of x, these complementary losses are shown in Eqs (16)–(18).
ˉLPC(ps,ˉy)=K−1n∑y≠ˉy(psy−psˉy)−K×(K−1)2+K−1, | (16) |
ˉLFWD(ps,ˉy)=−∑iˉyi×log(QT×psi), | (17) |
ˉLSCL−NL(ps,ˉy)=∑iˉyi×(−log(1−psˉy)), | (18) |
where K denotes the number of categories of the dataset, and QT denotes the transpose of Q which is a K×K square matrix with all entries 1/(K−1) except the diagonal.
With parameters pt,psandˉy, the final loss can be expressed in more detail as follows:
LKD−PC(pt,ps,ˉy)=αLKL(pt,ps)+ˉLPC(ps,ˉy) | (19) |
LKD−FWD(pt,ps,ˉy)=αLKL(pt,ps)+ˉLFWD(ps,ˉy) | (20) |
LKD−SCL(pt,ps,ˉy)=αLKL(pt,ps)+ˉLSCL−NL(ps,ˉy) | (21) |
α is the weighting factor, which is used to control the degree of influence of soft labels on the overall classification loss. The values of α will be determined in the experiment.
We evaluate and compare the student models optimized by KDCL with the same models only trained by complementary labels on four public image classification datasets. Three complementary label losses including PC loss [15], FWD loss [16] and SCL-NL loss [18], are used as loss functions for training the models. All the experiments are carried out on a server with a 15 vCPU Intel(R) Xeon(R) Platinum 8358P CPU @ 2.60GHz, 80 GB RAM and one RTX 3090 GPU with 24 GB memory.
Four benchmark image classification datasets, including MNIST, Fashion-MNIST(F-MNIST), Kuzushiji-MNIST(K-MNIST) and CIFAR10, are used to verify the effectiveness of KDCL.
MNIST: consists of 60,000 28 × 28 pixel grayscale images for training and 10,000 images for testing, with a total of 10 categories representing numbers between 0 and 9.
F-MNIST: is an alternative dataset to MNIST and consists of 10 categories, 60,000 training images and 10,000 test images, each with a size of 28 × 28 pixels.
K-MNIST: is a dataset derived from 10 Japanese ancient characters widely used between the mid-Heian period and early modern Japan, which is an extension of the MNIST dataset. K-MNIST contains a total of 74,000 gray-scale images of 28 × 28 pixels in 10 categories.
CIFAR10: consists of 60,000 32 × 32 color images, 50,000 of which are used as the training set and 10,000 as the test set. Each category contains 6000 images.
Following the settings in [15,17,18], we use an unbiased way to select complementary labels for samples in all datasets. Besides, we apply two different sets of teacher-student networks to these datasets. Specifically, for MNIST, F-MNIST and K-MNIST, we chose Lenet-5 [23] as the teacher model and MLP [24] with 500 hidden neurons as the student model. Because these datasets are relatively simple, simple networks can work well. For CIFAR10 dataset, since color images are more difficult to be classified, we need deeper CNN to extract features. We choose DenseNet-121 [25] as the teacher model and ResNet-18 [26] as the student model.
In the setting of training details, for MNIST, F-MNIST and K-MNIST, we train Lenet-5 and MLP with 120 epochs and use SGD as the optimizer with a momentum 0.9 and a weight decay of 0.0001. The initial learning rate is 0.1 and it is halved every 30 epochs. The batch size is set to 128. For CIFAR10 dataset, we train DenseNet-121 and ResNet-18 with 80 epochs and use SGD as the optimizer with a momentum 0.9 and a weight decay of 0.0005. The learning rate is from {1e-1, 1e-2, 5e-3, 1e-3, 5e-4, 1e-4} and it is divided by 10 every 30 epochs.
In Figure 4, we make a parameter sensitivity analysis of the distillation temperature T in Eq (13) and the soft label weighting factor α in Eqs (19)–(21).
We first explore the influence of different distillation temperature T. As we can see, when T=1, which means directly using the probability distribution output by the teacher model as soft labels without softening, KDCL exhibits the worst accuracy. This is because when the temperature is low, there is a significant difference in soft labels between positive and negative classes, making it difficult for the student model to learn effectively. As T gradually increases, the soft labels become more and more smooth, and student model can easily learn the knowledge in soft labels, and the accuracy is gradually improved. When T≥80, the gap between positive and negative classes in soft labels is extremely small, as well as the influence of negative classes is too large, which leads to the accuracy no longer increasing, or even decreasing.
Then, we further investigate the optimal value of soft label weighting factor α. We follow the setting in Hinton et al. [22], and set α in the range of 0 to 1. On the same dataset, the change of α does not have a great impact on the accuracy of KDCL. This indicates that the KDCL model parameter optimization process is not sensitive to the hyperparameter α. Nevertheless, the model still achieves higher accuracy when α=0.5.
Based on the above analysis, we will set T=80, α=0.5 in subsequent experiments.
We show the accuracy for all models with three complementary label losses before and after being optimized by KDCL on four datasets. The results are presented in Table 1.
Dataset | MNIST | F-MNIST | K-MNIST | CIFAR-10 | ||||||||
Model | Lenet-5 | MLP | KDCL-MLP | Lenet-5 | MLP | KDCL-MLP | Lenet-5 | MLP | KDCL-MLP | Lenet-5 | MLP | KDCL-MLP |
PC | 89.94% | 83.78% | 86.10% | 77.22% | 76.67% | 77.42% | 67.77% | 60.52% | 60.34% | 38.31% | 32.74% | 33.37% |
FWD | 85.35% | 83.67% | 84.61% | 85.35% | 83.67% | 84.61% | 86.85% | 70.86% | 75.41% | 60.74% | 44.93% | 46.65% |
SCL-NL | 98.18% | 92.06% | 94.33% | 85.93% | 83.69% | 84.66% | 86.85% | 70.59% | 75.25% | 61.64% | 40.46% | 45.98% |
In Table 1, we show the experimental results of KDCL, where we compare the performance of the student model optimized by KDCL with that trained only with complementary labels across different losses and datasets. On MNIST, which is a relatively simple and easy dataset, all methods can achieve high accuracies. With the help of KDCL, we improve the accuracy of MLP from 83.78% to 86.10% with PC loss, 92.07% to 94.32% with FWD loss and 92.06% to 94.33% with SCL-NL loss. SCL-NL loss performs better among three loss functions. Besides, after being enhanced by KDCL, the accuracy of KDCL-MLP falls between the accuracy of MLP model and Lenet-5. On F-MNIST, which is more complex than MNIST, all methods have a slight decrease. Our KDCL achives 77.42% with PC loss, 84.61% with FWD loss and 84.66% with SCL-NL loss. On K-MNIST, which is more complex than F-MNIST, when using PC loss, our method does not significantly improve the accuracy of MLP, but we improve 4.55% with FWD loss and 4.66% with SCL-NL loss. On CIFAR-10, which is the most complex among the four datasets, there is a significant drop in accuracies. Nevertheless, the student model can still be optimized by KDCL, demonstrating its robustness and effectiveness across different datasets.
We show the testing process of all models in Figure 5.
In Figure 5, we present the convergence speed of all models in our experiments. The results show that the student model distilled by KDCL converges faster than that trained only with complementary labels. This indicates that the model can learn the features of the images more accurately and efficiently when utilizing both soft labels and complementary labels.
Additionally, we observe that the PC loss exhibits a decrease in accuracy on more challenging datasets, particularly on CIFAR10. This is because the PC loss uses the Sigmoid function as the normalization function, which can lead to negative values in the loss calculation and prevent the model from finding better parameters when updating. This phenomenon becomes more pronounced on the CIFAR10 dataset, where a peak appears. However, KDCL can alleviate this phenomenon and shift the peak to a later epoch. This demonstrates the effectiveness of KDCL in addressing the limitations of existing CLL methods and improving the performance of complementary label learning.
In this study, we established a knowledge distillation training framework for CLL, called KDCL. As stated in the introduction, the supervision information in complementary labels is easily missed. The proposed framework employed a deep CNN model with higher accuracy to soften complementary labels to soft labels. Both soft labels and origion complementary labels are used to train the classification model. After the optimization of KDCL, compared to just using the normal CLL methods, the accuracy has been improved by 0.5–4.5%.
The main limitation lies in multiple aspects. First, KDCL’s performance could be influenced by the choice of teacher-student models and CLL algorithms. Our experiments utilize specific combinations of models and algorithms, and the results may vary with different configurations. By choosing better CNN networks and more excellent CLL algorithms, KDCL can achieve better performance on more difficult datasets. Another drawback of the proposed scheme is time cost. Due to the two-stage training framework of KDCL, which involves training a high-accuracy teacher model using complementary labels, the overall training time cost of KDCL is relatively high. Training a high-accuracy model typically takes a considerable amount of time, which poses a challenge to the efficiency of KDCL. In addition, KDCL is only tested on public datasets, and the data distribution is relatively uniform. In the future, we also consider expanding the application scope of KDCL to use dynamically imbalanced data for CLL, or to combine with hybrid deep learning models [27,28,29].
In this paper, we give the first attempt to leverage the knowledge distillation training framework in CLL. To enhance the supervised information present in complementary labels, which are often overlooked in existing CLL methods, we propose a complementary label enhancement framework based on knowledge distillation, called KDCL. Specifically, KDCL consists of a teacher model and a student model. By adopting knowledge distillation techniques, the teacher model transfers its softened knowledge to the student model. The student model then learns from both soft labels and complementary labels to improve its classification performance. The experimental results on four benchmark datasets show that KDCL can improve the classification accuracy of CLL, and maintain robustness and effectiveness on difficult datasets.
The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.
This work was supported by the National Natural Science Foundation of China (No. 61976217, 62306320), the Natural Science Foundation of Jiangsu Province (No. BK20231063), the Fundamental Research Funds of Central Universities (No. 2019XKQYMS87), Science and Technology Planning Project of Xuzhou (No. KC21193).
All authors declare that they have no conflicts of interest.
[1] |
Alvarez S, Tobarra MA, Zafrilla JE (2019) Corporate and Product Carbon Footprint under Compound Hybrid Analysis: Application to a Spanish Timber Company. J Ind Ecol 23: 496–507. https://doi.org/10.1111/jiec.12759 doi: 10.1111/jiec.12759
![]() |
[2] |
Battiston S, Mandel A, Monasterolo I, et al. (2017) A climate stress-test of the financial system. Nat Clim Chang 7: 283–288. https://doi.org/10.1038/nclimate3255 doi: 10.1038/nclimate3255
![]() |
[3] |
Boermans MA, Galema R (2019) Are pension funds actively decarbonizing their portfolios? Ecol Econ 161: 50–60. https://doi.org/https://doi.org/10.1016/j.ecolecon.2019.03.008 doi: 10.1016/j.ecolecon.2019.03.008
![]() |
[4] | CDP (2022) CDP Technical Note: Relevance of Scope 3 Categories by Sector. Available from: https://cdn.cdp.net/cdp-production/cms/guidance_docs/pdfs/000/003/504/original/CDP-technical-note-scope-3-relevance-by-sector.pdf?1649687608. |
[5] |
Dang THN, Nguyen CP, Lee GS, et al. (2023) Measuring the energy-related uncertainty index. Energy Econ 124: 106817. https://doi.org/10.1016/j.eneco.2023.106817 doi: 10.1016/j.eneco.2023.106817
![]() |
[6] | ECB (European Central Bank) (2023) Climate-related financial disclosures of the Eurosystem's corporate sector holdings for monetary policy purposes. Frankfurt am Main, Germany. Available from: https://www.ecb.europa.eu/pub/pdf/other/ecb.climate_related_financial_disclosures_eurosystem_corporate_sector_holdings_monetary_policy_purposes2023~9eae8df8d9.en.pdf. |
[7] | European Union (2019) Regulation (EU) 2019/2088 of the European Parliament and of the Council of 27 November 2019 on sustainability‐related disclosures in the financial services sector. |
[8] | European Union (2023) Commission Delegated Regulation (EU) 2023/2772 of 31 July 2023 supplementing Directive 2013/34/EU of the European Parliament and of the Council as regards sustainability reporting standards. |
[9] |
Fama EF, French KR (1992) The cross‐section of expected stock returns. J Financ 47: 427–465. https://doi.org/10.2307/2329112 doi: 10.2307/2329112
![]() |
[10] |
Fang D, Holmen M, Mavruk T (2021) Meeting new peers: The effects of Morningstar category reassignment on fund flows and star ratings. Int Rev Financ Anal 77: 101842. https://doi.org/10.1016/j.irfa.2021.101842 doi: 10.1016/j.irfa.2021.101842
![]() |
[11] |
Fang F, Parida S (2022) Sustainable mutual fund performance and flow in the recent years through the COVID-19 pandemic. Int Rev Financ Anal 84: 102387. https://doi.org/https://doi.org/10.1016/j.irfa.2022.102387 doi: 10.1016/j.irfa.2022.102387
![]() |
[12] | Fernández-Rosillo B, Koblents E, Morales A (2023) Micro-database for sustainability (ESG) indicators developed at the Banco de España (2022). Notas Estadísticas/Banco de España 17. |
[13] |
Fooladi IJ, Hebb G (2023) Drivers of differences in performance of ESG-focused funds relative to their underlying benchmarks. Global Financ J 56: 100745. https://doi.org/10.1016/j.gfj.2022.100745 doi: 10.1016/j.gfj.2022.100745
![]() |
[14] |
Gantchev N, Giannetti M, Li R (2024) Sustainability or performance? Ratings and fund managers' incentives. J Financ Econ 155: 103831. https://doi.org/https://doi.org/10.1016/j.jfineco.2024.103831 doi: 10.1016/j.jfineco.2024.103831
![]() |
[15] | GHGProtocol (2004) GHG Protocol Corporate Accounting and Reporting Standard. Available from: https://ghgprotocol.org/sites/default/files/standards/ghg-protocol-revised.pdf. |
[16] | GHGProtocol (2013) GHG Protocol Corporate Value Chain (Scope 3) Accounting and Reporting Standard. Supplement to the GHG Protocol Corporate Accounting and Reporting Standard. Available from: https://ghgprotocol.org/sites/default/files/standards/Corporate-Value-Chain-Accounting-Reporing-Standard_041613_2.pdf. |
[17] | Hale J (2018) Measuring transition risk in fund portfolios-The Morningstar® Portfolio Carbon Risk Score. Available from: https://www.morningstar.com/content/dam/marketing/shared/Company/LandingPages/CarbonRisk/Carbon_Risk_Paper.pdf?cid = EMQ_. |
[18] |
Hertwich EG, Wood R (2018) The growing importance of scope 3 greenhouse gas emissions from industry. Environ Res Lett 13: 104013. https://doi.org/10.1088/1748-9326/aae19a doi: 10.1088/1748-9326/aae19a
![]() |
[19] | IPCC (Intergovernmental Panel on Climate Change) (2018) Global Warming of 1.5℃. An IPCC Special Report on the impacts of global warming of 1.5℃ above pre-industrial levels and related global greenhouse gas emission pathways, in the context of strengthening the global response to the threat of climate change, sustainable development, and efforts to eradicate poverty. Available from: https://www.ipcc.ch/sr15/. |
[20] | IPCC (Intergovernmental Panel on Climate Change) (2022) Climate Change 2022. Mitigation of Climate Change. Working Group Ⅲ Contribution to the Sixth Assessment Report of the Intergovernmental Panel on Climate Change. Available from: https://www.ipcc.ch/report/ar6/wg3/. |
[21] |
Janicka M, Sajnóg A (2023) The Quality of Environmental Data Disclosure. The case of Public Companies from the European Union Countries. Eur J Sustain Dev 12. https://doi.org/10.14207/ejsd.2023.v12n4p149 doi: 10.14207/ejsd.2023.v12n4p149
![]() |
[22] |
Jensen MC (1968) The performance of mutual funds in the period 1945–1964. J Financ 23: 389–416. https://doi.org/10.1111/j.1540-6261.1968.tb00815.x doi: 10.1111/j.1540-6261.1968.tb00815.x
![]() |
[23] |
Ji X, Zhang Y, Mirza N, et al. (2021a) The impact of carbon neutrality on the investment performance: Evidence from the equity mutual funds in BRICS. J Environ Manage 297: 113228. https://doi.org/https://doi.org/10.1016/j.jenvman.2021.113228 doi: 10.1016/j.jenvman.2021.113228
![]() |
[24] |
Ji X, Chen X, Mirza N, et al. (2021b) Sustainable energy goals and investment premium: Evidence from renewable and conventional equity mutual funds in the Euro zone. Resour Policy 74: 102387. https://doi.org/10.1016/j.resourpol.2021.102387 doi: 10.1016/j.resourpol.2021.102387
![]() |
[25] |
Jindal A, Hishikar S, Shrimali G (2024) Examining the portfolio carbon footprint of Indian banking system. J Clean Prod 459: 142393. https://doi.org/10.1016/j.jclepro.2024.142393 doi: 10.1016/j.jclepro.2024.142393
![]() |
[26] | Kutlukaya M, Fornino M, Shirono K, et al. (2023) Carbon Footprint of Bank Loans—A Measure of Transition Risks for the Financial Sector. In: Agnolucci P, Ananthakrishnan P, Astolfi R, et al., Data for a Greener World: A Guide for Practitioners and Policymakers, Washington: International Monetary Fund, 113–139. https://doi.org/10.5089/9798400217296.071 |
[27] |
Ledoit O, Wolf M (2008) Robust performance hypothesis testing with the Sharpe ratio. J Empir Financ 15: 850–859. https://doi.org/10.1016/j.jempfin.2008.03.002 doi: 10.1016/j.jempfin.2008.03.002
![]() |
[28] |
López LA, Arce G, Cadarso MÁ, et al. (2023) The global dissemination to multinationals of the carbon emissions ruling on Shell. Struct Chang Econ Dyn 65: 406–416. https://doi.org/https://doi.org/10.1016/j.strueco.2023.03.015 doi: 10.1016/j.strueco.2023.03.015
![]() |
[29] |
Malik A, McBain D, Wiedmann TO, et al. (2019) Advancements in Input-Output Models and Indicators for Consumption-Based Accounting. J Ind Ecol 23: 300–312. https://doi.org/10.1111/jiec.12771 doi: 10.1111/jiec.12771
![]() |
[30] |
Mirza N, Abbas Rizvi SK, Saba I, et al. (2022) The resilience of Islamic equity funds during COVID-19: Evidence from risk adjusted performance, investment styles and volatility timing. Int Rev Econ Financ 77: 276–295. https://doi.org/https://doi.org/10.1016/j.iref.2021.09.019 doi: 10.1016/j.iref.2021.09.019
![]() |
[31] |
Naqvi B, Mirza N, Rizvi SKA, et al. (2021). Is there a green fund premium? Evidence from twenty seven emerging markets. Glob Financ J 50: 100656. https://doi.org/10.1016/j.gfj.2021.100656 doi: 10.1016/j.gfj.2021.100656
![]() |
[32] |
Nguyen Q, Diaz-Rainey I, Kitto A, et al. (2023) Scope 3 emissions: Data quality and machine learning prediction accuracy. PLOS Clim 2: e0000208. https://doi.org/10.1371/journal.pclm.0000208 doi: 10.1371/journal.pclm.0000208
![]() |
[33] |
Oosterhaven J, Stelder D (2002) Net Multipliers Avoid Exaggerating Impacts: With A Bi-Regional Illustration for the Dutch Transportation Sector. J Reg Sci 42: 533–543. https://doi.org/10.1111/1467-9787.00270 doi: 10.1111/1467-9787.00270
![]() |
[34] |
Ortiz M, López LA, Cadarso MÁ (2020) EU carbon emissions by multinational enterprises under control-based accounting. Resour Conserv Recycl 163: 105104. https://doi.org/https://doi.org/10.1016/j.resconrec.2020.105104 doi: 10.1016/j.resconrec.2020.105104
![]() |
[35] | PCAF (Partnership for Carbon Accounting Financials) (2020) The Global GHG Accounting and Reporting Standard Part A: Financed Emissions. Second Edition. Available from: https://carbonaccountingfinancials.com/files/downloads/PCAF-Global-GHG-Standard-2020.pdf. |
[36] | PCAF (Partnership for Carbon Accounting Financials) (2022) The Global GHG Accounting and Reporting Standard Part A: Financed Emissions. Second Edition. Available from: https://carbonaccountingfinancials.com/files/downloads/PCAF-Global-GHG-Standard.pdf. |
[37] | Peters G, Andrew RM, Karstensen J (2016) Global environmental footprints. A guide to estimating, interpreting and using consumption-based accounts of resource use and environmental impacts. Copenhagen, Denmark. https://doi.org/10.6027/TN2016-532 |
[38] |
Popescu IS, Gibon T, Hitaj C, et al. (2023) Are SRI funds financing carbon emissions? An input-output life cycle assessment of investment funds. Ecol Econ 212: 107918. https://doi.org/https://doi.org/10.1016/j.ecolecon.2023.107918 doi: 10.1016/j.ecolecon.2023.107918
![]() |
[39] |
Popescu IS, Schaubroeck T, Gibon T, et al. (2024) Investment funds are responsible for substantial environmental and social impacts. Commun Earth Environ 5: 355. https://doi:10.1038/s43247-024-01479-4 doi: 10.1038/s43247-024-01479-4
![]() |
[40] |
Reboredo JC, Quintela M, Otero LA (2017) Do investors pay a premium for going green? Evidence from alternative energy mutual funds. Renew Sustain Energy Rev 73: 512–520. https://doi.org/10.1016/j.rser.2017.01.158 doi: 10.1016/j.rser.2017.01.158
![]() |
[41] |
Rodrigues JFD, Moran D, Wood R, et al. (2018) Uncertainty of Consumption-Based Carbon Accounts. Environ Sci Technol 52: 7577–7586. https://doi:10.1021/acs.est.8b00632 doi: 10.1021/acs.est.8b00632
![]() |
[42] |
Stadler K, Wood R, Bulavskaya T, et al. (2018) EXIOBASE 3: Developing a time series of detailed environmentally extended multi‐regional input‐output tables. J Ind Ecol 22: 502–515. https://doi.org/10.1111/jiec.12715 doi: 10.1111/jiec.12715
![]() |
[43] |
Steen MJ, Moussawi T, Gjolberg O (2020) Is there a relationship between Morningstar's ESG ratings and mutual fund performance. J Sustain Financ Invest 10: 349–370. https://doi.org/10.1080/20430795.2019.1700065 doi: 10.1080/20430795.2019.1700065
![]() |
[44] |
Suh S, Lenzen M, Treloar GJ, et al. (2004) System boundary selection in life-cycle inventories using hybrid approaches. Environ Sci Technol 38: 657–664. https://doi.org/10.1021/es0263745 doi: 10.1021/es0263745
![]() |
[45] | TCFD (Task Force on Climate-Related Financial Disclosures) (2021) Task Force on Climate-related Financial Disclosures. Implementing the Recommendations of the Task Force on Climate-related Financial Disclosures. Available from: https://assets.bbhub.io/company/sites/60/2021/07/2021-TCFD-Implementing_Guidance.pdf. |
[46] | TCFD (Task Force on Climate-Related Financial Disclosures) (2023) Task Force on Climate-related Financial Disclosures 2023 Status Report. Available from: https://assets.bbhub.io/company/sites/60/2023/09/2023-Status-Report.pdf. |
[47] | Teubler J, Kühlert M (2020) Financial carbon footprint: calculating banks' scope 3 emissions of assets and loans. ECEE Industrial Summer Study Proceedings. Available from: https://epub.wupperinst.org/frontdoor/deliver/index/docId/7587/file/7587_Teubler.pdf. |
[48] |
Wagner M, Margaritis D (2017) All about fun(ds) in emerging markets? The case of equity mutual funds. Emerg Mark Rev 33: 62–78. https://doi.org/10.1016/j.ememar.2017.08.004 doi: 10.1016/j.ememar.2017.08.004
![]() |
[49] |
Wang J, Guan D, Zhang Z, et al. (2023) Carbon footprints of the equity portfolios of Chinese fund firms. Commun Earth Environ 4. https://doi.org/10.1038/s43247-023-00926-y doi: 10.1038/s43247-023-00926-y
![]() |
[50] |
Wei P, Li Y, Zhang Y (2023) Corporate green bonds and carbon performance: An economic input–output life cycle assessment model-based analysis. Bus Strategy Environ 32: 2736–2754. https://doi.org/https://doi.org/10.1002/bse.3267 doi: 10.1002/bse.3267
![]() |
[51] |
Zhang Z, Li J, Guan D (2023) Value chain carbon footprints of Chinese listed companies. Nat Commun 14: 2794. https://doi.org/10.1038/s41467-023-38479-5 doi: 10.1038/s41467-023-38479-5
![]() |
[52] |
Yamakawa A, Peters GP (2009) USING TIME-SERIES TO MEASURE UNCERTAINTY IN ENVIRONMENTAL INPUT–OUTPUT ANALYSIS. Econ Syst Res 21: 337–362. https://doi.org/10.1080/09535310903444766 doi: 10.1080/09535310903444766
![]() |
![]() |
![]() |
Dataset | MNIST | F-MNIST | K-MNIST | CIFAR-10 | ||||||||
Model | Lenet-5 | MLP | KDCL-MLP | Lenet-5 | MLP | KDCL-MLP | Lenet-5 | MLP | KDCL-MLP | Lenet-5 | MLP | KDCL-MLP |
PC | 89.94% | 83.78% | 86.10% | 77.22% | 76.67% | 77.42% | 67.77% | 60.52% | 60.34% | 38.31% | 32.74% | 33.37% |
FWD | 85.35% | 83.67% | 84.61% | 85.35% | 83.67% | 84.61% | 86.85% | 70.86% | 75.41% | 60.74% | 44.93% | 46.65% |
SCL-NL | 98.18% | 92.06% | 94.33% | 85.93% | 83.69% | 84.66% | 86.85% | 70.59% | 75.25% | 61.64% | 40.46% | 45.98% |
Dataset | MNIST | F-MNIST | K-MNIST | CIFAR-10 | ||||||||
Model | Lenet-5 | MLP | KDCL-MLP | Lenet-5 | MLP | KDCL-MLP | Lenet-5 | MLP | KDCL-MLP | Lenet-5 | MLP | KDCL-MLP |
PC | 89.94% | 83.78% | 86.10% | 77.22% | 76.67% | 77.42% | 67.77% | 60.52% | 60.34% | 38.31% | 32.74% | 33.37% |
FWD | 85.35% | 83.67% | 84.61% | 85.35% | 83.67% | 84.61% | 86.85% | 70.86% | 75.41% | 60.74% | 44.93% | 46.65% |
SCL-NL | 98.18% | 92.06% | 94.33% | 85.93% | 83.69% | 84.66% | 86.85% | 70.59% | 75.25% | 61.64% | 40.46% | 45.98% |