This paper examines the extent to which standardized financial statement disclosures can anticipate both overall environmental, social, and governance (ESG) ratings and their evolution over time for listed firms. Utilizing a longitudinal dataset of 851 companies spanning 11 years, we systematically extract 279 International Financial Reporting Standards (IFRS)-compliant accounts and generate an extensive set of financial ratios to capture the static and dynamic features of corporate performance. Through an empirical comparison of traditional machine learning, gradient boosting (LightGBM, XGBoost), and neural network methods (multilayer perceptrons, convolutional neural networks, and TabNet), the analysis finds that boosting algorithms deliver consistently superior accuracy in ESG prediction tasks involving high-dimensional tabular data. Notably, the overall ESG composite score exhibits the highest level of predictability from financial information, while environmental ratings remain more elusive. Further investigation reveals that balance sheet variables most strongly explain absolute ESG levels, whereas cash flow metrics are pivotal in predicting annual changes in the scores. These findings indicate that both corporate financial structures and resource flows encode substantial amounts of information that is relevant to sustainability evaluations. By linking financial accounting theory with ESG analytics, this study provides a rigorous, data-driven framework that offers practical insights for researchers, policy-makers, and market participants seeking to enhance the reliability and timeliness of ESG evaluations using objective financial data.
Citation: Kahyun Lee. Bridging financial disclosures and ESG ratings: A data-driven predictive framework[J]. Quantitative Finance and Economics, 2026, 10(1): 86-107. doi: 10.3934/QFE.2026005
This paper examines the extent to which standardized financial statement disclosures can anticipate both overall environmental, social, and governance (ESG) ratings and their evolution over time for listed firms. Utilizing a longitudinal dataset of 851 companies spanning 11 years, we systematically extract 279 International Financial Reporting Standards (IFRS)-compliant accounts and generate an extensive set of financial ratios to capture the static and dynamic features of corporate performance. Through an empirical comparison of traditional machine learning, gradient boosting (LightGBM, XGBoost), and neural network methods (multilayer perceptrons, convolutional neural networks, and TabNet), the analysis finds that boosting algorithms deliver consistently superior accuracy in ESG prediction tasks involving high-dimensional tabular data. Notably, the overall ESG composite score exhibits the highest level of predictability from financial information, while environmental ratings remain more elusive. Further investigation reveals that balance sheet variables most strongly explain absolute ESG levels, whereas cash flow metrics are pivotal in predicting annual changes in the scores. These findings indicate that both corporate financial structures and resource flows encode substantial amounts of information that is relevant to sustainability evaluations. By linking financial accounting theory with ESG analytics, this study provides a rigorous, data-driven framework that offers practical insights for researchers, policy-makers, and market participants seeking to enhance the reliability and timeliness of ESG evaluations using objective financial data.
| [1] | Arik SO, Pfister T (2020) TabNet: attentive interpretable tabular learning. arXiv preprint arXiv:1908.07442. https://doi.org/10.48550/arXiv.1908.07442 |
| [2] |
Berg F, Kölbel JF, Rigobon R (2022) Aggregate confusion: the divergence of ESG ratings. Rev Financ 26: 1315–1344. https://doi.org/10.1093/rof/rfac033 doi: 10.1093/rof/rfac033
|
| [3] |
Carrasco PO, Vilchez VF (2022) Sending corporate social responsibility signals: What organizational characteristics must be met? Rev Bras Gest Neg 24: 92–111. https://doi.org/10.7819/rbgn.v24i1.4146 doi: 10.7819/rbgn.v24i1.4146
|
| [4] |
Cicchiello AF, Cotugno M, Monferrà S, et al. (2022) Which are the factors influencing green bonds issuance? Evidence from the European bonds market. Financ Res Lett 50: 103190. https://doi.org/10.1016/j.frl.2022.103190 doi: 10.1016/j.frl.2022.103190
|
| [5] |
Cini F, Ferrari A (2025) Towards the estimation of ESG ratings: A machine learning approach using balance sheet ratios. Res Int Bus Financ 73. https://doi.org/10.1016/j.ribaf.2024.102653 doi: 10.1016/j.ribaf.2024.102653
|
| [6] |
Cui J, Jo H, Na H (2018) Does corporate social responsibility affect information asymmetry? J Bus Ethics 148: 549–572. https://doi.org/10.1007/s10551-015-3003-8 doi: 10.1007/s10551-015-3003-8
|
| [7] |
D'Amato V, D'Ecclesia R, Levantesi S (2021) Fundamental ratios as predictors of ESG scores: a machine learning approach. Decis Econ Financ 12: 1087–1110. https://doi.org/10.1007/s10203-021-00364-5 doi: 10.1007/s10203-021-00364-5
|
| [8] |
D'Amato V, D'Ecclesia R, Levantesi S (2022) ESG score prediction through random forest algorithm. Comput Manage Sci 19: 347–373. https://doi.org/10.1007/s10287-021-00419-3 doi: 10.1007/s10287-021-00419-3
|
| [9] |
Derwall J, Guenster N, Bauer R, et al. (2005) The eco-efficiency premium puzzle. Financ Anal J 61: 51–63. https://doi.org/10.2469/faj.v61.n2.2716 doi: 10.2469/faj.v61.n2.2716
|
| [10] |
Eccles RG, Ioannou I, Serafeim G (2014) The impact of corporate sustainability on organizational processes and performance. Manage Sci 60: 2835–2857. https://doi.org/10.1287/mnsc.2014.1984 doi: 10.1287/mnsc.2014.1984
|
| [11] | Fabijańska A, Wołczek P, Sikacz H (2025) Can machine learning bring ESG ratings closer to small and medium-sized enterprises? Oecon Copernic. https://doi.org/10.24136/oc.3162 |
| [12] | Freeman RE (2010) Strategic management: A stakeholder approach. Cambridge University Press. |
| [13] |
Friede G, Busch T, Bassen A (2015) ESG and financial performance: aggregated evidence from more than 2000 empirical studies. J Sustain Financ Invest 5: 210–233. https://doi.org/10.1080/20430795.2015.1118917 doi: 10.1080/20430795.2015.1118917
|
| [14] | Grinsztajn L, Oyallon E, Varoquaux G (2022) Why do tree-based models still outperform deep learning on tabular data? arXiv preprint arXiv: 2207.08815. https://doi.org/10.48550/arXiv.2207.08815 |
| [15] | Hong H, Kacperczyk M (2009) The price of sin: the effects of social norms on markets. J Financ Econ. https://doi.org/10.1016/j.jfineco.2008.09.001 |
| [16] | Jeffrey H (2010) Sustainability Accounting Standards Board (SASB), In: World Scientific Encyclopedia of Climate Change, 37–41. |
| [17] | Kumar S (2023) A review ESG performance as a measure of stakeholders theory. Acad Market Stud J 27. |
| [18] | Lundberg SM, Lee S-I (2017) A unified approach to interpreting model predictions. Adv Neural Inf Process Syst 30. |
| [19] |
Margolis JD, Walsh JP (2003) Misery loves companies: rethinking social initiatives by business. Admin Sci Q 48: 268–305. https://doi.org/10.2307/3556659 doi: 10.2307/3556659
|
| [20] |
Orlitzky M, Schmidt FL, Rynes SL (2003) Corporate social and financial performance: a meta-analysis. Organ Stud 24: 403–441. https://doi.org/10.1177/0170840603024003910 doi: 10.1177/0170840603024003910
|
| [21] |
O'Shea K, Nash R (2015) An introduction to convolutional neural networks. arXiv preprint arXiv:1511.08458. https://doi.org/10.48550/arXiv.1511.08458 doi: 10.48550/arXiv.1511.08458
|
| [22] | Raza H, Muhammad AK, Mazliham MS, et al. (2022) Applying artificial intelligence techniques for predicting the environment, social, and governance (ESG) pillar score based on balance sheet and income statement data: A case of non-financial companies of USA, UK, and Germany. Front Environ Sci 10. https://doi.org/10.3389/fenvs.2022.975487 |
| [23] |
Sariyer G, Mangla SK, Chowdhury S, et al. (2024) Predictive and prescriptive analytics for ESG performance evaluation: A case of Fortune 500 companies. J Bus Res 181: 114742. https://doi.org/10.1016/j.jbusres.2024.114742 doi: 10.1016/j.jbusres.2024.114742
|
| [24] |
Seow RYC (2025a) Transforming ESG analytics with machine learning: a systematic literature review using TCCM framework. Corp Soc Responsib Environ Manag 32: 1–20. https://doi.org/10.1002/csr.70089 doi: 10.1002/csr.70089
|
| [25] |
Seow RYC (2025b) Clarifying CSR and ESG: Causes of conflation, consequences, and pathways to conceptual clarity. J Environ Manage 394: 127468. https://doi.org/10.1016/j.jenvman.2025.127468 doi: 10.1016/j.jenvman.2025.127468
|
| [26] | Shapley LS (1953) A value for n-person games. In: Contributions to the Theory of Games II, Princeton University Press, 307–317. |
| [27] |
Spence M (1973) Job market signaling. Q J Econ 87: 355–374. https://doi.org/10.2307/1882010 doi: 10.2307/1882010
|
| [28] |
van Zanten JA (2025) Measuring Companies' Environmental and Social Impacts: An analysis of ESG Ratings and SDG Scores. Organ Environ 38: 403–439. https://doi.org/10.1177/10860266251326895 doi: 10.1177/10860266251326895
|
| [29] | Wang J, Tang J, Guo K (2022) Green Bond Index Prediction Based on CEEMDAN-LSTM. Front Energy Res https://doi.org/10.3389/fenrg.2021.793413 |
| [30] |
Weber O, Koellner T, Habegger D, et al. (2008) The relation between the GRI indicators and the financial performance of firms. Prog Ind Ecol Int J 5: 236–254. https://doi.org/10.1504/PIE.2008.019127 doi: 10.1504/PIE.2008.019127
|
| [31] | Whelan T, Atz U, Holt TV, et al. (2021) ESG and financial performance: uncovering the relationship by aggregating evidence from 1,000 plus studies published between 2015–2020. NYU Stern Center Sustainable Bus. |
QFE-10-01-005-s001.pdf |
![]() |