Time-generative AI-enabled temporal fusion transformer model for efficient air pollution sensor calibration in IIoT edge environments

Shagufta Henna; Mohammad Amjath; Asif Yar; Shagufta Henna; Mohammad Amjath; Asif Yar

doi:10.3934/environsci.2025024

AIMS Environmental Science

2025, Volume 12, Issue 3: 526-556. doi: 10.3934/environsci.2025024

Previous Article Next Article

Research Special Issues

Time-generative AI-enabled temporal fusion transformer model for efficient air pollution sensor calibration in IIoT edge environments

Department of Computing, Atlantic Technological University, Donegal, Ireland

Received: 30 December 2024 Revised: 06 May 2025 Accepted: 20 May 2025 Published: 30 May 2025

The deployment of real-time sensor calibration models for air pollution monitoring on resource-constrained Industrial Internet of Things (IIoT) edge devices presents significant challenges due to the computational complexity and memory requirements of deep learning models. This paper addressed these challenges by proposing a time-series-generative approach that integrated model quantization, generative artificial intelligence (AI), and temporal deep learning architectures to ensure efficient deployment. Specifically, we introduced a TimeGAN-augmented temporal fusion transformer (TFT) model optimized for edge devices. By leveraging model quantization, the approach reduces the memory footprint and computational demands of the model without compromising calibration accuracy. Furthermore, the integration of generative adversarial networks (GANs) enhances the robustness of the model by generating high-quality synthetic time-series data, compensating for sparse or noisy sensor readings. This ability to generate synthetic data mirrors the real sensor trends, ensuring reliable model performance even in data-limited environments. A comprehensive evaluation of the proposed model, comparing its performance against both float and quantized versions, demonstrates the effectiveness of the TimeGAN-augmented quantized TFT. This model achieves a significant 88% reduction in size (from 800.04 KB to 97.34 KB) while maintaining excellent predictive performance, evidenced by a mean squared error (MSE) of 0.3212 and a mean absolute error (MAE) of 0.4375. Additionally, the TimeGAN-augmented Float TFT model emerges as a strong contender for real-time applications, offering an optimal balance between inference speed and accuracy, with a rapid inference time of 23.4 ms, making it ideal for real-time pollution monitoring.

Keywords:

Citation: Shagufta Henna, Mohammad Amjath, Asif Yar. Time-generative AI-enabled temporal fusion transformer model for efficient air pollution sensor calibration in IIoT edge environments[J]. AIMS Environmental Science, 2025, 12(3): 526-556. doi: 10.3934/environsci.2025024

Related Papers:

[1]	Xiaobo Li, Xinggang Ye, Meirong Qian . An embedded machine learning system method for air pollution monitoring and control. AIMS Environmental Science, 2025, 12(4): 576-593. doi: 10.3934/environsci.2025026
[2]	Suprava Ranjan Laha, Binod Kumar Pattanayak, Saumendra Pattnaik . Advancement of Environmental Monitoring System Using IoT and Sensor: A Comprehensive Analysis. AIMS Environmental Science, 2022, 9(6): 771-800. doi: 10.3934/environsci.2022044
[3]	K. Wayne Forsythe, Cameron Hare, Amy J. Buckland, Richard R. Shaker, Joseph M. Aversa, Stephen J. Swales, Michael W. MacDonald . Assessing fine particulate matter concentrations and trends in southern Ontario, Canada, 2003–2012. AIMS Environmental Science, 2018, 5(1): 35-46. doi: 10.3934/environsci.2018.1.35
[4]	Winai Meesang, Erawan Baothong, Aphichat Srichat, Sawai Mattapha, Wiwat Kaensa, Pathomsorn Juthakanok, Wipaporn Kitisriworaphan, Kanda Saosoong . Effectiveness of the genus Riccia (Marchantiophyta: Ricciaceae) as a biofilter for particulate matter adsorption from air pollution. AIMS Environmental Science, 2023, 10(1): 157-177. doi: 10.3934/environsci.2023009
[5]	Sreenivasulu Kutala, Harshavardhan Awari, Sangeetha Velu, Arun Anthonisamy, Naga Jyothi Bathula, Syed Inthiyaz . Hybrid deep learning-based air pollution prediction and index classification using an optimization algorithm. AIMS Environmental Science, 2024, 11(4): 551-575. doi: 10.3934/environsci.2024027
[6]	Rakibul Hasan, Rabeya Khatoon, Jahanara Akter, Nur Mohammad, Md Kamruzzaman, Atia Shahana, Sanchita Saha . AI-Driven greenhouse gas monitoring: enhancing accuracy, efficiency, and real-time emissions tracking. AIMS Environmental Science, 2025, 12(3): 495-525. doi: 10.3934/environsci.2025023
[7]	Suwimon Kanchanasuta, Sirapong Sooktawee, Natthaya Bunplod, Aduldech Patpai, Nirun Piemyai, Ratchatawan Ketwang . Analysis of short-term air quality monitoring data in a coastal area. AIMS Environmental Science, 2021, 8(6): 517-531. doi: 10.3934/environsci.2021033
[8]	Muhammad Rendana, Wan Mohd Razi Idris, Sahibin Abdul Rahim . Clustering analysis of PM2.5 concentrations in the South Sumatra Province, Indonesia, using the Merra-2 Satellite Application and Hierarchical Cluster Method. AIMS Environmental Science, 2022, 9(6): 754-770. doi: 10.3934/environsci.2022043
[9]	Getachew A. Adnew, Carl Meusinger, Nicolai Bork, Michael Gallus, Mildrid Kyte, Vitalijs Rodins, Thomas Rosenørn, Matthew S. Johnson . Gas-phase advanced oxidation as an integrated air pollution control technique. AIMS Environmental Science, 2016, 3(1): 141-158. doi: 10.3934/environsci.2016.1.141
[10]	Dimitrios Kotzias . Built environment and indoor air quality: The case of volatile organic compounds. AIMS Environmental Science, 2021, 8(2): 135-147. doi: 10.3934/environsci.2021010

Abstract

1. Introduction

Air quality monitoring has become a critical global concern due to its significant environmental and health implications, with polluted air affecting both communities and industries alike ^[1]. Traditional air pollution monitoring methods, such as stationary reference stations, have provided accurate measurements but are limited by high costs, restricted coverage, and an inability to capture spatiotemporal variations in air quality, which fluctuate rapidly across different locations and times ^[2].

To overcome these limitations, low-cost wireless sensor networks (WSNs) have been developed as an affordable and scalable solution for real-time air pollution monitoring. These sensors can be deployed in large numbers across urban and industrial environments, providing improved coverage and richer data. However, the accuracy of data collected by these low-cost sensors can be compromised due to factors such as signal interference, environmental noise, and calibration errors ^[3,4].

Sensor calibration has thus become a key strategy to ensure the reliability and accuracy of data from low-cost sensors. Calibration techniques align the raw measurements from these sensors with the true values observed by high-precision reference stations ^[5]. Typically, calibration involves both pre-deployment adjustments to fine-tune the sensors and post-deployment maintenance to ensure sustained accuracy over time. Given the challenges inherent in manual calibration, automatic techniques such as blind and non-blind calibration methods have been developed to improve the reliability of sensor networks for large-scale deployments ^[6].

The performance of low-cost sensors in WSNs is strongly influenced by sensor type, environmental variability, and deployment conditions. For instance, metal-oxide sensors, though affordable and compact, are particularly susceptible to environmental cross-sensitivities, such as fluctuations in temperature and humidity. These sensors may also suffer from signal drift and aging effects over time. Deployment in uncontrolled outdoor environments further exacerbates these issues, introducing noise from factors such as ambient particulate matter, precipitation, or inconsistent maintenance. These challenges can lead to measurement biases and reduced sensor reliability ^[7].

As air pollution monitoring expands, there is growing demand for real-time sensor calibration in Industrial Internet of Things (IIoT) environments, where timely, accurate data is crucial for informed decision-making ^[8]. Traditional sensor calibration methods, reliant on centralized data processing, are increasingly impractical due to the massive data volume and need for rapid processing. Edge computing offers a solution by processing data closer to the source, allowing real-time calibration on edge devices. This reduces the need for large data transfers, minimizes latency, and enhances privacy and security, thus improving air pollution monitoring ^[9].

However, deploying calibration algorithms on resource-constrained edge devices like IIoT sensors or smartphones is challenging due to limitations in memory, processing power, and energy. Research is focused on optimizing deep learning models for edge deployment. Idrissi et al.^[10] proposed lightweight models for intrusion detection in IoT, while Singh et al. ^[9] developed a hybrid framework to improve real-time security in mobile edge computing, addressing the need for efficient model deployment in resource-limited environments.

Yandouzi et al. demonstrated the value of edge computing for real-time applications by using drones and lightweight deep learning models for forest fire detection ^[11]. Similarly, Abusitta et al. showed how deep learning enhances anomaly detection in IoT systems, improving system reliability ^[12]. Further work by Aversano et al. ^[13] and Ahmad et al. ^[14] emphasized optimizing models for edge efficiency. Konaite et al. ^[15] and Sharma et al. ^[16] expanded these findings to various domains, confirming the broader potential of deep learning on edge devices. To overcome edge device constraints, techniques such as model quantization, weight pruning, and knowledge distillation have been employed to reduce model size and computational load without significantly sacrificing performance ^[17,18,19]. Among these, quantization is especially impactful—reducing weight and activation precision lowers memory usage, speeds up inference, and decreases energy consumption, making it ideal for real-time sensor calibration on edge devices ^[17]. Although quantization may introduce slight accuracy loss, careful tuning can preserve model performance. Tools like TensorFlow Lite enable the deployment of such optimized models on resource-limited devices, supporting accurate real-time calibration ^[20]. However, temporal models like recurrent neural networks (RNNs) and long short-term memory (LSTM), while effective at modeling time-series data, remain challenging to deploy on edge devices due to their high computational and memory demands ^[21].

A further challenge in IIoT applications is the issue of insufficient or poor-quality data, which can undermine the accuracy of sensor calibration. Generative AI, especially generative adversarial networks (GANs), offers a promising solution by generating synthetic time-series data that mimics real-world sensor readings. This enables data augmentation when real-world sensor data is sparse, noisy, or of low quality ^[22]. By augmenting training datasets with synthetic data, GANs enhance the robustness of calibration models, enabling them to make accurate adjustments even in the presence of unreliable or incomplete data.

This work proposes a TimeGAN-augmented temporal fusion transformer (TFT) model optimized for IIoT edge deployment through quantization. The proposed approach is compared against other temporal models, using both float and quantized versions of the model for calibration tasks. By pushing intelligence to the edge, these temporal models enhance the efficiency of environmental monitoring, making air quality monitoring systems more sustainable and responsive. Notably, the TimeGAN-augmented Float TFT strikes a balance between inference speed and accuracy, achieving an inference time of just 23.4 ms, making it ideal for real-time decision-making in pollution control.

The paper is structured as follows: Section 2 reviews related work on sensor calibration, time-series models, and edge deployment optimization techniques. Section 3 details the materials, datasets, and methods, including the TimeGAN-augmented TFT model and quantization strategies for IIoT edge deployment. Section 4 presents results, evaluating TimeGAN-generated data and comparing the proposed model's performance with other temporal models in terms of accuracy, inference time, and resource efficiency. Finally, Section 5 concludes with a summary of findings and future work directions.

2. Related works

Air quality prediction is crucial for addressing environmental and public health concerns, especially in IIoT and edge computing contexts. Various studies have explored machine learning (ML) techniques and edge devices to enhance air quality monitoring systems, addressing challenges like limited resources, sparse data, and real-time prediction needs.

In the study by Sun C et al.^[21], the authors propose a hybrid methodology combining multi-factor LSTM, deep reinforcement learning (DRL), and optimal stopping theory (OST) for efficient task offloading in edge computing. Data is acquired from multiple monitoring stations, followed by preprocessing steps like normalization and feature selection using the Boruta algorithm ^[23]. While the methodology is robust, challenges arise from the variability and sparsity of data across stations, which can impact data quality. Additionally, while OST-based K-best selection aids task distribution, network stability concerns in remote or congested areas limit its efficiency. Furthermore, while the LSTM model with attention mechanisms effectively captures temporal dependencies and addresses missing data, its computational demands pose scalability issues, particularly on resource-constrained edge devices.

Moursi et al. introduced an IoT-enabled system for real-time PM2.5 concentration prediction, combining edge devices and cloud computing ^[24]. Using a nonlinear auto regression with eXogenous input (NARX) framework, the system integrates past PM2.5 data with meteorological inputs to predict the next hour's air quality. While effective for short-term pollution events, the one-hour forecasting horizon limits its ability to capture long-term trends, which are crucial for sustainable air quality management. Additionally, performance tests on a PC and Raspberry Pi highlight the ongoing challenge of balancing computational demands with real-time constraints on edge devices.

A group of researchers developed a cost-effective AI-IoT system for air quality monitoring aimed at individuals with respiratory problems ^[25]. The system uses low-cost wireless sensor nodes based on ESP32 microcontrollers and ZPHS01B air quality modules to measure pollutants and environmental factors. While the system is cost-efficient and accessible, the trade-off in sensor accuracy remains a significant challenge. The LSTM model used for 24-hour forecasting performs well with time-series data but faces computational challenges for deployment on resource-constrained IoT devices. The dataset, covering just two months, may limit the model's ability to account for seasonal variations or extreme pollution events. Furthermore, LSTM's computational demands, especially for long-term forecasting or large datasets, impose burdens on edge devices, potentially hindering real-time predictions. Transformer-based models ^[26], known for their ability to handle long-range dependencies in time-series data, could offer a more promising solution. These models could be optimized for edge deployment using techniques such as model pruning or knowledge distillation, which would reduce computational requirements.

Gong et al. proposed a hybrid predictive maintenance model combining convolutional neural networks for spatial feature extraction and LSTM networks for sequential data analysis, aimed at enhancing predictive maintenance for wind turbines ^[27]. While effective, the model's high computational demands hinder its deployment in resource-constrained edge environments. Similarly, Aggarwal et al. developed a hybrid P-LSTM model for urban air quality forecasting, integrating LSTM with particle swarm optimization (PSO) to optimize hyperparameters ^[28]. Despite its improved performance, the combined complexity of LSTM and PSO results in increased computational burden, limiting its feasibility for real-time urban air quality prediction on edge devices.

Wardana et al. optimized a hybrid CNN-LSTM model for edge-based air quality forecasting, using post-training quantization techniques to reduce model size and improve execution on low-power devices like Raspberry Pi boards ^[29]. While this optimization enabled efficient operation on resource-constrained devices, the trade-off between model size reduction and accuracy became apparent, particularly with full integer quantization, which resulted in some loss of predictive precision. Although quantization improved execution speed and reduced model size, it raised concerns about sacrificing accuracy for real-time air quality predictions.

In a similar vein, Hu et al. introduced FedDeep, a federated deep learning approach for multi-urban PM2.5 forecasting, which utilizes edge computing to process local data and reduce cloud dependency ^[30]. By incorporating a novel gating fusion layer to adapt weather data for PM2.5 predictions, FedDeep enhances accuracy and alleviates computational pressure on cloud servers. However, while the federated architecture offers advantages in data privacy and reduced cloud reliance, it introduces complexity in coordination and communication between edge servers and the cloud, which can challenge scalability and operational efficiency.

Koziel et al. proposed a machine learning-based calibration method for low-cost nitrogen dioxide sensors using neural network (NN) surrogates, specifically multi-layer perceptrons, to predict correlation coefficients from environmental parameters ^[31]. Although the approach demonstrated high calibration efficiency with a correlation coefficient over 0.9 and RMSE below 3.2 $\mathrm{\mu g/m^3}$ , its complexity, requiring multiple sensors and advanced techniques, limits scalability, especially in resource-limited or large-scale deployments.

Yu et al. introduced AirNet, a dual encoder architecture that maps the calibration task into a sequence-to-point format, utilizing data from both mobile and static stations ^[32]. While it outperformed several baseline approaches in reducing forecasting errors and improving accuracy, the increased complexity, incorporating gated recurrent units (GRU), convolutional layers, and a social-based guidance mechanism, raises concerns about computational intensity. Extended training times and high computational requirements make it unsuitable for real-time applications or resource-constrained environments.

In the study by Yar A et al. ^[33], the authors explored the use of graph convolutional betworks (GCN) and extreme learning machines (ELM) for self-calibration in large-scale WSNs. GCN achieved 95% accuracy by capturing spatial and temporal features, while ELM, with faster learning times (109 seconds), offered lower accuracy (70%). This illustrates the trade-off between accuracy and computational efficiency in environmental monitoring. Schmitz et al. compared multiple linear regression and random forest (RF) models for calibrating low-cost metal oxide gas-phase sensors ^[34]. Both models achieved $\text{R}^2$ values above 0.8, but RF outperformed linear regression by capturing non-linear relationships. However, RF's increased complexity poses computational challenges, particularly for real-time or resource-constrained applications. Wang et al. applied RF to enhance calibration in low-cost air quality monitoring systems, achieving $\text{R}^2$ values between 0.70 and 0.99 for multiple pollutants ^[35]. While the model demonstrates high accuracy, its computational intensity makes it impractical for large-scale, real-time deployment, especially in resource-limited environments. In the study by Rahardja U et al. ^[36], the authors proposed AIKU, a transfer learning and meta-learning-based approach for calibrating low-cost PM2.5 sensors. AIKU outperformed traditional methods by enabling rapid adaptation to new sensor locations with minimal training data. However, it remains computationally demanding, with longer training times than simpler models.

A different approach, GenCast, a probabilistic weather forecasting model, significantly outperformed traditional ensemble systems, generating sharper, more realistic weather trajectories in about 8 minutes^[37]. However, GenCast's high computational demands raise scalability concerns, particularly for real-time forecasting in resource-limited settings, emphasizing the need for models that balance high predictive accuracy with computational efficiency. Li et al. introduced the scalable ensemble envelope diffusion sampler (SEEDS), a deep generative model for generating large weather forecast ensembles quickly ^[38]. While SEEDS outperformed operational ensemble forecasts in reliability and skill, its significant computational requirements limit its applicability in environments with limited resources.

The reviewed literature emphasizes advancements in ML for low-cost air quality monitoring and weather forecasting, highlighting the trade-offs between prediction accuracy and computational efficiency. Techniques such as neural networks ^[31] and transfer learning ^[36] deliver high accuracy but are computationally demanding, which limits their deployment in real-time, resource-constrained environments. Models like GCN ^[33] and generative AI systems ^[37] show promise, but their substantial computational requirements hinder their practical use in edge applications within the IIoT. Furthermore, many generative models struggle with reconstruction loss, affecting their generalization ability in dynamic and noisy environments. These challenges emphasize the need for lightweight, scalable models that strike a balance between accuracy and operational feasibility, particularly for edge-based IIoT air quality systems that demand real-time processing and low power consumption.

3. Materials and method

This section presents the proposed approach for calibrating air pollution sensor networks, which includes the TimeGAN TFT model and its deployment on edge devices for the IIoT.

3.1. Time generative adversarial network (TimeGAN)

In this section, we apply the TimeGAN to the task of generating synthetic air pollution data for sensor calibration. The TimeGAN was selected due to the temporal nature of sensor readings, as it uniquely combines adversarial training with explicit temporal supervision. This enables the model to preserve both the time-dependent structure and the statistical properties of multivariate sensor data. Unlike conventional GANs, the TimeGAN captures long-term dependencies through recurrent architectures, ensuring the generation of realistic and temporally coherent sequences, an essential requirement for accurate calibration. As a result, it is particularly effective in replicating the dynamic behavior of environmental data, outperforming other generative models in terms of both fidelity and downstream utility.

Figure 1 illustrates the architecture of the TimeGAN model, which consists of several key components that work together to model and replicate the complex patterns observed in time series data, especially those associated with fluctuations in air quality measurements. The core of the model consists of multiple modules, each designed to handle a specific aspect of the data generation task. The Embedder module, built using an LSTM network, processes the input time series and transforms it into a latent space representation, capturing the temporal dependencies within the data. This embedded representation is then passed to the Generator, which also uses LSTM layers to generate synthetic latent sequences. The Generator's goal is to learn the underlying distribution of the input data and generate new time series sequences that closely resemble the real data. The Recovery module, another LSTM-based component, is responsible for transforming the generated latent representations back into the original feature space, producing synthetic time series data that mirrors the original measurements ^[39]. Finally, the Discriminator module evaluates the generated sequences, distinguishing between real and synthetic data by learning to identify which sequences belong to the original dataset. During the training process, the model is optimized through a combination of reconstruction loss, adversarial loss, and other specialized losses. The reconstruction loss ensures that the generated time series are as similar as possible to the original data, while the adversarial loss helps the model distinguish real from fake data through the interaction between the Generator and the Discriminator. This interplay encourages the Generator to improve its data generation capabilities and produce sequences that are increasingly difficult for the Discriminator to differentiate from real data. These losses are minimized over several epochs, gradually refining the model's ability to generate synthetic time series that are indistinguishable from real sensor data. The algorithm for training the TimeGAN, detailed in Algorithm 1, outlines the step-by-step process of data preprocessing, model initialization, and training. The algorithm emphasizes the iterative nature of the training process, where the model is optimized over multiple epochs. After training, the Generator and Recovery modules work together to produce synthetic time series that are then denormalized to match the scale of the original data. This approach preserves the crucial temporal and structural characteristics of the original air pollution measurements, ensuring that the generated data can be used for accurate sensor calibration and effective air quality monitoring.

Figure 1. The TimeGAN architecture for generating synthetic air pollution data for sensor calibration. The model consists of an Embedder that encodes time series into a latent space, a Generator that creates synthetic sequences, a Recovery module that reconstructs the generated data, and a Discriminator that distinguishes real from synthetic data.

Feature	Count	Mean	Std	Min	25%	50%
Sensor_O3	1241	222.59	158.87	20.83	60.69	209.26
Temp	1241	25.74	6.34	12.33	21.00	24.97
RelHum	1241	40.02	13.78	23.43	29.63	33.20

Feature	DTW Distance	Interpretation
Sensor_O3	7.3850	Moderate temporal deviation
Temp	6.3153	Relatively well-aligned
RelHum	7.4060	Most divergent temporal pattern
Overall FID-like Score	0.1941	High distributional similarity

Temporal Models	Inference Time (ms)
TimeGAN+GRUAttention	1340
TimeGAN+Light GRUAttention	647
TimeGAN+Quantized GRUAttention	29.1
TimeGAN+Transformer	235
TimeGAN+Light Transformer	24
TimeGAN+Quantized Transformer	125
TimeGAN+TFT	384
TimeGAN+Float TFT	23.4
TimeGAN+Quantized TFT	178

Model	Limitation
CNN ^[44]	Captures local spatial patterns; lacks temporal modeling across long horizons; prone to overfitting on small datasets
Temporal CNN (TCN) ^[45]	Improves temporal range but struggles with complex long-term dependencies under limited receptive fields
Dilated CNN ^[46]	Expands temporal receptive field but can introduce gridding artifacts, harming fine-grained temporal modeling
VAE ^[47]	Global latent focus; loses temporal granularity in sequential reconstruction; overfits when not regularized properly
Sequential VAE (SVAE) ^[48]	Introduces time dependencies but still underperforms on complex non-stationary temporal patterns
Conditional VAE (CVAE) ^[49]	Captures conditional structure but struggles with modeling long-term temporal correlations; may overfit when conditioned on insufficient data

[1]	Yar A, Henna S, McAfee M, et al. (2023) Air Pollution Monitoring Using Online Recurrent Extreme Learning Machine. 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) 2023: 1–6. https://doi.org/10.1109/AICS60730.2023.10470534 doi: 10.1109/AICS60730.2023.10470534
[2]	Alsamrai O, Redel-Macias MD, Pinzi S, et al. (2024) A systematic review for indoor and outdoor air pollution monitoring systems based on Internet of Things. Sustainability 16: 4353. https://doi.org/10.3390/su16114353 doi: 10.3390/su16114353
[3]	Castell N, Dauge FR, Schneider P, et al. (2017) Can commercial low-cost sensor platforms contribute to air quality monitoring and exposure estimates? Environ Int 99: 293-302. https://doi.org/10.1016/j.envint.2016.12.007 doi: 10.1016/j.envint.2016.12.007
[4]	Wang Z, Zhang Z, Wang Z, et al. (2024) A novel intelligent indoor fire and combustibles detection method based on multi-channel transfer learning strategy with acoustic signals. Process Safety and Environmental Protection 189: 1217–1225. https://doi.org/10.1016/j.psep.2024.06.020 doi: 10.1016/j.psep.2024.06.020
[5]	Henna S, Yar A, Saheed K, et al. (2023) Wireless Sensor Networks Calibration using Attention-based Gated Recurrent Units for Air Pollution Monitoring. IEEE International Conference on Big Data (BigData) 2023: 3779–3784. https://doi.org/10.1109/BigData59044.2023.10386318 doi: 10.1109/BigData59044.2023.10386318
[6]	Maag B, Zhou Z, Saukh O, et al. (2017) SCAN: Multi-Hop Calibration for Mobile Sensor Arrays. ACM Trans Sens Netw 1. https://doi.org/10.1145/3090084 doi: 10.1145/3090084
[7]	Feng H, Xu C, Jin B, et al. (2024) A Deployment Optimization for Wireless Sensor Networks Based on Stacked Auto Encoder and Probabilistic Neural Network. Digital Communications and Networks https://doi.org/10.1016/j.dcan.2024.06.003 doi: 10.1016/j.dcan.2024.06.003
[8]	Yar A, Henna S, McAfee M, et al. (2023) Extreme Learning Machines for Calibration and Prediction in Wireless Sensor Networks: Advancing Environmental Monitoring Efficiency. 31st Irish Conference on Artificial Intelligence and Cognitive Science (AICS) 2023: 1–4. https://doi.org/10.1109/AICS60730.2023.10470795 doi: 10.1109/AICS60730.2023.10470795
[9]	Singh A, Chatterjee K, Satapathy SC (2022) An edge based hybrid intrusion detection framework for mobile edge computing. Complex Intell Syst 8: 3719-3746. https://doi.org/10.1007/s40747-021-00498-4 doi: 10.1007/s40747-021-00498-4
[10]	Idrissi I, Azizi M, Moussaoui O (2022) A Lightweight Optimized Deep Learning-based Host-Intrusion Detection System Deployed on the Edge for IoT. International Journal of Computing and Digital Systems 11: 209-216. https://doi.org/10.12785/ijcds/110117 doi: 10.12785/ijcds/110117
[11]	Yandouzi M, Grari M, Idrissi I, et al. (2022) Review on forest fires detection and prediction using deep learning and drones. Journal of Theoretical and Applied Information Technology 100: 4565-4576.
[12]	Abusitta A, Carvalho GHS, Wahab OA, et al. (2022) Deep learning-enabled anomaly detection for IoT systems. Internet Things 21: 100656. https://api.semanticscholar.org/CorpusID: 253431423
[13]	Aversano L, Bernardi ML, Cimitile M, et al. (2021) Effective anomaly detection using deep learning in IoT systems. Wireless Communications and Mobile Computing 2021: 9054336. https://doi.org/10.1155/2021/9054336 doi: 10.1155/2021/9054336
[14]	Ahmad Z, Shahid Khan A, Nisar K, et al. (2021) Anomaly Detection Using Deep Neural Network for IoT Architecture. Applied Sciences 11: 7050. https://doi.org/10.3390/app11157050 doi: 10.3390/app11157050
[15]	Konaite M, Owolawi PA, Mapayi T, et al. (2021) Smart hat for the blind with real-time object detection using Raspberry Pi and TensorFlow Lite. Proceedings of the International Conference on Artificial Intelligence and its Applications ISBN 9781450385756. https://doi.org/10.1145/3487923.3487929
[16]	Sharma K, Eskicioglu R (2022) Deep learning-based ECG classification on Raspberry Pi using a TensorFlow Lite model based on PTB-XL dataset. International Journal of Artificial Intelligence & Applications 13: 55-66. https://doi.org/10.5121/ijaia.2022.1340455 doi: 10.5121/ijaia.2022.1340455
[17]	Rokh B, Azarpeyvand A, Khanteymoori A (2023) A comprehensive survey on model quantization for deep neural networks in image classification. ACM Computing Surveys 14. https://doi.org/10.1145/3623402 doi: 10.1145/3623402
[18]	Phuong M, Lampert C (2019) Towards understanding knowledge distillation. Proceedings of the 36th International Conference on Machine Learning 97: 5142–5151. PMLR. https://proceedings.mlr.press/v97/phuong19a.html
[19]	Yang L, He Z, Fan D (2020) Harmonious coexistence of structured weight pruning and ternarization for deep neural networks. Proceedings of the AAAI Conference on Artificial Intelligence 34: 6623–6630. https://doi.org/10.1609/aaai.v34i04.6138 doi: 10.1609/aaai.v34i04.6138
[20]	TensorFlow Lite. Ultralytics, 2024. Available from: https://developers.googleblog.com/en/tensorflow-lite-is-now-litert
[21]	Sun C, Li J, Sulaiman R, et al. (2023) Air Quality Prediction and Multi-Task Offloading based on Deep Learning Methods in Edge Computing. Journal of Grid Computing 21. https://doi.org/10.1007/s10723-023-09671-0 doi: 10.1007/s10723-023-09671-0
[22]	Yoon J, Jarrett D, van der Schaar M (2019) Time-series Generative Adversarial Networks. In: Neural Information Processing Systems, 2019. https://proceedings.neurips.cc/paper_files/paper/2019/file/c9efe5f26cd17ba6216bbe2a7d26d490-Paper.pdf
[23]	Kursa MB, Jankowski A, Rudnicki WR (2010) Boruta – A System for Feature Selection. Fundamenta Informaticae 101: 271–285. https://doi.org/10.3233/FI-2010-288 doi: 10.3233/FI-2010-288
[24]	Moursi AS, El-Fishawy N, Djahel S, et al. (2021) An IoT enabled system for enhanced air quality monitoring and prediction on the edge. Complex and Intelligent Systems 7: 2923–2947. https://doi.org/10.1007/s40747-021-00476-w doi: 10.1007/s40747-021-00476-w
[25]	Felici-Castell S, Segura-Garcia J, Perez-Solano JJ, et al. (2023) AI-IoT low-cost pollution-monitoring sensor network to assist citizens with respiratory problems. Sensors 23. https://doi.org/10.3390/s23239585 doi: 10.3390/s23239585
[26]	Li S, Jin X, Xuan Y, et al. (2019) Enhancing the locality and breaking the memory bottleneck of Transformer on time series forecasting. arXiv abs/1907.00235. https://api.semanticscholar.org/CorpusID: 195766887
[27]	Gong L, Chen Y (2024) Machine learning-enhanced IoT and wireless sensor networks for predictive analysis and maintenance in wind turbine systems. Int J Intell Netw 5: 133-144. https://doi.org/10.1016/j.ijin.2024.02.002 doi: 10.1016/j.ijin.2024.02.002
[28]	Aggarwal A, Toshniwal D (2021) A hybrid deep learning framework for urban air quality forecasting. J Clean Prod 329: Article ID 129660. https://doi.org/10.1016/j.jclepro.2021.129660 doi: 10.1016/j.jclepro.2021.129660
[29]	Wardana INK, Gardner JW, Fahmy SA (2021) Optimising deep learning at the edge for accurate hourly air quality prediction. Sensors (Switzerland) 21: 1-28. https://doi.org/10.3390/s21041064 doi: 10.3390/s21041064
[30]	Hu Y, Cao N, Guo W, et al. (2024) FedDeep: A federated deep learning network for edge-assisted multi-urban PM2.5 forecasting. Appl Sci (Switzerland) 14: Article ID 1979. https://doi.org/10.3390/app14051979 doi: 10.3390/app14051979
[31]	Koziel S, Pietrenko-Dabrowska A, Wojcikowski M, et al. (2024) High-performance machine-learning-based calibration of low-cost nitrogen dioxide sensor using environmental parameter differentials and global data scaling. Sci Rep 14: 26120. https://doi.org/10.1038/s41598-024-77214-y doi: 10.1038/s41598-024-77214-y
[32]	Yu H, Li Q, Geng YA, et al. (2020) AirNet: A calibration model for low-cost air monitoring sensors using dual sequence encoder networks. AAAI 34: Article ID 5464. https://doi.org/10.1609/aaai.v34i01.5464 doi: 10.1609/aaai.v34i01.5464
[33]	Yar A, Henna S, McAfee M, et al. (2024) Accelerating deep learning for self-calibration in large-scale uncontrolled wireless sensor networks for environmental monitoring. Proc 35th Irish Syst Signals Conf (ISSC 2024), Article ID 10603082. https://doi.org/10.1109/ISSC61953.2024.10603082
[34]	Schmitz S, Towers S, Villena G, et al. (2021) Unravelling a black box: An open-source methodology for the field calibration of small air quality sensors. Atmos Meas Tech 14: 7221-7241. https://doi.org/10.5194/amt-14-7221-2021 doi: 10.5194/amt-14-7221-2021
[35]	Wang G, Yu C, Guo K, et al. (2024) Research of low-cost air quality monitoring models with different machine learning algorithms. Atmos Meas Tech 17: 181-196. https://doi.org/10.5194/amt-17-181-2024 doi: 10.5194/amt-17-181-2024
[36]	Rahardja U, Aini Q, Manongga D, et al.(2023) Enhancing machine learning with low-cost PM 2.5 air quality sensor calibration using image processing. APTSI Trans Manag (ATM) 7: 194-202. https://doi.org/10.33050/atm.v7i3.2062 doi: 10.33050/atm.v7i3.2062
[37]	Price I, Sanchez-Gonzalez A, Alet F, et al. (2024) Probabilistic weather forecasting with machine learning. Nature https://doi.org/10.1038/s41586-024-08252-9 doi: 10.1038/s41586-024-08252-9
[38]	Li L, Carver R, Lopez-Gomez I, et al. (2023) SEEDS: Emulation of weather forecast ensembles with diffusion models. arXiv abs/2306.14066. https://api.semanticscholar.org/CorpusID: 259252403
[39]	Wang Z, Chen C, Zeng Y, et al. (2023) Where did I come from? Origin attribution of AI-generated images. Advances in Neural Information Processing Systems 36: 74478–74500. https://proceedings.neurips.cc/paper_files/paper/2023/file/ebb4c188fafe7da089b41a9f615ad84d-Paper-Conference.pdf
[40]	Tan K, Chen J, Wang D (2019) Gated Residual Networks with Dilated Convolutions for Monaural Speech Enhancement. IEEE/ACM Trans Audio Speech Lang Process 27: 189–198. https://doi.org/10.1109/TASLP.2018.2876171 doi: 10.1109/TASLP.2018.2876171
[41]	Barcelo-Ordinas JM, Ferrer-Cid P, Garcia-Vidal J, et al. (2021) H2020 project CAPTOR dataset: Raw data collected by low-cost MOX ozone sensors in a real air pollution monitoring network. Data in Brief 36: 107127. https://doi.org/10.1016/j.dib.2021.107127 doi: 10.1016/j.dib.2021.107127
[42]	Barcelo-Ordinas JM, Ferrer-Cid P, Garcia-Vidal J, et al. (2019) Distributed multi-scale calibration of low-cost ozone sensors in wireless sensor networks. Sensors 19:2503. https://doi.org/10.3390/s19112503 doi: 10.3390/s19112503
[43]	Gonsek A, Jeschke M, Rönnau S, et al. (2021) From Paths to Routes: A Method for Path Classification. Frontiers in Behavioral Neuroscience 14:610560. 10.3389/fnbeh.2020.610560 doi: 10.3389/fnbeh.2020.610560
[44]	Bai S, Kolter JZ, Koltun V (2018) An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling. International Conference on Machine Learning (ICML).
[45]	Lea C, Flynn MD, Vidal R, et al. (2017) Temporal Convolutional Networks for Action Segmentation and Detection. IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
[46]	Oord AV, Dieleman S, Zen H, et al. (2016) WaveNet: A Generative Model for Raw Audio. arXiv preprint arXiv: 1609.03499
[47]	Chung J, Kastner K, Dinh L, et al. (2015) Recurrent Latent Variable Models for Sequential Data. Advances in Neural Information Processing Systems (NeurIPS).
[48]	Fraccaro M, Sønderby SK, Paquet U, et al. (2016) Sequential Neural Models with Stochastic Layers. Advances in Neural Information Processing Systems (NeurIPS).
[49]	Walker J, Doersch C, Gupta A, et al. (2016) The Uncertainty in Action: Unsupervised Action Prediction with Variational Autoencoders. European Conference on Computer Vision (ECCV).

Algorithm 1 TimeGAN for Time Series Data Generation
Input: Time series data $D$ , Sequence length $L$ , Samples $N$ , Epochs $\mathcal{E}$ , Batch size $B$ , Hidden dimension $H$ , Latent dimension $Z$ , Learning rate $\alpha$
Output: Synthetic time series data $S$
1: Preprocessing:
2: Normalize $D$ and create sequences of length $L$
3: Model Initialization:
4: Initialize models: Embedder $E$ , Recovery $R$ , Generator $G$ , Discriminator $D$
5: Define loss functions $\mathcal{L}_{\text{MSE}}$ and $\mathcal{L}_{\text{BCE}}$
6: Training:
7: for $epoch = 1$ to $\mathcal{E}$ do
8: for each batch of size $B$ from $D$ do
9: Step 1: Reconstruction Loss
10: $H \gets E(\text{batch})$ $\triangleright$ Encode batch
11: $\hat{B} \gets R(H)$ $\triangleright$ Reconstruct batch
12: Compute $\mathcal{L}_{\text{recon}} = \mathcal{L}_{\text{MSE}}(\hat{B}, \text{batch})$
13: Update $E$ and $R$ using $\mathcal{L}_{\text{recon}}$
14: Step 2: Generator Loss
15: $Z \sim \mathcal{N}(0, 1)$ $\triangleright$ Generate latent noise
16: $H' \gets G(Z)$ $\triangleright$ Generate hidden states
17: $\hat{B}' \gets R(H')$ $\triangleright$ Recover synthetic data
18: Compute $\mathcal{L}_{\text{gen}} = \mathcal{L}_{\text{MSE}}(\hat{B}', \text{batch})$
19: Update $G$ using $\mathcal{L}_{\text{gen}}$
20: Step 3: Discriminator Loss
21: Compute $\mathcal{L}_{\text{disc}}$ using $H$ and $H'$
22: Update $D$ using $\mathcal{L}_{\text{disc}}$
23: end for
24: end for
25: Synthetic Data Generation:
26: Sample $Z \sim \mathcal{N}(0, 1)$
27: Compute $H' \gets G(Z)$ and $S \gets R(H')$
28: Denormalize $S$ to the original scale
29: return $S$

Algorithm 2 TimeGAN-Augmented Temporal Fusion Transformer
Input: $X_{\text{real}}, Y_{\text{real}}, X_{\text{synthetic}}, Y_{\text{synthetic}}$
Output: Trained model and predictions on test data
1: Normalize and reshape $X_{\text{real}}, X_{\text{synthetic}}$
2: $X_{\text{combined}} \gets \text{concat}(X_{\text{real}}, X_{\text{synthetic}})$
3: $X_{\text{input}} \gets \text{DefineInputLayer}((L, F))$
4: $vsn \gets \text{LayerNormalization}(\text{Dense}(\text{activation = 'relu'})(X_{\text{combined}}))$
5: $grn \gets \text{LayerNormalization}(\text{Dense}(\text{activation = 'relu'})(vsn))$
6: $grn \gets \text{Dropout}(0.2)(grn)$
7: $attn \gets \text{LayerNormalization}(\text{MultiHeadAttention}(grn))$
8: $attn \gets \text{Dropout}(0.1)(attn)$
9: $lstm \gets \text{LSTM}(attn)$
10: $lstm \gets \text{Dropout}(0.2)(lstm)$
11: $ffn \gets \text{Dense}(\text{activation = 'relu'})(lstm)$
12: $ffn \gets \text{Dropout}(0.2)(ffn)$
13: $output \gets \text{Dense}(1)(ffn)$
14: Loss $\gets$ MAE, Optimizer $\gets$ Adam
15: Train for $\mathcal{E}$ epochs on $(X_{\text{train}}, Y_{\text{train}})$ , validate on $(X_{\text{val}}, Y_{\text{val}})$
16: Return: Trained model and predictions $Y_{\text{pred}}$

Algorithm 3 Deployment of the TFT Model for IIoT Edge Devices
Input: Trained TFT Model, Test Data ( $X_{\text{test}}$ )
Output: Inference from the deployed model
1: $model \gets \text{convert_to_edge_format}(TFT\_model)$ $\triangleright$ Convert model for edge deployment
2: $model \gets \text{optimize_for_edge}(model)$ $\triangleright$ Optimize for resource constraints
3: $interpreter \gets \text{initialize_interpreter}(model)$ $\triangleright$ Initialize model interpreter
4: $input\_details, output_details \gets \text{get_io_details}(interpreter)$ $\triangleright$ Get input/output details
5: $sample \gets \text{select_sample}(X_{\text{test}})$ $\triangleright$ Select a sample from test data
6: $reshaped\_data \gets \text{prepare_data}(sample)$ $\triangleright$ Reshape and type-cast data for input
7: $interpreter \gets \text{set_input}(interpreter, reshaped\_data)$ $\triangleright$ Feed input to model
8: $predictions \gets \text{run_inference}(interpreter)$ $\triangleright$ Run inference to get predictions
9: Return: $predictions$ $\triangleright$ Return the model's predictions

Algorithm 4 Post-Training Quantization of TimeGAN-Augmented TFT
Input: Trained TimeGAN Model, Test Data ( $\texttt{X_test}$ )
Output: Quantized model predictions
1: Step 1: Model Conversion and Quantization
2: $tflite\_float\_model \gets \text{convert_to_tflite}(timegan\_model)$ $\triangleright$ Convert model to TFLite.
3: $tflite\_quantized\_model \gets \text{apply_quantization}(tflite\_float\_model)$ $\triangleright$ Post-training quantization
4: $\texttt{save_model("timegan_quantized.tflite")}$ $\triangleright$ Save quantized model.
5: $quantized\_model\_size \gets \text{get_model_size}(tflite\_quantized\_model)$ $\triangleright$ Get quantized model.
6: Step 2: Load Model and Allocate Memory
7: $interpreter\_quantized \gets \text{load_model}("timegan\_quantized.tflite")$ $\triangleright$ Load quantized model.
8: $interpreter\_quantized.\text{allocate_tensors}()$ $\triangleright$ Allocate model tensors.
9: Step 3: Prepare Input Data
10: $reshaped\_data \gets \text{reshape_data}(X\_test)$ $\triangleright$ Reshape input data.
11: Step 4: Inference on Quantized Model
12: $predictions \gets \text{run_inference}(interpreter\_quantized, reshaped\_data)$ $\triangleright$ Run inference.
13: Step 5: Return Results
14: $\texttt{return predictions}$ $\triangleright$ Return model predictions.

AIMS Environmental Science

Time-generative AI-enabled temporal fusion transformer model for efficient air pollution sensor calibration in IIoT edge environments

Related Papers:

Abstract

1. Introduction

2. Related works

3. Materials and method

3.1. Time generative adversarial network (TimeGAN)

3.2. Temporal fusion transformer model

3.2.1. Gated residual networks

3.2.2. Variable selection network

3.2.3. Multi-head attention mechanism in TFT

3.2.4. Static covariate encoders

3.2.5. Temporal fusion decoder

3.2.6. Self-attention and feed-forward processing

3.3. Deployment of the TimeGAN temporal fusion transformer (TimeGAN TFT) model for air pollution monitoring in the IIoT

3.3.1. Model conversion and optimization for edge devices

3.3.2. Post-training quantization for model compression

3.3.3. Inference pipeline on edge devices

4. Performance evaluations

4.1. Dataset description

4.2. Results and analysis

4.2.1. TimeGAN for data synthesis: analysis and evaluation

4.2.2. TimeGAN-generated data for downstream calibration regression

4.3. TimeGAN-augmented models for IIoT edge deployment

5. Conclusion

Use of AI tools declaration

Author Contributions

Conflict of interest

Appendix

References

Reader Comments

通讯作者: 陈斌, bchen63@163.com

Metrics

Figures and Tables

Other Articles By Authors

Related pages

Tools

Export File

Citation

Format

Content

Catalog

Abstract

1. Introduction

2. Related works

3. Materials and method

3.1. Time generative adversarial network (TimeGAN)

3.2. Temporal fusion transformer model

3.2.1. Gated residual networks

3.2.2. Variable selection network

3.2.3. Multi-head attention mechanism in TFT

3.2.4. Static covariate encoders

3.2.5. Temporal fusion decoder

3.2.6. Self-attention and feed-forward processing

3.3. Deployment of the TimeGAN temporal fusion transformer (TimeGAN TFT) model for air pollution monitoring in the IIoT

3.3.1. Model conversion and optimization for edge devices

3.3.2. Post-training quantization for model compression

3.3.3. Inference pipeline on edge devices

4. Performance evaluations

4.1. Dataset description

4.2. Results and analysis

4.2.1. TimeGAN for data synthesis: analysis and evaluation

4.2.2. TimeGAN-generated data for downstream calibration regression

4.3. TimeGAN-augmented models for IIoT edge deployment

5. Conclusion

Use of AI tools declaration

Author Contributions

Conflict of interest

Appendix

References