Imaging dose of cone-beam computed tomography in nanoparticle-enhanced image-guided radiotherapy: A Monte Carlo phantom study

Dewmini Mututantri-Bastiyange; James C. L. Chow; Dewmini Mututantri-Bastiyange; James C. L. Chow

doi:10.3934/bioeng.2020001

AIMS Bioengineering

2020, Volume 7, Issue 1: 1-11. doi: 10.3934/bioeng.2020001

Previous Article Next Article

Research article Special Issues

Imaging dose of cone-beam computed tomography in nanoparticle-enhanced image-guided radiotherapy: A Monte Carlo phantom study

Dewmini Mututantri-Bastiyange ¹,
James C. L. Chow ^{2,3
,
,}

1.
Department of Physics, Ryerson University, Toronto, ON, M5B 2K3 Canada
2.
Radiation Medicine Program, Princess Margaret Cancer Centre, University Health Network, Toronto, ON, M5G 1X6 Canada
3.
Department of Radiation Oncology, University of Toronto, Toronto, ON, M5T 1P5 Canada

Received: 19 November 2019 Accepted: 10 January 2020 Published: 13 January 2020

Using kilovoltage cone-beam computed tomography (kV-CBCT) and heavy-atom radiosensitizers in image-guided radiotherapy (IGRT) can provide numerous benefits, such as image contrast enhancement in radiation dose delivery. However, the increased use of kV-CBCT for daily imaging procedures may inevitably deposit certain amount of radiation dose to the patient, especially when nanoparticles used as radiosensitizers are involved. In this study, we use Monte Carlo simulation to evaluate the imaging dose escalation due to nanoparticle addition with varying nanoparticle material, nanoparticle concentration and photon beam energy. A phantom was used to determine the relationships between the imaging dose enhancement ratios (IDERs) and different concentrations (3–40 mg/ml) of gold (Au), platinum (Pt), iodine (I), silver (Ag) and iron oxide (Fe₂O₃) nanoparticles, under the delivery of 120–140 kVp photon beams from the CBCT. It is found that gold and platinum nanoparticles of 40 mg/ml concentration had the highest IDER (∼1.6) under the 120 kVp photon beam. This nanoparticle addition resulted in a 0.63% increase of imaging dose based on a typical dose prescription of 200 cGy per fraction in radiotherapy, and is within the standard uncertainty of ±5% in radiation dose delivery. This study proves that the incorporation of higher concentration nanoparticles under lower photon beam energy could increase the imaging dose. The results from this study can enable us to understand more about the incorporation of heavy-atom nanoparticles in IGRT systems.

Keywords:

Citation: Dewmini Mututantri-Bastiyange, James C. L. Chow. Imaging dose of cone-beam computed tomography in nanoparticle-enhanced image-guided radiotherapy: A Monte Carlo phantom study[J]. AIMS Bioengineering, 2020, 7(1): 1-11. doi: 10.3934/bioeng.2020001

Related Papers:

[1]	Chunhong Li, Dandan Yang, Chuanzhi Bai . Some Opial type inequalities in (p, q)-calculus. AIMS Mathematics, 2020, 5(6): 5893-5902. doi: 10.3934/math.2020377
[2]	İbrahim Aktaş . On some geometric properties and Hardy class of q-Bessel functions. AIMS Mathematics, 2020, 5(4): 3156-3168. doi: 10.3934/math.2020203
[3]	Ghulam Farid, Saira Bano Akbar, Shafiq Ur Rehman, Josip Pečarić . Boundedness of fractional integral operators containing Mittag-Leffler functions via (s,m)-convexity. AIMS Mathematics, 2020, 5(2): 966-978. doi: 10.3934/math.2020067
[4]	Xiaoli Zhang, Shahid Khan, Saqib Hussain, Huo Tang, Zahid Shareef . New subclass of q-starlike functions associated with generalized conic domain. AIMS Mathematics, 2020, 5(5): 4830-4848. doi: 10.3934/math.2020308
[5]	Pinhong Long, Huo Tang, Wenshuai Wang . Functional inequalities for several classes of q-starlike and q-convex type analytic and multivalent functions using a generalized Bernardi integral operator. AIMS Mathematics, 2021, 6(2): 1191-1208. doi: 10.3934/math.2021073
[6]	Khadeejah Rasheed Alhindi, Khalid M. K. Alshammari, Huda Ali Aldweby . Classes of analytic functions involving the q-Ruschweyh operator and q-Bernardi operator. AIMS Mathematics, 2024, 9(11): 33301-33313. doi: 10.3934/math.20241589
[7]	Muhammad Sabil Ur Rehman, Qazi Zahoor Ahmad, H. M. Srivastava, Nazar Khan, Maslina Darus, Bilal Khan . Applications of higher-order q-derivatives to the subclass of q-starlike functions associated with the Janowski functions. AIMS Mathematics, 2021, 6(2): 1110-1125. doi: 10.3934/math.2021067
[8]	Hengxiao Qi, Muhammad Yussouf, Sajid Mehmood, Yu-Ming Chu, Ghulam Farid . Fractional integral versions of Hermite-Hadamard type inequality for generalized exponentially convexity. AIMS Mathematics, 2020, 5(6): 6030-6042. doi: 10.3934/math.2020386
[9]	Ghulam Farid, Maja Andrić, Maryam Saddiqa, Josip Pečarić, Chahn Yong Jung . Refinement and corrigendum of bounds of fractional integral operators containing Mittag-Leffler functions. AIMS Mathematics, 2020, 5(6): 7332-7349. doi: 10.3934/math.2020469
[10]	Waseem Ahmad Khan, Kottakkaran Sooppy Nisar, Dumitru Baleanu . A note on (p, q)-analogue type of Fubini numbers and polynomials. AIMS Mathematics, 2020, 5(3): 2743-2757. doi: 10.3934/math.2020177

Abstract

1. Introduction

N6,2′-O-dimethyladenosine (m6Am) is a significant RNA modification that plays a vital role in regulating various cellular processes, including gene expression, RNA stability, and the general integrity of RNA metabolism. This modification occurs at the five untranslated regions (UTRs) of messenger RNA (mRNA), influencing key RNA functions such as capping, translation initiation, and RNA decay [1]. m6Am has been shown to affect the interaction of RNA molecules with RNA-binding proteins, modulating critical processes like RNA splicing, transport, and stability. These modifications help regulate gene expression in response to cellular conditions and environmental cues, making them essential for maintaining cellular homeostasis. The dynamic and reversible nature of m6Am modifications in RNA is crucial for regulating mRNA's fate and ensuring the translation machinery's proper functioning [2]. The m6Am modification has gained attention due to its potential implications in disease pathogenesis and cellular dysfunction. The m6Am is linked to various biological processes, such as cell growth, differentiation, stress responses, and RNA surveillance mechanisms. The m6Am role in regulating mRNA stability suggests that it could regulate gene expression in response to stress or environmental changes, making it an essential factor in cellular adaptation and survival [3],[4]. Similarly, alterations in m6Am modification patterns have been associated with several diseases, including cancer, neurological disorders, and metabolic conditions, highlighting its significance in both health and disease. Its importance and accurate identification of m6Am sites within RNA sequences is essential for advancing the understanding of gene regulation and the molecular mechanisms that govern disease progression [5]. The ability to detect m6Am modifications opens new avenues for therapeutic interventions, enabling the development of targeted strategies for diseases that involve aberrant RNA modifications. As a result, computational methods that allow efficient and precise detection of m6Am sites are critical for advancing research in RNA biology and molecular medicine.

Advancements in computational biology have led to several learning tools for predicting RNA modifications, particularly m6Am. For example, Song et al. [6] introduced MultiRM, an attention-based multi-label neural network capable of predicting 12 RNA modifications simultaneously. Using an attention mechanism, MultiRM identifies modification sites and interprets key sequence contexts, revealing strong associations between different RNA modifications. The model achieves 71.13% accuracy with an MCC of 0.427 and an AUC of 0.805 on sequence-based RNA modification mechanisms. Jiang et al. [7] proposed m6AmPred using the eXtreme gradient boosting with dart (XGBDart) algorithm and EIIP-PseEIIP encoding for feature representation. m6AmPred achieved 73.10% accuracy with an MCC of 0.462 and an AUC of 0.820 on cross-validation. Similarly, Luo et al. [8] developed another model named DLm6Am, i.e., an ensemble deep-learning framework combining one-hot encoding, nucleotide chemical property (NCP), and nucleotide density (ND) for feature extraction. DLm6Am integrates CNN, BiLSTM, and multi-head attention modules, outperforming tools like m6AmPred and MultiRM with 79.55% accuracy, 81.71% sensitivity, 77.40% specificity, MCC of 0.591, and AUC of 0.863 on independent testing data. Recently, Jia et al. [9] proposed EMDL_m6Am, a stacking ensemble model employing one-hot encoding and integrating DenseNet, inflated convolutional network (DCNN), and deep multiscale residual network (MSRN) for feature extraction. EMDL_m6Am achieved 80.98% accuracy, 82.25% sensitivity, 79.72% specificity, MCC of 0.619, and AUC of 0.823 on training data, with independent testing (80.98% accuracy, AUC of 0.8211). Despite advancements, existing methods struggle with limited encoding schemes, inefficient feature selection, and reliance on single deep learning frameworks, leading to suboptimal performance and high computational costs. The lack of explainability in current models significantly hinders the interpretation and improvement of accuracy, robustness, and interpretability in m6Am site prediction techniques.

Based on the aforementioned considerations, in this study, we propose Deep-m6Am, a novel deep learning (DL) model designed to accurately identify m6Am sites in RNA sequences. The model integrates multiple feature extraction techniques, including pseudo single nucleotide composition (PseSNC), pseudo dinucleotide composition (PseDNC), and pseudo trinucleotide composition (PseTNC), to capture complex sequence patterns essential for precise prediction. A SHAP (SHapley Additive exPlanations)-based feature selection mechanism is incorporated to enhance computational efficiency and eliminate irrelevant or redundant features, ensuring that only the most informative features contribute to the model's predictions. The Deep-m6Am framework addresses the limitations of single-model approaches by leveraging a multilayer deep neural network (DNN) classifier, improving robustness and generalizability. The model's performance was rigorously evaluated using 5-fold cross-validation and independent testing. The Deep-m6Am demonstrates state-of-the-art results across multiple evaluation metrics, including accuracy, sensitivity, specificity, AUC, and MCC, outperforming existing models and traditional ML algorithms. Integrating cutting-edge feature extraction, selection, and deep learning methodologies, Deep-m6Am provides a powerful and interpretable tool for predicting RNA modifications. This advancement significantly contributes to RNA biology by offering more profound insights into RNA modifications and their roles in disease mechanisms, opening promising avenues for further research into RNA modification patterns. Therefore, Deep-m6Am is a robust computational framework for addressing key challenges in RNA modification analysis, as illustrated in Figure 1.

Figure 1. Architecture of the proposed model.

DownLoad: Full-Size Img PowerPoint

The rest of the paper is organized as follows: Section 2 presents material and methods, Section 3 illustrates performance metrics and evaluation, Section 4 provides experimental results and analysis, and the work is concluded in Section 5.

2. Materials and methods

2.1. Benchmark dataset

A valid and reliable benchmark dataset is essential for designing a powerful and robust computational model. In this study, we utilized the same benchmark datasets employed by Jia et al. [9]. These sites were regarded as highly confident, providing a solid foundation for accurate and reliable model development. Initially, sample sequences were extracted for the training dataset, as depicted in Eq. 1.

$T_{1} = (T_{1}^{+} \cup T_{1}^{-})$ (1)

Where T₁ represents the total RNA sequences, $T_{1}^{+}$ represents the positive m6Am sequences, and $T_{1}^{-}$ represents the non-m6Am sequences. U is a mathematical operator representing the union of the two subsamples. Moreover, a CD-HIT tool was employed to eliminate pairwise sequences with a similarity greater than 20%. Finally, we achieved a benchmark dataset comprising 3548 sequences with 1774 m6Am samples and 1774 non-m6Am samples. In addition, we randomly separated 15% of the samples with label stratification from the original dataset and generated an independent set. The remaining 85% of the samples were used as training sets. The independent benchmark dataset was mathematically formulated using the following Eq. 2.

$T_{2} = (T_{2}^{+} \cup T_{2}^{-})$ (2)

Where T₂ represents the total RNA sequences, $T_{2}^{+}$ represents the positive m6Am sequences, and $T_{2}^{-}$ represents the non-m6Am sequences. After separation, the training sets contained 2838 (i.e., 1419 Pos+ and 1419 Neg-) training instances and 710 (i.e., 355 Pos+ and 355 Neg-) independent instances. It is important to note that the independent test set was carefully saved separately as invisible data and was not used in learning and parameter tuning processes. The statistical distribution of the benchmark dataset, detailed in Table 1, ensures an equitable representation of positive and negative samples across training and independent testing, thereby enabling a robust and reliable model evaluation.

Table 1. Statistical distribution of the benchmark dataset.

Dataset	Number of samples	Positive samples	Negative samples
Cross validation	3548	1774	1774
Training dataset	2838	1419	1419
Independent dataset	710	355	355

| Show Table

DownLoad: CSV

2.2. Feature extraction techniques

Several techniques have been developed to convert DNA, protein, and RNA sequences into discrete mathematical models, maintaining the nucleotides' outstanding features and structural integrity. These methods ensure that the biological sequences are accurately described in numerical formats, enabling computational analysis without losing critical sequence-specific information. Accordingly, several bioinformatics approaches have been developed that can transform RNA sequences into various statistical equations with the preservation of the uniqueness and inherent patterns of the measures [10]–[13]. Following the second rule of Chou's 5-step guidelines, several feature extraction techniques have been implemented in this paper to improve the representation of RNA sequences. These techniques include pseudo K-tuple nucleotide composition (PseKNC), comprising methods like PseSNC (Ƙ = 1), PseDNC (Ƙ = 2), and PseTNC (Ƙ = 3). Feature extraction methods are explained in detail in the next section. The PseKNC approach represents RNA sequences as functional vectors by encoding their composition and sequence patterns. This method suppresses detailed order data, focusing on capturing essential features that suggest similarities between RNA samples. By transforming the sequences into structured mathematical representations, PseKNC facilitates efficient computational analysis while preserving key biological characteristics of the RNA [14]. Let us consider an RNA sequence R with N number of nucleotides, represented in Eq. 3.

$R = R_{1} R_{2} R_{3} \dots R_{i} \dots R_{N}$ (3)

Where N represents the number of nucleotides in a RNA sequence (i.e. the length of a RNA sequence) and $R_{i} \in {A, C, G, U}$ $(i = 1, 2, 3, ...., L)$ . Where R_i represents a nucleotides at the i^th sequence location and A,C,G, U represents Adenine, Cytosine, Guanine and Urine respectively [16],[17].

The Eq. 3, can be expressed in the general form of the PseKNC as

$R = {[φ_{1} φ_{2} φ_{3} ... φ_{y} .... ϕ_{z}]}^{T}$ (4)

In RNA sequence representation, T is the transposed vector representing a mathematical transformation, z represents a numeric value typically corresponding to an output or dependent variable in the analysis, and ϕ_y represents the actual value of the RNA sequence's feature vector and can be computed using Eq. 5.

$φ_{u} = {\begin{cases} \frac{f_{u}^{K - t u p l e}}{\sum_{i = 1}^{4^{k}} f_{u}^{K - t u p l e} + w \sum_{j = 1}^{ƛ} θ_{j}} (1 \leq u \leq 4^{k}, u = 1, 2, 3, ...) \\ \frac{w θ_{u - 4^{k}}}{\sum_{i = 1}^{4^{k}} f_{u}^{K - t u p l e} + w \sum_{j = 1}^{ƛ} θ_{j}} (4^{k} + 1 \leq u \leq 4^{k} + λ) \end{cases}$ (5)

$θ_{j} = {\frac{1}{L - K - (λ - 1)} \sum_{i = 1}^{L - K - (λ - 1)} C_{i, i + j} j \to 1, 2, ....., λ, λ < L - K$ (6)

$C_{i, i + j} = \frac{1}{u} {\sum_{ξ = 1}^{λ} [H_{ξ} (N_{i} N_{i + 1} ... N_{i + K - 1}) - H_{ξ} (N_{i + j} N_{i + j + 1} ... N_{i + j + K - 1})]}^{2}$ (7)

Where θ_j represent the j^th tier correlation factor or j^th rank correlation factor that reflects the sequence order correlation in most contiguous K-tuple nucleotides. λ represents the total number correlation rank and w represents the weight. This paper uses the PseKNC technique to convert the provided sequences into discrete feature vectors while maintaining the sequence order data. By designating different values to K (i.e., K = 1, 2, 3) in Eq. 4, three distinct modes of PseKNC were obtained, i.e., PseSNC (K = 1), PseDNC (K = 2), and PseTNC (K = 3), defined as follows:

$R_{P s e S N C} = | f_{j = 1, ...4 D}^{1 - T u p l e} \overset{f}{\to} (A, C, G, U)$ (8)

$R_{P s e D N C} = | f_{j = 1, ...16 D}^{2 - T u p l e} \overset{f}{\to} (A A, C C, G G, U U)$ (9)

$R_{P s e T N C} = | f_{j = 1, ...64 D}^{3 - T u p l e} \overset{f}{\to} (A A A, C C C, G G G, U U U)$ (10)

2.3. Hybrid feature

This study used three distinct feature extraction methods to encode RNA sequences into discrete feature vectors, as summarized in Table 2. These features include PseSNC, PseDNC, and PseTNC, which integrate pseudo, composition, and transitional probability features to improve the differentiation and interpretation of nucleotide sequences [17]–[19]. All individual features were incorporated to construct a comprehensive hybrid feature vector by capturing diverse sequence-derived attributes. Machine learning models leveraging hybrid features benefit from combining multiple extraction techniques, enhancing predictive performance by effectively capturing complex data patterns. This approach remains a widely adopted strategy in bioinformatics and genomics for improving model interpretability and accuracy.

Table 2. Dimension of feature vector with different values of K.

Feature extraction methods	Features
Pseudo single nucleotide composition (PseSNC)	4
pseudo dinucleotide composition (PseDNC)	16
Pseudo trinucleotide composition (PseTNC)	64
Hybrid features	84

| Show Table

DownLoad: CSV

2.4. Feature selection

Feature selection is critical in developing models to improve overall performance and computational efficiency. Feature selection involves identifying and retaining the most informative features while eliminating irrelevant or redundant ones, which can introduce noise and reduce prediction accuracy. This study employs SHAP (SHapley Additive exPlanations) as a robust feature selection technique. SHAP leverages cooperative game theory to quantify the contribution of each feature to the model's predictions, ensuring that only the most significant features are retained [20]. This approach reduces the dataset's dimensionality and enhances the model's interpretability by providing insights into the importance of individual features. By integrating SHAP into the Deep-m6Am framework, the model achieves optimized computational efficiency and improved generalization, enabling more accurate and reliable identification of m6Am sites in RNA sequences. This feature selection strategy is pivotal in addressing the challenges of high-dimensional data and ensuring the model's robustness and scalability. This approach enhances model interpretability and supports robust data analysis; it can be expressed as in Eq. 11.

$S H A P_{i} (x) = Ø_{i} = \sum_{s \subseteq N {i}} \frac{| S | (| N | - | S | - 1)}{| N |} [f (S \cup^{} {i}) - f (S)]$ (11)

Where φ_i represents the SHAP value for the feature i, N is the set of all features, and S is a subset of features excluding i. Then, f(S) is the model's prediction given features in S, and $f (S {\cup^{}}^{} {i})$ is the model's prediction given features in S plus feature i. This equation captures the incremental effect of adding the feature i to different subsets of features.

2.5. Deep neural network architecture

The network topology of a deep neural network, an algorithm based on machine learning or artificial intelligence inspired by the human brain, includes input and output layers and multiple hidden layers. The mechanism of neuron transmission and activation function in DNN is shown in Figure 2. Unlike traditional processing techniques, DNNs can self-learn and automatically acquire pertinent features from unstructured or raw data. Domains in which DNN has been successfully implemented include speech recognition, NLP (Natural Language Processing) and bioengineering, and imaging [21].

Figure 2. The architecture of the proposed deep neural network.

DownLoad: Full-Size Img PowerPoint

The proposed architecture utilizes fully connected layers to locate m6Am sites in RNA sequences. The input layer comprises 42 nodes linked to a first hidden layer of 32 nodes through weighted connections. A second hidden layer with 16 nodes processes outputs from the first layer, followed by a third and last layer with 8 nodes. Each layer employs the rectified linear unit (ReLU) activation function, enabling the model to detect nonlinear relationships and complex patterns [22]. The output layer uses the sigmoid activation function for normalized binary classification, distinguishing m6Am and non-m6Am sites in RNA sequences.

3. Performance evaluation criteria

The Deep-m6Am performance is rigorously evaluated using key metrics, including accuracy (ACC), sensitivity (SN), specificity (SP), Matthews correlation coefficient (MCC), and area under the curve (AUC) [23]. SN measures the model's ability to accurately identify true m6Am sites, while SP evaluates its capacity to predict negative cases correctly. ACC reflects the overall correctness of predictions, MCC provides a balanced classification performance assessment, and AUC highlights the model's ability to distinguish between positive and negative instances. These metrics comprehensively evaluate the model's predictive power, ensuring its reliability and effectiveness in identifying m6Am sites.

$A C C = \frac{T^{+} + T^{-}}{T^{+} + F^{+} + T^{-} + F^{-}}$ (12)

$S N = \frac{T^{+}}{T^{+} + F^{-}}$ (13)

$S P = \frac{T^{-}}{T^{-} + F^{+}}$ (14)

$M C C = \frac{(T^{-} * T^{+}) - (F^{-} * F^{+})}{\sqrt{(f^{+} + T^{+}) (T^{+} + F^{-}) (F^{+} + T^{-}) (T^{-} + F^{-})}}$ (15)

Where T⁺ symbolizes true positives, F⁺ symbolizes false positives, T^- Symbolizes true negatives, and F^- false negatives, respectively.

4. Experimental results and analysis

4.1. Hyperparameters optimization

In this section, we analyze the hyperparameters of the Deep-m6Am model to optimize its performance. The key hyperparameters considered include learning rate (LR), batch size, number of layers, neurons per layer, and dropout rate. A dropout rate of 0.5 and L2 regularization (0.001) is applied to prevent overfitting, while Xavier initialization ensures stable weight distribution. The model is trained using the Adam optimizer with a learning rate of 0.01 and a momentum of 0.9 to accelerate convergence. Training is conducted for 100 epochs, utilizing ReLU activation functions in the hidden layers and Softmax activation in the output layer for effective learning and classification. A grid search technique was employed to assess the proposed model performance under various hyperparameters, exploring different combinations of parameters. Specifically, the analysis focused on the hyperparameters that significantly influence the performance of the DNN model, including the activation function, learning rate, and number of iterations. Table 3 presents the optimal hyperparameters for the Deep-m6Am.

Table 3. Optimal hyperparameters for the DNN model.

Parameter	Optimal value
Dropout rate	0.5
Weight initialization function	Xavier
Seed	12345L
Dropout	0.001
Number of hidden layers	3
Optimizer	Adam, SGD
L2 regularization	0.001
Epochs	100
Learning rate	0.01
Batch size	16
Activation functions	ReLU, Softmax
Momentum	0.9

| Show Table

DownLoad: CSV

4.2. Performance analysis of DNN

In this section, we conduct the performance analysis of the proposed Deep-m6Am model. We conducted experiments to examine the effects of LR. Table 4 presents a detailed comparison of performance metrics across different learning rates and shows how the chosen learning rate significantly impacts the model's effectiveness and reliability.

Table 4. Performance metrics across various learning rates (LR).

LR	ACC (%)	SN (%)	SP (%)	MCC
0.01	83.43	82.64	84.22	0.669
0.02	80.05	79.71	80.38	0.601
0.03	79.43	80.27	78.58	0.589
0.04	78.86	78.58	79.14	0.577
0.05	78.70	75.00	82.40	0.672

| Show Table

DownLoad: CSV

As shown in Table 4, the Deep-m6Am model achieves optimal performance with a learning rate of 0.01, attaining the highest accuracy (ACC) of 83.43%, sensitivity (SN) of 82.64%, specificity (SP) of 84.22%, and MCC of 0.669. However, as the learning rate increases, the model's performance declines, highlighting that excessively higher learning rates negatively influence overall metrics.

Figure 3. Performance comparison across various dropout rates.

DownLoad: Full-Size Img PowerPoint

Furthermore, in Figure 3, we analyze the model's fluctuation in performance with different dropout rates, offering valuable guidance for optimizing this hyperparameter. Proper optimization is crucial for balancing generalization and overfitting prevention, ensuring a robust and reliable model. Figure 3 shows that the model achieves optimal performance at a dropout rate of 0.5, with the highest ACC (83.43%) and MCC (66.90%). Performance improves as the dropout rate increases from 0.1 to 0.5, highlighting 0.5 as the most effective rate for balancing generalization and accuracy.

Figure 4. Performance comparison across various batch sizes.

DownLoad: Full-Size Img PowerPoint

Moreover, we analyze the effect of varying batch sizes on model performance, comparing outcomes across different sizes to identify optimal configurations. Figure 4 illustrates the impact of batch size on model performance, showing a decline as the batch size increases from 16 to 256. The model achieves optimal performance at a batch size of 16, with the highest ACC (83.43%) and MCC (69.90%). As batch size increases, performance gradually decreases, emphasizing the importance of tuning this hyperparameter for optimal results.

4.3. Performance evaluation using cross-validation

Evaluating the robustness of statistical learning models is essential, and this is typically achieved through validation techniques such as jackknife, k-fold cross-validation, and subsampling. Among these methods, k-fold cross-validation is particularly effective for objectively assessing model performance by dividing the dataset into multiple test sets. This approach ensures a thorough evaluation of the model's generalizability and reliability. Table 5 presents a performance comparison of the proposed Deep-m6Am model using various feature extraction techniques, including individual, hybrid, and SHAP-based feature selection methods.

Table 5. 5-fold cross-validation performance evaluation.

Method	ACC (%)	SN (%)	SP (%)	MCC
PseSNC	71.70	66.55	76.40	0.570
PseDNC	77.73	76.89	78.58	0.555
PseTNC	79.43	80.27	78.58	0.589
Hybrid features	80.83	80.27	81.40	0.617
Hybrid features after SHAP	83.43	82.64	84.22	0.669

| Show Table

DownLoad: CSV

Table 5 highlights the varying predictive performance of individual features, with PseSNC, PseDNC, and PseTNC achieving ACCs of 71.70%, 77.73%, and 79.43%, respectively. The hybrid feature approach significantly improves classification, reaching an ACC of 80.83%. Further enhancement through SHAP-based feature selection optimizes feature importance, achieving the highest ACC (83.43%) and MCC (0.669). These results underscore the effectiveness of hybrid features in capturing complex patterns and the role of SHAP in refining feature selection for improved model performance.

4.4. Performance comparison with different ML algorithms

In this section, we provide an analysis of the DNN model in comparison to well-known machine learning algorithms such as K-nearest neighbor (KNN), random forest (RF), decision tree (DT), naive Bayes (NB), and support vector machine (SVM) [16],[24]–[26]. Table 6 illustrates the importance of evaluating model performance across different classifiers. We employed a 5-fold cross-validation scheme to ensure a reliable and unbiased performance assessment.

Table 6. Performance comparison with ML algorithms on 5-fold cross-validation.

Classifiers	ACC (%)	SN (%)	SP (%)	MCC
RF	68.72	66.91	70.53	0.667
DT	71.80	70.22	73.38	0.706
KNN	77.15	75.62	78.68	0.741
NB	79.99	78.35	81.63	0.712
SVM	82.53	81.96	83.09	0.651
Deep-m6Am	83.43	82.64	84.22	0.669

| Show Table

DownLoad: CSV

Table 6 shows that Deep-m6Am outperforms other ML algorithms, achieving the highest ACC (83.43%) and MCC (0.669). SVM follows with an ACC of 82.53%, while NB and KNN achieve 79.99% and 77.15%, respectively. DT (71.80%) and RF (68.72%) perform lower. These results highlight Deep-m6Am as the most effective model for m6Am site identification. To analyze further, we evaluate the proposed model performance using the Area Under the ROC Curve (AUC), as shown in Figure 5. Figure 5 show that the proposed model achieved an AUC value of 0.853, indicating excellent performance compared with widely used ML algorithms.

Figure 5. AUC performance comparison with commonly used classifiers using 5-fold cross-validation.

DownLoad: Full-Size Img PowerPoint

Furthermore, Table 7 evaluates various ML algorithms on an independent dataset to assess generalizability and robustness. From Table 7, the proposed Deep-m6Am model demonstrated superior performance among the ML classifiers, achieving an ACC of 82.86% with an MCC of 0.657%. The SVM classifier had an ACC of 81.64% with an MCC of 0.632. In contrast, KNN achieved an ACC of 75.45%, while NB, DT, and RF performed at 77.29%, 68.88%, and 66.22%, respectively. This analysis identifies Deep-m6Am as the top-performing model, showcasing its superiority in handling dataset complexities and ensuring reliable and accurate m6Am site prediction.

Table 7. Performance comparison with ML algorithms on the independent dataset.

Classifiers	ACC (%)	SN (%)	SP (%)	MCC
RF	66.22	64.50	68.00	0.325
DT	68.88	67.10	70.66	0.378
KNN	75.45	73.20	77.70	0.509
NB	77.29	75.20	79.38	0.545
SVM	81.64	79.50	83.78	0.632
Deep-m6Am	82.86	83.65	82.07	0.657

| Show Table

DownLoad: CSV

4.5. Performance comparison on the independent dataset

In this section, we conduct a detailed comparative analysis of the proposed Deep-m6Am predictor against several state-of-the-art predictors, including MultiRM [6], m6AmPred [7], DLm6Am [8], and EMDL_m6Am [9]. This comparison is shown in Table 8.

Table 8. Performance comparison with existing models on the independent dataset.

Predictor	ACC (%)	SN (%)	SP (%)	MCC
MultiRM [6]	71.13	78.59	63.66	0.427
m6AmPred [7]	73.10	72.11	74.08	0.462
DLm6Am [8]	79.55	81.71	77.40	0.591
EMDL_m6Am [9]	80.98	82.25	79.72	0.619
Deep-m6Am	82.86	83.65	82.07	0.657

| Show Table

DownLoad: CSV

From Table 8, the MultiRM achieved an ACC of 71.13% and MCC of 0.427, while m6AmPred had an ACC of 73.10% and MCC of 0.462. DLm6Am demonstrated an ACC of 79.55% and MCC of 0.591, and EMDL_m6Am obtained an ACC of 80.98% and MCC of 0.619. In comparison, the proposed Deep-m6Am outperformed all these models, achieving the highest ACC of 82.86% and MCC of 0.657. These results highlight the superior predictive accuracy and robustness of Deep-m6Am for m6Am site identification, making it the most effective model among the evaluated predictors.

5. Conclusions

The biological function of N6,2′-O-dimethyladenosine (m6Am) in RNA sequences underscores its critical role in regulating post-transcriptional processes, RNA stability, and translation. This study introduces the Deep-m6Am model, which employs a hybrid feature extraction approach, incorporating SHAP (SHapley Additive exPlanations) feature selection and DNN classifier to precisely identify m6Am sites within RNA sequences. Through 5-fold cross-validation, compared with popular ML methods, Deep-m6Am demonstrated unique advantages that resulted in more precise m6Am site predictions. Furthermore, the proposed Deep-m6Am model showed superior performance metrics, achieving an average accuracy of 82.86% compared to the existing models. These results underscore the potential of Deep-m6Am as a reliable and efficient tool for advancing RNA modification analysis.

Future research could expand Deep-m6Am to analyze other RNA modifications and integrate multi-omics data for enhanced predictive accuracy. Exploring its role in disease-specific studies could advance precision medicine. Optimizing computational efficiency through transfer learning, hyperparameter optimization, and parallel programming will improve the model's scalability and applicability in RNA biology and medical research [27].

Acknowledgments

The authors would like to thank Xiao J. Zheng from Ryerson University to prepare the database for the heavy-atom nanoparticles used in Monte Carlo simulation.

Conflict of interest

The authors have no potential conflict of interest on financial or commercial matters associated with this study.

References

[1]	Zubizarreta E, Van Dyk J, Lievens Y (2017) Analysis of global radiotherapy needs and costs by geographic region and income level. Clin Oncol 29: 84-92. doi: 10.1016/j.clon.2016.11.011
[2]	He C, Chow JCL (2016) Gold nanoparticle DNA damage in radiotherapy: A Monte Carlo study. AIMS Bioeng 3: 352-361. doi: 10.3934/bioeng.2016.3.352
[3]	Berbeco RI, Korideck H, Ngwa W, et al. (2012) DNA damage enhancement from gold nanoparticles for clinical MV photon beams. Radiat Res 178: 604-608. doi: 10.1667/RR3001.1
[4]	Boudaïffa B, Cloutier P, Hunting D, et al. (2000) Resonant formation of DNA strand breaks by low-energy (3 to 20 eV) electrons. Sci 287: 1658-1660. doi: 10.1126/science.287.5458.1658
[5]	Killoran JH, Kooy HM, Gladstone DJ, et al. (1997) A numerical simulation of organ motion and daily setup uncertainties: implications for radiation therapy. Int J Radiat Oncol Biol Phys 37: 213-221. doi: 10.1016/S0360-3016(96)00477-4
[6]	Jaffray DA, Siewerdsen JH, Wong JW, et al. (2002) Flat-panel cone-beam computed tomography for image-guided radiation therapy. Int J Radiat Oncol Biol Phys 53: 1337-1349. doi: 10.1016/S0360-3016(02)02884-5
[7]	Groh BA, Siewerdsen JH, Drake DG, et al. (2002) A performance comparison of flat-panel imager-based MV and kV cone-beam CT. Med Phys 29: 967-975. doi: 10.1118/1.1477234
[8]	Jaffray DA (2012) Image-guided radiotherapy: from current concept to future perspectives. Nat Rev Clin Oncol 9: 688-699. doi: 10.1038/nrclinonc.2012.194
[9]	Chen GTY, Sharp GC, Mori S (2009) A review of image-guided radiotherapy. Radiol Phys Tech 2: 1-12. doi: 10.1007/s12194-008-0045-y
[10]	van der Meer S, Bloemen-van Gurp E, Hermans J, et al. (2013) Critical assessment of intramodality 3D ultrasound imaging for prostate IGRT compared to fiducial markers. Med Phys 40: 071707. doi: 10.1118/1.4808359
[11]	Boda-Heggemann J, Lohr F, Wenz F, et al. (2011) kV cone-beam CT-based IGRT. Strahlenther Onkol 187: 284-291. doi: 10.1007/s00066-011-2236-4
[12]	Abdulle A, Chow JCL (2019) Contrast enhancement for portal imaging in nanoparticle-enhanced radiotherapy: A Monte Carlo phantom evaluation using flattening-filter-free photon beams. Nanomaterials 9: 920. doi: 10.3390/nano9070920
[13]	Nagesha DK, Tada DB, Stambaugh CKK, et al. (2010) Radiosensitizer-eluting nanocoatings on gold fiducials for biological in-situ image-guided radio therapy (BIS-IGRT). Phys Med Biol 55: 6039. doi: 10.1088/0031-9155/55/20/001
[14]	Chow JCL (2018) Recent progress in Monte Carlo simulation on gold nanoparticle radiosensitization. AIMS Biophys 5: 231-244. doi: 10.3934/biophy.2018.4.231
[15]	Chow JCL (2017) Application of nanoparticle materials in radiation therapy. Handbook of Ecomaterials Switzerland: Springer, 1-21.
[16]	Chow JCL (2018) Monte Carlo nanodosimetry in gold nanoparticle-enhanced radiotherapy. Recent Advancements and Applications in Dosimetry New York: Nova Science Publishers.
[17]	Reuveni T, Motiei M, Romman Z, et al. (2011) Targeted gold nanoparticles enable molecular CT imaging of cancer: an in vivo study. Int J Nanomed 6: 2859-2864.
[18]	Hepel M, Stobiecka M (2012) Detection of oxidative stress biomarkers using functional gold nanoparticles. Fine Particles in Medicine and Pharmacy Boston: Springer, 241-281. doi: 10.1007/978-1-4614-0379-1_9
[19]	Stobiecka M, Ratajczak K, Jakiela S (2019) Toward early cancer detection: Focus on biosensing systems and biosensors for an anti-apoptotic protein survivin and survivin mRNA. Biosens Bioelectron 137: 58-71. doi: 10.1016/j.bios.2019.04.060
[20]	Leung MKK, Chow JCL, Chithrani BD, et al. (2011) Irradiation of gold nanoparticles by x-rays: Monte Carlo simulation of dose enhancements and the spatial properties of the secondary electrons production. Med Phys 38: 624-631. doi: 10.1118/1.3539623
[21]	Bertelsen A, Schytte T, Bentzen SM, et al. (2011) Radiation dose response of normal lung assessed by Cone Beam CT–a potential tool for biologically adaptive radiation therapy. Radiother Oncol 100: 351-355. doi: 10.1016/j.radonc.2011.08.012
[22]	Stock M, Palm A, Altendorfer A, et al. (2012) IGRT induced dose burden for a variety of imaging protocols at two different anatomical sites. Radiother Oncol 102: 355-363. doi: 10.1016/j.radonc.2011.10.005
[23]	Chow JCL, Leung MKK, Islam MK, et al. (2008) Evaluation of the effect of patient dose from cone beam computed tomography on prostate IMRT using Monte Carlo simulation. Med Phys 35: 52-60. doi: 10.1118/1.2815625
[24]	Chow JCL (2009) Cone-beam CT dosimetry for the positional variation in isocenter: A Monte Carlo study. Med Phys 36: 3512-3520. doi: 10.1118/1.3166934
[25]	Jia X, Yan H, Gu X, et al. (2012) Fast Monte Carlo simulation for patient-specific CT/CBCT imaging dose calculation. Phys Med Biol 57: 577. doi: 10.1088/0031-9155/57/3/577
[26]	Kawrakow I (2001) The EGSnrc code system, Monte Carlo simulation of electron and photon transport. NRCC Report Pirs-701.
[27]	Rogers DWO, Walters B, Kawrakow I (2009) BEAMnrc Users Manual. NRCC Report Pirs-509.
[28]	Walters B, Kawrakow I, Rogers DWO (2005) DOSXYZnrc Users Manual. NRCC Report Pirs-794.
[29]	Ding GX, Coffey CW (2010) Beam characteristics and radiation output of a kilovoltage cone-beam CT. Phys Med Biol 55: 5231-5248. doi: 10.1088/0031-9155/55/17/022
[30]	Albayedh F, Chow JCL (2018) Monte Carlo simulation on the imaging contrast enhancement in nanoparticle-enhanced radiotherapy. J Med Phys 43: 195-199. doi: 10.4103/jmp.JMP_141_17
[31]	Chow JCL (2016) Photon and electron interactions with gold nanoparticles: A Monte Carlo study on gold nanoparticle-enhanced radiotherapy. Nanobiomater Med Imaging 8: 45-70. doi: 10.1016/B978-0-323-41736-5.00002-9
[32]	Alaei P, Spezi E (2015) Imaging dose from cone beam computed tomography in radiation therapy. Phys Med 31: 647-658. doi: 10.1016/j.ejmp.2015.06.003
[33]	Brahme A (1988) Accuracy requirements and quality assurance of external beam therapy with photons and electrons. Acta Oncol 27: 1-76. doi: 10.3109/02841868809105001
[34]	Schuemann J, Berbeco R, Chithrani DB, et al. (2016) Roadmap to clinical use of gold nanoparticles for radiation sensitization. Int J Radiation Oncol Biol Phys 94: 189-205. doi: 10.1016/j.ijrobp.2015.09.032
[35]	Hainfeld JF, Slatkin DN, Focella TM, et al. (2006) Gold nanoparticles: a new X-ray contrast agent. Br J Radiol 79: 248-253. doi: 10.1259/bjr/13169882
[36]	Casanelli B, Santibáñez M, Valente M (2020) Particle size effect on fluorescence emission for Au-infused soft tissues. Rad Phys Chem 167: 108302. doi: 10.1016/j.radphyschem.2019.04.052

This article has been cited by:

1.	Bilal Khan, Hari M. Srivastava, Nazar Khan, Maslina Darus, Muhammad Tahir, Qazi Zahoor Ahmad, Coefficient Estimates for a Subclass of Analytic Functions Associated with a Certain Leaf-Like Domain, 2020, 8, 2227-7390, 1334, 10.3390/math8081334
2.	H. M. Srivastava, Operators of Basic (or q-) Calculus and Fractional q-Calculus and Their Applications in Geometric Function Theory of Complex Analysis, 2020, 44, 1028-6276, 327, 10.1007/s40995-019-00815-0
3.	Muhammad Sabil Ur Rehman, Qazi Zahoor Ahmad, H. M. Srivastava, Nazar Khan, Maslina Darus, Muhammad Tahir, Coefficient inequalities for certain subclasses of multivalent functions associated with conic domain, 2020, 2020, 1029-242X, 10.1186/s13660-020-02446-1
4.	Huo Tang, Shahid Khan, Saqib Hussain, Nasir Khan, Hankel and Toeplitz determinant for a subclass of multivalent $ q $-starlike functions of order $ \alpha $, 2021, 6, 2473-6988, 5421, 10.3934/math.2021320
5.	Bilal Khan, Zhi-Guo Liu, Hari M. Srivastava, Nazar Khan, Maslina Darus, Muhammad Tahir, A Study of Some Families of Multivalent q-Starlike Functions Involving Higher-Order q-Derivatives, 2020, 8, 2227-7390, 1470, 10.3390/math8091470
6.	Shahid Khan, Saqib Hussain, Maslina Darus, Inclusion relations of $ q $-Bessel functions associated with generalized conic domain, 2021, 6, 2473-6988, 3624, 10.3934/math.2021216
7.	Nazek Alessa, B. Venkateswarlu, K. Loganathan, T.S. Karthik, P. Thirupathi Reddy, G. Sujatha, Ming-Sheng Liu, Certain Class of Analytic Functions Connected with q -Analogue of the Bessel Function, 2021, 2021, 2314-4785, 1, 10.1155/2021/5587886
8.	Mohammad Faisal Khan, Anjali Goswami, Shahid Khan, Certain New Subclass of Multivalent Q-Starlike Functions Associated with Q-Symmetric Calculus, 2022, 6, 2504-3110, 367, 10.3390/fractalfract6070367
9.	Dong Liu, Muhey U Din, Mohsan Raza, Sarfraz Nawaz Malik, Huo Tang, Convexity, Starlikeness, and Prestarlikeness of Wright Functions, 2022, 10, 2227-7390, 3858, 10.3390/math10203858
10.	Bilal Khan, H. M. Srivastava, Sama Arjika, Shahid Khan, Nazar Khan, Qazi Zahoor Ahmad, A certain q-Ruscheweyh type derivative operator and its applications involving multivalent functions, 2021, 2021, 1687-1847, 10.1186/s13662-021-03441-6
11.	Wali Khan Mashwan, Bakhtiar Ahmad, Muhammad Ghaffar Khan, Saima Mustafa, Sama Arjika, Bilal Khan, A. M. Bastos Pereira, Pascu-Type Analytic Functions by Using Mittag-Leffler Functions in Janowski Domain, 2021, 2021, 1563-5147, 1, 10.1155/2021/1209871
12.	Bilal Khan, Zhi-Guo Liu, H. M. Srivastava, Serkan Araci, Nazar Khan, Qazi Zahoor Ahmad, Higher-order q-derivatives and their applications to subclasses of multivalent Janowski type q-starlike functions, 2021, 2021, 1687-1847, 10.1186/s13662-021-03611-6
13.	Hari M. Srivastava, Nazar Khan, Shahid Khan, Qazi Zahoor Ahmad, Bilal Khan, A Class of k-Symmetric Harmonic Functions Involving a Certain q-Derivative Operator, 2021, 9, 2227-7390, 1812, 10.3390/math9151812
14.	Bakhtiar Ahmad, Wali Khan Mashwani, Serkan Araci, Saima Mustafa, Muhammad Ghaffar Khan, Bilal Khan, A subclass of meromorphic Janowski-type multivalent q-starlike functions involving a q-differential operator, 2022, 2022, 2731-4235, 10.1186/s13662-022-03683-y
15.	Alina Alb Lupaş, Applications of the q-Sălăgean Differential Operator Involving Multivalent Functions, 2022, 11, 2075-1680, 512, 10.3390/axioms11100512
16.	Qiuxia Hu, Hari M. Srivastava, Bakhtiar Ahmad, Nazar Khan, Muhammad Ghaffar Khan, Wali Khan Mashwani, Bilal Khan, A Subclass of Multivalent Janowski Type q-Starlike Functions and Its Consequences, 2021, 13, 2073-8994, 1275, 10.3390/sym13071275
17.	Sercan Kazımoğlu, Erhan Deniz, Partial sums of the Rabotnov function, 2022, 14, 2066-7752, 250, 10.2478/ausm-2022-0017
18.	Bilal Khan, H. M. Srivastava, Muhammad Tahir, Maslina Darus, Qazi Zahoor Ahmad, Nazar Khan, Applications of a certain $q$-integral operator to the subclasses of analytic and bi-univalent functions, 2021, 6, 2473-6988, 1024, 10.3934/math.2021061
19.	Timilehin Gideon Shaba, Serkan Araci, Babatunde Olufemi Adebesin, 2023, Investigating q-Exponential Functions in the Context of Bi-Univalent Functions: Insights into the Fekctc-Szcgö Problem and Second Hankel Determinant, 979-8-3503-5883-4, 1, 10.1109/ICMEAS58693.2023.10429891
20.	Sarem H. Hadi, Maslina Darus, 2024, 3023, 0094-243X, 070002, 10.1063/5.0172085
21.	Basem Aref Frasin, Sercan Kazımoğlu, Applications of the normalized Le Roy-type Mittag-Leffler function on partial sums of analytic functions, 2025, 36, 1012-9405, 10.1007/s13370-025-01272-2

Reader Comments

Your name:*

Email:*
© 2020 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

AIMS Bioengineering

1.2

Metrics

Article views(5273) PDF downloads(541) Cited by(30)

Preview PDF

Download XML

Export Citation

Article outline

Show full outline

AIMS Bioengineering

Imaging dose of cone-beam computed tomography in nanoparticle-enhanced image-guided radiotherapy: A Monte Carlo phantom study