Research article

Glycation and secondary conformational changes of human serum albumin: study of the FTIR spectroscopic curve-fitting technique

  • The aim of this study was attempted to investigate both the glycation kinetics and protein secondary conformational changes of human serum albumin (HSA) after the reaction with ribose. The browning and fluorescence determinations as well as Fourier transform infrared (FTIR) microspectroscopy with a curve-fitting technique were applied. Various concentrations of ribose were incubated over a 12-week period at 37 ± 0.5 oC under dark conditions. The results clearly shows that the glycation occurred in HSA-ribose reaction mixtures was markedly increased with the amount of ribose used and incubation time, leading to marked alterations of protein conformation of HSA after FTIR determination.
    In addition, the browning intensity of reaction solutions were colored from light to deep brown, as determined by optical observation. The increase in fluorescence intensity from HSA–ribose mixtures seemed to occur more quickly than browning, suggesting that the fluorescence products were produced earlier on in the process than compounds causing browning. Moreover, the predominant α-helical composition of HSA decreased with an increase in ribose concentration and incubation time, whereas total β-structure and random coil composition increased, as determined by curve-fitted FTIR microspectroscopy analysis. We also found that the peak intensity ratios at 1044 cm−1/1542 cm−1 markedly decreased prior to 4 weeks of incubation, then almost plateaued, implying that the consumption of ribose in the glycation reaction might have been accelerated over the first 4 weeks of incubation, and gradually decreased. This study first evidences that two unique IR peaks at 1710 cm−1 [carbonyl groups of irreversible products produced by the reaction and deposition of advanced glycation end products (AGEs)] and 1621 cm−1 (aggregated HSA molecules) were clearly observed from the curve-fitted FTIR spectra of HSA-ribose mixtures over the course of incubation time. This study clearly suggests that FTIR spectroscopic curve-fitting technique may be easily used to allow determining the marked changes in the secondary conformational structure and protein aggregation of HSA during ribosylation as well as the production of AGEs.

    Citation: Yu-Ting Huang, Hui-Fen Liao, Shun-Li Wang, Shan-Yang Lin. Glycation and secondary conformational changes of human serum albumin: study of the FTIR spectroscopic curve-fitting technique[J]. AIMS Biophysics, 2016, 3(2): 247-260. doi: 10.3934/biophy.2016.2.247

    Related Papers:

    [1] Eunha Shim . Optimal strategies of social distancing and vaccination against seasonal influenza. Mathematical Biosciences and Engineering, 2013, 10(5&6): 1615-1634. doi: 10.3934/mbe.2013.10.1615
    [2] Pannathon Kreabkhontho, Watchara Teparos, Thitiya Theparod . Potential for eliminating COVID-19 in Thailand through third-dose vaccination: A modeling approach. Mathematical Biosciences and Engineering, 2024, 21(8): 6807-6828. doi: 10.3934/mbe.2024298
    [3] Giulia Luebben, Gilberto González-Parra, Bishop Cervantes . Study of optimal vaccination strategies for early COVID-19 pandemic using an age-structured mathematical model: A case study of the USA. Mathematical Biosciences and Engineering, 2023, 20(6): 10828-10865. doi: 10.3934/mbe.2023481
    [4] Eunha Shim . Prioritization of delayed vaccination for pandemic influenza. Mathematical Biosciences and Engineering, 2011, 8(1): 95-112. doi: 10.3934/mbe.2011.8.95
    [5] Rinaldo M. Colombo, Mauro Garavello . Optimizing vaccination strategies in an age structured SIR model. Mathematical Biosciences and Engineering, 2020, 17(2): 1074-1089. doi: 10.3934/mbe.2020057
    [6] Holly Gaff, Elsa Schaefer . Optimal control applied to vaccination and treatment strategies for various epidemiological models. Mathematical Biosciences and Engineering, 2009, 6(3): 469-492. doi: 10.3934/mbe.2009.6.469
    [7] Tetsuro Kobayashi, Hiroshi Nishiura . Prioritizing COVID-19 vaccination. Part 2: Real-time comparison between single-dose and double-dose in Japan. Mathematical Biosciences and Engineering, 2022, 19(7): 7410-7424. doi: 10.3934/mbe.2022350
    [8] Diána H. Knipl, Gergely Röst . Modelling the strategies for age specific vaccination scheduling during influenza pandemic outbreaks. Mathematical Biosciences and Engineering, 2011, 8(1): 123-139. doi: 10.3934/mbe.2011.8.123
    [9] Ayako Suzuki, Hiroshi Nishiura . Transmission dynamics of varicella before, during and after the COVID-19 pandemic in Japan: a modelling study. Mathematical Biosciences and Engineering, 2022, 19(6): 5998-6012. doi: 10.3934/mbe.2022280
    [10] Toshikazu Kuniya, Taisuke Nakata, Daisuke Fujii . Optimal vaccine allocation strategy: Theory and application to the early stage of COVID-19 in Japan. Mathematical Biosciences and Engineering, 2024, 21(6): 6359-6371. doi: 10.3934/mbe.2024277
  • The aim of this study was attempted to investigate both the glycation kinetics and protein secondary conformational changes of human serum albumin (HSA) after the reaction with ribose. The browning and fluorescence determinations as well as Fourier transform infrared (FTIR) microspectroscopy with a curve-fitting technique were applied. Various concentrations of ribose were incubated over a 12-week period at 37 ± 0.5 oC under dark conditions. The results clearly shows that the glycation occurred in HSA-ribose reaction mixtures was markedly increased with the amount of ribose used and incubation time, leading to marked alterations of protein conformation of HSA after FTIR determination.
    In addition, the browning intensity of reaction solutions were colored from light to deep brown, as determined by optical observation. The increase in fluorescence intensity from HSA–ribose mixtures seemed to occur more quickly than browning, suggesting that the fluorescence products were produced earlier on in the process than compounds causing browning. Moreover, the predominant α-helical composition of HSA decreased with an increase in ribose concentration and incubation time, whereas total β-structure and random coil composition increased, as determined by curve-fitted FTIR microspectroscopy analysis. We also found that the peak intensity ratios at 1044 cm−1/1542 cm−1 markedly decreased prior to 4 weeks of incubation, then almost plateaued, implying that the consumption of ribose in the glycation reaction might have been accelerated over the first 4 weeks of incubation, and gradually decreased. This study first evidences that two unique IR peaks at 1710 cm−1 [carbonyl groups of irreversible products produced by the reaction and deposition of advanced glycation end products (AGEs)] and 1621 cm−1 (aggregated HSA molecules) were clearly observed from the curve-fitted FTIR spectra of HSA-ribose mixtures over the course of incubation time. This study clearly suggests that FTIR spectroscopic curve-fitting technique may be easily used to allow determining the marked changes in the secondary conformational structure and protein aggregation of HSA during ribosylation as well as the production of AGEs.


    Attention deficit hyperactivity disorder (ADHD) is a common neurodevelopmental disorder that is diagnosed in childhood. It is characterized by impulsivity, inattention, and hyperactivity [1], which have a detrimental impact on children's learning, emotions, and social relationships. In recent years, pattern recognition has been applied to neurological disease diagnosis, and significant progress has been made in ADHD diagnosis through the develpopment of various classification approaches that utilize machine learning (ML) and deep learning (DL) techniques [2,3]. Nowadays, improving diagnosis accuracy remains a practical challenge for current ADHD classification methods, primarily due to complex factors such as limited data size and noise disturbance in sampled data. Furthermore, the identification of discriminative biomarkers for ADHD represents an essential application requirement. These biomarkers can serve as keys to uncover the mechanisms underlying ADHD and facilitate more convenient and accurate diagnosis and treatment. Accomplishing this task requires achieving satisfactory accuracy and effectively addressing the challenge of interpreting features obtained through the use of learning algorithms.

    As we know, the type of biosignal is crucial in ADHD classification and biomarker detection. Suitable biosignals can extremely improve ADHD classification accuracy and yield highly credible biomarker results. Here, the magnetic resonance imaging (MRI) technique provides plenty of metrics, such as cortical thickness, gray matter probability, regional homogeneity (ReHo), amplitude of low-frequency fluctuation (ALFF), and functional connectivity (FC), to elucidate the brain status of patients. Voxel-level metrics describe the detailed brain state at a high spatial resolution. For example, complementary features can be extracted from voxel-level functional MRI (fMRI) and structural MRI (sMRI) data to characterize differences among subjects [4], while binary voxel-level ALFF feature maps are employed and input to an attention-based DL classification network [5]. However, the extraction of features from voxel-level data has significant problems, as learned features often suffer from noise interference due to the limited consideration of relationships among these voxel-level data. As a result, the performance of voxel-based ADHD classification methods is poor, greatly reducing the reliability of biomarker detection in these studies. Region-level metrics can be obtained by matching the original resolution fMRI data with a certain brain template, which preserves the functional similarity of the voxels within the region and effectively elevates the classification accuracy, while providing better interpretability for ADHD disease. Therefore, region-level metrics are preferred. For example, region-level FC has been frequently utilized for accurate ADHD classification and biomarker analysis [6,7,8]. However, FC is a numerical measure that assesses the correlation between different brain regions. Consequently, when utilized, the identified biomarkers are dispersed in practice [7], making it difficult to in focus on specific brain regions and undermining the interpretability of the obtained outcomes. Studies have shown that spontaneous low-frequency ($ < $ 0.08 Hz) fluctuations are highly synchronized between the motor cortices, while ALFFs imply spontaneous neuronal activity within the region [9]. Therefore, region-level ALFF has better functional aggregation, making it more suitable for the detection of biological markers of ADHD. We employed region-level ALFF for ADHD classification and biomarker detection in this study, where abnormal ALFF values serve as indicators of changes in ADHD-related brain regions.

    The effectivity of biomarker detection also depends on the detection approaches. As far as we know, there exist two major ways to elucidate ADHD biomarkers. 1) One way is statistical analysis, which compares the differences between ADHD group and control group data. Here, the characteristics associated with credible brain region differences were found by various statistical methods including the volumes of hippocampus and amygdala[10] and the activity intensity of the cerebellum area[9]. Unfortunately, these statistical conclusions have limited capacity in ADHD individual diagnosis, since the ADHD condition is different for each subject. 2) The other way is to employ feature selection or learning strategies during the ADHD classification procedure. For example, high-score FC vectors are determined using by a specific feature ranking method[6], and the learned convolution kernels are mapped to three-dimensional space via reconstruction techniques to determine the location of highlighted brain regions[11]. But, it is highly challenging to design a satisfactory classification method that is also capable of reliable biomarker detection. On one hand, existing classification methods with ML are good at employing feature selection to find typical features from the input raw data through the various effective algorithms, such as a support vector machine with recursive feature elimination (SVM-RFE) [12], the least absolute shrinkage and selection operator (LASSO) [13,14], and the elastic net [15]. However, ML-based methods frequently use linear classifiers for ADHD prediction, and they cannot fully describe the complex relationship between features and ADHD disease. Consequently, it has low accuracy [16] and does not provide reliable biomarkers. On the other hand, DL methods exhibit remarkable classification performance, as their flexible feature learning capability facilitates the learning of potential ADHD-related features. For example, three-dimensional [4] and four-dimensional [17] convolutional neural networks (CNNs) have been directly applied to sMRI and fMRI medical image data, exploring the time and spatial patterns of ADHD features in MRI data. Several attention mechanisms have been adopted to adaptively emphasize the learned discriminative features and thus improve the classification performance [18,19]. In addition, autoencoder (AE) networks have also been widely used in ADHD classification. A deep variational AE (DVAE) network and its advanced version, the spatiotemporal attention AE (STAAE) network, have yielded impressive classification results with an accuracy of over 93% [20,21]. However, in these cases, the learned features from DL-based methods are viewed as high-level features, which have poor interpretability and cannot meet the biomarker detection requirement. In summary, both methods for biomarker detection have limitations. However, it is an interesting idea to combine the two, as it would introduce the prior knowledge obtained from statistical analysis into the classification method to guide the classification and enhance the interpretability of related derived biomarkers.

    Attracted by the superior performance of the AENet-based binary hypothesis testing (BHT) framework, we have employed this model to elucidate the related biomarkers from regional ALFF data. The main contributions of the study are described as follows:

    1) We use prior knowledge to guide the classification procedure. Since ADHD children suffer in executive functions, cognitive and emotional control, we applied ALFF to the data on the 50 related brain regions in the limbic system and cerebellum. Moreover, using the attributes of ALFFs as a basis, we modified the existing AENet to be more suitable for feature learning on ALFF data. Considering that the ALFF data on limited brain regions cause the AENet to exhibit unstable feature learning and thus degenerate the classification performance, we have also introduced an ensemble learning strategy to enhance the accuracy. As a result, we achieved remarkable results on multiple datasets with an average accuracy of 93.3%.

    2) Regarding high-level classification performance, biomarker detection was effectively carried out. Several potential biomarkers were identified from the selected features by implementing an SVM-RFE algorithm in the BHT framework. Then, we employed a two-sample t-test between groups and correlation analysis based on the ADHD symptom scores to verify the rationality of these biomarkers. We found that brain regions, including the thalamus, hippocampus, and cerebellar lobule Ⅹ, were most discriminative in our experiments, which is in line with the existing statistical analysis reports. It further demonstrates that the biomarkers obtained via classification and statistical analyses exhibit consistency in the limbic system and cerebellum.

    The limbic system involves a group of regions in the paleocortex that support various functions related to emotion regulation and motivation [22]. In detail, it mainly consists of the amygdala, hippocampus, striatum, and thalamus, which are known to be implicated in ADHD. Several studies have confirmed significant statistical differences between ADHD and healthy control (HC) subjects in the limbic system. For example, structural integrity is impaired in brain regions such as the thalamus [23], caudate nucleus [24], and amygdala [25,26] in children with ADHD. Additionally, abnormal volumes have been observed in these above regions, as well as the hippocampus [27,28]. Moreover, FC analysis has revealed significant variations in neural connections involving the thalamus and hippocampus regions [29,30].

    Initially believed to be primarily involved in motor learning and coordination, the cerebellum is now recognized to play a significant role in cognition and emotion, thus making a vital contribution to the pathophysiology of ADHD [31]. Numerous studies have shown that the cerebellum undergoes the most noticeable structural and functional changes for ADHD [32]. Children with ADHD also exhibit a significant decrease in the ALFF signal in the bilateral cerebellum [9], accompanied by impaired structural integrity [33,34].

    In summary, statistical differences associated with ADHD have been found in these brain regions, which means that these findings highlight the potential of the limbic system and cerebellum to serve as a source of biomarkers for ADHD detection and assist in its classification process. Hence, using this prior knowledge, the limbic system and cerebellum can be used as input brain regions. In this way, highly correlated input data can be obtained, and the interpretability of the intermediate features generated via the classification method is also enhanced.

    To the best of our knowledge, two commonly employed classification frameworks exist in the field of ADHD diagnosis: the training-test framework and the hypothesis-test framework [7]. The training-test framework dominates ADHD classification, involving the learning of features from training data their comparison with those of test data to predict labels. However, this strategy proves inadequate for small-sized datasets, leading to unsatisfactory performance. These learned training-data features fail to fully encompass the characteristics of test data, resulting in a significant hindrance to accuracy improvement due to the limitation of sample number[7]. In contrast, the hypothesis-test framework follows a semi-supervised approach, where we have successfully designed a BHT framework for ADHD classification [35,36,37]. The core idea behind the BHT framework is to incorporate the test data, along with an assumed label, into the feature learning process for the training data. In scenarios in which the assumed label is incorrect, this feature learning process becomes disrupted because the test data introduces noise-like elements that are not present in the training data. As a result of comparing the training features learned under different assumptions, predicting the labels of the test data becomes easier. Therefore, the BHT framework is suitable for research on the ADHD-200 dataset due to its anti-noise ability and ability to overcome the limitation of sample size.

    Figure 1 depicts the flowchart of the existing BHT framework, which utilizes FC as the raw data input (more details can be found in reference [7]). The test data were initially categorized as either HC ($ \mathcal{H}_{0} $) or ADHD ($ \mathcal{H}_{1} $) data. For feature selection, the SVM-RFE approach is employed to choose the N most characteristic connections for both the training and test data, wherein only the connections from the training data are retained to form two typical feature sets, $ X^{\mathcal{H}_{0} } $ and $ X^{\mathcal{H}_{1} } $. Subsequently, a feature extraction process is performed by using a modified AE network to process these feature sets and generate the corresponding high-level feature sets, $ \tilde{X} ^{\mathcal{H}_{0} } $ and $ \tilde{X} ^{\mathcal{H}_{1} } $. During the ADHD decision step, the inter-class and intra-class variability scores, $ D^{\mathcal{H}_{0} } $ and $ D^{\mathcal{H}_{1} } $, are computed for the high-level feature sets and compared. The assumption with a smaller variability score is identified as the true hypothesis, $ \tilde{\mathcal{H} } _{true} $, with the corresponding label assigned to the test data.

    Figure 1.  Flowchart of BHT framework. $ \mathcal{H}_{0} $ and $ \mathcal{H}_{1} $ correspond to different assumptions. Typical feature sets ($ X^{\mathcal{H}_{0} } $ and $ X^{\mathcal{H}_{1} } $) and high-level feature sets ($ \tilde{X} ^{\mathcal{H}_{0} } $ and $ \tilde{X} ^{\mathcal{H}_{1} } $) are obtained in the feature selection and feature extraction stages, respectively. Then, the variability scores ($ D^{\mathcal{H}_{0} } $ and $ D^{\mathcal{H}_{1} } $) are calculated in the decision stage for ADHD and the correct hypothesis ($ \tilde{\mathcal{H} } _{true} $) is selected based on the score.

    It should be noted that this FC-based model suffers from two problems in biomarker detection. Firstly, during feature selection, a total of 4005 connections are used, resulting in scattered characteristic connections. This severely undermines the meaningfulness of the identified biomarkers, as we believe that biomarkers should be localized in specific regions. Second, no prior knowledge is considered during feature selection. As a result, this purely data-driven framework may introduce bias and compromise the reliability of the biomarkers. To overcome these problems, we propose using a limited number of regional ALFFs in the limbic system and cerebellum for biomarker detection.

    All the data used in this study were taken from the Athena pipeline in the ADHD-200 preprocessed dataset of the (http://preprocessed-connectomes-project.org/adhd200/), which contains data of seven sites from the ADHD-200 competition dataset. Note that the University of Pittsburgh and Washington University in sT. Louis sites only contain HC samples, while there is relatively little research on the Oregon Health & Science University sites. Thus, we ultimately chose the remaining four sites for the experiment, including New York University Medical Center (NYU), Peking University (PU, comprising three PU sub-sites), Kennedy Krieger Institute (KKI), and NeuroIMAGE (NI). Detailed information on the samples from these four datasets is recorded in Table 1.

    Table 1.  Summary of several ADHD-200 sites.
    Site Age Female/Male Control ADHD Total
    NYU 7–18 76/140 98 118 216
    PU 8–17 52/142 116 78 194
    KKI 8–13 37/46 61 22 83
    NI 11–22 17/31 23 25 48

     | Show Table
    DownLoad: CSV

    We utilized a meticulous approach to extract regional ALFFs from the downloaded time course values of blood-oxygen-level-dependent (BOLD) signals recorded from the subjects in our study. To derive the voxel-based ALFFs, we implemented several standard operations, including filtering (band-pass 0.009–0.08 Hz) to eliminate noise, performing a fast Fourier transform, and calculating the power spectrum proportionally. Subsequently, we employed the automated anatomical labeling (AAL-116) atlas to divide the brain into distinct regions. Finally, regional ALFF values were obtained by averaging the voxel-based ALFF values within the regions of interest (ROIs), where a total of 50 ROIs are provided in Table 2 with their brain indices. But please note that there might be a slight deviation for the limbic system, which contains the entire cingulate gyrus including the anterior, middle, and posterior cingulate gyri.

    Table 2.  Used brain regions in limbic system and cerebellum.
    Region name Region index Region name Region index
    Olfactory cortex 21–22 Amygdala 41–42
    Insula 29–30 Caudate 71–72
    Anterior cingulate 31–32 Putamen 73–74
    Middle cingulate 33–34 Pallidum 75–76
    Posterior cingulate 35–36 Thalamus 77–78
    Hippocampus 37–38 Cerebellum 91–108
    Parahippocampal 39–40 Vermis 109–116

     | Show Table
    DownLoad: CSV

    We introduce the ALFF-based BHT framework for ADHD classification, wherein regional ALFFs replace FC data as the input raw data. Similar to the FC-based framework, we focus on enhancing the feature extraction step within the utilized AE network. The architecture of our ALFF-based AE network is illustrated in Figure 2. Within this diagram, the typical features (i.e., characteristic ALFF) of the training data are shown to undergo encoding via an encoder subnetwork to derive their high-level representations. These representations are then passed through a decoder subnetwork for reconstruction. Simultaneously, a classification subnetwork supervises the labeling of high-level features acquired through the encoder subnetwork. This strategy aims to retain category information related to the training data within the high-level features.

    Figure 2.  Structure of ALFF-based AE network. The proposed AE network includes encoding, decoding, and classification subnetworks. At the same time, the details of ResNet block in the classification subnetwork are also given.

    In contrast to the FC-based AE network [7], we have introduced several targeted modifications to our network architecture, making it specifically tailored to harness the attributes of ALFF. Notably, we have incorporated a subsequent ReLU into the dense layer within the decoding subnetwork. This adjustment ensures that the reconstructed ALFF maintains non-negative values. Furthermore, in practical implementation, our AE network employs the ALFF-based selected features with an output dimension (i.e., the output dimension of SVM-RFE) that has been reduced to 25, as opposed to the 50 dimensions pf the FC-based features. This conscious reduction mitigates the risk of overfitting that might arise when directly inputting these features into the FC-based AE network. Consequently, we have optimized the architecture of the classification subnetwork. Presently, the classification subnetwork employs just two residual network (ResNet) blocks, designed to process an input vector with dimensions of 8 $ \times $ 1. This departure from the 20 $ \times $ 1 input used in the FC-based AE network results in the generation of more efficacious high-level features. Lastly, we have meticulously detailed the parameters for each dense unit within our AE network in Table 3, note that the size of each dense layer is determined by grid search. The loss functions applied for the reconstructing of the selected features and prediction of the corresponding labels closely align with those employed in a previous study [7], ensuring methodological consistency.

    Table 3.  Size of dense layers.
    Layer size*
    Dense 1 (25, 10)
    Dense 2 (10, 25)
    Dense 3 (10, 8)
    Dense 4 (8, 8)
    Dense 5 (8, 2)
    * The parameters (a, b) describe a dense unit with input size of a and output size of b.

     | Show Table
    DownLoad: CSV

    Although adjusting the AE network to accommodate ALFF data represents a positive step, a prominent challenge persists. It centers around the inherent instability of the AE network, which directly impacts the final prediction outcomes. Throughout network training, the initial configuration of learned parameters wields substantial influence. Optimal initial values possess the capacity to expedite the training process and guide the network toward a state of stability. However, when handling a small-sized dataset, the effect of these initial values is disproportionately amplified, thereby compromising the network's robustness and introducing an element of uncertainty. In practical implementation, our ALFF-based AE network is susceptible to this uncertainty. Even when holding the ALFF input constant for both training and test data, the resulting ADHD prediction outcomes may exhibit minor variances with a low probability. This uncertainty significantly impedes the pursuit of refined ADHD classification accuracy, consequently undermining the efficacy of biomarker detection endeavors.

    In this study, we have employed ensemble learning as a powerful tool to address this challenge. Ensemble learning stands as a classical strategy that is renowned for mitigating issues related to data imbalance, model robustness, and uncertainty estimation. Its basic idea is to integrate several weak classifiers and build a fortified classifier, which engenders heightened reliability within the classification outcomes. The existing examples involve the collection of classifiers from the fields of ML and DL for application in various medical diagnostic scenarios, such as Parkinson's disease classification [38], anticancer peptide prediction [39], and virulence factor detection [40]. This strategy was also successfully utilized for ADHD diagnosis, where solid decision-making can be realized based on multimodal data [41].

    The proposed ensemble classification method is illustrated in Figure 3. For a given test data, the process involves generating the typical features sets ($ X^{\mathcal{H}_{0} } $ and $ X^{\mathcal{H}_{1} } $) of training data based on opposite test label assumptions in the feature selection step. Then, multiple pairs of AE networks are utilized, each initialized with random parameters, to acquire multiple pairs of high-level feature sets, $ \tilde{X} ^{\mathcal{H}_{0} } $ and $ \tilde{X} ^{\mathcal{H}_{1} } $. Subsequently, pairwise high-level features sets are compared to determine the true hypothesis and corresponding hypothesis labels ($ L_{true} $), which compares the intra-class and inter-class distances ratios of each feature set. The hypothesis label is considered as the prediction result of a baseline classifier. Finally, a hard voting strategy is then applied to these hypothesis labels. By assessing the frequency of label values (0 for ADHD and 1 for HC), class labels of the data are assigned higher frequency label values.

    Figure 3.  ALFF-based BHT framework. Different from the existing BHT framework, the voting strategy is applied to the output label ($ L_{true} $) of the ADHD decision stage, which improves the reliability of results. The ensemble label obtained by voting is regarded as the final prediction label of test data.

    As described for the BHT framework in Section 2.2, typical feature sets were selected by the SVM-RFE algorithm in the feature selection stage. Meanwhile, reliability weights $ W_{ji} $ of typical feature sets can be obtained during this process. Specifically, the SVM-RFE algorithm trains a linear SVM in one iteration to obtain a weight vector that fits to the input feature set. Then, the square of the weight vector is used as the criterion for judging the usefulness of features, and the feature with the smallest-valued criterion is removed. After removing a certain number of features over multiple iterations, a typical feature set with the expected dimensions is obtained. The square of the fitted linear SVM weight vector on this feature set is considered to comprise the reliability weights of the features.

    Reliability weights not only measure the contribution of features to classification they also serve as a criterion for subsequently extracting brain biomarkers. However, there are differences in the values and ranks of reliability weights at different sites. Moreover, under the binary hypothesis framework, the typical feature subset generated under the correct hypothesis is more valuable. Therefore, the weighted average of the feature reliability weights were designed to be from the correct hypotheses for the four sites, where the weighted value is the product of the number of people and the classification accuracy for the site. The weighted average result is seen as a more comprehensive measure to evaluate each feature, and it is called the feature score $ S_{i} $ for each ROI. The feature score can be defined as follows:

    $ Si=4j=0Accj×Nj×Wji4j=0Accj×Nj,
    $
    (3.1)

    where $ Acc_{j} $ and $ N_{j} $ respectively denote the classification accuracy after ensemble learning and data size of the j-th dataset, $ W_{ji} $ represents the reliability weight for the i-th ROI of the j-th dataset. This feature score provides enough convenience for finding ADHD biomarkers.

    The classification performance of our BHT framework with ALFF-based AE network is presented in Table 4, where the accuracies with and without ensemble learning are provided. Specifically, each site was subjected to 50 leave-one-out cross-validation (LOOCV) trials, and the average accuracy without ensemble learning strategies was obtained. The results with ensemble learning were derived from the average of 1000 hard voting trials, where each trial entailed randomly selecting seven hypothesis labels ($ L_{true} $) from the 50 LOOCV trials. During the AE network learning, an Adam optimizer is utilized to optimize the whole network. Hyperparameters of each site were determined through grid search, including the learning rate, training epoch, and the rate of dropout. After a certain number of epochs, the training loss converges and becomes stable. In addition, ablation experiments were designed to verify the role of components of the ALFF-based AE network, as well as under the BHT framework. 1) Default-classifier network: the classifier subnetwork was hidden, only the complete AE network was retained, and the entire network was trained by using the reconstruction loss. 2) Default-decoder network: the decoder subnetwork was hidden, the encoder degenerated into a dense layer, and the whole network was trained by using the cross-entropy loss. Later, ADHD decisions were made based on the encoder output features of the default-classifier network and the output features of the first dense layer of the default-decoder network, respectively. The results of ablation experiments are shown in Table 5, which shows that the hyperparameter and experimental details were identical to those of the ALFF-based AE network. The source code is available at https://github.com/BiolabHHU/ALFF-based-BHT.

    Table 4.  Classification performance for various datasets.
    Site Accuracy (%) Sensitivity (%) Specificity (%) AUROC AUPRC MCC F1
    Without ensemble learning
    NYU 91.94 90.59 93.55 0.9207 0.8913 0.8396 0.9239
    PU 80.32 78.87 81.29 0.8008 0.7984 0.5994 0.7697
    KKI 88.19 87.45 88.46 0.8796 0.8817 0.7445 0.8256
    NI 87.38 88.40 86.26 0.8733 0.8296 0.7488 0.8786
    Average 86.96 86.33 87.39 0.8686 0.8503 0.7331 0.8495
    With ensemble learning
    NYU 96.20 96.88 95.38 0.9616 0.9400 0.9250 0.9660
    PU 88.79 96.67 83.50 0.8982 0.9074 0.7801 0.8719
    KKI 95.05 98.90 93.66 0.9656 0.9815 0.9033 0.9276
    NI 93.23 98.32 87.70 0.9302 0.9174 0.8667 0.9374
    Average 93.32 97.69 90.06 0.9389 0.9366 0.8688 0.9257

     | Show Table
    DownLoad: CSV
    Table 5.  Comparison of classification accuracy of the three networks.
    Network NYU (%) PU (%) KKI (%) NI (%) Average (%)
    Without ensemble learning
    Default-classifier network 69.00 53.69 42.82 67.67 58.29
    Default-classifier network 89.83 80.26 86.84 86.79 85.93
    ALFF-based AE network 91.94 80.32 88.19 87.38 86.86
    With ensemble learning
    Default-classifier network 75.01 52.38 35.51 75.49 59.60
    Default-classifier network 95.14 85.45 91.29 92.39 91.07
    ALFF-based AE network 96.20 88.79 95.05 93.23 93.32

     | Show Table
    DownLoad: CSV

    In Table 4, one can see that an evident distinction emerges in the classification performance when comparing the results in the absence of ensemble learning to those in its presence, and it is manifested in the differences in metrics, including accuracy, sensitivity, specificity, area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), F1 score (F1), and apply capitalization correlation coefficient (MCC). The absence of ensemble learning resulted in notably inferior performance, as indicated by the average accuracy of 87.0%. Notably, when confronted with the PU dataset, the accuracy decreased to an unsatisfactory 80.3%. This decline is attributed to a high prevalence of comorbidity disorders among the ADHD children within this dataset, affecting 44 out of 78 cases. Conversely, the application of ensemble learning resulted in a marked enhancement in the accuracy metrics. The average accuracy significantly increased, reaching 93.3%. This positive impact is particularly evident in the case of the PU dataset, where accuracy improved dramatically from the aforementioned 80.3% to 88.8%. This result effectively substantiates the ability of ensemble learning to augment the potency of weaker classifiers and make them robust. Moreover, Table 4 also presents the results of measurements of the area under the curve. Specifically, the average AUROC and AUPRC values were both around 0.93, underscoring the excellent performance of the ALFF-based BHT framework on the ADHD-200 dataset. Remarkably, this heightened level of accuracy was achieved through the utilization of a mere 25 selected features from 50 regional ALFFs. This outcome serves to reinforce the presumption that our approach facilitates the biomarker detection endeavors. Additionally, we provide the receiver operating characteristic (ROC) curves for the used datasets in Figure 4, which shows that the curve for the KKI site had the largest area among these datasets.

    Figure 4.  ROC curves for various data sites.

    To further validate the utilization of ensemble learning, we constructed a box-and-whisker plot to show the accuracy distribution, as shown in Figure 5. It is evident that the adoption of ensemble learning leads to an enhancement in the average accuracy. While this learning strategy may not entirely eliminate the fluctuations in accuracy owing to the inherent uncertainty of the AE network, can be ascertained from Table 4, there was a substantial reduction in the standard deviation of accuracy. Especially, the enhancement of accuracy are obviously disclosed on the PU dataset. These findings prove the effectiveness of ensemble learning as a tool to enhance the robustness of our ALFF-based BHT framework.

    Figure 5.  Accuracy distribution with and without ensemble learning, presented in the form of a box-and-whisker plot. Each dataset is tagged with its accuracy's mean and the standard variation (SD) value. The absence of ensemble learning is colored with black, whereas the application of ensemble learning is in blue.

    Table 5 shows a comparison of the classification accuracy of the three networks. The accuracy of the default-classifier network was only 42–76%, which may have been caused by the lack of information learned from the label based on the guidance of the cross-entropy loss; thus, the extracted high-level features had no inter-class discrimination. The classification accuracy of the default-decoder network was 1–4% lower that of proposed ALFF-based AE network, indicating that the lack of reconstruction loss may reduce the representation ability of high-level features.

    We tested our method against other state-of-the-art methods the results are presented in Table 6. The compared methods include ML-based methods such as R-Relief [42], L1BioSVM [43], and Fusion fMRI [44], as well as DL-based methods such as the 3D CNN [4], DeepfMRI [45], CDAE [46], DVAE [20], STAAE [21] and data augmentation [47]. In addition, our previous work under the BH framework is also included in this comparison. These methods are referred to as SP-BH [37] and SP-$ l_{2, 1} $-BH [36], which use subspace learning and $ l_{2, 1} $-norm subspace learning for ADHD classification, respectively.

    Table 6.  Accuracy comparison between our method and state-of-the-art methods.
    NYU(%) PU(%) KKI(%) NI(%) Average(%) Material Rawfeatures Selectedfeatures* Biomarkerdetection
    ML
    Fusion fMRI (2018) 52.7 - 86.7 72.9 70.8 FC 4005 - No
    L1BioSVM (2018) - 81.1 81.3 - 81.2 FC 6670 - No
    R-Relief (2019) 70.7 68.6 81.8 76.0 74.3 fALFF 31 $ \times $ 37 $ \times $ 31 - No
    DL
    3D CNN (2017) 70.5 63.0 - 72.8 68.8 MRI image 90 $ \times $ 117 $ \times $ 100 1024 No
    DeepfMRI (2020) 73.1 62.7 - 67.9 67.9 BOLD 90 32 No
    CDAE (2021) 73.2 70.6 81.7 79.0 76.1 MRI image 60 $ \times $ 72$ \times $ 60 5 $ \times $ 6 $ \times $ 5 No
    DVAE (2021) 62.4 67.0 78.1 68.8 69.1 BOLD 28,546 80 No
    STAAE (2022) 93.5 92.7 90.4 91.7 92.1 BOLD 28,546 100 Yes
    Data augmentation (2023) 75.6 76.5 76.0 - 76.0 FC 13,456 1856 No
    BHT framework
    SP-BH (2019) 96.2 95.8 86.7 91.6 92.6 FC 4005 100 No
    SP-$ l_{2, 1} $-BH (2020) 99.5 96.3 100 95.8 97.9 FC 4005 50 No
    AENet (2022) 99.8 99.6 99.8 99.3 99.6 FC 4005 50 Yes
    Our 96.2 88.8 95.1 93.3 93.3 ALFF 50 25 Yes
    * The number of feature selected in the ML method is not fixed. And the DL method selects the learned high-level features, while the BHT method selects typical features from the raw features.

     | Show Table
    DownLoad: CSV

    From Table 6, it is observed that the existing ML methods had the lowest accuracies, stemming from their limited exploration of potential features for classification. Meanwhile, most examined DL methods are confined to the traditional training-test framework, which causes the learned features from the training data to inadequately representing the features of the test data. Among these methods, the STAAE approach stands out because of its remarkable performance achievements and a commendable biomarker detection capability. Its success benefits from the integration of an attention module that captures temporal patterns from fMRI spatiotemporal features. Based on its accurate classification, STAAE extends its application by projecting these temporal patterns onto corresponding brain regions, effectively fulfilling the biomarker detection task. The listed methods within the BHT framework all exhibited impressive classification performance, facilitated by the utilization of test data information (without seeing its label). Among these methods, AENet leverages DL to effectively represent high-level features, making it superior to the SP-BH, SP-$ l_{2, 1} $-BH approaches. As an FC-based BHT method, AENet incorporates a great number of connections, consequently leading to a widespread selection of features across the entire brain. This, however, renders the identified biomarkers indiscriminate. In contrast, our ALFF-based method achieved an average accuracy of 93.3%, even though it was slightly inferior to that of AENet. A distinctive trait lies in the substantial reduction of both the used raw and selected features. This implies that the features used and selected by our algorithm carry more significance as potential ADHD biomarkers. Moreover, the selected features were tightly localized within the limbic system and cerebellum, rendering our ALFF-based method exceedingly interpretable.

    Table 7 presents the top 10 brain regions obtained with the highest feature scores $ S_{i} $; their locations are visualized in Figure 6. These identified regions are considered to be potential ADHD biomarkers. To substantiate our findings, we conducted a correlation analysis between the ALFF data from these regions and the symptom scores provided by the ADHD-200 dataset; the results are shown in Table 7. Since the NI site did not give symptom assessment data, we performed correlation analysis for the PU, KKI and NYU sites. Notably, only individuals with ADHD-simplex were included in these analyses to mitigate the influence of comorbid disorders. In Table 7, it is evident that the majority of detected biomarkers had a robust correlation with symptom scores from the PU dataset, demonstrating significant values above 0.2, under the 95% confidence interval. This finding underscores the potential of our identified biomarkers to capture ADHD-related characteristics. However, an intriguing observation arose from the results obtained for the other two datasets. In this case, the correlation between biomarkers and symptoms was notably weaker, especially for the NYU site, as reflected by the higher P values over 0.05. This discrepancy can be attributed to the utilization of distinct symptom assessment measures across these datasets. To elaborate, although three datasets were derive from symptom measures based on questionnaires administered to parents and teachers, the specific scales employed differ. The PU dataset employs the ADHD Rating Scale-Ⅳ (ADHD-RS), whereas the KKI and NYU datasets utilize Conners' Parent Rating Scale-Revised, Long version (CPRS-LV). Despite the shared origin in parental and teacher reports, we posit that ADHD-RS offers advantages in terms of its ability to abnormalities within the limbic system and cerebellum. This assertion underscores the potential for ADHD-RS to provide insights into these specific regions.

    Table 7.  Detected biomarkers and their symptom score correlations for the PU, KKI and NYU datasets.
    Brian region Ranking order PU (ADHD-RS) KKI (CPRS-LV) NYU (CPRS-LV)
    Name Abbrev Corr P-value Corr P-value Corr P-value
    Middle cingulate gyrus (R) DCG.R 1 0.235 0.007 0.358 0.003 -0.035 0.660
    Cerebellum Ⅸ (R) CRBL9.R 2 0.161 0.065 0.155 0.213 0.038 0.630
    Amygdala (R) AMYG.R 3 0.198 0.023 0.057 0.648 -0.005 0.953
    Thalamus (L) THA.L 4 0.245 0.005 0.150 0.229 0.044 0.572
    Cerebellum Ⅹ (R) CRBL10.R 5 0.244 0.005 0.241 0.051 0.021 0.787
    Cerebellum Crus Ⅱ (L) CRBL Crus2.L 6 0.245 0.005 0.259 0.036 0.128 0.102
    Cerebellum Crus Ⅱ (R) CRBL Crus2.R 7 0.214 0.014 0.236 0.057 0.123 0.115
    Thalamus (R) THA.R 8 0.270 0.002 0.184 0.139 0.043 0.580
    Hippocampus (L) HIP.L 9 0.274 0.001 0.273 0.027 0.021 0.786
    Caudate (R) CAU.L 10 0.235 0.007 0.215 0.083 0.034 0.667

     | Show Table
    DownLoad: CSV
    Figure 6.  Visualization results for the top 10 regions. Here, the larger the node diameter, the higher region feature score achieved.

    Figure 7 supplements our analysis and visualizes the results of correlation analysis for ADHD-RS scales, encompassing regions the left hippocampal gyrus, left and right thalamus, and left cerebellum Crus Ⅱ gyri, which are the four regions with the highest correlation coefficients. It further supports that these regions exhibit significant relevance to ADHD, thereby warranting consideration as potential biomarkers.

    Figure 7.  Relationship between ALFF value and symptom score for ADHD-RS on the PU dataset. ADHD and HC subjects are colored with red and green, respectively.

    We shall present some biological explanations for our biomarkers. Table 7 presents the findings, highlighting significant associations within regions such as the amygdala, caudate nucleus, hippocampus, and thalamus gyri. These regions are well-established in their role in shaping and comprehending human emotions, as extensively documented in the existing literature [48,49]. Notably, ADHD patients have consistently exhibited reduced amygdala and hippocampal volumes in anatomical control experiments. Moreover, developmental delays and degenerative changes within these regions have been unveiled through lifespan exploratory modeling [10]. Concurrently, the caudate nucleus has emerged as a consistent focal point, with volumetric differences consistently noted in ADHD research [50,51,52]. Particularly, a volume asymmetry analysis demonstrated noteworthy links between caudate asymmetry and cumulative severity ratings of inattentive behaviors in ADHD-afflicted children [24]. As we delve into the thalamus, its pivotal role in information transmission, regulation, and participation in cognitive and behavioral processes such as attention, emotion, and motor control cannot be understated. Morphological aberrations in the thalamus [23] have been discovered and strongly correlated with ADHD symptom scores [53], further substantiating our identified biomarkers within the limbic system.

    Furthermore, our biomarkers extend to the cerebellum, particularly in the right inferior posterior lobe of the cerebellar hemisphere, encompassing cerebellum 9, 10, and Crus Ⅱ. A longitudinal case-control study revealed that ADHD participants with worse clinical outcomes exhibited a gradual decrease in overall cerebellar volume, primarily caused by abnormalities in the posterior inferior cerebellar hemisphere [54]. Moreover, a smaller volume in the cerebellar lobule Ⅹ has been found in children [55,56]. Our findings align harmoniously with these reports. The lateral hemisphere of the cerebellum has emerged as a pivotal player in executive functions [57,58], spatial cognition, and language processing [59]. Consequently, the identified abnormalities within the posterior inferior cerebellar hemisphere may indeed contribute to the executive function deficits that constitute a prominent symptomatology of ADHD.

    Overall, the aforementioned evidence firmly supports the presence of our delineated ADHD biomarkers within the limbic system and cerebellum. Here, the right middle cingulate gyrus is also established as a biomarker, but there is also a consensus that it is part of the salience network. Furthermore, when it comes to the cognitive function and emotional processing of ADHD patients, the middle cingulate gyrus is considered to play an important role [60,61]. Some existing reports have confirmed that the right middle cingulate gyrus is abnormal in ADHD patients. For example, a sex difference study on the factors related to the symptoms of common mental disorders in adolescence found that more prominent symptoms of hyperactivity/inattention were associated with lower grey matter volume of the bilateral anterior and midcingulum among boys [62]. Another study showed that compared with normal children, the increased degree centrality values for the right middle cingulate gyrus indicated differences in functional network connectivity for ADHD children [63].

    Our research has two limitations. First of all, biomarker detection only involves the linear relationship between features and does not take into account the nonlinear components. Our attempt to validate the discrimination of these biomarkers through a two-sample t-test experiment did not yield statistically significant results. This indicates no inter-group differences in the ALFF features of a single brain region. However, the aggregate of brain region features still contributes to the construction of excellent high-level feature space, which is due to the powerful feature extraction ability of DL. Unfortunately, the contribution here is difficult to quantify, as it involves the amount of classification information held by the brain region features and the details of non-linear fitting performed by the neural network. This issue limits further exploration of biomarkers; thus, in the future, we will attempt to explore this limitation by using other brain region features such as the voxel-based morphometry and ReHo, as well as introduce the interpretability theory of neural networks. In addition, our algorithm achieved good classification performance on limited brain regions of the limbic system and cerebellum, indicating the importance of these brain regions for ADHD. Moreover, we conducted biomarker detection, which revealed abnormal brain regions that are consistent with previously reported anatomical abnormalities. However, it may be insufficient to explore brain regions related to the biological mechanisms of ADHD solely through ALFF data. In future studies, we will consider multi-modal features for to characterize brain regions.

    This study primarily combined existing statistical prior knowledge and pattern recognition methods to elucidate critical ADHD biomarkers. Specifically, the ALFF in the limbic system and cerebellum, which is highly related to ADHD in statistical analysis, was used as input for classification, contributing to reduce the range of biomarker detection. The BHT framework and ensemble learning methods were employed to ensure high accuracy in the classification results. Consequently, we performed a highly credible biomarker detection task that achieved an average accuracy of 93% on the ADHD-200 datasets. Several brain regions such as the thalamus, hippocampus, amygdala, and cerebellum Ⅸ were extracted from the results of the SVM-RFE algorithm as biomarkers. We validated them by analyzing the correlation among symptom scores. Moreover, these findings extend previous findings and align with the existing reports on the neurobiological contributions to ADHD, which demonstrates the effectiveness of our method.

    The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

    Ying Chen: Conceptualization, Methodology, Writing–original draft; Lele Wang: Conceptualization, Software, Writing–original draft; Zhixin Li: Validation, Writing–review & editing; Yibin Tang: Software, Methodology, Writing–original draft; Zhan Huan: Supervision.

    This work was supported in part by the National Natural Science Foundation of China under Grants: 62201093.

    The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

    [1] Vanhooren V,Navarrete Santos A,Voutetakis K, et al. (2015) Protein modification and maintenance systems as biomarkers of ageing. Mech Ageing Dev 151: 71–84. doi: 10.1016/j.mad.2015.03.009
    [2] Uribarri J, Woodruff S, Goodman S, et al. (2010) Advanced glycation end products in foods and a practical guide to their reduction in the diet. J Am Diet Assoc 110: 911–916.e12 doi: 10.1016/j.jada.2010.03.018
    [3] Visentin S,Medana C,Barge A, et al. (2010) Microwave-assisted Maillard reactions for the preparation of advanced glycation end products (AGEs). Org Biomol Chem 8: 2473–2477. doi: 10.1039/c000789g
    [4] Horvat S,Jakas A (2014) Peptide and amino acid glycation: new insights into the Maillard reaction. J Pept Sci 10: 119–137.
    [5] Zhang Q,Ames JM,Smith RD, et al. (2009) A perspective on the Maillard reaction and the analysis of protein glycation by mass spectrometry: probing the pathogenesis of chronic disease. J Proteome Res 8: 754–769. doi: 10.1021/pr800858h
    [6] Dar B, Dar M, Bashir S, et al. (2015) Glycosylated hemoglobin (HbA1c): A biomarker of anti-aging. Int J Biol Med Res 6: 5084–5086.
    [7] Sebeková K,Somoza V (2007) Dietary advanced glycation endproducts (AGEs) and their health effects--PRO. Mol Nutr Food Res 51: 1079–1084. doi: 10.1002/mnfr.200700035
    [8] Arasteh A,Farahi S,Habibi-Rezaei M, et al. (2014) Glycated albumin: an overview of the in vitro models of an in vivo potential disease marker. J Diabetes Metab Disord 13: 49.
    [9] Uribarri J,del Castillo MD,de la Maza MP, et al. (2015) Dietary advanced glycation end products and their role in health and disease. Adv Nutr 6: 461–473. doi: 10.3945/an.115.008433
    [10] Takahashi M (2014) Glycation of Proteins. In Glycoscience: Biology and Medicine, Endo T, Seeberger PH, Hart GW, Wong CH, Taniguchi N, eds., Springer Japan, pp. 1339–1345
    [11] Nursten HE (2005) The Maillard Reaction: Chemistry, Biochemistry, and Implications. RSC.
    [12] Laroque D, Inisan C, Berger C, et al. (2008) Kinetic study on the Maillard reaction: Consideration of sugar reactivity. Food Chem 111: 1032–1042 doi: 10.1016/j.foodchem.2008.05.033
    [13] Sattarahmady N,Moosavi-Movahedi AA,Habibi-Rezaei M, et al. (2008) Detergency effects of nanofibrillar amyloid formation on glycation of human serum albumin. Carbohydr Res 343: 2229–2234.
    [14] Monnier VM (1990) Nonenzymatic glycosylation, the Maillard reaction and the aging process. J Gerontol 45: B105–111. doi: 10.1093/geronj/45.4.B105
    [15] Wei Y,Han CS,Zhou J, et al. (2012) D-ribose in glycation and protein aggregation. Biochim Biophys Acta 1820: 488–494. doi: 10.1016/j.bbagen.2012.01.005
    [16] Monnier VM, Cerami A (1981) Nonenzymatic browning in vivo: possible process for aging of long-lived proteins. Science 211: 491–493. doi: 10.1126/science.6779377
    [17] Han C,Lu Y,Wei Y, et al. (2011) D-ribose induces cellular protein glycation and impairs mouse spatial cognition. PLoS ONE 6:e24623. doi: 10.1371/journal.pone.0024623
    [18] Syrový I (1994) Glycation of albumin: reaction with glucose, fructose, galactose, ribose or glyceraldehyde measured using four methods. J Biochem Biophys Methods 28: 115–121. doi: 10.1016/0165-022X(94)90025-6
    [19] Kong FL,Cheng W,Chen J, et al. (2011) D-Ribose glycates β(2)-microglobulin to form aggregates with high cytotoxicity through a ROS-mediated pathway. Chem Biol Interact 194: 69–78. doi: 10.1016/j.cbi.2011.08.003
    [20] Khan MS,Dwivedi S,Priyadarshini M, et al. (2013) Ribosylation of bovine serum albumin induces ROS accumulation and cell death in cancer line (MCF-7). Eur Biophys J 42: 811–818.
    [21] Iannuzzi C,Maritato R,Irace G, et al. (2013) Glycation accelerates fibrillization of the amyloidogenic W7FW14F apomyoglobin. PLoS ONE 8: e80768. doi: 10.1371/journal.pone.0080768
    [22] Adrover M,Mariño L,Sanchis P, et al. (2014) Mechanistic insights in glycation-induced protein aggregation. Biomacromolecules 15: 3449–3462. doi: 10.1021/bm501077j
    [23] Liu J, Ru Q, Ding Y (2012) Glycation a promising method for food protein modification: Physicochemical properties and structure, a review. Food Res Int 49: 170–183.
    [24] Wei Y, Chen L, Chen J, et al. (2009) Rapid glycation with D-ribose induces globular amyloid-like aggregations of BSA with high cytotoxicity to SH-SY5Y cells. BMC Cell Biol 10: 10.
    [25] Kragh-Hansen U,Chuang VT,Otagiri M (2002) Practical aspects of the ligand-binding and enzymatic properties of human serum albumin. Biol Pharm Bull 25: 695–704.
    [26] Santra MK,Banerjee A,Krishnakumar SS, et al. (2004) Multiple-probe analysis of folding and unfolding pathways of human serum albumin. Evidence for a framework mechanism of folding. Eur J Biochem 271: 1789–1797.
    [27] Anguizola J,Matsuda R,Barnaby OS, et al. (2013) Review: Glycation of human serum albumin. Clin Chim Acta 2013; 425: 64–76.
    [28] Singha Roy A,Ghosh P,Dasgupta S (2015) Glycation of human serum albumin alters its binding efficacy towards the dietary polyphenols: A comparative approach. J Biomol Struct Dyn Oct 7:1–46. [In press].
    [29] Peters T (1996) All about Albumin. Biochemistry, Genetics, and Medical Applications. Academic Press, San Diego, CA.
    [30] Khan MW,Rasheed Z,Khan WA, et al. (2007) Biochemical, biophysical, and thermodynamic analysis of in vitro glycated human serum albumin. Biochemistry (Mosc) 72: 146–152. doi: 10.1134/S0006297907020034
    [31] Yang F,Zhang Y,Liang H (2014) Interactive association of drugs binding to human serum albumin. Int J Mol Sci 15: 3580–3595.
    [32] Lin SY,Wei YS,Li MJ,et al (2004). Effect of ethanol or/and captopril on the secondary structure of human serum albumin before and after protein binding. Eur J Pharm Biopharm 57: 457–464. doi: 10.1016/j.ejpb.2004.02.005
    [33] Lin SY, Wei YS, Li MJ (2004) Ethanol or/and captopril-induced precipitation and secondary conformational changes of human serum albumin. Spectrochim Acta A 60: 3107–3111. doi: 10.1016/j.saa.2004.03.001
    [34] Li MJ,Lin SY (2005) Vibrational spectroscopic studies on the disulfide formation and secondary conformational changes of captopril-HSA mixture after UV-B irradiation. Photochem Photobiol 81: 1404–1410. doi: 10.1562/2005-04-25-RN-497
    [35] Sadowska-Bartosz I,Galiniak S,Bartosz G (2014) Kinetics of glycoxidation of bovine serum albumin by glucose, fructose and ribose and its prevention by food components. Molecules 19: 18828–18849. doi: 10.3390/molecules191118828
    [36] Kosaraju SL,Weerakkody R,Augustin MA (2010) Chitosan-glucose conjugates: influence of extent of Maillard reaction on antioxidant properties. J Agric Food Chem 58: 12449–12455. doi: 10.1021/jf103484z
    [37] Ajandouz EH, Tchiakpe LS, Ore FD, et al. (2001) Effects of pH on caramelization and Maillard reaction kinetics in fructose-lysine model systems. J Food Sci 66: 926–931.
    [38] Monacelli F, Storace D, D’Arrigo C, et al. (2013)Structural alterations of human serum albumin caused by glycative and oxidative stressors revealed by circular dichroism analysis. Int J Mol Sci 14: 10694–10709.
    [39] Lee TH,Cheng WT,Lin SY (2010) Thermal stability and conformational structure of salmon calcitonin in the solid and liquid states. Biopolymers 93: 200–207. doi: 10.1002/bip.21323
    [40] Ledesma-Osuna AI, Ramos-Clamont G, Vazquez-Moreno L (2008) Characterization of bovine serum albumin glycated with glucose, galactose and lactose. Acta Biochim Pol 55: 491–497.
    [41] Sompong W,Meeprom A,Cheng H, et al. (2013) A comparative study of ferulic acid on different monosaccharide-mediated protein glycation and oxidative damage in bovine serum albumin. Molecules 18: 13886–13903. doi: 10.3390/molecules181113886
    [42] Wu CH, Huang SM, Lin JA, et al. (2011) Inhibition of advanced glycation endproduct formation by foodstuffs. Food Funct 2: 224–234.
    [43] Kato Y, Matsuda T, Kato N, et al. (1989) Maillard reaction of disaccharides with protein: suppressive effect of nonreducing end pyranoside groups on browning and protein polymerization. J Agric Food Chem 37: 1077–1081. doi: 10.1021/jf00088a057
    [44] Suárez G,Rajaram R,Oronsky AL, et al.(1989). Nonenzymatic glycation of bovine serum albumin by fructose (fructation). Comparison with the Maillard reaction initiated by glucose. J Biol Chem 264: 3674–3679.
    [45] McPherson JD,Shilton BH,Walton DJ (1988) Role of fructose in glycation and cross-linking of proteins. Biochemistry 27: 1901–1907.
    [46] Siddiqui AA,Sohail A,Bhat SA, et al. (2015). Non-enzymatic glycation of almond cystatin leads to conformational changes and altered activity. Protein Pept Lett 22: 449–459.
    [47] Awasthi S,Murugan NA,Saraswathi NT (2015) Advanced glycation end products modulate structure and drug binding properties of albumin. Mol Pharmaceutics 12: 3312–3322. doi: 10.1021/acs.molpharmaceut.5b00318
    [48] Bouma B,Kroon-Batenburg LM,Wu YP, et al. (2003) Glycation induces formation of amyloid cross-beta structure in albumin. J Biol Chem 278: 41810–41819. doi: 10.1074/jbc.M303925200
    [49] Khajehpour M,Dashnau JL,Vanderkooi JM (2006) Infrared spectroscopy used to evaluate glycosylation of proteins. Anal Biochem 348: 40–48. doi: 10.1016/j.ab.2005.10.009
    [50] GhoshMoulick R,Bhattacharya J,Roy S, et al. (2007). Compensatory secondary structure alterations in protein glycation. Biochim Biophys Acta 1774: 233–242. doi: 10.1016/j.bbapap.2006.11.018
    [51] Yang H,Yang S,Kong J, et al. (2015). Obtaining information about protein secondary structures in aqueous solution using Fourier transform IR spectroscopy. Nat Protoc 10: 382–396. doi: 10.1038/nprot.2015.024
    [52] Roy R,Boskey A,Bonassar LJ (2010) Processing of type I collagen gels using nonenzymatic glycation. J Biomed Mater Res A 93: 843–851.
    [53] Haris PI (2013) Probing protein-protein interaction in biomembranes using Fourier transform infrared spectroscopy. Biochim Biophys Acta 1828: 2265–2271.
    [54] Neault JF, Tajmir-Riahi HA (1998) Interaction of cisplatin with human serum albumin. Drug binding mode and protein secondary structure, Biochim. Biophys Acta 1384: 153–159.
    [55] Bramanti E, Benedetti E (1996) Determination of the secondary structure of isomeric forms of human serum albumin by a particular frequency deconvolution procedure applied to Fourier transform IR analysis. Biopolymers 38: 639–653.
    [56] Zsila F (2013) Subdomain IB is the third major drug binding region of human serum albumin: toward the three-sites model. Mol Pharmaceutics 10: 1668–1682. doi: 10.1021/mp400027q
    [57] Awasthi S,Murugan NA,Saraswathi NT (2015) Advanced glycation end products modulate structure and drug binding properties of albumin. Mol Pharmaceutics 12: 3312–3322.
    [58] Khan TA, Saleemuddin M, Naeem A (2011) Partially folded glycated state of human serum albumin tends to aggregate. Int J Pept Res Ther 17: 271–279. doi: 10.1007/s10989-011-9267-7
    [59] Oliveira LM,Lages A,Gomes RA, et al. (2011) Insulin glycation by methylglyoxal results in native-like aggregation and inhibition of fibril formation. BMC Biochem 5; 12:41. doi: 10.1186/1471-2091-12-41
    [60] Lin SY,Chu HL,Wei YS (2002) Pressure-induced transformation of alpha-helix to beta-sheet in the secondary structures of amyloid beta (1–40) peptide exacerbated by temperature. J Biomol Struct Dyn 19: 619–625.
    [61] Ding F,Borreguero JM,Buldyrey SV, et al. (2003) Mechanism for the alpha-helix to beta-hairpin transition. Proteins 53: 220–228. doi: 10.1002/prot.10468
    [62] Garip S,Yapici E,Ozek NS, et al. (2010) Evaluation and discrimination of simvastatin-induced structural alterations in proteins of different rat tissues by FTIR spectroscopy and neural network analysis. Analyst 135: 3233–3241. doi: 10.1039/c0an00540a
    [63] Yano K,Ohoshima S,Shimizu Y, et al. (1996) Evaluation of glycogen level in human lung carcinoma tissues by an infrared spectroscopic method. Cancer Lett 110: 29–34.
    [64] Podshyvalov A,Sahu RK,Mark S, et al. (2005) Distinction of cervical cancer biopsies by use of infrared microspectroscopy and probabilistic neural networks. Appl Opt 44: 3725–3734. doi: 10.1364/AO.44.003725
    [65] Colagar AH,Chaichi MJ,Khadjvand T (2011) Fourier transform infrared microspectroscopy as a diagnostic tool for distinguishing between normal and malignant human gastric tissue. J Biosci 36: 669–677.
    [66] Nagai R, Shirakawa J, Fujiwara Y, et al. (2014) Detection of AGEs as markers for carbohydrate metabolism and protein denaturation. J Clin Biochem Nutr 55: 1–6. doi: 10.3164/jcbn.13-112
    [67] Rondeau P,Bourdon E (2011) The glycation of albumin: structural and functional impacts. Biochimie 93: 645–658.
    [68] Basta G,Schmidt AM,De Caterina R (2004) Advanced glycation end products and vascular inflammation: implications for accelerated atherosclerosis in diabetes. Cardiovasc Res 63: 582–592. doi: 10.1016/j.cardiores.2004.05.001
    [69] Shivu B,Seshadri S,Li J, et al. (2013) Distinct β-sheet structure in protein aggregates determined by ATR-FTIR spectroscopy. Biochemistry 52: 5176–5183. doi: 10.1021/bi400625v
    [70] Natalello A,Doglia SM (2015) Insoluble protein assemblies characterized by fourier transform infrared spectroscopy. Methods Mol Biol 1258: 347–369. doi: 10.1007/978-1-4939-2205-5_20
    [71] Clark AH,Saunderson DH,Suggett A (1981) Infrared and laser-Raman spectroscopic studies of thermally-induced globular protein gels. Int J Pept Protein Res 17: 353–364.
    [72] Ruggeri FS,Longo G,Faggiano S, et al. (2015) Infrared nanospectroscopy characterization of oligomeric and fibrillar aggregates during amyloid formation. Nat Commun 6: 7831.
    [73] Miller LM,Bourassa MW,Smith RJ (2013) FTIR spectroscopic imaging of protein aggregation in living cells. Biochim Biophys Acta 1828: 2339–2346.
  • Reader Comments
  • © 2016 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(9158) PDF downloads(1678) Cited by(19)

Figures and Tables

Figures(7)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog