Esophageal squamous cell carcinoma (ESCC) is a malignant tumor of the digestive system in the esophageal squamous epithelium. Many studies have linked esophageal cancer (EC) to the imbalance of oral microecology. In this work, different machine learning (ML) models including Random Forest (RF), Gaussian mixture model (GMM), K-nearest neighbor (KNN), logistic regression (LR), support vector machine (SVM) and extreme gradient boosting (XGBoost) based on Genetic Algorithm (GA) optimization was developed to predict the relationship between salivary flora and ESCC by combining the relative abundance data of Bacteroides, Firmicutes, Proteobacteria, Fusobacteria and Actinobacteria in the saliva of patients with ESCC and healthy control. The results showed that the XGBoost model without parameter optimization performed best on the entire dataset for ESCC diagnosis by cross-validation (Accuracy = 73.50%). Accuracy and the other evaluation indicators, including Precision, Recall, F1-score and the area under curve (AUC) of the receiver operating characteristic (ROC), revealed XGBoost optimized by the GA (GA-XGBoost) achieved the best outcome on the testing set (Accuracy = 89.88%, Precision = 89.43%, Recall = 90.75%, F1-score = 90.09%, AUC = 0.97). The predictive ability of GA-XGBoost was validated in phylum-level salivary microbiota data from ESCC patients and controls in an external cohort. The results obtained in this validation (Accuracy = 70.60%, Precision = 46.00%, Recall = 90.55%, F1-score = 61.01%) illustrate the reliability of the predictive performance of the model. The feature importance rankings obtained by XGBoost indicate that Bacteroides and Actinobacteria are the two most important factors in predicting ESCC. Based on these results, GA-XGBoost can predict and diagnose ESCC according to the relative abundance of salivary flora, providing an effective tool for the non-invasive prediction of esophageal malignancies.
Citation: Yunxiang Meng, Qihong Duan, Kai Jiao, Jiang Xue. A screened predictive model for esophageal squamous cell carcinoma based on salivary flora data[J]. Mathematical Biosciences and Engineering, 2023, 20(10): 18368-18385. doi: 10.3934/mbe.2023816
[1] | Feng Li, Mingfeng Jiang, Hongzeng Xu, Yi Chen, Feng Chen, Wei Nie, Li Wang . Data governance and Gensini score automatic calculation for coronary angiography with deep-learning-based natural language extraction. Mathematical Biosciences and Engineering, 2024, 21(3): 4085-4103. doi: 10.3934/mbe.2024180 |
[2] | Kah Phooi Seng, Fenglu Ge, Li-minn Ang . Mathematical modeling and mining real-world Big education datasets with application to curriculum mapping. Mathematical Biosciences and Engineering, 2021, 18(4): 4450-4460. doi: 10.3934/mbe.2021225 |
[3] | Liangjing Shao, Benshuang Chen, Ziqun Zhang, Zhen Zhang, Xinrong Chen . Artificial intelligence generated content (AIGC) in medicine: A narrative review. Mathematical Biosciences and Engineering, 2024, 21(1): 1672-1711. doi: 10.3934/mbe.2024073 |
[4] | Tinghuai Ma, Hongmei Wang, Yuwei Zhao, Yuan Tian, Najla Al-Nabhan . Topic-based automatic summarization algorithm for Chinese short text. Mathematical Biosciences and Engineering, 2020, 17(4): 3582-3600. doi: 10.3934/mbe.2020202 |
[5] | Yunyun Sun, Peng Li, Zhaohui Jiang, Sujun Hu . Feature fusion and clustering for key frame extraction. Mathematical Biosciences and Engineering, 2021, 18(6): 9294-9311. doi: 10.3934/mbe.2021457 |
[6] | Amsa Shabbir, Aqsa Rasheed, Huma Shehraz, Aliya Saleem, Bushra Zafar, Muhammad Sajid, Nouman Ali, Saadat Hanif Dar, Tehmina Shehryar . Detection of glaucoma using retinal fundus images: A comprehensive review. Mathematical Biosciences and Engineering, 2021, 18(3): 2033-2076. doi: 10.3934/mbe.2021106 |
[7] | Xiaoxuan Pei, Kewen Li, Yongming Li . A survey of adaptive optimal control theory. Mathematical Biosciences and Engineering, 2022, 19(12): 12058-12072. doi: 10.3934/mbe.2022561 |
[8] | Keyue Yan, Tengyue Li, João Alexandre Lobo Marques, Juntao Gao, Simon James Fong . A review on multimodal machine learning in medical diagnostics. Mathematical Biosciences and Engineering, 2023, 20(5): 8708-8726. doi: 10.3934/mbe.2023382 |
[9] | Jose Guadalupe Beltran-Hernandez, Jose Ruiz-Pinales, Pedro Lopez-Rodriguez, Jose Luis Lopez-Ramirez, Juan Gabriel Avina-Cervantes . Multi-Stroke handwriting character recognition based on sEMG using convolutional-recurrent neural networks. Mathematical Biosciences and Engineering, 2020, 17(5): 5432-5448. doi: 10.3934/mbe.2020293 |
[10] | Yuanyao Lu, Kexin Li . Research on lip recognition algorithm based on MobileNet + attention-GRU. Mathematical Biosciences and Engineering, 2022, 19(12): 13526-13540. doi: 10.3934/mbe.2022631 |
Esophageal squamous cell carcinoma (ESCC) is a malignant tumor of the digestive system in the esophageal squamous epithelium. Many studies have linked esophageal cancer (EC) to the imbalance of oral microecology. In this work, different machine learning (ML) models including Random Forest (RF), Gaussian mixture model (GMM), K-nearest neighbor (KNN), logistic regression (LR), support vector machine (SVM) and extreme gradient boosting (XGBoost) based on Genetic Algorithm (GA) optimization was developed to predict the relationship between salivary flora and ESCC by combining the relative abundance data of Bacteroides, Firmicutes, Proteobacteria, Fusobacteria and Actinobacteria in the saliva of patients with ESCC and healthy control. The results showed that the XGBoost model without parameter optimization performed best on the entire dataset for ESCC diagnosis by cross-validation (Accuracy = 73.50%). Accuracy and the other evaluation indicators, including Precision, Recall, F1-score and the area under curve (AUC) of the receiver operating characteristic (ROC), revealed XGBoost optimized by the GA (GA-XGBoost) achieved the best outcome on the testing set (Accuracy = 89.88%, Precision = 89.43%, Recall = 90.75%, F1-score = 90.09%, AUC = 0.97). The predictive ability of GA-XGBoost was validated in phylum-level salivary microbiota data from ESCC patients and controls in an external cohort. The results obtained in this validation (Accuracy = 70.60%, Precision = 46.00%, Recall = 90.55%, F1-score = 61.01%) illustrate the reliability of the predictive performance of the model. The feature importance rankings obtained by XGBoost indicate that Bacteroides and Actinobacteria are the two most important factors in predicting ESCC. Based on these results, GA-XGBoost can predict and diagnose ESCC according to the relative abundance of salivary flora, providing an effective tool for the non-invasive prediction of esophageal malignancies.
Consider the mixing of two populations of hosts epidemiologically different with respect to the infection and transmission of a pathogen. What would be the outbreak outcome (e.g., in terms of attack rate) for each host population as a result of mixing in comparison to the situation with zero mixing? To address this question one would need to define what is meant by epidemiologically different and how mixing takes place.
To proceed, let's consider situations where mixing of epidemiologically different populations of hosts occurs. Such situations involve generalist (as opposed to specialist) pathogens capable of infecting multiple hosts and of being transmitted by multiple hosts [33]. Many of such pathogens cause zoonoses such as influenza, sleeping sickness, rabies, Lyme or West Nile, to cite a few [33]. In this paper, we focus on a specific example of a multi-host pathogen, the highly pathogenic avian influenza virus (HPAI) H5N1 -a virus considered as a potential pandemic threat by the scientific community.
The avian influenza virus can infect many hosts: wildfowl and domestic bird species, with occasional spill-over to mammals (including humans); the severity degree of the disease being species dependent: highly lethal (swans, chicken), few deaths (Common Pochards, humans), and asymptomatic (Mallards). Following the re-emergence of the highly pathogenic strain of H5N1 in China 2005 [6,7,28], a series of outbreaks spread throughout Western Europe, including France in 2006 [13,16,20]. The ensuing epizootics showed a need for adapted surveillance programs and a better understanding of the epidemiology of HPAI H5N1 [18]. In this context, this study is part of the French national project for assessing the risk of exposure of domestic birds and poultry farms to avian influenza viruses following introduction by wild birds; although human activities and commercial exchanges are also main sources for introduction of avian influenza [15,17,27,30].
The motivation for this study stems from the 2006 HPAI H5N1 outbreak that took place in France, in the Dombes wetlands. The area is one of the two main routes used by birds migrating across France, and an important stopover, breeding and wintering site for many wild waterfowl species. The outbreak was of minor size and affected mainly wild Anatidae bird species [13,16,20]: Common Pochards (Aythya ferina) and Mute Swans (Cygnus olor). Although the environmental conditions were conducive to the spread of the virus in the Dombes' ecosystem [31,34], it was suggested that the heterogeneity in the response to H5N1 viral infection of different bird species was a possible explanation for the reduced size of the outbreak [13]. Some studies have shown that averaging together different groups of a population, can only lead to a decrease (or no change) observed in the global reproduction number, compared to when no group structure of the population is considered [1]. Ref. [2] pointed out that the variance in the mixing rate between populations can have a substantial effect one the outbreak outcome. Other studies show that for multi-host pathogens, increasing host or species diversity may lead to either reduction or enhancement of the disease risk [12,24]. Therefore, addressing the question posed in the beginning of this section would provide insights and allow advances in the understanding of how avian influenza may spread in such ecosystems.
Our aim in this paper is to use a SIR compartmental model to investigate the effect of host heterogeneity on the disease outbreak in a multi-host population system. More precisely, we study how the outbreak outcome for each constituent population of hosts is affected in a multi-host population system with mixing in comparison with the single-host situation where individual populations are not mixed. The remainder of the paper is as follows. First, the key parameters and response functions characterizing the outbreak outcome are defined and determined for a single-host system in Section 2, and next the defined parameters are used to define the epidemiological heterogeneity in Section 3. Second, Section 4 is devoted to studying how the outbreak outcome in a multi-host population system is changed, due to mixing of epidemiologically heterogeneous hosts, compared to the outbreak outcomes in a single-host situation. Finally, the paper ends with the application of the results in the context of the Dombes area and concluding remarks in Section 5.
In this section we define the key characteristic parameters of the interacting population-pathogen system and the response function characterizing the outbreak outcome for such a system. To this end, consider a single species or single-host system in which the dynamics of an infection induced by a pathogen can be described within the framework of the compartmental susceptible-infected-recovered (SIR) model ([25]) in which susceptible individuals,
At any time
{dSdt=−λS[2ex]dIdt=λS−αI[2ex]dRdt=xαI[2ex] | (1) |
where
In writing Eq.(1) we have used the homogeneously mixing hypothesis and considered that the transmission of infection is frequency-dependent (i.e. the force infection is proportional to the inverse of the population size) like for the true mass-action kinetics [8]. For
The above SIR model is characterized by two (non independent) quantities: the generation time
1The derivation in Ref. [3] goes as follow. Consider a single infected individual applying a constant force of infection
R0=βN0β+αN0, | (2) |
where
To define a response function characterizing the outbreak outcome of the SIR model, we consider the following two indicators:
• the reduced persistence or extinction time,
• the attack rate,
To investigate
Bearing the distributions of
When
On the other hand, consider the probability
Thus, it follows from what precedes, that the mean attack rate
A=F(R0,g,x);R0=F−1[A(g,x)], | (3) |
where
Within the epidemiological framework as described in the Section 2, a host population interacting with a pathogen can be canonically characterized by two key parameters (or two dimensions): the basic reproduction number,
Hh=n∑i=1fih2i(n∑i=1fihi)2−1;hi=R0,g. | (4) |
It follows that a population of
For a single-host population,
Hh=y(z−1zy+1)2withy=f2f1andz=h2h1 | (5) |
where
Note that different demographic fractions
Now, we consider a heterogeneous system (in the sense of Section 3) constituted of
To proceed, consider
{dSidt=−λiSidIidt=λiSi−αiIidRidt=xiαiIi | (6) |
where
Assuming a hypothesis of homogeneous mixing of individuals for both within populations of hosts of the same kind (intra) and between host populations of different kind (inter), the elements of the matrix of contact probabilities can be written as,
{pii(t)=1Ni(t)[1−n∑j=1;j≠iϕijNj(t)Mi(t)]pij(t)=ϕijMi(t);Mi(t)=n∑j=1[1−δϕij,0]Nj(t) | (7) |
where
For the transmission of avian influenza viruses of interest here, we assume that infectious individuals of any kind are efficient sources of virus excretion such that the transmission of the infection to uninfected individuals only depends on the infection susceptibility of the receiver. That is to say that the infection transmission rate
λi(t)=[fiN0R0,ifiN0−R0,i]αi∑j=1pij(t)Ij(t)withR0,i=βifiN0βi+αifiN0, | (8) |
where
To go further and for the sake of simplicity, we specialize to the case of
For the mixing between
General considerations on the outbreak outcome can be drawn from the
{K1,1=(R0,1f1N0f1N0−R0,1)[1−ϕf2];K1,2=(R0,1f1N0f1N0−R0,1)(α1α2)ϕf1K2,1=(R0,2f2N0f2N0−R0,2)(α2α1)ϕf2;K2,2=(R0,2f2N0f2N0−R0,2)(1−ϕf1) | (9) |
In this approach,
R0=12[K2,2+K1,1+√(K2,2−K1,1)2+4(K2,1K1,2)]. | (10) |
Because of the term
• For a fixed nonzero heterogeneity
Rm=(f1N0f1N0−R0,1)R0,1f1+(f2N0f2N0−R0,2)R0,2f2. | (11) |
The decreasing of
• For a fixed nonzero mixing
- for any fixed ratio of reproductive numbers
- for fixed demography
The
To investigate the effects of mixing on individual outbreak outcomes at the level of each subsystem, we have run SIR stochastic simulations in a two-host system (see Appendix A) with a total population of size,
Figure 8 illustrates the cumulative distribution (cdf) of the attack rates for each host in the system and for the whole system. The cdf of the whole system is broad and close to that of the most abundant population host
Because of mixing, the mean attack rate
ηi=F−1i(Ai)F−1i(A0,i)=Reqv,iR0,i, | (12) |
where we have used the relation in Eq.(3) (see Section 2) to define the equivalent basic reproduction number as,
Several combinations of
heterogeneity | outbreak response | |
host 1 | host 2 | |
dilution | dilution | |
amplification | dilution | |
no effect | no effect | |
dilution | dilution | |
no effect | amplification |
• three kinds of behaviors for each host population are possible depending on the mixing and heterogeneity parameters: dilution, no effect or amplification behaviors. As shown in Table 1, the interaction between two heterogenous hosts, with at least a
• the extent to which a subsystem undergoes dilution or amplification is a function of demographic and mixing parameters with a possible transition from dilution via no effect to the amplification behaviors (and vice versa), when varying the individual
• as the proportion of recovered
Figures 9 and 10 illustrate some of the situations presented in Table 1. Figure 9 shows the coexistence of two-phase behaviors (dilution effect for a subpopulation and amplification effect for the other one), where the
The aims of this work were to define the epidemiological host heterogeneity and investigate the effect of host heterogeneity on the disease outbreak outcomes for each host in a multi-host population system, given prior knowledge of the disease epidemiology for each host population in the zero mixing situation. In other words, what is the impact of a multi-host system on the outbreak response of individual host populations involved?
We have shown that a single-host system can be canonically parametrized using two quantities, the basic reproductive number
• Heterogeneity index
• Interaction matrix: which takes into account both epidemic and demographic characteristics to structure how different hosts interact with each other. By interactions we mean that hosts have an epidemic and a demographic role in the transmission and spreading of the infection. For the two-host case presented in this analysis, the control parameter for the interaction matrix reduces to a single assortative mixing index
As minimal definition and necessary conditions, we state that the epidemiological host heterogeneity occurs in a system of epidemiologically interacting populations where each host population is characterized by a different epidemic response function. There is no host heterogeneity in the absence of interactions between populations or when interacting populations have all identical epidemic response functions.
Regarding the impacts of host heterogeneity on the outbreak outcomes, we found that they are twofold in the case of the infection transmission depending on the receiver infection susceptibility: i) -outbreak dampening, i.e., the outbreak in the heterogeneous multi-host system is always smaller than the summation of outbreaks for individual subsystems taken separately, and ii) -as summarized in Table 1, three kinds of outbreak outcomes are possible for the individual subsystem depending on the mixing and heterogeneity parameters: dilution, no effect or amplification behaviors where the outbreak responses in the multi-host system are lower, similar or higher than in the single host system, respectively, with the magnitude depending both on
Previous works, [14], have shown that, in the case of preferential mixing, like in this study (though with a different mixing pattern), the disease can invade the population when any subgroup is self-sufficient for the disease transmission (i.e.,
The previous works were largely focused on the impacts that heterogeneity may have on the global
The situation of the HPAI H5N1 outbreak in mid-February 2006 in the Dombes, France, can be analyzed within the framework of the afore outlined approach. As mentioned in the Introduction section, although the environmental conditions were conducive to the spread of the virus in the Dombes' ecosystem [31,34], the outbreak was of minor size, mainly affecting Common Pochards (Aythya ferina) and Mute Swans (Cygnus olor) [13,16,20]. It was suggested that the host heterogeneity in the response to H5N1 viral infection of different bird species was a possible explanation for the reduced size of the outbreak [13].
During the outbreak period, the situation in the Dombes was that Swans, Common Pochards and Mallards were found well mixed with a census of
To conclude, we have depicted a framework for defining the epidemiological host heterogeneity and assessing its impacts on outbreak outcomes in terms of epidemic response functions for host populations in interaction. The approach was illustrated for the case of frequency-dependent direct transmission where the infection transmission depends on the receiver infection susceptibility, (i.e.,
Stochastic simulations for the SIR model were generated using the stochastic discrete time version of the system of equations in Eq.(6), in which
{(Si,Ii,Ri)→(Si−1,Ii+1,Ri)at rate λi(t)Si[2ex](Si,Ii,Ri)→(Si,Ii−1,Ri+1)at rate αiIi with probability xi[2ex](Si,Ii,Ri)→(Si,Ii−1,Ri)at rate αiIi with probability 1−xi | (13) |
describing the transition from susceptible to infected following a Poisson process of parameter
• Single-host system: The subscript
λ(t)=pβI=[N0R0N0−R0]α×I(t)N(t), | (14) |
where
• Two-hosts system:
λi(t)=[fiN0R0,ifiN0−R0,i]αi∑j=1pij(t)Ij(t), | (15) |
where
When all infected individuals recover from infection, i.e.,
A=1−exp{−(R0N0−R0)[I(0)+AS(0)]}. | (16) |
For
A=(I(0)N0)×u+1×(1−u), | (17) |
where
u=tanh(c×e−bR0) | (18) |
where the constants
R0=F−1(A)=−1bln{−12cln[I(0)−AN0I(0)−(2−A)N0]}. | (19) |
AM is a PhD student supported by a grant from the Ministry of Education and Research of France through the Ecole Doctorale Ingénierie pour la Santé, la Cognition et l'Environnement (EDISCE) of Grenoble Alpes University.We are grateful to M. Artois for fruitful discussions. This work has benefited from the support of the Ministry of Agriculture and fisheries under the Project Cas DAR 7074.
[1] |
M. Arnold, I. Soerjomataram, J. Ferlay, D. Forman, Global incidence of oesophageal cancer by histological subtype in 2012, Gut, 64 (2015), 381–387. https://doi.org/10.1136/gutjnl-2014-308124 doi: 10.1136/gutjnl-2014-308124
![]() |
[2] |
E. J. Snider, G. Compres, D. E. Freedberg, H. Khiabanian, Y. R. Nobel, S. Stump, et al., Alterations to the Esophageal Microbiome Associated with Progression from Barrett's Esophagus to Esophageal Adenocarcinoma, Cancer Epidem. Biomar. Prev., 28 (2019), 1687–1693. https://doi.org/10.1158/1055-9965.EPI-19-0008 doi: 10.1158/1055-9965.EPI-19-0008
![]() |
[3] |
J. Zhao, Y. T. He, R. S. Zheng, S. W. Zhang, W. Q. Chen, Analysis of esophageal cancer time trends in China, 1989–2008, Asian Pac. J. Cancer Prev., 13 (2012), 4613–4617. https://doi.org/10.7314/apjcp.2012.13.9.4613 doi: 10.7314/apjcp.2012.13.9.4613
![]() |
[4] |
A. Q. Liu, E. Vogtmann, D. T. Shao, C. C. Abnet, H. Y. Dou, Y. Qin, et al., A Comparison of Biopsy and Mucosal Swab Specimens for Examining the Microbiota of Upper Gastrointestinal Carcinoma, Cancer Epidem. Biomar. Prev., 28 (2019), 2030–2037. https://doi.org/10.1158/1055-9965.EPI-18-1210 doi: 10.1158/1055-9965.EPI-18-1210
![]() |
[5] |
R. Lozano, M. Naghavi, K. Foreman, S. Lim, K. Shibuya, V. Aboyans, et al., Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010, Lancet, 380 (2012), 2095–2128. https://doi.org/10.1016/S0140-6736(12)61728-0 doi: 10.1016/S0140-6736(12)61728-0
![]() |
[6] |
C. C. Abnet, M. Arnold, W. Q. Wei, Epidemiology of esophageal squamous cell carcinoma, Gastroenterol., 154 (2018), 360–373. https://doi.org/10.1053/j.gastro.2017.08.023 doi: 10.1053/j.gastro.2017.08.023
![]() |
[7] |
J. Lagergren, E. Smyth, D. Cunningham, P. Lagergren, Oesophageal cancer, Lancet, 390 (2017), 2383–2396. https://doi.org/10.1016/S0140-6736(17)31462-9 doi: 10.1016/S0140-6736(17)31462-9
![]() |
[8] |
C. C. Abnet, Y. L. Qiao, S. D. Mark, Z. W. Dong, P. R. Taylor, S. M. Dawsey, Prospective study of tooth loss and incident esophageal and gastric cancers in China, Cancer Causes Control, 12 (2001), 847–854. https://doi.org/10.1023/a:1012290009545 doi: 10.1023/a:1012290009545
![]() |
[9] |
N. A. Dar, F. Islami, G. A. Bhat, I. A. Shah, M. A. Makhdoomi, B. Iqbal, et al., Poor oral hygiene and risk of esophageal squamous cell carcinoma in Kashmir, Br. J. Cancer, 109 (2013), 1367–1372. https://doi.org/10.1038/bjc.2013.437 doi: 10.1038/bjc.2013.437
![]() |
[10] |
N. Guha, P. Boffetta, V. Wünsch Filho, J. Eluf Neto, O. Shangina, D. Zaridze, et al., Oral health and risk of squamous cell carcinoma of the head and neck and esophagus: results of two multicentric case-control studies, Am. J. Epidemiol., 166 (2007), 1159–1173. https://doi.org/10.1093/aje/kwm193 doi: 10.1093/aje/kwm193
![]() |
[11] |
S. Kageyama, T. Takeshita, M. Furuta, M. Tomioka, M. Asakawa, S. Suma, et al., Relationships of variations in the tongue microbiota and pneumonia mortality in nursing home residents, J. Gerontol. A, 73 (2018), 1097–1102. https://doi.org/10.1093/gerona/glx205 doi: 10.1093/gerona/glx205
![]() |
[12] |
K. E. Kholy, R. J. Genco, T. E. Dyke, Oral infections and cardiovascular disease, Trends Endocrin. Met., 26 (2015), 315–321. https://doi.org/10.1016/j.tem.2015.03.001 doi: 10.1016/j.tem.2015.03.001
![]() |
[13] |
E. Zaura, B. W. Brandt, A. Prodan, M. J. Teixeira de Mattos, S. Imangaliyev, J. Kool, et al., On the ecosystemic network of saliva in healthy young adults, ISME J., 11 (2017), 1218–1231. https://doi.org/10.1038/ismej.2016.199 doi: 10.1038/ismej.2016.199
![]() |
[14] |
R. Vasapolli, K. Schütte, C. Schulz, M. Vital, D. Schomburg, D. H. Pieper, et al., Analysis of transcriptionally active bacteria throughout the gastrointestinal tract of healthy individuals, Gastroenterology, 157 (2019), 1081–1092. https://doi.org/10.1053/j.gastro.2019.05.068 doi: 10.1053/j.gastro.2019.05.068
![]() |
[15] |
X. Cao, Intestinal inflammation induced by oral bacteria, Science, 358 (2017), 308–309. https://doi.org/10.1126/science.aap9298 doi: 10.1126/science.aap9298
![]() |
[16] |
B. Corning, A. P. Copland, J. W. Frye, The esophageal microbiome in health and disease, Curr. Gastroenterol. Rep., 20 (2018), 1–7. https://doi.org/10.1007/s11894-018-0642-9 doi: 10.1007/s11894-018-0642-9
![]() |
[17] |
X. Chen, B. Winckler, M. Lu, H. Cheng, Z. Yuan, Y. Yang, et al., Oral microbiota and risk for esophageal squamous cell carcinoma in a high-risk area of China, PloS One, 10 (2015), e0143603. https://doi.org/10.1371/journal.pone.0143603 doi: 10.1371/journal.pone.0143603
![]() |
[18] |
Z. Li, L. Dou, Y. Zhang, S. He, D. Zhao, C. Hao, et al., Characterization of the oral and esophageal microbiota in esophageal precancerous lesions and squamous cell carcinoma, Front. Cell. Infect. Microbiol., 11 (2021), 714162. https://doi.org/10.3389/fcimb.2021.714162 doi: 10.3389/fcimb.2021.714162
![]() |
[19] |
H. Li, Z. Lou, H. Zhang, N. Huang, D. Li, C. Luo, et al., Characteristics of oral microbiota in patients with esophageal cancer in China, BioMed Res. Int., 2021 (2021), 2259093. https://doi.org/10.1155/2021/2259093 doi: 10.1155/2021/2259093
![]() |
[20] |
M. Kawasaki, Y. Ikeda, E. Ikeda, M. Takahashi, D. Tanaka, Y. Nakajima, et al., Oral infectious bacteria in dental plaque and saliva as risk factors in patients with esophageal cancer, Cancer, 127 (2021), 512–519. https://doi.org/10.1002/cncr.33316 doi: 10.1002/cncr.33316
![]() |
[21] |
Q. Zhao, T. Yang, Y. Yan, Y. Zhang, Z. Li, Y. Wang, et al., Alterations of Oral microbiota in Chinese patients with esophageal cancer, Front. Cell. Infect. Microbiol., 10 (2020), 541144. https://doi.org/10.3389/fcimb.2020.541144 doi: 10.3389/fcimb.2020.541144
![]() |
[22] |
Q. Wang, Y. Rao, X. Guo, N. Liu, S. Liu, P. Wen, et al., Oral microbiome in patients with oesophageal squamous cell carcinoma, Sci. Rep., 9 (2019), 19055. https://doi.org/10.1038/s41598-019-55667-w doi: 10.1038/s41598-019-55667-w
![]() |
[23] |
F. Liu, M. Liu, Y. Liu, C. Guo, Y. Zhou, F. Li, et al., Oral microbiome and risk of malignant esophageal lesions in a high-risk area of China: A nested case-control study, Chinese J. Cancer Res., 32 (2020), 742–754. https://doi.org/10.21147/j.issn.1000-9604.2020.06.07 doi: 10.21147/j.issn.1000-9604.2020.06.07
![]() |
[24] |
B. A. Peters, J. Wu, Z. Pei, L. Yang, M. P. Purdue, N. D. Freedman, et al., Oral microbiome composition reflects prospective risk for esophageal cancers, Cancer Res., 77 (2017), 6777–6787. https://doi.org/10.1158/0008-5472.CAN-17-1296 doi: 10.1158/0008-5472.CAN-17-1296
![]() |
[25] | W. Lv, Identification of the Microbial Composition of the Patients with Esophageal Squamous Cell Carcinoma and Analysis of the Differences in Microbial Composition from Healthy Subjects, Master thesis, Hebei Medical University in Shijiazhuang, 2021. https://doi.org/10.27111/d.cnki.ghyku.2021.000887 |
[26] | D. Shao, The Characteristic of Microbial Communities of Oral Cavity, Esophagus and Cardia of Population in High-Risk Regions of Esophageal Cancer in China, Ph.D thesis, Peking Union Medical College in Beijing, 2021. https://doi.org/10.27648/d.cnki.gzxhu.2021.000407 |
[27] | Y. Lu, Microbiota of the Tumor Tissue and Saliva in Patients with Esophageal Cancer, Ph.D thesis, Peking Union Medical College in Beijing, 2021. |
[28] |
K. Kourou, T. P. Exarchos, K. P. Exarchos, M. V. Karamouzis, D. I. Fotiadis, Machine learning applications in cancer prognosis and prediction, Comput. Struct. Biotechnol. J., 13 (2014), 8–17. https://doi.org/10.1016/j.csbj.2014.11.005 doi: 10.1016/j.csbj.2014.11.005
![]() |
[29] |
R. Mofidi, C. Deans, M. D. Duff, A. C. de Beaux, S. Paterson Brown, Prediction of survival from carcinoma of oesophagus and oesophago-gastric junction following surgical resection using an artificial neural network, Eur. J. Surg. Oncol., 32 (2006), 533–539. https://doi.org/10.1016/j.ejso.2006.02.020 doi: 10.1016/j.ejso.2006.02.020
![]() |
[30] |
Y. Hayashida, K. Honda, Y. Osaka, T. Hara, T. Umaki, A. Tsuchida, et al., Possible prediction of chemoradiosensitivity of esophageal cancer by serum protein profiling, Clin. Cancer Res., 11 (2005), 8042–8047. https://doi.org/10.1158/1078-0432.CCR-05-0656 doi: 10.1158/1078-0432.CCR-05-0656
![]() |
[31] |
Z. Xun, Q. Zhang, T. Xu, N. Chen, F. Chen, Dysbiosis and ecotypes of the salivary microbiome associated with inflammatory bowel diseases and the assistance in diagnosis of diseases using oral bacterial profiles, Front. Microbiol., 9 (2018), 1136. https://doi.org/10.3389/fmicb.2018.01136 doi: 10.3389/fmicb.2018.01136
![]() |
[32] |
L. A. David, A. C. Materna, J. Friedman, M. I. Campos-Baptista, M. C. Blackburn, A. Perrotta, et al., Host lifestyle affects human microbiota on daily timescales, Genome Biol., 15 (2014), R89. https://doi.org/10.1186/gb-2014-15-7-r89 doi: 10.1186/gb-2014-15-7-r89
![]() |
[33] | J. Wei, Analysis of Oral Salivary Microbiota in Patients with Esophageal Squamous Cell Carcinoma and its Clinical Significance, Master thesis, Southern Medical University in Canton, 2020. https://doi.org/10.27003/d.cnki.gojyu.2020.000723 |
[34] | Z. Zhu, Study on Risk Factors, Serum Biomarkers, and Salivary Microbiota of Upper Gastrointestinal Cancers, Ph.D thesis, Peking Union Medical College in Beijing, 2021. https://doi.org/10.27648/d.cnki.gzxhu.2021.000132 |
[35] |
X. Wan, W. Wang, J. Liu, T. Tong, Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range, BMC Med. Res. Methodol., 14 (2014), 1–13. https://doi.org/10.1186/1471-2288-14-135 doi: 10.1186/1471-2288-14-135
![]() |
[36] | F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, et al., Scikit-learn: machine learning in Python, preprint, arXiv: 1201.0490. |
[37] |
D. Krstajic, L. J. Buturovic, D. E. Leahy, S. Thomas, Cross-validation pitfalls when selecting and assessing regression and classification models, J. Cheminform., 6 (2014), 1–15. https://doi.org/10.1186/1758-2946-6-10 doi: 10.1186/1758-2946-6-10
![]() |
[38] |
G. Biau, E. Scornet, A Random Forest Guided Tour, Test, 25 (2016), 197–227. https://doi.org/10.1007/s11749-016-0481-7 doi: 10.1007/s11749-016-0481-7
![]() |
[39] | F. Najar, S. Bourouis, N. Bouguila, S. Belghith, A comparison between different Gaussian-based mixture models, in 2017 IEEE/ACS 14th International Conference on Computer Systems and Applications (AICCSA), (2017), 704–708. https://doi.org/10.1109/AICCSA.2017.108 |
[40] | I. Saini, D. Singh, A. Khosla, Delineation of ECG wave components using K-nearest neighbor (KNN) algorithm: ECG wave delineation using KNN, in 2013 10th International Conference on Information Technology: New Generations, (2013), 712–717. https://doi.org/10.1109/ITNG.2013.76 |
[41] | K. He, C. He, Housing price analysis using linear regression and logistic regression: A comprehensive explanation using melbourne real estate data, in 2021 IEEE International Conference on Computing (ICOCO), (2021), 241–246. https://doi.org/10.1109/ICOCO53166.2021.9673533 |
[42] |
A. P. Bradley, The use of the area under the ROC curve in the evaluation of machine learning algorithms, Pattern Recogn., 30 (1997), 1145–1159. https://doi.org/10.1016/s0031-3203(96)00142-2 doi: 10.1016/s0031-3203(96)00142-2
![]() |
[43] |
B. Pan, Application of XGBoost algorithm in hourly PM2.5 concentration prediction, IOP Conf. Ser.: Earth Environ. Sci., 113 (2018), 012127. https://doi.org/10.1088/1755-1315/113/1/012127 doi: 10.1088/1755-1315/113/1/012127
![]() |
[44] |
M. Moghtadaei, M. R. Golpayegani, F. Almasganj, A. Etemadi, M. R. Akbari, R. Malekzadeh, Predicting the risk of squamous dysplasia and esophageal squamous cell carcinoma using minimum classification error method, Comput. Biol. Med., 45 (2014), 51–57. https://doi.org/10.1016/j.compbiomed.2013.11.011 doi: 10.1016/j.compbiomed.2013.11.011
![]() |
1. | Ying Liu, Haozhu Wang, Huixue Zhou, Mingchen Li, Yu Hou, Sicheng Zhou, Fang Wang, Rama Hoetzlein, Rui Zhang, A review of reinforcement learning for natural language processing and applications in healthcare, 2024, 31, 1067-5027, 2379, 10.1093/jamia/ocae215 | |
2. | Utkarsh Dixit, Sonam Gupta, Arun Kumar Yadav, Divakar Yadav, 2023, Recent Advances in DL-based Text Summarization: A Systematic Review, 979-8-3503-9926-4, 391, 10.1109/ICACITE57410.2023.10183122 | |
3. | Ankit Raj, Mehul Raj, N. Umasankari, D. Geethanjali, 2024, Document-Based Text Summarization using T5 small and gTTS, 979-8-3503-6482-8, 1, 10.1109/ADICS58448.2024.10533605 | |
4. | Yongsik Kim, Su-Youn Hong, Sungjin Park, Huy Kang Kim, Reinforcement Learning-Based Generative Security Framework for Host Intrusion Detection, 2025, 13, 2169-3536, 15346, 10.1109/ACCESS.2025.3532353 |
heterogeneity | outbreak response | |
host 1 | host 2 | |
dilution | dilution | |
amplification | dilution | |
no effect | no effect | |
dilution | dilution | |
no effect | amplification |