Research article Special Issues

Mathematical modeling of tumor growth: the MCF-7 breast cancer cell line

  • Breast cancer is the second most commonly diagnosed cancer in women worldwide. MCF-7 cell line is an extensively studied human breast cancer cell line. This cell line expresses estrogen receptors, and the growth of MCF-7 cells is hormone dependent. In this study, a mathematical model, which governs MCF-7 cell growth with interaction among tumor cells, estradiol, natural killer (NK) cells, cytotoxic T lymphocytes (CTLs) or CD8+ T cells, and white blood cells (WBCs), is proposed. Experimental data are used to determine functional forms and parameter values. Breast tumor growth is then studied using the mathematical model. The results obtained from numerical simulation are compared with those from clinical and experimental studies. The system has three coexisting stable equilibria representing the tumor free state, a microscopic tumor, and a large tumor. Numerical simulation shows that an immune system is able to eliminate or control a tumor with a restricted initial size. A healthy immune system is able to effectively eliminate a small tumor or produces long-term dormancy. An immune system with WBC count at the low parts of the normal ranges or with temporary low NK cell count is able to eliminate a smaller tumor. The cytotoxicity of CTLs plays an important role in immune surveillance. The association between the circulating estradiol level and cancer risk is not significant.

    Citation: Hsiu-Chuan Wei. Mathematical modeling of tumor growth: the MCF-7 breast cancer cell line[J]. Mathematical Biosciences and Engineering, 2019, 16(6): 6512-6535. doi: 10.3934/mbe.2019325

    Related Papers:

    [1] Hao Yuan, Qiang Chen, Hongbing Li, Die Zeng, Tianwen Wu, Yuning Wang, Wei Zhang . Improved beluga whale optimization algorithm based cluster routing in wireless sensor networks. Mathematical Biosciences and Engineering, 2024, 21(3): 4587-4625. doi: 10.3934/mbe.2024202
    [2] Romulus Breban, Ian McGowan, Chad Topaz, Elissa J. Schwartz, Peter Anton, Sally Blower . Modeling the potential impact of rectal microbicides to reduce HIV transmission in bathhouses. Mathematical Biosciences and Engineering, 2006, 3(3): 459-466. doi: 10.3934/mbe.2006.3.459
    [3] Georgi Kapitanov . A double age-structured model of the co-infection of tuberculosis and HIV. Mathematical Biosciences and Engineering, 2015, 12(1): 23-40. doi: 10.3934/mbe.2015.12.23
    [4] Churni Gupta, Necibe Tuncer, Maia Martcheva . A network immuno-epidemiological model of HIV and opioid epidemics. Mathematical Biosciences and Engineering, 2023, 20(2): 4040-4068. doi: 10.3934/mbe.2023189
    [5] Nawei Chen, Shenglong Chen, Xiaoyu Li, Zhiming Li . Modelling and analysis of the HIV/AIDS epidemic with fast and slow asymptomatic infections in China from 2008 to 2021. Mathematical Biosciences and Engineering, 2023, 20(12): 20770-20794. doi: 10.3934/mbe.2023919
    [6] Aditya S. Khanna, Dobromir T. Dimitrov, Steven M. Goodreau . What can mathematical models tell us about the relationship between circular migrations and HIV transmission dynamics?. Mathematical Biosciences and Engineering, 2014, 11(5): 1065-1090. doi: 10.3934/mbe.2014.11.1065
    [7] Lih-Ing W. Roeger, Z. Feng, Carlos Castillo-Chávez . Modeling TB and HIV co-infections. Mathematical Biosciences and Engineering, 2009, 6(4): 815-837. doi: 10.3934/mbe.2009.6.815
    [8] Yicang Zhou, Yiming Shao, Yuhua Ruan, Jianqing Xu, Zhien Ma, Changlin Mei, Jianhong Wu . Modeling and prediction of HIV in China: transmission rates structured by infection ages. Mathematical Biosciences and Engineering, 2008, 5(2): 403-418. doi: 10.3934/mbe.2008.5.403
    [9] Wenshuang Li, Shaojian Cai, Xuanpei Zhai, Jianming Ou, Kuicheng Zheng, Fengying Wei, Xuerong Mao . Transmission dynamics of symptom-dependent HIV/AIDS models. Mathematical Biosciences and Engineering, 2024, 21(2): 1819-1843. doi: 10.3934/mbe.2024079
    [10] Nariyuki Nakagiri, Hiroki Yokoi, Yukio Sakisaka, Kei-ichi Tainaka, Kazunori Sato . Influence of network structure on infectious disease control. Mathematical Biosciences and Engineering, 2025, 22(4): 943-961. doi: 10.3934/mbe.2025034
  • Breast cancer is the second most commonly diagnosed cancer in women worldwide. MCF-7 cell line is an extensively studied human breast cancer cell line. This cell line expresses estrogen receptors, and the growth of MCF-7 cells is hormone dependent. In this study, a mathematical model, which governs MCF-7 cell growth with interaction among tumor cells, estradiol, natural killer (NK) cells, cytotoxic T lymphocytes (CTLs) or CD8+ T cells, and white blood cells (WBCs), is proposed. Experimental data are used to determine functional forms and parameter values. Breast tumor growth is then studied using the mathematical model. The results obtained from numerical simulation are compared with those from clinical and experimental studies. The system has three coexisting stable equilibria representing the tumor free state, a microscopic tumor, and a large tumor. Numerical simulation shows that an immune system is able to eliminate or control a tumor with a restricted initial size. A healthy immune system is able to effectively eliminate a small tumor or produces long-term dormancy. An immune system with WBC count at the low parts of the normal ranges or with temporary low NK cell count is able to eliminate a smaller tumor. The cytotoxicity of CTLs plays an important role in immune surveillance. The association between the circulating estradiol level and cancer risk is not significant.


    In the United States, an estimated 1.15 to 1.2 million people above the age of 13 years were living with a diagnosed or undiagnosed human immunodeficiency virus (HIV) infection [1] and 37,515 persons received diagnoses of HIV infection in 2018 [2]. In addition to the disease burden, the economic burden of HIV is high, with estimated lifetime treatment costs of $250,000 to $400,000 per person [3]. However, new advancements in HIV testing, treatment for HIV-infected persons, and pre-exposure prophylaxis (PrEP) for uninfected persons at elevated risk of infection can prevent transmission. Testing allows for early diagnosis and thus treatment initiation, consistent HIV treatment helps achieve and maintain viral suppression and can result in effectively no risk of sexual transmission, and PrEP provides a preventative option for uninfected persons at an elevated risk of HIV acquisition [4,5]. The challenge is to identify the most effective strategy for allocation of resources to at-risk populations to ensure HIV prevention.

    Simulation models have played a key role in identifying populations at-risk for HIV [6] and evaluating population-specific interventions to inform implementation of focused intervention programs [7,8]. One approach has been to stratify the national population into different groups based on demographic and behavioral factors, evaluate the risk of transmission for each group [9,10,11,12], and evaluate the impact of alternative intervention strategies specific to each group [13,14,15,16]. Common population stratifications have included combinations of transmission risk group (heterosexuals, men who have sex with men (MSM), and persons who inject drugs (PWID)), age group, race/ethnicity, and HIV care continuum stage of infected persons [17]. US National HIV Surveillance System (NHSS) [4] data indicate significant differences in HIV diagnosis, care, and treatment across these population groups, suggestive of differences in risk across these groups and thus the need for more focused interventions.

    Recent studies using HIV nucleotide sequence data, routinely collected as part of NHSS, have indicated that molecular sequence analysis can identify networks among which HIV transmission is occurring rapidly (i.e., substantially higher than average transmission rates) [18,19]. These findings identify a new method for identifying networks of persons at increased risk of HIV infection and could allow for developing a more focused and effective approach to intervention by ensuring that prevention resources effectively reach those in need. Molecular sequence analysis identifies clusters, or groups of HIV infections that are genetically very similar. Because HIV evolves rapidly, very similar HIV nucleotide sequences indicate that HIV transmission is occurring rapidly in a common network [19]. While an average of 3.5 HIV transmission events are estimated to occur per 100 persons living with HIV per year in the United States, analysis of nucleotide sequence data collected through the NHSS identified rapidly growing clusters with an average of 44 transmission events per 100 persons per year [19]. Clusters are thus indicative of where transmission of HIV is rapidly occurring and where prevention resources would be most useful in curbing new infections.

    The response to HIV clusters and outbreaks is a key part of one of the four strategic pillars of the U.S. Ending the HIV Epidemic initiative [20]. The goal of the strategic plan is to reduce new infections by 75% by 2025 and 90% by 2030. The initiative includes 4 strategic pillars to reach this goal, namely: 1) to diagnose all PWH as early as possible, 2) to treat the infection rapidly and effectively to achieve sustained viral suppression [5], 3) to prevent new HIV transmissions by using proven interventions, including pre-exposure prophylaxis [21,22,23,24,25] and syringe services programs [25,26,27,28,29,30], and 4) to respond quickly to potential HIV outbreaks to get needed prevention and treatment services to people who need them. Cluster detection is a key to identify potential outbreaks rapidly to guide response efforts, including implementing interventions for early diagnosis and treatment of HIV-infected persons and prevention options for those in associated networks who are at high risk of infection.

    A simulation model that accurately replicates HIV transmission networks in the United States and its cluster dynamics is a key tool to evaluate and guide cluster-based prevention strategies. We present a new version of the Progression and Transmission of HIV (PATH 4.0) simulation model, constructed using a newly developed agent-based evolving network modeling (ABENM) simulation technique, and a new network generation algorithm, evolving contact network algorithm (ECNA) [31]. The earlier version of PATH [6] simulated only infected persons as agents and modeled all characteristics including demographic, sexual behavior, and partnerships as features of the agents without explicitly generating the contact network. PATH 4.0 uses the disease progression module from the earlier version of PATH [6], and reconstructs the full computational structure using ABENM and ECNA (creating the hybrid compartmental and ABNM structure), adding new modules for simulating the functionalities related to partnership formations, demographics, and sexual behavior and maintain the overall network statistics. The new simulation technique and new modules are essential for accurately simulating HIV transmission networks in the United States, and thus, for the analysis of cluster detection and response strategies. Current simulation techniques, and thus previous versions of PATH that were built using these methods [6], are infeasible to use for this application, for reasons which we will discuss in the next few paragraphs.

    There are two major types of dynamic simulation techniques used in national-level HIV modeling: compartmental modeling (also known as differential equations modeling) and agent-based network modeling (ABNM) [32]. Compartmental modeling splits the population into groups (or compartments) that represent the different states of a disease, e.g., susceptible, infected, and removed, and use a system of differential equations to simulate the rates of change for transitioning between these compartments. These models assume random mixing between people in a group, making them not suitable for representation of sexual and needle sharing contact networks that are known to follow a non-random scale-free network structure [33], where the distribution of the number of contacts per person follows a power-law distribution [34]. ABNM simulates infected and susceptible persons at the individual-level and the interactions leading to HIV transmissions through a contact network structure [35], which is ideal for modeling non-random network structures [34]. However, due to low prevalence of HIV in the United States, it is computationally challenging to apply ABNM to simulate HIV transmissions at the national level.

    Taking data from the year 2015 as an example, the computational challenge can be described as follows. The estimated overall prevalence of HIV among persons aged 13 years or older in the United States averaged about 419 per 100,000 persons nationally and ranged from an average of 64 to 852 per 100,000 persons by area of residence except for District of Columbia, which had an estimated average of 3,018 per 100,000 persons [36]. Clusters identified through molecular analyses of recently diagnosed infections (recent defined as cases diagnosed over the past 3-year period preceding the year of analyses, who in 2015 constituted about 10% of all PWH), range from 2 to 49 persons per cluster [19]. These cluster sizes shed light on the size of the underlying contact network structures to simulate. As ABNM are scaled versions of the population, simulating a population of 100,000 persons representative of the U.S. population would result in 419 HIV-infected persons, and 42 persons with recent diagnosis. These small samples of HIV-infected persons are insufficient to generate the clusters, in numbers and sizes, observed in molecular data. Further, the small samples create challenges in modeling heterogeneity by age, gender, and transmission risk-group, which are key categories for HIV because of differences in a disease's epidemiologic features [36,37,38]. As there is considerable contact mixing between these groups [39], they cannot be modeled separately. As ABNMs track interactions among N individuals, where N is the population size in the simulation, computation times are in the order of O(N2), and thus, increasing the value of N is also not a suitable solution. These issues make ABNM insufficient to use for low prevalence diseases such as HIV.

    To overcome the challenges with the application of current simulation techniques for HIV cluster detection, we developed the PATH 4.0 model using a new stochastic simulation technique ABENM. When using the ABNM simulation technique, the full contact network, i.e., the network of all infected and susceptible persons, is initially generated using a network generation algorithm [35], and disease transmissions are simulated on this network over time. As sexual contact networks are known to follow scale-free network structures [40,41], a preferential attachment algorithm would be first used for generation of scale-free networks [35] in ABNM. Contrary to ABNM, the main concept of ABENM is to simulate only infected persons and their immediate contacts at the individual level as agents of the simulation, and to model all other susceptible persons using a compartmental modeling structure. As new persons become infected, their immediate contacts are added as agents (transitioning from the compartmental portion of the model to the network portion of the model), thus evolving the contact network. The key challenge is determining 'who', i.e., the degree (number of contacts), risk group, age, and geographical location of the person, to be added as the immediate contacts of the newly infected node. These characteristics of the infected persons and their contacts are known to be correlated, i.e., persons are more likely to have sexual partners who are similar to them, say of similar age or have similar degree (number of partners) [39]. Correlation in number of partners is a key mathematical feature known as degree correlation between neighboring nodes in scale-free network structure [42], where degree is the number of links (contacts) of a node (person) in the network [31]. In previous work, we developed the ECNA, which uses a neural network model to predict degree correlations, i.e., determine, based on the degree of the newly infected person, the degree of each of their immediate contacts [31]. The ECNA can be used as a network generation algorithm for generation of scale-free networks in ABENM. However, our previous work only modeled hypothetical networks and diseases using simplified data assumptions.

    We developed PATH 4.0 model using ABENM and ECNA. Specifically, expanding on the concepts of ECNA, we developed a new network generation model (HIV-ECNA) for simulating HIV transmissions. For simulating the progression of infection along the HIV disease and care continuum stages for HIV-infected persons, we adopted the disease and care continuum progression model from PATH 2.0 [6]. We implemented PATH 4.0 for prediction of sexually transmitted cases of HIV in the United States over the period 2006 to 2017, among heterosexual female (HET female), heterosexual male (HET male), and men who have sex with men (MSM). We did not model HIV transmissions through injection drug use. We validated the model by comparing epidemic predictions from PATH 4.0 with data from the U.S. NHSS for years 2010 to 2017.

    In this paper, we present the development and implementation of PATH 4.0 to the United States. To demonstrate the ability of PATH 4.0 to generate clusters similar to those detected in molecular data, we compare clusters extracted from the PATH 4.0 transmission network to those identified by nucleotide sequencing of HIV genetic data collected from HIV-infected persons by the NHSS [19]. We used a previously developed cluster generation algorithm for extraction of clusters from PATH 4.0 [43].

    In this section, we discuss the structure and methods of PATH 4.0 and its implementation for simulating the HIV epidemic in the United States for the period 2006 to 2017. This section is structured as follows: in Section 2.1, we discuss the overall computational structure of PATH 4.0; in Section 2.2, we discuss the four main modules of PATH 4.0, namely, a compartmental module for simulating susceptible persons, a Bernoulli transmission module for simulating new infections, the HIV-ECNA network generation module for generating sexual partnership networks of newly infected persons, and a disease progression module for simulating HIV-related events for HIV-infected persons; and in Section 2.3, we discuss the implementation of PATH 4.0 for simulating HIV in the United States for the period 2006 to 2017, and provide an overview of the full simulation model. While we present only a limited version of the mathematical concepts and data assumptions and sources here, further details for each section can be found in the corresponding sections of the Appendix. All mathematical notations used in the section are summarized in Table 1. PATH 4.0 was computationally coded in the Netlogo 6.1.0 [44] software, an open-source programmable modeling environment for agent-based modeling with network features.

    Table 1.  Table of Notations.
    Notation Description
    t Simulation time-step.
    A The number of age-groups.
    R The number of risk-groups.
    D The number of degree bins.
    G The number of pseudo-geographic jurisdictions.
    a; r; d; g Used when referring to an age-group, risk-group, degree-bin, and pseudo-geographic jurisdiction, respectively.
    St[a, r, d, g] An array of size A×R×D×G representing the number of susceptible persons in the model, in age-group a, risk group r, degree-bin d, and pseudo-geographic jurisdiction g, at time t.
    N A set of nodes, each representing an infected person or a susceptible sexual partner.
    E A set of edges representing sexual partnerships between nodes.
    Gt(N,E) A dynamic graph with N a set of nodes and E a set of edges, at time t.
    Qt The number of nodes in graph G, at time t.
    Ct[i,j] A static adjacency matrix of size Qt ×Qt, with static element Ct[i,j]=1 if i and j are sexual partners anytime during their lifetime and Ct[i,j]=0 otherwise.
    Vt[i,j] A dynamic adjacency matrix of size Qt×Qt, with element Vt[i,j]=1 if i and j are sexual partners during month t and Vt[i,j]=0 otherwise.
    e={i,j} An edge in graph Gt representing a sexual partnership between nodes i and j
    ¯t({i,j}) The partnership initiation time; represents the simulation month for partnership initiation.
    t_({i,j}) The partnership termination time; represents the simulation month for partnership termination.
    {ai,aj} The age of nodes i and j at the time of their partnership initiation.
    {a_i,a_j} The age of nodes i and j at the time of their partnership termination.
    at,j Age-group of node j at time t.
    at,j Age of node j at time t.
    dj Degree-bin corresponding to the number of lifetime partners of node j.
    dj The actual number of lifetime sexual partners of node j.
    ˆdt,j The number of lifetime sexual partners of person j who are already added as nodes in graph G at time t. For infected nodes dj=ˆdt,j for susceptible nodes in G, djˆdt,j
    Lt,j A partnership distribution matrix of size A×2, where Lt,j[a,1] is the number of partnerships that initiate at age-group a, and Lt,j[a,2] is the number of partnerships that are yet to be assigned. For infected nodes, Lt,j[a,2]=0,a
    ht,j Infection status of node j at time t.
    mt,j Deceased status of node j at time t.
    rj Risk-group of person j.
    st,j Care continuum or disease stage of person j at time t.
    pt,j Infectiousness or risk of transmission per act for person j at time t.
    ϵ Condom effectiveness.
    st,j The number of sex acts per month for person j at time t.
    ct,j The proportion of acts condom protected of person j at time t.
    F1(u) The inverse Bernoulli distribution that takes values 1 with probability u and 0 with probability 1u.
    Dk Random variable for degree of node k.
    Pr(Dk=dk|Dl=dl) Conditional probability distribution for Dk.
    Pr(Dk=dk) Marginal probability distribution for Dk.
    m Minimum degree of the network.
    λrl Scale-free network parameter corresponding to the risk-group of node l.
    L[a,d] A matrix of size A×D, representing the proportion of partnerships that initiate at age-group a for persons in degree-bin d.

     | Show Table
    DownLoad: CSV

    In this section we present the overall computational structure of PATH to help describe how the ABENM simulation technique combines the features of ABNM and compartmental simulation techniques, but we first briefly describe the main features of this computational structure. Only HIV-infected persons and their immediate contacts (both susceptible and infected contacts) are modeled at the individual-level as agents or nodes in a network (as in an ABNM), and all other susceptible persons are modeled at the population-level (as in a compartmental model). Therefore, at any time point in the simulation, all infected persons are nodes in the network, and all contacts an infected person would have over their lifetime (the contacts may be infected or susceptible) are also nodes in the network. Over time, as new persons become infected, they are added to the network, but at any "current" time point of the simulation, only persons who are currently infected and their contacts (all partners the infected person would have over their lifetime) are nodes in the network. An infected person is connected to each of their lifetime partners through an edge (link), and thus an edge (or link) represents a partnership. Thus, if a node is infected, the model is set up to ensure that their current degree (defined as the number of persons they are linked to in the network at that time point in the simulation) is equal to their actual degree (defined as the number of partners the infected person will have over their lifetime). However, the links are set up such that each partnership (link) is activated and deactivated over time based on when the partnership initiated and dissolved, through the use of edge features (similar to assigning features such as age to a node, we can assign features to an edge) to keep track of partnership initiation and termination times. As the only susceptible persons who are tracked as nodes in the network are those who are contacts of an infected person, the current degree of a susceptible node is less than or equal to their actual degree as they are only connected to their infected contacts. Note that it is possible that the susceptible contacts of a newly infected node are already in the network as contacts of other infected persons, but links between two partners are generated only when at least one of them become infected (the methodological process to achieve this is the core of the newly developed HIV-ECNA network generation model and is discussed in Section 2.2.2—in this section we only describe the general computational structure of PATH). All susceptible persons who are not contacts of a currently infected person are in the compartmental model. We will discuss in Section 2.2. the process of determining if a susceptible person in the compartmental model would become a contact of an infected node in the network, and the modeling of their transitioning from the compartmental model to the network.

    We next mathematically present the computational structure of PATH 4.0.

    Following the compartmental modeling structure, we use a four-dimensional array to keep track of the number of susceptible persons (who are not contacts of a currently infected person), specifically, let,

    St = an array of size A×R×D×G, where A is the number of age-groups, R is the number of risk-groups, D is the number of degree-bins (degree is the number of contacts per person and degrees are grouped into bins analogous to age grouped into age-groups), and G is the number of pseudo-geographic jurisdictions, to model heterogeneity in contact mixing as persons are more likely to form partnerships with persons in the same geographic area (here we only model 'pseudo'-geographic jurisdictions, i.e., we assigned persons to a randomly chosen jurisdiction for purposes of introducing heterogeneity in contact selection, to more realistically represent network structure formations, but we did not explicitly model geographic features of the epidemic as the focus of this paper is on national agammaegated estimates) [45] then

    St[a,r,d,g] is the number of susceptible persons in age-group a, risk group r, degree-bin d, and pseudo-geographic jurisdiction g, at time t.

    Following the agent-based network modeling structure, we use a dynamic graph Gt(N,E) to track HIV-infected persons and their immediate contacts (these contacts may be infected or susceptible), where N is a set of nodes, each node representing an infected person or a susceptible sexual partner, and E(Gt) is a set of undirected edges, an edge {i,j} representing a sexual partnership between nodes i and j. The number of nodes in the graph Qt=|N(Gt)| and the number of edges |E(Gt)| are dynamically changing over time t.

    The graph Gt(N,E) has the following features:

    Static adjacency matrix: Ct of time-variant size Qt×Qt, with static elements Ct[i,j]=1 if i and j are sexual partners anytime during their lifetime and Ct[i,j]=0 otherwise, and

    Dynamic adjacency matrix: Vt of time-variant size Qt×Qt, with element Vt[i,j]=1 if i and j are in a partnership during month t and Vt[i,j]=0 otherwise.

    Each edge {i,j}ϵE has the following features (similar to nodes having features of say age, sex, etc., edges can also have features):

    Partnership initiation time:¯t({i,j}) representing the simulation month for when the partnership initiated,

    Partnership termination time: t_({i,j}) representing the simulation month when the partnership terminated,

    Partnership initiation age: {ai,aj} representing the age of nodes iandj, at the time of partnership initiation, and

    Partnership termination age: {a_i,a_j} representing the age of nodes iandj, at the time of partnership termination.

    Each node jϵN(Gt) has the following features:

    Actual degree:dj representing the actual number of lifetime sexual partners of node j,

    Current degree: ˆdt,j representing the number of lifetime sexual partners of person j who are already added as nodes in Gt(N,E); if node j is infected ˆdt,j=dj, if node j is susceptible ˆdt,jdj, and thus dynamically changing with time t,

    Partnership distribution matrix: Lt,j of size A×2, where A is the number of age-groups, Lt,j[¯a,1] is the number of partnerships that node j initiates in age group ¯a, and Lt,j[¯a,2] is the number of partnerships that are yet to be assigned; the sub-script t are to indicate that the values of column 2 of Lt,j can change over time, specifically, Lt,j[,2] is a column of zeros if the node is infected as all their partnerships are already assigned, and greater than or equal to zero if the node is susceptible (when the susceptible person is added as a contact of a different infected person one of the rows is decremented, and when the susceptible person become infected all rows of column 2 are decremented to zero as their partners are found and added - the HIV-ECNA was specifically developed for determining when and how to assign these partnerships, and thus generating the network, which is discussed in Section 2.2.3.),

    Infection status: ht,j=1 if node j is an HIV-infected node and ht,j=0 otherwise,

    Deceased status: mt,j=1 if node j is alive and 0 otherwise,

    Age: at,j taking an integer value representative of the age of node j,

    Pseudo-geographic jurisdiction: gj taking an integer value representative of the pseudo-geographic location of node j,

    Risk group: rj taking one of the following values, representative of risk-group of node j, rj{heterosexual female, heterosexual male, MSM}, and

    Care continuum and disease stage: st,j taking one of the following values, 0 (not infected), 1 (infected, acute HIV stage, and undiagnosed), 2 (non-acute HIV, and undiagnosed), 3 (diagnosed and not in care), 4 (in care not on antiretroviral therapy (ART) treatment), 5 (on ART no viral load suppression (VLS)), or 6 (on ART with VLS).

    The main relationships between different components of the graph Gt(N,E) are the following.

    Between partnership initiation ¯t({i,j}) and termination t_({i,j}) times and static and dynamic adjacency matrices (Ct and Vt):

    Ct[i,j]={1if{i,j}ϵE0otherwise, i.e., if {i,j} are partners at some point during their life, this will have a value of 1,

    Vt[i,j]={1 if ˉt({i,j})tt_({i,j})0otherwise, i.e., if {i,j} are partners at time t this will have a value of 1, and thus, Ct[i,j]Vt[i,j].

    Between actual degree dj, current degrees ˆdt,j, and static adjacency matrix Ct:

    ˆdt,j{=dj if node j is infected dj if node j is susceptible , i.e., if a node is infected, they are linked to all partners they will have (actual degree) over their lifetime and thus dj=ˆdt,j, and if a node is susceptible, they are only linked to their infected partners and thus djˆdt,j, and

    ˆdt,j=i=1:QtCt[i,j], i.e., Ct keeps track of their current degree.

    Between actual degree dj and partnership distribution matrix Lt,j:

    ¯a=1:ALt,j[¯a,1]=dj, at any t, i.e., as Lt,j[¯a,1] tracks number of partnerships that initiate at age-group ¯a, when summed over all ¯a it should add to the actual degree dj for all nodes whether infected or susceptible, and

    ˉa=1:ALt,j[ˉa,2]={djˆdt,j if node j is susceptible 0 if node j is infected , i.e., as Lt,j[¯a,2] tracks number of partnerships that initiate at age-group ¯a and are yet to be generated, ¯a=1:ALt,j[¯a,2] would be zero if the node is infected because all partnerships of an infected node are already connected in the network, and would be equal to the number of partners yet to be assigned if the node is susceptible. (Assigning partnerships and all other features related to the network are part of the newly developed HIV-ECNA network generation algorithm, discussed later).

    This section presented the computational structure of the model, specifically the compartmental modeling structure, the network structure, and the features of the nodes and edges in the network. A visual representation of the computational structure is presented in Figure 1(A). The next section describes the methods (modules) used in modeling these features, and Figure 1(B) provides an overview of the modules.

    Figure 1.  Schematic overview of the computational structure of ABENM (A) and simulation steps (B).

    At every time-step (monthly) of the simulation, the model runs four different modules: a compartmental module for simulating susceptible persons, a Bernoulli transmission module for simulating new infections, the HIV-ECNA network generation module for generating sexual partnership networks of newly infected persons, and a disease progression module for simulating HIV-related events for HIV-infected persons. We discuss each module below.

    Every time-step (monthly) of the simulation, this module updates the demographic features of susceptible persons tracked through the array St. Specifically, it simulates births, aging, and deaths as follows

    St[a,r,d,g]=St1[a,r,d,g]+dS[a,r,d,g]dtt
    dS[a,r,d,g]dt={Br,dμaSt1[a,r,d,g];ifa=theyoungestagegroup1317(μa+1|a|)St1[a,r,d,g]+(1|a1|)St1[a1,r,d,g];otherwise

    where,

    Br,d is the number of persons of risk group r and degree-bin d annually aging into the youngest age-group a = 13-17 years, we assumed equal birth rate for all pseudo-geographic areas,

    μa is the annual natural mortality in age-group a.

    |a| is the age-group interval size of age-group a, and thus 1|a| is the rate of aging out.

    t is 1/12 to represent the modeling of monthly time-steps.

    In the simulation, we estimate Br,d as the number of persons who age into age-group 13-17 years multiplied by the proportion of persons of risk-group r (r {heterosexual male, heterosexual female, MSM}), and further multiplied by the proportion of persons with degree-bin d (where d{1,2,D}). Values for the proportions in risk-group and degree-bin are specific to the population simulated and are discussed in the Appendix for application to the US population. We use log-2 binning for degree, i.e., persons in degree-bin d are those with lifetime number of partners (d) in 2d1<d2d. As the number of lifetime sexual partners follows a power-law distribution [34], i.e., Pr(d=k)kλ, where λ is the scale-free parameter of the distribution, following the characteristic feature of power-law distributions, it would mean that a large number of persons have lower degrees and only a few persons have a very high degree. This creates issues when using uniform binning. For example, persons with large number of partners might typically report rounded numbers, e.g., 50,100, instead of 48 or 98 partners, so using uniform bins of say width 5 or 10 would create spikes at rounded values and zero around it. Therefore, as commonly done, for degree-bins we use log-2 binning, i.e., persons in degree-bin d are those with lifetime number of partners (d) in 2d1<d2d,d{1,2,D}, which would create bins of narrower intervals for smaller degree and wider intervals for larger degree. Applying the power-law distribution, we calculate the proportion of persons in degree-bin d as [2dk=2d1+1kλ(2Dk=20+1kλ)1], using the value of λ specific to the population simulated as discussed in the Appendix.

    Every time-step (monthly) of the simulation, this module determines if a susceptible node l in graph Gt(N,E) becomes infected using a Bernoulli transmission equation. Note, as susceptible persons in the compartmental model array St are not connected to an infected person, their chance of infection is zero. Further note that, susceptible persons can move from compartmental model to the network upon becoming partners of the infected person, modeled using the HIV-ECNA algorithm discussed in the next section, which would then expose them to the infection.

    Specifically, for nodes in the network with HIV infection status ht1,l=0 (denoting susceptible) the Bernoulli transmission equation is used to estimate the updated value ht,l as follows.

    ht,l=F1(1Qtj=1(1αjϵ)st,j.ct,j(1αj)st,j.(1ct,j)), where,

    αj=Vt[l,j].ht,j.mt,j.pt,j, where Vt[l,j],ht,j,mt,j are the elements of the graph described in Section 2.1, and pt,j is the probability of transmission per act modeled as a function of disease and care stage st,j and risk group rj of the infected node j; we will have a value of αj=pt,j if j is a contact of l (i.e., Vt[l,j]=1), is infected (i.e., ht,j=1), and is alive (i.e., mt,j=1), and αj=0 otherwise,

    ϵ = condom effectiveness,

    st,j = number of sex acts per month with node j, modeled as a function of age, risk group, and number of partners of node j,

    ct,j = proportion of acts with node j that is condom protected, modeled as a function of age, risk group, and number of partners of node j,

    F1(u)= an inverse Bernoulli distribution that takes a value of 1 with probability u and value of 0 with probability 1u.

    If node l becomes infected, i.e., if the above equations yield a value of 1 (ht,l=1), we set its HIV stage as 1 (i.e., set st,l=1 to denote the first stage of HIV, which is acute and undiagnosed).

    Every time-step t, this module also determines and updates any changes in sexual behavior of infected nodes. Specifically, it updates the following values. For every partnership (k,j), its active/inactive status by checking if the current time-step is within the partnership initiation and termination time, as Vt[k,j]={1 if ˉt({k,j})tt_({k,j}),indicatingitisactive0otherwise,indicatingitisinactive. For every infected node j, the number of sex acts per month as a random draw from age-group and risk-group specific uniform distribution, corresponding to age at,j and risk group rj of node j. For every infected node j, the number of sex acts per partner st,j as st,j= numberofsexactspermonthfornodejk=1:QtVt[k,j]. For every infected node j, condom use ct,j as a function of number of active partners (k=1:QtVt[k,j]) and disease stage st,j (specifically, diagnosed status of HIV, with higher condom use if aware, i.e., st,j>2. For every infected node j, the infectiousness pt,j as a function of risk group rj and stage st,j of node j. Data assumptions for the above behavioral parameters are presented in Section S3 of the Appendix, specific to the US.

    This module controls the overall network dynamics of partnerships between nodes. It controls partnership formation and dissolution over time and age, modeled through a combination of the static (Ct) and dynamic (Vt) adjacency matrices, with Ct keeping track of all partnerships over the lifetime and Vt keeping track of only those that are active at that specific time. It controls the dynamics between age-group and risk-group mixing between partnerships. It also controls the transitioning of susceptible persons from the compartmental model St to the network Gt(N,E) as they become partners of newly infected persons. These network dynamics are modeled through the simulation of HIV-ECNA, which we discuss next.

    At the beginning of every time-step t+1, this module applies the HIV-ECNA for each node l that was newly infected at the end of the previous time-step t. As l was a susceptible person up until time-step t, it had links with only their infected partners, and thus their current degree (ˆdt,l) was less than or equal to their actual degree (dl). Therefore, the main functionality of this algorithm is to generate the contact network for each newly infected node l, i.e., determine that dlˆdt,l partners are yet to be assigned, determine who those persons are (including the degree-bin corresponding to their number of lifetime partners, current age-group, risk-group, and pseudo-geographic jurisdiction) and, if they are not already part of the graph Gt(N,E), add them to Gt+1(N,E) and remove them from St+1.

    The steps of the HIV-ECNA are as follows.

    For every newly infected node l,

    1. Determine the number of new partnerships (edges) to generate as actual degree minus current degree (i.e., dlˆdt,l). Note, these new partnerships would all be with susceptible persons.

    2. For each new susceptible partner node, say k, determine node features, specifically, its number of lifetime partners degree-bin dk, risk-group rk, and current age-group at,k, pseudo-geographic jurisdiction gk, and the partnership distribution matrix Lt,k.

    3. For each new edge (partnership) between l and k, say {l,k}, determine its edge features, specifically, the partnership initiation age {al,ak}, initiation time ¯t({l,k}), termination age {a_1,a_k}, and termination time t_({l,k}).

    4. Determine who each new partner k is by a uniform random draw from all who are eligible, i.e., all persons who are eligible have an equal chance of selection. All susceptible agents in the graph Gt(N,E) and susceptible non-agents in the compartmental model array St with node and edge features matching that in steps 2 and 3 above, can be eligible

    a. A susceptible agent, say j, is eligible if its current age at,jat,k, its risk group rj=rk, its actual degree djdk, its pseudo-geographic jurisdiction gj=gk, its degree minus current degree djˆdt,j>0, and the number of unassigned contacts in age-group a, corresponding to activation age ak is greater than zero, i.e., Lt,j[a,2]>0,aka.

    b. All susceptible non-agents in element St[ak,rk,dk,gk] are eligible.

    Therefore, the probability that the person picked is a susceptible agent is the number of eligible agents divided by the number of eligible agents and non-agents, and thus, as the network grows, the chance of selection from within the network increases.

    5. For each new partner k (determined in previous step) and partnership {l,k}, update their corresponding features in Gt(N,E) and St, i.e., update all elements of the computational structure described under Section 2.1, generating an updated graph Gt+1(N,E) and an updated array St+1

    a. If the new partner k is already a susceptible node in graph Gt(N,E) update the graph features corresponding to nodes l and k and add a new edge {l,k}.

    b. If the new partner k is an element of the compartmental array St, add a new nodek to the graph, update the graph features corresponding to nodes l and k, add a new edge {l,k}, and decrement the value of St[ak,rk,dk,gk] by 1 (thus transitioning the susceptible person from compartmental model to the network). Assign actual degree dk through random selection from the degree-bin dk and assign current age asat,k=ak(¯t({l,k})t), where t is the current time-step

    Below, we briefly discuss the methods for determining the features in steps 2 and 3 and provide further details in the Appendix Section S4.

    Determining degree-bin dk for each new partner k

    As described in Step 1, suppose it is determined that a new susceptible person (say k) should be added as the partner of the newly infected node l. A key aspect of the HIV-ECNA is determining the features of k, including its degree-bin (the bin corresponding to its number of lifetime partners) dk. For a node l, the degree-bin of a partner dk is not independent of its degree-bin dl because of degree correlations between node neighbors, a key feature of scale-free networks [42]. Generally speaking, this means that the degree-bin dk should be determined using some probability distribution that is dependent on the degree-bin of the infected neighbor dl, that is, degree dk is conditional on dl. Mathematically representing this, Pr(Dk=dk|Dl=dl)Pr(Dk=dk), and thus, dk cannot be directly drawn from its scale-free network probability mass function Pr(Dk=dk) 2dki=2dk1+1iλ but should be determined using a conditional probability distribution Pr(Dk=dk|Dl=dl); where Dk is the random variable for degree-bin of node k.

    While the literature presents an analytical method for estimation of Pr(Dk=dk|Dl=dl) for general static scale-free networks [42], our previous work showed that this method is not suitable in the context of simulating an epidemic in a dynamically evolving contagion network [31]. Specifically, in general static scale-free networks, the full network is available so the degree of all node neighbors are available, and thus Pr(Dk=dk|Dl=dl) is an expectation over all possible values of dl i.e., an average over "all" node neighbors. However, as we only simulate infected nodes and their immediate contacts in the network, in our context, only values corresponding to infected node neighbors are used. As it is more likely that nodes with higher degree get infected first, the value of Pr(Dk=dk|Dl=dl) when l=allnodeneighbors is different compared to when l=infectednodeneighbors [34]. And further, Pr(Dk=dk|Dl=dl) is likely to change over time as the epidemic spreads and the percent of the population that is infected changes. Our previous work developed a neural network model for the prediction of Pr(Dk=dk|Dl=dl) using as independent variables, dl, dk, the minimum degree of the network m, the percent of the population that is infected, and the scale-free network parameter λrl corresponding to risk group of node l (rl). We used a similar method here. Further details of the training of the neural network and data assumptions and sources for the US are presented in the Appendix Section S4.1.

    Determining risk group rk and pseudo-geographic jurisdiction gk for each new partner k

    For any node l, the risk group of partner k (rk) was determined based on a risk group mixing probability matrix, which is a square matrix of probabilities, with rows and columns representing heterosexual male, heterosexual female, and MSM, and each element, say (rl,rk), representing the probability a person in risk group rl partners with a person in risk group rk. The pseudo-geographic jurisdiction (gk) was determined based on a pseudo-geographic mixing probability matrix. Data assumptions and sources for risk group mixing and pseudo-geographic mixing are presented in the Appendix Section S6.5

    Determining the partnership distribution matrix Lt,kfor each new partner k

    Suppose dk is the actual degree (number of lifetime partners) of a newly added susceptible node k. We need to determine at what age of k will each partnership initiate. As these data are not directly available, we estimated it using Markov process-based simulation and optimization methods applied to data from behavioral surveys that reported the number of lifetime partners by persons' age at the time of survey. More specifically, we can describe the parameter of interest and its estimation process as follows. Suppose there is a matrix L of size A×D,with A the number of age-groups and D the number of degree-bins, and element L[a,d] representing the proportion of partnerships that initiate at age-group a for persons in degree-bin d, with each column of L adding to 1, for all d{1,2,D}. Then, for any node k with actual degree dkϵd, we can calculate the partnership distribution matrix, i.e., the number of partnerships that initiate at age a, as Lt,k[a,1]=L[a,d].dk.

    Direct data for L would be a longitudinal survey over the duration of life of an individual, where the individual reports the number of partnerships they initiated at every age point of their life. Such surveys, however, are unavailable. Therefore, we estimated L using survey data on the reported number of partners up to that time by persons of different age-groups (see Appendix Section 4.2). Note that, these survey data only represent the number of partners up to the current age of the surveyed individual. Thus, the degree-bin d each person would belong to is unknown as d represents the number of partners the person would eventually have over their full lifetime. The age at which each partnership initiated is also unknown. Therefore, we estimated L[a,d],a,d, by mathematically piecing together data from persons of different age-groups to mimic a longitudinal survey. We used a two-step process in this estimation of L. In step 1, we solved for the probabilities of initiating a new partnership when a person ages from age-group a1to a, by formulating it as the transition probability matrix of a Markov chain and solving for it using an optimization model, calibrating to the survey data. In step 2, we used the transition probability matrix from step 1 to simulate partnership changes over the lifetime of a person, starting from age-group a=1toage-group a=A, repeating for 10,000 people, grouping together all persons who end in the same degree-bin d at the last age-group A, and for each degree-bin d, calculating the average proportion of partnerships that initiated in age-group a, a. We discuss both steps in detail in the Appendix Section S4.2.1.

    Determining partnership initiation and termination age and time for edges {l,k} with each partner k

    Determining the expected partnership initiation age {al,ak}, initiation time ¯t({l,k}), termination age {a_1,a_k}, and termination time t_({l,k}) of each edge {l,k} between the newly infected nodel and each of its partner k is equivalent to assigning values of these variables to each of the k partners over the duration of life of the infected node l. The optimal solution are the values that maintain the probability distribution for age-mixing between partners, and maintain the partnership distribution matrices (Lt,land Lt,k,k) of the newly infected node l and each partner k, i.e., for any node i, the number of edges initiating at age-group a should be equal to Lt,i[a,1], estimated in the previous section. We formulated this problem as an optimization model and developed a heuristic solution algorithm. The details of the formulation and the heuristic solution algorithm are presented in Appendix Section S4.3. The current age of partner k (at,k) is then determined as at,k=ak(¯t({l,k})t), and at,k, such that, at,kat,kis set as its current age-group.

    The disease progression module in PATH 4.0 is similar to that in PATH 2.0 [6]. At every time-unit of the simulation, this module updates the individual-level demographic and disease dynamics for every HIV-infected person, including aging, HIV-related and natural mortality, HIV disease progression, and changes in diagnosis, care, and treatment status.

    Updating disease progression includes updating HIV-specific parameters such as CD4 cell count, viral load, opportunistic infection (OI) incidence, and onset of acquired immune deficiency syndrome (AIDS), using previously validated disease progression methods in PATH 2.0. These HIV-specific parameters are updated as a function of care and treatment status and ART regimen. For persons on ART treatment, it also simulates changes in ART regimen over time by simulating viral load rebound. We provide an overview of the disease progression methods and data assumptions and sources in the Appendix Section S5.1–S5.3.

    Updating changes in diagnosis, care, and treatment status includes generating events of testing, initiation of treatment, and dropping-out or re-entry into treatment by calibrating to match surveillance estimates for the distribution of PWH by care continuum stages (unaware, aware not in care, and on ART treatment with viral load suppression) corresponding to the population and year being simulated. The details of the calibration method are presented in the Appendix Section S5.4.

    In application of the proposed ABENM to HIV, we first generated an initial population that is representative of people living with HIV (PWH) in the United States in 2006. We achieved this through a methodology that includes two sequential dry runs of the simulation (the concepts of dry runs for model initialization are discussed in more detail in the Appendix S6.2). The first dry run initializes a network of HIV-infected persons and immediate contacts that is replicative of the contact network among PWH, i.e., matching the correlations in degree, age, and risk group between partners. The second dry run initializes the model with epidemic and demographic features, such as disease stage, care continuum stage, age, and risk group, that are representative of the distributions of these features in PWH in the United States in 2006. To create a representation of HIV in the United States, data were taken from several studies, demographical, sexual behavioral, clinical, and HIV care and treatment behavioral studies, originating from data collected as part of multiple large national surveillance and survey systems in the United States, and other small studies. The surveillance and survey systems include the National HIV Surveillance System (NHSS), the Medical Monitoring Project (MMP), the HIV Outpatient Study (HOPS), the American Community Survey (ACS), the National HIV Behavioral Surveillance (NHBS), the National Survey for Family Growth (NSFG), and the National Survey for Sexual Health and Behavior (NSSHB) [46,47,48,49,50,51,52]. The specific data sources are detailed in the Appendix, in S2 and S6 for demographics, in S3 and S4 for sexual behavior, and in S5 for clinical and HIV care and treatment behaviors.

    After initialization of the model to 2006, we ran the simulation, from 2006 to 2017 in monthly-time steps, by calibrating to the care continuum distribution of PWH in the year being simulated, taking estimates from NHSS. We present the data assumptions and sources in the Appendix Section S6. An overview of the steps of the full simulation model, including the dry runs, is presented in Appendix Section S7.

    To validate the epidemic predictions from the model, for the period 2010 to 2017, we compared simulated annual estimates of relevant HIV parameters, including total prevalence, diagnosed prevalence, annual incidence, and annual diagnoses, distributed by risk group and age, with surveillance data [53,54,55,56,57,58,59,60]. We define total prevalence as the number of people living with diagnosed or undiagnosed HIV, diagnosed prevalence as the number of people living with diagnosed HIV, annual incidence as the number of new infections in that year, and annual diagnoses as the number of new diagnoses in that year. Risk groups include heterosexual females, heterosexual males, and men who have sex with men (MSM) infected with HIV through sexual transmission. Specifically, we compared model and surveillance estimates for the following features:

    1.Distribution of overall disease burden by risk group (Figure 2), calculating for each risk group,

    Figure 2.  Distributions of total prevalence, diagnosed prevalence, annual incidence, and annual diagnoses by risk group; total prevalence = the number of people living with diagnosed or undiagnosed HIV, diagnosed prevalence = the number of people with diagnosed HIV, annual incidence = the number of new infections in that year, and annual diagnosis = the number of new diagnosis in that year.

    a. total prevalence in risk group divided by overall total prevalence

    b. diagnosed prevalence in risk group divided by overall diagnosed prevalence

    c. annual incidence in risk group divided by overall annual incidence

    d. annual diagnoses in risk group divided by overall annual diagnoses

    2. Measures of epidemic growth within each risk group (Figure 3) as,

    Figure 3.  Comparing model estimates with surveillance for annual incidence by total prevalence and annual diagnosis by diagnosed prevalence for each risk group; total prevalence = the number of people living with diagnosed or undiagnosed HIV, diagnosed prevalence = the number of people with diagnosed HIV, annual incidence = the number of new infections in that year, and annual diagnosis = the number of new diagnosis in that year.

    a. annual incidence in risk group divided by total prevalence in risk group

    b. annual diagnoses in risk group divided by diagnosed prevalence in risk group

    3.For each risk-group, measures of epidemic growth within each age-group and distribution of disease burden by age (heterosexual females in Figure 4, heterosexual males in Figure 5, and MSM in Figure 6) as,

    Figure 4.  Annual incidence by total prevalence, annual diagnoses by diagnosed prevalence, and distributions of annual incidence and annual diagnoses by age among heterosexual females. Note: charts with no surveillance points are those for which data are available for MSM but not heterosexuals, but we are reporting simulated estimates for completeness.
    Figure 5.  Annual incidence by total prevalence, annual diagnoses by diagnosed prevalence, and distributions of annual incidence and annual diagnosis by age among heterosexual males. Note: charts with no surveillance points are those for which data are available for MSM but not heterosexuals, but we are reporting simulated estimates for completeness.
    Figure 6.  Annual incidence by total prevalence, annual diagnoses by diagnosed prevalence, and distributions of annual incidence and annual diagnosis by age among MSM (men who have sex with men).

    a. annual incidence in age-group divided by total prevalence in age-group

    b. annual diagnoses in age-group divided by diagnosed prevalence in age-group

    c. annual incidence in age-group divided by total annual incidence in risk group

    d. annual diagnoses in age-group divided by total annual diagnoses in risk group

    We generated 100 runs of the simulation and present box plots marking the minimum, 1st quartile, 2nd quartile, 3rd quartile, and maximum, of the 100 runs for each of the above features along with the corresponding values calculated using surveillance estimates [36].

    The surveillance estimates fall within the range of model estimates in most cases. In a few cases, such as in the ratio of new infections to total prevalence for MSM (Figure 3) and distribution of diagnosed cases by age for MSM (Figure 6), the surveillance estimates were outside the range of model estimates in some years. However, we believe the overall results are acceptable considering that these metrics are an outcome of multiple interacting events, including those related to sexual behavior, HIV-related care behavior, and disease progression, each event simulated at the individual-level using age, and risk group specific parameters. Specifically, sexual behavioral events include partnership formation and dissolution over time varying by age and risk group, age-group and risk-group mixing between partnerships, and changes in sexual exposures and condom use varying by age, risk group, and number of partners. HIV-related care events include HIV testing and diagnosis, linkage to care and treatment, and dropping-out of and re-entry into care and treatment. Disease progression events include changes in CD4 cell counts, viral load, OI and AIDS incidence, and mortality, each influenced by time of diagnosis and initiation of treatment, and changes in treatment regimen over time. Therefore, considering that the resulting overall population-level epidemic features presented in Figures 2 through 6 are an outcome of interactions between the above multiple events modeled at the individual-level, and that the model results are close to the surveillance estimates on most of these epidemic features (features not used in model calibration), we believe the model provides an acceptable replication of the epidemic. Further, as the surveillance estimates for incidence and prevalence are nationally agammaegated average estimates which also have a range of uncertainty associated with them, we did not want to risk overfitting.

    To test the ability of PATH 4.0 to generate clusters similar to those detected through analysis of nucleotide sequence data reported to NHSS, we compared clusters extracted from the PATH 4.0 transmission network to those identified through molecular cluster analysis of NHSS data. During 2013–2017, 27 HIV surveillance jurisdictions in the United States reported nucleotide sequences from routine clinical drug resistance tests to the NHSS; these jurisdictions reported 70% of US HIV diagnoses in 2015, To identify molecular clusters in NHSS data, we analyzed data reported through December 2017 for HIV infections diagnosed during January 1, 2015–December 31, 2017. Clusters were identified using methods described previously [19]: in brief, we included partial pol (protease and reverse transcriptase) sequences that were ≥500 nucleotides in length and removed sequences identified as potential contaminant. Sequences were analyzed using a local installation of HIV-TRACE (HIV TRAnsmission Cluster Engine [61], www.hivtrace.org) following methods previously described [19]. Clusters were defined as connected components using a genetic distance threshold of 0.5%. Sequence data were available for 48% of persons with HIV infection diagnosed during 2015–2017 in the participating jurisdictions.

    To replicate molecular cluster detection in the simulation, we applied a cluster generation algorithm (previously developed [43]) at the end of each simulation run, corresponding to the end of the year 2017, for identification of clusters among persons with HIV diagnosed between 2015 and 2017 [19]. The algorithm identifies clusters using time as a proxy for genetic distance. Specifically, we approximated genetic distance between any two nodes as the sum of the difference between each node's time of diagnosis and time of infection of the node that commonly connects them in a transmission network. Assuming that the viral sequence will change approximately 1%, on average, over a 10-year interval, we replicated a genetic distance threshold of 0.5% in the model by defining clusters as any group of nodes connected in a single component within a 60-month time interval. Further, we classified clusters that include at least one person with HIV diagnosed in the most recent year (2017) as priority clusters. Because the completeness of sequence data (the proportion of all diagnosed infections for which a sequence is available) will impact cluster detection, we replicated incomplete data in the simulation: to replicate 48% sequence completeness in NHSS. For each run, we randomly selected 52% of infections diagnosed during 2015 to 2017 and excluded them in the generation of clusters.

    We compared simulated clusters with molecular clusters, including the distribution of diagnosed cases by cluster type and cluster size and the distribution of number of clusters of varying size (Figure 7). We see in the figure that NHSS results are within the range of model results for each of these metrics. The proportion of persons with diagnosed infection that are part of a cluster as well as the distribution of persons by size of cluster in the model is consistent with NHSS results (plot on top in Figure 7). The distribution of clusters by size in the model is also consistent with the NHSS results (bottom plot in Figure 7). These results thus validate the methods in the model leading to the formation of the contact network structure and transmissions. Further, the model estimates also match well for the proportion of diagnosed cases in priority clusters, i.e., clusters with higher number of cases diagnosed in recent years, suggesting that the model generated events of diagnosis and care are similar to that observed in surveillance data (plot on top in Figure 7).

    Figure 7.  Distributions of number of persons diagnosed in years 2015, 2016, or 2017, by cluster type and size (top) and number of clusters by cluster size (bottom).

    This paper presents a newly developed simulation method, ABENM with ECNA, for simulation of infectious disease epidemic projections for diseases with low prevalence, i.e., diseases with a small ratio of the size of infected population to size of susceptible population. We used ABENM for developing the PATH 4.0, a model for simulating HIV epidemic projections, and applied it for simulating HIV in the United States. PATH 4.0 is a comprehensive stochastic simulation model that simulates multiple interacting HIV-related events at the individual-level, including those related to sexual behavior, HIV-related care behavior, and disease progression.

    Model estimates from PATH 4.0 on multiple surveillance outcomes related to both HIV disease projections and transmission network dynamics were comparable with surveillance data, thus validating the newly proposed ABENM simulation method and ECNA for network generation. Further, the fact that the clusters generated using the model compare well with those identified using HIV molecular sequence data supports the potential use of this model for analysis of cluster-based interventions.

    However, there are limitations to this model. PATH 4.0 only simulated sexual transmissions of HIV, and thus, clusters formed through needle sharing contacts are not included in our results. About 9% of new HIV diagnoses are among PWID [62]. Future work could expand PATH 4.0 to include PWID for analyses of interventions specific to mode of transmission. We did not simulate the use of pre-exposure prophylaxis (PrEP) among uninfected persons, which can reduce the transmission rate. PrEP coverage is a recently introduced metric in NHSS reporting, available for year starting 2017 (about 12.5% in 2017) [63] and thus, analyses of clusters for future years should consider adding PrEP into the simulation. This can be done through adding a PrEP status to every susceptible node (as those who are a contact of an infected node are agents in the simulation) and incorporating the effectiveness of that into the Bernoulli transmission equation. We only modeled hypothetical jurisdictions to generate network dynamics, specifically to generate sub-networks with small amounts of mixing across sub-networks, as it is more likely that partnerships form between people within a certain geographic proximity. The jurisdictions are hypothetical in that we did not simulate jurisdiction-specific demographics or epidemic features. Despite these limitations, we believe the model presented here can help study questions specific to inform HIV cluster-based interventions, and the methods presented here could be utilized for construction of more sub-group specific models, such as by geographical jurisdiction or mode of transmission.

    In summary, in this paper, we present the newly developed ABENM simulation method for simulation of diseases with low prevalence and its application to the development of the PATH 4.0 model. We propose PATH 4.0 as a tool for studying questions related to understanding the dynamics of cluster growth and its eventual application to identification of suitable cluster detection and intervention strategies. Cluster detection is a key component of the U.S. Ending the HIV Epidemic national strategic plan. While surveillance is critical for the detection of clusters, a model in conjunction with surveillance can be used to refine cluster detection methods, better understand factors associated with cluster growth, and assess interventions to inform effective response strategies for prevention. As surveillance data are only available for cases that are diagnosed and reported, a model is a critical tool for understanding the true size of the clusters and assess key questions, such as the relative contributions of clusters to onward transmissions. As per our knowledge, this is the first model to have successfully replicated cluster features similar to that observed in molecular analysis of NHSS data. Thus, PATH 4.0 serves as a novel tool for assessing intervention strategies for cluster detection and response. Moreover, the fact that this model more closely approximates true HIV transmission dynamics, including clusters of transmission occurring as a result of the scale-free network and nonrandom mixing, indicates that it would be a stronger mechanism, than an agent-based model alone, for making inferences about the benefits of various approaches to deploying HIV prevention interventions even in non-cluster response settings.

    The successful replication of HIV disease patterns and cluster formation supports exploring the use of ABENM and ECNA in other areas. ABENM is specifically suited for diseases where simulation of contact networks is an essential component for accurate epidemic projections, and where the diseases have low prevalence that makes current network modeling methods challenging to use. In addition to HIV, other infectious diseases that fall under this category include tuberculosis, and chronic hepatitis B and C, or reemerging disease outbreaks such as SARS (severe acute respiratory syndrome), MERS (Middle East respiratory syndrome), and Ebola disease. In these cases, infection spreads through close contact between people, making the modeling of contact networks essential. These diseases also have high mortality and economic burdens, such that, in the case of remerging disease outbreaks that spread quickly, halting the disease in the very early stages of the outbreak (i.e., when the prevalence is low) is key for effective control.

    The findings and conclusions in this article are those of the authors and do not necessarily represent the views of the Centers for Disease Control and Prevention.

    SS and CG were funded by a grant from the National Institute of Allergy and Infectious Diseases of the National Institutes of Health under Award Number R01AI127236. The funding agreement ensured the authors' independence in designing the study, interpreting the data, writing, and publishing the report.

    We would like to acknowledge Dr. Timothy Green from the Centers for Disease Control and Prevention for his inputs in manuscript preparation.

    All authors declare no conflicts of interest in this paper.



    [1] D. R. Jutagir, B. B. Blomberg, C. S. Carver, et al., Social well-being is associated with less pro-inflammatory and pro-metastatic leukocyte gene expression in women after surgery for breast cancer, Breast Cancer Res. Treat., 165 (2017), 169–180.
    [2] S. Katkuri and M. Gorantla, Awareness about breast cancer among women aged 15 years and above in urban slums: a cross sectional study, Int. J. Community Med. Public Health, 5 (2018), 929–932.
    [3] A. Pawlik, M. Slomi´ nska-Wojewódzka and A. Herman-Antosiewicz, Sensitization of estrogen receptor-positive breast cancer cell lines to 4-hydroxytamoxifen by isothiocyanates present in cruciferous plants, Eur. J. Nutr., 55 (2016), 1165–1180.
    [4] D. L. Holliday and V. Speirs, Choosing the right cell line for breast cancer research, Breast Cancer Res., 13 (2011), 215–215.
    [5] R. L. Sutherland, R. E. Hall and I. W. Taylor, Cell proliferation kinetics of MCF-7 human mammary carcinoma cells in culture and effects of tamoxifen on exponentially growing and plateau-phase cells, Cancer Res., 43 (1983), 3998–4006.
    [6] B. S. Katzenellenbogen, K. L. Kendra, M. J. Norman, et al., Proliferation, hormonal responsiveness, and estrogen receptor content of MCF-7 human breast cancer cells grown in the short-term and long-term absence of estrogens, Cancer Res., 47(1987), 4355–4360.
    [7] A. Maton, Human biology and health, 1st edition, Prentice Hall, New Jersey, 1997.
    [8] L. V. Rao, B. A. Ekberg, D. Connor, et al., Evaluation of a new point of care automated complete blood count (CBC) analyzer in various clinical settings, Clin. Chim. Acta., 389 (2008), 120–125.
    [9] S. Bernard, E. Abdelsamad, P. Johnson, et al., Pediatric leukemia: diagnosis to treatment a review, J. Cancer Clin. Trials, 2(2017), 131.
    [10] A. Shankar, J. J. Wang, E. Rochtchina, et al., Association between circulating white blood cell count and cancer mortality: a population-based cohort study, Arch. Intern. Med., 166 (2006), 188–194.
    [11] K. Kim, J. Lee, N. J. Heo, et al., Differential white blood cell count and all-cause mortality in the Korean elderly, Exp. Gerontol., 48 (2013), 103–108.
    [12] C. Ruggiero, E. J. Metter, A. Cherubini, et al., White blood cell count and mortality in the Baltimore Longitudinal Study of Aging, J. Am. Coll. Cardiol., 49 (2007), 1841–1850.
    [13] D. S. Bell and J. H. O'Keefe, White cell count, mortality, and metabolic syndrome in the Baltimore longitudinal study of aging, J. Am. Coll. Cardiol., 50(2007), 1810.
    [14] G. D. Friedman and B. H. Fireman, The leukocyte count and cancer mortality, Am. J. Epidemiol., 133 (1991), 376–380.
    [15] M. H. Andersen, D. Schrama, P. thor Straten, et al., Cytotoxic T cells, J. Invest. Dermatol., 126 (2006), 32–41.
    [16] J. Folkman and R. Kalluri, Cancer without disease, Nature, 427 (2004), 787.
    [17] T. Fehm, V. Mueller, R. Marches, et al., Tumor cell dormancy: implications for the biology and treatment of breast cancer, Apmis, 116 (2008), 742–753.
    [18] N. Almog, Molecular mechanisms underlying tumor dormancy, Cancer Lett., 294 (2010), 139–146.
    [19] A. Friedman, Cancer as multifaceted disease, Math. Model. Nat. Pheno., 7(2012), 3–28.
    [20] R. Eftimie, J. L. Bramson and D. J. Earn, Interactions between the immune system and cancer: a brief review of non-spatial mathematical models, Bull. Math. Biol., 73 (2011), 2–32.
    [21] S. Banerjee and R. R. Sarkar, Delay-induced model for tumor–immune interaction and control of malignant tumor growth, Biosystems, 91 (2008), 268–288.
    [22] H. Moore and N. K. Li, A mathematical model for chronic myelogenous leukemia (CML) and T cell interaction, J. Theor. Biol., 227 (2004), 513–523.
    [23] L. Anderson, S. Jang and J. Yu, Qualitative behavior of systems of tumor-CD4+-cytokine interactions with treatments, Math. Method. Appl. Sci., 38 (2015), 4330–4344.
    [24] A. d'Onofrio, Metamodeling tumor–immune system interaction, tumor evasion and immunotherapy, Math. Comput. Model., 47 (2008), 614–637.
    [25] A. Khar, Mechanisms involved in natural killer cell mediated target cell death leading to spontaneous tumour regression, J. Biosci., 22 (1997), 23–31.
    [26] T. Boon and P. van der Bruggen, Human tumor antigens recognized by T lymphocytes, J. Exp. Med., 183 (1996), 725–729.
    [27] D. Kirschner and J. C. Panetta, Modeling immunotherapy of the tumor–immune interaction, J. Math. Biol., 37 (1998), 235–252.
    [28] L. G. de Pillis, W. Gu and A. E. Radunskaya, Mixed immunotherapy and chemotherapy of tumors: modeling, applications and biological interpretations, J. Theor. Biol., 238 (2006), 841–862.
    [29] H. P. de Vladar and J. A. González, Dynamic response of cancer under the influence of immunological activity and therapy, J. Theor. Biol., 227 (2004), 335–348.
    [30] U. Fory´ s, J. Waniewski and P. Zhivkov, Anti-tumor immunity and tumor anti-immunity in a mathematical model of tumor immunotherapy, J. Biol. Syst., 14 (2006), 13–30.
    [31] R. W. De Boer, J. M. Karemaker and J. Strackee, Relationships between short-term blood-pressure fluctuations and heart-rate variability in resting subjects I: a spectral analysis approach, Med. Biol. Eng. Comput., 23 (1985), 352–358.
    [32] A. Cappuccio, M. Elishmereni and Z. Agur, Cancer immunotherapy by interleukin-21: potential treatment strategies evaluated in a mathematical model, Cancer Res., 66 (2006), 7293–7300.
    [33] N. Kronik, Y. Kogan, V. Vainstein, et al., Improving alloreactive CTL immunotherapy for malignant gliomas using a simulation model of their interactive dynamics, Cancer Immunol. Immunother., 57 (2008), 425–439.
    [34] A. M. Jarrett, J. M. Bloom, W. Godfrey, et al., Mathematical modelling of trastuzumab-induced immune response in an in vivo murine model of HER2+ breast cancer, Math. Med. Biol., (2018), dqy014.
    [35] K. Annan, M. Nagel and H. A. Brock, A mathematical model of breast cancer and mediated immune system interactions, J. Math. Syst. Sci., 2(2012), 430–446.
    [36] R. Roe-Dale, D. Isaacson and M. Kupferschmid, A mathematical model of breast cancer treatment with CMF and doxorubicin, Bull. Math. Biol., 73 (2011), 585–608.
    [37] B. Ribba, N. H. Holford, P. Magni, et al., A review of mixed-effects models of tumor growth and effects of anticancer drug treatment used in population analysis, CPT Pharmacometrics Syst. Pharmacol., 3(2014), 1–10.
    [38] R. Bhat and C. Watzl, Serial killing of tumor cells by human natural killer cells–enhancement by therapeutic antibodies, PloS One, 2 (2007), e326.
    [39] T. Sutlu and E. Alici, Natural killer cell-based immunotherapy in cancer: current insights and future prospects, J. Intern. Med., 266 (2009), 154–181.
    [40] T. R. Stravitz, T. Lisman, V. A. Luketic, et al., Minimal effects of acute liver injury/acute liver failure on hemostasis as assessed by thromboelastography, J. Hepatol., 56 (2012), 129–136.
    [41] Y. Zhang, D. L. Wallace, C. M. De Lara, et al., In vivo kinetics of human natural killer cells: the effects of ageing and acute and chronic viral infection, Immunotherapy, 121(2007), 258–265.
    [42] P. Wilding, L. J. Kricka, J. Cheng, et al., Integrated cell isolation and polymerase chain reaction analysis using silicon microfilter chambers, Anal. Biochem., 257(1998), 95–100.
    [43] V. Pascal, N. Schleinitz, C. Brunet, et al., Comparative analysis of NK cell subset distribution in normal and lymphoproliferative disease of granular lymphocyte conditions, Eur. J. Immunol., 34(2004), 2930–2940.
    [44] L. de Pillis, T. Caldwell, E. Sarapata, et al., Mathematical modeling of the regulatory T cell effects on renal cell carcinoma treatment, Discrete Continuous Dyn. Syst. Ser. B, 18(2013), 915–943.
    [45] T. D. To, A. T. T. Truong, A. T. Nguyen, et al., Filtration of circulating tumour cells MCF-7 in whole blood using non-modified and modified silicon nitride microsieves, Int. J. Nanotechnol., 15(2018), 39–52.
    [46] C. Chen, Y. Chen, D. Yao, et al., Centrifugalfilter device for detection of rare cells with immuno-binding, IEEE T. Nanobiosci., 14(2015), 864–869.
    [47] P. Dua, V. Dua and E. N. Pistikopoulos, Optimal delivery of chemotherapeutic agents in cancer, Comput. Chem. Eng., 32(2008), 99–107.
    [48] A. G. López, J. M. Seoane and M. A. Sanjuán, A validated mathematical model of tumor growth including tumor–host interaction, cell-mediated immune response and chemotherap, Bull. Math. Biol., 76(2014), 2884–2906.
    [49] K. Liao, X. Bai and A. Friedman, The role of CD200–CD200R in tumor immune evasion, J. Theor. Biol., 328(2013), 65–76.
    [50] M. C. Martins, A. M. A. Rocha, M. F. P. Costa, et al., Comparing immune-tumor growth models with drug therapy using optimal control, AIP Conf. Proc., 1738(2016), 300005.
    [51] M. Fernandez, M. Zhou and L. Soto-Ortiz, A computational assessment of the robustness of cancer treatments with respect to immune response strength, tumor size and resistance, Int. J. Tumor Ther., 7(2018), 1–26.
    [52] D. F. Tough and J. Sprent, Life span of naive and memory T cells, Stem Cells, 13(1995), 242–249.
    [53] C. M. Rollings, L. V. Sinclair, H. J. M. Brady, et al., Interleukin-2 shapes the cytotoxic T cell proteome and immune environment–sensing programs, Sci. Signal., 11(2018), eaap8112.
    [54] E. M. Janssen, E. E. Lemmens, T. Wolfe, et al., CD4+ T cells are required for secondary expansion and memory in CD8+ T lymphocytes, Nature, 421(2003), 852.
    [55] I. Gruber, N. Landenberger, A. Staebler, et al., Relationship between circulating tumor cells and peripheral T-cells in patients with primary breast cancer, Anticancer Res., 33(2013), 2233–2238.
    [56] D. Homann, L. Teyton and M. B. Oldstone, Differential regulation of antiviral T-cell immunity results in stable CD8+ but declining CD4+ T-cell memory, Nat. Med., 7(2001), 913–919.
    [57] R. J. De Boer, D. Homann and A. S. Perelson, Different dynamics of CD4+ and CD8+ T cell responses during and after acute lymphocytic choriomeningitis virus infection, J. Immunol., 171(2003), 3928–3935.
    [58] G. T. Skalski and J. F. Gilliam, Functional responses with predator interference: viable alternatives to the Holling type II model, Ecology, 82(2001), 3083–3092.
    [59] H. Nawata, M. T. Chong, D. Bronzert, et al., Estradiol-independent growth of a subline of MCF-7 human breast cancer cells in culture, J. Biol. Chem., 256(1981), 6895–6902.
    [60] R. Clarke, N. Brünner, B. S. Katzenellenbogen, et al., Progression of human breast cancer cells from hormone-dependent to hormone-independent growth both in vitro and in vivo, Proc. Natl. Acad. Sci. USA, 86(1989), 3649–3653.
    [61] N. T. Telang, G. Li, M. Katdare, et al., The nutritional herb Epimedium grandiflorum inhibits the growth in a model for the Luminal A molecular subtype of breast cancer, Oncol. Lett., 13 (2017), 2477–2482.
    [62] T. A. Caragine, M. Imai, A. B. Frey, et al., Expression of rat complement control protein Crry on tumor cells inhibits rat natural killer cell–mediated cytotoxicity, Blood, 100 (2002), 3304–3310.
    [63] M. R. Müller, F. Grünebach, A. Nencioni, et al., Transfection of dendritic cells with RNA induces CD4-and CD8-mediated T cell immunity against breast carcinomas and reveals the immunodominance of presented T cell epitopes, J. Immunol., 170 (2003), 5892–5896.
    [64] J. Chen, E. Hui, T. Ip, et al., Dietary flaxseed enhances the inhibitory effect of tamoxifen on the growth of estrogen-dependent human breast cancer (mcf-7) in nude mice, Clin. Cancer Res., 10(2004), 7703–7711.
    [65] P. V. Sivakumar, R. Garcia, K. S. Waggie, et al., Comparison of vascular leak syndrome in mice treated with IL21 or IL2, Comparative Med., 63 (2013), 13–21.
    [66] C.Wu, T.Motosha, H.A.Abdel-Rahman, etal., Freeandprotein-boundplasmaestradiol-17during the menstrual cycle, J. Clin. Endocrinol. Metab., 43(1976), 436–445.
    [67] E. Nikolopoulou, L. R. Johnson, D. Harris, et al., Tumour-immune dynamics with an immune checkpoint inhibitor, Lett. Biomath., 5(2018), S137–S159.
    [68] P. Vacca, E. Munari, N. Tumino, et al., Human natural killer cells and other innate lymphoid cells in cancer: friends or foes? Immunol. Lett., 201(2018), 14–19.
    [69] A. Cerwenka, J. Kopitz, P. Schirmacher, et al., HMGB1: the metabolic weapon in the arsenal of NK cells, Mol. Cell. Oncol., 3(2016), e1175538.
    [70] I. J. Fidler, Metastasis: quantitative analysis of distribution and fate of tumor emboli labeled with 125i-5-iodo-2'-deoxyuridine, J. Natl. Cancer Inst., 4(1970), 773–782.
    [71] G. G. Page and S. Ben-Eliyahu, A role for NK cells in greater susceptibility of young rats to metastatic formation, Dev. Comp. Immunol., 23(1999), 87–96.
    [72] O. E. Franco, A. K. Shaw, D. W. Strand, et al., Cancer associated fibroblasts in cancer pathogenesis, Semin. Cell Dev. Bio., 21(2010), 33–39.
    [73] G. P. Dunn, A. T. Bruce, H. Ikeda, et al., Cancer immunoediting: from immunosurveillance to tumor escape, Nat. Immunol., 3(2002), 991.
    [74] D. Mittal, M. M. Gubin, R. D. Schreiber, et al., New insights into cancer immunoediting and its three component phases-elimination, equilibrium and escape, Curr. Opin. Immunol., 27(2014), 16–25.
    [75] J. Nowak, P. Juszczynski and K. Warzocha, The role of major histocompatibility complex polymorphisms in the incidence and outcome of non-Hodgkin lymphoma, Curr. Immunol. Rev., 5(2009), 300–310.
    [76] J. G. B. Alvarez, M. González-Cao, N. Karachaliou, et al., Advances in immunotherapy for treatment of lung cancer, Cancer Biol. Med., 12(2015), 209–222.
    [77] M. E. Dudley and S. A. Rosenberg, Adoptive-cell-transfer therapy for the treatment of patients with cancer, Nat. Rev. Cancer, 3(2003), 666–675.
    [78] M. Su, C. Huang and A. Dai, Immune checkpoint inhibitors: therapeutic tools for breast cancer, Asian Pac. J. Cancer Prev., 17 (2016), 905–910.
    [79] M. Ebbo, L. Gérard, S. Carpentier, et al., Low circulating natural killer cell counts are associated with severe disease in patients with common variable immunodeficiency, EBioMedicine., 6(2016), 222–230.
    [80] S. H. Jee, J. Y. Park, H. Kim, et al., White blood cell count and risk for all-cause, cardiovascular, and cancer mortality in a cohort of Koreans, Am. J. Epidemiol., 162 (2005), 1062–1069.
    [81] W. B. Kannel, K. Anderson and T. W. Wilson, White blood cell count and cardiovascular disease: insights from the Framingham Study, Jama, 267 (1992), 1253–1256.
    [82] K. L. Margolis, J. E. Manson, P. Greenland, et al., Leukocyte count as a predictor of cardiovascular events and mortality in postmenopausal women: the Women's Health Initiative Observational Study, Arch. Intern. Med., 165(2005), 500–508.
    [83] B. K. Duffy, H. S. Gurm, V. Rajagopal, et al., Usefulness of an elevated neutrophil to lymphocyte ratio in predicting long-term mortality after percutaneous coronary intervention, Am. J. Cardiol., 97 (2006), 993–996.
    [84] B. D. Horne, J. L. Anderson, J. M. John, et al., Which white blood cell subtypes predict increased cardiovascular risk? J. Am. Coll. Cardiol., 45(2005), 1638–1643.
    [85] R. N. O. Cobucci, H. Saconato, P. H. Lima, et al., Comparative incidence of cancer in HIV-AIDS patients and transplant recipients, Cancer Epidemiol., 36(2012), e69–e73.
    [86] C. Bodelon, M. M. Madeleine, L. F. Voigt, et al., Is the incidence of invasive vulvar cancer increasing in the United States? Cancer Causes Control, 20(2009), 1779–1782.
    [87] M. R. Shurin, Cancer as an immune-mediated disease, Immunotargets Ther., 1 (2012), 1–6.
    [88] J. S. Lawrence, Leukopenia: its mechanism and therapy, J. Chronic. Dis., 6(1957), 351–364.
    [89] P. Venigalla, B. Motwani, A. Nallari, et al., A patient on hydroxyurea for sickle cell disease who developed an opportunistic infection, Blood, 100(2002), 363–364.
    [90] M. Iwamuro, S. Tanaka, A. Bessho, et al., Two cases of primary small cell carcinoma of the stomach, Acta. Med. Okayama, 63(2009), 293–298.
    [91] A. O. O. Chan, I. O. L. Ng, C. M. Lam, et al., Cholestatic jaundice caused by sequential carbimazole and propylthiouracil treatment for thyrotoxicosis, Hong Kong Med. J., 9(2003), 377–380.
    [92] J. H. Goodchild and M. Glick, A different approach to medical risk assessment, Endod. Topics, 4(2003), 1–8.
    [93] S. E. Hankinson, Endogenous hormones and risk of breast cancer in postmenopausal women, Breast Dis., 24(2006), 3–15.
    [94] R. Kaaks, S. Rinaldi, T. J. Key, et al., Postmenopausal serum androgens, oestrogens and breast cancer risk: the European prospective investigation into cancer and nutrition, Endocr. Relat. Cancer, 12(2005), 1071–1082.
    [95] S. A. Missmer, A. H. Eliassen, R. L. Barbieri, et al., Endogenous estrogen, androgen, and progesterone concentrations and breast cancer risk among postmenopausal women, J. Natl. Cancer Inst., 96(2004), 1856–1865.
    [96] A. A. Arslan, R. E. Shore, Y. Afanasyeva, et al., Circulating estrogen metabolites and risk for breast cancer in premenopausal women, Cancer Epidemiol. Biomarkers Prev., 18(2009), 2273–2279.
    [97] S. B. Brown and S. E. Hankinson, Endogenous estrogens and the risk of breast, endometrial, and ovarian cancers, Steroids, 99(2015), 8–10.
    [98] L. C. Houghton, D. Ganmaa, P. S. Rosenberg, et al., Associations of breast cancer risk factors with premenopausal sex hormones in women with very low breast cancer risk, Int. J. Environ. Res. Public Health, 13(2016), 1066.
    [99] R. Kaaks, K. Tikk, D. Sookthai, et al., Premenopausal serum sex hormone levels in relation to breast cancer risk, overall and by hormone receptor status-results from the EPIC cohortk, Int. J. cancer, 134(2014), 1947–1957.
    [100] A. Diefenbach, E. R. Jensen, A. M. Jamieson, et al., Rae1 and H60 ligands of the NKG2D receptor stimulate tumour immunity, Nature, 413(2001), 165.
    [101] A. Iannello and D. H. Raulet, Cold Spring Harbor symposia on quantitative biology, 1st edition, Cold Spring Harbor Laboratory Press, New York, 2013.
    [102] M. B. Pampena and E. M. Levy, Natural killer cells as helper cells in dendritic cell cancer vaccines, Front. Immunol., 6 (2015), 1–8.
    [103] E. Vivier, S. Ugolini, D. Blaise, et al., Targeting natural killer cells and natural killer T cells in cancer, Nat. Rev. Immunol., 12(2012), 239–252.
    [104] G. Liu, X. Fan, Y. Cai, et al., Efficacy of dendritic cell-based immunotherapy produced from cord blood in vitro and in a humanized NSG mouse cancer model, Immunotherapy, 11(2019), 599–616.
    [105] M. Schnekenburger, M. Dicato and M. F. Diederich, Anticancer potential of naturally occurring immunoepigenetic modulators: A promising avenue? Cancer, 125(2019), 1612–1628.
    [106] X. Feng, L. Lu, K. Wang, et al., Low expression of CD80 predicts for poor prognosis in patients with gastric adenocarcinoma, Future Oncol., 15 (2019), 473–483.
    [107] X. Lai and A. Friedman, Combination therapy of cancer with cancer vaccine and immune checkpoint inhibitors: a mathematical model, PloS One, 12 (2017), e0178479.
  • This article has been cited by:

    1. Chaitra Gopalappa, Hari Balasubramanian, Peter J. Haas, A new mixed agent-based network and compartmental simulation framework for joint modeling of related infectious diseases- application to sexually transmitted infections, 2023, 8, 24680427, 84, 10.1016/j.idm.2022.12.003
    2. Malú Grave, Alex Viguerie, Gabriel F. Barros, Alessandro Reali, Roberto F.S. Andrade, Alvaro L.G.A. Coutinho, Modeling nonlocal behavior in epidemics via a reaction–diffusion system incorporating population movement along a network, 2022, 401, 00457825, 115541, 10.1016/j.cma.2022.115541
    3. Matthew Eden, Rebecca Castonguay, Buyannemekh Munkhbat, Hari Balasubramanian, Chaitra Gopalappa, Agent-based evolving network modeling: a new simulation method for modeling low prevalence infectious diseases, 2021, 24, 1386-9620, 623, 10.1007/s10729-021-09558-0
    4. Anne Marie France, Nivedha Panneer, Paul G. Farnham, Alexandra M. Oster, Alex Viguerie, Chaitra Gopalappa, Simulation of Full HIV Cluster Networks in a Nationally Representative Model Indicates Intervention Opportunities, 2024, 95, 1525-4135, 355, 10.1097/QAI.0000000000003367
    5. Xinmeng Zhao, Chaitra Gopalappa, Olatunji Matthew Kolawole, Joint modeling HIV and HPV using a new hybrid agent-based network and compartmental simulation technique, 2023, 18, 1932-6203, e0288141, 10.1371/journal.pone.0288141
    6. Amir Khosheghbal, Peter J. Haas, Chaitra Gopalappa, Mechanistic modeling of social conditions in disease-prediction simulations via copulas and probabilistic graphical models: HIV case study, 2024, 1386-9620, 10.1007/s10729-024-09694-3
  • Reader Comments
  • © 2019 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(9502) PDF downloads(1569) Cited by(25)

Figures and Tables

Figures(10)  /  Tables(1)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog