Research article Special Issues

Modeling stochastic gene expression: From Markov to non-Markov models

  • Gene expression is an inherently noisy process due to low copy numbers of mRNA or protein. The involved reaction events may happen in a Markov fashion but also in a non-Markov manner, depending on waiting-time distributions for the occurrence of reaction events. In recent years, many mechanistic models of stochastic gene expression have been developed to forecast fluctuations in mRNA or protein levels. Here we discus commonalities between these models as well as their extensions from Markov to non-Markov models, focusing on the contributions of noisy sources to the protein level. We derive a useful formula for the protein noise quantified by the ratio of the variance over the squared mean. This formula, expressed in terms of the frequencies of the probabilistic events, can be used in the fast evaluation of fluctuations in the protein abundance. Although the detail of the formula may vary from gene to gene, it highlights sources of the protein noise, which can be decomposed into two parts: spontaneous fluctuations resulting from the birth and death of the protein and forced fluctuations originated from switching between the promoter states.

    Citation: Zhenquan Zhang, Junhao Liang, Zihao Wang, Jiajun Zhang, Tianshou Zhou. Modeling stochastic gene expression: From Markov to non-Markov models[J]. Mathematical Biosciences and Engineering, 2020, 17(5): 5304-5325. doi: 10.3934/mbe.2020287

    Related Papers:

    [1] Alexander P. Moscalets, Leonid I. Nazarov, Mikhail V. Tamm . Towards a robust algorithm to determine topological domains from colocalization data. AIMS Biophysics, 2015, 2(4): 503-516. doi: 10.3934/biophy.2015.4.503
    [2] Vladimir B. Teif, Andrey G. Cherstvy . Chromatin and epigenetics: current biophysical views. AIMS Biophysics, 2016, 3(1): 88-98. doi: 10.3934/biophy.2016.1.88
    [3] Théo Lebeaupin, Hafida Sellou, Gyula Timinszky, Sébastien Huet . Chromatin dynamics at DNA breaks: what, how and why?. AIMS Biophysics, 2015, 2(4): 458-475. doi: 10.3934/biophy.2015.4.458
    [4] Larisa I. Fedoreyeva, Boris F. Vanyushin, Ekaterina N. Baranova . Peptide AEDL alters chromatin conformation via histone binding. AIMS Biophysics, 2020, 7(1): 1-16. doi: 10.3934/biophy.2020001
    [5] Andrea Bianchi, Chiara Lanzuolo . Into the chromatin world: Role of nuclear architecture in epigenome regulation. AIMS Biophysics, 2015, 2(4): 585-612. doi: 10.3934/biophy.2015.4.585
    [6] Thomas Schubert, Gernot Längst . Studying epigenetic interactions using MicroScale Thermophoresis (MST). AIMS Biophysics, 2015, 2(3): 370-380. doi: 10.3934/biophy.2015.3.370
    [7] E.N. Baranova, R.M. Sarimov, A.A. Gulevich . Stress induced «railway for pre-ribosome export» structure as a new model for studying eukaryote ribosome biogenesis. AIMS Biophysics, 2019, 6(2): 47-67. doi: 10.3934/biophy.2019.2.47
    [8] Edward N Trifonov . Columnar structure of SV40 minichromosome. AIMS Biophysics, 2015, 2(3): 274-283. doi: 10.3934/biophy.2015.3.274
    [9] Michael-Christian Mörl, Tilo Zülske, Robert Schöpflin, Gero Wedemann . Data formats for modelling the spatial structure of chromatin based on experimental positions of nucleosomes. AIMS Biophysics, 2019, 6(3): 83-98. doi: 10.3934/biophy.2019.3.83
    [10] Klemen Bohinc, Leo Lue . On the electrostatics of DNA in chromatin. AIMS Biophysics, 2016, 3(1): 75-87. doi: 10.3934/biophy.2016.1.75
  • Gene expression is an inherently noisy process due to low copy numbers of mRNA or protein. The involved reaction events may happen in a Markov fashion but also in a non-Markov manner, depending on waiting-time distributions for the occurrence of reaction events. In recent years, many mechanistic models of stochastic gene expression have been developed to forecast fluctuations in mRNA or protein levels. Here we discus commonalities between these models as well as their extensions from Markov to non-Markov models, focusing on the contributions of noisy sources to the protein level. We derive a useful formula for the protein noise quantified by the ratio of the variance over the squared mean. This formula, expressed in terms of the frequencies of the probabilistic events, can be used in the fast evaluation of fluctuations in the protein abundance. Although the detail of the formula may vary from gene to gene, it highlights sources of the protein noise, which can be decomposed into two parts: spontaneous fluctuations resulting from the birth and death of the protein and forced fluctuations originated from switching between the promoter states.


    1. Introduction

    Chromatin is the scene of a vast set of epigenetic modifications such as histone marks and DNA methylation. The epigenomic state of chromatin is locally defined along the genome, and this epigenomic state contributes to transcription regulation by modulating regulatory sequence accessibility, transcription factor binding, and the propensity of other genetic processes [1]. Recent initiatives successfully determined complete epigenomes for various human cell lines [2,3,4]. The consensual understanding of the functional role of epigenomic domains is that, in the course of development, [5,6,7], genomic loci are provided with various combinations of histone marks so as to selectively enable tissue-specific profiles of gene expression.

    In parallel with the characterization of epigenomic domains, Chromosome Conformation Capture techniques (3C and HiC [8,9]) showed that chromosomes exhibit complex spatial architectures. In human cells, chromosomes are spatially segregated in territories which are themselves composed of topologically associating domains (TAD) at the kilobase-to-megabase scale [10,11]. TADs are consecutive sequences (from a few dozen to hundreds of kilobase pairs) of chromatin that preferentially form contacts with each other, and do not colocalize with sequences from other loci. Chromatin topology is linked to transcription control, for instance by selectively enabling regulatory sequence interaction with their target locus [12,13,14], by synchronizing expressions of genes located on the same TAD, thus acting as regulons [15], or by selectively localizing genes to transcription factories [16].

    Although experimental data accumulates regarding both epigenomes and chromosome architecture thanks to advances in molecular biology and bioinformatics, a concise, clear-cut understanding of how epigenetic marks contribute to gene expression regulation through chromatin spatial reorganization remains elusive. The vast combinatoric complexity of epigenetic marks led to simplified models of epigenomic states where epigenetic marks are combined into a reduced set of so-called chromatin “colors” [4,17,18]. In this view, the genome is composed of a succession of blocks with various colors, with a typical size of a few hundred kilobase pairs. This also revealed that TADs and epigenomic color domains seem to roughly overlap, suggesting that epigenomic domains not only modulate transcription by acting as signalling “tags”, but also contribute to specify the spatial architecture of chromatin.

    The modelling community basing the research on polymer physics seized this perspective to propose physically-driven models of chromosome conformation that would explain how epigenomic domains determine the architecture of the genome. These modelling studies are all based on copolymer physics principles, since chromosomes are copolymers and not just homopolymers, precisely because of epigenetic marking [19]. In this respect, there are two main modelling approaches: the String & Binders Switch (SBS) models, where chromosomes are flexible polymer combined with molecular actors that specifically bind chromatin loci together to form loops and other structures [20,21] ; and the interacting copolymer models in which monomers of chromatin have specific interaction energies according to their epigenomic state [22,23]. In [23], Jost et al. noticed that the overlap between TADs and epigenomic domains was strongly akin to block copolymers, and therefore introduced an interacting 2-color block copolymer model which provided a surprisingly rich conformational space and fairly reproduced contact maps observed experimentally using HiC assays. This suggests that the wealth of genetic expression patterns observed in human cell types is made possible by the richness of the chromatin conformational space, where combinations of TADs can be selectively opened/closed and/or expressed based on their conformation.

    In parallel, we demonstrated recently that homogeneous polymers of different sizes would undergo a coil-globule transition at different temperatures [24]: for a given temperature, small polymers tend to adopt a coil conformation at equilibrium whereas larger polymers will adopt a globule conformation. In this study, we aim at demonstrating that this size-dependent coil-globule transition also affects the conformation of epigenomic domains and TADs. We show that the expression for the probability density of polymer conformations at equilibrium we proposed in [24], which takes into account the polymer size and its interaction energy parameters, also correctly describes the conformations observed when simulating a block copolymer with various block sizes. For a 2-color block copolymer, the conformation of epigenomic blocks is size-dependent and obeys the same coil-globule transition as homoplymers. Our main conclusion is that not only the strength of the interaction of epigenomic domains (both intra-domain and inter-domain interactions) affects chromatin conformations, but also the size of epigenomic domains physically regulates gene expression. This opens a new perspective in which simply changing the span of epigenomic domains (i.e. by spreading / erasing epigenomic marks) is sufficient to produce a vast set of selective chromatin folding patterns, hence a vast set of gene expression patterns.

    2. Material and Methods

    We present in this section the theoretical model and computational method we used for the finitesize scaling analysis of the conformational space of epigenomic domains. We explain the order parameters we retained for the theoretical study of the finite-size coil-globule transition for a homogeneous (single color) polymer, and the expected behaviors when varying the polymer size. We briefly introduce the block copolymer model akin to the one presented in [23] that we used for modelling chromatin domains. For the sake of simplicity, we chose a 2-color model (domains of A and domains of B). We then present the Langevin dynamics simulation software used for our numerical results.

    2.1. Chromatin as a block copolymer

    Our framework uses the same fundamental assumptions as presented in [23] : a chromatin fiber is modelled as a bead-spring chain (see fig.1). Each bead is assigned one color (either A or B). The color ci of monomer i determines its interaction energy Uij with monomer j of color cj as follows:

    Uij={0,if cicj or rij>rmaxUS(1ea(rijr0))2,if ci=cj and rij<rmax
    (1)

    Figure 1. Copolymer model. (A) Snapshot of a simulation showing an intermediate coilglobule conformation. monomers of type A are in red, monomers of type B are in green. (B) Principles of the bead-spring copolymer simulation model. Beads are linked by harmonic springs with stiffness ks. Two monomers of type A (respectively B) have an interaction energy UA (respectively UB). Monomers of different colors do not interact except by volume exclusion. (C) The monomer sequences of simulated copolymers were chosen so that they are composed of 3 tandem repeats of a sub-sequence (ANABNB) with NA + NB = 60.

    The interaction model is a cut-off Morse potential whose depth was set to Us, with maximum range rmax, potential well width a and minimal energy distance r0 (see table 1). Us is the specific interaction energy between monomers of the same color, with US = UA (respectively UB) if ci = cj = A (respectively B) in Eq.1. We neglect non-specific interactions between monomers. The specific interaction energy represents the effect of epigenomic marks on chromatin monomer physico-chemical properties and more generally on their propensity to bind through other, possibly unknown, molecular mechanisms (insulator binding, loop-mediating proteins, etc.). Each bead represents 10kb of chromatin. The stiffness of the connecting springs is set to @k_s = \frac{3k_{B}T}{b^2}@, where kB is Boltzmann’s constant, T the temperature and b is both the effective Kuhn length of the polymer and the spring relaxed extension.

    Epigenomic domains are introduced by setting the color of monomers along the polymer in the following way : an alternance of NA monomers of type A and NB monomers of type B, repeated NR times (denoted (ANABNB)NR ). Unless otherwise mentioned, NA+NB = 60 and NR = 3 for all simulations. To simulate a homopolymer, we set NA = N; NB = 0; NR = 1. We will call N the polymer size of which the conformation is observed, so that for homopolymers, N is the total polymer size, and for block copolymers, N is the size of the observed epigenomic domain (NA or NB respectively). The premise of the model is therefore similar to [23], except that we use a global-scheme Langevin dynamics engine (detailed further) which allows us to efficiently sample the conformational space, and we vary the domain sizes as well as the interaction energy to elucidate the size dependence of conformations.

    2.2. Finite-size scaling of the coil-globule transition

    A flexible homopolymer composed of N identical monomers can adopt a conformation at equilibrium that is either coil (self avoiding random walk) or globule (compact random walk) [25]. The polymer can transit from a coil to a globule as the temperature decreases. The typical physical situation where such a transition is observed occurs for a polymer in a bad solvent : as the temperature decreases, the interaction energy between monomers favors a more compact, dense conformation. In our study, the temperature is kept fixed at T = 300K and the interaction energy US is varied, which achieves the same effect. The same transition from coil to globule is therefore expected as the interaction energy increases at fixed temperature.

    The conformation of the chain can be characterized by the probability density of the radius of gyration r at equilibrium:

    r=1NNi=1(riRG)2
    (2)

    where ri is the position of the ith monomer and RG is the position of the center of mass of the polymer. At a given chain size N, the larger r, the more relaxed the conformation, and the smaller r, the more globular, the denser the conformation. A polymer of size N can have one of three principal types of conformations, each having its specific scaling : globules for which < r > @\propto N^{1/d}@ (d is the spatial dimension), coils for which < r > @\propto N^\nu@ (@\nu@ is the Flory exponent), and stretchs for which < r > @\propto N@. These scalings are however only valid in the thermodynamic limit (@N \to \infty@). In previous works [26,24], we proposed an expression for the finite-size Boltzmann-Gibbs distribution of the chain conformations by combining these three scalings into a single distribution by means of finitesize scaling analysis:

    PN(ˆt)=ˆCNˆtceN(a1ˆt+a2ˆt2)ea3(Nˆt)q
    (3)

    The coefficients ai depend on the monomer interaction energy US and the temperature T and are related to the global shape of the distribution.

    The complete derivation of Eq.3 is available in [26]. For the sake of clarity and completeness, we give here the basic elements of this derivation. First, Eq.3 is the probability density of the order parameter @\hat{t}@, which is a proxy for the polymer conformation derived from the radius of gyration r and the density @\rho = N/r^{3}@ :

    ˆt=(N3/2r3)5/4
    (4)

    Hence, the larger @\hat{t}@, the denser the conformation. Then, the two main classes of conformations in the thermodynamic limit (coils and globules) appear in Eq.3 as follows. The globule limit of the distribution is found in the factor :

    PG(ˆt)=eN(a1ˆt+a2ˆt2)
    (5)

    while the coil limit of the distribution is found in the factor :

    PC(ˆt)=ea3(Nˆt)q
    (6)

    with @q=\frac{\nu - 1/3}{1-\nu} \sim 0.617@ [27] and @\nu \sim 0.588@ is the Flory exponent. The final contribution to the whole distribution normalization is the factor @\hat{t}^c\hat{\mathcal{C}}^{(\beta)}_{N}@, where the exponent c ~ -1.13 arises from the cardinality of the space of self-avoiding walks, and @{\mathcal{C}}^{(\beta)}_{N}@ N is a normalization constant depending on the polymer size and the interaction energy US .

    Eq.3 for @P_N(\hat{t})@ is relevant to polymers with both excluded-volume and attractive interactions between monomers. This expression predicts that, at a given temperature, small polymers adopt a coil conformation while large polymers adopt a globule conformation, as observed in simulations [28,27,29] and experimentally [30]. In this work, we demonstrate that it also correctly describes the conformation of individual blocks of a copolymer. One notable result of our previous finite-size scaling analysis is that not only the average < @\hat{t}@ > scales differently with respect to the size (i.e. the radius of gyration), but the whole shape of the distribution changes due to finite-size effects [24]. This is shown in Eq.3 as @-N(a_{1}\hat{t} + a_{2}\hat{t}^{2})@ becomes dominant for large N compared to @-a_{3}(N\hat{t})^{-q}@. For a polymer in a coil conformation (high temperature or low attractive interactions), the distribution typically exhibits an exponential peak at @\hat{t} \sim 0@ whereas for a globule conformation (low temperature or high attractive interactions), the distribution exhibits a bell shape around a central value @\hat{t} \gg 0@. At a given temperature T near the so called @\theta@-point (or at intermediate attractive interactions), coil and globule conformations coexist.

    We have checked the validity of Eq.3 for homopolymers by fitting it to empirical distributions of @\hat{t}@ obtained from simulations, for various N and various interaction energies. Then we showed that empirical distributions of @\hat{t}@ taken from the conformations of epigenomic domains of simulated copolymers also satisfy Eq.3, demonstrating that each epigenomic domain undergoes its own size-dependent coil-globule transition when its size changes and its interaction energy remains fixed.

    2.3. Langevin Dynamics simulation

    We used a software library developped in-house, CGMDODE [31], to build a simulator of copolymer dynamics. The simulator builds a chain of N beads from a specified sequence of the form (ANABNB)NR . We specify the physical parameters of each bead (mass, radius, ...) as well as the specific interaction energy based on its color. For simplicity and to reduce the parameter space, we used the same physical properties for the two colors except for the interaction energy (see parameter table 1). Each bead represents 10kb of chromatin (i.e. ~ 60 nucleosomes). Domains spanned 5 to 55 beads in size, corresponding to actual chromatin sizes of 50 to 550 kb, i.e. the typical range of observed human epigenomic domain sizes. We used the built-in physics engine collision detection algorithm to enforce volume exclusion. A-A (respectively B-B) interactions were implemented by applying a force on each pair of monomers derived according to Eq.1.

    We implemented the global Langevin dynamics scheme [32] into our simulator, as recently developped for physics engines [33,34]. The global Langevin dynamics allows for a very fast convergence of conformations towards their equilibrium distribution as well as an efficient sampling of the conformational space at equilibrium, while preserving the accuracy of the ensemble statistics of dynamical properties [33].

    Table 1. Langevin dynamics simualtions parameters for beads of 10kb.
    descriptionnotationvalue
    thermostat coupling frequency@\gamma@@10^8\text{s}^{-1}@
    timestep@\Delta t@@5\times10^{-9}@s
    total simulation time@t_{sim}@@5\times10^{-2}@
    Morse potential max. range@r_{max}@@7.5 \times 10^{-7}@m
    Morse potential width factor@a@@10^{7}\text{m}^{-1}@
    Morse potential energy minimum distance@r_0@@6 \times 10^{-8}@m
    bead mass@m_{b}@@2.2 \times10^{-20}@kg
    bead radius @r_{b}@@6 \times 10^{-8}@m
    Kuhn length@b@@1.5 \times 10^{-7}@m
    TemperatureT300K
     | Show Table
    DownLoad: CSV

    3. Results

    We simulated trajectories of a homopolymer to check Eq.3, then simulated 2-color block copolymers with various domain sizes and interaction energies. We computed the values of @\hat{t}@ from the trajectories for each simulation parameter set. Unless otherwise mentioned, we generated at least 12 independent runs for each simulation parameter set (20 for block copolymers), for a total simulation time of 5 × 10-2s with a timestep of 5 × 10-9s (a total of 107 timesteps). We observed that conformations reached equilibrium after 10-3s of simulation time. We exploited one trajectory frame every 10 000 frames, starting after 2 × 10-3s of simulation time, to ensure that conformations were at equilibrium. The frame sampling ensures that sampled conformations are independent of each other, as we checked that the auto-correlation function of the snapshot radius of gyration r decreased exponentially with a characteristic time ~ 10-5s (approximately 5 000 timesteps). We computed @\hat{t}@ for every frame sampled (a single value based on the total polymer conformation for homopolymers, or 3 independent values based on each of the 3 individual block conformations for block copolymers). This yielded, for every condition, at least 8 000 uncorrelated conformations for homopolymers, and at least 40 000 for block copolymer domains. We fitted Eq.3 on the obtained empirical probability densities. Fits were obtained using the Levenberg-Marquardt algorithm for non-linear regression implemented in the package minpack.lm of the R software.

    3.1. Size-dependent conformations of a homopolymer

    Empirical probability densities of conformations for a homopolymer (N monomers of the same color) are shown in fig.2, for different polymer sizes N and attractive interaction energies US (given in absolute value). At low attractive energy (fig.2-A), all polymer sizes yield a peaked distribution at @\hat{t}@ close to 0, characteristic of coil conformations. When attractive interactions increase, the distribution progressively slides towards larger @\hat{t}@ values, and long polymers undergo the transition to globule conformations sooner - i.e. at weaker attractive interactions - than short polymers (fig.2-B and -C). Globule densities stabilize around a central @\hat{t}@ value that depends on the strength of the attractive interaction US and the polymer size N. Eq.3 is in very good agreement with simulation data for all N and US (fig.2 solid lines).

    Figure 2. Size-dependent coil-globule transition for a homopolymer. Empirical probability density for the order parameter @\hat{t}@ (symbols) of the conformation of a homopolymer obtained from simulations, at fixed temperature. N refers to the total size of the polymer. Solid lines are fits for PN(@\hat{t}@) using Eq 3.

    3.2. Size-dependent conformations for epigenomic domains

    We simulated block copolymers (see fig.1-C for block sequences) with NA, NB between 5 and 55. We observed a similar size-dependent coil-globule transition for epigenomic domains of simulated block copolymers, as shown by probability densities of @\hat{t}@ at fixed energies and various domain sizes (fig.3), or at fixed domain size and varying energies (fig.3). In fig.3-A (US = 0.1kBT), attractive interactions are too low and all domains are in a coil conformation. When attractive strength increases, large domains undergo a transition towards globule conformations, with larger domains transiting first. It appears in fig.3-B at US = 0:2kBT: domains of N=55 monomers form globules (bell-shaped curve on the right of the x-axis) whereas smaller domains keep a coil conformation (peaked curve on the left of the x-axis), and in fig.3-C, domains of sizes between 30 and 55 have transited, domains of size 20 have mixed coil-globule conformations, and domains of size 10 are still mainly coils and just beginning to transit into globules. Eq.3 for PN(@\hat{t}@) was fitted on the densities obtained from block copolymer domain conformations with very good agreement (solid lines fig.3).

    Figure 3. Size-dependent coil-globule transition of blocks in a copolymer. Empirical probability densities for the order parameter @\hat{t}@ (symbols) of the conformation of an epigenomic block obtained from simulations, at fixed temperature. N here refers to the size of the block. Solid lines are fits for PN(@\hat{t}@) using Eq.3.

    Figure 4. Block copolymer conformation samples. Snapshots of simulated conformations for a block copolymer of sequence (A20B40)3. Left and middle : attractive interactions Us = 0.225kBT. Small domains (green) adopt a coil conformation while large domains (red) adopt a globule conformation. Size-dependent folding produces a vast set of conformation combination possibilities, such as string of globules (left) and central globule with extruding loops (middle). Right : example of microphase separation conformation (two separated globules) for a block copolymer (A20B40)3, with attractive interactions US = 0.25kBT.

    The main conclusions of this study are, for a given attractive interaction strength US :

    · (i) if the temperature is high (or US is low), then the scaling of @\hat{t}@ is the same regardless of N : the average radius of gyration of short domains and long domains has the same scaling in N (@\sim N^{\nu}@ for coils, ~ N1/3 for globules).

    · (ii) if the temperature is near @\theta@ (@=\theta(N \to \infty)@) (or attraction strength is close to @1/\theta(N \to \infty)@), then the scaling becomes size-dependent : for a polymer of size N, if T > @\theta(N)@ then the polymer is a coil, whereas if T < @\theta(N)@, the polymer is a globule.

    This holds for homopolymers as well as for blocks of a copolymer taken individually, and therefore for epigenomic domains. Our simulations for chromatin showed that a change of interaction energy of 0.1 kbT is enough to change the scaling for domains of 550 kb (fig.3-A to B, N=55). In terms of size, at US = 0.2kT, going from domains of size N = 30 to N = 55 was enough to complete the transition from coil to globule. This suggests that small changes in domain sizes may have huge consequences in TADs and epigenomic domain foldings.

    4. Discussion

    This work was set in the context of a simple model of chromatin as a block copolymer, stemmed from the observation that finite polymers undergo a size-dependent coil-globule transition. Using Langevin dynamics simulations, we showed that, at fixed temperature, a polymer with attractive and excluded-volume interactions adopts a conformation that depends on its size: large polymers will have a globule conformation and small ones a coil conformation. We demonstrated that this still holds when considering blocks of a 2-color copolymer taken individually: the conformations of the blocks depend on their size. We provided an expression for the probability density of size-dependent conformations and verified its extremely good agreement with simulation results for both homopolymers and block copolymer domains.

    Our model uses the same mechanical properties on both simulated epigenomic colors, as only their respective specific interaction energies were modulated. Yet, it is quite possible that mechanical properties of the chromatin fiber might also be affected by epigenetic marks. However experimental data are still lacking in mammalian cells concerning the persistence length of the chromatin fiber - the elusive 30nm fiber- in euchromatin and heterochromatin, note to mention that actual structures of the chromatin fiber are still currently debated [19,35]. Additionnally, the compaction rates (in bp per nm) of chromatin fibers in euchromatin and heterochromatin are most probably very different. When the structural and mechanical properties of various chromatin types will be better characterized experimentally, they could be implemented in a more refined version of our model.

    5. Conclusion

    The selective folding of domains depending on their size observed demonstrates that two epigenomic domains of the same epigenomic color taken anywhere on the genome will have different conformations if their sequence lengths are different, and notably, that the size difference is not required to be huge for a dramatic change in domain folding. This opens new perspectives for linking epigenomes and chromosome conformation data, as we showed that the span (in terms of sequence) of an epigenomic domain directly influences its folding. This size-dependent folding could link the molecular processes that write / erase epigenetic marks on the chromatin [36] during development and changes in chromosome architecture. During development, by removing the barriers and/or the biochemical cues that delimit domains, epigenetic marks are spread and/or erased by dynamical processes [37,38], leading to modified domain sizes, and thus alternative domain folding. Alternative folding then leads to specific gene expression patterns, as chromosome spatial organization contributes to transcription regulation. Hence, this simple scenario offers new perspectives on how cells can produce a large set of chromosome architectures using a very limited set of epigenomic states. This is why, in future chromatin modelling studies, the effect of the sizes of epigenomic domains should not be overlooked. For instance, combined statistical analysis of conformation and epigenomic data aimed at determining the interaction energy of chromatin colors will fail if the model does not take into account the principle presented in this study. Finally, single-cell HiC experiments [39] should provide invaluable insights regarding the importance of this effect, and its role in gene expression stochasticity and robustness.

    Acknowledgments

    We gratefully acknowledge the support of NVIDIA which provided computing resources used for this study. This work was funded by the French Agence Nationale de la Recherche, Grant No. ANR- 13-BSV5-0010-03 and by the French Institut National du Cancer, Grant No. INCa 5960.

    Conflict of Interest

    The authors declare no conflict of interest.



    [1] A. Sanchez, S. Choubey, J. Kondev, Stochastic models of transcription: From single molecules to single cells, Methods, 62 (2013), 13-25.
    [2] J. Zhang, T. Zhou, Promoter-mediated transcriptional dynamics, Biophys. J., 106 (2014), 479-488.
    [3] T. Zhou, J. Zhang, Analytical results for a multistate gene model, SIAM J. Appl. Math., 72 (2012), 789-818.
    [4] L. Cai, N. Friedman, X. S. Xie, Stochastic protein expression in individual cells at the single molecule level, Nature, 440 (2006), 358-362.
    [5] T. Liu, J. Zhang, T. Zhou, Effect of interaction between chromatin loops on cell-to-cell variability in gene expression, PLoS Comput. Biol., 12 (2016), e1004917.
    [6] D. R. Rigney, W. C. Schieve, Stochastic model of linear, continuous protein-synthesis in bacterial populations, J. Theor. Biol., 69 (1977), 761-766.
    [7] O. G. Berg, A model for the statistical fluctuations of protein numbers in a microbial population, J. Theor. Biol., 71 (1978), 587-603.
    [8] P. K. Tapaswi, R. K. Roychoudhury, T. Prasad, A stochastic model of gene activation and RNA synthesis during embryogenesis, Sankhyā Indian J. Statist. Ser. B, 49 (1987), 51-67.
    [9] J. Peccoud, B. Ycart, Markovian modeling of gene-product synthesis, Theor. Popul. Biol., 48 (1995), 222-234.
    [10] D. R. Rigney, Stochastic model of constitutive protein levels in growing and dividing bacterial cells, J. Theor. Biol., 76 (4), 453-480.
    [11] D. R. Rigney, Stochastic models of cellular variability. In Kinetic Logic A Boolean Approach to the Analysis of Complex Regulatory Systems, Springer, Berlin, Heidelberg, 1979.
    [12] T. B. Kepler, T. C. Elston, Stochasticity in transcriptional regulation: origins, consequences, and mathematical representations, Biophys. J., 81 (2001), 3116-3136.
    [13] M. Thattai, A. Van Oudenaarden, Intrinsic noise in gene regulatory networks, Proc. Natl. Acad. Sci. U.S.A., 98 (2001), 8614-8619.
    [14] P. S. Swain, M. B. Elowitz, E. D. Siggia, Intrinsic and extrinsic contributions to stochasticity in gene expression, Proc. Natl. Acad. Sci. U.S.A., 99 (2002), 12795-12800.
    [15] M. Sasai, P. G. Wolynes, Stochastic gene expression as a many-body problem, Proc. Natl. Acad. Sci. U.S.A., 100 (2003), 2374-2379.
    [16] T. Jia, R. V. Kulkarni, Intrinsic noise in stochastic models of gene expression with molecular memory and bursting, Phys. Rev. Lett., 106 (2011), 058102.
    [17] Z. Cao, R. Grima, Linear mapping approximation of gene regulatory networks with stochastic dynamics, Nat. Commun., 9 (2018), 1-15.
    [18] C. V. Harper, B. Finkenstädt, D. J. Woodcock, S. Friedrichsen, S. Semprini, L. Ashall, et al., Dynamic analysis of stochastic transcription cycles, PLoS Biol., 9 (2011), e1000607.
    [19] M. R. Green, Eukaryotic transcription activation: Right on target, Mol. Cell, 18 (2005), 399-402.
    [20] J. Paulsson, Models of stochastic gene expression, Phys. Life Rev., 2 (2005), 157-175.
    [21] G. Hornung, R. Bar-Ziv, D. Rosin, N. Tokuriki, D. S. Tawfik, M. Oren, et al., Noise-mean relationship in mutated promoters, Genom. Res., 22 (2012), 2409-2417.
    [22] Q. Li, G. Barkess, H. Qian, Chromatin looping and the probability of transcription, Trend. Genet., 22 (2006), 197-202.
    [23] C. W. Gardiner, Handbook of Stochastic Methods for Physics, Chemistry and the Natural Sciences, Springer, Berlin, Heidelberg, 2004.
    [24] J. D. Jordan, E. M. Landau, R. Iyengar, Signaling networks: The origins of cellular multitasking, Cell, 103 (2000), 193-200.
    [25] L. Bintu, J. Yong, Y. E. Antebi, K. McCue, Y. Kazuki, N. Uno, et al., Dynamics of epigenetic regulation at the single-cell level, Science, 351 (2016), 720-724.
    [26] C. W. Gardiner, Stochastic Methods: a handbook for the natural and social sciences, Springer, New York, 2009.
    [27] N. G. Van Kampen, Stochastic Processes in Physics and Chemistry, North-Holland, Amsterdam, 2007.
    [28] E. Pardoux, Markov Processes and Applications: Algorithms, Networks, Genome and Finance, vol 796, John Wiley & Sons, New York, 2008.
    [29] H. Andersson, T. Britton, Stochastic epidemic models and their statistical analysis, vol. 151, Springer Science & Business Media, 2012.
    [30] M. Salathé, M. Kazandjieva, J. W. Lee, P. Levis, M. W. Feldman, J. H. Jones, A high-resolution human contact network for infectious disease transmission, Proc. Natl. Acad. Sci. U.S.A., 107 (2010), 22020-22025.
    [31] A. Corral, Long-term clustering, scaling, and universality in the temporal occurrence of earthquakes, Phys. Rev. Lett., 92 (2004), 108501.
    [32] P. S. Stumpf, R. C. Smith, M. Lenz, A. Schuppert, F. J. Müller, A. Babtie, et al., Stem cell differentiation as a non-Markov stochastic process, Cell Syst., 5 (2017), 268-282.
    [33] D. M. Suter, N. Molina, D. Gatfield, K. Schneider, U. Schibler, F. Naef, Mammalian genes are transcribed with widely different bursting kinetics, Science, 332 (2011), 472-474.
    [34] T. Guérin, O. Bénichou, R. Voituriez, Non-Markovian polymer reaction kinetics, Nat. Chem., 4 (2012), 568-573.
    [35] A. L. Barabasi, The origin of bursts and heavy tails in human dynamics, Nature, 435 (2005), 207-211.
    [36] J. M. Pedraza, J. Paulsson, Effects of molecular memory and bursting on fluctuations in gene expression, Science, 319 (2008), 339-343.
    [37] G. Srinivasan, D. M. Tartakovsky, B. A. Robinson, A. B. Aceves, Quantification of uncertainty in geochemical reactions, Water Resour. Res., 43 (2007), W12415.
    [38] S. Condamin, O. Bénichou, V. Tejedor, R. Voituriez, J. Klafter, First-passage times in complex scale-invariant media, Nature, 450 (2007), 77-80.
    [39] G. Guigas, M. Weiss, Sampling the cell with anomalous diffusion—the discovery of slowness, Biophys. J., 94 (2008), 90-94.
    [40] Y. Meroz, I. M. Sokolov, J. Klafter, Distribution of first-passage times to specific targets on compactly explored fractal structures, Phys. Rev. E, 83 (2011), 020104.
    [41] M. Dentz, A. Russian, P. Gouze, Self-averaging and ergodicity of subdiffusion in quenched random media, Phys. Rev. E, 93 (2016), 010101.
    [42] A. A. Ovchinnikov, Y. B. Zeldovich, Role of density fluctuations in bimolecular reaction kinetics, Chem. Phys., 28 (1978), 215-218.
    [43] M. Dobrzyński, F. J. Bruggeman, Elongation dynamics shape bursty transcription and translation, Proc. Natl. Acad. Sci. U.S.A., 106 (2009), 2583-2588.
    [44] D. R. Larson, D. Zenklusen, B. Wu, J. A. Chao, R. H. Singer, Real-time observation of transcription initiation and elongation on an endogenous yeast gene, Science, 332 (2011), 475-478.
    [45] S. Yunger, L. Rosenfeld, Y. Garini, Y. Shav-Tal, Single-allele analysis of transcription kinetics in living mammalian cells, Nat. Methods, 7 (2010), 631-633.
    [46] I. Golding, J. Paulsson, S. M. Zawilski, E. C. Cox, Real-time kinetics of gene activity in individual bacteria, Cell, 123 (2005), 1025-1036.
    [47] T. Muramoto, D. Cannon, M. Gierliński, A. Corrigan, G. J. Barton, J. R. Chubb, Live imaging of nascent RNA dynamics reveals distinct types of transcriptional pulse regulation, Proc. Natl. Acad. Sci. U.S.A., 109 (2012), 7350-7355.
    [48] A. Raj, C. S. Peskin, D. Tranchina, D. Y. Vargas, S. Tyagi, Stochastic mRNA synthesis in mammalian cells, PLoS Biol., 4 (2006), e309.
    [49] D. G. Spiller, C. D. Wood, D. A. Rand, M. R. White, Measurement of single-cell dynamics, Nature, 465 (2010), 736-745.
    [50] A. Eldar, M. B. Elowitz, Functional roles for noise in genetic circuits, Nature, 467 (2010), 167-173.
    [51] B. Zoller, D. Nicolas, N. Molina, F. Naef, Structure of silent transcription intervals and noise characteristics of mammalian genes, Mol. Syst. Biol., 11 (2015), 823.
    [52] T. R. Sokolowski, T. Erdmann, P. R. Ten Wolde, Mutual repression enhances the steepness and precision of gene expression boundaries, PLoS Comput. Biol., 8 (2012), e1002654.
    [53] J. Paulsson, Summing up the noise in gene networks, Nature, 427 (2001), 415-418.
    [54] D. R. Larson, What do expression dynamics tell us about the mechanism of transcription?, Curr. Opin. Gen. Dev., 21 (2011), 591-599.
    [55] V. Shahrezaei, P. S. Swain, Analytical distributions for stochastic gene expression, Proc. Natl. Acad. Sci. U.S.A., 105 (2008), 17256-17261.
    [56] N. Kumar, A. Singh, R. V. Kulkarni, Transcriptional bursting in gene expression: analytical results for general stochastic models, PLoS Comput. Biol., 11 (2015), e1004292.
    [57] Z. Wang, Z. Zhang, T. Zhou, Exact distributions for stochastic models of gene expression with arbitrary regulation, Sci. China Math., 63 (2020), 485-500.
    [58] P. Liu, Z. Yuan, L. Huang, T. Zhou, Roles of factorial noise in inducing bimodal gene expression, Phys. Rev. E, 91 (2015), 062706.
    [59] J. Zhang, Q. Nie, T. Zhou, A moment-convergence method for stochastic analysis of biochemical reaction networks, J. Chem. Phys., 144 (2016), 194109.
    [60] A. B. O. Daalhuis, Confluent hypergeometric functions, NIST Handb. Math. Funct., 2010.
    [61] T. Aquino, M. Dentz, Chemical continuous time random walks, Phys. Rev. Lett., 119 (2017), 230601.
    [62] N. Masuda, M. A. Porter, R. Lambiotte, Random walks and diffusion on networks, Phys. Rep., 716 (2017), 1-58.
    [63] R. Kutner, J. Masoliver, The continuous time random walk, still trendy: fifty-year history, state of art and outlook, Eur. Phys. J. B, 90 (2017), 50.
    [64] L. Liu, B. R. K. Kashyap, J. G. C. Templeton, On the GIX/G/∞ system, J. Appl. Prob., 27 (1990), 671-683.
    [65] A. R. Stinchcombe, C. S. Peskin, D. Tranchina, Population density approach for discrete mRNA distributions in generalized switching models for stochastic gene expression, Phys. Rev. E, 85 (2012), 061919.
    [66] N. Masuda, L. E. Rocha, A Gillespie algorithm for non-Markovian stochastic processes, SIAM Rev., 60 (2018), 95-115.
    [67] C. Deneke, R. Lipowsky, A. Valleriani, Complex degradation processes lead to non-exponential decay patterns and age-dependent decay rates of messenger RNA, PloS One, 8 (2013), e55442.
    [68] B. C. Arnold, Majorization: Here, there and everywhere, Statist. Sci., 22 (2007), 407-413.
    [69] A. David, S. Larry, The least variable phase type distribution is Erlang, Stochastic Models, 3 (1987), 467-473.
    [70] J. Zhang, T. Zhou, Markovian approaches to modeling intracellular reaction processes with molecular memory, Proc. Natl. Acad. Sci. U.S.A., 116 (2019), 23542-23550.
    [71] H. Qiu, B. Zhang, T. Zhou, Analytical results for a generalized model of bursty gene expression with molecular memory, Phys. Rev. E, 100 (2019), 012128.
    [72] A. Coulon, C. C. Chow, R. H. Singer, D. R. Larson, Eukaryotic transcriptional dynamics: From single molecules to cell populations, Nat. Rev. Genet., 14 (2013), 572-584.
    [73] W. J. Blake, M. Kærn, C. R. Cantor, J. J. Collins, Noise in eukaryotic gene expression, Nature, 422 (2003), 633-637.
    [74] J. M. Raser, E. K. O'Shea, Control of stochasticity in eukaryotic gene expression, Science, 304 (2004), 1811-1814.
    [75] N. Friedman, L. Cai, X. S. Xie, Linking stochastic dynamics to population distribution: an analytical framework of gene expression, Phys. Rev. Lett., 97 (2006), 168302.
    [76] A. M. Kringstein, F. M. Rossi, A. Hofmann, H. M. Blau, Graded transcriptional response to different concentrations of a single transactivator, Proc. Natl. Acad. Sci. U.S.A., 95 (1998), 13670-13675.
    [77] J. Stewart-Ornstein, C. Nelson, J. DeRisi, J. S. Weissman, H. El-Samad, Msn2 coordinates a stoichiometric gene expression program, Curr. Biol., 23 (2013), 2336-2345.
    [78] J. Paulsson, M. Ehrenberg, Noise in a minimal regulatory network: plasmid copy number control, Quart. Rev. Biophys., 34 (2001), 1-59.
    [79] M. B. Elowitz, A. J. Levine, E. D. Siggia, P. S. Swain, Stochastic gene expression in a single cell, Science, 297 (2002), 1183-1186.
  • This article has been cited by:

    1. Martina Pannuzzo, Bruno A. C. Horta, Carmelo La Rosa, Paolo Decuzzi, Predicting the Miscibility and Rigidity of Poly(lactic-co-glycolic acid)/Polyethylene Glycol Blends via Molecular Dynamics Simulations, 2020, 53, 0024-9297, 3643, 10.1021/acs.macromol.0c00110
    2. Vladimir B. Teif, Andrey G. Cherstvy, Chromatin and epigenetics: current biophysical views, 2016, 3, 2377-9098, 88, 10.3934/biophy.2016.1.88
    3. N. Haddad, D. Jost, C. Vaillant, Perspectives: using polymer modeling to understand the formation and function of nuclear compartments, 2017, 25, 0967-3849, 35, 10.1007/s10577-016-9548-2
    4. Ruggero Cortini, Maria Barbi, Bertrand R. Caré, Christophe Lavelle, Annick Lesne, Julien Mozziconacci, Jean-Marc Victor, The physics of epigenetics, 2016, 88, 0034-6861, 10.1103/RevModPhys.88.025002
    5. Pauline Bacle, Marie Jardat, Virginie Marry, Guillaume Mériguet, Guillaume Batôt, Vincent Dahirel, Coarse-Grained Models of Aqueous Solutions of Polyelectrolytes: Significance of Explicit Charges, 2020, 124, 1520-6106, 288, 10.1021/acs.jpcb.9b09725
    6. Juan D Olarte-Plata, Noelle Haddad, Cédric Vaillant, Daniel Jost, The folding landscape of the epigenome, 2016, 13, 1478-3975, 026001, 10.1088/1478-3975/13/2/026001
    7. Rui Zhou, Yi Qin Gao, Polymer models for the mechanisms of chromatin 3D folding: review and perspective, 2020, 22, 1463-9076, 20189, 10.1039/D0CP01877E
    8. Antony Lesage, Vincent Dahirel, Jean-Marc Victor, Maria Barbi, Polymer coil–globule phase transition is a universal folding principle of Drosophila epigenetic domains, 2019, 12, 1756-8935, 10.1186/s13072-019-0269-6
    9. Daniel Jost, Cédric Vaillant, Peter Meister, Coupling 1D modifications and 3D nuclear organization: data, models and function, 2017, 44, 09550674, 20, 10.1016/j.ceb.2016.12.001
    10. Hossein Salari, Geneviève Fourel, Daniel Jost, Transcription regulates the spatio-temporal dynamics of genes through micro-compartmentalization, 2024, 15, 2041-1723, 10.1038/s41467-024-49727-7
    11. Amith Z. Abdulla, Maxime M. C. Tortora, Cédric Vaillant, Daniel Jost, Topological Constraints and Finite-Size Effects in Quantitative Polymer Models of Chromatin Organization, 2023, 56, 0024-9297, 8697, 10.1021/acs.macromol.3c01182
    12. R. Tiani, M. Jardat, V. Dahirel, Phase transitions in chromatin: Mesoscopic and mean-field approaches, 2025, 162, 0021-9606, 10.1063/5.0236019
  • Reader Comments
  • © 2020 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(6081) PDF downloads(262) Cited by(4)

Article outline

Figures and Tables

Figures(2)

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog