Loading [MathJax]/jax/element/mml/optable/MathOperators.js
Research article

The generalization ability of logistic regression with Markov sampling

  • In the case of non-independent and identically distributed samples, we propose a new ueMC algorithm based on uniformly ergodic Markov samples, and study the generalization ability, the learning rate and convergence of the algorithm. We develop the ueMC algorithm to generate samples from given datasets, and present the numerical results for benchmark datasets. The numerical simulation shows that the logistic regression model with Markov sampling has better generalization ability on large training samples, and its performance is also better than that of classical machine learning algorithms, such as random forest and Adaboost.

    Citation: Zhiyong Qian, Wangsen Xiao, Shulan Hu. The generalization ability of logistic regression with Markov sampling[J]. Electronic Research Archive, 2023, 31(9): 5250-5266. doi: 10.3934/era.2023267

    Related Papers:

    [1] José M. Mazón . The total variation flow in metric graphs. Mathematics in Engineering, 2023, 5(1): 1-38. doi: 10.3934/mine.2023009
    [2] Filippo Gazzola, Gianmarco Sperone . Remarks on radial symmetry and monotonicity for solutions of semilinear higher order elliptic equations. Mathematics in Engineering, 2022, 4(5): 1-24. doi: 10.3934/mine.2022040
    [3] Rupert L. Frank, Tobias König, Hynek Kovařík . Energy asymptotics in the Brezis–Nirenberg problem: The higher-dimensional case. Mathematics in Engineering, 2020, 2(1): 119-140. doi: 10.3934/mine.2020007
    [4] Qiang Du, Tadele Mengesha, Xiaochuan Tian . $ L^{p} $ compactness criteria with an application to variational convergence of some nonlocal energy functionals. Mathematics in Engineering, 2023, 5(6): 1-31. doi: 10.3934/mine.2023097
    [5] Giuseppe Procopio, Massimiliano Giona . Bitensorial formulation of the singularity method for Stokes flows. Mathematics in Engineering, 2023, 5(2): 1-34. doi: 10.3934/mine.2023046
    [6] Márcio Batista, Giovanni Molica Bisci, Henrique de Lima . Spacelike translating solitons of the mean curvature flow in Lorentzian product spaces with density. Mathematics in Engineering, 2023, 5(3): 1-18. doi: 10.3934/mine.2023054
    [7] Juan-Carlos Felipe-Navarro, Tomás Sanz-Perela . Semilinear integro-differential equations, Ⅱ: one-dimensional and saddle-shaped solutions to the Allen-Cahn equation. Mathematics in Engineering, 2021, 3(5): 1-36. doi: 10.3934/mine.2021037
    [8] Konstantinos T. Gkikas . Nonlinear nonlocal equations involving subcritical or power nonlinearities and measure data. Mathematics in Engineering, 2024, 6(1): 45-80. doi: 10.3934/mine.2024003
    [9] Yoshihisa Kaga, Shinya Okabe . A remark on the first p-buckling eigenvalue with an adhesive constraint. Mathematics in Engineering, 2021, 3(4): 1-15. doi: 10.3934/mine.2021035
    [10] Nguyen Cong Phuc, Igor E. Verbitsky . Uniqueness of entire solutions to quasilinear equations of $ p $-Laplace type. Mathematics in Engineering, 2023, 5(3): 1-33. doi: 10.3934/mine.2023068
  • In the case of non-independent and identically distributed samples, we propose a new ueMC algorithm based on uniformly ergodic Markov samples, and study the generalization ability, the learning rate and convergence of the algorithm. We develop the ueMC algorithm to generate samples from given datasets, and present the numerical results for benchmark datasets. The numerical simulation shows that the logistic regression model with Markov sampling has better generalization ability on large training samples, and its performance is also better than that of classical machine learning algorithms, such as random forest and Adaboost.



    Dedicated to Professor Neil Trudinger on the occasion of his 80th birthday.

    We consider the fourth-order total variation flow equation in Rn of the form

    ut=Δdivu|u|. (1.1)

    We aim to give explicit description of its solutions emanating from piecewise constant radial data. However, it turns out that the definition of a solution is itself non-trivial since Δ does not have a bounded inverse on L2(Rn). Our first goal is thus to provide a rigorous definition of a solution. Our second goal is to find explicit formula for the solution to (1.1) when the initial datum u(0,x)=u0(x) is the characteristic function of a ball or an annulus. In other words,

    u0=a01BR0oru0=a01AR10R00a0R,

    where 1K is the characteristic function of a set KRn, i.e.,

    1K(x)={1,xK0,xRnK.

    Here BR denotes the open ball of radius R centered at 0Rn and AR1R0 denotes the annulus defined by AR1R0=BR1¯BR0. Our major concern is whether or not the solution remains a characteristic function throughout the evolution. For example, in the case u0=a01BR0, whether or not the solution u of (1.1) is of the form

    u(t,x)=a(t)1BR(t)

    with a function a=a(t). In other words, we are asking whether the speed ut on the ball BR(t) and on its complement are constant in the spatial variable. As in the second-order problem [3] (see also [17]), this leads to the notion of calibrability of a set. In the case of the second-order problem ut=div(u/|u|), a ball and its complement are always calibrable and R(t)R0, i.e., the ball does not expand nor shrink [3]. In our problem, R(t) may be non-constant.

    We first note that the definition of a solution itself is non-trivial. The fourth-order total variation flow has been mainly studied in the periodic setting [12,14] or in a bounded domain with some boundary conditions [15]. Formally, it is a gradient flow of the total variation functional

    TV(u):=Ω|u|

    with respect to the inner product

    (u,v)1=Ωu(Δ)1v

    when Ω is a domain in Rn or a flat torus Tn. In the periodic setting, i.e., Ω=Tn as in [12,14], it is interpreted as a gradient flow in H1av which is the dual space of H1av, the space of average-free H1 functions equipped with the inner product

    (u,v)1=Ωuv.

    For the homogeneous Dirichlet boundary condition with bounded Ω, H1av is replaced by D1, the dual space of D10=D10(Ω), which is the completion of Cc(Ω) in the norm associated with the inner product (u,v)1; here Cc(Ω) denotes the space of all smooth functions compactly supported in Ω. By the Poincaré inequality, both H1av and D10(Ω) can be regarded as subspaces of L2(Ω). However, if Ω equals Rn, the situation is more involved. If n3, D10(Rn) is continuously and densely embedded in L2(Rn), where p=np/(np) so that 2=2n/(n2), by the Sobolev inequality. In fact,

    D10(Rn)=D1(Rn)L2(Rn),D1(Rn)={uL1loc(Rn)|uL2(Rn)}

    see e.g., [11]. On the other hand, if n2, D10 is isometrically identified with the quotient space ˙D1(Rn):=D1(Rn)/R, when D1(Rn) is equipped with inner product (u,v)1 [11]. Thus, we need to be careful when n2 because an element of D10(Rn) is determined only up to a constant. In any case, D10(Rn) is a Hilbert space with the scalar product

    (u,v)D10(Rn)=Rnuv.

    Therefore, we can identify D10(Rn) with its dual space by means of the isometry

    Δ:u(u,)D10(Rn).

    On the other hand, let us define a subspace ˜D1(Rn)D10(Rn) by

    ˜D1(Rn)={wRnuw:uCc(Rn)}if n3,
    ˜D1(Rn)={wRnuw:uCc,av(Rn)}if n=1 or n=2,

    where

    Cc,av(Rn)={uCc(Rn):Rnu=0}.

    Then the closure D1(Rn) of ˜D1(Rn) coincides with D10(Rn) [11]. Note that the restriction to Cc,av(Rn) in the definition of ˜D1(Rn) in n=1,2 is necessary for the functionals to be well-posed on D1(Rn)/R. In any case, since (by definition) the space of test functions D(Rn) is continuously and densely embedded in D10(Rn), we also have a continuous embedding D1(Rn)=D10(Rn)D(Rn). Throughout the paper, we will often drop (Rn) in the notation for spaces of functions on Rn, e.g., D1=D1(Rn).

    In the first step, we give a rigorous definition of the total variation functional TV on D1. Then we calculate the subdifferential of TV in D1 space. Since it is a homogeneous functional, we are able to apply a duality method [3] to characterize the subdifferential, provided that TV is well approximated by nice functions in D1. We know that Cc,av(Rn) is dense in D1 for n2; see e.g., [11]. However, it is not immediately clear whether TV is simultaneously approximable. Fortunately, it turns out that for any wD1, there is a sequence wkCc,av(Rn) which converges to w in D1 and TV(wk)TV(w) as k. This approximation part is relatively involved since we have to use an efficient cut-off function. Using the approximation, we are able to characterize the subdifferential D1TV of TV in D1 by adapting the argument in [3]. Namely, we have

    D1TV(u)={v=ΔdivZ|ZL(Rn), |Z|1, u,divZ=TV(u)},

    where ,  denotes the canonical paring of D1 and D10. A vector field Z corresponding to an element of the subdifferential is often called a Cahn-Hoffman vector field. The equation (1.1) should be interpreted as the gradient flow of TV in D1, i.e.,

    utD1TV(u), (1.2)

    and its unique solvability for any initial datum u0D1 is guaranteed by the classical theory of maximal monotone operators ([6,24]). By our characterization of the subdifferential, we are able to give a more explicit definition of a solution which is consistent with that proposed in [15]. Namely, for u0D1 with TV(u0)<, a function uC([0,T[,D1) is a solution to (1.2) with u(0)=u0 if and only if there exists ZL(]0,T[×Rn) satisfying divZL2(0,T;D10(Rn)) such that

    ut=ΔdivZinD1(Rn) (1.3)
    |Z(t,x)|1for a.e. xRn (1.4)
    u,divZ=TV(u) (1.5)

    for a.e. t]0,T[. This is convenient for calculating explicit solutions.

    Unfortunately, in n2, for a compactly supported square integrable function u0, we know that u0D1 if and only if u0 is average-free, i.e., Rnu0=0 (see Lemma 17). Thus, the characteristic function of any bounded, measurable set of positive measure does not belong to D1. We have to extend a class of initial data u0 such that u0=ψ+w0 with w0D1 while ψ is a fixed compactly supported L2 function. We consider a gradient flow utD1TV(u) in the affine space ψ+D1. Since D1 is a directional partial derivative in the direction of D1, it is more convenient to consider solutions to evolutionary variational inequality

    12ddtu(t)g2D1TV(g)TV(u(t))for a.e. t>0 (1.6)

    for any gψ+D1 [2]. In the case ψ=0, it is easy to show that the evolutionary variational inequality is equivalent to (1.2). Indeed, by definition of the subdifferential, (1.2) is equivalent to

    (ut,gu(t))D1TV(g)TV(u(t))

    for any gD1. The left-hand side equals (d/dt)(ug2/2). Thus, the equivalence follows if ψ=0. From now on we assume that Rnψ0.

    It is easy to check that there is at most one solution to the evolutionary variational inequality (1.6). The solution u is constructed by solving

    wtD1TV(w+ψ)withw(0)=w0=u0ψ

    and setting u=w+ψ. Characterization of the (directional) subdifferential is more involved since wTV(w+ψ) is no more positively one-homogeneous. We identify the one dimensional space {cψ|cR} with R and consider the Hilbert space E1 defined as the orthogonal sum D1R. We calculate the subdifferential by the duality method since TV is now positively one-homogeneous on E1. We then project this subdifferential onto D1 to get a characterization of a (directional) subdifferential D1TV. We end up with a characterization of solution to (1.2) similar to (1.3)–(1.5), with (1.5) adjusted in a suitable way. If we also denote E10=D1 in n2, E10=D10, E1=D1 in n3 and

    u,vE={u,v if n3,w,[v]+cψv, where u=w+cψ, wD1 if n2, (1.7)

    for uE1, vE10, we end up with the following definition of solution.

    Definition 1. Assume that u0E1. We say that uC([0,[,E1) with utL2loc(]0,[,D1) is a solution to (1.1) with initial datum u0 if there exists ZL(]0,[×Rn) with divZ(t,)E10 for a.e. t>0 such that

    ut=ΔdivZinD1(Rn) (1.8)
    |Z(t,x)|1fora.e.xRn (1.9)
    u,divZE=TV(u) (1.10)

    for a.e. t>0.

    and associated well-posedness result.

    Theorem 2. Let u0E1. There exists a unique solution to (1.1) with initial datum u0.

    Our next problem is whether or not the speed of a characteristic function of a set is spatially constant inside and outside of the set. By the general theory ([6,24]), the speed is determined by the minimal section (canonical restriction) 0D1TV of D1TV. In other words, 0D1TV(u)=v0 minimizes vD1 for vD1TV(u), i.e.,

    0D1TV(u):=argmin{vD1|vD1TV(u)}.

    To motivate the notion of calibrability, we consider a smooth function u such that

    ¯U={xRn|u(x)=0},

    where U is a smooth open set. Outside ¯U, we assume that u0. To fix the idea, we assume that U has negative signature (orientation) in the sense that u<0 outside ¯U. By our specification of u, we see that

    0D1TV(u)=argmin{divZD10||Z|1 in U, Z=u/|u| in ¯Uc, divZD10}.

    Since divZ is locally integrable, Zν does not jump across U, where ν is the exterior unit normal of U. In this case,

    Zν=Zu/|u|=1on U. (1.11)

    Since divZ does not have a singular part, divZ does not jump across U. In this case,

    divZ=divνon U. (1.12)

    However, v=ΔdivZ may have a non-zero singular part concentrated on U even if v=v0, i.e., v is the minimizer. This phenomenon is observed in [12,20,21] in a one-dimensional periodic setting. Different from the second-order problem, this causes expansion or shrinking of the ball when the solution u is of the form u(t,x)=a(t)1BR(t)(x). If u>0 outside U, the minus in (1.11) and (1.12) should be replaced by the plus.

    If 0D1TV(u) is constant on BR(t) and (¯BR(t))c, this property is preserved under the evolution, which leads us to definition of calibrability. Note that the value of 0D1TV(u) on U is determined by U and its signature does not depend on particular value of u. We say that U (with negative signature) is calibrable if there exists Z0L(U,Rn) such that divZ0L2(U,Rn), Z0 satisfies (1.11), (1.12), |Z0|1 a.e. in U and ΔdivZ0 is constant on U. We call any such Z0 a calibration for U.

    Recall that in the case of the second-order problem, we say that U (with negative signature) is calibrable if there exists ˜Z0L(U,Rn) satisfying (1.11), |˜Z0|1 a.e. in U and div˜Z0 is a constant function on U. This is formally equivalent to calibrability in [3,4]. It can be shown that ˜Z0 is a calibration for U if and only if

    ˜Z0argmin{divZL2(U)|z satisfies (1.11) and |Z|1 a.e. in U}

    and div˜Z0 is a constant function on U, which is the definition of calibrability in [17].

    Going back to our fourth-order problem, if Z0 is a calibration for U, then w=divZ0 must satisfy

    Δw=λin U (1.13)
    w=divνon U (1.14)

    with some constant λ. If U is bounded, λ is determined by (1.11) since

    UwdLn=UdivZ1dLn=UZ1νdHn1=Hn1(U), (1.15)

    where Hn1 denotes the n1 dimensional Hausdorff and Ln denotes the Lebesgue measure in Rn. Using this fact, in Section 5 we prove that if Z0 is a calibration for a bounded U, then

    Z0argmin{divZL2(U)|Z satisfies (1.11), (1.12) and |Z|1 a.e. in U}. (1.16)

    Moreover, we obtain an "explicit" formula for the constant λ in terms of the Saint-Venant problem in U.

    In the radially symmetric setting, it is not difficult to show that Z0 in (1.16) can be chosen in the form z(|x|)x|x|. Indeed, if Z0 is belongs to the set of minimizers (1.16), then its rotational average ¯Z0 belongs to (1.16) as well, because averaging preserves (1.11), (1.12) and the inequality |Z|1. Since the angular part of ¯Z0 does not contribute to the divergence, it is possible to delete this part (Lemma 31). We thus conclude that there is an element of (1.16) of form Z(x)=z(|x|)x|x|. Thus, the Eq (1.13) can be written as the third-order ODE of the form

    r1n(rn1(r1n(rn1z)))=λ (1.17)

    since divZ=r1n(rn1z). If U is BR with negative signature, condition (1.11) implies

    z(R)=1. (1.18)

    Since divZ=z+(n1)z/r, condition (1.12) implies that

    z(R)=0. (1.19)

    Solving (1.17) under the assumption that z is smooth near zero under conditions (1.18), (1.19), we eventually get a unique solution (1.17)–(1.19) of the form

    z(r)=12(rR)332rR,λ=n(n+2)R3

    for all n1. It is easy to see that Z(x)=z(|x|)x|x| satisfies the constraint |Z|1 in BR. We conclude that all balls are calibrable. More careful argument is necessary, but we are able to discuss calibrability of an annulus as well as a complement of a ball.

    Theorem 3. (i) All balls are calibrable for all n1.

    (ii) All complement of balls are calibrable except n=2.

    (iii) If n=2, all complement of balls are not calibrable.

    (iv) All annuli (with definite signature) are calibrable except in n=2.

    (v) For n=2, there is Q>1 such that an annulus (with definite signature) is calibrable if and only if the ratio of the exterior radius over the interior radius is smaller than or equal to Q. In other words, AR1R0 is calibrable if and only if R1/R0Q.

    Theorem 3(v) is consistent with (iii) since R1 implies AR1R0 converges to ¯BR0c, a complement of the closure of the ball BR0. Note that in the case of an annulus, there is a possibility we take a signature which is different on the exterior boundary BR1 and the interior boundary BR0. We also study such indefinite cases.

    We now calculate an explicit solution of (1.1) starting from u0=a01BR0. We first discuss the case n2. Since a ball and its complement is calibrable, the solution is of the form

    u(t,x)=a(t)1BR(t). (1.20)

    We take the (radial) calibration Zin in BR(t) and Zout in Rn¯BR(t) and set

    Z(x,t)={Zin(x),xBR(t)Zout(x),xRn¯BR(t).

    Here Zout(x)=zout(|x|)x|x| can be calculated as

    zout(r)=n12(rR)3+n32(rR)1n

    while, as we already discussed, zin for Zin(x)=zin(|x|)x|x| is of the form

    zin(r)=12(rR)332rR.

    This Z satisfies (1.9) and (1.10), and moreover divZD10 for any t>0. Moreover, divZ is continuous across BR(t). However, divZ may jump across BR(t). Actually,

    ΔdivZ=λ1BR(t)+ν(divZindivZout)δBR(t),

    where δΓ(φ)=ΓφdHn1 or for a hypersurface and is the exterior unit normal of , i.e., . Here . By a direct calculation, the quantity . Since , by

    we conclude that

    Since

    an explicit form of a solution is given as

    As noticed earlier, in the case , the complement of the disk is not calibrable. If is a radially strictly decreasing function outside , we expect for . In [15], it is proposed that a solution to (1.1) must satisfy

    Since and , this implies

    (1.21)

    In the case , so is a Cahn-Hoffman vector field.

    If we start with with for , the expected form of a solution is

    (1.22)

    where

    (1.23)

    Analyzing this ODE system, we can deduce qualitative properties of the solution. Summing up our results yields.

    Theorem 4. Let with .

    If , then the solution to (1.1) with initial datum is of the form

    and for . (The time is called the extinction time.) Moreover, is decreasing and as .

    (i) is increasing and as for .

    (ii) for .

    (iii) is decreasing and as for .

    If , then the solution is not a characteristic function for . It is of the form (1.21) and moves by (1.23). In particular, there is no extinction time, is increasing and is decreasing. Moreover, and as . The gap is always positive.

    If , then the solution is of the form for . Moreover, is increasing and is decreasing with and as .

    We note that the infinite extinction time observed in is related to the fact that is not an element of the affine space where the flow lives if . In [14], finite time extinction for solution to (1.1) is proved in a periodic setting for average zero initial data when the space dimension Our result is unrelated to their result because we consider (1.1) in .

    The formula (1.21) does not give a solution to (1.1) when since does not belong to . In the case , this formula is consistent with our definition. If we consider strictly radially decreasing for and for , then does not belong to the domain of for . In other words, there is no Cahn-Hoffman vector field.

    These results contrast with the second-order total variation flow

    In the second-order problem, a ball and an annulus are always calibrable with their complements, see e.g., [3] or [17, Section 5]. Furthermore, is a locally integrable function without singular part for . Thus, for example, the solution starting from () must be with , where is the Cheeger ratio, i.e., . In particular, the extinction time equals .

    We conclude this paper by deriving a system of ODEs prescribing the solution in the case when the initial datum is a piecewise constant, radially symmetric function, which we call a stack. To be precise, we say that is a stack if it is of the form

    , . In particular, we obtain

    Theorem 5. Let and let be a stack. If is the solution to (1.1), then is a stack for .

    In the case , this result is no longer true, as evidenced by Theorem 4. However the solution can still be prescribed by a finite system of ODEs.

    A total variation flow type equation

    (1.24)

    was introduced by [31] to describe the height of crystal surface moved by relaxation dynamics below the roughening temperature, where . For this equation, characterization of the subdifferential of the corresponding energy was given by Y. Kashima in periodic setting [20,21] and under Dirichlet condition on a bounded domain [21]. The speed of a facet (a flat part of the graph) is calculated for in [20] and for a ball with the Dirichlet condition under radial symmetry [21]. Different from the second-order problem, the speed of a facet is determined not only by the shape of facet. Also it has been already observed in [20], that the minimal section may have a delta part although the behavior of the corresponding solution was not studied there. A numerical computation was given in [23]. The Eq (1.24) was derived as a continuum limit of models describing motion of steps on crystal surface as discussed in [27], where numerical simulation was given; see also [22].

    In [7], a crystalline diffusion flow was proposed and calculated numerically. In a special case, it is of the form , where is a piecewise linear convex function, when the curve is given as the graph of a function. This equation was analyzed in [13] in a class of piecewise linear (in space) solutions.

    Fourth-order equations of type (1.1) were proposed for image denoising as an improvement over the second-order total variation flow. For example, the equation

    where is an original image which is given and , corresponds to the Osher-Solé-Vese model [28]. The well-posedness of this equation was proved by using the Galerkin method by [8].

    For (1.1), an extinction time estimate was given in [14] for in the periodic setting. It was extended to the Dirichlet problem in a bounded domain by [15]. In the review paper [12], it was proved that the solution of (1.1) in may become discontinuous instantaneously even if the initial datum is Lipschitz continuous, because the speed may have a delta part.

    There are a few numerical studies for (1.1) in the periodic setting. A duality-based numerical scheme which applies the forward-backward splitting has been proposed in [16]. A split Bregman method was adjusted to (1.1) and also (1.24) in [18]. In these methods, the singularity of the equation at is not regularized. However, all above studies deal with either periodic, Dirichlet or Neumann boundary condition for a bounded domain. It has never been rigorously studied in , although in [15] there are some preliminary calculations for radial solution in .

    This paper is organized as follows. In Section 2, we discuss basic properties of the total variation on , notably we show strict density of . In Section 3, we give a rigorous definition of a solution to (1.1) and obtain a verifiable characterization of solutions. In Section 4, we extend the results of the previous section to include initial data with non-zero average in . In Section 5, we introduce the notion of calibrability. In Section 6, we discuss calibrability of rotationally symmetric sets in . In Section 7, we study solutions emanating from piecewise constant, radially symmetric data.

    In this section, we give a rigorous definition of the total variation on and relate it to the usual total variation defined on . The main tool that we use here as well as in the following section is an approximation lemma, which for a given produces a sequence of nice functions that converges to in and .

    Let us denote

    We define by

    Let us compare this definition with the usual total variation, which we denote here by , defined by

    First of all, as in the case , we easily check that is lower semicontinuous with respect to the weak-* (and, a fortiori, strong) convergence in . Indeed, if in ,

    In fact, we have

    Lemma 6. We have , and so with and coinciding on . In particular, if , . If ,

    The proof of this fact is a consequence of the lemma below and we postpone it.

    Lemma 7. For any there exists a sequence such that

    and

    To prove it, we will use a special choice of cut-off function and associated variant of the Sobolev-Poincaré inequality. For , let us denote by the element of minimal norm in among those that satisfy if , if . It is an easy exercise to show that for

    In either case,

    (2.1)

    Lemma 8. If and , then for all , there holds

    (2.2)

    with and

    (2.3)

    with a different .

    Proof. Let . Following the proof of the standard Poincaré inequality by contradiction using Rellich-Kondrachov theorem, we obtain

    Applying the Sobolev inequality in to the function , we upgrade this to

    (2.4)

    Next, let for a given . We observe that

    and so, by a change of variables ,

    Applying the same change of variables to both sides of (2.4) we conclude the proof of (2.2).

    The proof of (2.3) is a matter of direct calculation.

    Let us now return to the proof of the approximation lemma.

    Proof of Lemma 7. Given , let

    Equivalently, for ,

    Denoting ,

    We estimate the second term on the r. h. s. using the Poincaré inequality from Lemma 8, taking into account that the support of the integrand is contained in , where ,

    Thus,

    and so

    (2.5)

    Next we estimate . Due to lower semicontinuity of , we can assume without loss of generality that . First, we note that ([29,30]) and

    Thus, for any ,

    In particular, this implies that for and . By Lemma 8,

    (2.6)

    If , together with lower semicontinuity of , this yields

    Taking into account (2.5), by a diagonal procedure we can select sequences , such that satisfies both requirements in the assertion. On the other hand, if or , (2.6) only implies uniform boundedness of .

    Let now . Since have compact support, uniform boundedness of implies uniform bound on in by the Sobolev inequality. As converges to in , we have . This allows us to improve (2.6):

    (2.7)

    The r. h. s. of (2.7) converges to as and we conclude as before.

    Next, consider . In this case, finiteness of implies that and there exist such that

    Now, let be the element of minimal norm in under constraints

    (Clearly, is a continuous, piecewise affine function.) We have

    On the other hand, since are compactly supported and coincides as distribution with a locally integrable function, we can calculate

    so . Therefore, we can estimate

    (2.8)

    Since we have shown that as , the averages on the r. h. s. converge to and we conclude as before.

    As a first application of the approximation lemma, we demonstrate Lemma 6 announced before.

    Proof of Lemma 6. Let and let be the sequence provided by Lemma 7. Let first . Since is uniformly bounded in , by the Sobolev embedding is uniformly bounded in . Therefore, .

    In case , since are compactly supported and uniformly bounded in , is uniformly bounded in . From these two bounds it follows that and that is a finite signed measure on . In particular, has essential limits at . Reasoning as in the final part of the proof of Lemma 7, we show that these limits vanish.

    For a gradient flow of a convex functional, there is a general theory initiated by Y. Kmura [24] and developed by H. Brézis [6] and others. It is summarized as follows.

    Proposition 9 ([6]). Let be a real Hilbert space. Let be a lower semicontinuous, convex functional on with values in . Assume that is dense in . Then, for any , there exists a unique solution which is absolutely continuous in (for any ) satisfying

    (3.1)

    Moreover,

    If , then is allowed. In particular, .

    As in [2], this solution satisfies the evolutionary variational inequality

    for any . Indeed, by definition, is equivalent to saying

    for any . The evolutionary variational inequality is not only an equivalent formulation of the gradient flow , but also apply to a gradient flow of a metric space by replacing by distance between and ; see [2] for the theory.

    To be able to actually find solutions to (3.1), we need to characterize the subdifferential of the total variation in the space . The basic idea of the proof is a duality argument, which has been carried out in the case of subdifferentials. In the case of setting, the idea goes back to the unpublished note of F. Alter and a detailed proof is given in [3]. Let be a functional on a real Hilbert space equipped with an inner product . The main idea is to characterize the subdifferential by the polar of which is defined by

    where . We first recall a lemma [3, Lemma 1.7].

    Lemma 10. Let be convex. Assume that is positively one-homogeneous, i.e.,

    for all , . Then, if and only if and .

    Remark 11. By general theory of convex functionals, we know that

    if is a non-negative, lower semicontinuous, convex, positively one-homogeneous functional [3, Proposition 1.6].

    This property is essential for the proof of

    Theorem 12. Let by defined by

    Then .

    Remark 13. (i) By definition, is a convex, lower semi-continuous, positively one-homogeneous function, so .

    (ii) if , the infimum is attained. Theorem 12 together with Lemma 10 implies the following characterization of the subdifferential of .

    Theorem 14. An element belongs to if and only if there is with such that

    (i)

    (ii)

    (iii) .

    Proof. By Lemma 10 and Theorem 12,

    The property together with Remark 13(ii) implies (i), (ii) and .

    It is not difficult to check the converse.

    Proof of Theorem 12. The inequality :

    We take with . By Remark 13(ii), there is with with such that . By Lemma 7, there is such that , in . We observe that

    Sending , we conclude that

    By definition of , this implies .

    The inequality :

    By definition,

    Since

    we observe that

    This implies that .

    Now that the subdifferential of in is calculated, we are able to justify an explicit definition of a solution proposed in [15].

    Theorem 15. Assume that . Then is a solution of with in the sense of Proposition 9 if and only if there exists satisfying

    such that for a.e. there holds

    and

    (If , is allowed.)

    The Theorem essentially follows from Theorem 14. We only need to justify that a Cahn-Hoffman vector field defined separately for every time instance by Theorem 14 can be chosen to be jointly measurable, i.e., . As in the second-order case [3, section 2.4], this can be done by recalling that Bochner functions can be well approximated by piecewise constant functions. Since our situation is slightly different, let us include the argument for completeness. We begin with a lemma which is a version of [3, Lemma A.8] suited to our needs.

    Lemma 16. Let , where is a Banach space and let have Lebesgue measure . Then for each there exists a countable family of disjoint closed intervals , , such that is a Lebesgue point for , ,

    and

    Proof. Referring e.g., to [6, p. 140], -a.e. point is a Lebesgue point for , i.e.,

    Let be the set of Lebesgue points of contained in and let us take

    Using Besicovitch covering theorem [9, Corollary 1 on p. 35] with and (for example) , we obtain a candidate for the family . We check that indeed

    Proof of Theorem 15. Applying Lemma 16 to , for each we obtain a partition of (up to a set of Lebesgue measure ) into disjoint closed intervals , such that

    (3.2)

    and

    By (3.2) and Theorem 14, for there exist such that

    Further denoting

    we have

    We immediately deduce that there exists with a.e. and a subsequence such that converges to weakly-* in . Moreover, for any ,

    at least along a subsequence. Since on the other hand

    we infer that and in particular in . Finally, we observe that in . Moreover, since is non-increasing, we have in . Therefore, for any ,

    Unfortunately, in the case , the characteristic function of a set of positive measure is not in since . We shall define a new space containing as follows. We take a function with compact support such that . We introduce a vector space

    This space is independent of the choice of . Indeed, let be compactly supported and (). An element can be rewritten as

    The next lemma implies since . We then conclude that .

    Lemma 17. Assume that . A compactly supported function belongs to if and only if .

    Proof. If , then

    where stands for the element of whose representatives are as well as .

    Now suppose that a compactly supported function satisfies . Given a , by the Poincaré inequality, we have for any , independently of the representative ,

    In particular, . Taking into account this and the assumption , we see that the linear functional

    (4.1)

    on is well defined. Moreover, if is large enough that ,

    Thus, the functional defined by (4.1) is bounded, i.e., .

    Since is independent of the choice of , we suppress and denote this space by . In case , we will use notation . We also denote if , if . For , , we denote

    (4.2)

    where , , , . As before, we check that the value of does not depend on the choice of this decomposition.

    We recall that if , and come with a Hilbert space structure. We also define inner products on , in case by

    for , , (). This gives an orthogonal decomposition

    We note that although the values of those products may depend on the choice of , the topologies they induce on , do not. Formula (4.2) associates to any a continuous linear functional on . The resulting mapping is an isometric isomorphism between and the continuous dual to .

    We extend onto by defining

    As usual, we check that is a convex, weakly-* (and strongly) lower semicontinuous functional. In particular, for a fixed , the functional is convex and lower semicontinuous on . We next give a definition of a solution of in the space . It turns out the idea of evolutionary variational inequality is very convenient since it is a flow in an affine space for some .

    Definition 18. Assume that . We say that is a solution to

    (4.3)

    in the sense of EVI (evolutionary variational inequality) with initial datum if

    (i) is absolutely continuous on (for any ) with values in and continuous up to zero with and

    (ii) satisfies the evolutionary variational inequality, i.e.,

    holds for a.e. , provided that is such that .

    Theorem 19. For any , there exists a unique solution of (4.3) in the sense of EVI. Moreover, if is the solution to (4.3) in the sense of EVI with for , then

    (4.4)

    provided that .

    Proof. Uniqueness follows from contractivity (4.4), which is established by a standard reasoning [2]. We give a short proof for the reader's convenience and for completeness. Let () be a solution to (4.3) in the sense of EVI with initial datum such that . Since are solutions, we also have for and so , . Thus, EVI yields

    for a.e. . We conclude that is non-increasing, in particular (4.4) holds.

    The existence is a bit more involved. For with , we consider the gradient flow of the form

    (4.5)

    Applying Proposition 9, there is a unique solution to (4.5) for . This solution satisfies the evolutionary variational inequality

    for any . Setting , , we end up with

    Since can be taken arbitrary such that , this shows that is the solution of (4.3) in the sense of EVI; condition (i) follows easily from Proposition 9.

    It is non-trivial to characterize the subdifferential . For this purpose, we introduce a mapping which plays a role analogous to in .

    Lemma 20. Let . The mapping defined by

    is an isometric isomorphism.

    Proof. It is clear that and is linear. For given with , , there is such that . Since a representative of is determined up to an additive constant, there is a unique representative such that

    Thus, the mapping is surjective. If , then so . Thus is a constant. Since , this constant must be zero, so . Thus, is injective. Recalling our definitions of inner products on , , it is easy to check that is an isometry.

    We have a characterization of the polar of in as in Theorem 12.

    Theorem 21. Let . Let be given by

    for . Then .

    Admitting this fact, we are able to give a characterization of the subdifferential.

    Theorem 22. Let . An element belongs to if and only if there is with such that

    (i)

    (ii)

    (iii)

    Proof of Theorem 22. The proof parallels that of Theorem 14. By Lemma 10 and Theorem 21

    The properties (i), (ii) together with are equivalent to . Since

    (4.6)

    the Euler equation is equivalent to (iii).

    Proof of Theorem 21. The proof parallels that of Theorem 12. We first prove that

    for . This implies . The estimate formally follows from the identity (4.6). Indeed, by (4.6), we see

    If is in , then, by this formula, we obtain

    By approximation, as in the proof of Theorem 12, we conclude the desired estimate.

    The other inequality follows from . The proof of is parallel to that of Theorem 12 by replacing by and the inner product by the inner product, respectively, if one notes the identity (4.6). Since is an isometry, is lower semicontinuous, and we conclude that by Remark 11.

    We have to be careful, since the gradient flow

    does not correspond to the total variation flow . By Theorem 22(iii), if . Thus the gradient flow is formally of the form

    To recover the original total variation flow, we consider "partial" subdifferential in the direction of . Let be the orthogonal projection from to . Then, by definition,

    The equation

    is now formally of the form

    since is time-independent. Here is a precise statement.

    Theorem 23. Let . Consider the functional on for a fixed and . Then, for . In particular, an element belongs to if and only if there is with such that

    (i) ,

    (ii) ,

    (iii) .

    (In case , by we understand .) This characterization is important to calculate the solution of for explicitly. In fact, we can recover the characterization of a solution in the sense of EVI analogous to the one in Theorem 15, amounting to Theorem 2.

    Proof of Theorem 2. This is almost immediate. However, like in the case of Theorem 15, we need to justify that the vector field can be chosen to be jointly measurable with respect to . We proceed as in the proof of Theorem 15. The difference is, now we need to pass to the limit with

    but convergence of in gives only . To deal with this problem, let us choose and let be such that for . Then we also have for , and

    Testing with and using weak-* convergence of in we get

    at which point we conclude as in the proof of Theorem 15.

    We are interested in sets where the speed of solution is spatially constant. The speed is given as minus the minimal section of the subdifferential, i.e.,

    Since is closed and convex, is uniquely determined if . Since we have characterized the subdifferential, we end up with

    Although the minimizer is unique, the corresponding may not be unique. Let be a smooth open set in . We consider a smooth function such that

    and . Such a closed set is often called a facet. Assume further that outside . Let be a vector field satisfying for . It is easy to see that outside the facet ,

    by . Since , we see that

    Since is locally integrable, the normal trace is well-defined from inside as an element of [3] and it must agree with that from outside, i.e.,

    (5.1)

    where is the exterior unit normal of and

    Since is in , its trace from inside and outside must agree, i.e.,

    Note that is the sum of all principal curvatures, equal to times the (inward) mean curvature, of at .

    Let be a minimizer corresponding to . Since the value outside is always the same, we consider its restriction on and still denote by . Then, is a minimizer of

    (5.2)

    Although so that , the quantity may jump across . Thus may contain singular part which is a driving force to move the facet boundary "horizontally" during its evolution under the fourth-order total variation equation as observed in the previous section and earlier in [12]. In the second-order problem, the speed does not contain any singular part so the jump discontinuity does not move.

    We are interested in a situation where is constant over . In the spirit of [25], we call any continuous function a signature for .

    Definition 24. Let be a smooth open set in with signature . We say that is (-)calibrable (with signature ) if there exists satisfying the constraint

    (5.3)

    with boundary conditions

    (5.4)

    with the property that

    (5.5)

    We call any such a (-)calibration for (with signature ).

    From the definition of calibration, we easily deduce

    Proposition 25. Let be a smooth bounded domain in . Assume that is a calibration for with signature . Then, is a solution to the Saint-Venant problem

    with the constraint

    (5.8)

    where is some constant.

    Appealing to this relation between calibrability and the Saint-Venant problem, we can prove the following

    Theorem 26. Let be a smooth bounded domain in . Suppose that is a calibration for with signature . Then is a minimizer of (5.2).

    Proof. We first note that must satisfy (5.8). We consider the minimization problem for

    under the Dirichlet condition (5.7) and the constraint (5.8). Since the problem is strictly convex, there is a unique minimizer in . By Lagrange's multiplier method, must satisfy (5.6) because of the constraint (5.8). (Actually, a weak solution of (5.6) is a smooth solution of (5.6), (5.7) by the standard regularity theory of linear elliptic partial differential equations [19, Chapter 6].) As we see in Lemma 27 below, the constant is uniquely determined by (5.6) and (5.8). For , there always exists such that

    (5.9)

    Indeed, let be a solution of the Neumann problem

    Such a solution always exists since satisfies the compatibility condition (5.8) and it is smooth up to ; see e.g., [11,19]. If we set , then satisfies the desired property (5.9). Thus, the minimum of the Dirichlet energy under the constraint (5.8) agrees with

    Since and , this shows that is a minimizer of (5.2).

    Lemma 27. Let be a smooth bounded domain in . Let solve

    for , . This solution can be written as

    where solves the Saint-Venant problem

    (5.10)
    (5.11)

    and is the harmonic extension of to . In particular, if is given then is uniquely determined by

    (5.12)

    since in .

    Proof. The decomposition is rather clear. The property in follows from the maximum principle [19].

    From Lemma 27 we also deduce the following formula for vertical speed of calibrable facets in terms of the solution to the Saint-Venant problem.

    Proposition 28. Let be a smooth bounded domain in and . If is a calibration for with signature satisfying

    then

    where is the solution to the Saint-Venant problem (5.10).

    Proof. We recall (5.12) and calculate

    We now compare the definition of calibrability for the second-order problem.

    Definition 29. Let be a smooth open set in with signature . We say that is (-)calibrable if there is satisfying the constraint in and the boundary condition with the property that is a constant over .

    This definition is slightly weaker than the calibrability used in [26], where is a solution of the total variation flow in with some function of ; see also [3]. This requires that is constant not only on but also . Our definition follows from that of [5].

    Definition 30. We say that a Lebesgue measurable subset (defined up to a set of measure zero) of is a generalized annulus if is non-empty, open, connected and rotationally symmetric, i.e., invariant under the linear action of on .

    It is easy to see that any generalized annulus is a ball, an annulus, the complement of a ball or the whole space . In other words, any generalized annulus is of form

    In this section we will settle the question which generalized annuli are calibrable.

    Lemma 31. Let be a generalized annulus. Suppose that is calibrable with signature . Then there exists a calibration for of form .

    Proof. Let be any calibration for . Let be the Haar measure on . We define as the average

    It is an exercise in vector calculus to check that satisfies boundary conditions (5.4) and that is a constant (equal to ) on . By convexity, . Thus is a calibration for . By definition, it is invariant under rotations, i.e.,

    for , . In the case this already shows that is in the desired form. In higher dimensions, we consider the orthogonal decomposition

    Both and are invariant under rotations. In particular, for any given , the restriction of to is an invariant tangent vector field on . Note that any such vector field is smooth. If , it follows by the hedgehog uncombability theorem [10, Proposition 7.15] that . If , any vector field invariant on is in particular invariant on a sphere containing any given point in , so the same conclusion follows. Thus, we have

    By rotational invariance, we have , which concludes the proof.

    We are left with the case in which there exists a one-dimensional space of invariant tangent fields on spanned by . Thus, we have

    We calculate

    Thus, we can disregard and choose as our calibration, since it satisfies conditions (5.3)–(5.5) (recall that and are orthogonal). As before, we see that

    Let be a generalized annulus. By Lemma 31, if is calibrable, then there exists a calibration for of form . It follows from (5.10) that needs to satisfy the ODE

    (6.1)

    The general solution to this ODE is

    (6.2)

    where if and

    (6.3)

    where if . We will now try to find a calibration for by solving a suitable boundary value problem for (6.1).

    Let . To focus attention, we choose on . In this case, boundary conditions (5.4) lead to

    (6.4)

    If , in order to satisfy the requirements and , we need to restrict to in (6.2). We make the same choice also in case , as it leads to the right result. Then, applying (6.4) in (6.2) or (6.3), we obtain a system of two affine equations for two unknowns . We solve it obtaining

    (6.5)
    (6.6)

    We check that satisfies on , so is a calibration for . Thus, all balls are calibrable in any dimension.

    Let . For consistency with the previous case, we choose on . In this case, boundary conditions (5.4) also lead to

    (6.7)

    Let us first assume that . In order to satisfy the requirement , we need to restrict to in (6.2). Again, applying (6.4) in (6.2) leads to a system of two affine equations for two unknowns . We solve it obtaining

    (6.8)

    Again, we easily check that satisfies on , so is a calibration for .

    In the omitted cases , requirement implies in (6.2). If , there exists of such form satisfying (6.7): , consistently with (6.8). On the other hand, if , applying (6.7) to (6.2) with leads to a contradiction.

    Summing up, all complements of balls are calibrable if . On the other hand, if all complements of balls turn out not to be calibrable.

    Let now , . In this case has two connected components, so there exist two distinct choices of signature: constant and non-constant. Let us first consider the former. To focus attention, we choose . Then, boundary conditions (5.4) take form

    (6.9)

    Applying (6.9) to (6.2) or (6.3) leads to a system of four affine equations with four unknowns. In the case , the solution is

    (6.10)

    This can be rewritten in a form emphasizing homogeneity:

    (6.11)

    where we denoted . We can further simplify it to

    (6.12)

    We need to check whether condition is satisfied. We calculate

    (6.13)

    Using the form (6.12), we can check that , for all . Therefore, has at most one zero on the half-line . Consequently, has at most one zero, so has at most one inflection point. Taking into account (6.9), cannot have a local extremum on . Thus, on and is a valid calibration.

    In the case , the solution is

    which can be rewritten (again, denoting ) as

    (6.14)

    As before, we calculate the second derivative of :

    The polynomial has at most positive roots, and so does . By (6.9), at least one of them belongs to . Furthermore, since for , is positive for large values of . Taking into account these observations, we deduce that on if and only if (compare Figure 1). This inequality is equivalent to

    Figure 1.  Plots of for an annulus with constant signature for two different values of in case .

    We compute

    (6.15)

    We observe that has exactly one zero on , and if and only if . Therefore, is a valid calibration for with signature if and only if . By (6.15) it is evident that . Numerical computation using Wolfram Mathematica shows that . Thus, with constant signature is calibrable if and only if . This concludes the proof of Theorem 3.

    Now, let us consider non-constant signature. We assume that on and on . This choice leads to

    (6.16)

    If , the solution to the resulting affine system is

    (6.17)

    which we rewrite as

    (6.18)

    We note that in case the solution reduces to

    while in case it reduces to

    In both of these cases is constant and we have . On the other hand, if , we can check that for . Recalling (6.13), we observe that has at most two zeros on the positive half-line and for large values of . On the other hand, by (6.16), if has local extrema on , it needs to have at least inflection points. We deduce from these conditions that has exactly one local maximum and no local minima, and therefore on (compare Figure 2). It remains to check whether on . Let now

    Figure 2.  Plots of for an annulus with non-constant signature for two values of , with , .

    Then, by (6.1), (6.16), is a solution to the second-order elliptic problem

    where

    Since , we have for . By the classical weak maximum principle [19, Theorem 3.1.],

    Now, if has a local maximum at , then , so . Consequently,

    so on . Thus, if , all annuli with non-constant signature are calibrable.

    We move to the case . Now, the solution to the affine system for coefficients of is

    or equivalently

    We can check that in this case for . By the same argument as in the previous case, we show that in (compare Figure 2), so it does not define a valid calibration. Thus, in the case all annuli with non-constant signature are not calibrable.

    In this section, our goal is to provide explicit description of solutions to (1.1) emanating from the characteristic function of a ball

    (7.1)

    In the case of second-order total variation flow, the solutions with initial datum (7.1) are known to be of form

    with finite extinction time, i.e., there exists such that for . In the fourth order case, based on the treatment of case in [12], we would expect the solutions to have the form

    (7.2)

    at least until an extinction time beyond which . This intuition turns out to be correct in every dimension except .

    Let first . As we have checked in Section 6, in this case both balls and complements of balls are calibrable. Thus, as long as the solution is of form (7.2) in time instance , we expect a valid Cahn-Hoffman vector field to be given by

    (7.3)

    where is the calibration w constructed for a ball and is the calibration we constructed for the complement of that ball, recall:

    (7.4)
    (7.5)

    We further calculate:

    It is straightforward to check that . Next, we deduce

    (7.6)

    Then, using the identity , we obtain (recall notation (7.2))

    Summing up, evolution of initial datum (7.1) is given by (7.2) with , satisfying

    (7.7)

    This system can be explicitly solved by noticing that

    and therefore

    along trajectories. The solution is

    (7.8)

    We note that the solution satisfies

    along trajectories. (This "first integral" could also have been used to solve the system (7.7).) Let us point out a few observations concerning the solutions (compare Figure 3):

    Figure 3.  Plots of the solution emanating from the characteristic function of the unit ball as a function of for chosen values of .

    ● the extinction time is equal to ,

    ● if , is increasing and as ,

    ● if , is constant,

    ● in higher dimensions, is decreasing and as .

    In the case we were able to exhibit a calibration for the ball , but not for its complement. Another possible ansatz on the Cahn-Hoffman vector field of form (7.3) is one where is the calibration we constructed for and is the choice considered in [15]:

    (7.9)

    We calculate

    (7.10)

    hence

    (7.11)

    If , this would lead to being radially strictly increasing for positive and large values of , which would be at odds with our choice of . In fact, if , for any of this form. However, in smaller dimensions this ansatz remains a viable option. If , it leads to the same solution as before. On the other hand, if , we obtain a solution which is not of form (7.2). Instead, we are led to assume

    (7.12)

    We have:

    and, recalling (7.12),

    Thus, we arrive at ODE system

    (7.13)

    This system is not autonomous, but it can be integrated by noticing that along trajectories

    and so

    This implies, first of all, that

    for all and the form of solution (7.12) is preserved as long as the solution does not vanish. Furthermore, we can rewrite the system (7.13) in decoupled form

    (7.14)

    These equations can be explicitly integrated:

    We observe that the solutions exist globally and

    In particular stays in the form (7.12) for all .

    Finally we consider . In this case, both ansätze considered before lead to the same solution:

    (7.15)

    which coincides with (7.5). Repeating the calculations following (7.5), we obtain a solution of form (7.2) satisfying (7.8), i.e.,

    (7.16)

    Note that now, as opposed to the case , the coefficient multiplying is positive. Like in , the extinction time is infinite and we have

    This concludes the proof of Theorem 4.

    Using the calibrations we constructed for generalized annuli, we will now derive a system of ODEs locally prescribing the solution emanating from any piecewise constant, radially symmetric datum (a stack).

    Definition 32. Let . We say that is a stack if there exists a number and sequences , with such that

    Suppose first that , in which case all connected components of level sets of any stack are calibrable. Let be a stack

    (7.17)

    where for , . We expect that if is the solution emanating from , then is a stack of form

    (7.18)

    with for all , and that , for small . We construct a Cahn-Hoffman vector field for by pasting together calibrations for , , with suitable choice of signatures. We have

    (7.19)

    We denote

    The values of are functions of . Assuming that are regular enough and , are small enough, we have

    whence

    Further, for , we denote by the value of which is constant since is a calibration. Then, we can write down the system of ODEs for and :

    (7.20)

    Let denote given by (6.17) if or by (6.10) if , with and in place of and . Then, we have

    (7.21)

    We observe that in a neighborhood of any initial datum , , , , the r. h. s. of (7.20) is regular in , , so locally the system has a unique solution. Unique solvability fails when a time instance is reached such that , or . In such case is again a stack with a smaller , and we can restart our procedure. This concludes the proof of Theorem 5.

    Next we deal with the remaining case of dimension . In this case, our attempt to obtain a radial calibration failed for complements of balls and for some annuli. Again, let be a stack of form (7.17). For , let . We assume the following ansatz on the solution and the associated field for small :

    (7.22)
    (7.23)

    We complete the definition of a Cahn-Hoffman field consistent with (7.22), (7.23) by pasting the calibrations with suitable choice of signatures into the gaps left in (7.23). This leads to

    Moreover, and

    for , where are the one sided limits as . The values of are functions of . Reasoning as in the case , the evolution of is governed by equations

    (7.24)

    The values are either prescribed by ODEs

    (7.25)

    with functions of in calibrable regions where , or explicitly determined by in bending regions. It is important to note that in the case , the functions do not depend on . Thus, one can first solve a part of the system (7.24), (7.25) for the outer annuli, then calculate in the bending region (without knowing a priori its inner boundary) and move on to solving innermore parts of (7.24), (7.25). This way, finding the solution is indeed again reduced to solving a system of ODEs. We include Figure 4 illustrating the evolution of stacks on the example of the characteristic function of an annulus.

    Figure 4.  Plots of the solution emanating from the characteristic function of annulus as a function of for chosen values of with , .

    In the case of thick annuli in , the qualitative behavior resulting from this procedure is rather intricate and may be surprising. To showcase this, we include Figures 5 and 6 depicting the evolution emanating from the characteristic function of a thick annulus.

    Figure 5.  Plots of the solution emanating from the characteristic function of annulus with , in as a function of for , , , .
    Figure 6.  Plot of the Cahn-Hoffman vector field for the characteristic function of annulus with , in as a function of .

    Let us explain the evolution in a few words. The inner part of the initial facet corresponding to the annulus instantaneously bends downwards. Meanwhile, the outer boundary of the facet expands outwards, at relatively low speed (practically invisible in the picture). Since the ratio of outer to inner radius of the facet is constant, this means that the whole facet slowly moves outwards. The combined effect of this and the bending results in the very steep (but continuous!) part of the graph between the facet and the bending part. At the same time, the facet corresponding to the inside ball also expands outwards, gradually consuming the bending part. In the final pictured time instance, the whole bending part has disappeared and the solution is a stack again.

    The authors declare they have not used Artificial Intelligence (AI) tools in the creation of this article.

    The authors would like to thank the anonymous reviewers for their helpful comments.

    The first author of the work was partly supported by the Japan Society for the Promotion of Science (JSPS) through the grants Kakenhi: No. 19H00639, No. 18H05323, No. 17H01091, and by Arithmer Inc. and Daikin Industries, Ltd. through collaborative grants.

    The second author of the work was partly supported by JSPS through the grant Kakenhi No. 18H05323.

    This work was created during the last author's JSPS Postdoctoral Research Fellowship at the University of Tokyo. The last author was partly supported by the Kakenhi Grant-in-Aid No. 21F20811.

    The authors declare no conflict of interest.



    [1] A. Bayaga, Multinomial logistic regression: Usage and application in risk analysis, J. Appl. Quant. Methods, 5 (2010), 288–297.
    [2] A. Selmoune, Z. Liu, J. Lee, To pay or not to pay? Understanding public acceptance of congestion pricing: A case study of Nanjing, Electron. Res. Arch, 30 (2022), 4136–4156. https://doi.org/10.3934/era.2022209 doi: 10.3934/era.2022209
    [3] Z. Ahmad, Z. Almaspoor, F. Khan, S. E. Alhazmi, M. El-Morshedy, O. Y. Ababneh, et al., On fitting and forecasting the log-returns of cryptocurrency exchange rates using a new logistic model and machine learning algorithms, AIMS Math., 7 (2022), 18031–18049. https://doi.org/10.3934/math.2022993 doi: 10.3934/math.2022993
    [4] N. Dwarika, Asset pricing models in South Africa: A comparative of regression analysis and the Bayesian approach, Data Sci. Financ. Econ., 3 (2023), 55–75. https://doi.org/10.3934/DSFE.2023004 doi: 10.3934/DSFE.2023004
    [5] D. McAllester, Generalization bounds and consistency, Predicting Struct. Data, 2007. https://doi.org/10.7551/mitpress/7443.003.0015
    [6] N. Kordzakhia, G. D. Mishra, L. Reiersølmoen, Robust estimation in the logistic regression model, J. Stat. Plan. Infer., 98 (2001), 211–223. https://doi.org/10.1016/S0378-3758(00)00312-8 doi: 10.1016/S0378-3758(00)00312-8
    [7] M. Rashid, Inference on Logistic Regression Models, Ph.D thesis, Bowling Green State University, 2008.
    [8] D. Dai, D. Wang, A generalized Liu-type estimator for logistic partial linear regression model with multicollinearity, AIMS Math., 8 (2023), 11851–11874. https://doi.org/10.3934/math.2023600 doi: 10.3934/math.2023600
    [9] Z. Wang, Z. Wang, B. Fu, Learning restricted bayesian network classifiers with mixed non-i.i.d. sampling, in 2010 IEEE International Conference on Data Mining Workshops, (2010), 899–904. https://doi.org/10.1109/ICDMW.2010.199
    [10] H. Sun, Q. Wu, Least square regression with indefinite kernels and coefficient regularization, Appl. Comput. Harmon A, 30 (2011), 96–109 https://doi.org/10.1016/j.acha.2010.04.001 doi: 10.1016/j.acha.2010.04.001
    [11] H. Sun, Q. Guo, Coefficient regularized regression with non-iid sampling, Int. J. Comput. Math., 88 (2011), 3113–3124. https://doi.org/10.1080/00207160.2011.587511 doi: 10.1080/00207160.2011.587511
    [12] X. Chu, H. Sun, Regularized least square regression with unbounded and dependent sampling, Abstr. Appl. Anal., 2013 (2013), 900–914. https://doi.org/10.1155/2013/139318. doi: 10.1155/2013/139318
    [13] Z. C. Guo, L. Shi, Learning with coefficient-based regularization and l1-penalty, Adv. Comput. Math., 39 (2013), 493–510. https://doi.org/10.1007/s10444-012-9288-6 doi: 10.1007/s10444-012-9288-6
    [14] B. Jiang, Q. Sun, J. Q. Fan, Bernstein's inequality for general Markov chains, preprint, arXiv: 1805.10721.
    [15] D. S. Modha, E. Masry, Minimum complexity regression estimation with weakly dependent observations, IEEE Trans. Inf. Theory, 42 (1996), 2133–2145. https://doi.org/10.1109/18.556602 doi: 10.1109/18.556602
    [16] F. Merlevède, M. Peligrad, E. Rio, Bernstein inequality and moderate deviations under strong mixing conditions, Inst. Math. Stat. (IMS) Collect., 2009 (2009), 273–292. https://doi.org/10.1214/09-IMSCOLL518 doi: 10.1214/09-IMSCOLL518
    [17] J. Q. Fan, B. Jiang, Q. Sun, Hoeffding's lemma for Markov Chains and its applications to statistical learning, preprint, arXiv: 1802.00211.
    [18] P. J. M. Laarhoven, E. H. L. Aarts, Simulated Annealing: Theory and Applications, Springer, Dordrecht, 1987.
    [19] J. Thongkam, G. Xu, Y. Zhang, et.al., Support vector machine for outlier detection in breast cancer survivability prediction, in Asia-Pacific Web Conference, Springer, (2008), 99–109. https://doi.org/10.1007/978-3-540-89376-9_10
    [20] A. L. B. Miranda, L. P. F. Garcia, A. C. P. L. F. Carvalho, A. C. Lorena, Use of classification algorithms in noise detection and elimination, in International Conference on Hybrid Artificial Intelligence Systems, Springer, (2009), 417–424. https://doi.org/10.1007/978-3-642-02319-4_50
    [21] J. Xu, Y. Y. Tang, B. Zou, Z. Xu, L. Li, Y. Lu, et al., The generalization ability of SVM classification based on Markov sampling, IEEE Trans. Cybern., 45 (2014), 1169–1179. https://doi.org/10.1109/TCYB.2014.2346536 doi: 10.1109/TCYB.2014.2346536
    [22] J. D. Head, M. C. Zerner, A Broyden—Fletcher—Goldfarb—Shanno optimization procedure for molecular geometries, Chem. Phys. Lett., 122 (1985), 264–270. https://doi.org/10.1016/0009-2614(85)80574-1 doi: 10.1016/0009-2614(85)80574-1
    [23] M. Vidyasagar, Learning and Generalization: With Applications to Neural Networks, Springer, London, 2003.
    [24] S. P. Meyn, R. L. Tweedie, Markov Chains and Stochastic Stability, Springer, Berlin, 2012.
    [25] P. Doukhan, Mixing: Properties and Examples, Springer, Berlin, 2012.
    [26] P. Zhang, N. Riedel, Discriminant analysis: A unified approach, in Fifth IEEE International Conference on Data Mining (ICDM'05), 2005. https://doi.org/10.1109/ICDM.2005.51
    [27] V. N. Vapnik, An overview of statistical learning theory, IEEE T. Neur. Net. Lear., 10 (1999), 988–999. https://doi.org/10.1109/72.788640 doi: 10.1109/72.788640
    [28] F. Cucker, S. Smale, Best choices for regularization parameters in learning theory: On the bias-variance problem, Found. Comput. Math., 2 (2002), 413–428. https://doi.org/10.1007/s102080010030 doi: 10.1007/s102080010030
    [29] G. Stempfel, L. Ralaivola, Learning SVMs from sloppily labeled data, in Lecture Notes in Computer Science, Springer, 2009. http://dx.doi.org/10.1007/978-3-642-04274-4_91
    [30] M. P. Qian, G. L. Gong, Applied random processes, Peking University Press, Beijing, 1998.
    [31] W. K. Hastings, Monte Carlo sampling methods using Markov chains and their applications, Biometrika, 57 (1970), 97–109. https://doi.org/10.1093/biomet/57.1.97 doi: 10.1093/biomet/57.1.97
    [32] S. Geman S, D. Geman, Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images, IEEE Trans. Pattern Anal. Mach. Intell., 6 (1984), 721–741. https://doi.org/10.1109/TPAMI.1984.4767596 doi: 10.1109/TPAMI.1984.4767596
  • This article has been cited by:

    1. Julie Clutterbuck, Jiakun Liu, Preface to the Special Issue: Nonlinear PDEs and geometric analysis – Dedicated to Neil Trudinger on the occasion of his 80th birthday, 2023, 5, 2640-3501, 1, 10.3934/mine.2023095
  • Reader Comments
  • © 2023 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(1750) PDF downloads(81) Cited by(0)

Figures and Tables

Figures(8)  /  Tables(5)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog