contents

Transcription

contents
S⃗eMA JOURNAL
NUMBER 54
April 2011
contents
On the curve straightening flow of inextensible, open, planar curves., by D.B.
Oelz . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5
On the Modeling of Crowd Dynamics: An Overview and Research Perspectives,
by N. Bellomo, C. Bianca, V. Coscia . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
A brief survey of the discontinuous Galerkin method for the Boltzmann-Poisson
equations, by Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu . . . . . . . . . . . . . 47
On the numerical computation of mountain pass solutions to some perturbed
semi-linear elliptic problem, by L. Montoro . . . . . . . . . . . . . . . . . . . . . . . . . . . 65
Executive Committee of S⃗eMA
President: Pablo Pedregal Tercero
Vicepresident: Rosa Marı́a Donat Beneito
Secretary: Julio Moro Carreño
EC Members: Lluis Alseda i Soler, Sergio Amat Plata, Rafael Bru Garcı́a
Inmaculada Higueras Sanz, Carlos Parés Madroñal, Luis Vega González.
S⃗eMA Journal
Editor-in-Chief: Enrique Fernández-Cara
Editorial Board: Grégoire Allaire, Carme Calderer, Carlos Conca, Amadeu
Delshams, Martin J. Gander, Francisco Guillén-González, Vivette Girault, Arieh
Iserles, José M. Mazón, Pablo Pedregal, Ireneo Peral, Benoı̂t Perthame, Alfio
Quarteroni, Chi-Wang Shu, Daniel B. Szyld, Luis Vega, Enrique Zuazua.
Editorial Staff: Sergio Amat, Carlos Angosto, Sonia Busquier, Marı́a J. Moncayo,
Alberto Murillo.
Web page of S⃗eMA
http://www.sema.org.es/
e-mail
[email protected]
Dirección Editorial: Dpto. de Matemática Aplicada y Estadı́stica. Univ. Politécnica de Cartagena. Paseo de
Alfonso XIII, 52, 30203 Cartagena (Murcia) Spain. [email protected]
ISSN 1575-9822.
AS-1442-2002.
3
S⃗eMA Journal
no 54(2011), 5–24
ON THE CURVE STRAIGHTENING FLOW OF INEXTENSIBLE, OPEN,
PLANAR CURVES.
D.B. OELZ
Faculty of Mathematics, Vienna University
[email protected]
Abstract
We consider the curve straightening flow of inextensible, open, planar curves
generated by the Kirchhoff bending energy. It can be considered as a model
for the motion of elastic, inextensible rods in a high friction regime. We derive
governing equations, namely a semilinear fourth order parabolic equation for the
indicatrix and a second order elliptic equation for the Lagrange multiplier. We
prove existence and regularity of solutions, compute the energy dissipation, prove
its coercivity and conclude convergence to equilibrium, namely to a straight curve,
at an exponential rate.
1
Introduction
We consider the L2 -gradient flow generated by the Kirchhoff bending energy (cp.
[8, 5])
∫
1 1 ′′ 2
E[z] :=
|z | ds
(1)
2 0
on the manifold
A := {z ∈ H 2 ([0, 1], R2 ) : |z ′ | ≡ 1}
Received: January 14, 2010. Accepted: July 14, 2010.
This work has been started during a visit of the author at the Département de mathématiques et
applications (DMA) of the ENS rue d’Ulm Paris within the DEASE project (Marie Curie Early Stage
Training multi Site (mEST) of the EU, MEST-CT-2005-021122). Furthermore it has been supported by
the Vienna Science and Technology Fund (WWTF) through the project ”How do cells move? Mathematical
modelling of cytoskeletal dynamics and cell migration” of C. Schmeiser and V. Small.
The author would like to thank B. Perthame for suggesting the problem and for fruitful discussions and C.
Schmeiser for helpful suggestions.
5
6
D.B. Oelz
of free, open, planar curves of length 1 which are parametrised by their arclength. The
L2 -gradient flow is described by the system (cp. Appendix A)

∂t z + z ′′′′ − (λ1 z ′ )′ = 0 ,


 ′′

 z |s=0,1 = 0 ,
z ′′′ − λ1 z ′ |s=0,1 = 0 ,
(2)

′

|z
|
=
1
,



z(t = 0, .) = zI (.) ,
where λ1 = λ1 (t, s) ∈ R is a Lagrange multiplier function determined by the
constraint on the arclength
|z ′ | ≡ 1
(3)
and zI ∈ A represents the initial datum of the evolution. Here and throughout the
paper ′ denotes derivatives with respect to the arclength.
The system (2) can be considered as a model for the motion of elastic, inextensible
rods in a high friction regime and has appeared in the modelling of Actin-filaments in
biological cells (cp. [19], [18]). Currently the mathematical modelling of biopolymers
and biopolymer networks is a field of high scientific interest and elastic rod models
have recently also been used for the modelling of the DNA (cp [2, 3]). The primary
motivation for this study is to obtain qualitative results on the behaviour of these
models which have the potential to give insight into the behaviour of the respective
biological systems.
Continuing the discussion of the system (2), observe that the centre of mass,
∫1
z ds, is a conserved quantity of solutions to the problem (2). Furthermore the energy
0
of the initial datum is given by
∫
1 1 ′′ 2
EI :=
(4)
|z | ds .
2 0 I
Specialising Theorem 8 and Lemma 4 in [17] implies (see also Appendix A for more
details):
(
)
Theorem 1 Let zI ∈ A, then there exist T > 0 and z ∈ L∞ (0, T ); H 2 (0, 1) ∩
(
)
(
)
C 0,1/8 [0, T ]; C 1 ([0, 1]) ∩ H 1 (0, T ); L2 (0, 1) and λ1 ∈ L2 ((0, T ); M(0, 1)),
satisfying z(t = 0, .) = zI , |z ′ | ≡ 1 and
∫ T∫ 1
[z ′′ · v ′′ + ∂t z · v + λ1 z ′ · v ′ ] ds dt = 0 ,
0
0
for every smooth v : [0, T ] × [0, 1] → R2 .
In the theorem above we use the notation M(0, 1) for the space of Radon measures on
the interval (0, 1). The fact that the existence Theorem 1 is only local in time is due to
the fact that in the more complex setting of [17] additional geometric assumptions on
z are required which cannot be guaranteed for all times. In the present special case (2),
however, these assumptions are redundant and the proof of local existence of the above
theorem can be extended. Here and in the rest of the paper we abbreviate the notation
of function spaces writing the subscripts t for function spaces on {t ∈ [0, ∞)} and the
subscript s for function spaces on {s ∈ [0, 1]}.
7
On the curve straightening flow of inextensible, open, planar curves.
0, 1
0, 1
1
2
Corollary 2 Let zI ∈ A, then there are z ∈ Ht,loc
L2s ∩ Ct 2 L2s ∩ Ct 8 Cs1 ∩ L∞
t Hs
2
and λ1 ∈ Lloc,t Ms such that the pair (z, λ1 ) is a weak solution of (2) satisfying
∫ ∞∫ 1
[z ′′ · v ′′ + ∂t z · v + λ1 z ′ · v ′ ] ds dt
(5)
0
0
for all v ∈ Cc∞ (R+ , Cs∞ ), z(t = 0, .) = zI and
√ the constraint (3) in a pointwise sense.
It holds that ∂t z ∈ L2t L2s with ∥∂t z∥L2t L2s ≤ 2 EI .
Proof. A sketch of the proof, which is presented in detail in [17], can be found in
Appendix A. The crucial difference between the present setting and the more complex
problem in [17] is that the inequality (48) holds without additional terms. Hence the
infinite sum for n = 1, 2, ... is bounded and yields a telescoping series which in the
limit as the step size converges to zero becomes (49), the estimate on ∥∂t z∥L2t L2s . In the present study we show that the problem (2) is equivalent to the system

′
∂t ω + ω ′′′′ − ω ′2 ω ′′ − (ω ′ λ) − ω ′ λ′ = 0 ,


 −λ′′ + ω ′2 λ = ω ′′′ ω ′ + (ω ′′ ω ′ )′ ,
ω ′ , ω ′′ , λs=0,1 = 0 ,



ω(t = 0, .) = ωI (.) .
(6)
where λ = λ1 + ω ′2 and ω = ω(t, s) ∈ R represents the ”indicatrix” of the curve z
(e.g. see [14]) so that
z ′ = (cos(ω), sin(ω)) .
(7)
The reconstruction of the curve z from the indicatrix ω has to be done in such a way,
∫1
that the centre of mass of the initial datum z̄I := 0 zI ds ∈ R2 is conserved,
)
)
∫ 1 ∫ s̃ (
∫ s(
cos(ω(t, s̄))
cos(ω(t, s̄))
z(t, s) = z̄I −
ds̄ ds̃ +
ds̄ .
sin(ω(t, s̄))
sin(ω(t, s̄))
0
0
0
The curvature energy (1), in a abuse of notation, can be written as
∫
1 1 ′ 2
E[ω] :=
(ω ) ds ,
2 0
(8)
such that E[z] = E[ω] if the constraint (3) is satisfied.
In this work we prove
Theorem 3 The solution according to Corollary 2 gives a distributional sense to the
system (6), i.e. for ω being the indicatrix of z and λ = λ1 + ω ′2 it holds that
∫ ∞∫ 1
[
]
−ω∂t ψ − ω ′′′ ψ ′ − ω ′′ (ω ′ )2 ψ − λ′ ω ′ ψ + λ ω ′ ψ ′ ds dt = 0 and (9)
0
∫
0
0
∞
∫
1
[ ′′′ ′
]
−ω ω ϕ + ω ′′ ω ′ ϕ′ + λ′ ϕ′ + λ (ω ′ )2 ϕ ds dt = 0 .
(10)
0
1
1
1
for all ψ ∈ H0,t
L2s ∩ L2t H0,s
and ϕ ∈ L2t H0,s
and λ, ω ′ , ω ′′ 0,1 = 0 a.e. on R+ .
0,1/8 0
Cs
Furthermore It holds that ω ∈ Ct
2
2 2
2 1
with ω ′ ∈ L∞
t Ls ∩ Lt Hs and λ ∈ Lt Hs .
8
D.B. Oelz
The regularity results are better than in the preliminary result Corollary 2, most notably
we get L2 -regularity up to the third spatial derivative of the indicatrix, which partly
corresponds to the fourth spatial derivative of the curve z, and we are able to prove that
the Lagrange multiplier is a function λ ∈ L2t Hs1 . Furthermore we prove:
Theorem 4 Let zI ∈ A, let (z, λ1 ) be a solution of problem (2) according to
Corollary 2 and let (ω, λ) be the corresponding solution to (6) according to Theorem 3,
then the energy dissipation is given by
d
E = −D ,
dt
(11)
in the sense that (11) holds weakly in time. The curvature energy is given by (1) and
(8) respectively and the energy dissipation D too can, in an abuse of notation, be
alternatively written in terms of z or ω,
D = D[z] = D[ω] with
∫ 1
2
D[z] :=
|z ′′′′ − (λ1 z ′ )′ | ds and
0
∫
1
D[ω] :=
(12)
[
]
2
2
(ω ′ ω ′′ + λ′ ) + (ω ′′′ − ω ′ λ) ds .
0
Finally we prove coercivity of (12) with respect to the curvature energy (a Poincaré
type inequality) obtaining the exponential decay of the energy.
Theorem 5 (Poincaré-type) Under the assumptions of Theorem 4, let the energy of
the initial datum be given by (4), then it holds that
E ≤ EI exp(−2π 4 t) ,
where again the curvature energy is alternatively given by (1) and (8).
This result and the regularity statements in Theorem 3 are finally used to prove the
convergence of solutions towards straight lines.
Theorem 6 Under the assumptions of Theorem 4, let the energy of the initial datum
be given by (4), then there is ω∞ ∈ R such that
1/2
∥ω(t, .) − ω∞ ∥L2s ≤ C1 exp(−π 4 t) EI
where C1 :=
√ (1
)
2 π +1 +
3
2π 2
+
3
4
and C2 :=
3/2
+ C2 exp(−3π 4 t) EI
√
9 2+3
2π 2
,
+ 34 .
Curve straightening flows have been investigated intensively since the 1980s. The
paper [11] deals with global in time existence of solutions of the curve straightening
flow of closed curves and with the stability of stationary limit shapes (elasticae). In
[12] this work is generalised to curves in arbitrary Riemannian manifolds and in [10]
and [15] the authors deal with categorising the elasticae. In [14] the development of
self-intersections was investigated and in [16] the focus was on finding a Riemannian
structure that ensures that the curve-straightening flow preserves certain symmetries.
On the curve straightening flow of inextensible, open, planar curves.
9
In more recent works, [21] and [22], the viewpoint shifted more towards PDEmethods. In [21] a gradient flow acting directly on the indicatrix was investigated and
in [22] the L2 -gradient flow of the square curvature functional was investigated in the
space of smooth closed curves and a fourth order semilinear parabolic equation was
derived as a consequence of the indicatrix representation. The author proves global
in time existence and a convergence result that involves the asymptotic exponential
convergence rate due to the spectrum of the linearised model (cp. [7]). In [9] the
evolution of closed, inextensible curves in R3 under the curve straightening flow with
the same curvature functional as in the present study was considered, but without
making use of the indicatrix representation. In the same way in [20] the motion of
elastic planar closed curves under the additional constraint of area-preservation was
considered. Finally in [13] and in [6] efficient ways to compute the evolution towards
the stationary points called elasticae were investigated.
In contrast to the above mentioned studies, the present one considers open curves as
opposed to closed ones. Although the indicatrix representation was also used in earlier
works ([11, 14, 13, 21, 22]) we consider as new the approach pursued in this paper: to
derive a system of scalar valued equations for the indicatrix and the Lagrange multiplier
and to use it in order to compute, respectively represent the energy dissipation. Doing
so we are able to prove coercivity and, finally, convergence to a straight line at an
exponential rate. These results are derived from the non-linear dynamics and therefore
reveal part of the nature of problem (2), (6) respectively. Most notably the coercivity of
(12) with respect to the square curvature energy is global and leads beyond asymptotic
results which one can get by linearisation at the steady state.
The paper is organised as follows:
In Section 2 we derive formally the system (6) and prove the existence theorem 3.
In Section 3 we show the formal derivation of the energy dissipation and prove the
Theorems 4-6.
Finally, in Section 4, we demonstrate a sequence of numerically computed
snapshots of the evolution under consideration and discuss the observed exponential
convergence rate of the curvature energy.
The paper finishes with an appendix which includes technical computations.
2
Indicatrix formulation
As a consequence of the definition of the indicatrix ω such that (7) holds the curvature
of a curve z ∈ A can be written as
ω ′ = z ′⊥ · z ′′ ,
z∈A.
(13)
Here and in the sequel the superscript ⊥ denotes the rotation of a vector by 90o to the
left, (x, y)⊥ = (−y, x). Frequently we will make use of the relations
z ′′ = ω ′ z ′⊥
and
z ′′⊥ = −ω ′ z ′ ,
(14)
which holds whenever |z ′ | ≡ 1 and which implies the equality of (1) and (8) in this
case. Additionally the initial energy (4) can also be written in terms the indicatrix ωI
∫1
of zI , EI = 12 0 (ωI′ )2 ds .
10
D.B. Oelz
We devote the rest of this section to clarifying the relation between the system
(2) and the equation which we will derive directly from the indicatrix based energy
functional (8).
To this end we extend the definition of the indicatrix to non-admissible curves
z ∈ H 2 ((0, 1); R2 ) with non-singular parametrisation such that |z ′ (s)| > 0 for all
0 ≤ s ≤ 1. Requiring z ′ /|z ′ | = (cos(ω), sin(ω)) we obtain
ω′ =
z ′⊥ · z ′′
.
|z ′ |2
(15)
We use various formulations of the bending energy of such a curve. On the one hand
we rely on the straightforward formulation of the Kirchhoff bending energy (1). On
the other hand we define
)2
)2
∫ (
∫ (
1 1 z ′⊥ · z ′′
1 1
ω′
Ep [z] :=
ds =
ds ,
(16)
2 0
|z ′ |p
2 0
|z ′ |p−2
∫1
which is a generalisation of the square curvature functional E5/2 [z] = 0 (z ′⊥ ·
z ′′ /|z ′ |3 )2 dz that is invariant with respect to reparametrisations of z (see Appendix B).
The functional Ep represents, for every p ∈ R, a notion of bending energy, which
coincides with the Kirchhoff energy (1) and (8) respectively whenever the constraint
(3) is satisfied,
E[z] = E[ω] = Ep [z] if |z ′ | ≡ 1 .
(17)
The energy functional (16) coupled to the constraint (3) defines a gradient flow,
which, in its weak formulation, reads
)
∫ ∞ (∫ 1
∫ 1
∂t z · v ds + δEp [z]v +
λp z ′ · v ′ ds dt = 0 ,
(18)
0
0
0
Cc∞ (R+ , Cs∞ )
for all v ∈
with the Lagrange multiplier function λp = λp (t, s) ∈ R to
enforce the constraint (3) (see Appendix A for the derivation of the variational equation
and Appendix B for the computation of the variation δEp [z]). Its solution coincides
with the solution of the system (2), whereas the strong formulation of (18) is given by

(
)′ ((
) )′
∂t z + ω ′′z ′⊥ + (p − 2)ω ′2 − λp z ′ = 0 ,



 ′ ′′
ω , ω , λp =0,
(19)
s=0,1

′

|z
|
≡
1
,


z(t = 0, .) = zI (.) .
Due to (17) both systems, (2) and (19), describe the same evolution of z and the
existence result Corollary 2 applies to the system (19). However, since the variation
of the functional (16) in non-admissible directions depends on the exponent p, the
Lagrange multipliers λp are different for different p and (19) implies that
λp1 − λp2 = (p1 − p2 )ω ′2
(20)
for different values of p. Furthermore in equation (51) of the Appendix B we show
that non-admissible variations of (1) and (16) coincide when p = 1 which justifies
11
On the curve straightening flow of inextensible, open, planar curves.
the notation λ1 for the Lagrange-multiplier function in system (2). Summarizing we
obtain
Lemma 7 Let zI ∈ A, let (z, λ1 ) be a weak solution to problem (2) according to
0,1/8 0
2
Cs with ω ′ ∈ L∞
Corollary 2 and let λp = λ1 + (p − 1)ω ′2 , then there is ω ∈ Ct
t Ls
such that ω is an indicatrix of z and (z, ω, λp ) constitute a weak solution of (19)
satisfying (18) for all v ∈ Cc∞ (R+ , Cs∞ ).
Proof. Furthermore the regularity of z ′ according to Corollary 2 allows to identify
0,1/8 0
ω ∈ Ct
Cs up to an additive constant being a multiple of 2π. The rest of the
statement is then an consequence of the discussion above, most notably (20).
The case p = 2 has an especially simple structure, since the forces generated by
the curvature and by the Lagrange multiplier are orthogonal in some sense. This is
reflected by the identities
δE2
· δz = −
δz
∫
1
ω ′′ z ′⊥ · δz ′ ds and
0
∫
δE L
· δz =
δz
1
λz ′ · δz ′ ds ,
(21)
0
∫1
where E L [z] := 0 λ(s)(|z ′ |2 − 1)/2 ds is a potential term such that the effect of the
Lagrange multiplier can be considered as its variation. For this reason we choose the
notation
λ := λ2 .
(22)
It holds that
Lemma 8 (Regularity, a-priori estimates) Let (z, ω, λ) bet a weak solution of (19) in
′ ′′
2 2
the case p = 2 in the sense of Lemma 7, then it holds that ω ′′ ∈ L2t L∞
s , ω ω ∈ Lt Ls ,
2 ∞
′
2 2
′′′
2 2
′
2 2
λ ∈ Lt Ls , ω λ ∈ Lt Ls , ω ∈ Lt Ls and λ ∈ Lt Ls . It also holds that
∫
0
∞
∫
1
0
[
]
2
2
2
(ω ′′′ − λω ′ ) + (λ′ + ω ′ ω ′′ ) ds dt = ∥∂t z∥L2 L2s .
t
Proof. First note that the primitive
∫
0
∞
∫
s
0
2
∫
∂t z ds̃ dt ≤
∞
∞
(∫
0
∫s
0
∂t z ds̃ ∈ L2t L∞
s since
)2
1
|∂t z| ds
∫
∞
dt ≤
0
∫
0
1
|∂t z|2 ds dt < 2 EI
0
by (49). We go back to the integrated version of (19) for p = 2
( ′′ ′⊥ )
 ∫s
− (λz ′ ) = 0 ,
 0 ∂t z ds + ω z
′
ω s=0,1 = 0 ,

z(t = 0, .) = zI (.) .
and obtain
′′
ω = −z
′⊥
∫
·
s
∂t z ds and
0
′
∫
λ=z ·
s
∂t z ds ,
0
(23)
12
D.B. Oelz
2 ∞
which implies ω ′′ ∈ L2t L∞
s and λ ∈ Lt Ls . Moreover we find that
∫
∞
0
∫
1
∫
′ 2
∞
(λω ) ds dt ≤
0
0
∫
2
∥λ∥L∞
s
1
(ω ′ ) ds dt ≤
0
∫ ∞
2
≤ 2 EI
∥λ∥L∞
dt < (2 EI )2 . (24)
s
2
0
′
′′
This computation and an analogous one for ω ω imply
ω ′ λ ∈ L2t L2s
ω ′ ω ′′ ∈ L2t L2s .
and
(25)
We write the weak formulation (18) of problem (19) for p = 2 after two integrations
by parts,
∫
∞
0
∫
1
(
0
)
(
)′
′
∂t z + ω ′′ z ′⊥ − (λz ′ ) · v ds dt+
∫ ∞
[ ′′ ′⊥
]1
+
−ω z · v + λz ′ · v + ω ′ z ′⊥ · v ′ 0 dt = 0 , (26)
0
for all v ∈ Cc∞ (R+ , Cs∞ ).
The uniform estimates we obtained allows to
set v = z ′⊥ ϕ for a testfunction ϕ ∈ D(([0, ∞) × [0, 1]) and to obtain
]
∫∞∫1[
(∂t z)z ′⊥ ϕ + ω ′′′ ϕ − λω ′ ϕ ds dt = 0. Specialising (26) for v =
0
0
z ′ ψ with ψ ∈ D([0, ∞) × [0, 1]) we conclude by an analogous computation
∫∞∫1
[(∂t z)z ′ ψ − ω ′ ω ′′ ψ + λψ] ds dt = 0. Due to (49) and (25) this implies
0
0
z ′⊥ · ∂t z + ω ′′′ − λω ′ = 0 a.e.
and z ′ · ∂t z − ω ′ ω ′′ − λ′ = 0 a.e.,
(27)
hence ω ′′′ ∈ L2t L2s and λ′ ∈ L2t L2s .
Finally (27) implies
∫
∞
0
∫
1
[
(ω ′′′ − λω ′ ) + (λ′ + ω ′ ω ′′ )
2
0
∫
∞
∫
=
0
0
1
[(
z ′⊥ · ∂t z
2
)2
]
ds dt =
+ (z ′ · ∂t z)
2
]
2
ds dt = ∥∂t z∥L2 L2s . (28)
t
Next we will derive the governing equations for the quantities ω(t, s) and λ(t, s).
Taking the derivative of the evolution equation in (19) and explicitly evaluating all the
derivatives we infer
(
∂t z ′ + (ω ′′′′ + 3(p − 2)ω ′2 ω ′′ )z ′⊥ + (ω ′′′ + (p − 2)ω ′3 )(−1)ω ′ z ′ +
)
+ (2p − 5)(ω ′′ ω ′ )′ z ′ + (2p − 5)ω ′′ ω ′ ω ′ z ′⊥ −
− (λ′′p z ′ + (λp ω ′ )′ z ′⊥ + λ′p ω ′ z ′⊥ + λp ω ′ (−1)ω ′ z ′ ) = 0 . (29)
On the curve straightening flow of inextensible, open, planar curves.
Multiplying by z ′⊥ and z ′ respectively we get
( ′′
)

′′′′
′2 ′′
′ ′
 ∂t ω + ω + (5p − 11) ω ω − ω λp + 2ω(λp = 0 , )
−(ω ′′′ + (p − 2)ω ′3 )ω ′ + (2p − 5)(ω ′′ ω ′ )′ − λ′′p − ω ′2 λp = 0 ,
 ω ′ , ω ′′ , λ p s=0,1 = 0 .
13
(30)
where ∂t ω = z ′⊥ · ∂t z ′ . Observe that λ′′p (0) = λ′′p (1) = 0 as a consequence of the
second equation and the boundary conditions. Let us compare the system for some
“canonical” choices for p. In the case p = 1 we obtain

′′′′
′2 ′′
′′
′ ′
 ∂t ω + ω − 6 ω ω − (ω λ1(+ 2ω λ1 ) =)0 ,
′′′
′3
′
′′ ′ ′
′′
−(ω − ω )ω − 3(ω ω ) − λ1 − ω ′2 λ1 = 0 ,
(31)
 ω ′ , ω ′′ , λ 1 s=0,1 = 0 .
As a consequence of the fact that the variations of E and E1 coincide by (51), the elliptic
equation for λ1 is the one that was found by Koiso ([9]).
In the case p = 2, using the notation (22), we obtain the system (6) and prove the
Theorem 3.
Proof of Theorem 3. We start with a weak solution of (19) satisfying (26) for all
v ∈ Cc∞ (R+ , Cs∞ ) in
the case p = 2 according to Lemma 7. The boundary integrals
imply that λ, ω ′ , ω ′′ 0,1 = 0 a.e. on R+ .
We choose a regularising sequence ηk with supp ηk ⊂ B(0, 1/k) ⊂ R2 and set
v = ηk (t̃ − t, s̃ − s) for (t̃, s̃) ∈ Uk with Uk := [1/k, ∞) × (1/k, 1 − 1/k) obtaining
(
)′
∂t zk + ω ′′ z ′⊥ ∗ ηk − (λz ′ )′ ∗ ηk = 0 ,
)′
(
where zk := z ∗ ηk . We omit the tilde and integrate against − φzk′ + |z1′ |2 ψzk′⊥
k
with φ, ψ ∈ D(R+ × [0, 1]) and k large enough such that Uk covers the support of
both testfunctions,
(
)′
∫∫ [
1
−∂t zk · zk′⊥ ′ 2 ψ −
|zk |
Uk
)′ ]
) (
((
)
′′ ′⊥ ′
′ ′
′⊥ 1
(32)
− ω z
∗ ηk − (λz ) ∗ ηk · zk ′ 2 ψ
ds dt = 0 ,
|zk |
∫∫
((
)
)′
′
−∂t zk · (zk′ φ)′ − ω ′′ z ′⊥ ∗ ηk − (λz ′ )′ ∗ ηk · (zk′ φ) ds dt = 0 .
Uk
Making use of the regularity results in Corollary 2 it holds that zk′ → z ′ and, as a
consequence, also ωk → ω, uniformly on (compact subsets of
) R+ × [0, 1], where ωk
is the indicatrix of zk satisfying zk′ /|zk′ | = cos(ωk ), sin(ωk ) . Using these results we
perform integrations by parts with respect to s and t with the expressions in (32) that
involve ∂t zk and pass to the limit, obtaining
∫∫
∫∫
−∂t zk · (zk′ φ)′ ds dt =
∂t zk′ · zk′ φ ds dt =
Uk
Uk
∫∫
1
=
− |zk′ |2 ∂t φ ds dt → 0 as k → ∞
2
Uk
14
D.B. Oelz
and
∫∫
)′
(
∫∫
1
′⊥ 1
∂t zk′ · zk′⊥ ′ 2 ψ ds dt =
−∂t zk · zk ′ 2 ψ ds dt =
|z |
|zk |
U
Uk
∫ ∫k
∫ ∫k
=
∂t ωk ψ ds dt =
−ωk ∂t ψ ds dt →
Uk
Uk
∞
∫
→
0
∫
1
−ω∂t ψ ds dt as
k→∞,
0
2
where we used that ∂t ωk = ∂t zk′ ·zk′⊥ |z1′ |2 . Making use of the the fact that z ′′ ∈ L∞
t Ls
k
by Corollary 2 and ω ′′′ , λ′ in L2t L2s by Lemma 8 we now pass to the limit k → ∞ in
the remaining expressions of (32) and conclude
∫
∞
∫ 1[
−ω∂t ψ−
(
) (
)]
− ω ′′′ z ′⊥ − λ′ z ′ − ω ′′ ω ′ z ′ − λω ′ z ′⊥ · −z ′ ω ′ ψ + z ′⊥ ψ ′ ds dt = 0 ,
∫ ∞∫ 1
(
) (
)
− ω ′′′ z ′⊥ − λ′ z ′ − ω ′′ ω ′ z ′ − λω ′ z ′⊥ · z ′⊥ ω ′ φ + z ′ φ′ ds dt = 0 .
0
0
0
0
This implies (9), (10) where every term is well defined by Lemma 8 and where we
allow for ψ, ϕ in function spaces in which testfunctions are densily contained.
An interesting structural observation concerning the problem (6) is the following.
Remark 1 Let (ω, λ) be a solution of (6), then, at least on a formal level, the
evolution, represented by ∂t ω, and the Langrange mulitplier satisfy the orthogonality
condition
∫
1
∂t ωλ ds = 0 ,
0
which we obtain integrating the first equation in (6) against λ,
∫
∫
1
∂t ωλ ds =
0
1
(
)
ω ′′′ λ′ + ω ′′ ω ′2 λ + λ′ ω ′ λ − λω ′ λ′ ds dt =
0
∫
∞
∫
=
0
1
(
)
ω ′′′ λ′ + ω ′′ ω ′2 λ ds ,
(33)
0
and integrating the second equation in (6) against ω ′′ ,
∫
1
0
( ′′′ ′
)
ω λ + ω ′2 ω ′′ λ ds =
∫
1
(−ω ′ ω ′′ ω ′′′ + ω ′ ω ′′′ ω ′′ ) ds = 0 .
0
This observation seems to be related to the special structure of forces resulting from
curvature and from pressure/tension (cp (21)) and to the fact that we are dealing with
a constrained gradient flow, i.e. the evolution is tangential to the constrained set.
15
On the curve straightening flow of inextensible, open, planar curves.
3
Energy dissipation and long time convergence
Testing the first equation in the system (6) formally against ω ′′ and the second one
against λ and taking their difference implies that the energy dissipation is given by
d
d 1
E[ω(t)] =
dt
dt 2
∫
1
ω ′2 ds = −
0
∫
1
[
]
2
2
(ω ′ ω ′′ + λ′ ) + (ω ′′′ − ω ′ λ) ds ≤ 0 .
0
(34)
This can be made rigorous stating that the energy dissipation equality (34) holds
weakly in time (Theorem 4).
Proof of Theorem 4. Here the problem is that we cannot directly set ψ in (9)
equal to ω ′′ , since its time derivative cannot necessarily be interpreted as a function.
Therefore we regularise with respect to t using a sequence of mollifiers (ηk )k=1,2,...
with supp ηk ⊂ [−1/k, 1/k].
∫∞
For t̃ ≥ 1/k we denote by ωk (t̃, s) := (ω ∗t ηk )(t̃, s) = 0 ω(t, s)ηk (t̃ − t) dt the
regularised version of ω and evaluate (9) using ψ(t, s) = ηk (t̃ − t) ωk′′ (t̃, s). The time
integral becomes part of the convolution terms and we find the following expression
which holds pointwise for every t̃,
∫ 1[
(
)
∂t ωk ωk′′ − ωk′′′ ωk′′′ − (ω ′′ (ω ′ )2 ) ∗t ηk ωk′′ −
]
− ((λ′ ω ′ ) ∗t ηk ) ωk′′ + ((λ ω ′ ) ∗t ηk ) ωk′′′ ds = 0 ∀
0
t̃ ≥ 1/k .
Observe that we write the convolution of various expressions and ηk explicitly using
the symbol ∗t .
We omit the tildes and integrate against a test function in time ϑ ∈ D(R+ )
obtaining
∫
∞[
∫
1
∂t ϑ
1/k
0
ωk′2 /2 ds + ϑ
∫ 1[
(
)
−ωk′′′2 − (ω ′′ (ω ′ )2 ) ∗t ηk ωk′′ −
0
] ]
− ((λ′ ω ′ ) ∗t ηk ) ωk′′ + ((λ ω ′ ) ∗t ηk ) ωk′′′ ds dt = 0 , (35)
with k large enough such that(supp ϑ ⊂ [1/k,)∞). Next we use the fact that all the
convolved terms like ωk′ , ωk′′′ , (ω ′′ (ω ′ )2 ) ∗t ηk , etc. converge strongly in L2t,loc L2s to
their original counterparts.
We illustrate this with respect to ω ′′′ . Fix k ∈ N, 1/k < t0 < t1 and 0 ≤ s ≤ 1.
16
D.B. Oelz
Using Fubini’s Theorem and Jensen’s Inequality we find that
∫
t1
2
t0
∫
∫
t0
t1
∫
∫
≤2
=2
t0
∞
ωk′′′ (t, s)2
0
t1
t0
t1
≤
∫
∫
(ωk′′′ − ω ′′′ ) dt̃ =
(∫
∞
ωk′′′ (t̃ − t, s)ηk (t) dt − ω ′′′ (t̃, s)
dt̃ ≤
0
∞
( ′′′
)2
ωk (t̃ − t, s) − ω ′′′ (t̃, s) ηk (t) dt dt̃ ≤
∞
(
0
)
ωk′′′ (t̃ − t, s)2 + ω ′′′ (t̃, s)2 ηk (t) dt dt̃ =
0
∫
)2
∫
t1
t1
ηk (t̃−t) dt̃ dt+2
t0
ω ′′′ (t̃, s)2
∫
t0
∞
0
ηk (t) dt dt̃ ≤ 4∥ω ′′′ ∥2L2 .
t
This result and the strong convergence ωk → ω as k → ∞ for every 0 ≤ s ≤ 1 allows
to apply Lebesgue’s Theorem concluding
∫ 1 ∫ t1
2
′′′
′′′ 2
∥ωk − ω ∥L2 ((t0 ,t1 ))L2s =
(ωk′′′ − ω ′′′ ) dt ds → 0 ,
t
0
t0
hence ωk′′′ → ω ′′′ in L2t,loc L2s . We use this result and the analogous one for the other
convolution terms to pass to the limit as k → ∞ in (35) obtaining
]
∫ ∞[
∫ 1
∫ 1
[ ′′′2
]
∂t ϑ
ω ′2 /2 ds + ϑ
−ω − (ω ′′ ω ′ )2 − λ′ ω ′ ω ′′ + λ ω ′ ω ′′′ ds dt = 0 .
0
0
0
(36)
Setting ϕ = λϑ(t) in (10)
∫ ∞
∫ 1
[ ′′′ ′
]
ϑ(t)
−ω ω λ + ω ′′ ω ′ λ′ + λ′2 + λ2 (ω ′ )2 ds dt = 0 .
0
(37)
0
The difference of (36) and (37) finally yields a weak in time formulation of (34).
As an immediate consequence we obtain
Corollary 9 Let z be a solution of (2) according to Corollary 2 with initial energy EI ,
then it holds that
∫ ∞
∥∂t z∥2L2 L2 =
t
s
D dt = EI .
(38)
0
Furthermore it holds that
∥ω ′′ ∥L2t L∞
, ∥λ∥L2t L∞
s
s
∥ω ′ λ∥L2t L2s , ∥ω ′ ω ′′ ∥L2t L2s
∥λ′ ∥L2t L2s , ∥ω ′′′ ∥L2t L2s
√
EI ,
√
≤
2EI and
√
√
2EI + EI .
≤
≤
Proof . Integrate (34) with respect to time and combine it with (28). Using this
result we go again through the estimates in the proof of Lemma 8.
The energy dissipation equality (34) motivates the definition of [ω] in (12). The
equivalent expression in terms of z,D[z], is then a consequence of
17
On the curve straightening flow of inextensible, open, planar curves.
Lemma 10 Let (z, λ1 ) be a solution of (2) and let ω be the indicatrix of z and
λ = λ1 + ω ′2 such that (ω, λ) satisfy (6), then it holds that
∫
1
D[ω(t, .)] =
|z ′′′′ (t, s) − (λ1 (t, s)z ′ (t, s))′ | ds a.e. in R+ .
2
0
Proof. We compute iterated derivatives of the constraint (3),
0 = z ′ · z ′′ ,
0 = z ′ · z ′′′ + z ′′ · z ′′ = z ′ · z ′′′ + ω ′2 ,
0 = z ′ · z ′′′′ + z ′′ · z ′′′ + 2ω ′ ω ′′ = z ′ · z ′′′′ + ω ′ z ′⊥ · z ′′′ + 2ω ′ ω ′′ = z ′ · z ′′′′ + 3ω ′ ω ′′ ,
making use of (14) and iterated derivatives of (13),
ω ′′ = z ′⊥ · z ′′′ ,
ω
′′′
=z
′′⊥
·z
′′′
(39)
+z
′⊥
·z
′′′′
′ ′
= −ω z · z
′′′
+z
′⊥
·z
′′′′
′3
=ω +z
′⊥
·z
′′′′
.
We obtain z ′′′′ · z ′ = −3ω ′ ω ′′ and z ′′′′ · z ′⊥ = ω ′′′ − ω ′3 , which imply together with
λ1 = λ − ω ′2 (cp. (20)) that
∫
1
[
]
2
2
(ω ′ ω ′′ + λ′ ) + (ω ′′′ − ω ′ λ) ds =
0
∫
1
=
]
[ ′′′′ ′
(z · z − λ′1 )2 + (z ′′′′ · z ′⊥ − λ1 ω ′ )2 ds =
0
∫
1
=
′′′′ ′ ′
z · z z − λ′1 z ′ + z ′′′′ · z ′⊥ z ′⊥ − λ1 ω ′ z ′⊥ 2 ds =
0
∫
1
=
|z ′′′′ − (λ1 z ′ )′ | ds ,
2
0
where we again used (14).
We finally deal the convergence to equilibrium. First we obtain
Lemma 11 Let ω be a function such that D = D[ω] = 0, then ω ′ ≡ 0, i.e. stationary
points of the energy are straight lines.
Proof. We integrate ω ′ ω ′′ + λ′ = 0 and make use of the boundary conditions on
2
λ to conclude that λ = −ω ′2 /2. Furthermore together with (ω ′′′ − ω ′ λ) = 0 this
implies that ω ′′′ + ω ′3 /2 = 0 and the boundary conditions on ω ′ imply that ω ′ ≡ 0 on
[0, 1].
Furthermore we find the exponential decay of the energy formulated in Theorem 5.
Proof of Theorem 5. The proof uses the fact that the best constant in the Poincarétype inequality
∫ 1
∫ 1
2
v ds ≤ C
v ′2 ds
(40)
0
0
18
D.B. Oelz
Figure 1: Plot of the minimiser of the functional D/E with D/E = 505.917
for v ∈ H01 ((0, 1)) is given by the reciprocal value of the first eigenvalue of the
differential operator v ′′ in that space, C = 1/π 2 . Furthermore we use (39) to obtain
∫
2π E = π
2
1
ω ds ≤
1
=
(z
∫
∫
′2
0
∫
0
′′′
′⊥ 2
ω ′′2 ds =
∫
1
· z ) ds =
0
((z ′′′ − λ1 z ′ ) · z ′⊥ )2 ds ≤
0
1
≤
|z
0
≤
1
2
1
π2
′′′
∫
0
1
(41)
′ 2
− λ1 z | ds ≤
1
′ 2
′′′
(z − λ1 z ′ ) ds = 2 D ,
π
where we used (12). As a consequence, the energy dissipation D is coercive with
respect to the energy and and the following Poincaré-type inequality holds
D/E ≥ 2π 4 .
(42)
Observe that the estimate (42) is not sharp, since in the estimations (41) the inequality
(40) is used twice with the optimal constant C, but with different functions and because
the step from line 2 to line 3 involves adding ((z ′′′ −λ1 z ′ )·z ′ )2 = ((|z ′′ |2 +λ1 ))2 = λ2
to the integrand.
The plot in figure 1 show the numerically found ground state of the exponential
dissipation rate, i.e. the minimiser of the functional D/E. Its rate of convergence is
D/E ≈ 505.917, which still leaves a gap to the constant we found in (42), 2π 4 ≈ 200.
We finally are able to give the proof of Theorem 6.
∫1
Proof of Theorem 6. Let ω(t) := 0 ω(t, s) ds be the mean-value of the indicatrix,
19
On the curve straightening flow of inextensible, open, planar curves.
then Theorem 5 implies
∥ω(t, .) − ω(t)∥2L2 ≤
2
E[ω(t, .)]
π2
(43)
due to a Poincaré-inequality which is analogous to (40). Let µ(s) := 6s(1 − s) and
let (ηk ) be a regularizing sequence such that supp ηk ⊂ [−1/k, 1/k]. In (9) we set
ψ(t, s) = ηk (t̃ − t)µ(s) with t̃ > 1/k obtaining
d
dt̃
∫
1
ωk µ ds+
0
∫
+
1
[ ′′′ ′
]
−ωk µ − (ω ′′ (ω ′ )2 ) ∗ ηk µ − (λ′ ω ′ ) ∗ ηk µ + (λ ω ′ ) ∗ ηk µ′ ds = 0 .
0
Let 0 < t1 < t2 and integrate with respect to time, omitting the tilde. We pass to the
limit as k → ∞ to get
∫
∫
t2
ω̂(t2 ) − ω̂(t1 ) +
t1
1
[ ′′ ′ 2
]
−ω (ω ) µ − λ′ ω ′ µ + λ ω ′ µ′ ds dt = 0 ,
0
where
∫
ω̂(t) :=
1
ω(t, s)µ(s) ds
0
∫1
is a weighted mean of the function ω(t, .). Observe that 0 −ω ′′′ µ′ ds =
∫ 1 ′′ ′′
ω µ ds = −12(ω ′ (1) − ω ′ (0)) = 0 has canceled due to the boundary values
0
of ω ′ . We infer
∫
|ω̂(t2 ) − ω̂(t1 )| = t2
t1
∫
0
1
[ ′′ ′ 2
]
−ω (ω ) µ − 2λ′ ω ′ µ − λ ω ′′ µ ds dt ≤
]
3 [ ′′ ′
′′
2
∥ω ω ∥L2t L2s ∥ω ′ ∥L2t L2s + 2∥λ′ ∥L2t L2s ∥ω ′ ∥L2t L2s + ∥λ∥L2t L∞
≤
∥ω
∥
,
∞
Lt Ls
s
2
where norms in the dimension t are evaluated on the interval (t1 , t2 ). We now
interprete the data at t =
√ t1 < t2 as initial datum. Doing so Theorem 5 implies
that ∥ω ′ ∥L2 ((t1 ,t2 );L2s ≤ E[ω(t1 )]/(π 2 ) and together with the results of Corollary 9
this implies
[
(
)
]
3 3√
2
3/2
|ω̂(t2 ) − ω̂(t1 )| ≤
2E[ω(t1 , .)] +
+ 1 E[ω(t1 , .)] .
(44)
2 π2
π2
The decay of the energy by Theorem 5 implies that for a given sequence of times (tn )
the sequence ω̂(tn ) is Cauchy and it yields a limit value which we call ω∞ ∈ R. Setting
t1 = t and t2 = tn in (44) and passing to the limit as n → ∞ we obtain
[
(
)
]
2
3 3√
3/2
2E[ω(t, .)] +
+ 1 E[ω(t, .)] .
(45)
|ω̂(t) − ω∞ | ≤
2 π2
π2
20
D.B. Oelz
Furthermore, since
∫1
0
µ(s) ds = 1, it holds that
∫ 1
|ω(t) − ω̂(t)| = ω(t, s)(1 − µ(s)) ds =
0
∫ 1 (
)
∫ s
= ω(t, 0) +
ω ′ (t, s̃) ds̃ (1 − µ(s)) ds =
0
0
∫ 1 (∫ s
)
′
=
ω (t, s̃) ds̃ (1 − µ(s)) ds ≤
∫
0
1
≤
0
(46)
0
|ω ′ (t, s̃)| ds̃ ≤ ∥ω ′ (t, s̃)∥L2s =
√
2E(t) .
Together, (43), (45) and (46) imply
∥ω(t, .) − ω∞ ∥L2s ≤ ∥ω(t, .) − ω(t)∥L2s + |ω(t) − ω̂(t)| + |ω̂(t) − ω∞ | ≤
(
)
(
[
)
]
√
√
1
3 3√
2
3/2
≤ 2
+1
E[ω(t, .)] +
2E[ω(t, .)] +
+ 1 E[ω(t, .)] .
π
2 π2
π2
We finally use Theorem 5 and control the last term by an interpolation between the
smallest and the largest powers of exp(−π 4 t) using x2 ≤ (x + x3 )/2 for x ≥ 0, which
yields the result.
4
Numerics
In the sequence of figures 2 we visualise the numerical solution of the recursive scheme
(47) for τ = 10−6 at various points in time. Information about time, energy and the
dissipation rate are printed in the title of each frame. Observe the decay of energy as
compared to the changes in the dissipation rate, which seems to evolve according to
the following scheme. Initially it drops rapidly, then, when the energy is around E ≈ 7,
it takes its minimum at D/E ≈ 510 and the profile of the curve results to be U-shaped
and similar to the minimiser of the exponential dissipation rate we found numerically
(figure 1). Finally, the exponential rate increases again up to values D/E ≈ 1000. This
can be explained by claiming that at low energy, due to the bounds of Corollary 9, the
linear components of the model (6) dominate. Stripping (6) from its nonlinearities we
obtain ∂t ω̃ + ω̃ ′′′′ = 0 with ω̃ ′ (0) = ω̃ ′ (1) = ω̃ ′′ (0) = ω̃ ′′ (1) = 0. The exponential
dissipation rate of this model in the sense of (42) would be D/E ≈ 1001.13, namely
twice the first eigenvalue of the respective fourth order operator. We remark that a
future study will be devoted to investigating this asymptotic behaviour in more detail.
A
Construction of solutions by the steepest descent flow
The results Theorem 1 and Corollary 2 respectively are obtained from the usual
construction in the theory of gradient flows and steepest descent flows (cp. [4], [1]).
This result indeed is a special case of the existence proof formulated in [17]. It is based
21
On the curve straightening flow of inextensible, open, planar curves.
t=0.000000 E= 100.00 α=D/E= 677401.0
t=0.000009 E= 63.07 α=D/E= 28698.3
0.5
0.5
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0
0
−0.1
−0.1
−0.2
−0.2
−0.3
−0.3
−0.4
−0.4
−0.5
−0.5
0
0.5
−0.5
−0.5
t=0.000099 E= 38.30 α=D/E= 3120.3
0.5
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0
0
−0.1
−0.1
−0.2
−0.2
−0.3
−0.3
−0.4
−0.4
−0.5
−0.5
0
0.5
−0.5
−0.5
t=0.000699 E= 16.16 α=D/E= 860.1
0.5
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0
0
−0.1
−0.1
−0.2
−0.2
−0.3
−0.3
−0.4
−0.4
0
0.5
−0.5
−0.5
t=0.005999 E= 0.46 α=D/E= 900.8
0
0.5
0
0.5
t=0.009999 E= 0.01 α=D/E= 996.3
0.5
0.5
0.4
0.4
0.3
0.3
0.2
0.2
0.1
0.1
0
0
−0.1
−0.1
−0.2
−0.2
−0.3
−0.3
−0.4
−0.5
−0.5
0.5
t=0.001999 E= 7.29 α=D/E= 515.1
0.5
−0.5
−0.5
0
t=0.000299 E= 24.71 α=D/E= 1444.5
0.5
−0.4
0
0.5
−0.5
−0.5
0
Figure 2: Evolution computed numerically.
0.5
22
D.B. Oelz
on defining the recursive scheme
{∫
Zτ0
= zI
Zτn
and
1
∈ argminw∈A
0
[
] }
1 ′′ 2 |w(s) − Zτn−1 |2
ds , (47)
|w | +
2
2τ
where (Zτn )n=0,1,... is a stepwise in time approximation of z with stepsize τ . The
passage to the limit τ → 0 is based on the estimate which we obtain by summing up
for n = 1, 2, ... the fact that
]
∫ 1
∫ 1[
)
1 ( n
1
1
n−1 2
n ′′ 2
Zτ − Zτ
+ ((Zτ ) ) ds ≤
((Zτn−1 )′′ )2 ds .
(48)
2τ
2
2
0
0
Taking the sum and the passage to the limit with respect to τ with interpolations of the
sequence (Zτn )n implies the L2t L2s a priori bound on ∂t z,
∫
0
∞
∫
∫
1
|∂t z| ds dt ≤ 2 EI =
2
0
1
|zI′′ |2 ds .
(49)
0
Furthermore the variational equation of (47) after letting τ → 0 becomes (5) for
variations δz ∈ H 2 ([0, 1]) and the Lagrange multiplier λ1 = λ1 (t, s) to enforce
the constraint |z ′ | ≡ 1. The system (2) is then the strong formulation of (5)
coupled with the constraint. We remark that the ”natural” boundary conditions
z ′′ , z ′′′ − λ1 z ′ |s=0,1 = 0 are equivalent to those in the formulation (2), since taking
the second derivative of the constraint (3) implies |z ′′ |2 + z ′ · z ′′′ = 0 and therefore it
holds that 0 = z ′ · z ′′′ − λ1 |s=0,1 = −|z ′′ |2 − λ1 |s=0,1 = −λ1 |s=0,1 .
B
Variation of the Kirchhoff bending energy
In this appendix we gather some of the variational formulas needed in Section 2. The
energies defined in (1) and (16) coincide, if the constraint (3) is satisfied. This holds
too for the variations of these energies with respect to variations δz ∈ H 2 ([0, 1], R2 )
which are admissible to the constraint (3), i.e. which satisfy z ′ · δz ′ ≡ 0. Variations in
a general direction δz ∈ H 2 ([0, 1], R2 ) might differ, we compute
)(
)
(δω)′
ω′ ′
ω′
′
δEp [z]δz =
− (p − 2) ′ p z · δz
ds =
|z ′ |p−2
|z ′ |p−2
|z |
0
) ( ′⊥
)′
(
)
∫ 1(
ω′
z · δz ′
ω ′2
=
−
(p
−
2)
z ′ · δz ′ ds
′ |2
′ |2(p−2)
′ |2(p−1)
|z
|z
|z
0
∫
1
(
for a curve satisfying |z ′ (s)| > 0 for all 0 ≤ s ≤ 1 making use of the fact that we can
exchange variation and differentiation in the case of δω ′ = (δω)′ and that the variation
of the indicatrix is given by δω[z]δz = z ′⊥ · δz ′ /|z ′ |2 .
Observe that for p = 5/2 the bending energy corresponds to the square curvature
∫1
functional, E5/2 [z] = 0 (z ′⊥ · z ′′ /|z ′ |3 )2 dz and its variation is invariant with respect
to variations that only act as reparametrisations, i.e. δz = z ′ ϕ for test functions
ϕ ∈ D([0, 1]). In fact it holds that δE5/2 [z]z ′ ϕ = 0.
23
On the curve straightening flow of inextensible, open, planar curves.
Continuing this computation using the assumption that the constraint |z ′ | ≡ 1
holds, the variation of the bending energy reads
∫
1
δEp [z]δz =
ω′
((
z ′⊥ · δz ′
0
[
]1
= ω ′ z ′⊥ · δz ′ 0 +
)′
∫
1
− (p − 2) ω ′ z ′ · δz ′
)
ds =
−ω ′′ z ′⊥ · δz ′ − (p − 2) ω ′2 z ′ · δz ′ ds =
0
∫
)
[
(
]1
= ω ′ z ′⊥ · δz ′ − ω ′′ z ′⊥ + (p − 2) ω ′2 z ′ · δz 0 +
1
( ′′ ′⊥
)′
ω z + (p − 2) ω ′2 z ′ ·δz ds
0
(50)
On the other hand the variation of (1) in a general direction δz but evaluated at a curve
which satisfies the constraint (3) reads
∫
1
′′
∫
′′
z · δz ds =
δE[z]δz =
0
∫
=
1
0
[
1
′′
′ ′
′′
′′
′⊥ ′⊥
′′
|z {z· z} z · δz + z · z z · δz ds =
=0
]1
)′
(
ω ′ z ′⊥ · δz ′′ ds = ω ′ z ′⊥ · δz ′ − ω ′ z ′⊥ · δz +
0
[
(
)
]1
= ω ′ z ′⊥ · δz ′ − ω ′′ z ′⊥ − ω ′2 z ′ · δz 0 +
0
∫
1
(
∫
1
(
ω ′ z ′⊥
)′′
· δz ds =
0
)′
ω ′′ z ′⊥ − ω ′2 z ′ ·δz ds = δE1 [z]δz
0
(51)
and apparently coincides with the variation of E1 .
References
[1] Luigi Ambrosio, Nicola Gigli, and Giuseppe Savaré. Gradient flows in metric
spaces and in the space of probability measures. Lectures in Mathematics ETH
Zürich. Birkhäuser Verlag, Basel, 2005.
[2] A. Balaeff, L. Mahadevan, and K. Schulten. Modeling dna loops using the theory
of elasticity. Phys Rev E Stat Nonlin Soft Matter Phys, 73(3 Pt 1), March 2006.
[3] G. Bijani, N. Hamedani Radja, F. Mohammad-Rafiee, and M. R. Ejtehadi.
Anisotropic elastic model for short dna loops, 2006.
[4] Ennio De Giorgi, Antonio Marino, and Mario Tosques. Problems of evolution in
metric spaces and maximal decreasing curve. Atti Accad. Naz. Lincei Rend. Cl.
Sci. Fis. Mat. Natur. (8), 68(3):180–187, 1980.
[5] Ellis Harold Dill. Kirchhoff’s theory of rods. Arch. Hist. Exact Sci., 44(1):1–23,
1992.
[6] Gerhard Dziuk, Ernst Kuwert, and Reiner Schätzle. Evolution of elastic curves
in Rn : existence and computation. SIAM J. Math. Anal., 33(5):1228–1245
(electronic), 2002.
24
D.B. Oelz
[7] Daniel Henry. Geometric theory of semilinear parabolic equations, volume 840
of Lecture Notes in Mathematics. Springer-Verlag, Berlin, 1981.
[8] G. R. Kirchhoff. Über das Gleichgewicht und die Bewegung eines unendlich
dünnen elastischen Stabes. J. Reine Angew. Math., 56:285–313, 1859.
[9] Norihito Koiso. On the motion of a curve towards elastica. In Actes de la Table
Ronde de Géométrie Différentielle (Luminy, 1992), volume 1 of Sémin. Congr.,
pages 403–436. Soc. Math. France, Paris, 1996.
[10] Joel Langer and David A. Singer. Knotted elastic curves in R3 . J. London Math.
Soc. (2), 30(3):512–520, 1984.
[11] Joel Langer and David A. Singer. Curve straightening and a minimax argument
for closed elastic curves. Topology, 24(1):75–88, 1985.
[12] Joel Langer and David A. Singer. Curve-straightening in Riemannian manifolds.
Ann. Global Anal. Geom., 5(2):133–150, 1987.
[13] Chun-Chi Lin and Hartmut R. Schwetlick. On the geometric flow of Kirchhoff
elastic rods. SIAM J. Appl. Math., 65(2):720–736 (electronic), 2004/05.
[14] Anders Linnér. Some properties of the curve straightening flow in the plane.
Trans. Amer. Math. Soc., 314(2):605–618, 1989.
[15] Anders Linnér. Curve-straightening and the Palais-Smale condition. Trans. Amer.
Math. Soc., 350(9):3743–3765, 1998.
[16] Anders Linnér. Symmetrized curve-straightening. Differential Geometry and its
Applications, 18(2):119 – 146, 2003.
[17] Dietmar Oelz and Christian Schmeiser. Derivation of a model for symmetric
lamellipodia with instantaneous crosslink turnover. preprint, 2009.
[18] Dietmar Oelz and Christian Schmeiser. Simulation of lamellipodia with arbitrary
shape. to be submitted, 2009.
[19] Dietmar Oelz and Christian Schmeiser. Cell mechanics: from single scale-based
models to multiscale modelling, chapter How do cells move? mathematical
modelling of cytoskeleton dynamics and cell migration. Chapman and Hall /
CRC Press, to appear, 2009.
[20] Shinya Okabe. The motion of elastic planar closed curves under the areapreserving condition. Indiana Univ. Math. J., 56(4):1871–1912, 2007.
[21] Yingzhong Wen. L2 flow of curve straightening in the plane. Duke Math. J.,
70(3):683–698, 1993.
[22] Yingzhong Wen. Curve straightening flow deforms closed plane curves with
nonzero rotation number to circles. J. Differential Equations, 120(1):89–107,
1995.
S⃗eMA Journal
no 54(2011), 25–46
ON THE MODELING OF CROWD DYNAMICS: AN OVERVIEW AND
RESEARCH PERSPECTIVES
N. BELLOMO∗ , C. BIANCA∗ , AND V. COSCIA†
∗
†
Department of Mathematics, Politecnico di Torino, Italy,
Department of Mathematics, Università di Ferrara, Italy.
[email protected], [email protected],
[email protected]
Abstract
This paper presents an overview of the mathematical approaches to modeling
crowd dynamics by taking into account the complexity of living systems. The
contents refer to the classical representation scales (microscopic, kinetic, and
macroscopic) and to the mathematical frameworks that can be used for the
modeling approach. The analysis focuses on the kinetic scale and, specifically, on
developments of the mathematical kinetic theory of particles with heterogeneous
behaviors. The existing literature is critically analyzed looking ahead to research
perspectives and, more precisely, to a unified modeling strategy in view also of
swarm dynamics modeling.
A
Introduction
This paper presents an overview, followed by a critical analysis, of the mathematical
literature concerning the modeling of crowd dynamics looking forward to the
challenging objective of depicting, by mathematical equations, the complex dynamics
of swarms. Crowds, swarms, and, to a certain extent, vehicular traffic belong to the
class of complex systems. A complex system can be regarded as a large ensemble of
entities that interact each other by rules, which follow specific strategies, and that have
the ability to communicate and organize their own dynamics according to both their
own individual strategy and their interpretation of that of the others [7].
Received: August 27, 2010. Accepted: November 30, 2010.
NB acknowledges the support by the European Union FP7 Health Research Grant number FP7HEALTH-F4-2008-202047. CB acknowledges the support by the FIRB project RBID08PP3J-Metodi
matematici e relativi strumenti per la modellizzazione e la simulazione della formazione di tumori,
competizione con il sistema immunitario, e conseguenti suggerimenti terapeutici, and by the Compagnia
di SanPaolo, Torino, Italy. VC acknowledges the support by the project ”Environmental effects of vehicular
traffic; Mathematical models and simulations”, University of Ferrara, Italy.
25
26
N. Bellomo, C. Bianca, V. Coscia
A major challenge in the study of complex systems lies in the consideration that the
knowledge of the dynamics and interaction of a few entities is not, in general, sufficient
to describe the collective dynamics of the overall system. A further difficulty resides
on the fact that individual dynamics cannot be observed for every single entity, while
the overall behavior can be described and geometrically interpreted.
The previous considerations lead to conclude that complex systems are difficult to
model or understand at a global level based only on the description of the dynamics of
individual elements. On the other hand, recently, some of the systems cited above have
attracted the attention of scientists involved in the design and construction of models.
The present overview takes advantage of the literature on vehicular traffic, which
has widely developed after the pioneering paper by Prigogine and Hermann [56]
as documented in the review paper [11] and in the collection of papers [9]. A
valuable contribution is offered, from the view point of engineering sciences of traffic
phenomena, by the paper of Daganzo [22], who puts in discussion the continuity
assumptions both of the modeling at the macroscopic scale, based on the paradigms of
continuum mechanics, and of the kinetic theory based on the assumption of continuous
dependence of the distribution function over the state of vehicles at the microscopic
scale. Indeed, he observes that the approach of fluid mechanics refers to thousands of
particles, while only a few vehicles are involved by traffic jams.
Moreover, it is correctly claimed that the modeling approach should take into
account the heterogeneous distribution of the quality of drivers and vehicles. Finally,
it is remarked that models should be characterized by a simple structure, considering
that complex models need a large number of parameters, which may not be identified
in practice. It is plain that these criticisms need to be carefully taken into account
also in the case of crowd modeling. For instance, the comment concerning the
heterogeneous distribution of the quality of interacting entities suggests to use methods
recently developed to model complex systems of the living matter [4, 8]. Moreover,
the modeling needs to take into account additional important characteristics such as,
for instance, a remarkable influence of the geometry and quality of the environment
that contains pedestrians and the transition from normal to panic conditions.
This paper aims at offering to the interested reader not only a review of the state of
the art, but mainly various hints towards research perspectives in a challenging field,
where applied mathematicians generally agree that modeling has not yet reached a
satisfying level and that additional work needs to be developed. This is organized with
three more sections each of them characterized by a well defined objective.
More precisely, Section 2 introduces the concept of scaling and representation
according to the microscopic and macroscopic scales. This section includes
also a review, and critical analysis, of models derived at the microscopic and
macroscopic scales, which refer, respectively, to individually identified pedestrian and
to macroscopic observable quantities such as local density and linear momentum.
Section 3 presents some speculations on the modeling approach by methods
of the generalized kinetic theory. Four mathematical frameworks, selected among
various conceivable ones, are analyzed with the aim of understanding how far they
can act as background paradigms for the derivation of specific models. These
frameworks are derived and critically analyzed to put in evidence positive and negative
properties. Indeed, the modeling strategy starts from the selection of the most
On the Modeling of Crowd Dynamics
27
appropriate mathematical structure. One of them refers to the granular essence of
crowd dynamics, and is based on the discretization of velocity space to overcome the
uncorrect continuity assumptions of the probability distributions used in kinetic theory.
Section 4 further proposes some guidelines toward the modeling by the kinetic
theory of active particles, where the interacting entities are modeled as active particles
with the ability to develop specific strategies that are heterogeneously distributed.
Moreover, an insight to modeling panic conditions is proposed, and finally a brief
overview is given on the modeling of swarms by suitable development of the approach
used to model crowd dynamics.
B
Scaling and Representation Problems
Pedestrian dynamics has not been studied as extensively as vehicular traffic, although
the literature in the field is rapidly developing, as documented in the paper by Helbing
and Molnár [33], Hoogendoorn et al. [40, 41]. The aim of modeling consists in
describing collective and self-organization phenomena from a detailed analysis of the
dynamics at the microscopic scale. As known, the first step of the modeling approach,
for all systems of the real world, consists in the identification of the observation and
modeling scales. Subsequently, for each scale the parameters and the variables to
be used towards modeling need to be identified. Classically, the following types of
descriptions can be considered:
Microscopic description, which refers to entities individually identified. In this case,
their position and velocity identify, as dependent variables of time, the state of the
whole system. Mathematical models are generally stated by systems of ordinary
differential equations.
Macroscopic description, which is used when the state of the system is described
by averaged gross quantities, namely density, linear momentum, and kinetic energy,
regarded as dependent variables of time and space. Mathematical models describe the
evolution of the above variables by systems of partial differential equations.
An alternative to the above two scaling approaches is delivered by the kinetic
theory, which is used when the microscopic state of the interacting entities is still
identified by the position and velocity, but their representation is delivered by a suitable
probability distribution over the microscopic state. Mathematical models describe the
evolution of the above distribution function generally by nonlinear integro-differential
equations. This method is treated in the next sections. This section specifically deals
with the representation and modeling at the two above defined scales. Subsequently
a critical analysis is proposed focusing on the crucial problem of the validation of
models.
Bearing this in mind, let us consider a representation referred to two-space
dimensions, while the generalization to a three-space dimension can be formally
obtained by straightforward calculations. Therefore, let Ω be the closed domain
occupied by the crowd whose boundary is ∂Ω. Let ℓ, nM , and VM , respectively, the
largest dimension (diameter) of the domain Ω, the maximum density corresponding
to the highest admissible packing, and the maximum admissible mean velocity which
may be reached, in average, by pedestrians in free flow conditions. Of course isolated
28
N. Bellomo, C. Bianca, V. Coscia
pedestrians can reach a limit velocity Vl larger than VM , and Vl = (1 + µ)VM where
µ > 0 is a constant which depends on the characteristics of the domain as well as on
the type of pedestrian.
T
~ν (P)
P
∂Ω
Ω
Figure 1: – Geometry of the domain Ω occupied by the crowd.
The assessment of the independent variables follows, namely t = tr /TC is the
dimensionless time variable referred to the critical time TC = ℓ/VM necessary to the
fastest pedestrian to move at the maximum mean velocity VM along the diameter ℓ
of Ω; x = xr /ℓ, y = yr /ℓ, v = vr /VM are the dimensionless space and velocity
variables. Accordingly, the representation at the two scales that have been previously
indicated can be defined as follows.
B.1 Modeling at the microscopic scale
Let Ω be the crowd domain occupied by N pedestrians. The microscopic description
of the pedestrian dynamics is represented, for each i-th pedestrian with i ∈ {1, . . . , N },
by the following dimensionless variables:
The position vector xi = xi (t) = (xi (t), yi (t));
The velocity vector vi = vi (t) = (vxi (t), vyi (t)).
Mathematical models are generally stated as a large system of ordinary differential
equations where xi and vi are the dependent variables.
The mathematical structure used to model the dynamics of a system constituted
by a system of N interacting pedestrians is borrowed from the classical Newtonian
mechanics as follows:

dxi


= vi ,


dt
(1)


dvi


= Fi (x1 , . . . , xN , v1 , . . . , vN ) ,
dt
On the Modeling of Crowd Dynamics
29
which is valid when pedestrians can avoid a discontinuous interaction with the walls.
Here Fi models the interaction of the i-th pedestrian with all the others. In the presence
of time dependent external actions the time can be inserted in the argument of Fi .
The solution of (1) provides the time evolution of position and velocity of
pedestrians. Macroscopic quantities are obtained by suitable average performed either
at fixed time over a suitable space domain or at fixed spatial position over a suitable
time interval. In both cases uncertainties and fluctuations cannot be avoided. Different
modeling approaches correspond to different ways of describing the acceleration term
on the basis of a detailed interpretation of individual behaviors.
An interesting example is the so-called social force model, introduced in [33].
These kinds of models are based on the assumption that interactions among pedestrians
are implemented by using the concept of a social force or social field [44]. Important
examples of social force models can be found in [30, 33, 35], and therein references.
The model proposed by Seyfried et al. [57] deals with pedestrians treated as particles
subject to long-range forces induced by the social behavior of the individuals.
The social force model is able to cover several natural phenomena which occur
during pedestrian movements. For instance:
i) Pedestrians normally choose the fastest route and chase a well defined target T as
visualized in Figure 1;
ii) The concept of desired speed can be introduced to reflect the motivation of the
pedestrians to reach the desired goal with the desired speed;
iii) Pedestrians move with an individual speed, taking into account the situation, sex,
age, handicaps, surroundings, and so on. The speed can be, in this case, assumed to be
Gaussian distributed [36];
iv) Pedestrians keep a certain distance from other pedestrians. The distance depends
on the pedestrian density and walking speed. Suitable repulsive, short-range, potentials
can be introduced to describe these phenomena.
v) The interaction potential can be attractive, for long-range interactions, to model
the aggregation phenomena of pedestrians, who often show a trend to walk in groups.
Once separated (for instance if a pedestrian has to avoid an obstacle), the individual
pedestrians try to reform the group.
The above guidelines define a methodological approach to be followed towards the
derivation of specific models. In general, simplicity of the structure of the model is
needed, namely the design of models should include a small number of parameters to
be identified by experimental data.
Technically different approaches can be developed at the microscopic scale with the
aim of reducing computational complexity. For instance, cellular automata models
have recently been used to simulate pedestrian flows, see among others [15, 29, 49, 50].
The model simulates pedestrians as entities (automata) in cells. The walkway is
modeled as grid cells and a pedestrian is represented as a circle that occupies a cell.
One of the crucial problems of the modeling at the microscopic scale consists
in dealing with a large number of equations and in transferring the microscopic
information to the macroscopic level, namely to physical quantities which can be
possibly observed and measured.
30
N. Bellomo, C. Bianca, V. Coscia
B.2 Modeling at the macroscopic scale
Modeling the crowd dynamics at macroscopic scale is far less developed than vehicular
traffic [11]. However, the interest in this specific field is rapidly growing related
to various motivations such as engineering applications, for instance interactions of
crowds and the structures of lively bridges [28, 46, 58, 60, 61]; or the modeling of
panic situations and related safety initiatives. For instance, Coscia and Canavesio [18]
develop calculations related to crowd dynamics on the pilgrims Jamarat bridge (Saudi
Arabia), where every year safety conditions are violated.
Macroscopic modeling has been initiated by Henderson’s pioneering works [36,
37], that proposes a modeling approach by equations related to the kinetic theory of
the homogeneous gas constituted of statistically independent particles in equilibrium
in a two-dimensional space. This approach is also documented in Henderson and
Lyons [38], Henderson and Jenkins [39]. Subsequently, Hughes [42, 43] extended
Henderson’s fluid dynamics approach to allow the introduction of factors related to
human decision and interaction by a model that represents pedestrians as a continuous
density field, where an evolving potential function models the guide of the density field
optimally towards its goal. Additional developments of the pedestrian flow modeling
at the macroscopic scale are due to Hoogendoorn and Bovy [40] and Hoogendoorn et
al. [41], which developed a pedestrian flow model formulated into a user’s dynamic
equilibrium assignments. Additional literature is given in [11].
The macroscopic representation of a system constituted by a large number of
interacting individuals concerns groups of pedestrians rather than individual units. The
macroscopic description may be selected for high density, large scale systems in which
the knowledge of the local behavior of groups is sufficient. In details, the macroscopic
description is defined by the following dimensionless variables:
The local density ρ = ρ(t, x) which is referred to the maximum density nM of
pedestrians;
The mean velocity V = V(t, x) which is referred to VM of pedestrians.
The relationship among the flow rate q, the mean velocity V, and the pedestrian
density ρ is q = ρ V.
The mathematical structure suitable for the derivation of specific models
corresponds to conservation of mass and linear momentum, see Hughes [42] and [43]:

∂t ρ + ∇x · (ρV) = 0 ,

(2)

∂t V + (V · ∇x )V = A[ρ, V] ,
where the dot-product denotes inner-product of vectors and A[ρ, V] is a psychomechanical acceleration acting on pedestrians in the elementary macroscopic volume
of the physical space.
Specific models can be designed referring to the above frameworks. In details:
• First-order models are obtained by mass conservation only linked to a closure of the
equilibrium velocity Ve = Ve (ρ). The guidelines for the derivation of models are
analogous to those followed for vehicular traffic flow modeling [12] with the addition
of the selection of flow direction towards the target of pedestrians, for instance the
On the Modeling of Crowd Dynamics
31
exit. These models have the advantage of a structural simplicity, however they need
plugging experimental data into the model. Due to their simplicity, first-order models
may be useful for some specific applications as documented in the already cited paper
[18], where simulations are developed in a complex geometry.
The ability of the model to depict real flow conditions encourages further studies in
the field, for instance by inserting in the model an intelligent selection of the optimal
pathways related to local flow conditions as well as the influence of panic conditions.
Specifically, these conditions may attract pedestrians towards overcrowding, which are
avoided in normal flow conditions.
• Second-order models are obtained by both equations (2) with the addition of a
phenomenological relation describing the psycho-mechanic acceleration A[ρ, V]. It
seems quite difficult dealing with higher order models due to problems related to a
correct identification of the energy for a system, where the overall amount of available
energy also depends on the individual ability to develop a strategy.
The acceleration A can be viewed, according to a simple approach, as the
superposition of two contributions due to an adaptation to the mean equilibrium flow
velocity Ve measured in steady uniform flow conditions and to local density gradients.
Both contributions are supposed, in a first approximation, to act along the unit vector
⃗ν = ⃗ν (P) directed from P to the target T, as shown in Figure 1. Namely, individuals
at the point P = (x, y) aim to reach at a destination T = (xT , yT ) along their intended
direction of movement given by the unit vector ⃗ν .
Bearing all above ideas in mind, a brief review of some second-order macroscopic
models is given referring specifically to [10]. In details, the acceleration can be
modeled by the contribution of two terms:
A[ρ, V] = AF [ρ, V] + AP [ρ, V] ,
to be modeled according to the following two assumptions:
2.1. The frictional acceleration AF is proportional to the difference between the actual
velocity V and the mean equilibrium velocity Ve (ρ, ⃗ν ) corresponding to the local
density, that is: AF = cF (ρ) (Ve (ρ, ⃗ν ) − V) ,
2.2. The acceleration AP between pedestrians is determined only by the gradient of
pedestrian density as: AP = −cP (ρ) ∇x ρ .
Several different models of the above terms are proposed in [10], where a
qualitative analysis, focused on the properties of hyperbolic structures, is developed.
Models can be further refined by taking the gradients in the computation of the local
mean velocity, for instance by computing it at a density higher than the real one in the
presence of positive gradients and at a lower one for negative gradients, see [23]. More
in general, A is conditioned by ⃗ν .
It is worth mentioning, in the line of this perspective, that Piccoli and Tosin [54, 55]
have proposed a model of evolution of probability measures occupied by the crowd.
It is an interesting approach which, although developed within the framework of the
macroscopic scale, refers to the dynamics at the lower scale, namely to the strategy
developed by pedestrians.
Their approach can be classified as related to first order models, where the mean
velocity is replaced by a probability measure of the velocity field. The evolution is
32
N. Bellomo, C. Bianca, V. Coscia
obtained by adaptation of pedestrians to the density in their visibility zone that is also
related to the target pursued by them, for instance to the exit area [10]. A perspective
idea, however still limited to vehicular traffic, is offered by [12], where the velocity
variable is linked to a probability density which evolves in time.
B.3 Model validation
The validation of models can be based on a suitable comparison between predictions of
the model and empirical data. A preliminary step consists in analyzing the advantages
and disadvantages of the selection, in the modeling approach, of one of the above
scales that may even be different from that used to collect empirical data. Accordingly,
the following ten remarks are selected among several ones to be taken into account as
guidelines of the modeling approach:
i) Crowds are definitely discrete with finite degrees of freedom. However it
is necessary, for practical purposes, that the model allows the computation of
macroscopic quantities;
ii) The flow is not continuous, hence models derived at the macroscopic scale are not
consistent with the classical paradigms of continuum mechanics;
iii) The number of individual entities is not large enough to allow the use of continuous
distribution functions within the framework of the mathematical kinetic theory;
iv) Interactions are not localized, as in the case of classical particles, considering that
pedestrians adapt their dynamics to the flow conditions within their visibility zone;
v) Pedestrians have the ability to modify their dynamics according to specific strategies.
This ability is heterogeneously distributed;
vi) This self-organizing ability and the overall strategy is substantially modified by
environmental conditions, such as the appearance of panic situations;
vii) Mathematical models should include a limited number of parameters related
to well defined physical phenomena, which should be technically identified by
experiments;
viii) Empirical data should not be artificially plugged into mathematical models, which
should reproduce them after an appropriate choice of the parameters;
ix) The output of experiments is very sensitive to environmental conditions;
x) Mathematical models are required to reproduce empirical data in steady uniform
flow conditions, and at least at a qualitative level, emerging phenomena which are
observed in unsteady flow.
Empirical data can be roughly classified into two main categories, namely
quantitative results, for instance focused on steady flow conditions or outlet flows, and
qualitative description of emerging behaviors related to the collective self-organizing
ability of humans and animals. Both types of data can and should be used to validate
models, while their artificial plugging into models should be avoided. Specifically,
the ETH report by Buchmueller and Weidmann [17] provides a rich amount of data
and parameters concerning pedestrian traffic related to walking facilities. The report
is very detailed on specific parameters such as pedestrian dimension and weight,
On the Modeling of Crowd Dynamics
33
viewed as heterogeneously distributed variables, energy consumption, in different
types of pathways, walking speed, lateral oscillation, flow rates, and other similar
data. The report also analyzes the heterogeneous behavior of individuals related to
age, handicaps, and so on. An important information is delivered by the so-called
velocity and fundamental diagrams, which report the one-directional mean equilibrium
dimensionless velocity Ve = Ve (ρ) and flux qe = Ve (ρ) ρ versus the dimensionless
density ρ. An important test for validation is the analysis of the ability of models to
reproduce these diagrams by transferring the dynamics at the microscopic scale into
the macroscopic behavior in steady uniform flow. This result has been achieved in the
case of vehicular traffic [16, 24], but it still needs to be properly developed in the case
of crowd dynamics.
Experimental activities on crowd emerging phenomena are focused on the selforganizing ability of pedestrians as documented in [34], where a variety of emerging
behaviors are visualized. This type of investigation has some analogy with that related
to swarms that we shall see later. A particularly relevant issue refers to experiments
concerning panic phenomena [31] to capture the substantial difference with respect
to the flow in normal conditions. Crowds exhibit self-organizing abilities in various
circumstances [48], which are not observed in vehicular traffic, while other emerging
phenomena present specific analogies, for instance instabilities that generate phase
transition to jam conditions [52].
A further aspect to be considered is the heterogeneous behavior of drivers and
pedestrians. This aspect is even more critical in the physics of crowds where changes in
the environmental conditions can introduce substantial modifications in the individual
behaviors. Indeed, this is the case of transition from normal to panic conditions, where
individuals loose their trend to the target and are attracted by directions, correctly
or incorrectly, related to escape danger. Modeling has to take into account also the
fact that the human interpretation of danger is not, at least in some cases, correct.
For instance, escaping a danger can be identified by the localization of overcrowded
areas, which constitute additional danger, and a subsequent additional panic. This is
an interesting topic, see [31], which is definitely worth for future research activity to
be properly related to well-defined models.
Finally, let us stress that the modeling of heterogeneous behaviors is lost in the
averaging process induced by the macroscopic approach, while it increases the already
relevant complexity of models at the microscopic scale. This aspect can be consistently
treated by methods of the kinetic theory as we shall see in the following.
C
Models at the Kinetic Scale by Generalized Distribution Function
Methods of the generalized kinetic theory have not yet been systematically applied
to model crowd dynamics although some perspective ideas have been given in [5, 8].
Mathematical models are derived, similarly to the classical kinetic theory, by a balance
of the number of pedestrians, hereinafter also called particles, in the elementary volume
of the phase space. Accordingly, the transport term is equated to the net flow in
such volume as it is computed according to specific models at the microscopic scale.
Different models correspond to different ways of modeling interactions that may be
34
N. Bellomo, C. Bianca, V. Coscia
localized or long-range in space. Models may also include the role on the dynamics
of interactions of the dimension of particles. Modeling of individual dynamics should
take into account both interactions among particles and their trend toward a certain
objective, for instance the exit area.
More in details, this section presents four mathematical frameworks, selected
among various conceivable ones, that can act as background paradigms for the
derivation of specific models. Indeed, we claim that the modeling strategy should
include the selection of the most appropriate mathematical structure. These
frameworks are critically analyzed to put in evidence the elements that are useful to the
modeling approach. In general, it is worth mentioning that none of these frameworks
takes into account the heterogeneous behavior of pedestrians. Moreover, transitions
from normal flow to panic conditions involves substantial modifications. Both issues
will be discussed in the next section.
In general, the kinetic theory approach is such that the description of the overall
state of the system is given by the following distribution function normalized with
respect to nM :
f = f (t, x, v) : [0, T ] × Ω × Dv → R+ ,
(3)
where Dv ⊂ R2 is the domain of the velocity variable. If f is locally integrable,
f (t, x, v) dx dv denotes the number of individuals, which, at the time t, are in the
elementary volume of the microscopic states [x, x + dx] × [v, v + dv].
Macroscopic observable quantities are obtained by moments weighted by the
velocity variable. For instance, the dimensionless local density and the local flux
respectively read:
∫
ρ(t, x) =
f (t, x, v) dv ,
(4)
v f (t, x, v) dv .
(5)
Dv
∫
and
q(t, x) =
Dv
C.1
Phenomenological models
Let us first consider the framework characterized by the following structure:
(
)
∂t + v · ∂x + F(t, x) · ∂v f (t, x, v) = Q[f, ρ, ⃗ν ](t, x, v) ,
(6)
where F = F(t, x) models the acceleration applied to pedestrians by the outer
environment, and Q[f, ρ, ⃗ν ] = Q[f, ρ, ⃗ν ](t, x, v) depends on the local distribution
function. Q can be derived on a phenomenological basis, and that can be parameterized
by local macroscopic quantities.
A simple way to model the term Q consists in describing a trend to equilibrium
analogous to the BGK model in kinetic theory:
(
)
Q[f, ρ, ⃗ν ](t, x, v) = cr (ρ) fe (v|ρ, ⃗ν ) − f (t, x, v) ,
(7)
where the rate of convergence cr = cr (ρ) depends on the local density ρ, and fe =
fe (v|ρ, ⃗ν ) denotes the equilibrium distribution function that may be parameterized by
On the Modeling of Crowd Dynamics
35
the local density ρ and by the direction ⃗ν towards the target T. Models generally
assume F = 0; however, acceleration terms may be imposed by the outer environment,
for instance signalling to accelerate or decelerate that may be useful in the case of rapid
evacuation.
Phenomenological models have the advantage of a structural simplicity. They do
not need modeling interactions at the microscopic scale; a constant trend is assumed
toward an equilibrium velocity prescribed a priori. Such a function, as analyzed
in Section 2, is difficult to measure and should be the output of interactions at the
microscopic scale. Moreover, it should be parameterized with respect to the quality of
the environment.
C.2
Models with long-range interactions
Long-range interactions are important in the modeling of particle interactions
considering that individuals develop their own strategy not only by taking into account
nearby particles, but also long distance ones.
This class of models needs the definition of the following two types of particles
involved in the interaction. Test particles representative of the system, being described
by the distribution function f , and field particles which belong to the visibility zone
∆P of the test particles.
The corresponding structure is as follows:
(∂t + v · ∂x + F(t, x) · ∂v + ∂v F[f ]) f (t, x, v) = 0 ,
(8)
where F[f ] = F[f ](t, x, v) represents the acceleration/deceleration due to the
interactions with the field particles which affect the motion of the test particle. This
term, obtained by summing all actions of the field particles belonging to the visibility
zone ∆P of the test particle P localized in x, see Figure 2, reads:
∫
∫
F[f ](t, x, v) =
∆P
φ(x, x∗ , v, v∗ |ρ, ⃗ν )f (t, x∗ , v∗ ) dx∗ dv∗ .
(9)
Dv
where φ = φ(x, x∗ , v, v∗ |ρ, ⃗ν ) is the positional acceleration/deceleration term
applied by the field particles, localized in x∗ with velocity v∗ , to the test particle,
localized in x with velocity v.
The modeling approach needs the identification of the positional acceleration φ
within a suitable visibility zone. Also in this case, the trend of particles towards the
target has to be taken into account, while local density distribution affects the aforementioned actions.
The advantage of this class of models is that the strategy of individuals is identified
by the term φ, which may be parameterized by ρ and ⃗ν . Therefore the development of
appropriate experiments should be addressed to identify such term. The withdraw is
in the additivity of the action applied by all particles in the visibility zone, while this
is not true in general considering that the strategy developed by the test particle is the
output of the field particles viewed as a whole.
36
N. Bellomo, C. Bianca, V. Coscia
T
~ν (P′ )
Obstacle
P′
∆P′
~ν (P)
P
∆P
∂Ω
Ω
Figure 2: – Target T and visibility zones ∆P and ∆P′ of the particles localized in P
and P′ .
C.3
Models with weighted binary interactions
These models are based on the assumption that the intensity of the interaction is
fractionated among the field particles which belong to the visibility zone ∆x of the
test particle localized in x. Accordingly, in order to measure the different intensity
expressed by the field particles localized in x∗ , a weight function w(x, x∗ ) : Ω×∆x →
R+ is introduced such that the intensity interaction expressed by w(x, x∗ ) tails off
when the ’distance’ between the test particle localized in x and the field particle
localized in x∗ increases. Moreover the following identity holds true:
∫
w(x, x∗ ) dx∗ = 1.
∆x
The mathematical framework is derived considering another kind of particle, called
candidate. The interaction rule is as follows: Candidate particles can acquire, in
probability, the state of the test particles after an interaction with field particles, while
test particles loose their state after interactions. The dynamics is described by the
following quantities:
η = η(v∗ , v∗ |ρ(t, x∗ )) is the encounter rate between the candidate particle (or test),
with velocity v∗ (or v) and the field particle with velocity v∗ .
P = P(v∗ → v|v∗ , v∗ , ρ, ⃗ν ) is the probability density that a candidate particle, with
velocity v∗ , ends up into the state v of the test particle after the interaction with the
field particle, with velocity v∗ .
This quantity, defined the table of games, is also assumed to depend on ρ and ⃗ν . In
On the Modeling of Crowd Dynamics
particular, P satisfies, at fixed ρ ∈]0, 1[, the following condition:
∫
P(v∗ → v|v∗ , v∗ , ρ, ⃗ν ) dv = 1 , ∀ v∗ , v∗ ∈ Dv .
37
(10)
Dv
The corresponding mathematical structure is as follows:
(
)
∂t + v · ∂x + F(t, x) · ∂v f (t, x, v)
∫ ∫ ∫
w(x, x∗ )η(v∗ , v∗ |ρ(t, x∗ ))A(v∗ → v|v∗ , v∗ , ρ, ⃗ν )
=
∆x
Dv
Dv
×f (t, x, v∗ ) f (t, x∗ , v∗ ) dx∗ dv∗ dv∗
∫ ∫
−f (t, x, v)
w(x, x∗ )η(v, v∗ |ρ(t, x∗ ))f (t, x∗ , v∗ ) dx∗ dv∗ . (11)
∆x
Dv
This mathematical structure corresponds to the, so-called, averaged stochastic
games [8]. It is immediate to show that simply by assuming w(x, x∗ ) = δ(x∗ −
x), where δ denotes the Dirac’s delta function, it yields models with long-range
interactions.
A brief account can now be given focusing on the modeling of the term A.
Following [5], particles modify their velocities due to interactions by feeling two
different type of actions:
i) Trend towards a target, for instance the exit zone;
ii) Trend towards the stream of the particles on the visibility zone ∆x of the test
particles;
These trends, namely the modification of velocity induced by them, are related to
α(1 − ρ) in the first case, and to αρ in the second case. More precisely, α(1 − ρ)
and αρ indicate the fraction of particles (pedestrians), which modify their velocity due
to interactions. Therefore, the first trend increases with increasing density, while the
second one with decreasing density. Moreover, α is a parameter modeling the quality
of the environment, in detail α = 0 indicates the worst conditions where the movement
is practically prevented, while α = 1 denotes the best conditions where the movement
is at the easiest level.
The critical analysis of this structure leads to conclusions analogous to that of the
preceding subsection, with the additional advantage of providing a framework that can
be further developed towards modeling nonlinear interactions at least by the weight
function that privileges certain areas with respect to the others.
C.4
Models with discrete set velocity
The contents of this subsection is motivated by the already mentioned criticisms on
the continuity assumption of the distribution function over the microscopic state of
particles. This topic has already been treated in [14, 19, 26] to take into account
the granular nature of vehicular traffic by means of kinetic type models with discrete
velocities. The same approach can be generalized to crowd modeling in a way that
38
N. Bellomo, C. Bianca, V. Coscia
the overall state of the system is described by a discrete probability distributions over
groups of particles with velocity within a certain velocity domain. In fact, particles
tend to move in clusters, which can be identified and distinguished from each other by
a discrete set of velocity values. Perspective ideas on this approach are given in [5],
and in the work to appear [6].
It is worth mentioning that this approach is based on different motivations with
respect to the so-called discrete Boltzmann equation that is a crude approximation of
physical reality. Here, the discrete velocity space is used to simulate the noncontinuous
behavior of the distribution function, namely it is used to simulate granular flow.
Borrowing some ideas from [6] the velocity variable is considered in polar coordinates
(v, θ), where v ∈ [0, 1] is the velocity module of v and θ ∈ [0, 2π) is the angle that
a horizontal line forms with the velocity vector v. The following discrete sets are
considered:
Iv = {v1 = 0 , . . . , vi , . . . , vn = Vl } ,
and
Iθ = {θ1 = 0 , . . . , θj , . . . , θm+1 = 2 π} .
Let fij (t, x) = f (t, x, vi , θj ), then the corresponding discrete representation is as
follows:
n ∑
m
∑
f (t, x, v) =
fij (t, x)δ(v − vi ) δ(θ − θj ) .
(12)
i=1 j=1
Macroscopic quantities are obtained by weighted sums, which replace the integrals (4)
and (5).
Application in this present case of the same approach used to derive the
mathematical structure in the continuous case yields:
(
)
∂t + vi · ∂x fij (t, x) = H[f ](t, x)
n
m ∫
∑
∑
ij
=
η[ρ(t, x∗ )]ω(x, x∗ )Phk,pq
(ρ, ⃗ν ) fhp (t, x) fkq (t, x∗ ) dx∗
h,k=1 p,q=1 ∆x
n ∑
m ∫
∑
− fij (t, x)
k=1 q=1
η[ρ(t, x∗ )]ω(x, x∗ )fkq (t, x∗ ) dx∗ ,
(13)
∆x
ij
where Phk,pq
denotes the probability density that a (hk)-candidate particle reaches the
(ij)-state after interaction with a (pq)-field particle. The meaning of the other terms
can be recovered by the framework derived in Subsection 3.3.
The qualitative analysis of this framework leads to the same conclusions addressed
to the preceding framework with the additional advantage of the use of discrete values
of the velocity that appears to be more consistent with the physics of the system
under consideration. However, the smoothness with respect to v is lost therefore the
application of the force F = F(t, x) cannot be considered in the same way of the
previous cases. This framework has the advantage, with respect to that of Subsection
3.3, to use discrete variables that are more suited to capture the granular nature of
the real system. Modeling the application of external actions needs, as we have seen,
the approximation of the derivatives with respect to the discrete velocity variables.
Analogous reasonings are needed when the also the space variable is discrete [13].
On the Modeling of Crowd Dynamics
D
39
Research perspectives
This final section aims at offering to the interested reader some hints to develop
research activity in the field. The guidelines to pursue such objective can be
summarized, according to the authors’ bias, as follows:
– Crowds need to be regarded as a complex system characterized by some specific
features of living systems. More precisely, the interacting entities have the ability to
develop specific strategies, which depend on the state and localization of the other
entities in their interaction domain. Therefore, interactions are nonlinear and not
additive. –
– This ability is heterogeneously distributed among interacting entities. In other
words, such self-organizing ability is not the same for all individuals, and has to be
regarded as a random variable linked to a probability distribution that might have a
local nature and may be modified by several types of interactions at the microscopic
level. Moreover, modeling individual dynamics does not straightforwardly lead to the
mathematical description of the collective dynamics. –
– Rather than selecting one only modeling scale, the mathematical approach should
also consider the simultaneous interaction of two scales, where the lower scale
modifies the heterogeneous behavior at the higher scale. The lower scale refers to
interactions, which do not follow deterministic rules, but can be classified as stochastic
games. –
Differently from the preceding sections, a mathematical formalization is here
substituted by conceptual reasonings, which wait for a detailed mathematical approach
to be developed within specific research programs. Bearing all above in mind, let
us first consider the problem of modeling the heterogeneous behavior of pedestrians.
This matter can be treated according to the methods of kinetic theory of active
particles theory [8], by introducing an activity variable u in the microscopic state
of pedestrians, called here and after active particles.
The microscopic state is the defined by a tern (x, v, u) with u ∈ [0, 1], where u = 0
corresponds to the worse admissible ability, may be due to handicaps, while u = 1 to
the best one, may be at some athletic level. Therefore, the distribution function is
formally defined as follows:
g = g(t, x, v, u) : [0, T ] × Ω × Dv × [0, 1] → R+ .
(14)
The cases studied in Section 3 correspond to the following approximation:
g = f (t, x, v) × δ(u − u0 ) ,
where δ denotes the Dirac’s function and u0 is the activity expressed by all particles
with the same intensity. In all above cases, macroscopic equations can be computed
simply by adding to Eqs (4) and (5) integration over the activity variable.
The objective of the modeling approach consists in deriving suitable evolution
equations for the distribution function defined in Eq. (14). The mathematical structure
defined in Eq. (13) can be generalized to include the role of this new variable by
40
N. Bellomo, C. Bianca, V. Coscia
introducing in the probability density A also the activity variable u. Accordingly, such
density can be formally written as follows:
A = A(v∗ → v, u∗ → u|v∗ , v∗ , u∗ , u∗ , ρ, ⃗ν ) ,
(15)
which, for practical purposes can be factorized as follows:
A = B(v∗ → v|v∗ , v∗ , u∗ , u∗ , ρ, ⃗ν ) × C(u∗ → u|u∗ , u∗ , ρ) .
(16)
More in general, at least two-scales should be considered: the higher concerning
the mechanics of individuals and the lower related to their ability to organize the
dynamics. Interactions at the lower scale modify the activity variable which modifies
the dynamics at the higher scale. The two systems are coupled by the local density
and the activity variable. A general rule is that heterogeneity decreases with increasing
density and disappears in overcrowding conditions, when active particles cannot any
more express their own ability.
A brief account can now be given concerning the modeling of the term A. Particles,
according to the conjecture proposed in [5] modify their velocity at interactions by
feeling two actions: a trend towards the target and an attraction to the velocity of the
active particles in the interaction domain of the candidate particle. These two trends
are modeled by the discrete probability densities in a way analogous to that presented
in Subsection 3.3. In details
ε1 α(1 − ρ),
1 − ε1 α(1 − ρ) ,
(17)
1 − ε2 α ρ ,
(18)
and
ε2 α ρ,
where, in both cases, the former is the fraction of particles that modifies the velocity,
the latter indicates the fraction that keep its velocity. The modification of the velocity
depends also on the velocities of the interacting pair and on the direction of their
velocity with respect to ⃗ν . Moreover, α ∈ [0, 1] is a parameter related to the quality
of the environment, while the parameters ε1 and ε2 model the respective weights of
the two different actions. Analogous reasonings can be developed referring to the
framework with discrete velocities Eq. (13).
Panic conditions induce some modifications in the modeling of interactions. This
topic has been carefully treated for models at the microscopic scale, see [32] and
therein cited references. Some perspective ideas can be given focusing on the dynamics
at the microscopic scale:
i) The value of the activity variable increases, namely active particles are more active
in panic conditions;
ii) In normal conditions the trend towards the target is more significant than the
sensitivity to the flow, namely ε1 is larger than ε2 . The opposite occurs during panic;
iii) In some cases active particles follow the stream even if not useful to escape danger.
The characteristics that have been listed above can be technically taken into account
in the modeling of interactions involving particles, thus generating a model which uses
the same mathematical structure, but with a different choice of the parameters.
On the Modeling of Crowd Dynamics
41
Let us finally consider the possible development of the mathematical approach
we have presented in this paper to modeling the dynamics of swarms. Definitely,
it is an attractive, and challenging, research perspective that keeps engaging a
growing number of applied mathematicians. The mathematical literature on swarm
modeling shows a rather heterogeneous use of different approaches. Among others,
stochastic differential equations [1], macroscopic equations derived from stochastic
perturbation of individual dynamics [25], modeling swarming patterns [59]. Additional
bibliography can be found in the review paper [11] and in the special issue [9].
A specific characteristic is that the swarm has the ability to express a collective
intelligence related to the environmental conditions, which can evolve by learning
processes. This self-organizing ability amounts also to a typical phenomenon of
swarms concerns the flocking phenomena [20, 21, 47, 53].
The experimental activity on swarms differs from that developed in the case
of crowds. In fact, it is mainly focused on understanding the dynamics of the
interactions corresponding to different animal species [2, 3, 27]. Further, experiments
are addressed to understand emerging behaviors, such as flocking phenomena, break
up and aggregation of swarms, which should be depicted by models. Therefore, the
collection of empirical data is generally focused on qualitative, rather than quantitative,
aspects.
The above reasonings do not claim to be exhaustive, but simply show how
the interest in the field is rapidly growing. This stimulates the interest of applied
mathematicians towards this challenging research field. A few guidelines are given
to support modeling projects:
4.1. Generally, swarms refer to specific animal behaviors, which differ from species to
species and that can be modified by external actions that can induce panic.
4.2. Recent studies [2] conjecture, on the basis of empirical data, that some systems
of animal world develop a common strategy based on interactions depending on
topological rather than metric distances. This definitely is a valuable suggestion to
be used towards modeling.
4.3. The swarm has the ability to express a common strategy, which is a nonlinear
elaboration of all individual contributions, generated by each individual based on the
microscopic state of all the others. In general a swarm has the ability to express a
collective intelligence that is generated by a cooperative strategy [47].
4.4. The above mentioned strategy includes a clustering ability (flocking) that prevents
the fragmentation of the swarm. However, when a fragmentation of the domain
occupied by the swarm occurs, the clustering ability induces an aggregation. The
dynamics of interactions differs in the various zones of the swarm. The activity variable
does not significantly change among individuals. However, it may change due to the
position. For instance, from the border to the center of the swarm. In some cases
position depends on a hierarchy inside the group [20, 51].
Of course, it is necessary considering that interactions among active particles of a
swarm are in three space coordinates, while those of particles of a crowd are defined
over two-space coordinates and the mathematical problems are stated in unbounded
domains with initial conditions with compact support. The solution of problems should
provide the evolution in time of the domain of the initial conditions.
42
N. Bellomo, C. Bianca, V. Coscia
Finally, we can claim that the approach of the kinetic theory of active particles is
an appropriate candidate to model the dynamics of crowds and, possibly, swarms. The
most delicate issue is, in our opinion, the modeling of interactions at the microscopic
scale, which are nonlinear, namely nonadditive. Moreover, interactions involve a
limited number of particles almost independently on their localization as far as particles
remain in a certain interaction domain. Various hints have been given in this paper.
Some of them are already involving research teams. Hopefully the reader, interested in
the challenging research field reviewed in this paper, can take advantage of them.
References
[1] S. Albeverio and W. Alt, Stochastic dynamics of viscolelastic skeins:
Condensation waves and continuum limit, Math. Models Methods Appl. Sci.,
18:1149-1192, (2002).
[2] M. Ballerini, N. Cabibbo, R. Candelier, A. Cavagna, E. Cisbani, I. Giardina,
V. Lecomte, A. Orlandi, G. Parisi, A. Procaccini, M. Viale, and V. Zdravkovic,
Interaction ruling animal collective behavior depends on topological rather than
metric distance: evidence from a field study, Proc. Nat. Acad. of Sci., 105:12321237, (2008).
[3] R.N. Bearon and K.L. Grünbaum, From individual behavior to population
models: A case study using swimming algae, J. Theor. Biol., 251:33-42, (2008).
[4] N. Bellomo, Modelling Complex Living Systems - A Kinetic Theory and
Stochastic Game Approach, Birkäuser, Boston, (2008).
[5] N. Bellomo and A. Bellouquid, On the modelling of vehicular traffic and crowds
by the kinetic theory of active particles, in Mathematical Modelling of Collective
Behaviour in Socio-Economics and Life Sciences, G. Naldi, L. Pareschi, and G.
Toscani, Eds., Birkhäuser, Boston, (2010).
[6] N. Bellomo, and A. Bellouquid, On the modeling of crowd dynamics looking at
the beautiful shapes of swarms, (2011), to be published.
[7] N. Bellomo, H. Berestycki, F. Brezzi, and J-P. Nadal. Mathematics and
complexity in human and life sciences, Math. Models Methods Appl. Sci.,
19:1385-1389, (2009).
[8] N. Bellomo, C. Bianca, and M. Delitala, Complexity analysis and mathematical
tools towards the modelling of living systems, Phys. Life Rev., 6:144-175, (2009).
[9] N. Bellomo and F. Brezzi, Traffic, crowds, and swarms, Math. Models Methods
Appl. Sci., 18:1145-1148, (2008).
[10] N. Bellomo and C. Dogbè. On the modelling crowd dynamics from
scaling to hyperbolic macroscopic models, Math. Models Methods Appl. Sci.,
18(Supplement):1317-1345, (2008).
On the Modeling of Crowd Dynamics
43
[11] N. Bellomo and C. Dogbè, On the modeling of traffic and crowds - A survey of
models, speculations, and perspectives, SIAM Review, to appear.
[12] N. Bellomo and V. Coscia, First order models and closure of the mass
conservation equation in the mathematical theory of vehicular traffic flow, C.R.
Mécanique, 333(11): 843-851, (2005).
[13] C. Bianca, On the modelling of space dynamics in the kinetic theory for active
particles, Math. Comput. Modelling, 51:72-83, (2010).
[14] C. Bianca and V. Coscia, On the coupling of steady and adaptive velocity grids in
vehicular traffic modelling, Appl. Math. Letters, 24(2):149-155, (2011).
[15] V.J. Blue and J.L. Adler, Cellular automata microsimulation of bidirectional
pedestrian flows, Transp. Research Board, 1678:135-141, (2000).
[16] I. Bonzani and L. Mussone, On the derivation of the velocity and fundamental
traffic flow diagram from the modelling of the vehicle-driver behaviors, Math.
Comput. Modelling, 50:1107-1112, (2009).
[17] S. Buchmuller and U. Weidman, Parameters of pedestrians, pedestrian traffic
and walking facilities, ETH Report Nr.132, October, (2006).
[18] V. Coscia and C. Canavesio, First order macroscopic modelling of human crowds,
Math. Models Methods Appl. Sci., 18:1217-1247, (2008).
[19] V. Coscia, M. Delitala, and P. Frasca, On the mathematical theory of vehicular
traffic flow models II. Discrete velocity kinetic models, Int. J. Non-linear Mech.,
42:411-421, (2007).
[20] F. Cucker and Jiu-Gang Dong, On the critical exponent for flocks under
hierarchical leadership, Math. Models Methods Appl. Sci., 19:1391-1404 ,(2009).
[21] F. Cucker and S. Smale, Emergent behavior in flocks, IEEE Trans. Automatic
Control, 52:853-862 (2007).
[22] C.F. Daganzo, Requiem for second-order fluid approximations of traffic flow,
Transp. Research Board, 29:277-286, (1995).
[23] E. De Angelis, Nonlinear Hydrodynamic Models of Traffic Flow Modelling and
Mathematical Problems, Math. Comput. Modelling, 29:83-95, (1999).
[24] P. Degond and M. Delitala, Modelling and simulation of vehicular traffic jam
formation, Kinetic and Related Models, 1:279-293, (2008).
[25] P. Degond and S. Motsch, Continuum limit of self-driven particles with
orientation interaction, Math. Models Methods Appl. Sci., 18:1193–1217, (2008).
[26] M. Delitala and A. Tosin, Mathematical modelling of vehicular traffic: A discrete
kinetic theory approach, Math. Models Methods Appl. Sci., 17:901-932, (2007).
44
N. Bellomo, C. Bianca, V. Coscia
[27] C. Detrain and J-L. Doneubourg, Self-organized structures in a superorganism:
do ants “behave” like molecules?, Physics Life Rev., 3:162-187, (2006).
[28] B. Eckhardt and E. Ott, Crowd synchrony on the London Millennium Bridge,
16:041104, (2006).
[29] M. Fukui and Y. Ishibashi, Self-organized phase transitions in CA-models for
pedestrians, J. Phys. Soc. Japan, 8:2861-2863, (1999).
[30] D. Helbing, A mathematical model for the behavior of pedestrians, Behavioral
Sciences, 36:298-310, (1991).
[31] D. Helbing, I. Farkas, and T. Vicsek, Simulating dynamical feature of escape
panic, Nature, 407:487-490, (2000).
[32] D. Helbing, A.F. Johansson, and H. Z. Al-Abideen, Dynamics of crowd disasters:
An empirical study, Phys. Review E, 75:046109, (2007).
[33] D. Helbing and P. Molnár, Social force model for pedestrian dynamics, Phys.
Review E, 514282-4286, (1995).
[34] D. Helbing, P. Molnár, I. Farkas, and K. Bolay, Self-organizing pedestrian
movement, Environment and Planning B, 28:361-383, (2001).
[35] D. Helbing and T. Vicsek, Optimal self-Organization, New J. of Physics, 1:13.113.17, (1999).
[36] L.F. Henderson, The statistics of crowd fluids, Nature, 229:381-383, (1971).
[37] L.F. Henderson, On the fluid mechanic of human crowd motion, Transp.
Research, 8:509-515, (1975).
[38] L.F. Henderson, and D.J. Lyons, Sexual differences in Human Crowds Motion,
Nature, 240:353-355 (1972).
[39] L.F. Henderson, and D.M. Jenkins, Response of Pedestrians to Traffic Challenge,
Transp. Res., 8:71-72, (1973).
[40] L.S. Hoogendoorn and P.H.L. Bovy, Dynamic user-optimal assignment in
continuous time and space, Transp. Res. B, 38:571-592, (2004).
[41] L.S. Hoogendoorn and P.H.L. Bovy, and W. Daamen, Walking infrastructure
design assignment by continuous space dynamic assignment modeling, J.
Advanced Transp., 38:69-92, (2004).
[42] R. L. Hughes, A continuum theory for the flow of pedestrians, Transp. Res. B,
36:507-536, (2002).
[43] R.L. Hughes, The flow of human crowds, Annual Rev. Fluid Mech., 35:169-183,
(2003).
On the Modeling of Crowd Dynamics
45
[44] K. Lewin, Field Theory in Social Science, Selected theoretical papers. D.
Cartwright (Ed.). New York: Harper & Row, (1951).
[45] B.S. Kerner, The Physics of Traffic, Springer, New York, Berlin, (2004).
[46] J.H.G. Macdonald, Lateral excitation of bridges by balancing pedestrians, Proc.
Royal Society A: Math. Phys. Eng.ng, 465:1055-1073, (2004).
[47] A. Mogilner, L. Edelstein-Keshet, L. Bent, and A. Spiros, Mutual interactions,
potentials, and individual distance in a social aggregation, J. Math. Biol., 47:353389, (2003).
[48] M. Moussaid, D. Helbing, S. Garnier, A. Johanson, M. Combe, and G. Theraulaz,
Experimental study of the behavioral underlying mechanism underlying selforganization in human crowd, Proc. Royal Society B: Biol. Sci., 276:2755-2762,
(2009).
[49] M. Muramatsu and T. Nagatani, Jamming transition in pedestrian counter flow,
Physica A, 267:487-498487-498, (1999).
[50] M. Muramatsu and T. Nagatani, Jamming transition in two-dimensional
pedestrian traffic, Physica A, 275:281-291, (2000).
[51] M. Nagy, Z. Akos, D. Biro, and T. Vicsek, Hierarchical group dynamics in pigeon
flocks, Nature, 464:890-893, (2010).
[52] A. Nakayama, Y. Sugiyama, and K. Hasebe, Instability of pedestrian flow
and phase structure in two-dimensional optimal velocity model, Phys. Rev. E,
71:036121, (2005).
[53] A. Okubo, D. Grunbaum, and L. Edelstein-Keshet, The dynamics of animal
grouping, in Diffusion and Ecological Problems, 2nd ed., A. Okubo and S. Levin,
eds., Interdiscip. Appl. Math. 14., Springer, New York, 197-237, (1999).
[54] B. Piccoli and A. Tosin, Pedestrian flows in bounded domains with obstacles,
Cont. Mech. Thermodyn., 21:85-117, (2009).
[55] B. Piccoli and A. Tosin, Time evolving measures and macroscopic modeling of
pedestrian flows, Arch. Rational Mech. Anal., 199:707-738, (2011).
[56] I. Prigogine and R. Herman, Kinetic Theory of Vehicular Traffic, Elsevier, New
York, (1971).
[57] A. Seyfried, B. Steinen, W. Klingsch, and M. Boltes, The fundamental diagram
of pedestrian movement revisited, J. Statist. Mech., 10:P10002 (2005).
[58] S.H. Strogatz, D.M. Abrams, A. McRobie, B. Eckhardt and E. Ott, Crowd
synchrony on the London Millennium Bridge, Nature, 348:43-44, (2005).
[59] C. M. Topaz and A. Bertozzi, Swarming patterns in a two dimensional kinematic
model for biological groups, SIAM J. Appl. Math., 65:152-174, (2004).
46
N. Bellomo, C. Bianca, V. Coscia
[60] F. Venuti, L. Bruno, and N. Bellomo, Crowd dynamics on a moving platform:
Mathematical modelling and application to lively footbridges, Math. Comput.
Modelling, 45: 252-269, (2007).
[61] F. Venuti and L. Bruno, Crowd structure interaction in lively footbridges under
synchronous lateral excitation: A literature review, Phys. Life Rev., 6:176-206,
(2009).
S⃗eMA Journal
no 54(2011), 47–64
A BRIEF SURVEY OF THE DISCONTINUOUS GALERKIN METHOD FOR
THE BOLTZMANN-POISSON EQUATIONS
YINGDA CHENG∗ , IRENE M. GAMBA∗ , ARMANDO MAJORANA† AND CHI-WANG SHU‡
∗
Department of Mathematics and ICES, University of Texas at Austin,
†
Dipartimento di Matematica e Informatica, Università di Catania,
‡
Division of Applied Mathematics, Brown University
[email protected], [email protected],
[email protected],[email protected]
Abstract
We are interested in the deterministic computation of the transients for
the Boltzmann-Poisson system describing electron transport in semiconductor
devices. The main difficulty of such computation arises from the very high
dimensions of the model, making it necessary to use relatively coarse meshes
and hence requiring the numerical solver to be stable and to have good resolution
under coarse meshes. In this paper we give a brief survey of the discontinuous
Galerkin (DG) method, which is a finite element method using discontinuous
piecewise polynomials as basis functions and numerical fluxes based on upwinding
for stability, for solving the Boltzmann-Poisson system. In many situations, the
deterministic DG solver can produce accurate solutions with equal or less CPU
time than the traditional DSMC (Direct Simulation Monte Carlo) solvers. In order
to make the presentation more concise and to highlight the main ideas of the
algorithm, we use a simplified model to describe the details of the DG method.
Sample simulation results on the full Boltzmann-Poisson system are also given.
A
Introduction
The Boltzmann-Poisson (BP) system, which is a semiclassical description of electron
flow in semiconductors, is an equation in six dimensions (plus time if the device
is not in steady state) for a truly three dimensional device, and four dimensions
for a one-dimensional device. This heavy computational cost explains why the BP
Received: September 15, 2010. Accepted: November 30, 2010.
Support from the Institute of Computational Engineering and Sciences and the University of Texas
Austin is gratefully acknowledged. Research supported for the first author by NSF grant DMS-1016001.
Research supported for the second author by NSF grant DMS-0807712 and DMS-0757450. Research
supported for the third author by PRA 2009 Unict. Research supported for the fourth author by NSF grant
DMS-0809086 and DOE grant DE-FG02-08ER25863.
47
48
Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu
system is traditionally simulated by the Direct Simulation Monte Carlo (DSMC)
methods [25]. DSMC methods have the advantage that the increase in computational
cost is not significant with the increase of dimensions. However, the simulation
results are often noisy, and it is difficult to compute transient details (time dependent
states), especially if the probability density function (pdf ) is desired. In recent years,
deterministic solvers to the BP system were considered in the literature, see for
example [23, 28, 3, 2, 4, 5, 6, 24]. These methods provide accurate results which,
in general, agree well with those obtained from DSMC simulations, sometimes at a
comparable or even less computational time. Deterministic solvers have the distinct
advantage in resolving transient details for the pdf. However, the main difficulty of
the deterministic solvers arises from the very high dimensions of the model, making it
necessary to use relatively coarse meshes and hence requiring the numerical solver to
be stable and to have good resolution under coarse meshes. This can be challenging
because under coarse meshes, for a convection dominated problem, the solution may
contain high gradient (relative to the mesh) regions, which may lead to instability if
care is not taken in the design of the algorithm.
One class of very successful numerical solvers for the deterministic solvers of
the BP system is the weighted essentially non-oscillatory (WENO) finite difference
scheme [4, 6]. The advantage of the WENO scheme is that it is relatively simple to
code and very stable even on coarse meshes for solutions containing sharp gradient
regions. However, the WENO finite difference method requires smooth meshes to
achieve high order accuracy, hence it is not very flexible for adaptive meshes.
On the other hand, the Runge-Kutta discontinuous Galerkin (RKDG) method,
which is a class of finite element methods originally devised to solve hyperbolic
conservation laws [17, 16, 15, 14, 18], is a suitable alternative for solving the BP
system. Using a completely discontinuous polynomial space for both the test and
trial functions in the spatial variables and coupled with explicit and nonlinearly stable
high order Runge-Kutta time discretization, the method has the advantage of flexibility
for arbitrarily unstructured meshes, with a compact stencil, and with the ability to
easily accommodate arbitrary hp-adaptivity. For more details about DG scheme for
convection dominated problems, we refer to the review paper [20]. The DG method
was later generalized to the local DG (LDG) method to solve the convection diffusion
equation [19] and elliptic equations [1]. It is L2 stable and locally conservative, which
makes it particularly suitable to treat the Poisson equation.
In recent years, we have initialized a line of research to develop and implement the
RKDG method, coupled with the LDG solution for the Poisson equation, for solving
the full BP system, see [7, 8, 9, 10]. It is demonstrated through extensive numerical
studies that the DG solver produces good resolution on relatively coarse meshes for the
transient and steady state pdf, as well as various orders of moments and I-V curves,
which compare well with DSMC results. Our DG solver has the capability of handling
full energy bands [10] that no other deterministic solver has been able to implement so
far.
In this paper, we give a short survey of the DG solver for the full BP system.
The emphasis is on the algorithm details, explained through a simplified model, and on
sample simulation results to demonstrate the performance of the DG solver. The plan of
the paper is as follows: in Section B, we will introduce the BP system and a simplified
Discontinuous Galerkin method for the Boltzmann-Poisson equations
49
model. In Section C, the DG scheme for this model will be presented. Section D
includes some discussion and extensions of the algorithm. In Section E, we present
some numerical results to show the performance of the scheme. Conclusions and future
work are given in Section F. We collect some technical details of the implementation
of the scheme in the Appendix.
B
The Boltzmann-Poisson system, and a simplified model
The evolution of the electron distribution function f (t, x, k) in semiconductors,
depending on the time t, position x and electron wave vector k, is governed by the
Boltzmann transport equation (BTE) [26]
∂f
1
q
+ ∇k ε · ∇x f − E · ∇k f = Q(f ) ,
∂t
~
~
(1)
where ~ is the reduced Planck constant, and q denotes the positive elementary charge.
The function ε(k) is the energy of the considered crystal conduction band measured
from the band minimum; according to the Kane dispersion relation, ε is the positive
root of
~2 k 2
ε(1 + αε) =
,
(2)
2m∗
where α is the non-parabolicity factor and m∗ the effective electron mass. The electric
field E is related to the doping density ND and the electron density n, which equals
the zero-order moment of the electron distribution function f , by the Poisson equation
∇x [εr (x) ∇x V ] =
q
[n(t, x) − ND (x)] ,
ε0
E = −∇x V ,
(3)
where ε0 is the dielectric constant of the vacuum, εr (x) labels the relative dielectric
function depending on the semiconductor and V is the electrostatic potential. For low
electron densities, the collision operator Q(f ) is
∫
Q(f )(t, x, k) =
[S(k′ , k)f (t, x, k′ ) − S(k, k′ )f (t, x, k)] dk′ ,
(4)
R3
where S(k′ , k) is the kernel depending on the scattering mechanisms between
electrons and phonons in the semiconductor.
In order to more clearly describe the details of the DG method, as well as to
highlight the essential algorithm ingredients, we introduce a simplified model transport
equation, which has the same characteristics as the full BP system. Thus, we consider
the system
∫
∂u
∂
∂
+
[a(v) u] +
[b(v) η(t, x) u] =
K(v, v ′ ) u(t, x, v ′ ) dv ′ − ν(v) u, (5)
∂t
∂x
∂v
R
[
] ∫
∂ϕ
∂ϕ
∂
σ(x)
=
u(t, x, v ′ ) dv ′ − ND (x), η(t, x) = − . (6)
∂x
∂x
∂x
R
Now, the unknown distribution function is u, which depends on time t, space
coordinate x ∈ [0, 1] and the variable v ∈ R (velocity or energy). The functions a,
50
Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu
b, σ, ND and the kernel K are given, and K ≥ 0. The collision frequency ν is defined
by the equation
∫
K(v ′ , v) dv ′
ν(v) =
(7)
R
which guarantees mass conservation.
C
The DG solver for the simplified model
We now discuss the DG solver for the model equations (5)-(6). The first step is to
reduce the domain of the variable v to a finite size. This can be justified because we
expect a vanishing behavior of the unknown u for large values of v. For rigorous
justification, one could refer to the discussion in [12]. If I is the finite interval
used for the computation in the variable v, then we will replace K(v, v ′ ) with
K(v, v ′ ) χI (v) χI (v ′ ), where χI is the characteristic function on I. The new collision
frequency is redefined according Eq. (7). Therefore, this adjustment of the domain will
not affect the mass conservation of the system.
For simplicity of discussion, we will use a simple rectangular grid to introduce the
DG scheme, although the algorithm could be easily adjusted to accommodate general
unstructured grids. For the domain [0, 1] × I, we let
[
] [
]
Ωik = xi− 12 , xi+ 12 × vk− 12 , vk+ 21
where,
xi± 12 = xi ±
∆xi
2
vk± 12 = vk ±
∆vk
2
(i = 1, 2, 3, ...Nx ) and (k = 1, 2, 3, ...Nv ) .
We denote by Nx and Nv the number of intervals in the x and v direction, respectively.
The approximation space is defined as
Vhℓ = {vh : (vh )|Ωik ∈ P ℓ (Ωik )},
(8)
where P ℓ (Ωik ) is the set of all polynomials of degree at most ℓ on Ωik . Notice that the
polynomial degree ℓ can actually change from cell to cell (p-adaptivity), although in
this paper it is kept as a constant for simplicity. The DG formulation for the simplified
Boltzmann equation (5) would be: to find uh ∈ Vhℓ , such that
∫
∫
∫
(uh )t vh dx dv −
a(v)uh (vh )x dx dv −
b(v)η(t, x)uh (vh )v dx dv
Ωik
Ωik
Ωik
]
∫ [∫
+Fx+ − Fx− + Fv+ − Fv− =
K(v, v ′ ) uh dv ′ − ν(v) uh vh dx dv (9)
Ωik
I
for any test function vh ∈ Vhℓ . In (9),
∫ v 1
k+
2
Fx+ =
a(v) ǔh vh− (xi+ 12 , v)dv,
vk− 1
2
Discontinuous Galerkin method for the Boltzmann-Poisson equations
Fx− =
∫
∫
vk+ 1
2
vk− 1
a(v) ǔh vh+ (xi− 12 , v)dv,
2
xi+ 1
2
Fv+ =
Fv− =
∫
51
xi− 1
b(vk+ 12 )η(t, x) ũh vh− (x, vk+ 12 )dx,
2
xi+ 1
2
xi− 1
b(vk− 21 )η(t, x) ũh vh− (x, vk− 12 )dx,
2
where the upwind numerical fluxes ǔh , ũh are chosen according to the following rules,
• if a(v) ≥ 0 on the interval [vk− 12 , vk+ 21 ], ǔh = u−
h ; if a(v) < 0 on the interval
+
[vk− 21 , vk+ 12 ], ǔh = uh . Since a(v) is a given function that does not depend on
time, we will always be able to choose the grid such that a(v) holds constant
signs in each cell [vk− 12 , vk+ 12 ].
∫x 1
+
• If x i+12 b(vk+ 12 )η(t, x)dx > 0, ũh = u−
h ; otherwise, ũh = uh . Since the
i−
2
function η depends on time, we can not choose a grid such that b(vk+ 12 )η(t, x)
holds constant sign on each interval [xi− 12 , xi+ 12 ]. Here we relax the condition
to look at the cell averages for the coefficient for easy implementation.
As for the Poisson equation (6), in the simple one-dimensional setting, one can
use an exact Poisson solver or alternatively use a DG scheme designed for elliptic
equations. Below we will describe the local DG methods [19] for (6) with Dirichlet
boundary conditions. First, the Poisson equation is rewritten into the following form,


 q = ∂ϕ
∂x
(10)
∂


(σ(x)q) = R(t, x)
∂x
∫
′
where R(t, x) = I uh (t, x, v )dv ′ −ND (x) is a known function that can
[ be computed
]
at each time step once uh is solved from (9). The grid we use is Ii = xi− 12 , xi+ 12 ,
with i = 1, . . . , Nx , which is consistent with the mesh for the Boltzmann equation.
The approximation space is
Whℓ = {vh : (vh )|Ii ∈ P ℓ (Ii )},
with P ℓ (Ii ) denoting the set of all polynomials of degree at most ℓ on Ii . The LDG
scheme for (10) is given by: to find qh , ϕh ∈ Vhℓ , such that
∫
∫
qh vh dx +
ϕh (vh )x dx − ϕ̂h vh− (xi+ 12 ) + ϕ̂h vh+ (xi− 21 ) = 0,
Ii
Ii
∫
∫
\
\ +
1 ) − σ(x)q p (x
1) =
−
σ(x)qh (ph )x dx + σ(x)q h p−
(x
R(t, x)ph dx, (11)
i+ 2
i− 2
h h
h
Ii
Ii
hold true for any vh , ph ∈ Whℓ . In the above formulation, the flux is chosen as follows,
+
−
+
\
ϕ̂h = ϕ−
h , σ(x)q h = (σ(x)qh ) − [ϕh ], where [ϕh ] = ϕh − ϕh . At x = L we need
52
Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu
−
\
to flip the flux to ϕ̂h = ϕ+
h , σ(x)q h = (σ(x)qh ) − [ϕh ] to adapt to the Dirichlet
boundary conditions. Solving (11), we can obtain the numerical approximation of
the electric potential ϕh and electric field ηh = −qh on each cell Ii . The so-called
minimum dissipation LDG method [13] can also be used here.
To summarize, the DG-LDG algorithm advances from tn to tn+1 in the following
steps:
∫
Step 1 Compute I uh (t, x, v ′ ) dv ′ and R(t, x).
Step 2 Solve the electric field ηh (t, x) from (11).
Step 3 Solve (9) and get a method of line ODE for uh .
Step 4 Evolve this ODE by proper time stepping from tn to tn+1 , if partial time step
is necessary, then repeat Step 1 to 3 as needed.
The algorithm described above carries the essential ideas of those in [9] for the BP
system. We include some of the technical details of the scheme in the Appendix for
the reference of the readers.
D
Some considerations
In this section, we will review some aspects of the DG-BP solver developed in the
literature and give an overview of some ongoing and future research directions.
D.1 Mass conservation and positivity of the numerical solution
It is well known that the transport Boltzmann equation associated with the BP system
conserves mass, and the initial value problem propagates positivity. In particular, it is
essential that the numerical schemes preserve these physical properties of the system.
In general, high order than one DG solvers as the one introduced in the previous
section will not enjoy positivity. However it was shown in [12] that the semi-discrete
DG scheme is positivity-preserving and stable for piecewise constants by arguments
following an adaptation of the Crandall-Tartar lemma [22] to low order DG schemes,
which states that any mass preserving, contracting linear first order operator is stable
and monotone preserving. In addition, in [12] the authors also proposed a fully discrete
positivity-preserving DG scheme for Vlasov-Boltzmann transport equation. They used
a maximum-principle-satisfying limiter for conservation laws [32] and achieved a high
order accurate DG solver that ensures the positivity of the numerical solution. A
comparison study of the standard RKDG scheme against the positivity-preserving DG
scheme has also been performed for the linear Boltzmann equation in [12].
Similarly for the DG-BP solver, clearly the following two properties will also hold,
where the second one is a simple proof of positivity for the fully discrete scheme on
piecewise constant basis functions.
Property 1 (Semi-discrete mass conservation) Under zero or periodic boundary
conditions in the x space, we have that
∫ ∫
d 1
uh dvdx = 0.
dt 0 I
Discontinuous Galerkin method for the Boltzmann-Poisson equations
53
Proof. Plug in (9) with vh = 1, and sum over all i, k, the conservation will follow.
Property 2 (Positivity for the first order scheme) Define A = max a(v), B =
max b(v)η(t, x) and V̄ = max ν(v). For first order DG scheme with piecewise
constant approximation and forward Euler time discretization, if the CFL condition
λ1 A + λ2 B + △tV̄ ≤ 1,
and λ2 = min△t
, then the DG scheme is monotone
is satisfied, where λ1 =
k △vk
and will preserve the positivity of the numerical solution.
△t
mini △xi
Proof. Plug in (9) with vh = 1, we have
∫
∫
(uh )t dΩ + Fx+ − Fx− + Fv+ − Fv− =
Ωik
Ωik
∫
where
[∫
]
K(v, v ′ ) uh dv ′ − ν(v) uh dΩ, (12)
I
vk+ 1
2
Fx+ =
a(v) ǔh dv,
vk− 1
Fx−
∫
Fv+
Fv−
∫
2
vk+ 1
2
=
2
xi+ 1
2
=
∫
a(v) ǔh dv,
vk− 1
xi− 1
b(vk+ 21 )η(t, x) ũh dx,
2
xi+ 1
2
=
xi− 1
b(vk− 21 )η(t, x) ũh dx.
2
We assume uh = unik on Ωik at tn , then
un+1
ik
△t
−
=
{F + − Fx− + Fv+ − Fv− +
△xi △vk x
∫ v 1
k+
2
−△xi
ν(v) dv unik }.
∫
∫
unik
Ωik
K(v, v ′ ) uh dv ′ dΩ
I
vk− 1
2
As suggested by the definition for the numerical fluxes, ∫if a(v) ≥ 0 on the
vk+ 1
+
n
2
interval [vk− 12 , vk+ 12 ], ǔh = u−
h , which implies Fx = ( vk− 1 a(v)dv) uik and
2
∫v 1
∫v 1
Fx− = ( v k+12 a(v)dv) uni−1,k . Otherwise, Fx+ = ( v k+12 a(v)dv) uni+1,k and Fx− =
k−
k−
2
2
∫v 1
( v k+12 a(v)dv) unik .
k−
2
∫x 1
= u−
Fv+
=
Similarly, if x i+12 b(vk+ 12 )η(t, x)dx > 0, ũh
h,
i−
2
∫ xi+ 1
∫
x
1
( x 12 b(vk+ 12 )η(t, x) dx) unik and Fv− = ( x i+12 b(vk+ 12 )η(t, x) dx) uni,k−1 ; otheri−
i−
2
2
∫x 1
∫x 1
wise, Fv+ = ( x i+12 b(vk+ 12 )η(t, x) dx) uni,k+1 and Fv− = ( x i+12 b(vk+ 12 )η(t, x) dx) unik .
i−
2
i−
2
By taking derivative of all its variables, it is not difficult to verify that this is a
monotone scheme if
λ1 A + λ2 B + △tV̄ ≤ 1.
Hence it will preserve the positivity of the numerical solution.
54
Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu
D.2 Incorporation of full energy band models
The BP system introduced in Section B uses analytical band structures, which means
the energy band function ε(k) has been given explicitly. The analytical band makes
use of the explicit dependence of the carrier energy on the quasimomentum, which
significantly simplifies all expressions as well as implementation of these techniques.
However, the physical details of the band structure are partly or totally ignored, which
is unphysical when hot carriers in high-field phenomena are considered.
Full band models [21], on the other hand, can guarantee accurate physical pictures
of the energy-band function. They are widely used in DSMC simulators, but only
recently the Boltzmann transport equation was considered [31, 29], where approximate
solutions were found by means of spherical harmonics expansion of the distribution
function f . Since only a few terms of the expansion are usually employed, high
order accuracy is not always achieved [27]. Recently in [10], the authors developed
a DG code, which is the first deterministic code that can compute the full band model
directly. The energy band is treated as a numerical input that can be obtained either by
experimental data or the empirical pseudopotential method. The Dirac delta functions
in the scattering kernels can be computed directly, based on the weak formulations of
the PDE. The results in [10] for 1D devices have demonstrated the importance of using
full band model when accurate description of hydromoments under large applied bias
is desired. In a forthcoming manuscript [11], we will present a full implementation
of the scheme with numerical bands as well as a thorough study of stability and error
analysis.
In future work, we will extend the solver to include multi-carrier transport in
devices such as P-N junctions. When modeling the P-N junctions, the carrier flows
of both electrons and holes should be considered. The pdf of electrons and holes will
satisfy the following BTEs coupled with Poisson equation for the field,
∂fi
1
q
+ ∇k εi · ∇x fi ∓ E · ∇k fi = Qi (f ) + Ri (fe , fh ), i = e, h1, h2, h3,
∂t
~
~
q
[ne (t, x) − nh (t, x) − ND (x) + NA (x)] , E = −∇x V .
[εr (x) ∇x V ] =
ε0
In the above equation, the subscript e denotes electrons and h1, h2, h3 denote holes.
One electron conduction band and three hole valence bands (heavy, light and split-off
bands) need to be considered in order to have an accurate physical description. The
Qi terms are the collision terms which have been introduced in Section 2 for electrons.
For holes, those terms should include inter-band scattering as well. Ri (fe , fh ) is the
recombination term. In this model, we need to solve for each carrier a Boltzmann
transport equation and they are all coupled together through the Poisson equation for
the electric field. It will be of particular interest to explore adaptive DG methods for
solving this type of systems in order to reduce computational cost.
E
Numerical results for semiconductor devices
In this section, we will demonstrate the performance of DG schemes through the
calculation for a 2D double gate MOSFET device. The schematic plot of the double
55
Discontinuous Galerkin method for the Boltzmann-Poisson equations
gate MOSFET device is given in Figure 1. The top and bottom shadowed region
denotes the oxide-silicon region, whereas the rest is the silicon region.
y
50nm
50nm
top gate
2nm
1111111111111111
0000000000000000
0000000000000000
1111111111111111
0000000000000000
1111111111111111
0000000000000000
1111111111111111
0000000000000000
1111111111111111
0000000000000000
1111111111111111
x
0000000000000000
1111111111111111
0000000000000000
1111111111111111
0000000000000000
1111111111111111
0000000000000000
1111111111111111
0000000000000000
1111111111111111
00000000000000000000000000000000000000000000
11111111111111111111111111111111111111111111
0000000000000000
1111111111111111
00000000000000000000000000000000000000000000
11111111111111111111111111111111111111111111
drain
source
24nm
11111111111111111111111111111111111111111111
00000000000000000000000000000000000000000000
00000000000000000000000000000000000000000000
11111111111111111111111111111111111111111111
bottom gate
150nm
Figure 1: Schematic representation of a 2D double gate MOSFET device
Since the problem is symmetric about the x-axis, we will only need to compute
for y > 0. At the source and drain contacts, we implement the same boundary
condition as proposed in [6] to realize neutral charges. At the top and bottom of the
computational domain (the silicon region), we impose the classical elastic specular
boundary reflection. The electric potential Ψ = 0 at source, Ψ = 1 at drain and Ψ =
0.5 at gate. For the rest of boundaries, we impose homogeneous Neumann boundary
condition. The relative dielectric constant in the oxide-silicon region is εr = 3.9,
in the silicon region is εr = 11.7. The doping profile has been specified as follows:
ND (x, y) = 5×1017 cm−3 if x < 50nm or x > 100nm, ND (x, y) = 2×1015 cm−3 in
the channel 50nm ≤ x ≤ 100nm. All numerical results are obtained with a piecewise
linear approximation space and second order TVD Runge-Kutta time stepping. We use
a very coarse mesh, 24 × 12 grid in space, 24 points in w, 8 points in µ and 6 points
in φ in our calculation. In Figures 2 and 3, we show the results of the macroscopic
quantities for the top part of the device when it is already at equilibrium.
56
Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu
8e+17
7e+17
6e+17
5e+17
4e+17
3e+17
2e+17
1e+17
0
0.00 0.03
0.06 0.09
x
0.12 0.15
0.024
0.018
0.012
y
0.006
0.3
0.25
0.2
0.15
0.00 0.03
0.06 0.09
x
0.12 0.15
1.4e+07
1.3e+07
1.2e+07
1.1e+07
1e+07
9e+06
8e+06
0.00 0.03
0.06 0.09
x
0.12 0.15
0.024
0.018
0.012
y
0.006
0.024
0.018
0.012
y
0.006
1e+06
500000
0
-500000
-1e+06
0.00 0.03
0.06 0.09
x
0.12 0.15
0.024
0.018
0.012
y
0.006
Figure 2: Macroscopic quantities of double gate MOSFET device at t = 0.8ps. Top
left: density in cm−3 ; top right: energy in eV ; bottom left: x-component of velocity in
cm/s; bottom right: y-component of velocity in cm/s. Solution reached steady state.
Discontinuous Galerkin method for the Boltzmann-Poisson equations
0
-20
-40
-60
-80
-100
-120
-140
0.00 0.03
0.06 0.09
x
0.12 0.15
50
0
-50
-100
-150
-200
-250
-300
-350
0.00 0.03
0.06 0.09
x
0.12 0.15
1
0.8
0.6
0.4
0.2
0
0.00 0.03
0.06 0.09
x
0.12 0.15
57
0.024
0.018
0.012
y
0.006
0.024
0.018
0.012
y
0.006
0.024
0.018
0.012
y
0.006
Figure 3: Macroscopic quantities of double gate MOSFET device at t = 0.8ps. Top
left: x-component of electric field in kV /cm; top right: y-component of electric field
in kV /cm; bottom: electric potential in V . Solution has reached steady state.
58
Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu
F Concluding remarks and future work
In this paper, we present a brief survey of the current state-of-the-art of DG solvers for
BP systems in semiconductor device simulations. We demonstrate the main ideas of the
algorithm through a simplified model. We include some discussions of the properties
and extensions of the schemes and show numerical results for the full BP system.
Deterministic solvers have recently gained growing attention in the field of
semiconductor device modeling because of the guaranteed accuracy and noise-free
simulation results they provide. However, the relative cost of this type of methods is
still large especially when the dimension of the device is high. From the stand point of
algorithm design and development, it will be interesting to explore ways to utilize fully
the freedom of the DG framework such as hp-adaptivity. The DG schemes also provide
an excellent platform for potential areas such as the hybridization of different level of
models. For practical purposes, we will develop kinetic models and solvers to simulate
nano-scale devices such as bipolar transistors, heterostructure bipolar transistors and
solar cells in the future. Parallel implementation will also be an important component
of our future research.
References
[1] D. Arnold, F. Brezzi, B. Cockburn and L. Marini, Unified analysis of
discontinuous Galerkin methods for elliptic problems, SIAM Journal on
Numerical Analysis, 39 (2002), pp. 1749-1779.
[2] M.J. Caceres, J.A. Carrillo, I.M. Gamba, A. Majorana and C.-W. Shu,
Deterministic kinetic solvers for charged particle transport in semiconductor
devices, in Transport Phenomena and Kinetic Theory Applications to Gases,
Semiconductors, Photons, and Biological Systems. C. Cercignani and E. Gabetta
(Eds.), Birkhäuser (2006), pp. 151-171.
[3] J.A. Carrillo, I.M. Gamba, A. Majorana and C.-W. Shu, A WENO-solver for 1D
non-stationary Boltzmann-Poisson system for semiconductor devices, Journal of
Computational Electronics, 1 (2002), pp. 365-375.
[4] J.A. Carrillo, I.M. Gamba, A. Majorana and C.-W. Shu, A direct solver for
2D non-stationary Boltzmann-Poisson systems for semiconductor devices: a
MESFET simulation by WENO-Boltzmann schemes, Journal of Computational
Electronics, 2 (2003), pp. 375-380.
[5] J.A. Carrillo, I.M. Gamba, A. Majorana and C.-W. Shu, A WENO-solver for the
transients of Boltzmann-Poisson system for semiconductor devices. Performance
and comparisons with Monte Carlo methods, Journal of Computational Physics,
184 (2003), pp. 498-525.
[6] J.A. Carrillo, I.M. Gamba, A. Majorana and C.-W. Shu, 2D semiconductor device
simulations by WENO-Boltzmann schemes: efficiency, boundary conditions and
comparison to Monte Carlo methods, Journal of Computational Physics, 214
(2006), pp. 55-80.
Discontinuous Galerkin method for the Boltzmann-Poisson equations
59
[7] Y. Cheng, I.M. Gamba, A. Majorana and C.-W. Shu, Discontinuous Galerkin
solver for the semiconductor Boltzmann equation, SISPAD 07, T. Grasser and S.
Selberherr, editors, Springer (2007), pp. 257-260.
[8] Y. Cheng, I. Gamba, A. Majorana and C.-W. Shu, Discontinuous Galerkin
solver for Boltzmann-Poisson transients, Journal of Computational Electronics,
7 (2008), pp. 119-123.
[9] Y. Cheng, I. Gamba, A. Majorana and C.-W. Shu, A discontinuous Galerkin
solver for Boltzmann-Poisson systems for semiconductor devices, Computer
Methods in Applied Mechanics and Engineering, 198 (2009), pp. 3130-3150.
[10] Y. Cheng, I. Gamba, A. Majorana and C.-W. Shu, A discontinuous Galerkin
solver for full-band Boltzmann-Poisson models, the Proceeding of IWCE 13,
pp. 211-214, 2009.
[11] Y. Cheng, I.M. Gamba, A. Majorana and C.-W. Shu, High order positive
discontinuous Galerkin schemes for the Boltzmann-Poisson system with full
bands, in preparation.
[12] Y. Cheng, I.M. Gamba and J. Proft, Positivity-preserving discontinuous Galerkin
schemes for linear Vlasov-Boltzmann transport equations, Mathematics of
Computation, to appear.
[13] B. Cockburn and B. Dong, An analysis of the minimal dissipation local
discontinuous Galerkin method for convection-diffusion problems, Journal of
Scientific Computing, 32 (2007), pp. 233-262.
[14] B. Cockburn, S. Hou and C.-W. Shu, The Runge-Kutta local projection
discontinuous Galerkin finite element method for conservation laws IV: the
multidimensional case, Mathematics of Computation, 54 (1990), pp. 545-581.
[15] B. Cockburn, S.-Y. Lin and C.-W. Shu, TVB Runge-Kutta local projection
discontinuous Galerkin finite element method for conservation laws III: one
dimensional systems, Journal of Computational Physics, 84 (1989), pp. 90-113.
[16] B. Cockburn and C.-W. Shu, TVB Runge-Kutta local projection discontinuous
Galerkin finite element method for conservation laws II: general framework,
Mathematics of Computation, 52 (1989), pp. 411-435.
[17] B. Cockburn and C.-W. Shu, The Runge-Kutta local projection P1-discontinuous
Galerkin finite element method for scalar conservation laws, Mathematical
Modelling and Numerical Analysis, 25 (1991), pp. 337-361.
[18] B. Cockburn and C.-W. Shu, The Runge-Kutta discontinuous Galerkin method
for conservation laws V: multidimensional systems, Journal of Computational
Physics, 141 (1998), pp. 199-224.
[19] B. Cockburn and C.-W. Shu, The local discontinuous Galerkin method for timedependent convection-diffusion systems, SIAM Journal on Numerical Analysis,
35 (1998), pp. 2440-2463.
60
Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu
[20] B. Cockburn and C.-W. Shu, Runge-Kutta discontinuous Galerkin methods for
convection-dominated problems, Journal of Scientific Computing, 16 (2001),
pp. 173-261.
[21] M. L. Cohen and J. Chelikowsky. Electronic Structure and Optical Properties of
Semiconductors. Springer-Verlag, 1989.
[22] M. G. Crandall and L. Tartar. Some relations between nonexpansive and order
preserving mappings, Proc. Amer. Math. Soc., 78 (1980), pp. 385-390.
[23] E. Fatemi and F. Odeh, Upwind finite difference solution of Boltzmann equation
applied to electron transport in semiconductor devices, Journal of Computational
Physics, 108 (1993), pp. 209-217.
[24] M. Galler and A. Majorana, Deterministic and stochastic simulation of electron
transport in semiconductors, Bulletin of the Institute of Mathematics, Academia
Sinica (New Series), 6th MAFPD (Kyoto) special issue Vol. 2 (2007), No. 2,
pp. 349-365.
[25] C. Jacoboni and P. Lugli, The Monte Carlo Method for Semiconductor Device
Simulation, Spring-Verlag: Wien-New York, 1989.
[26] M. Lundstrom, Fundamentals of Carrier Transport, Cambridge University Press:
Cambridge, 2000.
[27] A. Majorana, A comparison between bulk solutions to the Boltzmann equation
and the spherical harmonic model for silicon devices, in Progress in Industrial
Mathematics at ECMI 2000 - Mathematics in Industry, 1 (2002), pp. 169-173.
[28] A. Majorana and R. Pidatella, A finite difference scheme solving the Boltzmann
Poisson system for semiconductor devices, Journal of Computational Physics, 174
(2001), pp. 649-668.
[29] S. Smirnov and C. Jungemann, A full band deterministic model for semiclassical
carrier transport in semiconductors, Journal of Applied Physics, 99 (1988),
063707.
[30] K. Tomizawa, Numerical Simulation of Submicron Semiconductor Devices,
Artech House: Boston, 1993.
[31] M. C. Vecchi, D. Ventura, A. Gnudi and G. Baccarani. Incorporating full bandstructure effects in spherical harmonics expansion of the Boltzmann transport
equation, in Proceedings of NUPAD V Conference, 8 (1994), pp. 55-58.
[32] X. Zhang and C.-W. Shu, On maximum-principle-satisfying high order schemes
for scalar conservation laws, Journal of Computational Physics, 229 (2010),
pp. 3091-3120.
[33] J.M. Ziman, Electrons and Phonons. The Theory of Transport Phenomena in
Solids, Oxford University Press: Oxford, 2000.
61
Discontinuous Galerkin method for the Boltzmann-Poisson equations
G Appendix
In this appendix, we will include some of the implementation details of the proposed
DG algorithm for the full BP system.
For a silicon device, the collision operator (4) takes into account acoustic
deformation potential and optical intervalley scattering [30, 33]. For low electron
densities, it reads
∫
Q(f )(t, x, k) =
[S(k′ , k)f (t, x, k′ ) − S(k, k′ )f (t, x, k)] dk′
(13)
R3
with the scattering kernel
S(k, k′ )
(nq + 1) K δ(ε(k′ ) − ε(k) + ~ωp )
+ nq K δ(ε(k′ ) − ε(k) − ~ωp ) + K0 δ(ε(k′ ) − ε(k))
=
(14)
and K and K0 being constant for silicon. The symbol δ indicates the usual Dirac
distribution and ωp is the constant phonon frequency. Moreover,
[
(
)
]−1
~ωp
−1
nq = exp
kB TL
is the occupation number of phonons, kB the Boltzmann constant and TL the constant
lattice temperature. In Table 1, we list the physical constants for a typical silicon
device.
m∗ = 0.32 me
K=
K0 =
(Dt K)2
8π 2 ϱ ωp
kB TL
Ξ2
4π 2 ~ v02 ϱ d
εr = 11.7
TL = 300 K
~ωp = 0.063 eV
Dt K = 11.4 eV Å−1
ϱ = 2330 kg m−3
Ξd = 9 eV
v0 = 9040 m s−1
α = 0.5 eV
Table 1. Values of the physical parameters
For the numerical treatment of the Boltzmann-Poisson system (1), (3), it is convenient
to introduce suitable dimensionless quantities and variables. Typical values for length,
time and voltage are ℓ∗ = 10−6 m, t∗ = 10−12 s and V∗ = 1 Volt, respectively. Thus,
we define the dimensionless variables
(x, y, z) =
x
,
ℓ∗
t=
t
,
t∗
Ψ=
V
,
V∗
(Ex , Ey , Ez ) =
with E∗ = 0.1 V∗ ℓ−1
∗ and
Ex = −cv
∂Ψ
,
∂x
Ey = −cv
∂Ψ
,
∂y
cv =
V∗
.
ℓ∗ E∗
E
E∗
62
Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu
In correspondence to [28] and [5], we perform a coordinate transformation for k
according to
√ ∗
( √
)
√
2m kB TL √
k=
w(1 + αK w) µ, 1 − µ2 cos φ, 1 − µ2 sin φ ,
~
(15)
ε
, the
kB TL
cosine of the polar angle µ and the azimuth angle φ with αK = kB TL α. The main
advantage of the generalized spherical coordinates (15) is the easy treatment of the
Dirac distribution in the kernel (14) of the collision term. In fact, this procedure enables
us to transform the integral operator (4) with the not regular kernel S into an integraldifference operator, as shown in the following.
where the new independent variables are the dimensionless energy w =
We are interested in studying two-dimensional problems in real space; this requires
the full three-dimensional k-space. Therefore, it is useful to consider the new unknown
function Φ related to the electron distribution function via
Φ(t, x, y, w, µ, φ) = s(w)f (t, x, k)|
√
t=t∗ t , x=ℓ∗ (x,y,z) , k=
2m∗ kB TL
~
√
w(1+αK w) ...
,
where
s(w) =
√
w(1 + αK w)(1 + 2αK w),
(16)
is proportional to the Jacobian of the change of variables (15) and, apart from a
dimensional constant factor, to the density of states. This allows us to write the free
streaming operator of the dimensionless Boltzmann equation in a conservative form,
which is appropriate for applying standard numerical schemes used for hyperbolic
partial differential equations. Due to the symmetry of the problem and of the collision
operator, we have
Φ(t, x, y, w, µ, 2π − φ) = Φ(t, x, y, w, µ, φ) .
(17)
Straightforward but cumbersome calculations end in the following transport equation
for Φ:
∂Φ
∂
∂
∂
∂
∂
+
(g1 Φ) +
(g2 Φ) +
(g3 Φ) +
(g4 Φ) +
(g5 Φ) = C(Φ) . (18)
∂t
∂x
∂y
∂w
∂µ
∂φ
The functions gi (i = 1, 2, .., 5) in the advection terms depend on the independent
variables w, µ, φ as well as on time and position via the electric field. They are given
63
Discontinuous Galerkin method for the Boltzmann-Poisson equations
by
g1 (·)
=
g2 (·)
=
g3 (·)
=
g4 (·)
=
g5 (·)
=
√
w(1 + αK w)
cx
,
1 + 2αK w
√
√
1 − µ2 w(1 + αK w) cos φ
,
cx
1 + 2αK w
√
]
√
w(1 + αK w) [
µ Ex (t, x, y) + 1 − µ2 cos φ Ey (t, x, y) ,
− 2ck
1 + 2αK w
√
]
[√
1 − µ2
− ck √
1 − µ2 Ex (t, x, y) − µ cos φ Ey (t, x, y) ,
w(1 + αK w)
sin φ
√
ck √
Ey (t, x, y)
w(1 + αK w) 1 − µ2
µ
√
t∗ 2 kB TL
t∗ qE∗
cx =
and ck = √ ∗
.
∗
ℓ∗
m
2m kB TL
The right hand side of (18) is the integral-difference operator
{ ∫ π
∫ 1
′
C(Φ)(t, x, y, w, µ, φ) = s(w) c0
dφ
dµ′ Φ(t, x, y, w, µ′ , φ′ )
with
∫
+
0
π
dφ′
∫
−1
0
1
−1
dµ′ [c+ Φ(t, x, y, w + γ, µ′ , φ′ ) + c− Φ(t, x, y, w − γ, µ′ , φ′ )]
}
− 2π[c0 s(w) + c+ s(w − γ) + c− s(w + γ)]Φ(t, x, y, w, µ, φ) ,
where
(c0 , c+ , c− ) =
2m∗ t∗ √
2 m∗ kB TL (K0 , (nq + 1)K, nq K) ,
~3
γ=
~ωp
kB TL
are dimensionless parameters. We remark that the δ distributions in the kernel S have
been eliminated which leads to the shifted arguments of Φ. The parameter γ represents
the jump constant corresponding to the quantum of energy ~ωp . We have also taken
into account (17) in the integration with respect to φ′ .
In terms of the new variables the electron density becomes
(√
)3
∫
2 m∗ kB TL
n(t∗ t, ℓ∗ x, ℓ∗ y) =
f (t∗ t, ℓ∗ x, ℓ∗ y, k) dk =
ρ(t, x, y) ,
~
R3
where
∫
ρ(t, x, y) =
+∞
∫
0
∫
1
dw
π
dµ
−1
dφ Φ(t, x, y, w, µ, φ) .
(19)
0
Further hydrodynamical variables are the dimensionless x-component of the velocity
∫ +∞ ∫ 1
∫ π
dw
dµ
dφ g1 (w) Φ(t, x, y, w, µ, φ)
0
−1
0
,
ρ(t, x, t)
64
Y. Cheng, I.M. Gamba, A. Majorana, C.-W. Shu
and the dimensionless energy
∫
+∞
∫
dw
0
∫
1
π
dµ
−1
dφ w Φ(t, x, y, w, µ, φ)
0
ρ(t, x, t)
.
Using the new dimensionless variables, Poisson equation becomes
(
)
(
)
∂
∂Ψ
∂Ψ
∂
εr
+
εr
= cp [ρ(t, x, t) − ND (x, y)]
∂x
∂x
∂y
∂y
(20)
with
(√
(√
)−3
)3
2 m∗ k B T L
2 m∗ kB TL ℓ2∗ q
ND (x, y) =
ND (ℓ∗ x, ℓ∗ y) and cp =
.
~
~
ε0
To solve the dimensionless Boltzmann-Poisson system, we need the initial conditions
for Φ and boundary conditions both for Φ and Ψ. These depend on the geometry of the
device and on the problem. In the (w, µ, φ)-space, no boundary condition is necessary.
In fact,
• at w = 0, g3 = 0. At w = wmax , Φ is assumed machine zero;
• at µ = ±1, g4 = 0;
• at φ = 0, π, g5 = 0,
so at those boundaries, the numerical flux always vanishes, hence no ghost point is
necessary for the DG method.
S⃗eMA Journal
no 54(2011), 65–90
ON THE NUMERICAL COMPUTATION OF MOUNTAIN PASS SOLUTIONS
TO SOME PERTURBED SEMI-LINEAR ELLIPTIC PROBLEM
L. MONTORO
Dipartimento di Matematica
Università degli Studi della Calabria
[email protected]
Abstract
In this paper we want to visualize and construct the shape of least energy
solutions to a singularly perturbed problem (M̃ε ) with mixed Dirichlet and
Neumann boundary conditions. Such type of problem arises in several situations as
reaction-diffusion systems, nonlinear heat conduction and also as limit of reactiondiffusion systems coming from chemotaxis. We are mainly interested in using an
appropriate numerical method to show the location and the shape of such type
of solutions when the singular perturbation parameter goes to zero, analyzing the
geometrical effect of the curved boundary of the domain.
A
Introduction
In this short survey we are interested in showing the main results and in analyzing the
main steps of the construction of the shape of positive least-energy solutions (Mountain
Pass Type Solutions) for a mixed perturbed semilinear Dirichlet-Neumann problem as

2
p

in Ω;
−ε ∆u + u = u
∂u
(M̃ε )
u = 0 on ∂D Ω;
∂ν = 0 on ∂N Ω;


u>0
in Ω,
(
)
n+2
where Ω is a smooth bounded subset of Rn , p ∈ 1, n−2
if n ≥ 3 and p ∈ (1, +∞)
if n = 2, ε > 0 is a small parameter, and ∂N Ω, ∂D Ω are two subsets of the boundary
of Ω such that the union of their closures coincides with the whole ∂Ω.
Received: May 14, 2010. Accepted: December 01, 2010.
The author would like to thank warmly Prof. Jesus Azorero, Prof. Andrea Malchiodi and Prof.
Ireneo Peral for many fruitful conversations about the topic and express his gratitude to Departamento
de Matemáticas de la Universidad Autónoma de Madrid for the homely hospitality. The author has been
supported by the Project SB2009-0033, MEC, Spain.
65
66
L. Montoro
These stationary mixed boundary problems appear in different situations and
generally the Dirichlet condition is equivalent to impose some state on the physical
parameter represented by u while instead the Neumann condition gives a meaning at
the flux parameter crossing ∂N Ω, see the introduction of [11, 12, 23] for more specific
comments.
Here below, for completeness and for the reader convenience there are some
classical physical situations where such a type of problem (M̃ε ) appears:
(i) Population dynamics. Assume that a species lives in a bounded region Ω such
that the boundary has two parts, ∂N Ω and ∂D Ω, the first one is an obstacle that
blocks the pass across, while the second one is killing zone for the population.
(ii) Nonlinear heat conduction. In this case (M̃ε ) models the heat (for small
conductivity) in the presence of a nonlinear source in the interior of the domain,
with combined isothermal and isolated regions at the boundary.
(iii) Reaction diffusion with semi-permeable boundary. In this framework we have
that the meaning of the Neumann part, ∂N Ω, is an obstacle to the flux of the
matter, while the Dirichlet part, ∂D Ω, stands for a semipermeable region that
allows the outwards transit of the matter produced in the interior of the cell Ω by
a nonlinear reaction.
Singularly perturbed problems with Neumann or Dirichlet boundary conditions, that is

2
p

in Ω;
−ε ∆u + u = u
∂u
(Nε )
=
0
on ∂N Ω;
∂ν


u>0
in Ω,

2
p

in Ω;
−ε ∆u + u = u
(Dε )
u=0
on ∂D Ω;


u>0
in Ω,
are been studied by many authors, see for example [19], [20], [21] and [22]. Indeed
there is a wide literature regarding the study of this type of problems and we refer also
to the list of references [9], [10], [13], [14], [15], [16], [17], [18], [28] and [29].
The typical concentration behavior of solutions uε to the above(two problems
(Nε )
)
and (Dε ), is via a scaling of the variables in the form uε (x) ∼ U
x−Q
ε
, where Q is
some point of Ω, and where U is a solution of
−∆U + U = U p
in Rn
(or in Rn+ = {(x1 , . . . , xn ) ∈ Rn : xn > 0}),
(1)
the domain depending on whether Q lies in the interior of Ω or at the boundary; in
n+2
(and indeed only if
the latter case Neumann conditions are imposed. When p < n−2
Computation of mountain pass solutions to some semi-linear elliptic problem
67
this inequality is satisfied), problem (1) admits positive radial solutions which decay to
zero at infinity, see [2]:
lim er r
r→+∞
n−1
2
U (r) = αn,p .
Solutions of (M̃ε ) which inherit this profile are called spike layers, since they are highly
concentrated near some point of Ω.
W.M.Ni and I.Takagi in [20] proved that least energy solutions of (Nε ), that is
Mountain Pass solutions, have only one local maximum over Ω achieved at a point
that must lie on the boundary and in a subsequent paper [21] they showed that
the maximum, indeed unique, of the least energy solutions concentrates near global
maxima of the mean curvature H at the boundary.
The main results can be summarized as follows:
Theorem 1 ([20], [21]) For ε sufficiently small the problem (Nε ) possesses a leastenergy solution uε , i.e. a Mountain Pass solution with the following properties:
(i) uε has a unique local (thus global) maximum point Pε ∈ Ω. Furthermore Pε
must lie on the boundary ∂Ω;
(ii) uε → 0 everywhere in Ω outside a small neighborhood of Pε and
uε (Pε ) → U (0), with U is the unique solution of the problem (1) in Rn ;
(iii) H(Pε ) → max H(P ) as ε → 0, where H(P ) denotes the mean curvature of
P ∈∂Ω
∂Ω at P .
Concerning instead (Dε ), W.M.Ni and J.Wei in [22] proved that least energy
solutions have only one local maximum over Ω achieved at the most centered point of
Ω when the perturbation parameter goes to zero.
The main result states:
Theorem 2 ([22]) Let uε be a least-energy solution (i.e. Mountain Pass solution) to
the problem (Dε ). Then, for ε sufficiently small, we have:
(i) uε has at most one local maximum and it is achieved at exactly one point Pε in
1
Ω. Moreover, uε (·+Pε ) → 0 in Cloc
(Ω−Pε \{0}), where Ω−Pε = {x−Pε |x ∈
Ω};
(ii) d(Pε , ∂Ω) → max d(P, ∂Ω) as ε → 0.
P ∈Ω
68
L. Montoro
The mixed case has been analyzed in the two recent works [11, 12]. This case is
more involved because there exists an interface which separates the Dirichlet and the
Neumann parts, namely
IΩ := ∂D Ω ∩ ∂N Ω ̸= ∅.
Here below we give the main results proved in the two works that show the
behaviour of the solutions of (M̃ε ) when the perturbation parameter ε goes to zero.
In particular in the first one [11], the authors determine the geometric conditions to
have some sequence of solutions {uε } (spike-layers type) which approach IΩ when ε
tends to zero. Below we state the main theorem proved:
Theorem 3 ([11]) Suppose Ω ⊆ Rn , n ≥ 2, is a smooth bounded domain, and that
n+2
1 < p < n−2
(1 < p < +∞ if n = 2). Suppose ∂D Ω, ∂N Ω are disjoint open sets of
∂Ω such that the union of the closures is the whole boundary of Ω and such that their
intersection IΩ is an embedded hypersurface. Suppose Q ∈ IΩ is such that H|IΩ is
critical and non degenerate at Q, and that ∇H ̸= 0 points toward ∂D Ω. Then for
ε > 0 sufficiently small problem (M̃ε ) admits a solution uε concentrating at Q, with a
unique maximum point in ∂N Ω, at distance of order ε| log ε| from IΩ .
Remark 1 The non-degeneracy condition on Q can be relaxed requiring that either Q
is a strict local maximum or minimum for H|IΩ or that the local degree of ∇H|IΩ is
non zero for any connected (small) neighborhood of Q in IΩ .
In the second one [12], the authors analyze mainly the behavior of the least energy
solutions to the problem (M̃ε ) under generic assumptions on the domain and on the
interface.
It was shown that Mountain Pass solutions are in fact least energy solutions and
that, given a family of least energy solutions {uε }, their points of maximum must lie
on the boundary of the domain Ω, as in the Neumann case. Moreover, as for (Nε ), the
least energy solutions concentrate at boundary points in the closure of ∂N Ω where the
mean curvature is maximal. We state the main theorems proved:
Theorem 4 ([12]) Suppose Ω ⊆ Rn , n ≥ 2, is a smooth bounded domain, and that
n+2
1 < p < n−2
(1 < p < +∞ if n = 2). Suppose ∂D Ω, ∂N Ω are disjoint open sets
of ∂Ω such that the union of the closures is the whole boundary of Ω and such that
their intersection IΩ is an embedded hypersurface. Then, as ε → 0, the least energy
solution of (M̃ε ) have a unique maximum point which converges to Q ∈ ∂N Ω such
that H(Q) = max H.
∂N Ω
Theorem 5 ([12]) Suppose we are in the situation of Theorem 4. Assume also that
max H is attained only at Q ∈ IΩ , where Q is an isolated maximum point of H|IΩ
∂N Ω
for which ∇H(Q) ̸= 0 points toward ∂D Ω. Then the least energy solution of (M̃ε )
Computation of mountain pass solutions to some semi-linear elliptic problem
69
behaves as in Theorem 3. If instead max H is attained both at IΩ and at an isolated
∂N Ω
interior point Q̌ of ∂N Ω, and if Q is as in the previous case, the maxima of least energy
solutions to (M̃ε ) converge to Q̌.
In [23] via a useful diffeomorphism y = Ψ(x) (x = Φ(y) with Ψ ≡ Φ−1 ,
straightening the domain around some point belonging to the boundary), we prove
Theorem 6 below that characterizes the shape of the least energy solutions to the
problem (M̃ε ) as the perturbation parameter ε goes to zero. Here we denote Ωε the
scaled domain (after flatten and scaling around Pε = Ψ−1 (Qε ), with Pε belonging to
the interior of Neumann boundary part) and we suppose Qε to be the origin, that is
Qε = 0.
Theorem 6 ([23]) Let uε be a least-energy solution to the problem (M̃ε ) and Ω ⊆
Rn , n ≥ 2, a bounded smooth domain. Let K ⊂ Ωε be a compact containing the
origin and Br+ ⊂ Ωε the set {z ∈ Br , zn > 0}. Moreover let y = Ψ(x) be the
diffeomorphism that straightens a boundary portion around Pε , where Pε ∈ ∂N Ω is,
for ε small enough, the unique maximum point of uε . Let we denote wε (·) = uε (Φ(ε·))
the scaling of the least-energy solution uε after straightening the boundary. Then for
b ε ⊂ Ω,
any α > 0, there exists ε0 such that for ε < ε0 we have two connected sets Ω
e
Ωε ⊂ Ω and ρ positive constant independent of ε, such that
b ε ) < C1 ε,
1. diam(Ω
2. uε (x) − U ( Ψ(x)
ε )
bε)
C 2 (Ω
b ε,
Pε ∈ ∂ Ω
< C εα2 ,
∥wε (z) − U (z)∥C 2 (K) < α,
b ε,
3. |uε (x)| < Cαe−µ1 δ(x)/ε if x ∈ Ω\Ω
e
−µ1 δ(z)
|wε (z)| < Cαe
if z ∈ Ωε \ B r2 ,
(
)1/2
−µ1 ρ/ε
4. ∥uε ∥H 1 (Ω
.
e ε ) ≤ C̃e
D
e
b ε \ ∂Ω) and C1 , C, C̃, µ1 are
where δ(z)
= dist(z, ∂B r2 \ ∂Ωε ), δ(x) = dist(x, ∂ Ω
positive constants depending only on Ω.
For all the details of the proofs and other related results to the problem (M̃ε ) we
recommend to refer to [11, 12, 23] .
In this work we consider the least energy solutions to the problem (M̃ε ), referring
strongly to the numerical analysis’ point of view, giving an approach of their shape
and then presenting the related results. In this type of problems with pattern
formation phenomena, whose main interest is the profiles’ locations of concentration,
the visualization of solutions via computational and numerical methods turns to be very
useful. We refer to the papers [5], [6] and the bibliographies therein for a complete and
accurate description of all these methods.
70
L. Montoro
Since crucial for the application of the numerical algorithm used, we want to
point out that least energy solutions are actually Mountain Pass solutions, see [12,
Proposition 2.3] .
B
The mountain pass shape: a numerical approach.
For our analysis we consider the case n = 2 and p = 3, that is:

2
3

in Ω;
−ε ∆u + u = u
∂u
u = 0 on ∂D Ω;
∂ν = 0 on ∂N Ω;


u>0
in Ω,
(Mε )
where Ω is an elliptical domain of R2 . We want to point out that in the two dimensional
case the interface IΩ is represented by points belonging to R2 . Figure 1 shows how
Dirichlet and Neumann conditions are imposed.
Γ1N
Neu ma nn Condi t i on
Di r i chl et Condi t i on
Γ2D
Γ1D
Dir ichl et Condit ion
Neumann Condit ion
Γ2N
Figure 1: The Elliptical Domain Ω.
In order to construct and aproach the shape of a positive least energy solution of
problem (Mε ) we take advantage of Mountain Pass Algorithm due to Y.S. Choi and
P.J. McKenna, appeared in the pioneering work [7], taking into account the intrinsic
differences, see [23].
Here we consider a perturbative problem with mixed boundary conditions and
defined in an elliptical domain contained in R2 . We need to consider the problem in a
domain with non constant mean curvature H because our goal is finding a spike layer
solution concentrating at the interface: in fact the variation of the mean curvature H
is crucial to visualize a least energy solution to the problem (Mε ) concentrating at the
interface IΩ as ε → 0, see Theorem 4.
Computation of mountain pass solutions to some semi-linear elliptic problem
71
From numerical point of view a curved boundary domain is difficult to mesh in
order to define all the discrete variables that the algorithm requires. In fact, as known,
such a process is simplest if the geometry is rectangular. However when the domain
has a curved boundary the construction of the mesh is more involved requiring large
manpower and we refer to the books [8] and [27] for more details about.
Moreover to get a Mountain Pass solution concentrating at the interface, we impose
mixed conditions on four disjoint subsets of the boundary ∂Ω, as shown in Figure 1,
in such a way that Dirichlet conditions are imposed on the two boundary parts, Γ1D
and Γ2D , where the curvature H reaches its maximum value. This requires a further
commitment in the processing of the domain geometry and in the code development.
Imposing Dirichlet conditions on Γ1D and Γ2D we avoid that least energy solutions
concentrate where the mean curvature is maximal, see [12]. In this way we get
solutions to (Mε ) concentrating at the interface IΩ according to Theorem 4.
B.1 The Mountain Pass algorithm
First of all, to apply the algorithm, we need to show that the energy functional
associated to the problem (M̃ε ) has the Muontain Pass geometry and satisfies the
Palais-Smale condition. This readily follows by arguments nowadays well understood,
see [3, 4] for example.
In fact, solutions of (Mε ), correspond to critical points of the functional
∫
∫
1
1
Iε (u) =
ε2 |∇u|2 + u2 −
u4 ;
u∈X
2 Ω
4 Ω
(2)
where
1
(Ω) : u ̸≡ 0 and u ≥ 0 in Ω}
X = {u ∈ HD
1
and HD
(Ω) stands for the space of functions in H 1 (Ω) which have zero trace on ∂D Ω,
i.e.
1
HD
(Ω) = {u ∈ H 1 (Ω) : u|∂D Ω = 0}.
Then, it is easy to show, that the functional Iε (see (2)) associated to (Mε ) satisfies
the Mountain Pass Theorem. Indeed by Poincaré inequality we can observe that
Remark 2 For ε fixed,
(∫
∥u∥HD1 =
ε |∇u|
2
2
) 12
Ω
is an equivalent norm to
(∫
∥u∥HD1 =
for the boundary problem (Mε ).
ε |∇u| + u
2
Ω
2
2
) 21
72
L. Montoro
Then by standard arguments Iε has the Mountain Pass geometry. Moreover since the
exponent p is subcritical (and actually since we are in dimension n = 2), a PalaisSmale condition holds true for the functional Iε .
Now we apply to the problem (Mε ) the Mountain Pass algorithm, due to Y.S. Choi
and P.J. McKenna, that is a constructive form of the Mountain-Pass Theorem:
First, we initialize a path from the local minimum w to the point of lower altitude e.
Second, we find the maximum of Iε along this path. Third, we deform the path in such
a way as to make the maximum along the path decrease as fast as possible. Finally,
in the last step we decide whether to stop, if the maximum turns out to have been a
critical point, or repeat the process again.
We outline and rewrite, for completeness and a better understanding, the main steps
in finding an approximate critical point, but for more ideas about the Mountain Pass
algorithm and its implementation, we refer to [6] and [7].
Step 1 of the algorithm: Since for our problem (Mε ) the Mountain Pass geometry
holds, there exist the local minimum w and the point of lower altitude e. We prescribe
a fixed number N , which determine the number of linear segments in the piecewise
linear path Γ0 . One possible starting path would be simply to take a straight line path
from e, to w. It is clear that the initial point e was not optimal. Thus we are led to
make an improvement in the initial path by choosing
g(i) = w +
i
(e − w),
N
1≤i≤N
and then evaluating Iε (g(i)) for i ≥ 1 until we find at i = i0 that
Iε (g(i)) ≤ Iε (w).
Having found such an i0 , we replace the original e with g(i0 ), shortening the distance
between w and e. We now choose our initial Γ0 to be the straight line between w and
e, and the corresponding g(i) to be the equally spaced points along this path.
Step 2 of the algorithm: With a given discretized path g(i), the value of i = im
at which a maximum of Iε (g(i)) occurs is computed. If the number of segments N is
large enough, then 1 ≤ im ≤ N −1, because w is a local minimum and Iε (e) ≤ Iε (w).
This first estimate of the maximum along the path is a little crude. We try to refine it by
the procedures below. Since the path is a discretized one, we will try to locate a higher
maximum between i = im−1 and i = im+1 . Now, we join g(im ) and g(im+1 ) by a
straight line path. In theory we seek α ≥ 0 such that I(g(im ) + αz) stops increasing,
where z = g(im+1 ) − g(im ). In practice, we test successively
g1 =
1
(g(im ) + g(im+1 )),
2
g2 =
1
1
(g(im ) + g(i1 )) g3 = (g(im ) + g(i2 ))
2
2
etc. until either
(a) gi is found such that I(gi ) > I(g(im )) or,
Computation of mountain pass solutions to some semi-linear elliptic problem
73
(b) ||gi − g(im )||HD1 = (1/2i )||gim+1 − g(im )||HD1 is smaller than a prescribed
tolerance.
In case (a), we evaluate gmid = 12 (g(im )+gi ), and perform a quadratic polynomial
interpolation to find a better value for the maximum location of Iε . In case (b), we do
not change g(im ).
Having checked whether we can improve the estimate of the maximum in the
direction of g(im+1 ), we now perform a similar task with g(im−1 ) and our revised
location of g(im ), so that g(im ) will be moved to attain a higher maximum. We now
refine the path in the neighborhood of the local maximum, by moving some of the
nearby nodes closer. If the H 1 norm distance between g(im ) and g(im+1 ) is larger
than a prescribed value, we then move a prescribed number of mesh points (around
this local maximum) to avoid the missing of saddle points.
Step 3 of the algorithm: The steepest descent direction at the local maximum
location is computed by finding a solution of the problem (Pv ) via finite element
method, solving equation (9), below. Thus, a numerical approximation of 2λv =
v − ε2 u can be computed.
If the difference on the right-hand side is smaller than a prescribed tolerance in the
H 1 norm (see Proposition 7 and (11)), we stop the algorithm and find an approximate
critical point of the functional Iε . Otherwise, up to a prescribed maximum, the
maximum point is moved downhill for a certain descent distance. This distance is
determined by a similar procedure as in step 2. If this distance or the difference in
Iε before and after the descent is smaller than a prescribed tolerance, then we have a
numerical approximation of a critical point and stop the algorithm. If not, repeat steps
2 and 3.
Remark 3 All the algorithm was implemented in a MATLAB code, using different
topics of programming language and different tools of standard numerical analysis.
For all that, we refer to the books [24] and [25].
B.2 The steepest descent direction
1
In our case, the steepest descent direction at u ∈ HD
(Ω) corresponds to finding the
1
direction v ∈ HD (Ω) with ||v||HD1 = 1 that minimize the Fréchet derivative Iε′ (u)v
subject to the constraint that ||v||HD1 = 1.
Introducing the Lagrange multiplier λ, we look for the unconstrained minimum of
1
the functional Jε,u : HD
(Ω) → R defined as:
∫
ε2 ∇u∇v + uv − f (u)v + λ|∇v|2
Jε,u (v) :=
Ω
1
where f (u) = u3 and u, v ∈ HD
(Ω).
(3)
74
L. Montoro
Thanks to Remark 2,
∫ it turns out to be more simple to deal with (3): in fact we can
|∇v|2 = 1 since, for ε fixed, ||∇v||L2 (Ω) is an equivalent
simply minimize over
Ω
norm to ∥v∥HD1 .
1
Now we find the direction v ∈ HD
(Ω) that minimizes the Fréchet derivative of
functional Jε,u defined in (3). If such a function v is a minimum of the functional Jε,u
(3), it must satisfy
′
Jε,u
(v) = 0
i.e.
∫
∫
ε2 ∇u∇w + uw − f (u)w = −
2λ∇v∇w.
(4)
Ω
Ω
Denoting v as:
v := 2λv + ε2 u.
(5)
l(u) := f (u) − u,
(6)
and defining the functional l(·) as
we can arrange equation (4) as:
∫
∫
∇w∇v =
Ω
l(u)w,
Ω
1
Then v ∈ HD
(Ω) corresponds to the unique weak solution (by Lax-Milgram lemma)
for the linear problem:


in Ω;
−∆v = l(u)
∂v
(Pv )
∂ν = 0 on ∂N Ω; v = 0 on ∂D Ω;


v>0
in Ω.
Then by (5) we get the direction v. Moreover |λ| is determined so that ||v||HD1 = 1.
Since
1
Jε,u (v + u) = Jε,u (v) + dJε,u (v)[u] + d2 Jε,u (v)[u, u] + o(||u||3 )
2
∫
and
|∇u|2 .
2λ
Ω
we get that v is a minimum (the steepest descent direction) if λ > 0.
B.3 Numerical approximation of problem (Pv )
In order to solve problem (Pv ) we need to find a numerical approximation of the
solution v, with l(u) known at the mesh points. We use the finite element method
(FEM for short) which is the most appropriate for approximating the solutions of
Computation of mountain pass solutions to some semi-linear elliptic problem
75
second-order problem posed in variational form over a given functional space V . A
well-known approach to solve such problems is Galerkin’s method, which consists in
defining similar problems, called discrete problems, over a finite-dimensional subspace
Vh of the space V . The finite element method is a Galerkin’s method characterized by
three basic aspects in the construction of the space Vh :
1. a triangulation Th is established over the set Ω, that is, the set Ω is written as a
finite union of finite elements K ∈ Th ;
2. the function vh ∈ Vh are piecewise polynomials, i.e., the spaces PK = {vh|K :
vh ∈ Vh } consist of polynomials;
3. there should exist a basis in the space Vh whose functions have a compact
support.
For more details about such type of method we refer to the books [8, 26, 27].
To solve problem (Pv ), we consider the linear abstract variational problem:
Find u ∈ V such that
∀v ∈ V,
a(u, v) = F (v),
(7)
where the space V , the bilinear form a(·, ·) and the linear form F satisfy the
assumptions of the Lax-Milgram lemma. In our case
1
(Ω),
(i) V = HD
∫
(ii) a(u, v) =
∇u∇v,
∫
(iii) F (v) =
Ω
lv,
Ω
where l is defined in (6).
Then we have to construct a similar problem in finite-dimensional subspace of the
space V . With any Vh of V we associate the discrete problem:
Find uh ∈ Vh such that
∀vh ∈ Vh ,
a(uh , vh ) = F (vh ).
(8)
By Lax-Milgram lemma, we infer that such a problem has only one solution uh . We
denote Nh the number of all mesh points of the domain. We use a piecewise linear
basis functions φj satisfying
{
0 i ̸= j,
φj (Ni ) = δij =
1 i = j,
76
L. Montoro
where Ni is a generic mesh point and i, j = 1, 2, . . . , Nh . Hence we express the
discrete solution uh as
uh (x) =
Nh
∑
uj φj (x),
∀x ∈ Ω,
uj = uh (Nj ),
∀x ∈ Ω,
vi = vh (Ni ).
j=1
and vh ∈ Vh as
vh (x) =
Nh
∑
vi φi (x),
i=1
Then we have to find the coefficients uj such that they are solutions of the linear
system
Nh
∑
a(φj , φi )uj = F (φi ),
i = 1, 2, . . . , Nh .
j=1
Then we get
Au = b
(9)
where A is a Nh × Nh matrix given by
A = [aij ],
∫
∇φj ∇φi ,
aij =
with
Ω
and u and b are vectors defined as
u = [uj ] with
∫
uj = uh (Nj ),
b = [bi ] with bi =
lφi .
Ω
All the integrals in the above expressions are been approximated by Gaussian 2-d
quadrature rule and 2-d composite trapezoidal rule.
Proposition 7 Let uh be the discrete solution to the problem (8). Then the H 1 norm
is given numerically by
1
||uh ||H 1 = (uT Au) 2 ,
where A and u are as in equation (9).
Proof. By a straight calculation, one has that:
∫
∇uh ∇uh =
Ω


Nh
Nh ∑
Nh
Nh
∑
∑
∑


a
=
a(uj φj , ui φi )
uj φj ,
u i φi
j=1
Nh ∑
Nh
∑
j=1 i=1
j=1 i=1
i=1
ui a(φj , φi )uj
=
Nh ∑
Nh
∑
j=1 i=1
= uT Au.
ui aij uj
Computation of mountain pass solutions to some semi-linear elliptic problem
77
Proposition 7 it is fundamental, through all the algorithm, in finding the H 1 norm of
approximated functions, solutions, and also the H 1 distance between any two given
functions.
A fundamental step in the FEM method is to define the stiffness matrix A
associated to the discretized problem. In our case we perform a mesh via triangular
elements and we use linear piecewise polynomials functions wh over each of such
elements, that is we choose wh = a+bx+dy linear inside each triangle and continuous
across each edge. Then within each triangle, the three coefficients (a, b, d) are uniquely
determined by the values of wh at the three vertices.
Let us denote by (pix , piy ), with i = 1, 2, 3, the coordinates of the three vertices of
each triangle in the cartesian coordinate system. Define the matrix Q as


1 p1x p1y
Q :=  1 p2x p2y  .
1 p3x p3y
Obviously the area ael of each triangle is given by
ael =
1
| det(Q)|.
2
Then denote the matrix c as
c = [cij ] := Q−1 .
The 3 × 3 stiffness element matrix Ael associated to each triangular element K, in
our case is given by

c221 + c231


Ael = ael 
 c22 c21 + c32 c31

c23 c21 + c33 c31
c21 c22 + c31 c32
c222 + c232
c23 c22 + c33 c32
The 3 × 1 element vector b for each K is defined as


h(p1x , p1y )



ael 
 h(p2x , p2y )  .
bel =


3 

h(p3x , p3y )
c21 c23 + c31 c33



c22 c23 + c32 c33 
.

c223 + c233
(10)
Then by summation over all Nh elements we obtain the assembly of the global
stiffness matrix A, that means that the matrix elements are added together, after being
78
L. Montoro
placed in the right position in the global array. In our problem, since the boundary
is curved we need to construct, in a useful way, a table containing the geometric and
topological properties of the mesh. Then we pass from a local problem, defined in each
element K, to the global one defined in Ω, getting the global stiffness matrix A.
For example, here below in Table 1 and Figure 2, we show a good assembly scheme
used in order to construct the stiffness matrix A. For more details about the assembly
scheme we refer to the book [26].
Table 1: Assembly scheme.
Element
1
2
3
4
5
..
.
1
6
7
7
632
722
..
.
Nodes
2
3
1
15
6
2
6
15
650 614
714 713
..
..
.
.
1
1
15
6
614
3
15
2
2
4
7
632
650
Figure 2: Number of elements and nodes.
Remark 4 All the integrals in the above expressions are been approximated by
Gaussian 2-d quadrature rule and 2-d composite trapezoidal rule. Denoting K a
triangular element, we can compute the integral of a function g(x) defined in K as
∫
area(K) ∑
g(pix , piy ).
3
i=1
3
g(x)dx =
K
Computation of mountain pass solutions to some semi-linear elliptic problem
79
In this way, for example, we get (10).
Another important goal is to create a mesh for the domain Ω which is elliptical.
With respect to the domains considered in [7], in our case the main difficulty is that
the boundary is curved. In general a mesh requires a choice of mesh points (vertex
Figure 3: Subdivision of the domain Ω into triangles.
nodes) and a triangulation, see Figure 3. The FEM triangulation of a domain with
a curved boundary is not a trivial task, see [6], because the coding work increases a
lot since we need a mesh refinement to better approximate the curved boundary. The
mesh should have small elements to resolve the fine details of the geometry, but larger
size, where possible, to reduce the total number of nodes. We have chosen triangular
elements because triangles are obviously better at approximating a curved boundary.
We subdivide the basic region Ω into triangles. The union of these triangles will be
a polygon Ωh . In general, if ∂Ω is a curved boundary, there will be a nonempty skin
Ω \ Ωh . We assume that Ωh is a subset of Ω and that no vertex of any triangles lies
along the edge of another one.
C
Graphic results: the spike-layer solution in the mixed case
Here below we present the graphic results obtained applying the numerical method
described in the previous section. The algorithm was implemented using a MATLAB
code. A detailed description of the code of all procedures of the algorithm is included
in [1].
Below, in the different figures, we analyze the behavior of the least energy solution
when the perturbation parameter ε goes to zero. We run the algorithm fixing, every
time, a smaller different ε-value.
80
L. Montoro
We denote dn the absolute convergence error defined here as
{
}
dn = max ||unε − un−1
||HD1 (Ω) , |Iε (unε ) − Iε (un−1
)| ,
ε
ε
(11)
where unε ( resp. un−1
) is the approximate least energy solution at the Step n (Step
ε
n−1) of the algorithm. The process stops when a fixed desired dn indicator is achieved.
In this work, we set dn = 10−3 .
As Figure 1 shows, we have considered an elliptical domain with boundary
subdivided in four parts, that is, we set:
∂N Ω =
Γ1N ∪ Γ2N ,
∂D Ω =
Γ1D ∪ Γ2D ,
with Γ1N ∩ Γ2N = ∅, Γ1D ∩ Γ2D = ∅ and ∂Ω = ∂N Ω ∪ ∂D Ω.
Since we are working in a domain Ω ⊂ R2 , the interface IΩ consists in four points
belonging to R2 space. On the two boundary parts, where the curvature reaches its
maximum value we impose Dirichlet conditions, see Figure 1.
In [12], we have showed that the concentration of the limit point Pε cannot occur
on the Dirichlet boundary part. The point Pε is such that
uε (Pε ) = maxuε (P ),
x∈Ω
and up to a subsequence, we suppose that Pε → P0 . In fact if
dist(Pε , ∂D Ω)
→ ∞,
ε
x
the least energy
ε
solution uε would converge to the ground state solution U of problem (1) in Rn in C 2
sense, with a critical level
cε = εn {I(U ) + o(1)}
as ε → 0, see [11, Corollary 3.7], up to a scaling of the type x′ =
bigger than the least energy solution critical level
cε = εn {
I(U )
+ o(1)},
2
which is a contradiction (see [12]). Then, since the two points where the curvature
maximizes belong to Γ1D and Γ2D , the maximum point Pε should concentrate at a point
P0 ∈ IΩ , see Theorem 4.
For each ε-value considered we run the algorithm getting a three-dimensional view,
see for example Figure 4, and a two-dimensional view, see for example Figure 5.
Computation of mountain pass solutions to some semi-linear elliptic problem
81
The three-dimensional view shows better the profile of such a solution: in fact
that is called spike layer, since it is highly concentrated near some point of Ω. As
the perturbation parameter ε becomes smaller, see Figure 6, Figure 8 and Figure 10,
the Mountain Pass type solutions uε tend to zero uniformly on every compact set of
Ω \ {P0 } (see Theorem 6), while there exist δ ≥ 1 and Pε such that uε (Pε ) ≥ δ. This
is evident utilizing the equation
−ε2 ∆u + u = u3
when Pε is in the interior and otherwise, when Pε ∈ ∂N Ω this follows from the
boundary Hopf lemma.
Instead, in the two dimensional view, see Figure 7, Figure 9 and Figure 11, we note
that the maximum point Pε belongs to the Neumann boundary part and, as ε goes to
zero, this point reaches the interface IΩ .
Moreover the support of the least energy solution becomes smaller when ε goes
to zero. In fact, for example, giving a look at Figure 10 and Figure 11, we note that
the maximum point Pε concentrates at the interface IΩ and the spike layer’s profile is
concentrated around the maximum point in a small support, as Theorem 6 states.
82
L. Montoro
Figure 4: Three-dimensional view with ε = 0.7 and max uε = 2.14.
Figure 5: Two-dimensional view with ε = 0.7.
Computation of mountain pass solutions to some semi-linear elliptic problem
Figure 6: Three-dimensional view with ε = 0.4 and max uε = 2.21.
Figure 7: Two-dimensional view with ε = 0.4.
83
84
L. Montoro
Figure 8: Three-dimensional view with ε = 0.2 and max uε = 2.52.
Figure 9: Two-dimensional view with ε = 0.2.
Computation of mountain pass solutions to some semi-linear elliptic problem
Figure 10: Three-dimensional view with ε = 0.1 and max uε = 1.99.
Figure 11: Two-dimensional view with ε = 0.1.
85
86
C.1
L. Montoro
A concluding case: the spike-layer solution in the Dirichlet case
As last application, we show as the Mountain Pass algorithm can be applied to the
singularly perturbed Dirichlet problem

2
3

in Ω;
−ε ∆u + u = u
(12)
u=0
on ∂Ω;


u>0
in Ω.
In fact, also in this case, the least energy solutions come from the Mountain Pass
Theorem. While in the Mixed case (M̃ε ) the spike layer solution with its unique
maximum concentrates in the closure of the Neumann boundary part ∂N Ω where the
curvature maximize, in the Dirichlet case (Dε ) the spike layer solution has one local
maximum achieved at a point that maximize the distance from the boundary ∂Ω, see
Theorem 2. We can use the Mountain Pass algorithm to treat the perturbed problem
(12) over the same elliptical domain, making the necessary changes with respect to
the mixed case. As Figures 12, 13, 14 and 15 show, we get a spike-layer solution
concentrating, as ε goes to zero, at the center of symmetry of the elliptical domain in a
support becoming smaller around the unique maximum point Pε .
Figure 12: Three-dimensional view with ε = 0.8 and max uε = 2.91.
Computation of mountain pass solutions to some semi-linear elliptic problem
Figure 13: Three-dimensional view with ε = 0.4 and max uε = 2.26.
Figure 14: Three-dimensional view with ε = 0.2 and max uε = 2.36.
87
88
L. Montoro
Figure 15: Three-dimensional view with ε = 0.1 and max uε = 1.98.
References
[1] Files are available online at http://www.mat.unical.it/∼ montoro
[2] A. Ambrosetti and A. Malchiodi, Perturbation methods and semilinear elliptic
problems on Rn , Progress in Mathematics, vol. 240, Birkhäuser Verlag, Basel,
2006.
[3] A. Ambrosetti and A. Malchiodi, Nonlinear analysis and semilinear elliptic
problems, Cambridge Studies in Advanced Mathematics, vol. 104, Cambridge
University Press, Cambridge, 2007.
[4] A. Ambrosetti and P. H. Rabinowitz, Dual variational methods in critical point
theory and applications, J. Functional Analysis, 14 (1973), 349–381.
[5]
G. Chen and W. M. Ni and A. Perronnet and J. Zhou, Algorithms and
visualization for solutions of nonlinear elliptic equations. II. Dirichlet, Neumann
and Robin boundary conditions and problems in 3D, Internat. J. Bifur. Chaos
Appl. Sci. Engrg., 11 (2001), no. 7, 1781-1799.
[6] G. Chen and W. M. Ni and J. Zhou, Algorithms and visualization for solutions
of nonlinear elliptic equations, Internat. J. Bifur. Chaos Appl. Sci. Engrg., 10
(2000), no. 7, 1565-1612.
Computation of mountain pass solutions to some semi-linear elliptic problem
89
[7] Y. S. Choi, and P. J. McKenna, A mountain pass method for the numerical
solution of semilinear elliptic problems, Nonlinear Anal., 20 (1993), no. 4, 417437.
[8] P. G. Ciarlet, The finite element method for elliptic problems, Classics in Applied
Mathematics, vol. 40, Society for Industrial and Applied Mathematics (SIAM),
Philadelphia, PA, 2002.
[9]
E. N. Dancer and S. Yan, Multipeak solutions for a singularly perturbed
Neumann problem, Pacific J. Math., 189 (1999), no. 2, 241-262.
[10] M. del Pino and P. L. Felmer and J. Wei, On the role of mean curvature in some
singularly perturbed Neumann problems, SIAM J. Math. Anal., 31 (1999), no. 1,
63-79 (electronic).
[11] J. Garcia Azorero and A. Malchiodi and L. Montoro and I. Peral, Concentration
of solutions for some singularly perturbed mixed problems: existence results,
Arch. Rational Mech. Anal., 196 (2010), 907-950.
[12] J. Garcia Azorero and A. Malchiodi and L. Montoro and I. Peral, Concentration
of solutions for some singularly perturbed mixed problems: asymptotics of
minimal energy solutions, Ann. Inst. H. Poincaré Anal. Non Linéaire., 27 (2010),
no. 1, 37-56.
[13] M. Grossi and A. Pistoia and J. Wei, Existence of multipeak solutions for a
semilinear Neumann problem via nonsmooth critical point theory, Calc. Var.
Partial Differential Equations, 11 (2000), no. 2, 143-175.
[14] C. Gui, Multipeak solutions for a semilinear Neumann problem, Duke Math. J.,
84 (1996), no. 3, 739-769.
[15] C. Gui and J. Wei, On multiple mixed interior and boundary peak solutions for
some singularly perturbed Neumann problems, Canad. J. Math., 52 (2000), no.
3, 522-538.
[16] C. Gui and J. Wei and M. Winter, Multiple boundary peak solutions for some
singularly perturbed Neumann problems, Ann. Inst. H. Poincaré Anal. Non
Linéaire, 17 (2000), no. 1, 47-82.
[17] Y. Y. Li, On a singularly perturbed equation with Neumann boundary condition,
Comm. Partial Differential Equations, 23 (1998), no. 3-4, 487-545.
[18] Y. Li and L. Nirenberg, The Dirichlet problem for singularly perturbed elliptic
equations, Comm. Pure Appl. Math., 51 (1998), no. 11-12, 1445-1490.
[19] C. S. Lin and W. M. Ni and I. Takagi, Large amplitude stationary solutions to a
chemotaxis system, J. Differential Equations, 72 (1988), no. 1, 1-27.
[20] W. M. Ni and I. Takagi, On the shape of least-energy solutions to a semilinear
Neumann problem, Comm. Pure Appl. Math., 44 (1991), no. 7, 819-851.
90
L. Montoro
[21] W. M. Ni and I. Takagi, Locating the peaks of least-energy solutions to a
semilinear Neumann problem, Duke Math. J., 70 (1993), no. 2, 247-281.
[22] W. M. Ni and J. Wei, On the location and profile of spike-layer solutions to
singularly perturbed semilinear Dirichlet problems, Comm. Pure Appl. Math.,
48 (1995), no. 7, 731-768.
[23] L. Montoro, On the shape of the least-energy solutions to some singularly
perturbed mixed problems, Commun. Pure Appl. Anal., 9 (2010), no.6, 17311752.
[24] A. Quarteroni and R. Sacco and F. Saleri, Numerical mathematics, Texts in
Applied Mathematics, vol. 37, Springer-Verlag, New York, 2000.
[25] A. Quarteroni and F. Saleri, Scientific computing with MATLAB and Octave,
second ed., Texts in Computational Science and Engineering, vol. 2, SpringerVerlag, Berlin, 2006.
[26] A. Quarteroni and A. Valli, Numerical approximation of partial differential
equations, Springer Series in Computational Mathematics, vol. 23, SpringerVerlag, Berlin, 1994.
[27] G. Strang and G. J. Fix, An analysis of the finite element method, PrenticeHall Inc., Englewood Cliffs, N. J., 1973, Prentice-Hall Series in Automatic
Computation.
[28] Z. Q. Wang, On the existence of multiple, single-peaked solutions for a semilinear
Neumann problem, Arch. Rational Mech. Anal., 120 (1992), no. 4, 375-399.
[29] J. Wei, On the boundary spike layer solutions to a singularly perturbed Neumann
problem, J. Differential Equations, 134 (1997), no. 1, 104-133.
S⃗eMA JOURNAL BEST PAPER AWARD
The Spanish Society of Applied Mathematics (S⃗eMA), aware of the remarkable
development of applied mathematics and its growing influence in the everyday life
of modern societies and also aware of the need to encourage the transfer of scientific
knowledge within our community, is proud to announce the “S⃗eMA Journal Best Paper
Award”, for outstanding papers published in the S⃗eMA Journal. The rules of this
prize are established below. One of the main interests of S⃗eMA is to promote the
dissemination of applied mathematics, emphasizing its relevance and potential. There
is an increasingly large body of applied interests in mathematics and the guidelines
below seek to enforce those subjects that have traditionally been in the core of the
scientific activities in S⃗eMA, that is, theoretical and numerical analysis, control theory
and computational techniques in mathematical modelling in science.
GENERAL RULES
1. The Spanish Society of Applied Mathematics (S⃗eMA) will award the
“S⃗eMA Journal Best-Paper Award” on a yearly basis.
2. All papers published in the volumes of the S⃗eMA Journal of the corresponding
year will be eligible for the prize.
3. There will be an ad-hoc Prize Committee, composed by five members: The
Editor-in-Chief of the S⃗eMA Journal, who will be the chair, the President of
S⃗eMA (or a member of the Executive Committee of S⃗eMA acting on behalf) and
three members of the Editorial Board the S⃗eMA Journal, proposed by the Editorin-Chief and designated by the Executive Committee. The Prize Committee will
determine its own procedure to carry out the selection process.
4. The award includes a diploma containing the citation and a cash prize to
be shared among the authors of the awarded paper. If the authors are
S⃗eMA members, the S⃗eMA membership fee will be waived for a one year period.
If any of the authors is not a member of S⃗eMA, he or she will be automatically
granted membership in the Society, and the fee will be waived for one year.
5. The decision of the Committee will be irrevocable. The Committee will report
on the merits found in the awarded paper.
6. One of the authors of the awarded paper should agree to deliver a presentation
concerning the contents of the paper in one of the activities organized by
S⃗eMA during the year in which the prize is awarded.
91