III Hydrodynamic Stability - A variational point of view

4A variational point of view

III Hydrodynamic Stability

4 A variational point of view

In this chapter, we are going to learn about a more robust and powerful way

to approach and understand transient growth. Instead of trying to think about

G(t) as a function of t, let us fix some target time T , and just look at G(T ).

For any two times t

< t

, we can define a propagator function such that

u(t

) = Φ(t

, t

)u(t

)

for any solution

of our equations. In the linear approximation, this propagator

is a linear function between appropriate function spaces. Writing Φ = Φ(

T, T

the gain problem is then equivalent to maximizing

G(T, T

) =

E(T )

E(T

)

(T ), u

(T )i

), u

hΦu

), Φu

), u

), Φ

†

Φu

), u

Here the angled brackets denote natural inner product leading to the energy

norm. Note that the operator Φ

†

Φ is necessarily self-adjoint, and so this

is maximized when

(

) is chosen to be the eigenvector of Φ of maximum

eigenvalue.

There is a general method to find the maximum eigenvector of a (self-adjoint)

operator Φ. We start with a random vector

. Then we have Φ

x ∼ λ

n → ∞

, where

is the maximum eigenvalue with associated eigenvector

Indeed, if we write

as the linear combination of eigenvectors, then as we apply

Φ many times, the sum is dominated by the term with largest eigenvalue.

So if we want to find the mode with maximal transient growth, we only need

to be able to compute Φ

†

Φ. The forward propagator Φ(

T, T

) is something we

know how to compute (at least numerically). We simply numerically integrate

the Navier–Stokes equation. So we need to understand Φ(T, T

)

†

Let u(t) be a solution to the linear equation

(t) = L(t)u(t).

Here we may allow

to depend on time. Let

†

be the adjoint, and suppose

v(t) satisfies

(t) = −L(t)

†

v(t).

Then the chain rule tells us

hv(t), u(t)i = h−L(t)

†

v(t), u(t)i + hv(t), L(t)

†

u(t)i = 0.

So we know that

hv(T

), u(T

)i = hv(T ), u(T )i = hv(T ), Φ(T, T

)u(T

)i.

Thus, given a

, to compute Φ(

T, T

)

†

, we have to integrate the adjoint

−L

(

)

†

backwards in time, using the “initial” condition

(

) =

, and

then we have

Φ(T, T

)

†

= v(T

What does the adjoint equation look like? For a time-dependent background

shear flow, the linearized forward/direct equation for a perturbation

is given

∂u

∂t

+ (U(t) · ∇)u

= −∇p

− (u

· ∇)u(t) + Re

−1

∇

The adjoint (linearized) Navier–Stokes equation is then

−

∂u

∂t

= Ω × u

− ∇ × (U × u

) − ∇p

+ Re

−1

∇

where we again have

∇ · u

= 0, Ω = ∇ × U.

This PDE is ill-posed if we wanted to integrate it forwards in time, but that

does not concern us, because to compute Φ

†

, we have to integrate it backwards

in time.

Thus, to find the maximal transient mode, we have to run the direct-adjoint

loop.

) u

(T )

†

Φ(u

)) u

(T )

†

and keep running this until it converges.

Using these analysis, we can find some interesting results. For example, in

a shear layer flow, 3-dimensional modes with both streamwise and spanwise

perturbations are the most unstable. However, in the long run, the Kelvin-

Helmholtz modes dominate.

Variational formulation

We can also use variational calculus to find the maximally unstable mode. We

think of the calculation as a constrained optimization problem with the following

requirements:

(i) For all T

≤ t ≤ T , we have

∂q

∂t

= D

q = Lq.

(ii) The initial state is given by q(0) = q

We will need Lagrangian multipliers to impose these constraints, and so the

augmented Lagrangian is

G =

, q

−

q, (D

− L)qi dt + h

, q(0) − q

Taking variations with respect to

and

recover the evolution equation and

the initial condition. The integral can then be written as

q, (D

− L)qi dt =

hq, (D

+ L

†

)

qi dt + h

, q

i − h

, q

Now if we take a variation with respect to q, we see that

q has to satisfy

+ L

†

)

q = 0.

So the Lagrange multiplier within such a variational problem evolves according

to the adjoint equation!

Taking appropriate adjoints, we can write

G =

, q

hq, (D

+ L

†

)

qi dt + h

, q

i−h

, q

i+ boundary terms.

But if we take variations with respect to our initial conditions, then

δG

δq

= 0

gives

2hq

, q

Similarly, setting

δG

δq

= 0, we get

, q

Applying h·, q

i to the first equation and h·, q

i to the second, we find that

, q

i = h

, q

This is the equation we previously saw for the adjoint equation, and this provides

a consistency condition to see if we have found the optimal solution.

Previously our algorithm used power iteration, which requires us to integrate

forwards and backwards many times. However, we now have gradient information

for gain:

δG

∂q

−

2hq

, q

This allows us to exploit optimization algorithms (steepest descent, conjugate

gradient etc.). This has an opportunity for faster convergence.

There are many ways we can modify this. One way is to modify the inner

product. Note that we actually had to use the inner product in three occasions:

(i) Inner product in the objective functional J

(ii) Inner product in initial normalization/constraint of the state vector.

(iii) Inner product in the definition of adjoint operator.

We used the same energy norm for all of these, but there is no reason we have

to use the same norm for all of these. However, there are strong arguments that

an energy norm is natural for norm 2. It is a wide open research question as to

whether there is an appropriate choice for inner product 3. On the other hand,

investigation of variation of inner product 1 has been widely explored.

As an example, consider p-norms

J =



Ω

e(x, T )

dΩ



1/p

, e(x, T ) =

|u(x, T )|

This has the attraction that for large values of

, this would be dominated by

peak values of e, not average.

When we set

= 1, i.e. we use the usual energy norm, then we get a beautiful

example of the Orr mechanism, in perfect agreement with what SVD tells us.

For, say, p = 50, we get more exotic center/wall modes.

Non-linear adjoints

In the variational formulation, there is nothing in the world that stops us from

using a non-linear evolution equation! We shall see that this results in slightly

less pleasant formulas, but it can still be done.

Consider plane Couette flow with

= 1000, and allow arbitrary amplitude

in perturbation

tot

= U + u, ∂

u + (u + U) · ∇(u + U) = −∇p + Re

−1

∇

where U = yˆx. We can define a variational problem with Lagrangian

L =

E(T )

−[∂

u+N(u)+∇p, v]−[∇·u, q]−



, u

i − E



c−hu(0)−u

, v

where

N(u

) = U

∂

+ u

∂

+ u

∂

−

∂

, [v, u] =

hv, ui dt.

Variations with respect to the direct variable

can once again define a non-linear

adjoint equation

δL

δu

= ∂

v + N

†

(v, u) + ∇q +



− v





t=T

+ (v −v

t=0

= 0,

where

†

, u) = ∂

, v

) − v

∂

+ ∂

) − v

∂

− v

∂

We also have

δL

δp

= ∇ · v = 0,

δL

δu

= v

− cu

= 0.

Note that the equation for the adjoint variable

depends on the direct variable

, but is linear in

. Computationally, this means that we need to remember

our solution to

when we do our adjoint loop. If

is large, then it may not be

feasible to store all information about

across the whole period, as that is too

much data. Instead, the method of checkpointing is used:

(i) Pick “checkpoints” 0 = T

< ··· < T

= T .

(ii) When integrating u, we remember high resolution data for u(x, T

(iii)

When integrating

backwards in the interval (

K−1

, T

), we use the data

remembered at

K−1

to re-integrate to obtain detailed information about

u in the interval (T

k−1

, T

This powerful algorithmic approach allows us to identify minimal seed of turbu-

lence, i.e. the minimal, finite perturbation required to enter a turbulence mode.

Note that this is something that cannot be understood by linear approximations!

This is rather useful, because in real life, it allows us to figure out how to modify

our system to reduce the chance of turbulence arising.