Below are some of my markdown notes for concepts covered and reviewed throughout the semester - seperate from my written notes.
Multivariable Calculus
The nabla operator represents the vector of partial derivatives:
∇ = ⟨ ∂ ∂ x , ∂ ∂ y , ∂ ∂ z ⟩ \boxed{
\nabla = \left\langle \frac{\partial}{\partial x}, \frac{\partial}{\partial y}, \frac{\partial}{\partial z}\right\rangle
}
∇ = ⟨ ∂ x ∂ , ∂ y ∂ , ∂ z ∂ ⟩
Gradient
∇ f = ⟨ ∂ f ∂ x , ∂ f ∂ y , ∂ f ∂ z ⟩ \boxed{
\nabla f = \left\langle \frac{\partial f}{\partial x}, \frac{\partial f}{\partial y}, \frac{\partial f}{\partial z}\right\rangle
}
∇ f = ⟨ ∂ x ∂ f , ∂ y ∂ f , ∂ z ∂ f ⟩
The gradient of a scalar function gives the direction and rate of the greatest increase.
Written as g r a d f \mathrm{grad}\,f grad f and ∇ f \nabla f ∇ f .
Divergence
∇ ⋅ F = ∂ F 1 ∂ x + ∂ F 2 ∂ y + ∂ F 3 ∂ z \boxed{
\nabla\cdot F = \frac{\partial F_1}{\partial x} + \frac{\partial F_2}{\partial y} + \frac{\partial F_3}{\partial z}
}
∇ ⋅ F = ∂ x ∂ F 1 + ∂ y ∂ F 2 + ∂ z ∂ F 3
The divergence of a vector field is the extent to which the vector field flux behaves like a source or a sink at a given point.
Written as d i v F \mathrm{div}\,F div F , and ∇ ⋅ F \nabla\cdot F ∇ ⋅ F . It is proper to write the ∇ \nabla ∇ as ∇ ⃗ \vec{\nabla} ∇ since it behaves like a vector operator in three-dimensional space.
Divergence Theorem
∫ V ( ∇ ⋅ F ) d V = ∫ S ( F ⋅ n ⃗ ) d S \boxed{
\int_V(\nabla\cdot F)\,dV = \int_S(F\cdot\vec n)\,dS
}
∫ V ( ∇ ⋅ F ) d V = ∫ S ( F ⋅ n ) d S
The dot product F ⋅ n F \cdot n F ⋅ n measures the component of F F F that is perpendicular to the surface at each point, which is essential for calculating the flux through the surface.
Note: a common notation in the Chorin textbook is to use W W W to represent a region in the fluid at a particular time t t t , and ∂ W \partial W ∂ W to represent the surface that encloses W W W ("the boundary of W W W ").
In topology, ∂ \partial ∂ is used to denote the boundary of a set
Week 2 recap
density: ρ ( x , t ) \rho(x, t) ρ ( x , t )
velocity: u ( x , t ) u(x,t) u ( x , t )
pressure: p ( x , t ) p(x,t) p ( x , t )
Goal #1: Derive Euler’s Equation using the Net Force Equation
Net Force Equation (Newton’s second law of motion):
F ⃗ = m a ⃗ = m d u ⃗ d t = d d t ( m a ⃗ ) = d p ⃗ d t , where p ⃗ is momentum \begin{align}
\vec F &= m\vec a\\
&= m \frac{d\vec u}{dt} = \frac{d}{dt}(m\vec a) = \frac{d\vec p}{dt}, \textrm{where }\vec p\textrm{ is momentum}\\
\end{align}
F = m a = m d t d u = d t d ( m a ) = d t d p , where p is momentum
Inviscid Fluid: no (dynamic) viscosity ( μ = 0 ) (\mu = 0) ( μ = 0 )
Making the assumption that there is no viscosity allows us to represent the following quantities in a simplified form:
Note: pressure is p p p , momentum is p ⃗ \vec p p .
Momentum in V = ∫ V ρ u ⃗ d V Gravity force (body force) = ∫ V ρ g ⃗ d V Force due to pressure = − ∫ S p n ⃗ d S \begin{align}
\textrm{Momentum in }V&=\int_V \rho\vec u\, dV\\
\underset{\textrm{(body force)}}{\textrm{Gravity force}} &= \int_V\rho\vec g \, dV\\
\textrm{Force due to pressure} &= -\int_S p\vec n \, dS
\end{align}
Momentum in V (body force) Gravity force Force due to pressure = ∫ V ρ u d V = ∫ V ρ g d V = − ∫ S p n d S
Question: What are the units for the total momentum in V V V ?
Density is expressed as mass over a cubic volume, and velocity is expressed as length over time, so we have that [ ρ ] [ u ⃗ ] = M L 3 L T = M L 2 T [\rho][\vec u]=\frac{M}{L^3}\frac{L}{T}=\frac{M}{L^2T} [ ρ ] [ u ] = L 3 M T L = L 2 T M . Then, we integrate over volume, so we multiply our units by L 3 L^3 L 3 to get [ p ⃗ ] = M L T [\vec p]=M\frac{L}{T} [ p ] = M T L
units of pressure: [ p ] = F L 2 [p] = \frac{F}{L^2} [ p ] = L 2 F
d d t ∫ V ρ u ⃗ d V = − ∫ S p n ⃗ d S + ∫ V ρ g ⃗ d V \frac{d}{dt}\int_V \rho\vec{u}\,dV=-\int_Sp\vec n\,dS+\int_V\rho\vec g \,dV
d t d ∫ V ρ u d V = − ∫ S p n d S + ∫ V ρ g d V
Leibniz integral rule (swap integral and derivative)
∫ V ∂ ∂ t ( ρ u ⃗ ) d V = − ∫ S p n ⃗ d S + ∫ V ρ g ⃗ d V \int_V\frac{\partial}{\partial t}(\rho \vec u)\,dV = -\int_Sp\vec n\,dS+\int_V\rho\vec g\, dV
∫ V ∂ t ∂ ( ρ u ) d V = − ∫ S p n d S + ∫ V ρ g d V
Divergence Theorem: ∫ V ( ∇ ⋅ F ) d V = ∫ S ( F ⋅ n ⃗ ) d S \int_V(\nabla\cdot F)\,dV = \int_S(F\cdot\vec n)\,dS ∫ V ( ∇ ⋅ F ) d V = ∫ S ( F ⋅ n ) d S
− ∫ S p n ⃗ d S = ∫ V ∇ ⃗ p d V -\int_Sp\vec n\,dS = \int_V\vec\nabla p\,dV
− ∫ S p n d S = ∫ V ∇ p d V
Here, because p p p is a scalar, we write ∇ ⃗ p \vec\nabla p ∇ p instead of ∇ ⃗ ⋅ p \vec\nabla\cdot p ∇ ⋅ p .
Now, we go back to the net force equation, which states that F ⃗ = d p ⃗ d t \vec F=\frac{d\vec p}{dt} F = d t d p . We have two forces we are considering in this simplified example:
Gravity force, which acts in the direction of the gravity vector
Pressure force, which acts in the direction opposite of the normal vector of the volume
We also consider the “net rate of momentum flow into V V V ” as a separate force into V V V . Because the volume is fixed but the particles are in motion, we account for the momentum of the volume changing as a result of fluid carrying momentum into or out of the volume through its boundaries. If we were dealing with solids, this would not be a factor since the particles would be fixed.
d d t ( momentum in V ) = net force on V + net rate of momentum flow into V \frac{d}{dt}(\textrm{momentum in V}) = \textrm{net force on V} + \textrm{net rate of momentum flow into V}
d t d ( momentum in V ) = net force on V + net rate of momentum flow into V
Note: Eulerian vs Lagrangian Perspective
Lagrangian: ∂ f ∂ t \frac{\partial f}{\partial t} ∂ t ∂ f
Eulerian: ∂ f ∂ t + ( u ⃗ ⋅ ∇ ⃗ ) f \frac{\partial f}{\partial t} + (\vec u \cdot \vec\nabla)f ∂ t ∂ f + ( u ⋅ ∇ ) f
Eulerian fluid analysis focuses on fixed points in space, whereas Lagrangian fluid analysis tracks individual particles by following them around and tracking how they change over time. Eulerian is more fitting for studying overall flow characteristics.
Back to derivation:
d d t ( momentum in V ) = net force on V + net rate of momentum flow into V \frac{d}{dt}(\textrm{momentum in V}) = \textrm{net force on V} + \textrm{net rate of momentum flow into V}
d t d ( momentum in V ) = net force on V + net rate of momentum flow into V
⇒ d d t ∫ V ρ u ⃗ d V = − ∫ S p n ⃗ d S + ∫ V ρ g ⃗ d V − ∫ S ( ρ u ⃗ ) ( u ⃗ ⋅ n ⃗ ) ⏟ vol per time d S \Rightarrow \frac{d}{dt}\int_V \rho\vec{u}\,dV=-\int_Sp\vec n\,dS+\int_V\rho\vec g \,dV - \int_S(\rho\vec{u})\underbrace{(\vec u \cdot \vec n)}_\textrm{vol per time}\,dS
⇒ d t d ∫ V ρ u d V = − ∫ S p n d S + ∫ V ρ g d V − ∫ S ( ρ u ) vol per time ( u ⋅ n ) d S
Apply Leibnez, divergence theorem, and reorder terms / move parentheses
∫ V ∂ ∂ t ( ρ u ⃗ ) d V = − ∫ V ∇ ⃗ p d V + ∫ V ρ g ⃗ d V − ∫ S ( ρ u ⃗ u ⃗ ) ⋅ n ⃗ d S \int_V\frac{\partial}{\partial t}(\rho \vec u)\,dV = -\int_V\vec\nabla p\,dV+\int_V\rho\vec g\, dV-\int_S(\rho\vec u \vec u)\cdot\vec n\,dS
∫ V ∂ t ∂ ( ρ u ) d V = − ∫ V ∇ p d V + ∫ V ρ g d V − ∫ S ( ρ u u ) ⋅ n d S
Combine like terms:
∫ V ∂ ∂ t ( ρ u ⃗ ) + ∇ ⃗ p − ρ g ⃗ d V = − ∫ S ( ρ u ⃗ u ⃗ ) ⋅ n ⃗ d S \int_V\frac{\partial}{\partial t}(\rho \vec u)+ \vec\nabla p -\rho\vec g\, dV=-\int_S(\rho\vec u \vec u)\cdot\vec n\,dS
∫ V ∂ t ∂ ( ρ u ) + ∇ p − ρ g d V = − ∫ S ( ρ u u ) ⋅ n d S
divergence theorem
∫ V ∂ ∂ t ( ρ u ⃗ ) + ∇ ⃗ p − ρ g ⃗ d V = − ∫ V ∇ ⃗ ⋅ ( ρ u ⃗ u ⃗ ) d V \int_V\frac{\partial}{\partial t}(\rho \vec u)+ \vec\nabla p -\rho\vec g\, dV=-\int_V\vec\nabla\cdot(\rho\vec u \vec u)\,dV
∫ V ∂ t ∂ ( ρ u ) + ∇ p − ρ g d V = − ∫ V ∇ ⋅ ( ρ u u ) d V
Inner expressions must be equal
∂ ∂ t ( ρ u ⃗ ) + ∇ ⃗ p − ρ g ⃗ + ∇ ⃗ ⋅ ( ρ u ⃗ u ⃗ ) = 0 ⃗ \frac{\partial}{\partial t}(\rho \vec u)+ \vec\nabla p -\rho\vec g +\vec\nabla\cdot(\rho\vec u \vec u)=\vec 0
∂ t ∂ ( ρ u ) + ∇ p − ρ g + ∇ ⋅ ( ρ u u ) = 0
product rule
∂ ρ ∂ t u ⃗ + ρ ∂ u ⃗ ∂ t + ∇ ⃗ ⋅ ( ρ u ⃗ ) u ⃗ + ρ ( u ⃗ ⋅ ∇ ⃗ u ⃗ ) = − ∇ ⃗ p + ρ g ⃗ \frac{\partial \rho}{\partial t}\vec u + \rho \frac{\partial \vec u}{\partial t} + \vec\nabla\cdot(\rho \vec u)\vec u + \rho (\vec u \cdot \vec\nabla \vec u) = - \vec\nabla p + \rho \vec g
∂ t ∂ ρ u + ρ ∂ t ∂ u + ∇ ⋅ ( ρ u ) u + ρ ( u ⋅ ∇ u ) = − ∇ p + ρ g
( ∂ ρ ∂ t + ∇ ⃗ ⋅ ( ρ u ⃗ ) ) ⏟ = 0 u ⃗ + ρ ( ∇ u ⃗ ∂ t + ( u ⃗ ⋅ ∇ ⃗ ) u ⃗ ) = − ∇ ⃗ + ρ g \underbrace{(\frac{\partial \rho}{\partial t} + \vec\nabla\cdot(\rho\vec u))}_{=0}\vec u + \rho (\frac{\nabla \vec u}{\partial t} + (\vec u\cdot \vec \nabla)\vec u) = -\vec\nabla + \rho g
= 0 ( ∂ t ∂ ρ + ∇ ⋅ ( ρ u )) u + ρ ( ∂ t ∇ u + ( u ⋅ ∇ ) u ) = − ∇ + ρ g
Total derivative: D f a = [ ∂ f ∂ x 1 ( a ) ⋯ ∂ f ∂ x n ( a ) ] Df_a = [\frac{\partial f}{\partial x_1}(a) \cdots \frac{\partial f}{\partial x_n}(a)] D f a = [ ∂ x 1 ∂ f ( a ) ⋯ ∂ x n ∂ f ( a )]
⇒ ( ∂ u ⃗ ∂ t + ( u ⃗ ⋅ ∇ ⃗ ) u ⃗ ) = D u ⃗ D t \Rightarrow(\frac{\partial \vec u}{\partial t}+(\vec u \cdot \vec\nabla)\vec u) = \frac{D\vec u}{D t}
⇒ ( ∂ t ∂ u + ( u ⋅ ∇ ) u ) = D t D u
⇒ ρ D u ⃗ D t = − ∇ p + ρ g ⃗ \Rightarrow \rho \frac{D\vec u}{D t} = -\nabla p+\rho\vec g
⇒ ρ D t D u = − ∇ p + ρ g
Euler's Equations
Law of conservation of mass
The rate of increase of mass in W W W equals the rate at which mass is crossing ∂ W \partial W ∂ W in the inward direction:
d d t ∫ W ρ d V = − ∫ ∂ W ρ u ⃗ ⋅ n ⃗ d A \frac{d}{dt}\int_W \rho dV = -\int_{\partial W} \rho \vec{u}\cdot \vec{n} d A
d t d ∫ W ρ d V = − ∫ ∂ W ρ u ⋅ n d A
This is the "integral form" of the law of conservation of mass .
Continuity Equation
By the divergence theorem, this is equivalent to
∫ W [ ∂ ρ ∂ t + d i v ( ρ u ⃗ ) ] d V = 0 \int_W\left[\frac{\partial\rho}{\partial t} + \mathrm{div}(\rho \vec u)\right] dV = 0
∫ W [ ∂ t ∂ ρ + div ( ρ u ) ] d V = 0
Because this is true for all W, it is equivalent to:
∂ ρ ∂ t + d i v ( ρ u ⃗ ) = 0 \boxed{
\frac{\partial\rho}{\partial t} + \mathrm{div}(\rho \vec u) = 0
}
∂ t ∂ ρ + div ( ρ u ) = 0
This is the continuity equation , also known as the "differential form" of the law of conservation of mass .
Material Derivative
D D t f = ( ∂ ∂ t + u ⃗ ⋅ ∇ ⃗ ) f \frac{D}{Dt}f = \left(\frac{\partial}{\partial t} + \vec{u}\cdot\vec{\nabla}\right)f
D t D f = ( ∂ t ∂ + u ⋅ ∇ ) f
Main idea : the material deriviative takes into account the fact that the fluid is moving and that the positions of fluid particles change with time.
Example:
D u ⃗ D t = d d t u ⃗ ( x ⃗ ( t ) , t ) \frac{D\vec u}{Dt} = \frac{d}{dt}\vec{u}(\vec{x}(t),t)
D t D u = d t d u ( x ( t ) , t )
gives the velocity following a fluid particle
Balance of momentum
An ideal fluid is one with the following property:
For any motion of the fluid there is a function p ( x ⃗ , t ) p(\vec x,t) p ( x , t ) called the pressure such that is S S S is a surface in the fluid with a chosen unit normal n ⃗ \vec n n , the force of stress exerted across the surface S S S per unit area at x ⃗ ∈ S \vec x \in S x ∈ S at time t t t is p ( x ⃗ , t ) n p(\vec x,t)n p ( x , t ) n .
Note that the force is in the direction n \mathbf{n} n and that the force acts orthogonally to the surface S S S ; that is, there are no tangental forces (see Figure 1.1.3).
If b ⃗ ( x ⃗ , t ) \vec{b}(\vec{x}, t) b ( x , t ) denotes the given body force per unit mass , we have the total body force B ⃗ = ∫ W ρ b ⃗ d V \vec B = \int_W \rho \vec b \,dV B = ∫ W ρ b d V . Thus, on any piece of fluid material, the force per unit volume is equal to − g r a d p + ρ b ⃗ -\mathrm{grad}\,p + \rho \vec b − grad p + ρ b .
This along with Newton's second law (F = d ρ / d t F=d\rho/dt F = d ρ / d t ) leads us to the differential form of the law of balance of momentum :
ρ D u ⃗ D t = − g r a d p + ρ b ⃗ \boxed{
\rho \frac{D \vec u}{D t} = -\mathrm{grad}\,p+\rho \vec b
}
ρ D t D u = − grad p + ρ b
In some other resources, the body force b b b is omitted (assumed to be zero) for simplicity. However, including b b b makes the formulation more general. Including ρ \rho ρ on both sides covers both compressible and incompressible fluids.
Balance of momentum is often used interchangeably with conservation of momentum. The use of the word balance emphasizes the ongoing equilibrium between the forces applied to a fluid and the resulting change in its momentum - accounting for the fact inflows, outflows, and forces must be accounted for at every instant, not just that total momentum doesn't change over time.
Newton's third law ensures that momentum exchanged internally cancels out, leaving only external forces to change total momentum.
Reynolds Transport Theorem
d d t ∫ W t ρ f d V = ∫ W t ρ D f D t d V \boxed{
\frac{d}{dt}\int_{W_t}\rho f\,dV=\int_{W_t}\rho\frac{Df}{Dt}\,dV
}
d t d ∫ W t ρ f d V = ∫ W t ρ D t D f d V
d d t ∫ W t f d V = ∫ W t ( ∂ f ∂ t + d i v ( f u ⃗ ) ) d V \boxed{
\frac{d}{dt}\int_{W_t} f\,dV=\int_{W_t}\left(\frac{\partial f}{\partial t}+ \mathrm{div}(f\vec u)\right)\,dV
}
d t d ∫ W t f d V = ∫ W t ( ∂ t ∂ f + div ( f u ) ) d V
momentum conservation
Incompressibility
We call a flow incompressible if for any fluid subregion W W W ,
volume ( W t ) = ∫ W T d V = constant in t \textrm{volume}(W_t) = \int_{W_T} dV = \textrm{constant in }t
volume ( W t ) = ∫ W T d V = constant in t
The following statements are equivalent:
the fluid is incompressible
d i v u ⃗ = 0 \mathrm{div}\,\vec u = 0 div u = 0
J ≡ 1 J \equiv 1 J ≡ 1
D ρ / D t = 0 D\rho/D t = 0 D ρ / D t = 0
"The mass density is constant following the fluid"
ρ \rho ρ is constant
∇ ⋅ u ⃗ = 0 \nabla\cdot\vec{u} = 0 ∇ ⋅ u = 0
Homogeneous fluid means that ρ \rho ρ is constant in space. Incompressible fluid means that ρ \rho ρ is constant in time.
[!NOTE]
Problems involving inhomogeneous incompressible flow occur, for example, in oceanography
Main idea : An increasing amount of fluid cannot be stored in a fixed volume, because that would necessitate that the fluid be compressed into that volume. The rate of fluid entering must be exactly matched by the rate leaving which mathematically implies that the div of the mass flow rate is zero: An increasing amount of fluid cannot be stored in a fixed volume, because that would necessitate that the fluid be compressed into that volume. The rate of fluid entering must be exactly matched by the rate leaving which mathematically implies that the div of the mass flow rate is zero.
Wikipedia article on incompressible flow
The Euler equations
Assuming incompressibility, the Euler equations are:
ρ D u ⃗ D t = − g r a d p + ρ b ⃗ D ρ D t = 0 d i v u ⃗ = 0 \begin{align}
\rho\frac{D\vec u}{D t} &= -\mathrm{grad}\,p + \rho \vec{b}\\
\frac{D\rho}{D t} &= 0\\
\mathrm{div}\,\vec{u} &= 0\\
\end{align}
ρ D t D u D t D ρ div u = − grad p + ρ b = 0 = 0
with the boundary conditions u ⃗ ⋅ n ⃗ = 0 on ∂ D \vec{u}\cdot\vec{n} = 0\textrm{ on }\partial D u ⋅ n = 0 on ∂ D .
Balance of momentum
Density conservation as fluid particles move (incompressibility)
Rate of fluid entering must exactly match the rate leaving (incompressibility)
See also: Elementary Fluid Dynamics
non-linear dynamics - chaos
bifercation analysis
solution looks normal, as parameter varys solution oscillates, etc
Boyce D. Prima
skim chapter 2
7.1 - 7.8
7.1 Intro 355
7.2 Review of Matrices 364
7.3 Linear Algebraic Equations; Linear Independence, Eigenvalues, Eigenvectors 374
7.4 Basic Theory of Systems of First Order Linear Equations 385
7.5 Homogeneous Linear Systems with Constant Coefficients 390
7.6 Complex Eigenvalues 401
7.7 Fundamental Matrices 414
7.8 Repeated Eigenvalues 422
Differential Equations
First-order ordinary differential equations (ODEs)
involve the first derivative of a function as its highest derivative
y ‘ + a ( x ) y = b ( x ) y` + a(x)y = b(x) y ‘ + a ( x ) y = b ( x )
1) Systems of First Order Linear Equations
x ( n ) + a n − 1 x ( n − 1 ) + ⋯ + a 2 x ′ ′ + a 1 x ′ + a 0 x = 0 x^{(n)} + a_{n-1}x^{(n-1)} + \dots + a_2x^{\prime\prime}+a_1x^\prime + a_0x = 0
x ( n ) + a n − 1 x ( n − 1 ) + ⋯ + a 2 x ′′ + a 1 x ′ + a 0 x = 0
Key idea : Every higher-order ODE can be rewritten as a system of first-order ODEs.
This is done by introducing new variables for each derivative:
x 1 = y , x 2 = y ′ , x 3 = y ′ ′ , … , x n = y ( n − 1 ) x_1 = y,\, x_2 = y^\prime,\, x_3 = y^{\prime\prime},\,\dots,\, x_n = y^{(n-1)}
x 1 = y , x 2 = y ′ , x 3 = y ′′ , … , x n = y ( n − 1 )
Then the system becomes:
x 1 ′ = x 2 , x 2 ′ = x 3 , x 3 ′ = x 4 , … , x n − 1 ′ = x n x_1^\prime = x_2,\\
x_2^\prime = x_3,\\
x_3^\prime = x_4,\\
\dots,\\
x_{n-1}^\prime = x_n
x 1 ′ = x 2 , x 2 ′ = x 3 , x 3 ′ = x 4 , … , x n − 1 ′ = x n
And we have:
x n ′ = F ( t , x 1 , x 2 , … , x n ) ⇒ x ( n ) = − a 0 x 1 − a 1 x 2 − a 2 x 3 − ⋯ − a n − 1 x n \begin{align}
x_n^\prime &= F(t, x_1, x_2, \dots, x_n)\\
\Rightarrow x^{(n)} &= -a_0x_1 - a_1x_2 - a_2x_3 - \dots - a_{n-1}x_n
\end{align}
x n ′ ⇒ x ( n ) = F ( t , x 1 , x 2 , … , x n ) = − a 0 x 1 − a 1 x 2 − a 2 x 3 − ⋯ − a n − 1 x n
This system of equations can be written in matrix form:
x ⃗ ′ = A x ⃗ \vec x^\prime = \mathbf{A}\vec x
x ′ = A x
Key idea : eigenvalues of A \mathbf{A} A are roots of characteristic polynomial; eigenvalue equation of equivalent system is equivalent to characteristic polynomial of the original higher-order equation
RecalL: e i g s ( A ) are λ ’s that satisfy d e t ( A − λ I ) = 0 \mathrm{eigs}(A) \textrm{ are } \lambda\textrm{'s that satisfy } \mathrm{det}(A-\lambda I) = 0 eigs ( A ) are λ ’s that satisfy det ( A − λ I ) = 0
5) Homogeneous Linear Systems with Constant Coefficients
For general system x ⃗ ′ = A x ⃗ \vec x^\prime = \mathbf{A}\vec x x ′ = A x , we want a change of coordinates x = T z ⃗ x = \mathbf{T}\vec z x = T z that diagonalizes my ODE
in this z-coordinate system, I want z ⃗ ′ = D z ⃗ \vec z^\prime = \mathbf{D}\vec z z ′ = D z , where D \mathbf{D} D is decoupled and diagonal.
z ⃗ ′ = D z ⃗ ⇒ z ⃗ ( t ) = e D t z ⃗ ( 0 ) \vec z^\prime = \mathbf{D}\vec z \Rightarrow \vec{z}(t) = e^{Dt}\vec z(0)
z ′ = D z ⇒ z ( t ) = e D t z ( 0 )
Eigenvalues
The eigenvalues r 1 , … , r n r_1,\,\dots,\,r_n r 1 , … , r n (which need not all be different) are roots of the $n$th degree polynomial equation
d e t ( A − r I ) = 0 \mathrm{det}(\mathbf{A}-r\mathbf{I}) = 0
det ( A − r I ) = 0
If A \mathbf{A} A is a real-valued matrix, there are three possibilities for the eigenvalues of A \mathbf{A} A :
All eigenvalues are real and different from each other.
Some eigenvalues occur in complex conjugate pairs.
Some eigenvalues are repeated.
If the eigenvalues are all real and different, then we have n n n eigenvectors v ⃗ 1 , … , v ⃗ n \vec{v}_1,\,\dots,\,\vec{v}_n v 1 , … , v n which are linearly independent. The corresponding solutions of the differential system are
x ( 1 ) ( t ) = v ⃗ 1 e r 1 t , … , x ( n ) ( t ) = v ⃗ n e r n t x^{(1)}(t) = \vec{v}_1e^{r_1t},\, \dots,\, x^{(n)}(t) = \vec{v}_ne^{r_nt}
x ( 1 ) ( t ) = v 1 e r 1 t , … , x ( n ) ( t ) = v n e r n t
To find eigenvalues, solve d e t ( A − λ I ) = 0 \mathrm{det}(A-\lambda I ) = 0 det ( A − λ I ) = 0 . For each eigenvalue λ \lambda λ , solve the equation ( A − λ I ) v = 0 (A-\lambda I)v = 0 ( A − λ I ) v = 0 to find the corresponding eigenvectors v v v .
Eigenvalues: three cases
eigenvalues are real and have opposite signs; x = 0 x=0 x = 0 is a saddle point.
eigenvalues are real and have the same sign but are unequal; x = 0 x=0 x = 0 is a node.
eigenvalues are complex with nonzero real part; x = 0 x=0 x = 0 is a spiral point.
Initial Value Problem
A = [ a 11 a 12 a 21 a 22 ] A=
\left[ {\begin{array}{cc}
a_{11} & a_{12} \\
a_{21} & a_{22} \\
\end{array} } \right]
A = [ a 11 a 21 a 12 a 22 ]
Compute eigenvalues λ 1 , λ 2 \lambda_1, \lambda_2 λ 1 , λ 2 and eigenvectors v 1 , v 2 v_1, v_2 v 1 , v 2 .
Then the general solution is:
x ( t ) = c 1 e λ 1 t v 1 + c 2 e λ 2 t v 2 x(t) = c_1e^{\lambda_1t}v_1 + c_2e^{\lambda_2t}v_2
x ( t ) = c 1 e λ 1 t v 1 + c 2 e λ 2 t v 2
The constants c 1 , c 2 c_1, c_2 c 1 , c 2 are chosen to meet initial conditions
IVP Using Diagonalization:
A = [ a 11 a 12 a 21 a 22 ] A=
\left[ {\begin{array}{cc}
a_{11} & a_{12} \\
a_{21} & a_{22} \\
\end{array} } \right]
A = [ a 11 a 21 a 12 a 22 ]
x ( t ) = e A t x 0 x(t) = e^{At} x_0
x ( t ) = e A t x 0
If A A A diagonalizable (has n n n linearly independent eigenvectors), then A = P D P − 1 A=PDP^{-1} A = P D P − 1 , with D D D diagonal and P P P invertable. The matrix P P P has the eigenvectors as its columns.
If A A A has n n n distinct eigenvalues, it is guaranteed to be diagonalizable.
A n × n A^{n\times n} A n × n having n n n distinct eigenvalues implies that A A A has n n n linearly independent eigenvectors (diagonalizable)
converse is not necessarily true.
If there are fewer than n n n linearly independent eigenvectors, the matrix cannot be diagonalized
D D D can be constructed by placing the eigenvalues on the main diagonal in any order, with zeros elsewhere. P P P is constructed using the corresponsing eigenvectors as its columns; the order of the eigenvectors must correspond to the order of the eigenvalues in matrix D D D .
Then, e A t = P e D t P − 1 e^{At} = Pe^{Dt}P^{-1} e A t = P e D t P − 1 , where e D t e^{Dt} e D t is a diagonal matrix with e λ 1 t e^{\lambda_1t} e λ 1 t and e λ 2 t e^{\lambda_2t} e λ 2 t on the diagonal
Steve Brunton - Motivating Eigenvalues and Eigenvectors with Differential Equations
Defective matrix
wiki article
note: distinct eigenvalues always have linearly independent eigenvectors.
eigenvalue has geometric multiplicity < \lt < algebraic multiplicity (defective eigenvalue)
A A A has fewer than n n n distinct eigenvalues
⇒ \Rightarrow ⇒ not possible to find n n n linearly independent eigenvectors
algebraic multiplicity: number of times eigenvalue appears as a root of the characteristic polynomial
geometric multiplicity: number of linearly independent eigenvectors
γ ( λ ) ≤ μ ( λ ) , ∀ λ \gamma(\lambda) \leq \mu(\lambda)\,, \forall \lambda γ ( λ ) ≤ μ ( λ ) , ∀ λ
defect: not enough eigenvectors to form a complete basis (not diagonalizable)
A A A defective if and only if it does not have n n n linearly independent eigenvectors.
geometric and algebraic multiplicity of eigenvalues
matrix diagonalization
( λ 1 = λ 2 = 2 ) (\lambda_1 = \lambda_2 = 2) ( λ 1 = λ 2 = 2 ) , algebraic mutliplicity equal to 2, μ ( λ 1 ) = 2 \mu(\lambda_1) = 2 μ ( λ 1 ) = 2
eigenspace of λ 1 \lambda_1 λ 1 is the linear space that contains all vectors v 1 v_1 v 1 of the form: v = v 11 ⋅ [ 1 0 ] T + v 21 ⋅ [ 0 1 ] T v=v_{11}\cdot [1\, 0]^T + v_{21}\cdot [0\, 1]^T v = v 11 ⋅ [ 1 0 ] T + v 21 ⋅ [ 0 1 ] T , eigenspace has dimension 2, geometric multiplicity equal to 2, γ ( λ 1 ) = 2 \gamma(\lambda_1) = 2 γ ( λ 1 ) = 2
Two cool notes
Matrix Power: A n = P D n P − 1 A^n = PD^nP^{-1} A n = P D n P − 1
Matrix Inverse: A − 1 = P D − 1 P − 1 A^{-1} = PD^{-1}P^{-1} A − 1 = P D − 1 P − 1
D − 1 = [ 1 / D 11 0 0 1 / D 22 ] D^{-1}=
\left[ {\begin{array}{cc}
1/D_{11} & 0 \\
0 & 1/D_{22} \\
\end{array} } \right]
D − 1 = [ 1/ D 11 0 0 1/ D 22 ]
Repetition of eigenvalues
pg 398
Condition Number
https://www.google.com/search?q=condition+numbers+of+a+matrix&rlz=1C5CHFA_enUS1119US1119&oq=condition+numbers+of+a+matrix&gs_lcrp=EgZjaHJvbWUyBggAEEUYOTIGCAEQRRg5MgoIAhAAGAoYFhgeMg0IAxAAGIYDGIAEGIoFMg0IBBAAGIYDGIAEGIoFMgoIBRAAGIAEGKIEMgoIBhAAGKIEGIkF0gEINTE5OGowajGoAgCwAgA&sourceid=chrome&ie=UTF-8
cs357
Condition Numbers - Numberical Methods Course
Loss Function Analysis
Semi simplicity :
equivalent to being diagonalizable for algebraically closed fields (field of complex numbers, not but not the field of real numbers)
diagonalizable :
non-defective
means there are enough independent eigenvectors
high condition number is a sign that a matrix is nearly non-diagonalizable
near-defective matrices exhibit unstable numerical properties and high sensitivity to perturbations (indicated by large condition numbers)
A square matrix that is not diagonalizable is called defective. It can happen that a matrix A A A with real entries is defective over the real numbers, meaning that A = P D P − 1 A=PDP^{-1} A = P D P − 1 is impossible for any invertible P P P and diagonal D D D with real entries, but it is possible with complex entries, so that A A A is diagonalizable over the complex numbers. For example, this is the case for a generic rotation matrix.
Here we assume T < < 1 T<<1 T << 1
J ( θ ) = ∫ 0 T ∥ x ⃗ θ ( t ) − x ⃗ ( t ) ∥ 2 d t J(\theta) = \int_0^T \| \vec x_\theta(t) - \vec x(t) \|^2 \,\text{d}t
J ( θ ) = ∫ 0 T ∥ x θ ( t ) − x ( t ) ∥ 2 d t
∂ J ∂ θ i = ∫ 0 T ∂ ∂ θ i [ ( x θ − x ) ⋅ ( x θ − x ) ] d t = 0 , ∀ i = 1 , 2 , 3 , 4 \frac{\partial J}{\partial \theta_i} = \int_0^T \frac{\partial}{\partial\theta_i}\left[ (x_\theta - x) \cdot (x_\theta - x)\right]\,\text{d}t = 0,\, \forall i=1,2,3,4
∂ θ i ∂ J = ∫ 0 T ∂ θ i ∂ [ ( x θ − x ) ⋅ ( x θ − x ) ] d t = 0 , ∀ i = 1 , 2 , 3 , 4
Condition Number
https://www.google.com/search?q=condition+numbers+of+a+matrix&rlz=1C5CHFA_enUS1119US1119&oq=condition+numbers+of+a+matrix&gs_lcrp=EgZjaHJvbWUyBggAEEUYOTIGCAEQRRg5MgoIAhAAGAoYFhgeMg0IAxAAGIYDGIAEGIoFMg0IBBAAGIYDGIAEGIoFMgoIBRAAGIAEGKIEMgoIBhAAGKIEGIkF0gEINTE5OGowajGoAgCwAgA&sourceid=chrome&ie=UTF-8
cs357
Condition Numbers - Numberical Methods Course