Template:Differential equations

Template:Short descriptionIn mathematics, variation of parameters, also known as variation of constants, is a general method to solve inhomogeneous linear ordinary differential equations.

For first-order inhomogeneous linear differential equations it is usually possible to find solutions via integrating factors or undetermined coefficients with considerably less effort, although those methods leverage heuristics that involve guessing and do not work for all inhomogeneous linear differential equations.

Variation of parameters extends to linear partial differential equations as well, specifically to inhomogeneous problems for linear evolution equations like the heat equation, wave equation, and vibrating plate equation. In this setting, the method is more often known as Duhamel's principle, named after Jean-Marie Duhamel (1797–1872) who first applied the method to solve the inhomogeneous heat equation. Sometimes variation of parameters itself is called Duhamel's principle and vice versa.

HistoryEdit

The method of variation of parameters was first sketched by the Swiss mathematician Leonhard Euler (1707–1783), and later completed by the Italian-French mathematician Joseph-Louis Lagrange (1736–1813).<ref>See:

A forerunner of the method of variation of a celestial body's orbital elements appeared in Euler's work in 1748, while he was studying the mutual perturbations of Jupiter and Saturn.<ref>Euler, L. (1748) "Recherches sur la question des inégalités du mouvement de Saturne et de Jupiter, sujet proposé pour le prix de l'année 1748, par l’Académie Royale des Sciences de Paris" [Investigations on the question of the differences in the movement of Saturn and Jupiter; this subject proposed for the prize of 1748 by the Royal Academy of Sciences (Paris)] (Paris, France: G. Martin, J.B. Coignard, & H.L. Guerin, 1749).</ref> In his 1749 study of the motions of the earth, Euler obtained differential equations for the orbital elements.<ref>Euler, L. (1749) "Recherches sur la précession des équinoxes, et sur la nutation de l’axe de la terre," Histoire [or Mémoires ] de l'Académie Royale des Sciences et Belles-lettres (Berlin), pages 289–325 [published in 1751].</ref> In 1753, he applied the method to his study of the motions of the moon.<ref>Euler, L. (1753) Theoria motus lunae: exhibens omnes ejus inaequalitates ... [The theory of the motion of the moon: demonstrating all of its inequalities ... ] (Saint Petersburg, Russia: Academia Imperialis Scientiarum Petropolitanae [Imperial Academy of Science (St. Petersburg)], 1753).</ref>

Lagrange first used the method in 1766.<ref>Lagrange, J.-L. (1766) “Solution de différens problèmes du calcul integral,” Mélanges de philosophie et de mathématique de la Société royale de Turin, vol. 3, pages 179–380.</ref> Between 1778 and 1783, he further developed the method in two series of memoirs: one on variations in the motions of the planets<ref>See:

Description of methodEdit

Given an ordinary non-homogeneous linear differential equation of order n

Template:NumBlk

Let <math>y_1(x), \ldots, y_n(x)</math> be a basis of the vector space of solutions of the corresponding homogeneous equation

Template:NumBlk

Then a particular solution to the non-homogeneous equation is given by

Template:NumBlk

where the <math>c_i(x)</math> are differentiable functions which are assumed to satisfy the conditions

Template:NumBlk

Starting with (Template:EquationNote), repeated differentiation combined with repeated use of (Template:EquationNote) gives Template:NumBlk

One last differentiation gives

Template:NumBlk

By substituting (Template:EquationNote) into (Template:EquationNote) and applying (Template:EquationNote) and (Template:EquationNote) it follows that Template:NumBlk

The linear system (Template:EquationNote and Template:EquationNote) of n equations can then be solved using Cramer's rule yielding

<math>c_i'(x) = \frac{W_i(x)}{W(x)}, \, \quad i=1,\ldots,n</math>

where <math>W(x)</math> is the Wronskian determinant of the basis <math>y_1(x), \ldots, y_n(x)</math> and <math>W_i(x)</math> is the Wronskian determinant of the basis with the i-th column replaced by <math>(0, 0, \ldots, b(x)).</math>

The particular solution to the non-homogeneous equation can then be written as

<math>\sum_{i=1}^n y_i(x) \, \int \frac{W_i(x)}{W(x)}\, \mathrm dx.</math>

Intuitive explanationEdit

Consider the equation of the forced dispersionless spring, in suitable units:

<math>x(t) + x(t) = F(t).</math>

Here Template:Math is the displacement of the spring from the equilibrium Template:Math, and Template:Math is an external applied force that depends on time. When the external force is zero, this is the homogeneous equation (whose solutions are linear combinations of sines and cosines, corresponding to the spring oscillating with constant total energy).

We can construct the solution physically, as follows. Between times <math>t=s</math> and <math>t=s+ds</math>, the momentum corresponding to the solution has a net change <math>F(s)\,ds</math> (see: Impulse (physics)). A solution to the inhomogeneous equation, at the present time Template:Math, is obtained by linearly superposing the solutions obtained in this manner, for Template:Math going between 0 and Template:Math.

The homogeneous initial-value problem, representing a small impulse <math>F(s)\,ds</math> being added to the solution at time <math>t=s</math>, is

<math>x(t)+x(t)=0,\quad x(s)=0,\ x'(s)=F(s)\,ds.</math>

The unique solution to this problem is easily seen to be <math>x(t) = F(s)\sin(t-s)\,ds</math>. The linear superposition of all of these solutions is given by the integral:

<math>x(t) = \int_0^t F(s)\sin(t-s)\,ds.</math>

To verify that this satisfies the required equation:

<math>x'(t)=\int_0^t F(s)\cos(t-s)\,ds</math>
<math>x(t) = F(t) - \int_0^tF(s)\sin(t-s)\,ds = F(t)-x(t),</math>

as required (see: Leibniz integral rule).

The general method of variation of parameters allows for solving an inhomogeneous linear equation

<math>Lx(t)=F(t)</math>

by means of considering the second-order linear differential operator L to be the net force, thus the total impulse imparted to a solution between time s and s+ds is F(s)ds. Denote by <math>x_s </math> the solution of the homogeneous initial value problem

<math>Lx(t)=0, \quad x(s)=0,\ x'(s)=F (s)\,ds. </math>

Then a particular solution of the inhomogeneous equation is

<math>x (t)=\int_0^t x_s (t)\,ds,</math>

the result of linearly superposing the infinitesimal homogeneous solutions. There are generalizations to higher order linear differential operators.

In practice, variation of parameters usually involves the fundamental solution of the homogeneous problem, the infinitesimal solutions <math>x_s </math> then being given in terms of explicit linear combinations of linearly independent fundamental solutions. In the case of the forced dispersionless spring, the kernel <math>\sin(t-s)=\sin t\cos s - \sin s\cos t </math> is the associated decomposition into fundamental solutions.

ExamplesEdit

First-order equationEdit

<math> y' + p(x)y = q(x) </math>

The complementary solution to our original (inhomogeneous) equation is the general solution of the corresponding homogeneous equation (written below):

<math> y' + p(x)y = 0 </math>

This homogeneous differential equation can be solved by different methods, for example separation of variables:

<math>\frac{d}{dx} y + p(x)y = 0 </math>
<math>\frac{dy}{dx}=-p(x)y </math>
<math>{dy \over y} = -{p(x)\,dx},</math>
<math>\int \frac{1}{ y} \, dy = -\int p(x) \, dx </math>
<math>\ln |y| = -\int p(x) \, dx + C </math>
<math>y = \pm e^{-\int p(x) \, dx +C } = C_0 e^{-\int p(x) \, dx}</math>

The complementary solution to our original equation is therefore:

<math>y_c = C_0 e^{-\int p(x) \, dx}</math>

Now we return to solving the non-homogeneous equation:

<math> y' + p(x)y = q(x)</math>

Using the method variation of parameters, the particular solution is formed by multiplying the complementary solution by an unknown function C(x):

<math>y_p = C(x) e^{-\int p(x) \, dx}</math>

By substituting the particular solution into the non-homogeneous equation, we can find C(x):

<math> C' (x) e^{-\int p(x) \, dx} - C(x) p(x) e^{-\int p(x) \, dx} + p(x) C(x) e^{-\int p(x) \, dx} = q(x)</math>
<math> C' (x) e^{-\int p(x) \, dx} = q(x)</math>
<math> C' (x) = q(x) e^{\int p(x) \, dx} </math>
<math> C(x) =\int q(x) e^{\int p(x) \, dx} \, dx + C_1 </math>

We only need a single particular solution, so we arbitrarily select <math>C_1=0</math> for simplicity. Therefore the particular solution is:

<math>y_p =e^{-\int p(x) \, dx} \int q(x) e^{\int p(x) \, dx} \, dx</math>

The final solution of the differential equation is:

<math>\begin{align}

y &= y_c + y_p\\ &=C_0 e^{-\int p(x) \, dx} + e^{-\int p(x) \, dx} \int q(x) e^{\int p(x) \, dx} \, dx \end{align}</math>

This recreates the method of integrating factors.

Specific second-order equationEdit

Let us solve

<math> y+4y'+4y = \cosh x</math>

We want to find the general solution to the differential equation, that is, we want to find solutions to the homogeneous differential equation

<math>y+4y'+4y=0. </math>

The characteristic equation is:

<math>\lambda^2+4\lambda+4=(\lambda+2)^2=0 </math>

Since <math>\lambda=-2</math> is a repeated root, we have to introduce a factor of x for one solution to ensure linear independence: <math> u_1 = e^{-2x} </math> and <math> u_2 =x e^{-2x}</math>. The Wronskian of these two functions is

<math>W=\begin{vmatrix}
 e^{-2x} & xe^{-2x} \\

-2e^{-2x} & -e^{-2x}(2x-1)\\ \end{vmatrix} = -e^{-2x}e^{-2x}(2x-1)+2xe^{-2x}e^{-2x} = e^{-4x}. </math>

Because the Wronskian is non-zero, the two functions are linearly independent, so this is in fact the general solution for the homogeneous differential equation (and not a mere subset of it).

We seek functions A(x) and B(x) so A(x)u1 + B(x)u2 is a particular solution of the non-homogeneous equation. We need only calculate the integrals

<math>A(x) = - \int {1\over W} u_2(x) b(x)\,\mathrm dx,\; B(x) = \int {1 \over W} u_1(x)b(x)\,\mathrm dx</math>

Recall that for this example

<math>b(x) = \cosh x</math>

That is,

<math>A(x) = - \int {1\over e^{-4x}} xe^{-2x} \cosh x \,\mathrm dx = - \int xe^{2x}\cosh x \,\mathrm dx = -{1\over 18}e^x\left(9(x-1)+e^{2x}(3x-1)\right)+C_1</math>
<math>B(x) = \int {1 \over e^{-4x}} e^{-2x} \cosh x \,\mathrm dx = \int e^{2x}\cosh x\,\mathrm dx ={1\over 6}e^x\left(3+e^{2x}\right)+C_2 </math>

where <math>C_1</math> and <math>C_2</math> are constants of integration.

General second-order equationEdit

We have a differential equation of the form

<math>u+p(x)u'+q(x)u=f(x)</math>

and we define the linear operator

<math>L=D^2+p(x)D+q(x)</math>

where D represents the differential operator. We therefore have to solve the equation <math>L u(x)=f(x)</math> for <math>u(x)</math>, where <math>L</math> and <math>f(x)</math> are known.

We must solve first the corresponding homogeneous equation:

<math>u+p(x)u'+q(x)u=0</math>

by the technique of our choice. Once we've obtained two linearly independent solutions to this homogeneous differential equation (because this ODE is second-order) — call them u1 and u2 — we can proceed with variation of parameters.

Now, we seek the general solution to the differential equation <math> u_G(x)</math> which we assume to be of the form

<math>u_G(x)=A(x)u_1(x)+B(x)u_2(x).</math>

Here, <math>A(x)</math> and <math>B(x)</math> are unknown and <math>u_1(x)</math> and <math>u_2(x)</math> are the solutions to the homogeneous equation. (Observe that if <math>A(x)</math> and <math>B(x)</math> are constants, then <math>Lu_G(x)=0</math>.) Since the above is only one equation and we have two unknown functions, it is reasonable to impose a second condition. We choose the following:

<math>A'(x)u_1(x)+B'(x)u_2(x)=0.</math>

Now,

<math>\begin{align}

u_G'(x) &= \left (A(x)u_1(x)+B(x)u_2(x) \right )' \\ &= \left (A(x)u_1(x) \right )'+ \left (B(x)u_2(x) \right )'\\ &=A'(x)u_1(x)+A(x)u_1'(x)+B'(x)u_2(x)+B(x)u_2'(x)\\ &=A'(x)u_1(x)+B'(x)u_2(x)+A(x)u_1'(x)+B(x)u_2'(x) \\ &= A(x)u_1'(x)+B(x)u_2'(x) \end{align}</math>

Differentiating again (omitting intermediary steps)

<math>u_G(x)=A(x)u_1(x)+B(x)u_2(x)+A'(x)u_1'(x)+B'(x)u_2'(x).</math>

Now we can write the action of L upon uG as

<math>Lu_G=A(x)Lu_1(x)+B(x)Lu_2(x)+A'(x)u_1'(x)+B'(x)u_2'(x).</math>

Since u1 and u2 are solutions, then

<math>Lu_G=A'(x)u_1'(x)+B'(x)u_2'(x).</math>

We have the system of equations

<math>\begin{bmatrix}

u_1(x) & u_2(x) \\ u_1'(x) & u_2'(x) \end{bmatrix} \begin{bmatrix} A'(x) \\ B'(x)\end{bmatrix} = \begin{bmatrix} 0 \\ f \end{bmatrix}.</math>

Expanding,

<math>\begin{bmatrix}

A'(x)u_1(x)+B'(x)u_2(x)\\ A'(x)u_1'(x)+B'(x)u_2'(x) \end{bmatrix} = \begin{bmatrix} 0\\f\end{bmatrix}.</math>

So the above system determines precisely the conditions

<math>A'(x)u_1(x)+B'(x)u_2(x)=0.</math>
<math>A'(x)u_1'(x)+B'(x)u_2'(x)=Lu_G=f.</math>

We seek A(x) and B(x) from these conditions, so, given

<math>\begin{bmatrix}

u_1(x) & u_2(x) \\ u_1'(x) & u_2'(x) \end{bmatrix} \begin{bmatrix} A'(x) \\ B'(x)\end{bmatrix} = \begin{bmatrix} 0\\ f\end{bmatrix}</math>

we can solve for (A′(x), B′(x))T, so

<math>\begin{bmatrix} A'(x) \\ B'(x) \end{bmatrix} =

\begin{bmatrix} u_1(x) & u_2(x) \\ u_1'(x) & u_2'(x) \end{bmatrix}^{-1} \begin{bmatrix} 0\\ f \end{bmatrix} =\frac{1}{W} \begin{bmatrix} u_2'(x) & -u_2(x) \\ -u_1'(x) & u_1(x) \end{bmatrix} \begin{bmatrix} 0\\ f \end{bmatrix},</math>

where W denotes the Wronskian of u1 and u2. (We know that W is nonzero, from the assumption that u1 and u2 are linearly independent.) So,

<math> \begin{align}

A'(x) &= - {1\over W} u_2(x) f(x), & B'(x) &= {1 \over W} u_1(x)f(x) \\ A(x) &= - \int {1\over W} u_2(x) f(x)\,\mathrm dx, & B(x) &= \int {1 \over W} u_1(x)f(x)\,\mathrm dx \end{align}</math>

While homogeneous equations are relatively easy to solve, this method allows the calculation of the coefficients of the general solution of the inhomogeneous equation, and thus the complete general solution of the inhomogeneous equation can be determined.

Note that <math>A(x)</math> and <math> B(x)</math> are each determined only up to an arbitrary additive constant (the constant of integration). Adding a constant to <math>A(x)</math> or <math>B(x)</math> does not change the value of <math>Lu_G(x)</math> because the extra term is just a linear combination of u1 and u2, which is a solution of <math>L</math> by definition.

See alsoEdit

NotesEdit

Template:Reflist

ReferencesEdit

External linksEdit