Mean value theorem

Revision as of 00:59, 4 May 2025 by imported>Hillers45 (Added Calculus temp;ate, as all links have the template apart from Mean Value Theorem)
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Template:Short description Template:For{{#invoke:sidebar|collapsible | class = plainlist | titlestyle = padding-bottom:0.25em; | pretitle = Part of a series of articles about | title = Calculus | image = <math>\int_{a}^{b} f'(t) \, dt = f(b) - f(a)</math> | listtitlestyle = text-align:center; | liststyle = border-top:1px solid #aaa;padding-top:0.15em;border-bottom:1px solid #aaa; | expanded = | abovestyle = padding:0.15em 0.25em 0.3em;font-weight:normal; | above =

Template:Startflatlist

Template:EndflatlistTemplate:Startflatlist

Template:Endflatlist

| list2name = differential | list2titlestyle = display:block;margin-top:0.65em; | list2title = Template:Bigger | list2 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | heading1 = Definitions
 | content1 =
 | heading2 = Concepts
 | content2 =
 | heading3 = Rules and identities
 | content3 =
}}

| list3name = integral | list3title = Template:Bigger | list3 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | content1 =

| heading2 = Definitions

 | content2 =
 | heading3 = Integration by
 | content3 =
}}

| list4name = series | list4title = Template:Bigger | list4 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | content1 =
 | heading2 = Convergence tests
 | content2 =
}}

| list5name = vector | list5title = Template:Bigger | list5 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | content1 =
 | heading2 = Theorems
 | content2 =
}}

| list6name = multivariable | list6title = Template:Bigger | list6 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | heading1 = Formalisms
 | content1 =
 | heading2 = Definitions
 | content2 =
}}

| list7name = advanced | list7title = Template:Bigger | list7 ={{#invoke:sidebar|sidebar|child=yes

 |contentclass=hlist
 | content1 =
}}

| list8name = specialized | list8title = Template:Bigger | list8 =

| list9name = miscellanea | list9title = Template:Bigger | list9 =

}} In mathematics, the mean value theorem (or Lagrange's mean value theorem) states, roughly, that for a given planar arc between two endpoints, there is at least one point at which the tangent to the arc is parallel to the secant through its endpoints. It is one of the most important results in real analysis. This theorem is used to prove statements about a function on an interval starting from local hypotheses about derivatives at points of the interval.

Template:TOC limit

HistoryEdit

A special case of this theorem for inverse interpolation of the sine was first described by Parameshvara (1380–1460), from the Kerala School of Astronomy and Mathematics in India, in his commentaries on Govindasvāmi and Bhāskara II.<ref>J. J. O'Connor and E. F. Robertson (2000). Paramesvara, MacTutor History of Mathematics archive.</ref> A restricted form of the theorem was proved by Michel Rolle in 1691; the result was what is now known as Rolle's theorem, and was proved only for polynomials, without the techniques of calculus. The mean value theorem in its modern form was stated and proved by Augustin Louis Cauchy in 1823.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref> Many variations of this theorem have been proved since then.<ref>Template:Cite journal</ref><ref>Template:Cite book</ref>

StatementEdit

File:Mittelwertsatz3.svg
The function <math>f</math> attains the slope of the secant between <math>a</math> and <math>b</math> as the derivative at the point <math>\xi\in(a,b)</math>.
File:Mittelwertsatz6.svg
It is also possible that there are multiple tangents parallel to the secant.

Let <math>f:[a,b]\to\R</math> be a continuous function on the closed interval Template:Nowrap and differentiable on the open interval Template:Nowrap where Template:Nowrap Then there exists some <math>c</math> in <math>(a,b)</math> such that:Template:Sfn

<math>f'(c)=\frac{f(b)-f(a)}{b-a}.</math>

The mean value theorem is a generalization of Rolle's theorem, which assumes <math>f(a)=f(b)</math>, so that the right-hand side above is zero.

The mean value theorem is still valid in a slightly more general setting. One only needs to assume that <math>f:[a,b]\to\R</math> is continuous on <math>[a,b]</math>, and that for every <math>x</math> in <math>(a,b)</math> the limit

<math>\lim_{h\to 0}\frac{f(x+h)-f(x)}{h}</math>

exists as a finite number or equals <math>\infty</math> or <math>-\infty</math>. If finite, that limit equals <math>f'(x)</math>. An example where this version of the theorem applies is given by the real-valued cube root function mapping <math>x \mapsto x^{1/3}</math>, whose derivative tends to infinity at the origin.

ProofEdit

The expression <math display="inline">\frac{f(b)-f(a)}{b-a}</math> gives the slope of the line joining the points <math>(a,f(a))</math> and <math>(b,f(b))</math>, which is a chord of the graph of <math>f</math>, while <math>f'(x)</math> gives the slope of the tangent to the curve at the point <math>(x,f(x))</math>. Thus the mean value theorem says that given any chord of a smooth curve, we can find a point on the curve lying between the end-points of the chord such that the tangent of the curve at that point is parallel to the chord. The following proof illustrates this idea.

Define <math>g(x)=f(x)-rx</math>, where <math>r</math> is a constant. Since <math>f</math> is continuous on <math>[a,b]</math> and differentiable on <math>(a,b)</math>, the same is true for <math>g</math>. We now want to choose <math>r</math> so that <math>g</math> satisfies the conditions of Rolle's theorem. Namely

<math>\begin{align}

g(a)=g(b)&\iff f(a)-ra=f(b)-rb\\ &\iff r(b-a)=f(b)-f(a) \\ &\iff r=\frac{f(b)-f(a)}{b-a} . \end{align}</math>

By Rolle's theorem, since <math>g</math> is differentiable and <math>g(a)=g(b)</math>, there is some <math>c</math> in <math>(a,b)</math> for which <math>g'(c)=0</math> , and it follows from the equality <math>g(x)=f(x)-rx</math> that,

<math>\begin{align}

&g'(x) = f'(x) -r \\ & g'(c) = 0\\ &g'(c) = f'(c) - r = 0 \\ &\Rightarrow f'(c) = r = \frac{f(b)-f(a)}{b-a} \end{align}</math>

ImplicationsEdit

Theorem 1: Assume that <math>f</math> is a continuous, real-valued function, defined on an arbitrary interval <math>I</math> of the real line. If the derivative of <math>f</math> at every interior point of the interval <math>I</math> exists and is zero, then <math>f</math> is constant on <math>I</math>.

Proof: Assume the derivative of <math>f</math> at every interior point of the interval <math>I</math> exists and is zero. Let <math>(a, b)</math> be an arbitrary open interval in <math>I</math>. By the mean value theorem, there exists a point <math>c</math> in <math>(a, b)</math> such that

<math>0=f'(c)=\frac{f(b)-f(a)}{b-a}.</math>

This implies that Template:Nowrap. Thus, <math>f</math> is constant on the interior of <math>I</math> and thus is constant on <math>I</math> by continuity. (See below for a multivariable version of this result.)

Remarks:

Theorem 2: If <math>f'(x) = g'(x)</math> for all <math>x</math> in an interval <math>(a, b)</math> of the domain of these functions, then <math>f - g</math> is constant, i.e. <math>f = g + c</math> where <math>c</math> is a constant on <math>(a, b)</math>.

Proof: Let <math>F(x) = f(x) - g(x)</math>, then <math>F'(x)=f'(x)-g'(x)=0</math> on the interval <math>(a, b)</math>, so the above theorem 1 tells that <math>F(x) = f(x) - g(x)</math> is a constant <math>c</math> or <math>f = g + c</math>.

Theorem 3: If <math>F</math> is an antiderivative of <math>f</math> on an interval <math>I</math>, then the most general antiderivative of <math>f</math> on <math>I</math> is <math>F(x) + c</math> where <math>c</math> is a constant.

Proof: It directly follows from the theorem 2 above.

Cauchy's mean value theoremEdit

Cauchy's mean value theorem, also known as the extended mean value theorem, is a generalization of the mean value theorem.<ref>{{#invoke:citation/CS1|citation |CitationClass=web }}</ref>Template:Sfn It states: if the functions <math>f</math> and <math>g</math> are both continuous on the closed interval <math>[a,b]</math> and differentiable on the open interval <math>(a,b)</math>, then there exists some <math>c \in (a,b)</math>, such that

File:Cauchy.svg
Geometrical meaning of Cauchy's theorem
<math>(f(b)-f(a))g'(c)=(g(b)-g(a))f'(c).</math>

Of course, if <math>g(a) \neq g(b)</math> and <math>g'(c) \neq 0</math>, this is equivalent to:

<math>\frac{f'(c)}{g'(c)}=\frac{f(b)-f(a)}{g(b)-g(a)}.</math>

Geometrically, this means that there is some tangent to the graph of the curve<ref>Template:Cite news</ref>

<math>\begin{cases}[a,b] \to \R^2\\t\mapsto (f(t),g(t))\end{cases}</math>

which is parallel to the line defined by the points <math>(f(a), g(a))</math> and <math>(f(b), g(b))</math>. However, Cauchy's theorem does not claim the existence of such a tangent in all cases where <math>(f(a), g(a))</math> and <math>(f(b), g(b))</math> are distinct points, since it might be satisfied only for some value <math>c</math> with <math>f'(c) = g'(c) = 0</math>, in other words a value for which the mentioned curve is stationary; in such points no tangent to the curve is likely to be defined at all. An example of this situation is the curve given by

<math>t \mapsto \left(t^3,1-t^2\right),</math>

which on the interval <math>[-1,1]</math> goes from the point <math>(-1, 0)</math> to <math>(1, 0)</math>, yet never has a horizontal tangent; however it has a stationary point (in fact a cusp) at <math>t = 0</math>.

Cauchy's mean value theorem can be used to prove L'Hôpital's rule. The mean value theorem is the special case of Cauchy's mean value theorem when <math>g(t) = t</math>.

ProofEdit

The proof of Cauchy's mean value theorem is based on the same idea as the proof of the mean value theorem.

Define <math>h(x) = (g(b)-g(a))f(x) - (f(b)-f(a))g(x)</math>, then we easily see <math>h(a)=h(b)=f(a)g(b)-f(b)g(a)</math>.

Since <math>f</math> and <math>g</math> are continuous on <math>[a,b]</math> and differentiable on <math>(a,b)</math>, the same is true for <math>h</math>. All in all, <math>h</math> satisfies the conditions of Rolle's theorem. Consequently, there is some <math>c</math> in <math>(a,b)</math> for which <math>h'(c) = 0</math>. Now using the definition of <math>h</math> we have:

<math>0=h'(c)=(g(b)-g(a))f'(c)-(f(b)-f(a))g'(c)</math>

The result easily follows.

Mean value theorem in several variablesEdit

The mean value theorem generalizes to real functions of multiple variables. The trick is to use parametrization to create a real function of one variable, and then apply the one-variable theorem.

Let <math>G</math> be an open subset of <math>\R^n</math>, and let <math>f:G\to\R</math> be a differentiable function. Fix points <math>x,y\in G</math> such that the line segment between <math>x, y</math> lies in <math>G</math>, and define <math>g(t)=f\big((1-t)x+ty\big)</math>. Since <math>g</math> is a differentiable function in one variable, the mean value theorem gives:

<math>g(1)-g(0)=g'(c)</math>

for some <math>c</math> between 0 and 1. But since <math>g(1)=f(y)</math> and <math>g(0)=f(x)</math>, computing <math>g'(c)</math> explicitly we have:

<math>f(y)-f(x)=\nabla f\big((1-c)x+cy\big)\cdot (y-x)</math>

where <math>\nabla</math> denotes a gradient and <math>\cdot</math> a dot product. This is an exact analog of the theorem in one variable (in the case <math>n=1</math> this is the theorem in one variable). By the Cauchy–Schwarz inequality, the equation gives the estimate:

<math>\Bigl|f(y)-f(x)\Bigr| \le \Bigl|\nabla f\big((1-c)x+cy\big)\Bigr|\ \Bigl|y - x\Bigr|.</math>

In particular, when <math>G</math> is convex and the partial derivatives of <math>f</math> are bounded, <math>f</math> is Lipschitz continuous (and therefore uniformly continuous).

As an application of the above, we prove that <math>f</math> is constant if the open subset <math>G</math> is connected and every partial derivative of <math>f</math> is 0. Pick some point <math>x_0\in G</math>, and let <math>g(x)=f(x)-f(x_0)</math>. We want to show <math>g(x)=0</math> for every <math>x\in G</math>. For that, let <math>E=\{x\in G:g(x)=0\}</math>. Then <math>E</math> is closed in <math>G</math> and nonempty. It is open too: for every <math>x\in E</math> ,

<math>\Big|g(y)\Big|=\Big|g(y)-g(x)\Big|\le (0)\Big|y-x\Big|=0</math>

for every <math>y</math> in open ball centered at <math>x</math> and contained in <math>G</math>. Since <math>G</math> is connected, we conclude <math>E=G</math>.

The above arguments are made in a coordinate-free manner; hence, they generalize to the case when <math>G</math> is a subset of a Banach space.

Mean value theorem for vector-valued functionsEdit

There is no exact analog of the mean value theorem for vector-valued functions (see below). However, there is an inequality which can be applied to many of the same situations to which the mean value theorem is applicable in the one dimensional case:Template:Sfn

Template:Math theorem

Template:Math proof

Mean value inequalityEdit

Template:See also Jean Dieudonné in his classic treatise Foundations of Modern Analysis discards the mean value theorem and replaces it by mean inequality as the proof is not constructive and one cannot find the mean value and in applications one only needs mean inequality. Serge Lang in Analysis I uses the mean value theorem, in integral form, as an instant reflex but this use requires the continuity of the derivative. If one uses the Henstock–Kurzweil integral one can have the mean value theorem in integral form without the additional assumption that derivative should be continuous as every derivative is Henstock–Kurzweil integrable.

The reason why there is no analog of mean value equality is the following: If Template:Math is a differentiable function (where Template:Math is open) and if Template:Math, Template:Math is the line segment in question (lying inside Template:Mvar), then one can apply the above parametrization procedure to each of the component functions Template:Math of f (in the above notation set Template:Math). In doing so one finds points Template:Math on the line segment satisfying

<math>f_i(x+h) - f_i(x) = \nabla f_i (x + t_ih) \cdot h.</math>

But generally there will not be a single point Template:Math on the line segment satisfying

<math>f_i(x+h) - f_i(x) = \nabla f_i (x + t^* h) \cdot h.</math>

for all Template:Mvar simultaneously. For example, define:

<math>\begin{cases}

f : [0, 2 \pi] \to \R^2 \\ f(x) = (\cos(x), \sin(x)) \end{cases}</math>

Then <math>f(2\pi) - f(0) = \mathbf{0} \in \R^2</math>, but <math>f_1'(x) = -\sin (x)</math> and <math>f_2'(x) = \cos (x)</math> are never simultaneously zero as <math>x</math> ranges over <math>\left[0, 2 \pi\right]</math>.

The above theorem implies the following: Template:Math theorem

In fact, the above statement suffices for many applications and can be proved directly as follows. (We shall write <math>f</math> for <math>\textbf{f}</math> for readability.) Template:Math proof

Cases where the theorem cannot be appliedEdit

All conditions for the mean value theorem are necessary:

  1. <math>\boldsymbol{f(x)} </math> is differentiable on <math>\boldsymbol{(a,b)} </math>
  2. <math>\boldsymbol{f(x)} </math> is continuous on <math>\boldsymbol{[a,b]} </math>
  3. <math>\boldsymbol{f(x)} </math> is real-valued

When one of the above conditions is not satisfied, the mean value theorem is not valid in general, and so it cannot be applied.

The necessity of the first condition can be seen by the counterexample where the function <math>f(x)=|x| </math> on [-1,1] is not differentiable.

The necessity of the second condition can be seen by the counterexample where the function <math display="block">f(x) = \begin{cases} 1, & \text{at }x=0 \\ 0, & \text{if }x\in( 0,1] \end{cases} </math> satisfies criteria 1 since <math>f'(x)=0 </math> on <math>(0,1) </math> but not criteria 2 since <math display="block">\frac{f(1)-f(0)}{1-0}=-1 </math> and <math>-1\neq 0 =f'(x) </math> for all <math>x\in (0,1) </math> so no such <math>c </math> exists.

The theorem is false if a differentiable function is complex-valued instead of real-valued. For example, if <math>f(x) = e^{xi}</math> for all real <math>x</math>, then <math display="block">f(2\pi)-f(0)=0=0(2\pi-0)</math> while <math>f'(x)\ne 0</math> for any real <math>x</math>.

Mean value theorems for definite integralsEdit

First mean value theorem for definite integralsEdit

File:积分中值定理.jpg
CitationClass=web }}</ref>

Let f : [a, b] → R be a continuous function. Then there exists c in (a, b) such that

<math>\int_a^b f(x) \, dx = f(c)(b - a).</math>

This follows at once from the fundamental theorem of calculus, together with the mean value theorem for derivatives. Since the mean value of f on [a, b] is defined as

<math>\frac{1}{b-a} \int_a^b f(x) \, dx,</math>

we can interpret the conclusion as f achieves its mean value at some c in (a, b).<ref name="Comenetz2002">Template:Cite book</ref>

In general, if f : [a, b] → R is continuous and g is an integrable function that does not change sign on [a, b], then there exists c in (a, b) such that

<math>\int_a^b f(x) g(x) \, dx = f(c) \int_a^b g(x) \, dx.</math>

Second mean value theorem for definite integralsEdit

There are various slightly different theorems called the second mean value theorem for definite integrals. A commonly found version is as follows:

If <math>G : [a,b]\to \mathbb{R}</math> is a positive monotonically decreasing function and <math>\varphi : [a,b]\to \mathbb{R}</math> is an integrable function, then there exists a number x in (a, b] such that
<math> \int_a^b G(t)\varphi(t)\,dt = G(a^+) \int_a^x \varphi(t)\,dt. </math>

Here <math>G(a^+)</math> stands for <math display="inline">{\lim_{x\to a^+}G(x)}</math>, the existence of which follows from the conditions. Note that it is essential that the interval (a, b] contains b. A variant not having this requirement is:<ref>Template:Cite journal</ref>

If <math>G : [a,b]\to \mathbb{R}</math> is a monotonic (not necessarily decreasing and positive) function and <math>\varphi : [a,b]\to \mathbb{R}</math> is an integrable function, then there exists a number x in (a, b) such that
<math> \int_a^b G(t)\varphi(t)\,dt = G(a^+) \int_a^x \varphi(t)\,dt + G(b^-) \int_x^b \varphi(t)\,dt. </math>

If the function <math>G</math> returns a multi-dimensional vector, then the MVT for integration is not true, even if the domain of <math>G</math> is also multi-dimensional.

For example, consider the following 2-dimensional function defined on an <math>n</math>-dimensional cube:

<math>\begin{cases}

G: [0,2\pi]^n \to \R^2 \\ G(x_1, \dots, x_n) = \left(\sin(x_1 + \cdots + x_n), \cos(x_1 + \cdots + x_n) \right) \end{cases} </math>

Then, by symmetry it is easy to see that the mean value of <math>G</math> over its domain is (0,0):

<math>\int_{[0,2\pi]^n} G(x_1,\dots,x_n) dx_1 \cdots dx_n = (0,0)</math>

However, there is no point in which <math>G=(0,0)</math>, because <math>|G|=1</math> everywhere.

GeneralizationsEdit

Linear algebraEdit

Assume that <math>f, g,</math> and <math>h</math> are differentiable functions on <math>(a,b)</math> that are continuous on <math>[a,b]</math>. Define

<math> D(x) = \begin{vmatrix}

f(x) & g(x) & h(x)\\ f(a) & g(a) & h(a)\\ f(b) & g(b) & h(b) \end{vmatrix}</math>

There exists <math>c\in(a,b)</math> such that <math> D'(c)=0</math>.

Notice that

<math> D'(x) = \begin{vmatrix}

f'(x) & g'(x)& h'(x)\\ f(a) & g(a) & h(a)\\ f(b) & g(b)& h(b) \end{vmatrix}</math> and if we place <math>h(x)=1</math>, we get Cauchy's mean value theorem. If we place <math>h(x)=1</math> and <math>g(x)=x</math> we get Lagrange's mean value theorem.

The proof of the generalization is quite simple: each of <math>D(a)</math> and <math>D(b)</math> are determinants with two identical rows, hence <math>D(a)=D(b)=0</math>. The Rolle's theorem implies that there exists <math>c\in (a,b)</math> such that <math>D'(c)=0</math>.

Probability theoryEdit

Let X and Y be non-negative random variables such that E[X] < E[Y] < ∞ and <math>X \leq_{st} Y</math> (i.e. X is smaller than Y in the usual stochastic order). Then there exists an absolutely continuous non-negative random variable Z having probability density function

<math> f_Z(x)={\Pr(Y>x)-\Pr(X>x)\over {\rm E}[Y]-{\rm E}[X]}\,, \qquad x\geqslant 0.</math>

Let g be a measurable and differentiable function such that E[g(X)], E[g(Y)] < ∞, and let its derivative g′ be measurable and Riemann-integrable on the interval [x, y] for all yx ≥ 0. Then, E[g′(Z)] is finite and<ref>Template:Cite journal</ref>

<math> {\rm E}[g(Y)]-{\rm E}[g(X)]={\rm E}[g'(Z)]\,[{\rm E}(Y)-{\rm E}(X)].</math>

Complex analysisEdit

Template:See also As noted above, the theorem does not hold for differentiable complex-valued functions. Instead, a generalization of the theorem is stated such:<ref>1 J.-Cl. Evard, F. Jafari, A Complex Rolle’s Theorem, American Mathematical Monthly, Vol. 99, Issue 9, (Nov. 1992), pp. 858-861.</ref>

Let f : Ω → C be a holomorphic function on the open convex set Ω, and let a and b be distinct points in Ω. Then there exist points u, v on the interior of the line segment from a to b such that

<math>\operatorname{Re}(f'(u)) = \operatorname{Re}\left ( \frac{f(b)-f(a)}{b-a} \right),</math>
<math>\operatorname{Im}(f'(v)) = \operatorname{Im}\left ( \frac{f(b)-f(a)}{b-a} \right).</math>

Where Re() is the real part and Im() is the imaginary part of a complex-valued function.

See alsoEdit

NotesEdit

Template:Reflist

ReferencesEdit

External linksEdit

|_exclude=urlname, _debug, id |url = https://mathworld.wolfram.com/{{#if:Mean-ValueTheorem%7CMean-ValueTheorem.html}} |title = Mean value theorem |author = Weisstein, Eric W. |website = MathWorld |access-date = |ref = Template:SfnRef }}

  • {{#invoke:Template wrapper|{{#if:|list|wrap}}|_template=cite web

|_exclude=urlname, _debug, id |url = https://mathworld.wolfram.com/{{#if:CauchysMean-ValueTheorem%7CCauchysMean-ValueTheorem.html}} |title = Cauchy's Mean-Value Theorem |author = Weisstein, Eric W. |website = MathWorld |access-date = |ref = Template:SfnRef }}

Template:Calculus topics Template:Joseph-Louis Lagrange Template:Authority control