Basel problem

Template:Short description Template:Pi box The Basel problem is a problem in mathematical analysis with relevance to number theory, concerning an infinite sum of inverse squares. It was first posed by Pietro Mengoli in 1650 and solved by Leonhard Euler in 1734,<ref>Template:Citation</ref> and read on 5 December 1735 in The Saint Petersburg Academy of Sciences.<ref>E41 – De summis serierum reciprocarum</ref> Since the problem had withstood the attacks of the leading mathematicians of the day, Euler's solution brought him immediate fame when he was twenty-eight. Euler generalised the problem considerably, and his ideas were taken up more than a century later by Bernhard Riemann in his seminal 1859 paper "On the Number of Primes Less Than a Given Magnitude", in which he defined his zeta function and proved its basic properties. The problem is named after the city of Basel, hometown of Euler as well as of the Bernoulli family who unsuccessfully attacked the problem.

The Basel problem asks for the precise summation of the reciprocals of the squares of the natural numbers, i.e. the precise sum of the infinite series: <math display="block">\sum_{n=1}^\infty \frac{1}{n^2} = \frac{1}{1^2} + \frac{1}{2^2} + \frac{1}{3^2} + \cdots. </math>

The sum of the series is approximately equal to 1.644934.<ref>Template:Cite OEIS</ref> The Basel problem asks for the exact sum of this series (in closed form), as well as a proof that this sum is correct. Euler found the exact sum to be <math display="inline">\frac {\pi^2}{6}</math> and announced this discovery in 1735. His arguments were based on manipulations that were not justified at the time, although he was later proven correct. He produced an accepted proof in 1741.

The solution to this problem can be used to estimate the probability that two large random numbers are coprime. Two random integers in the range from 1 to Template:Mvar, in the limit as Template:Mvar goes to infinity, are relatively prime with a probability that approaches <math display="inline">\frac {6}{\pi^2}</math>, the reciprocal of the solution to the Basel problem.<ref>Template:Citation</ref>

Euler's approachEdit

Euler's original derivation of the value <math display="inline">\frac{\pi^2}{6}</math> essentially extended observations about finite polynomials and assumed that these same properties hold true for infinite series.

Of course, Euler's original reasoning requires justification (100 years later, Karl Weierstrass proved that Euler's representation of the sine function as an infinite product is valid, by the Weierstrass factorization theorem), but even without justification, by simply obtaining the correct value, he was able to verify it numerically against partial sums of the series. The agreement he observed gave him sufficient confidence to announce his result to the mathematical community.

To follow Euler's argument, recall the Taylor series expansion of the sine function <math display=block> \sin x = x - \frac{x^3}{3!} + \frac{x^5}{5!} - \frac{x^7}{7!} + \cdots </math> Dividing through by Template:Mvar gives <math display=block> \frac{\sin x}{x} = 1 - \frac{x^2}{3!} + \frac{x^4}{5!} - \frac{x^6}{7!} + \cdots .</math>

The Weierstrass factorization theorem shows that the right-hand side is the product of linear factors given by its roots, just as for finite polynomials. Euler assumed this as a heuristic for expanding an infinite degree polynomial in terms of its roots, but in fact it is not always true for general <math>P(x)</math>.<ref>A priori, since the left-hand-side is a polynomial (of infinite degree) we can write it as a product of its roots as <math display=block>\begin{align} \sin(x) & = x (x^2-\pi^2)(x^2-4\pi^2)(x^2-9\pi^2) \cdots \\

    & = Ax \left(1 - \frac{x^2}{\pi^2}\right)\left(1 - \frac{x^2}{4\pi^2}\right)\left(1 - \frac{x^2}{9\pi^2}\right) \cdots.

\end{align} </math> Then since we know from elementary calculus that <math>\lim_{x \rightarrow 0} \frac{\sin(x)}{x} = 1</math>, we conclude that the leading constant must satisfy <math>A = 1</math>.</ref> This factorization expands the equation into: <math display=block>\begin{align} \frac{\sin x}{x} &= \left(1 - \frac{x}{\pi}\right)\left(1 + \frac{x}{\pi}\right)\left(1 - \frac{x}{2\pi}\right)\left(1 + \frac{x}{2\pi}\right)\left(1 - \frac{x}{3\pi}\right)\left(1 + \frac{x}{3\pi}\right) \cdots \\

                   &= \left(1 - \frac{x^2}{\pi^2}\right)\left(1 - \frac{x^2}{4\pi^2}\right)\left(1 - \frac{x^2}{9\pi^2}\right) \cdots

\end{align}</math>

If we formally multiply out this product and collect all the Template:Math terms (we are allowed to do so because of Newton's identities), we see by induction that the Template:Math coefficient of Template:Math is <ref>In particular, letting <math>H_n^{(2)} := \sum_{k=1}^n k^{-2}</math> denote a generalized second-order harmonic number, we can easily prove by induction that <math>[x^2] \prod_{k=1}^{n} \left(1-\frac{x^2}{\pi^2}\right) = -\frac{H_n^{(2)}}{\pi^2} \rightarrow -\frac{\zeta(2)}{\pi^2}</math> as <math>n \rightarrow \infty</math>.</ref> <math display=block> -\left(\frac{1}{\pi^2} + \frac{1}{4\pi^2} + \frac{1}{9\pi^2} + \cdots \right) = -\frac{1}{\pi^2}\sum_{n=1}^{\infty}\frac{1}{n^2}.</math>

But from the original infinite series expansion of Template:Math, the coefficient of Template:Math is Template:Math. These two coefficients must be equal; thus, <math display=block>-\frac{1}{6} = -\frac{1}{\pi^2}\sum_{n=1}^{\infty}\frac{1}{n^2}.</math>

Multiplying both sides of this equation by −Template:Pi² gives the sum of the reciprocals of the positive square integers.<ref name="HAVIL-GAMMA">Template:Citation</ref> <math display=block>\sum_{n=1}^{\infty}\frac{1}{n^2} = \frac{\pi^2}{6}.</math>

Generalizations of Euler's method using elementary symmetric polynomialsEdit

Using formulae obtained from elementary symmetric polynomials,<ref>Cf., the formulae for generalized Stirling numbers proved in: Template:Citation</ref> this same approach can be used to enumerate formulae for the even-indexed even zeta constants which have the following known formula expanded by the Bernoulli numbers: <math display=block>\zeta(2n) = \frac{(-1)^{n-1} (2\pi)^{2n}}{2 \cdot (2n)!} B_{2n}. </math>

For example, let the partial product for <math>\sin(x)</math> expanded as above be defined by <math>\frac{S_n(x)}{x} := \prod\limits_{k=1}^n \left(1 - \frac{x^2}{k^2 \cdot \pi^2}\right)</math>. Then using known formulas for elementary symmetric polynomials (a.k.a., Newton's formulas expanded in terms of power sum identities), we can see (for example) that <math display=block> \begin{align} \left[x^4\right] \frac{S_n(x)}{x} & = \frac{1}{2\pi^4}\left(\left(H_n^{(2)}\right)^2 - H_n^{(4)}\right) \qquad \xrightarrow{n \rightarrow \infty} \qquad \frac{1}{2\pi^4}\left(\zeta(2)^2-\zeta(4)\right) \\[4pt] & \qquad \implies \zeta(4) = \frac{\pi^4}{90} = -2\pi^4 \cdot [x^4] \frac{\sin(x)}{x} +\frac{\pi^4}{36} \\[8pt] \left[x^6\right] \frac{S_n(x)}{x} & = -\frac{1}{6\pi^6}\left(\left(H_n^{(2)}\right)^3 - 2H_n^{(2)} H_n^{(4)} + 2H_n^{(6)}\right) \qquad \xrightarrow{n \rightarrow \infty} \qquad \frac{1}{6\pi^6}\left(\zeta(2)^3-3\zeta(2)\zeta(4) + 2\zeta(6)\right) \\[4pt] & \qquad \implies \zeta(6) = \frac{\pi^6}{945} = -3 \cdot \pi^6 [x^6] \frac{\sin(x)}{x} - \frac{2}{3} \frac{\pi^2}{6} \frac{\pi^4}{90} + \frac{\pi^6}{216}, \end{align} </math>

and so on for subsequent coefficients of <math>[x^{2k}] \frac{S_n(x)}{x}</math>. There are other forms of Newton's identities expressing the (finite) power sums <math>H_n^{(2k)}</math> in terms of the elementary symmetric polynomials, <math>e_i \equiv e_i\left(-\frac{\pi^2}{1^2}, -\frac{\pi^2}{2^2}, -\frac{\pi^2}{3^2}, -\frac{\pi^2}{4^2}, \ldots\right), </math> but we can go a more direct route to expressing non-recursive formulas for <math>\zeta(2k)</math> using the method of elementary symmetric polynomials. Namely, we have a recurrence relation between the elementary symmetric polynomials and the power sum polynomials given as on this page by <math display=block>(-1)^{k}k e_k(x_1,\ldots,x_n) = \sum_{j=1}^k (-1)^{k-j-1} p_j(x_1,\ldots,x_n)e_{k-j}(x_1,\ldots,x_n),</math>

which in our situation equates to the limiting recurrence relation (or generating function convolution, or product) expanded as <math display=block> \frac{\pi^{2k}}{2}\cdot \frac{(2k) \cdot (-1)^k}{(2k+1)!} = -[x^{2k}] \frac{\sin(\pi x)}{\pi x} \times \sum_{i \geq 1} \zeta(2i) x^i. </math>

Then by differentiation and rearrangement of the terms in the previous equation, we obtain that <math display=block>\zeta(2k) = [x^{2k}]\frac{1}{2}\left(1-\pi x\cot(\pi x)\right). </math>

Consequences of Euler's proofEdit

By the above results, we can conclude that <math>\zeta(2k)</math> is always a rational multiple of <math>\pi^{2k}</math>. In particular, since <math>\pi</math> and integer powers of it are transcendental, we can conclude at this point that <math>\zeta(2k)</math> is irrational, and more precisely, transcendental for all <math>k \geq 1</math>. By contrast, the properties of the odd-indexed zeta constants, including Apéry's constant <math>\zeta(3)</math>, are almost completely unknown.

The Riemann zeta functionEdit

The Riemann zeta function Template:Math is one of the most significant functions in mathematics because of its relationship to the distribution of the prime numbers. The zeta function is defined for any complex number Template:Math with real part greater than 1 by the following formula: <math display=block>\zeta(s) = \sum_{n=1}^\infty \frac{1}{n^s}.</math>

Template:AnchorTaking Template:Math, we see that Template:Math is equal to the sum of the reciprocals of the squares of all positive integers: <math display=block>\zeta(2) = \sum_{n=1}^\infty \frac{1}{n^2}

               = \frac{1}{1^2} + \frac{1}{2^2} + \frac{1}{3^2} + \frac{1}{4^2} + \cdots = \frac{\pi^2}{6} \approx 1.644934.</math>

Convergence can be proven by the integral test, or by the following inequality: <math display=block>\begin{align}

 \sum_{n=1}^N \frac{1}{n^2} & < 1 + \sum_{n=2}^N \frac{1}{n(n-1)} \\
                            & = 1 + \sum_{n=2}^N \left( \frac{1}{n-1} - \frac{1}{n} \right) \\
                            & = 1 + 1 - \frac{1}{N} \;{\stackrel{N \to \infty}{\longrightarrow}}\; 2.

\end{align}</math>

This gives us the upper bound 2, and because the infinite sum contains no negative terms, it must converge to a value strictly between 0 and 2. It can be shown that Template:Math has a simple expression in terms of the Bernoulli numbers whenever Template:Math is a positive even integer. With Template:Math:<ref>Template:Citation</ref> <math display=block>\zeta(2n) = \frac{(2\pi)^{2n}(-1)^{n+1}B_{2n}}{2\cdot(2n)!}.</math>

A proof using Euler's formula and L'Hôpital's ruleEdit

The normalized sinc function <math>\text{sinc}(x)=\frac{\sin (\pi x)}{\pi x}</math> has a Weierstrass factorization representation as an infinite product: <math display=block>\frac{\sin (\pi x)}{\pi x} = \prod_{n=1}^\infty \left(1-\frac{x^2}{n^2}\right).</math>

The infinite product is analytic, so taking the natural logarithm of both sides and differentiating yields <math display=block>\frac{\pi \cos (\pi x)}{\sin (\pi x)}-\frac{1}{x}=-\sum_{n=1}^\infty \frac{2x}{n^2-x^2}</math>

(by uniform convergence, the interchange of the derivative and infinite series is permissible). After dividing the equation by <math>2x</math> and regrouping one gets <math display=block>\frac{1}{2x^2}-\frac{\pi \cot (\pi x)}{2x}=\sum_{n=1}^\infty \frac{1}{n^2-x^2}.</math>

We make a change of variables (<math>x=-it</math>): <math display=block>-\frac{1}{2t^2}+\frac{\pi \cot (-\pi it)}{2it}=\sum_{n=1}^\infty \frac{1}{n^2+t^2}.</math>

Euler's formula can be used to deduce that <math display=block>\frac{\pi \cot (-\pi i t)}{2it}=\frac{\pi}{2it}\frac{i\left(e^{2\pi t}+1\right)}{e^{2\pi t}-1}=\frac{\pi}{2t}+\frac{\pi}{t\left(e^{2\pi t} - 1\right)}.</math> or using the corresponding hyperbolic function: <math display=block>\frac{\pi \cot (-\pi i t)}{2it}=\frac{\pi}{2t}{i\cot (\pi i t)}=\frac{\pi}{2t}\coth(\pi t).</math>

Then <math display=block>\sum_{n=1}^\infty \frac{1}{n^2+t^2}=\frac{\pi \left(te^{2\pi t}+t\right)-e^{2\pi t}+1}{2\left(t^2 e^{2\pi t}-t^2\right)}=-\frac{1}{2t^2} + \frac{\pi}{2t} \coth(\pi t).</math>

Now we take the limit as <math>t</math> approaches zero and use L'Hôpital's rule thrice. By Tannery's theorem applied to <math display="inline">\lim_{t\to\infty}\sum_{n=1}^\infty 1/(n^2+1/t^2)</math>, we can interchange the limit and infinite series so that <math display="inline">\lim_{t\to 0}\sum_{n=1}^\infty 1/(n^2+t^2)=\sum_{n=1}^\infty 1/n^2</math> and by L'Hôpital's rule <math display=block>\begin{align}\sum_{n=1}^\infty \frac{1}{n^2}&=\lim_{t\to 0}\frac{\pi}{4}\frac{2\pi te^{2\pi t}-e^{2\pi t}+1}{\pi t^2 e^{2\pi t} + te^{2\pi t}-t}\\[6pt] &=\lim_{t\to 0}\frac{\pi^3 te^{2\pi t}}{2\pi \left(\pi t^2 e^{2\pi t}+2te^{2\pi t} \right)+e^{2\pi t}-1}\\[6pt] &=\lim_{t\to 0}\frac{\pi^2 (2\pi t+1)}{4\pi^2 t^2+12\pi t+6}\\[6pt] &=\frac{\pi^2}{6}.\end{align}</math>

A proof using Fourier seriesEdit

Use Parseval's identity (applied to the function Template:Math) to obtain <math display=block>\sum_{n=-\infty}^\infty |c_n|^2 = \frac{1}{2\pi}\int_{-\pi}^\pi x^2 \, dx,</math> where <math display=block>\begin{align}

 c_n &= \frac{1}{2\pi}\int_{-\pi}^\pi x e^{-inx} \, dx \\[4pt]
     &= \frac{n\pi \cos(n\pi)-\sin(n\pi)}{\pi n^2} i \\[4pt]
     &= \frac{\cos(n\pi)}{n} i \\[4pt]
     &= \frac{(-1)^n}{n} i

\end{align}</math>

for Template:Math, and Template:Math. Thus, <math display=block>|c_n|^2 = \begin{cases} \dfrac{1}{n^2}, & \text{for } n \neq 0, \\ 0, & \text{for } n = 0, \end{cases} </math>

and <math display=block>\sum_{n=-\infty}^\infty |c_n|^2 = 2\sum_{n=1}^\infty \frac{1}{n^2} = \frac{1}{2\pi} \int_{-\pi}^\pi x^2 \, dx.</math>

Therefore, <math display=block>\sum_{n=1}^\infty \frac{1}{n^2} = \frac{1}{4\pi}\int_{-\pi}^\pi x^2 \, dx = \frac{\pi^2}{6}</math> as required.

Another proof using Parseval's identityEdit

Given a complete orthonormal basis in the space <math>L^2_{\operatorname{per}}(0, 1)</math> of L2 periodic functions over <math>(0, 1)</math> (i.e., the subspace of square-integrable functions which are also periodic), denoted by <math>\{e_i\}_{i=-\infty}^{\infty}</math>, Parseval's identity tells us that <math display=block>\|x\|^2 = \sum_{i=-\infty}^{\infty} |\langle e_i, x\rangle|^2, </math>

where <math>\|x\| := \sqrt{\langle x,x\rangle}</math> is defined in terms of the inner product on this Hilbert space given by <math display=block>\langle f, g\rangle = \int_0^1 f(x) \overline{g(x)} \, dx,\ f,g \in L^2_{\operatorname{per}}(0, 1).</math>

We can consider the orthonormal basis on this space defined by <math>e_k \equiv e_k(\vartheta) := \exp(2\pi\imath k \vartheta)</math> such that <math>\langle e_k,e_j\rangle = \int_0^1 e^{2\pi\imath (k-j) \vartheta} \, d\vartheta = \delta_{k,j}</math>. Then if we take <math>f(\vartheta) := \vartheta</math>, we can compute both that <math display=block> \begin{align} \|f\|^2 & = \int_0^1 \vartheta^2 \, d\vartheta = \frac{1}{3} \\ \langle f, e_k\rangle & = \int_0^1 \vartheta e^{-2\pi\imath k\vartheta} \, d\vartheta = \Biggl\{\begin{array}{ll} \frac{1}{2}, & k = 0 \\ -\frac{1}{2\pi\imath k} & k \neq 0, \end{array} \end{align} </math>

by elementary calculus and integration by parts, respectively. Finally, by Parseval's identity stated in the form above, we obtain that <math display=block> \begin{align} \|f\|^2 = \frac{1}{3} & = \sum_{\stackrel{k=-\infty}{k \neq 0}}^{\infty} \frac{1}{(2\pi k)^2}+ \frac{1}{4}

    = 2 \sum_{k=1}^{\infty} \frac{1}{(2\pi k)^2}+ \frac{1}{4} \\
    & \implies \frac{\pi^2}{6} = \frac{2 \pi^2}{3} - \frac{\pi^2}{2} = \zeta(2).

\end{align} </math>

Generalizations and recurrence relationsEdit

Note that by considering higher-order powers of <math>f_j(\vartheta) := \vartheta^j \in L^2_{\operatorname{per}}(0, 1)</math> we can use integration by parts to extend this method to enumerating formulas for <math>\zeta(2j)</math> when <math>j > 1</math>. In particular, suppose we let <math display=block>I_{j,k} := \int_0^1 \vartheta^j e^{-2\pi\imath k\vartheta} \, d\vartheta, </math>

so that integration by parts yields the recurrence relation that <math display=block> \begin{align} I_{j,k} & = \begin{cases} \frac{1}{j+1}, & k=0; \\[4pt] -\frac{1}{2\pi\imath \cdot k} + \frac{j}{2\pi\imath \cdot k} I_{j-1,k}, & k \neq 0\end{cases} \\[6pt]

    & = \begin{cases} \frac{1}{j+1}, & k=0; \\[4pt] -\sum\limits_{m=1}^j \frac{j!}{(j+1-m)!} \cdot \frac{1}{(2\pi\imath \cdot k)^m}, & k \neq 0. \end{cases}

\end{align} </math>

Then by applying Parseval's identity as we did for the first case above along with the linearity of the inner product yields that <math display=block> \begin{align} \|f_j\|^2 = \frac{1}{2j+1} & = 2 \sum_{k \geq 1} I_{j,k} \bar{I}_{j,k} + \frac{1}{(j+1)^2} \\[6pt]

    & = 2 \sum_{m=1}^j \sum_{r=1}^j \frac{j!^2}{(j+1-m)! (j+1-r)!} \frac{(-1)^r}{\imath^{m+r}} \frac{\zeta(m+r)}{(2\pi)^{m+r}} + \frac{1}{(j+1)^2}.

\end{align} </math>

Proof using differentiation under the integral signEdit

It's possible to prove the result using elementary calculus by applying the differentiation under the integral sign technique to an integral due to Freitas:<ref>Template:Cite arXiv</ref> <math display=block>I(\alpha) = \int_0^\infty \ln\left(1+\alpha e^{-x}+e^{-2x}\right)dx.</math>

While the primitive function of the integrand cannot be expressed in terms of elementary functions, by differentiating with respect to <math>\alpha</math> we arrive at

<math display=block>\frac{dI}{d\alpha} = \int_0^\infty \frac{e^{-x}}{1+\alpha e^{-x}+e^{-2x}}dx,</math> which can be integrated by substituting <math>u=e^{-x}</math> and decomposing into partial fractions. In the range <math>-2\leq\alpha\leq 2</math> the definite integral reduces to

<math display=block>\frac{dI}{d\alpha} = \frac{2}{\sqrt{4-\alpha^2}}\left[\arctan\left(\frac{\alpha+2}{\sqrt{4-\alpha^2}}\right)-\arctan\left(\frac{\alpha}{\sqrt{4-\alpha^2}}\right)\right].</math>

The expression can be simplified using the arctangent addition formula and integrated with respect to <math>\alpha</math> by means of trigonometric substitution, resulting in

<math display=block>I(\alpha) = -\frac{1}{2}\arccos\left(\frac{\alpha}{2}\right)^2 + c.</math>

The integration constant <math>c</math> can be determined by noticing that two distinct values of <math>I(\alpha)</math> are related by

<math display=block>I(2) = 4I(0),</math> because when calculating <math>I(2)</math> we can factor <math>1+2e^{-x}+e^{-2x} = (1+e^{-x})^2</math> and express it in terms of <math>I(0)</math> using the logarithm of a power identity and the substitution <math>u=x/2</math>. This makes it possible to determine <math>c = \frac{\pi^2}{6}</math>, and it follows that

<math display=block>I(-2) = 2\int_0^\infty \ln(1-e^{-x})dx = -\frac{\pi^2}{3}.</math>

This final integral can be evaluated by expanding the natural logarithm into its Taylor series:

<math display=block>\int_0^\infty \ln(1-e^{-x})dx = - \sum_{n=1}^\infty \int_0^\infty \frac{e^{-nx}}{n}dx = -\sum_{n=1}^\infty\frac{1}{n^2}.</math>

The last two identities imply

<math display=block>\sum_{n=1}^\infty\frac{1}{n^2} = \frac{\pi^2}{6}.</math>

Cauchy's proofEdit

While most proofs use results from advanced mathematics, such as Fourier analysis, complex analysis, and multivariable calculus, the following does not even require single-variable calculus (until a single limit is taken at the end).

For a proof using the residue theorem, see here.

History of this proofEdit

The proof goes back to Augustin Louis Cauchy (Cours d'Analyse, 1821, Note VIII). In 1954, this proof appeared in the book of Akiva and Isaak Yaglom "Nonelementary Problems in an Elementary Exposition". Later, in 1982, it appeared in the journal Eureka,<ref>Template:Citation</ref> attributed to John Scholes, but Scholes claims he learned the proof from Peter Swinnerton-Dyer, and in any case he maintains the proof was "common knowledge at Cambridge in the late 1960s".<ref>Template:Citation; this anecdote is missing from later editions of this book, which replace it with earlier history of the same proof.</ref>

The proofEdit

File:Limit circle FbN.jpeg

The inequality
<math>\tfrac{1}{2}r^2\tan\theta > \tfrac{1}{2}r^2\theta > \tfrac{1}{2}r^2\sin\theta</math>
is shown pictorially for any <math>\theta \in (0, \pi/2)</math>. The three terms are the areas of the triangle OAC, circle section OAB, and the triangle OAB. Taking reciprocals and squaring gives
<math>\cot^2\theta<\tfrac{1}{\theta^2}<\csc^2\theta</math>.

The main idea behind the proof is to bound the partial (finite) sums <math display=block>\sum_{k=1}^m \frac{1}{k^2} = \frac{1}{1^2} + \frac{1}{2^2} + \cdots + \frac{1}{m^2}</math> between two expressions, each of which will tend to Template:Sfrac as Template:Math approaches infinity. The two expressions are derived from identities involving the cotangent and cosecant functions. These identities are in turn derived from de Moivre's formula, and we now turn to establishing these identities.

Let Template:Math be a real number with Template:Math, and let Template:Math be a positive odd integer. Then from de Moivre's formula and the definition of the cotangent function, we have <math display=block>\begin{align}

 \frac{\cos (nx) + i \sin (nx)}{\sin^n x} &= \frac{(\cos x + i\sin x)^n}{\sin^n x} \\[4pt]
                                            &= \left(\frac{\cos x + i \sin x}{\sin x}\right)^n \\[4pt]
                                            &= (\cot x + i)^n.

\end{align}</math>

From the binomial theorem, we have <math display=block>\begin{align} (\cot x + i)^n = & {n \choose 0} \cot^n x + {n \choose 1} (\cot^{n - 1} x)i + \cdots + {n \choose {n - 1}} (\cot x)i^{n - 1} + {n \choose n} i^n \\[6pt] = & \Bigg( {n \choose 0} \cot^n x - {n \choose 2} \cot^{n - 2} x \pm \cdots \Bigg) \; + \; i\Bigg( {n \choose 1} \cot^{n-1} x - {n \choose 3} \cot^{n - 3} x \pm \cdots \Bigg). \end{align}</math>

Combining the two equations and equating imaginary parts gives the identity <math display=block>\frac{\sin (nx)}{\sin^n x} = \Bigg( {n \choose 1} \cot^{n - 1} x - {n \choose 3} \cot^{n - 3} x \pm \cdots \Bigg).</math>

We take this identity, fix a positive integer Template:Math, set Template:Math, and consider Template:Math for Template:Math. Then Template:Math is a multiple of Template:Pi and therefore Template:Math. So, <math display=block>0 = {{2m + 1} \choose 1} \cot^{2m} x_r - {{2m + 1} \choose 3} \cot^{2m - 2} x_r \pm \cdots + (-1)^m{{2m + 1} \choose {2m + 1}}</math>

for every Template:Math. The values Template:Math are distinct numbers in the interval Template:Math. Since the function Template:Math is one-to-one on this interval, the numbers Template:Math are distinct for Template:Math. By the above equation, these Template:Math numbers are the roots of the Template:Mathth degree polynomial <math display=block>p(t) = {{2m + 1} \choose 1}t^m - {{2m + 1} \choose 3}t^{m - 1} \pm \cdots + (-1)^m{{2m+1} \choose {2m + 1}}.</math>

By Vieta's formulas we can calculate the sum of the roots directly by examining the first two coefficients of the polynomial, and this comparison shows that <math display=block>\cot ^2 x_1 + \cot ^2 x_2 + \cdots + \cot ^2 x_m = \frac{\binom{2m + 1}3} {\binom{2m + 1}1} = \frac{2m(2m - 1)}6.</math>

Substituting the identity Template:Math, we have <math display=block>\csc ^2 x_1 + \csc ^2 x_2 + \cdots + \csc ^2 x_m = \frac{2m(2m - 1)}6 + m = \frac{2m(2m + 2)}6.</math>

Now consider the inequality Template:Math (illustrated geometrically above). If we add up all these inequalities for each of the numbers Template:Math, and if we use the two identities above, we get <math display=block>\frac{2m(2m - 1)}6 < \left(\frac{2m + 1}{\pi} \right)^2 + \left(\frac{2m + 1}{2\pi} \right)^2 + \cdots + \left(\frac{2m + 1}{m \pi} \right)^2 < \frac{2m(2m + 2)}6.</math>

Multiplying through by Template:Math, this becomes <math display=block>\frac{\pi ^2}{6}\left(\frac{2m}{2m + 1}\right)\left(\frac{2m - 1}{2m + 1}\right) < \frac{1}{1^2} + \frac{1}{2^2} + \cdots + \frac{1}{m^2} < \frac{\pi ^2}{6}\left(\frac{2m}{2m + 1}\right)\left(\frac{2m + 2}{2m + 1}\right).</math>

As Template:Math approaches infinity, the left and right hand expressions each approach Template:Sfrac, so by the squeeze theorem, <math display=block>\zeta(2) = \sum_{k=1}^\infty \frac{1}{k^2} =

 \lim_{m \to \infty}\left(\frac{1}{1^2} + \frac{1}{2^2} + \cdots + \frac{1}{m^2}\right) = \frac{\pi ^2}{6}</math>

and this completes the proof.

Proof assuming Weil's conjecture on Tamagawa numbers

A proof is also possible assuming Weil's conjecture on Tamagawa numbers.<ref>Template:Citation|</ref> The conjecture asserts for the case of the algebraic group SL₂(R) that the Tamagawa number of the group is one. That is, the quotient of the special linear group over the rational adeles by the special linear group of the rationals (a compact set, because <math>SL_2(\mathbb Q)</math> is a lattice in the adeles) has Tamagawa measure 1: <math display="block">\tau(SL_2(\mathbb Q)\setminus SL_2(A_{\mathbb Q}))=1.</math>

To determine a Tamagawa measure, the group <math>SL_2</math> consists of matrices <math display="block">\begin{bmatrix}x&y\\z&t\end{bmatrix}</math> with <math>xt-yz=1</math>. An invariant volume form on the group is <math display="block">\omega = \frac1x dx\wedge dy\wedge dz.</math>

The measure of the quotient is the product of the measures of <math>SL_2(\mathbb Z)\setminus SL_2(\mathbb R)</math> corresponding to the infinite place, and the measures of <math>SL_2(\mathbb Z_p)</math> in each finite place, where <math>\mathbb Z_p</math> is the p-adic integers.

For the local factors, <math display="block">\omega(SL_2(\mathbb Z_p)) = |SL_2(F_p)|\omega(SL_2(\mathbb Z_p,p))</math> where <math>F_p</math> is the field with <math>p</math> elements, and <math>SL_2(\mathbb Z_p,p)</math> is the congruence subgroup modulo <math>p</math>. Since each of the coordinates <math>x,y,z</math> map the latter group onto <math>p\mathbb Z_p</math> and <math>\left|\frac1x\right|_p=1</math>, the measure of <math>SL_2(\mathbb Z_p,p)</math> is <math>\mu_p(p\mathbb Z_p)^3=p^{-3}</math>, where <math>\mu_p</math> is the normalized Haar measure on <math>\mathbb Z_p</math>. Also, a standard computation shows that <math>|SL_2(F_p)|=p(p^2-1)</math>. Putting these together gives <math>\omega(SL_2(\mathbb Z_p))=(1-1/p^2)</math>.

At the infinite place, an integral computation over the fundamental domain of <math>SL_2(\mathbb Z)</math> shows that <math>\omega(SL_2(\mathbb Z)\setminus SL_2(\mathbb R)=\pi^2/6</math>, and therefore the Weil conjecture finally gives <math display="block">1 = \frac{\pi^2}6\prod_p \left(1-\frac1{p^2}\right).</math> On the right-hand side, we recognize the Euler product for <math>1/\zeta(2)</math>, and so this gives the solution to the Basel problem.

This approach shows the connection between (hyperbolic) geometry and arithmetic, and can be inverted to give a proof of the Weil conjecture for the special case of <math>SL_2</math>, contingent on an independent proof that <math>\zeta(2)=\pi^2/6</math>.

Geometric proof

The Basel problem can be proved with Euclidean geometry, using the insight that the real line can be seen as a circle of infinite radius. An intuitive, if not completely rigorous, sketch is given here.

Choose an integer <math>N</math>, and take <math>N</math> equally spaced points on a circle with circumference equal to <math>2N</math>. The radius of the circle is <math>N/\pi</math> and the length of each arc between two points is <math>2</math>. Call the points <math>P_{1..N}</math>.
Take another generic point <math>Q</math> on the circle, which will lie at a fraction <math>0 < \alpha < 1</math> of the arc between two consecutive points (say <math>P_1</math> and <math>P_2</math> without loss of generality).
Draw all the chords joining <math>Q</math> with each of the <math>P_{1..N}</math> points. Now (this is the key to the proof), compute the sum of the inverse squares of the lengths of all these chords, call it <math>sisc</math>.
The proof relies on the notable fact that (for a fixed <math>\alpha</math>), the <math>sisc</math> does not depend on <math>N</math>. Note that intuitively, as <math>N</math> increases, the number of chords increases, but their length increases too (as the circle gets bigger), so their inverse square decreases.
In particular, take the case where <math>\alpha = 1/2</math>, meaning that <math>Q</math> is the midpoint of the arc between two consecutive <math>P</math>'s. The <math>sisc</math> can then be found trivially from the case <math>N=1</math>, where there is only one <math>P</math>, and one <math>Q</math> on the opposite side of the circle. Then the chord is the diameter of the circle, of length <math>2/\pi</math>. The <math>sisc</math> is then <math>\pi^2/4</math>.
When <math>N</math> goes to infinity, the circle approaches the real line. If you set the origin at <math>Q</math>, the points <math>P_{1..N}</math> are positioned at the odd integer positions (positive and negative), since the arcs have length 1 from <math>Q</math> to <math>P_1</math>, and 2 onward. You hence get this variation of the Basel Problem:

<math display="block"> \sum_{z=-\infty}^{\infty} \frac{1}{(2z-1)^2} = \frac{\pi^2}{4} </math>

From here, you can recover the original formulation with a bit of algebra, as:

   \sum_{n=1}^\infty \frac{1}{n^2} = \sum_{n=1}^\infty \frac{1}{(2n-1)^2} + \sum_{n=1}^\infty \frac{1}{(2n)^2} = \frac{1}{2}\sum_{z=-\infty}^{\infty} \frac{1}{(2z-1)^2} + \frac{1}{4}\sum_{n=1}^\infty \frac{1}{n^2}

</math>

that is,

   \frac{3}{4}\sum_{n=1}^\infty \frac{1}{n^2} = \frac{\pi^2}{8}

</math>

or

   \sum_{n=1}^\infty \frac{1}{n^2} = \frac{\pi^2}{6}

</math>.

The independence of the <math>sisc</math> from <math>N</math> can be proved easily with Euclidean geometry for the more restrictive case where <math>N</math> is a power of 2, i.e. <math>N = 2^n</math>, which still allows the limiting argument to be applied. The proof proceeds by induction on <math>n</math>, and uses the Inverse Pythagorean Theorem, which states that:

where <math>a</math> and <math>b</math> are the legs and <math>h</math> is the height of a right triangle.

In the base case of <math>n=0</math>, there is only 1 chord. In the case of <math>\alpha = 1/2</math>, it corresponds to the diameter and the <math>sisc</math> is <math>\pi^2/4</math> as stated above.
Now, assume that you have <math>2^n</math> points on a circle with radius <math>2^n/\pi</math> and center <math>O</math>, and <math>2^{n+1}</math> points on a circle with radius <math>2^{n+1}/\pi</math> and center <math>R</math>. The induction step consists in showing that these 2 circles have the same <math>sisc</math> for a given <math>\alpha</math>.

Start by drawing the circles so that they share point <math>Q</math>. Note that <math>R</math> lies on the smaller circle. Then, note that <math>2^{n+1}</math> is always even, and a simple geometric argument shows that you can pick pairs of opposite points <math>P_1</math> and <math>P_2</math> on the larger circle by joining each pair with a diameter. Furthermore, for each pair, one of the points will be in the "lower" half of the circle (closer to <math>Q</math>) and the other in the "upper" half.

File:Induction step 1728668638527.jpg

The sum of inverse squares of distances of P1 and P2 from Q equals the inverse square distance from P to Q.

The diameter of the bigger circle <math>P_1P_2</math> cuts the smaller circle at <math>R</math> and at another point <math>P</math>. You can then make the following considerations:
- <math>P_1 \widehat{Q} P_2</math> is a right angle, since <math>P_1P_2</math> is a diameter.
- <math>Q \widehat{P} R</math> is a right angle, since <math>QR</math> is a diameter.
- <math>Q \widehat{R} P_2 = Q \widehat{R} P</math> is half of <math>Q \widehat{O} P</math> for the Inscribed Angle Theorem.
- Hence, the arc <math>QP</math> is equal to the arc <math>QP_2</math>, again because the radius is half.
- The chord <math>QP</math> is the height of the right triangle <math>QP_1P_2</math>, hence for the Inverse Pythagorean Theorem:

<math display="block"> \frac{1}{\overline{QP}^2} = \frac{1}{\overline{QP_1}^2} + \frac{1}{\overline{QP_2}^2} </math>

Hence for half of the points on the bigger circle (the ones in the lower half) there is a corresponding point on the smaller circle with the same arc distance from <math>Q</math> (since the circumference of the smaller circle is half that of the bigger circle, the last two points closer to <math>R</math> must have arc distance 2 as well). Vice versa, for each of the <math>2^n</math> points on the smaller circle, we can build a pair of points on the bigger circle, and all of these points are equidistant and have the same arc distance from <math>Q</math>.
Furthermore, the total <math>sisc</math> for the bigger circle is the same as the <math>sisc</math> for the smaller circle, since each pair of points on the bigger circle has the same inverse square sum as the corresponding point on the smaller circle.<ref>{{#invoke:citation/CS1|citation

|CitationClass=web }}</ref>

Other identities

See the special cases of the identities for the Riemann zeta function when <math>s = 2.</math> Other notably special identities and representations of this constant appear in the sections below.

Series representations

The following are series representations of the constant:<ref name="MWZETA2">Template:Mathworld</ref> <math display=block>\begin{align}

 \zeta(2) &= 3 \sum_{k=1}^\infty \frac{1}{k^2 \binom{2k}{k}} \\[6pt]
    &= \sum_{i=1}^\infty \sum_{j=1}^\infty \frac{(i-1)! (j-1)!}{(i+j)!}.
\end{align}</math>

There are also BBP-type series expansions for Template:Math.<ref name="MWZETA2" />

Integral representations

The following are integral representations of <math>\zeta(2)\text{:}</math><ref>Template:Cite arXiv</ref><ref>Template:Mathworld</ref><ref>Template:Mathworld</ref> <math display=block> \begin{align}

 \zeta(2) & = -\int_0^1 \frac{\log x}{1-x} \, dx \\[6pt]
    & = \int_0^{\infty} \frac{x}{e^x-1} \, dx \\[6pt]
    & = \int_0^1 \frac{(\log x)^2}{(1+x)^2} \, dx \\[6pt]
    & = 2 + 2\int_1^{\infty} \frac{\lfloor x \rfloor -x}{x^3} \, dx \\[6pt]
    & = \exp\left(2 \int_2^{\infty} \frac{\pi(x)}{x(x^2-1)} \,dx\right) \\[6pt]
    & = \int_0^1 \int_0^1 \frac{dx \, dy}{1-xy} \\[6pt]
    & = \frac{4}{3} \int_0^1 \int_0^1 \frac{dx \, dy}{1-(xy)^2} \\[6pt]
    & = \int_0^1 \int_0^1 \frac{1-x}{1-xy} \, dx \, dy + \frac{2}{3}.
\end{align}</math>

Continued fractions

In van der Poorten's classic article chronicling Apéry's proof of the irrationality of <math>\zeta(3)</math>,<ref>Template:Citation</ref> the author notes as "a red herring" the similarity of a simple continued fraction for Apery's constant, and the following one for the Basel constant: <math display=block>\frac{\zeta(2)}{5} = \cfrac{1}{\widetilde{v}_1 - \cfrac{1^4}{\widetilde{v}_2-\cfrac{2^4}{\widetilde{v}_3-\cfrac{3^4}{\widetilde{v}_4-\ddots}}}}, </math> where <math>\widetilde{v}_n = 11n^2-11n+3 \mapsto \{3,25,69,135,\ldots\}</math>. Another continued fraction of a similar form is:<ref name="Berndt">Template:Citation</ref> <math display=block>\frac{\zeta(2)}{2} = \cfrac{1}{v_1 - \cfrac{1^4}{v_2-\cfrac{2^4}{v_3-\cfrac{3^4}{v_4-\ddots}}}}, </math> where <math>v_n = 2n-1 \mapsto \{1,3,5,7,9,\ldots\}</math>.

ReferencesEdit

NotesEdit

Template:Reflist

External linksEdit

An infinite series of surprises by C. J. Sangwin
From ζ(2) to Π. The Proof. step-by-step proof
Template:Citation, English translation with notes of Euler's paper by Lucas Willis and Thomas J. Osler
Template:Citation
Template:Citation
Robin Chapman, Evaluating Template:Math (fourteen proofs)
Visualization of Euler's factorization of the sine function
Template:Citation
- Template:YouTube (animated proof based on the above)

Basel problem

Contents

Euler's approachEdit

Generalizations of Euler's method using elementary symmetric polynomialsEdit

Consequences of Euler's proofEdit

The Riemann zeta functionEdit

A proof using Euler's formula and L'Hôpital's ruleEdit

A proof using Fourier seriesEdit

Another proof using Parseval's identityEdit

Generalizations and recurrence relationsEdit

Proof using differentiation under the integral signEdit

Cauchy's proofEdit

History of this proofEdit

The proofEdit

Proof assuming Weil's conjecture on Tamagawa numbers

Geometric proof

Other identities

Series representations

Integral representations

Continued fractions

See alsoEdit

ReferencesEdit

NotesEdit

External linksEdit