Matrix representation of conic sections

Revision as of 14:35, 15 March 2025 by imported>MediaKyle (Adding short description: "Concept in mathematics")
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Template:Short description In mathematics, the matrix representation of conic sections permits the tools of linear algebra to be used in the study of conic sections. It provides easy ways to calculate a conic section's axis, vertices, tangents and the pole and polar relationship between points and lines of the plane determined by the conic. The technique does not require putting the equation of a conic section into a standard form, thus making it easier to investigate those conic sections whose axes are not parallel to the coordinate system.

Conic sections (including degenerate ones) are the sets of points whose coordinates satisfy a second-degree polynomial equation in two variables, <math display="block">Q(x,y) = Ax^2 + Bxy + Cy^2 + Dx + Ey + F = 0.</math> By an abuse of notation, this conic section will also be called <math>Q</math> when no confusion can arise.

This equation can be written in matrix notation, in terms of a symmetric matrix to simplify some subsequent formulae, as<ref>Template:Harvnb</ref>

<math display="block">\begin{pmatrix} x & y \end{pmatrix} \begin{pmatrix} A & B/2 \\ B/2 & C \end{pmatrix} \begin{pmatrix} x \\ y \end{pmatrix} + \begin{pmatrix} D & E \end{pmatrix} \begin{pmatrix} x\\y \end{pmatrix} + F = 0.</math>

The sum of the first three terms of this equation, namely <math display="block">Ax^2 + Bxy + Cy^2 = \begin{pmatrix} x & y \end{pmatrix} \begin{pmatrix} A & B/2 \\ B/2 & C \end{pmatrix} \begin{pmatrix}x\\y\end{pmatrix},</math> is the quadratic form associated with the equation, and the matrix <math display="block">A_{33} = \begin{pmatrix} A & B/2 \\ B/2 & C\end{pmatrix}</math> is called the matrix of the quadratic form. The trace and determinant of <math>A_{33} </math> are both invariant with respect to rotation of axes and translation of the plane (movement of the origin).<ref name=petto110>Template:Harvnb</ref><ref name=Spainsec>Template:Harvnb</ref>

The quadratic equation can also be written as

<math display="block">\mathbf{x}^\mathsf{T} A_Q\mathbf{x} = 0,</math>

where <math>\mathbf{x}</math> is the homogeneous coordinate vector in three variables restricted so that the last variable is 1, i.e.,

<math display="block">\begin{pmatrix} x \\ y \\ 1 \end{pmatrix}</math>

and where <math>A_Q</math> is the matrix

<math display="block">A_Q = \begin{pmatrix}

 A & B/2 & D/2 \\
 B/2 & C & E/2 \\
 D/2 & E/2 & F

\end{pmatrix}.</math> The matrix <math>A_Q</math> is called the matrix of the quadratic equation.<ref>It is also a matrix of a quadratic form, but this form has three variables and is <math>Ax^2 + Bxy + Cy^2 + Dxz + Eyz + Fz^2</math>.</ref> Like that of <math>A_{33}</math>, its determinant is invariant with respect to both rotation and translation.<ref name=Spainsec />

The 2 × 2 upper left submatrix (a matrix of order 2) of <math>A_Q</math>, obtained by removing the third (last) row and third (last) column from <math>A_Q</math> is the matrix of the quadratic form. The above notation <math>A_{33}</math> is used in this article to emphasize this relationship.

ClassificationEdit

Proper (non-degenerate) and degenerate conic sections can be distinguished<ref name=Lawrence>Template:Harvnb</ref><ref>Template:Harvnb</ref> based on the determinant of <math>A_Q=(AC-\frac{B^2}{4})F+\frac{BDE-C{D}^2-A{E}^2}{4}</math>:

If <math>\det A_Q = 0</math>, the conic is degenerate.

If <math>\det A_Q \neq 0</math> so that <math>Q</math> is not degenerate, we can see what type of conic section it is by computing the minor, <math>\det A_{33}=AC-\frac{B^2}{4}</math>:

  • <math>Q</math> is a hyperbola if and only if <math> \det A_{33} < 0 </math>,
  • <math>Q</math> is a parabola if and only if <math> \det A_{33} = 0 </math>, and
  • <math>Q</math> is an ellipse if and only if <math> \det A_{33} > 0 </math>.

In the case of an ellipse, we can distinguish the special case of a circle by comparing the last two diagonal elements corresponding to the coefficients of <math>x^2</math> , <math>xy</math> and <math>y^2</math>:

  • If <math>A=C</math> and <math>B=0</math>, then <math>Q</math> is a circle.

Moreover, in the case of a non-degenerate ellipse (with <math>\det A_{33} > 0 </math> and <math>\det A_Q \ne 0</math>), we have a real ellipse if <math>(A + C)\det A_Q < 0</math> but an imaginary ellipse if <math>(A + C)\det A_Q > 0</math>. An example of the latter is <math>x^2 + y^2 + 10 = 0 </math>, which has no real-valued solutions.

If the conic section is degenerate (<math>\det A_Q = 0</math>), <math>\det A_{33}</math> still allows us to distinguish its form:

  • Two intersecting lines (a hyperbola degenerated to its two asymptotes) if and only if <math>\det A_{33} < 0</math>.
  • Two parallel straight lines (a degenerate parabola) if and only if <math>\det A_{33} = 0</math>. These lines are distinct and real if <math>D^2+E^2 > 4(A+C)F</math>, coincident if <math>D^2+E^2 = 4(A+C)F</math>, and non-existent in the real plane if <math>D^2+E^2 < 4(A+C)F</math>.
  • A single point (a degenerate ellipse) if and only if <math>\det A_{33} > 0</math>.

The case of coincident lines occurs if and only if the rank of the 3 × 3 matrix <math>A_Q</math> is 1; in all other degenerate cases its rank is 2.<ref name=petto110 />

Central conicsEdit

When <math> \det A_{33} \neq 0 </math> a geometric center of the conic section exists and such conic sections (ellipses and hyperbolas) are called central conics.<ref>Template:Harvnb</ref>

CenterEdit

The center of a conic, if it exists, is a point that bisects all the chords of the conic that pass through it. This property can be used to calculate the coordinates of the center, which can be shown to be the point where the gradient of the quadratic function Template:Math vanishes—that is,<ref>Template:Harvnb</ref> <math display="block"> \nabla Q = \left[ \frac{\partial Q}{\partial x} , \frac{\partial Q}{\partial y} \right] = [0,0]. </math> This yields the center as given below.

An alternative approach that uses the matrix form of the quadratic equation is based on the fact that when the center is the origin of the coordinate system, there are no linear terms in the equation. Any translation to a coordinate origin Template:Math, using Template:Math, Template:Math gives rise to

<math display="block">\begin{pmatrix} x^* + x_0 & y ^* + y_0 \end{pmatrix} \begin{pmatrix}A & B/2\\B/2 & C \end{pmatrix} \begin{pmatrix} x^* + x_0\\y^* + y_0 \end{pmatrix} + \left(\begin{matrix}D & E \end{matrix}\right) \left(\begin{matrix}x^* + x_0 \\ y^* + y_0\end{matrix}\right) +F= 0. </math>

The condition for Template:Math to be the conic's center Template:Math is that the coefficients of the linear Template:Mvar and Template:Mvar terms, when this equation is multiplied out, are zero. This condition produces the coordinates of the center: <math display="block"> \begin{pmatrix} x_c \\ y_c \end{pmatrix}

  = \begin{pmatrix} A & B/2 \\ B/2 & C \end{pmatrix}^{\!-1}
    \begin{pmatrix} -D/2 \\ -E/2 \end{pmatrix}
  = \begin{pmatrix} (BE-2CD)/(4AC-B^2) \\ (DB-2AE)/(4AC-B^2) \end{pmatrix}.</math>

This calculation can also be accomplished by taking the first two rows of the associated matrix Template:Math, multiplying each by Template:Math and setting both inner products equal to 0, obtaining the following system:

<math display="block">\begin{align} Ax + (B/2)y + D/2 &= 0, \\ (B/2)x + Cy + E/2 &= 0. \end{align}</math>

This yields the above center point.

In the case of a parabola, that is, when Template:Math, there is no center since the above denominators become zero (or, interpreted projectively, the center is on the line at infinity.)

Centered matrix equationEdit

A central (non-parabola) conic <math>Ax^2 + Bxy + Cy^2 + Dx + Ey + F = 0</math> can be rewritten in centered matrix form as <math display="block">\begin{pmatrix} x-x_c & y-y_c \end{pmatrix} \begin{pmatrix} A & B/2 \\ B/2 & C \end{pmatrix} \begin{pmatrix} x-x_c \\ y-y_c \end{pmatrix} = K,</math> where <math display="block">K = -\frac{\det (A_Q)}{AC-(B/2)^2} = -\frac{\det(A_Q)}{\det(A_{33})}.</math>

Then for the ellipse case of Template:Math, the ellipse is real if the sign of Template:Math equals the sign of Template:Math (that is, the sign of each of Template:Math and Template:Math), imaginary if they have opposite signs, and a degenerate point ellipse if Template:Math. In the hyperbola case of Template:Math, the hyperbola is degenerate if and only if Template:Math.

Standard form of a central conicEdit

Template:Main article

The standard form of the equation of a central conic section is obtained when the conic section is translated and rotated so that its center lies at the center of the coordinate system and its axes coincide with the coordinate axes. This is equivalent to saying that the coordinate system's center is moved and the coordinate axes are rotated to satisfy these properties. In the diagram, the original Template:Mvar-coordinate system with origin Template:Mvar is moved to the Template:Mvar-coordinate system with origin Template:Mvar.

File:Conic ref syst.svg
Translating and rotating coordinates

The translation is by the vector <math>\mathbf{t} = \begin{pmatrix} x_c \\ y_c \end{pmatrix}.</math>

The rotation by angle Template:Mvar can be carried out by diagonalizing the matrix Template:Math. Thus, if <math>\lambda_1</math> and <math>\lambda_2</math> are the eigenvalues of the matrix A33, the centered equation can be rewritten in new variables Template:Mvar and Template:Mvar as<ref>Template:Harvnb</ref>

<math display="block">\lambda_1 x'^2 + \lambda_2 y'^2 = - \frac{\det A_Q}{\det A_{33}}.</math>

Dividing by <math>K = -\frac{\det A_Q}{\det A_{33}}</math> we obtain a standard canonical form.

For example, for an ellipse this form is <math display="block">\frac{{x'}^2}{a^2} + \frac{{y'}^2}{b^2} = 1.</math> From here we get Template:Math and Template:Math, the lengths of the semi-major and semi-minor axes in conventional notation.

For central conics, both eigenvalues are non-zero and the classification of the conic sections can be obtained by examining them.<ref>Template:Harvnb</ref>

AxesEdit

By the principal axis theorem, the two eigenvectors of the matrix of the quadratic form of a central conic section (ellipse or hyperbola) are perpendicular (orthogonal to each other) and each is parallel to (in the same direction as) either the major or minor axis of the conic. The eigenvector having the smallest eigenvalue (in absolute value) corresponds to the major axis.<ref>Template:Harvnb</ref>

Specifically, if a central conic section has center Template:Math and an eigenvector of Template:Math is given by Template:Math then the principal axis (major or minor) corresponding to that eigenvector has equation, <math display="block">

\frac{x-x_c}{v_1} = \frac{y-y_c}{v_2}.

</math>

VerticesEdit

The vertices of a central conic can be determined by calculating the intersections of the conic and its axes — in other words, by solving the system consisting of the quadratic conic equation and the linear equation for alternately one or the other of the axes. Two or no vertices are obtained for each axis, since, in the case of the hyperbola, the minor axis does not intersect the hyperbola at a point with real coordinates. However, from the broader view of the complex plane, the minor axis of an hyperbola does intersect the hyperbola, but at points with complex coordinates.<ref>Template:Citation</ref>

Poles and polarsEdit

Template:Main article Using homogeneous coordinates,<ref>This permits the algebraic inclusion of infinite points and a line at infinity which are necessary to have for some of the following results</ref> the points<ref>This section follows Template:Citation</ref> <math display="block">\mathbf{p} = \begin{pmatrix} p_0 \\ p_1 \\ p_2 \end{pmatrix} </math> and <math>\mathbf{r} = \begin{pmatrix} r_0 \\ r_1 \\ r_2 \end{pmatrix} </math> are conjugate with respect to the conic Template:Mvar provided <math display="block"> \mathbf{p}^\mathsf{T} A_Q \mathbf{r} = 0.</math>

The conjugates of a fixed point Template:Math either form a line or consist of all the points in the plane of the conic. When the conjugates of Template:Math form a line, the line is called the polar of Template:Math and the point Template:Math is called the pole of the line, with respect to the conic. This relationship between points and lines is called a polarity.

If the conic is non-degenerate, the conjugates of a point always form a line and the polarity defined by the conic is a bijection between the points and lines of the extended plane containing the conic (that is, the plane together with the points and line at infinity).

If the point Template:Math lies on the conic Template:Mvar, the polar line of Template:Math is the tangent line to Template:Mvar at Template:Math.

The equation, in homogeneous coordinates, of the polar line of the point Template:Math with respect to the non-degenerate conic Template:Mvar is given by <math display="block"> \mathbf{p}^T A_Q \begin{pmatrix} x \\ y \\ z \end{pmatrix} = 0.</math>

Just as Template:Math uniquely determines its polar line (with respect to a given conic), so each line determines a unique pole Template:Math. Furthermore, a point Template:Math is on a line Template:Math which is the polar of a point Template:Math, if and only if the polar of Template:Math passes through the point Template:Math (La Hire's theorem).<ref>Template:Harvnb</ref> Thus, this relationship is an expression of geometric duality between points and lines in the plane.

Several familiar concepts concerning conic sections are directly related to this polarity. The center of a non-degenerate conic can be identified as the pole of the line at infinity. A parabola, being tangent to the line at infinity, would have its center being a point on the line at infinity. Hyperbolas intersect the line at infinity in two distinct points and the polar lines of these points are the asymptotes of the hyperbola and are the tangent lines to the hyperbola at these points of infinity. Also, the polar line of a focus of the conic is its corresponding directrix.<ref>Template:Citation</ref>

TangentsEdit

Let line Template:Math be the polar line of point Template:Math with respect to the non-degenerate conic Template:Mvar. By La Hire's theorem, every line passing through Template:Math has its pole on Template:Math. If Template:Math intersects Template:Mvar in two points (the maximum possible) then the polars of those points are tangent lines that pass through Template:Math and such a point is called an exterior or outer point of Template:Mvar. If Template:Math intersects Template:Mvar in only one point, then it is a tangent line and Template:Math is the point of tangency. Finally, if Template:Math does not intersect Template:Mvar then Template:Math has no tangent lines passing through it and it is called an interior or inner point.<ref>Interpreted in the complex plane such a point is on two complex tangent lines that meet Template:Mvar in complex points.</ref>

The equation of the tangent line (in homogeneous coordinates) at a point Template:Mvar on the non-degenerate conic Template:Mvar is given by,

<math display="block"> \mathbf{p}^\mathsf{T} A_Q \begin{pmatrix} x \\ y \\ z \end{pmatrix} = 0. </math>

If Template:Math is an exterior point, first find the equation of its polar (the above equation) and then the intersections of that line with the conic, say at points Template:Math and Template:Math. The polars of Template:Math and Template:Math will be the tangents through Template:Math.

Using the theory of poles and polars, the problem of finding the four mutual tangents of two conics reduces to finding the intersection of two conics.

See alsoEdit

NotesEdit

Template:Reflist

ReferencesEdit

Template:Matrix classes