Open main menu
Home
Random
Recent changes
Special pages
Community portal
Preferences
About Wikipedia
Disclaimers
Incubator escapee wiki
Search
User menu
Talk
Dark mode
Contributions
Create account
Log in
Editing
Covariance and contravariance of vectors
Warning:
You are not logged in. Your IP address will be publicly visible if you make any edits. If you
log in
or
create an account
, your edits will be attributed to your username, along with other benefits.
Anti-spam check. Do
not
fill this in!
{{Short description|Vector behavior under coordinate changes}} {{About||use of "covariance" in the context of special relativity|Lorentz covariance|other uses of "covariant" or "contravariant"|Covariance and contravariance (disambiguation)}} [[File:Vector 1-form.svg|upright=1.35|thumb| A {{color box|#CC0000}} vector, '''v''', represented in terms of {{glossary}} {{term|tangent basis}} {{defn|{{color box|orange}} '''e'''{{sub|1}}, '''e'''{{sub|2}}, '''e'''{{sub|3}} to the {{color box|black}} coordinate curves (''left''),}} {{term|dual basis, covector basis, or reciprocal basis}} {{defn|{{color box|blue}} '''e'''{{sup|1}}, '''e'''{{sup|2}}, '''e'''{{sup|3}} to {{color box|#4B545B}} coordinate surfaces (''right''),}} {{glossary end}} in [[three-dimensional|3-d]] general ''[[curvilinear coordinates]]'' {{nowrap|(''q''{{sup|1}}, ''q''{{sup|2}}, ''q''{{sup|3}})}}, a [[tuple]] of numbers to define a point in a [[position space]]. Note the basis and cobasis coincide only when the basis is [[orthonormal basis|orthonormal]].<ref> {{cite book | title=Gravitation |title-link=Gravitation (book) | first3=J.A. |last3=Wheeler | first1=C. |last1=Misner | first2=K.S. |last2=Thorne | publisher=W.H. Freeman | year=1973 | isbn=0-7167-0344-0 }}</ref>{{specify}} ]] In [[physics]], especially in [[multilinear algebra]] and [[tensor analysis]], '''covariance''' and '''contravariance''' describe how the quantitative description of certain geometric or physical entities changes with a [[change of basis]].<ref>{{Cite book |last=Frankel |first=Theodore |url=https://www.worldcat.org/oclc/739094283 |title=The geometry of physics : an introduction |date=2012 |publisher=Cambridge University Press |isbn=978-1-107-60260-1 |page=42 |location=Cambridge |oclc=739094283}}</ref> Briefly, a contravariant vector is a list of numbers that transforms oppositely to a change of basis, and a covariant vector is a list of numbers that transforms in the same way. Contravariant vectors are often just called ''vectors'' and covariant vectors are called ''covectors'' or ''dual vectors''. The terms ''covariant'' and ''contravariant'' were introduced by [[James Joseph Sylvester]] in 1851.<ref>{{cite book |first=J.J. |last=Sylvester |chapter=On the general theory of associated algebraical forms |chapter-url=https://books.google.com/books?id=mZ9EAAAAcAAJ&pg=PA289 |editor= |title=Cambridge and Dublin Mathematical Journal |publisher= |location= |date=1851 |isbn= |volume=6 |pages=289–293 |url=}}</ref><ref>{{Cite book|title=The collected mathematical papers of James Joseph Sylvester |volume=3, 1870–1883|last=Sylvester|first=J.J. University Press|date=16 February 2012 |publisher=Cambridge University Press |isbn=978-1107661431|oclc=758983870}}</ref> [[Curvilinear coordinate system]]s, such as [[cylindrical coordinates|cylindrical]] or [[spherical coordinates]], are often used in physical and geometric problems. Associated with any coordinate system is a natural choice of coordinate basis for vectors based at each point of the space, and covariance and contravariance are particularly important for understanding how the coordinate description of a vector changes by passing from one coordinate system to another. [[Tensor]]s are objects in [[multilinear algebra]] that can have aspects of both covariance and contravariance. ==Introduction== In physics, a vector typically arises as the outcome of a measurement or series of measurements, and is represented as a list (or [[tuple]]) of numbers such as :<math>(v_1,v_2,v_3) .</math> The numbers in the list depend on the choice of [[coordinate system]]. For instance, if the vector represents position with respect to an observer ([[position vector]]), then the coordinate system may be obtained from a system of rigid rods, or reference axes, along which the components ''v''<sub>1</sub>, ''v''<sub>2</sub>, and ''v''<sub>3</sub> are measured. For a vector to represent a geometric object, it must be possible to describe how it looks in any other coordinate system. That is to say, the components of the vectors will ''transform'' in a certain way in passing from one coordinate system to another. A simple illustrative case is that of a [[Euclidean vector]]. For a vector, once a set of basis vectors has been defined, then the components of that vector will always vary ''opposite'' to that of the basis vectors. That vector is therefore defined as a ''contravariant'' tensor. Take a standard position vector for example. By changing the scale of the reference axes from meters to centimeters (that is, ''dividing'' the scale of the reference axes by 100, so that the basis vectors now are <math>.01</math> meters long), the components of the measured [[Position (geometry)|position]] [[Vector (mathematics and physics)|vector]] are ''multiplied'' by 100. A vector's components change scale ''inversely'' to changes in scale to the reference axes, and consequently a vector is called a ''contravariant'' tensor. A ''vector'', which is an example of a ''contravariant'' tensor, has components that transform inversely to the transformation of the reference axes, (with example transformations including [[rotation (mathematics)|rotation]] and [[Dilation (metric space)|dilation]]). [[Active and passive transformation#Passive transformation|The vector itself does not change under these operations]]; instead, the components of the vector change in a way that cancels the change in the spatial axes. In other words, if the reference axes were rotated in one direction, the component representation of the vector would rotate in exactly the opposite way. Similarly, if the reference axes were stretched in one direction, the components of the vector, would reduce in an exactly compensating way. Mathematically, if the coordinate system undergoes a transformation described by an <math>n\times n</math> [[invertible matrix]] ''M'', so that the basis vectors transform according to <math>\begin{bmatrix} \mathbf{e}_1^\prime\ \mathbf{e}_2^\prime\ ... \ \mathbf{e}_n^\prime\end{bmatrix}=\begin{bmatrix} \mathbf{e}_1\ \mathbf{e}_2\ ... \ \mathbf{e}_n\end{bmatrix}M</math>, then the components of a vector '''v''' in the original basis ( <math>v^i</math> ) must be similarly transformed via <math>\begin{bmatrix} v^1{^\prime} \\ v^2{^\prime} \\ ... \\ v^n{^\prime} \end{bmatrix}=M^{-1}\begin{bmatrix} v^1 \\ v^2 \\ ... \\ v^n \end{bmatrix}</math>. The components of a ''vector'' are often represented arranged in a column. By contrast, a ''covector'' has components that transform like the reference axes. It lives in the dual vector space, and represents a linear map from vectors to scalars. The dot product operator involving vectors is a good example of a covector. To illustrate, assume we have a covector defined as <math>\mathbf{v}\ \cdot</math> , where <math>\mathbf{v}</math> is a vector. The components of this covector in some arbitrary basis are <math>\begin{bmatrix} \mathbf{v}\cdot\mathbf{e}_1 & \mathbf{v}\cdot\mathbf{e}_2 & ... & \mathbf{v}\cdot\mathbf{e}_n \end{bmatrix}</math>, with <math>\begin{bmatrix} \mathbf{e}_1\ \mathbf{e}_2\ ... \ \mathbf{e}_n\end{bmatrix}</math> being the basis vectors in the corresponding vector space. (This can be derived by noting that we want to get the correct answer for the dot product operation when multiplying by an arbitrary vector <math>\mathbf{w}</math> , with components <math>\begin{bmatrix} w^1 \\ w^2 \\ ... \\ w^n \end{bmatrix}</math>). The covariance of these covector components is then seen by noting that if a transformation described by an <math>n\times n</math> [[invertible matrix]] ''M'' were to be applied to the basis vectors in the corresponding vector space, <math>\begin{bmatrix} \mathbf{e}_1^\prime\ \mathbf{e}_2^\prime\ ... \ \mathbf{e}_n^\prime\end{bmatrix}=\begin{bmatrix} \mathbf{e}_1\ \mathbf{e}_2\ ... \ \mathbf{e}_n\end{bmatrix}M</math>, then the components of the covector <math>\mathbf{v}\ \cdot</math> will transform with the same matrix <math>M</math>, namely, <math>\begin{bmatrix} \mathbf{v}\cdot\mathbf{e}_1^\prime & \mathbf{v}\cdot\mathbf{e}_2^\prime & ... & \mathbf{v}\cdot\mathbf{e}_n^\prime \end{bmatrix}=\begin{bmatrix} \mathbf{v}\cdot\mathbf{e}_1 & \mathbf{v}\cdot\mathbf{e}_2 & ... & \mathbf{v}\cdot\mathbf{e}_n \end{bmatrix}M</math>. The components of a ''covector'' are often represented arranged in a row. A third concept related to covariance and contravariance is [[Invariant (physics)|invariance]]. A [[scalar (physics)|scalar]] (also called type-0 or rank-0 tensor) is an object that does not vary with the change in basis. An example of a physical [[observable]] that is a scalar is the [[mass]] of a particle. The single, scalar value of mass is independent to changes in basis vectors and consequently is called ''invariant''. The magnitude of a vector (such as [[Euclidean distance|distance]]) is another example of an invariant, because it remains fixed even if geometrical vector components vary. (For example, for a position vector of length <math>3</math> meters, if all [[Cartesian coordinate system|Cartesian]] basis vectors are changed from <math>1</math> meters in length to <math>.01</math> meters in length, the length of the position vector remains unchanged at <math>3</math> meters, although the vector components will all increase by a factor of <math>100</math>). The scalar product of a vector and a covector is invariant, because one has components that vary with the base change, and the other has components that vary oppositely, and the two effects cancel out. One thus says that covectors are ''dual'' to vectors. Thus, to summarize: * A vector or [[tangent vector]], has components that ''contra-vary'' with a change of basis to compensate. That is, the matrix that transforms the vector components must be the inverse of the matrix that transforms the basis vectors. The components of vectors (as opposed to those of covectors) are said to be '''contravariant'''. In [[Einstein notation]] (implicit summation over repeated index), contravariant components are denoted with ''upper indices'' as in *:<math>\mathbf{v} = v^i \mathbf{e}_i</math> * A covector or [[cotangent vector]] has components that ''co-vary'' with a change of basis in the corresponding (initial) vector space. That is, the components must be transformed by the same matrix as the change of basis matrix in the corresponding (initial) vector space. The components of covectors (as opposed to those of vectors) are said to be '''covariant'''. In [[Einstein notation]], covariant components are denoted with ''lower indices'' as in *:<math>\mathbf{w} = w_i \mathbf{e}^i.</math> * The scalar product of a vector and covector is the scalar <math>v^iw_i</math>, which is invariant. It is the duality pairing of vectors and covectors. ==Definition== The general formulation of covariance and contravariance refers to how the components of a coordinate vector transform under a [[change of basis]] ([[Active and passive transformation|passive transformation]]).<ref>{{cite book|title=Ricci calculus|page=6|author=J A Schouten|year=1954|publisher=Springer|edition=2}}</ref> Thus let ''V'' be a [[vector space]] of dimension ''n'' over a [[field (mathematics)|field]] of [[scalar (mathematics)|scalars]] ''S'', and let each of {{nowrap|1='''f''' = (''X''<sub>1</sub>, ..., ''X''<sub>''n''</sub>)}} and {{nowrap|1='''f'''′ = (''Y''<sub>1</sub>, ..., ''Y''<sub>''n''</sub>)}} be a [[basis of a vector space|basis]] of ''V''.<ref group="note">A basis '''f''' may here profitably be viewed as a [[linear isomorphism]] from '''R'''<sup>''n''</sup> to ''V''. Regarding '''f''' as a row vector whose entries are the elements of the basis, the associated linear isomorphism is then <math>\mathbf{x}\mapsto \mathbf{f}\mathbf{x}.</math></ref> Also, let the [[change of basis]] from '''f''' to '''f'''′ be given by {{NumBlk|:|<math>\mathbf{f}\mapsto \mathbf{f}' = \biggl(\sum_i a^i_1X_i,\dots,\sum_i a^i_nX_i\biggr) = \mathbf{f}A</math>|{{EquationRef|1}}}} for some [[invertible matrix|invertible]] ''n''×''n'' matrix ''A'' with entries <math>a^i_j</math>. Here, each vector ''Y''<sub>''j''</sub> of the '''f'''′ basis is a linear combination of the vectors ''X''<sub>''i''</sub> of the '''f''' basis, so that :<math>Y_j=\sum_i a^i_jX_i,</math> which are the columns of the matrix product <math>\mathbf f A</math>. ===Contravariant transformation=== {{Main|Covariant_transformation#Contravariant_transformation|l1=Contravariant transformation}} A vector <math>v</math> in ''V'' is expressed uniquely as a [[linear combination]] of the elements <math>X_i</math> of the '''f''' basis as {{NumBlk|:|<math>v=\sum_i v^i[\mathbf{f}]X_i,</math>|{{EquationRef|2}}}} where ''v''{{i sup|''i''}}['''f'''] are elements of the field ''S'' known as the '''components''' of ''v'' in the '''f''' basis. Denote the [[column vector]] of components of ''v'' by '''v'''['''f''']: :<math>\mathbf{v}[\mathbf{f}] = \begin{bmatrix}v^1[\mathbf{f}]\\v^2[\mathbf{f}]\\\vdots\\v^n[\mathbf{f}]\end{bmatrix}</math> so that ({{EquationNote|2}}) can be rewritten as a matrix product :<math>v = \mathbf{f}\, \mathbf{v}[\mathbf{f}].</math> The vector ''v'' may also be expressed in terms of the '''f'''′ basis, so that :<math>v = \mathbf{f'}\, \mathbf{v}[\mathbf{f'}].</math> However, since the vector ''v'' itself is invariant under the choice of basis, :<math>\mathbf{f}\, \mathbf{v}[\mathbf{f}] = v = \mathbf{f'}\, \mathbf{v}[\mathbf{f'}].</math> The invariance of ''v'' combined with the relationship ({{EquationNote|1}}) between '''f''' and '''f'''′ implies that :<math>\mathbf{f}\, \mathbf{v}[\mathbf{f}] = \mathbf{f}A\, \mathbf{v}[\mathbf{f}A],</math> giving the transformation rule :<math>\mathbf{v}[\mathbf{f'}] = \mathbf{v}[\mathbf{f}A] = A^{-1}\mathbf{v}[\mathbf{f}].</math> In terms of components, :<math>v^i[\mathbf{f}A] = \sum_j \tilde{a}^i_jv^j[\mathbf{f}]</math> where the coefficients <math>\tilde{a}^i_j</math> are the entries of the [[inverse matrix]] of ''A''. Because the components of the vector ''v'' transform with the ''inverse'' of the matrix ''A'', these components are said to '''transform contravariantly''' under a change of basis. The way ''A'' relates the two pairs is depicted in the following informal diagram using an arrow. The reversal of the arrow indicates a contravariant change: :<math>\begin{align} \mathbf{f} &\longrightarrow \mathbf{f'} \\ v[\mathbf{f}] &\longleftarrow v[\mathbf{f'}] \end{align}</math> ===Covariant transformation=== {{main|Covariant transformation}} A [[linear functional]] ''α'' on ''V'' is expressed uniquely in terms of its '''components''' (elements in ''S'') in the '''f''' basis as :<math>\alpha(X_i) = \alpha_i[\mathbf{f}] , \quad i=1,2,\dots,n.</math> These components are the action of ''α'' on the basis vectors ''X''<sub>''i''</sub> of the '''f''' basis. Under the change of basis from '''f''' to '''f'''′ (via {{EquationNote|1}}), the components transform so that {{NumBlk|:|<math>\begin{align} \alpha_i[\mathbf{f}A] &= \alpha(Y_i) \\ &= \alpha\biggl(\sum_j a^j_i X_j\biggr) \\ &= \sum_j a^j_i \alpha(X_j) \\ &= \sum_j a^j_i \alpha_j[\mathbf{f}]. \end{align}</math>|{{EquationRef|3}}}} Denote the [[row vector]] of components of ''α'' by ''α''['''f''']: :<math>\mathbf{\alpha}[\mathbf{f}] = \begin{bmatrix}\alpha_1[\mathbf{f}],\alpha_2[\mathbf{f}],\dots,\alpha_n[\mathbf{f}]\end{bmatrix}</math> so that ({{EquationNote|3}}) can be rewritten as the matrix product :<math>\alpha[\mathbf{f}A] = \alpha[\mathbf{f}]A.</math> Because the components of the linear functional α transform with the matrix ''A'', these components are said to '''transform covariantly''' under a change of basis. The way ''A'' relates the two pairs is depicted in the following informal diagram using an arrow. A covariant relationship is indicated since the arrows travel in the same direction: :<math>\begin{align} \mathbf{f} &\longrightarrow \mathbf{f'} \\ \alpha[\mathbf{f}] &\longrightarrow \alpha[\mathbf{f'}] \end{align}</math> Had a column vector representation been used instead, the transformation law would be the [[matrix transpose|transpose]] :<math>\alpha^\mathrm{T}[\mathbf{f}A] = A^\mathrm{T}\alpha^\mathrm{T}[\mathbf{f}].</math> ==Coordinates== The choice of basis '''f''' on the vector space ''V'' defines uniquely a set of coordinate functions on ''V'', by means of :<math>x^i[\mathbf{f}](v) = v^i[\mathbf{f}].</math> The coordinates on ''V'' are therefore contravariant in the sense that :<math>x^i[\mathbf{f}A] = \sum_{k=1}^n \tilde{a}^i_kx^k[\mathbf{f}].</math> Conversely, a system of ''n'' quantities ''v''<sup>''i''</sup> that transform like the coordinates ''x''<sup>''i''</sup> on ''V'' defines a contravariant vector (or simply vector). A system of ''n'' quantities that transform oppositely to the coordinates is then a covariant vector (or covector). This formulation of contravariance and covariance is often more natural in applications in which there is a coordinate space (a [[manifold]]) on which vectors live as [[tangent vector]]s or [[cotangent vector]]s. Given a local coordinate system ''x''<sup>''i''</sup> on the manifold, the reference axes for the coordinate system are the [[vector field]]s :<math>X_1 = \frac{\partial}{\partial x^1},\dots,X_n=\frac{\partial}{\partial x^n}.</math> This gives rise to the frame {{nowrap|1='''f''' = (''X''<sub>1</sub>, ..., ''X''<sub>''n''</sub>)}} at every point of the coordinate patch. If ''y''<sup>''i''</sup> is a different coordinate system and :<math>Y_1=\frac{\partial}{\partial y^1},\dots,Y_n = \frac{\partial}{\partial y^n},</math> then the frame '''f'''' is related to the frame '''f''' by the inverse of the [[Jacobian matrix]] of the coordinate transition: :<math>\mathbf{f}' = \mathbf{f}J^{-1},\quad J = \left(\frac{\partial y^i}{\partial x^j}\right)_{i,j=1}^n.</math> Or, in indices, :<math>\frac{\partial}{\partial y^i} = \sum_{j=1}^n\frac{\partial x^j}{\partial y^i}\frac{\partial}{\partial x^j}.</math> A tangent vector is by definition a vector that is a linear combination of the coordinate partials <math>\partial/\partial x^i</math>. Thus a tangent vector is defined by :<math>v = \sum_{i=1}^n v^i[\mathbf{f}] X_i = \mathbf{f}\ \mathbf{v}[\mathbf{f}].</math> Such a vector is contravariant with respect to change of frame. Under changes in the coordinate system, one has :<math>\mathbf{v}\left[\mathbf{f}'\right] = \mathbf{v}\left[\mathbf{f}J^{-1}\right] = J\, \mathbf{v}[\mathbf{f}].</math> Therefore, the components of a tangent vector transform via :<math>v^i\left[\mathbf{f}'\right] = \sum_{j=1}^n \frac{\partial y^i}{\partial x^j}v^j[\mathbf{f}].</math> Accordingly, a system of ''n'' quantities ''v''<sup>''i''</sup> depending on the coordinates that transform in this way on passing from one coordinate system to another is called a contravariant vector. ==Covariant and contravariant components of a vector with a metric== [[File:Covariantcomponents.gif|thumb|upright=1.75|Covariant and contravariant components of a vector when the basis is not orthogonal.]] In a finite-dimensional [[vector space]] ''V'' over a field ''K'' with a non-degenerate symmetric [[bilinear form]] {{nowrap|''g'' : ''V'' × ''V'' → ''K''}} (which may be referred to as the [[metric tensor]]), there is little distinction between covariant and contravariant vectors, because the [[bilinear form]] allows covectors to be identified with vectors. That is, a vector ''v'' uniquely determines a covector ''α'' via :<math>\alpha(w) = g(v, w) </math> for all vectors ''w''. Conversely, each covector ''α'' determines a unique vector ''v'' by this equation. Because of this identification of vectors with covectors, one may speak of the '''covariant components''' or '''contravariant components''' of a vector, that is, they are just representations of the same vector using the [[Dual basis|reciprocal basis]]. Given a basis {{nowrap|1='''f''' = (''X''<sub>1</sub>, ..., ''X''<sub>''n''</sub>)}} of ''V'', there is a unique reciprocal basis {{nowrap|1='''f'''<sup>#</sup> = (''Y''<sup>1</sup>, ..., ''Y''<sup>''n''</sup>)}} of ''V'' determined by requiring that :<math>g(Y^i,X_j) = \delta^i_j,</math> the [[Kronecker delta]]. In terms of these bases, any vector ''v'' can be written in two ways: :<math>\begin{align} v &= \sum_i v^i[\mathbf{f}]X_i = \mathbf{f}\,\mathbf{v}[\mathbf{f}]\\ &=\sum_i v_i[\mathbf{f}]Y^i = \mathbf{f}^\sharp\mathbf{v}^\sharp[\mathbf{f}]. \end{align} </math> The components ''v''<sup>''i''</sup>['''f'''] are the '''contravariant components''' of the vector ''v'' in the basis '''f''', and the components ''v''<sub>''i''</sub>['''f'''] are the '''covariant components''' of ''v'' in the basis '''f'''. The terminology is justified because under a change of basis, :<math>\mathbf{v}[\mathbf{f}A] = A^{-1}\mathbf{v}[\mathbf{f}],\quad \mathbf{v}^\sharp[\mathbf{f}A] = A^T\mathbf{v}^\sharp[\mathbf{f}]</math> where <math>A</math> is an invertible <math>n\times n</math> matrix, and the [[matrix transpose]] has its usual meaning. ===Euclidean plane=== In the Euclidean plane, the [[dot product]] allows for vectors to be identified with covectors. If <math>\mathbf{e}_1,\mathbf{e}_2</math> is a basis, then the dual basis <math>\mathbf{e}^1,\mathbf{e}^2</math> satisfies :<math>\begin{align} \mathbf{e}^1\cdot\mathbf{e}_1 = 1, &\quad \mathbf{e}^1\cdot\mathbf{e}_2 = 0 \\ \mathbf{e}^2\cdot\mathbf{e}_1 = 0, &\quad \mathbf{e}^2\cdot\mathbf{e}_2 = 1. \end{align}</math> Thus, '''e'''<sup>1</sup> and '''e'''<sub>2</sub> are perpendicular to each other, as are '''e'''<sup>2</sup> and '''e'''<sub>1</sub>, and the lengths of '''e'''<sup>1</sup> and '''e'''<sup>2</sup> normalized against '''e'''<sub>1</sub> and '''e'''<sub>2</sub>, respectively. ====Example==== For example,<ref>{{cite book |last1=Bowen |first1=Ray |first2=C.-C. |last2=Wang |chapter=§3.14 Reciprocal Basis and Change of Basis |chapter-url=https://books.google.com/books?id=Hr5bhIVWr4wC&pg=PA76 |title=Introduction to Vectors and Tensors |year=2008 |orig-year=1976 |publisher=Dover |pages=78, 79, 81 |isbn=9780486469140}}</ref> suppose that we are given a basis '''e'''<sub>1</sub>, '''e'''<sub>2</sub> consisting of a pair of vectors making a 45° angle with one another, such that '''e'''<sub>1</sub> has length 2 and '''e'''<sub>2</sub> has length 1. Then the dual basis vectors are given as follows: * '''e'''<sup>2</sup> is the result of rotating '''e'''<sub>1</sub> through an angle of 90° (where the sense is measured by assuming the pair '''e'''<sub>1</sub>, '''e'''<sub>2</sub> to be positively oriented), and then rescaling so that {{nowrap|'''e'''<sup>2</sup> ⋅ '''e'''<sub>2</sub> {{=}} 1}} holds. * '''e'''<sup>1</sup> is the result of rotating '''e'''<sub>2</sub> through an angle of 90°, and then rescaling so that {{nowrap|'''e'''<sup>1</sup> ⋅ '''e'''<sub>1</sub> {{=}} 1}} holds. Applying these rules, we find :<math>\mathbf{e}^1 = \frac{1}{2}\mathbf{e}_1 - \frac{1}{\sqrt{2}}\mathbf{e}_2</math> and :<math>\mathbf{e}^2 = -\frac{1}{\sqrt{2}}\mathbf{e}_1 + 2\mathbf{e}_2.</math> Thus the change of basis matrix in going from the original basis to the reciprocal basis is :<math>R = \begin{bmatrix} \frac{1}{2} & -\frac{1}{\sqrt{2}} \\ -\frac{1}{\sqrt{2}} & 2 \end{bmatrix},</math> since :<math>[\mathbf{e}^1\ \mathbf{e}^2] = [\mathbf{e}_1\ \mathbf{e}_2]\begin{bmatrix} \frac{1}{2} & -\frac{1}{\sqrt{2}} \\ -\frac{1}{\sqrt{2}} & 2 \end{bmatrix}.</math> For instance, the vector :<math>v = \frac{3}{2}\mathbf{e}_1 + 2\mathbf{e}_2</math> is a vector with contravariant components :<math>v^1 = \frac{3}{2},\quad v^2 = 2.</math> The covariant components are obtained by equating the two expressions for the vector ''v'': :<math>v = v_1\mathbf{e}^1 + v_2\mathbf{e}^2 = v^1\mathbf{e}_1 + v^2\mathbf{e}_2</math> so :<math>\begin{align} \begin{bmatrix}v_1\\ v_2\end{bmatrix} &= R^{-1}\begin{bmatrix}v^1 \\ v^2\end{bmatrix} \\ &= \begin{bmatrix}4 & \sqrt{2} \\ \sqrt{2} & 1\end{bmatrix} \begin{bmatrix}v^1 \\ v^2\end{bmatrix} \\ &= \begin{bmatrix}6 + 2\sqrt{2} \\ 2 + \frac{3}{\sqrt{2}}\end{bmatrix} \end{align}.</math> ===Three-dimensional Euclidean space=== In the three-dimensional [[Euclidean space]], one can also determine explicitly the dual basis to a given set of [[basis vector]]s '''e'''<sub>1</sub>, '''e'''<sub>2</sub>, '''e'''<sub>3</sub> of ''E''<sub>3</sub> that are not necessarily assumed to be orthogonal nor of unit norm. The dual basis vectors are: :<math> \mathbf{e}^1 = \frac{\mathbf{e}_2 \times \mathbf{e}_3}{\mathbf{e}_1 \cdot (\mathbf{e}_2 \times \mathbf{e}_3)} ; \qquad \mathbf{e}^2 = \frac{\mathbf{e}_3 \times \mathbf{e}_1}{\mathbf{e}_2 \cdot (\mathbf{e}_3 \times \mathbf{e}_1)}; \qquad \mathbf{e}^3 = \frac{\mathbf{e}_1 \times \mathbf{e}_2}{\mathbf{e}_3 \cdot (\mathbf{e}_1 \times \mathbf{e}_2)}. </math> Even when the '''e'''<sub>i</sub> and '''e'''<sup>i</sup> are not [[Orthonormality|orthonormal]], they are still mutually reciprocal: :<math>\mathbf{e}^i \cdot \mathbf{e}_j = \delta^i_j,</math> Then the contravariant components of any vector '''v''' can be obtained by the [[dot product]] of '''v''' with the dual basis vectors: :<math> q^1 = \mathbf{v} \cdot \mathbf{e}^1; \qquad q^2 = \mathbf{v} \cdot \mathbf{e}^2; \qquad q^3 = \mathbf{v} \cdot \mathbf{e}^3 .</math> Likewise, the covariant components of '''v''' can be obtained from the dot product of '''v''' with basis vectors, viz. :<math> q_1 = \mathbf{v} \cdot \mathbf{e}_1; \qquad q_2 = \mathbf{v} \cdot \mathbf{e}_2; \qquad q_3 = \mathbf{v} \cdot \mathbf{e}_3 .</math> Then '''v''' can be expressed in two (reciprocal) ways, viz. :<math> \mathbf{v} = q^i \mathbf{e}_i = q^1 \mathbf{e}_1 + q^2 \mathbf{e}_2 + q^3 \mathbf{e}_3 .</math> or :<math> \mathbf{v} = q_i \mathbf{e}^i = q_1 \mathbf{e}^1 + q_2 \mathbf{e}^2 + q_3 \mathbf{e}^3 </math> Combining the above relations, we have :<math> \mathbf{v} = (\mathbf{v} \cdot \mathbf{e}^i) \mathbf{e}_i = (\mathbf{v} \cdot \mathbf{e}_i) \mathbf{e}^i </math> and we can convert between the basis and dual basis with :<math>q_i = \mathbf{v}\cdot \mathbf{e}_i = (q^j \mathbf{e}_j)\cdot \mathbf{e}_i = (\mathbf{e}_j\cdot\mathbf{e}_i) q^j </math> and :<math>q^i = \mathbf{v}\cdot \mathbf{e}^i = (q_j \mathbf{e}^j)\cdot \mathbf{e}^i = (\mathbf{e}^j\cdot\mathbf{e}^i) q_j .</math> If the basis vectors are [[orthonormal]], then they are the same as the dual basis vectors. ===Vector spaces of any dimension=== The following applies to any vector space of dimension {{math|''n''}} equipped with a non-degenerate commutative and distributive dot product, and thus also to the Euclidean spaces of any dimension. All indices in the formulas run from 1 to {{math|''n''}}. The [[Einstein notation]] for the implicit summation of the terms with the same upstairs (contravariant) and downstairs (covariant) indices is followed. The historical and geometrical meaning of the terms ''contravariant'' and ''covariant'' will be explained at the end of this section. ====Definitions==== # '''Covariant basis''' of a vector space of dimension ''n'': <math>\mathbf{e_j} \triangleq</math> {any linearly independent basis for which in general is <math> \mathbf{e_i} \cdot \mathbf{e_j} \neq \delta_{ij}</math>}, i.e. not necessarily [[orthonormal]] (D.1). # '''Contravariant components''' of a vector <math>\mathbf{v}</math>: <math>v^i \triangleq \{ v^i \mid \mathbf{v} = v^i \mathbf{e_i}\}</math> (D.2). # '''Dual (contravariant) basis''' of a vector space of dimension ''n'': <math>\mathbf{e^i} \triangleq \{ \mathbf{e^i} : \mathbf{e^i} \cdot \mathbf{e_j} = \delta^i_j \}</math> (D.3). # '''Covariant components''' of a vector <math>\mathbf{v}</math>: <math>v_i \triangleq \{ v_i \mid \mathbf{v} = v_i \mathbf{e^i}\}</math> (D.4). # '''Components of the covariant metric tensor''': <math>g_{ij} \triangleq \mathbf{e_i} \cdot \mathbf{e_j}</math>; the metric tensor can be considered a square matrix, since it only has two covariant indices: <math>G \triangleq \{ g_{ij} \}</math>; for the commutative property of the dot product, the <math>g_{ij}</math> are symmetric (D.5). # '''Components of the contravariant metric tensor''': <math>g^{ij} \triangleq \{ h_{ij} : G^{-1} = \{ h_{ij} \}\}</math>; these are the elements of the inverse of the covariant metric tensor/matrix <math>G^{-1}</math>, and for the properties of the inverse of a symmetric matrix, they're also symmetric (D.6). ====Corollaries==== * <span style="border: 1px solid black; padding: 5px;"><math>g^{ij}g_{jk} = \delta^i_k</math> (1).</span> <br>''Proof:'' from the properties of the inverse matrix (D.6). * <span style="border: 1px solid black; padding: 5px;"><math>\mathbf{e^i} = g^{ij} \mathbf{e_j}</math> (2).</span><br>''Proof:'' let's suppose that <math>\{A^{ij} \mid \mathbf{e^i} = A^{ij} \mathbf{e_j}\}</math>; we will show that <math>A^{ij} = g^{ij}</math>. Taking the dot product of both sides with <math>\mathbf{e_k}</math>:<div style="display: inline-block; overflow-x: auto; max-width: 100%;"><math>\mathbf{e^i} \cdot \mathbf{e_k} = (A^{ij} \mathbf{e_j}) \cdot \mathbf{e_k} \stackrel{\text{(D.3,D.5)}}{\to} A^{ij}g_{jk} = \delta^i_k</math></div>; multiplying both sides by <math>g^{mk}</math>:<div style="display: inline-block; overflow-x: auto; max-width: 100%;"><math>g^{mk}A^{ij}g_{jk} = g^{mk}\delta^i_k \stackrel{\text{(D.5)}}{\to} g^{mk}g_{kj}A^{ij} = g^{mi} \stackrel{\text{(1)}}{\to} \delta^m_j A^{ij} = g^{mi} \to A^{im} = g^{mi}\stackrel{\text{(D.6)}}{=} g^{im},\ \text{Q.E.D.} \quad \blacksquare</math></div> * <span style="border: 1px solid black; padding: 5px;"><math>\mathbf{e^i} \cdot \mathbf{e^j} = g^{ij}</math> (3).</span><br>''Proof:'' :<div style="display: inline-block; overflow-x: auto; max-width: 100%;"><math>\mathbf{e^i} \stackrel{\text{(2)}}{=} g^{ik} \mathbf{e_k}; \mathbf{e^j} \stackrel{\text{(2)}}{=} g^{jm} \mathbf{e_m} \to \mathbf{e^i} \cdot \mathbf{e^j} = g^{ik} g^{jm} (\mathbf{e_k} \cdot \mathbf{e_m}) \stackrel{\text{(D.5)}}{=} g^{ik}g^{jm}g_{km} \stackrel{\text{(1)}}{=} \delta^i_m g^{jm} = g^{ji} \stackrel{\text{(D.6)}}{=} g^{ij},\ \text{Q.E.D.} \quad \blacksquare</math></div> * <span style="border: 1px solid black; padding: 5px;"><math>\mathbf{e_i} = g_{ij} \mathbf{e^j}</math> (4).</span><br>''Proof:'' let's suppose that <math>\{B_{ij} \mid \mathbf{e_i} = B_{ij} \mathbf{e^j}\}</math>; we will show that <math>B_{ij} = g_{ij}</math>. Taking the dot product of both sides with <math>\mathbf{e^k}</math>: <math>\mathbf{e_i} \cdot \mathbf{e^k} = (B_{ij} \mathbf{e^j}) \cdot \mathbf{e^k} \stackrel{\text{(D.3,3)}}{\to} B_{ij}g^{jk} = \delta_i^k</math>; multiplying both sides by <math>g_{mk}</math>:<div style="display: inline-block; overflow-x: auto; max-width: 100%;"><math>g_{mk}B_{ij}g^{jk} = g_{mk}\delta_i^k \stackrel{\text{(D.5)}}{\to} g_{mk}g^{kj}B_{ij} = g_{mi} \stackrel{\text{(1)}}{\to} \delta_m^j B_{ij} = g_{mi} \to B_{im} = g_{mi} \stackrel{\text{(D.5)}}{=} g_{im},\ \text{Q.E.D.} \quad \blacksquare</math></div> * <span style="border: 1px solid black; padding: 5px;"><math>v^i = g^{ij} v_j</math> (5).</span><br>''Proof:'' :<div style="display: inline-block; overflow-x: auto; max-width: 100%;"><math>\mathbf{v} \stackrel{\text{(D.2)}}{=} v^i \mathbf{e_i}; \mathbf{v} \stackrel{\text{(D.4)}}{=} v_j \mathbf{e^j} \stackrel{\text{(2)}}{=} v_j(g^{ji} \mathbf{e_i}) \to v^i \mathbf{e_i} = v_j g^{ji} \mathbf{e_i}, \forall i \to v^i = g^{ji}v_j \stackrel{\text{(D.6)}}{=} g^{ij} v_j,\ \text{Q.E.D.} \quad \blacksquare</math></div> * <span style="border: 1px solid black; padding: 5px;"><math>v_i = g_{ij} v^j</math> (6).</span><br>''Proof:'' specular to (5). * <span style="border: 1px solid black; padding: 5px;"><math>v_i = \mathbf{v} \cdot \mathbf{e_i}</math> (7).</span><br>''Proof:'' :<div style="display: inline-block; overflow-x: auto; max-width: 100%;"><math>\mathbf{v} \cdot \mathbf{e_i} \stackrel{\text{(D.4)}}{=} (v_j \mathbf{e^j}) \cdot \mathbf{e_i} \stackrel{\text{(D.3)}}{=} v_j \delta^j_i = v_i,\ \text{Q.E.D.} \quad \blacksquare</math></div> * <span style="border: 1px solid black; padding: 5px;"><math>v^i = \mathbf{v} \cdot \mathbf{e^i}</math> (8).</span><br>''Proof:'' specular to (7). * <span style="border: 1px solid black; padding: 5px;"><math>\mathbf{u} \cdot \mathbf{v} = g_{ij}u^iv^j</math> (9).</span><br>''Proof:'' :<div style="display: inline-block; overflow-x: auto; max-width: 100%;"><math>\mathbf{u} \cdot \mathbf{v} \stackrel{\text{(D.2)}}{=} (u^i\mathbf{e_i}) \cdot (v^j\mathbf{e_j}) = (\mathbf{e_i} \cdot \mathbf{e_j})u^iv^j \stackrel{\text{(D.5)}}{=} g_{ij}u^iv^j,\ \text{Q.E.D.} \quad \blacksquare</math>.</div> * <span style="border: 1px solid black; padding: 5px;"><math>\mathbf{u} \cdot \mathbf{v} = g^{ij}u_iv_j</math> (10).</span><br>''Proof:'' specular to (9). ====Historical and geometrical meaning==== [[File:Covariant and contravariant vector components.svg|thumb|Aid for explaining the geometrical meaning of covariant and contravariant vector components.]] Considering this figure for the case of an Euclidean space with <math>n = 2</math>, since <math>\mathbf{v} = \mathbf{OA} + \mathbf{OB}</math>, if we want to express <math>\mathbf{v}</math> in terms of the covariant basis, we have to multiply the basis vectors by the coefficients <math>v^1 = \frac{\vert \mathbf{OA} \vert}{\vert \mathbf{e_1} \vert}, v^2 = \frac{\vert \mathbf{OB} \vert}{\vert \mathbf{e_2} \vert}</math>. With <math>\mathbf{v}</math> and thus <math>\mathbf{OA}</math> and <math>\mathbf{OB}</math> fixed, if the module of <math>\mathbf{e_i}</math> increases, the value of the <math>v^i</math> component decreases, and that's why they're called ''contra''-variant (with respect to the variation of the basis vectors module). Symmetrically, corollary (7) states that the <math>v_i</math> components equal the dot product <math>\mathbf{v} \cdot \mathbf{e_i}</math> between the vector and the covariant basis vectors, and since this is directly proportional to the basis vectors module, they're called ''co''-variant. If we consider the dual (contravariant) basis, the situation is perfectly specular: the covariant components are ''contra''-variant with respect to the module of the dual basis vectors, while the contravariant components are ''co''-variant. So in the end it all boils down to a matter of convention: historically the first non-[[orthonormal]] basis of the vector space of choice was called "covariant", its dual basis "contravariant", and the corresponding components named specularly. If the covariant basis becomes [[orthonormal]], the dual contravariant basis aligns with it and the covariant components collapse into the contravariant ones, the most familiar situation when dealing with geometrical Euclidean vectors. <math>G</math> and <math>G^{-1}</math> become the identity matrix <math>I</math>, and: <div style="display: inline-block; overflow-x: auto; max-width: 100%;"><math>g_{ij}=\delta_{ij}, g^{ij}=\delta^{ij}, \mathbf{u} \cdot \mathbf{v} = \delta_{ij}u^iv^j = \sum_{i}u^iv^i = \delta^{ij}u_iv_j = \sum_{i}u_iv_i.</math></div> If the metric is non-Euclidean, but for instance [[Minkowski space#Minkowski metric|Minkowskian]] like in the [[special relativity]] and [[general relativity]] theories, the basis are never orthonormal, even in the case of special relativity where <math>G</math> and <math>G^{-1}</math> become, for <math>n=4,\ \eta \triangleq diag(1,-1,-1,-1)</math>. In this scenario, the covariant and contravariant components always differ. ==Use in tensor analysis== The distinction between covariance and contravariance is particularly important for computations with [[tensor]]s, which often have '''mixed variance'''. This means that they have both covariant and contravariant components, or both vector and covector components. The valence of a tensor is the number of covariant and contravariant terms, and in [[Einstein notation]], covariant components have lower indices, while contravariant components have upper indices. The duality between covariance and contravariance intervenes whenever a vector or tensor quantity is represented by its components, although modern [[differential geometry]] uses more sophisticated [[Tensor (intrinsic definition)|index-free methods to represent tensors]]. In [[tensor analysis]], a '''covariant''' vector varies more or less reciprocally to a corresponding contravariant vector. Expressions for lengths, areas and volumes of objects in the vector space can then be given in terms of tensors with covariant and contravariant indices. Under simple expansions and contractions of the coordinates, the reciprocity is exact; under affine transformations the components of a vector intermingle on going between covariant and contravariant expression. On a [[manifold]], a [[tensor field]] will typically have multiple, upper and lower indices, where Einstein notation is widely used. When the manifold is equipped with a [[metric tensor|metric]], covariant and contravariant indices become very closely related to one another. Contravariant indices can be turned into covariant indices by [[contraction of a tensor|contracting]] with the metric tensor. The reverse is possible by contracting with the (matrix) inverse of the metric tensor. Note that in general, no such relation exists in spaces not endowed with a metric tensor. Furthermore, from a more abstract standpoint, a tensor is simply "there" and its components of either kind are only calculational artifacts whose values depend on the chosen coordinates. The explanation in geometric terms is that a general tensor will have contravariant indices as well as covariant indices, because it has parts that live in the [[tangent bundle]] as well as the [[cotangent bundle]]. A contravariant vector is one which transforms like <math>\frac{dx^{\mu}}{d\tau}</math>, where <math>x^{\mu} \!</math> are the coordinates of a particle at its [[proper time]] <math>\tau</math>. A covariant vector is one which transforms like <math>\frac{\partial \varphi}{\partial x^{\mu}}</math>, where <math>\varphi</math> is a scalar field. ==Algebra and geometry== In [[category theory]], there are [[covariant functor]]s and [[contravariant functor]]s. The assignment of the [[dual space]] to a vector space is a standard example of a contravariant functor. Contravariant (resp. covariant) vectors are contravariant (resp. covariant) functors from a <math>\text{GL}(n)</math>-[[torsor]] to the fundamental representation of <math>\text{GL}(n)</math>. Similarly, tensors of higher degree are functors with values in other representations of <math>\text{GL}(n)</math>. However, some constructions of [[multilinear algebra]] are of "mixed" variance, which prevents them from being functors. In [[differential geometry]], the components of a vector relative to a basis of the [[tangent bundle]] are covariant if they change with the same linear transformation as a change of basis. They are contravariant if they change by the inverse transformation. This is sometimes a source of confusion for two distinct but related reasons. The first is that vectors whose components are covariant (called covectors or [[1-form]]s) actually [[pullback (differential geometry)|pull back]] under smooth functions, meaning that the operation assigning the space of covectors to a smooth manifold is actually a ''contravariant'' functor. Likewise, vectors whose components are contravariant [[pushforward (differential)|push forward]] under smooth mappings, so the operation assigning the space of (contravariant) vectors to a smooth manifold is a ''covariant'' functor. Secondly, in the classical approach to differential geometry, it is not bases of the tangent bundle that are the most primitive object, but rather changes in the coordinate system. Vectors with contravariant components transform in the same way as changes in the coordinates (because these actually change oppositely to the induced change of basis). Likewise, vectors with covariant components transform in the opposite way as changes in the coordinates. ==See also== * [[Active and passive transformation]] * [[Mixed tensor]] * [[Two-point tensor]], a generalization allowing indices to reference multiple vector bases ==Notes== {{Reflist|group="note"}} ==Citations== {{Reflist}} ==References== {{refbegin}} * {{citation |last1=Kusse |first1=Bruce R. |last2=Westwig |first2=Erik A. |title=Mathematical Physics: Applied Mathematics for Scientists and Engineers |edition=2nd |publisher=Wiley |year=2010 |isbn=978-3-527-61814-9}}. * {{citation |last1=Arfken |first1=George B.|authorlink1=George B. Arfken |first2=Hans J. |last2=Weber |title=Mathematical Methods for Physicists |edition=6th |publisher=Harcourt |year=2005 |isbn=0-12-059876-0}}. * {{Citation | last1=Dodson | first1=C. T. J. | last2=Poston | first2=T. | title=Tensor geometry | publisher=Springer | edition=2nd | series=Graduate Texts in Mathematics | isbn=978-3-540-52018-4 | mr=1223091 | year=1991 | volume=130}}. * {{Citation | last1=Greub | first1=Werner Hildbert | title=Multilinear algebra | publisher=Springer | series=Die Grundlehren der Mathematischen Wissenschaften, Band 136 | mr=0224623 | year=1967 | bibcode=1967mual.book.....G |isbn=9780387038278}}. * {{Citation | last1=Sternberg | first1=Shlomo | author1-link=Shlomo Sternberg | title=Lectures on differential geometry | publisher=Chelsea | isbn=978-0-8284-0316-0 | year=1983}}. * {{citation |doi=10.1098/rstl.1853.0018 |first=J.J. |last=Sylvester |jstor=108572 |title=On a Theory of the Syzygetic Relations of Two Rational Integral Functions, Comprising an Application to the Theory of Sturm's Functions, and That of the Greatest Algebraical Common Measure |year=1853 |journal=Philosophical Transactions of the Royal Society of London |volume=143 |pages=407–548 |url=https://zenodo.org/record/1432412 |doi-access=free }}. * {{Citation | last1=Weinreich | first1=Gabriel | title=Geometrical Vectors | publisher=The University of Chicago Press | series=Chicago Lectures in Physics | year=1998 | isbn=9780226890487 | pages=126}} {{refend}} ==External links== * {{springer|title=Covariant tensor|id=p/c026880}} * {{springer|title=Contravariant tensor|id=p/c025970}} * {{MathWorld | urlname=CovariantTensor | title=Covariant Tensor}} * {{MathWorld | urlname=ContravariantTensor | title=Contravariant Tensor}} * [http://www.mathpages.com/home/kmath398/kmath398.htm Invariance, Contravariance, and Covariance] * {{cite web |first1=Kees |last1=Dullemond |first2=Kasper |last2=Peeters |title=Introduction to tensor calculus |date=2010 |publisher= |url=http://www.ita.uni-heidelberg.de/~dullemond/lectures/tensor/tensor.pdf}} {{tensors}} [[Category:Tensors]] [[Category:Differential geometry]] [[Category:Riemannian geometry]] [[Category:Vectors (mathematics and physics)]]
Edit summary
(Briefly describe your changes)
By publishing changes, you agree to the
Terms of Use
, and you irrevocably agree to release your contribution under the
CC BY-SA 4.0 License
and the
GFDL
. You agree that a hyperlink or URL is sufficient attribution under the Creative Commons license.
Cancel
Editing help
(opens in new window)
Pages transcluded onto the current version of this page
(
help
)
:
Template:About
(
edit
)
Template:Citation
(
edit
)
Template:Cite book
(
edit
)
Template:Cite web
(
edit
)
Template:Color box
(
edit
)
Template:Defn
(
edit
)
Template:EquationNote
(
edit
)
Template:Glossary
(
edit
)
Template:Glossary end
(
edit
)
Template:I sup
(
edit
)
Template:Main
(
edit
)
Template:Math
(
edit
)
Template:MathWorld
(
edit
)
Template:Nowrap
(
edit
)
Template:NumBlk
(
edit
)
Template:Refbegin
(
edit
)
Template:Refend
(
edit
)
Template:Reflist
(
edit
)
Template:SfnRef
(
edit
)
Template:Short description
(
edit
)
Template:Specify
(
edit
)
Template:Springer
(
edit
)
Template:Tensors
(
edit
)
Template:Term
(
edit
)