Four-momentum

Revision as of 18:40, 9 May 2025 by imported>Eulersidentity (Fixes formatting)
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Template:Short description Template:Special relativity sidebar

In special relativity, four-momentum (also called momentum–energy or momenergy<ref>Template:Cite book</ref>) is the generalization of the classical three-dimensional momentum to four-dimensional spacetime. Momentum is a vector in three dimensions; similarly four-momentum is a four-vector in spacetime. The contravariant four-momentum of a particle with relativistic energy Template:Mvar and three-momentum Template:Math, where Template:Math is the particle's three-velocity and Template:Mvar the Lorentz factor, is <math display="block">p = \left(p^0 , p^1 , p^2 , p^3\right) = \left(\frac E c , p_x , p_y , p_z\right).</math>

The quantity Template:Math of above is the ordinary non-relativistic momentum of the particle and Template:Mvar its rest mass. The four-momentum is useful in relativistic calculations because it is a Lorentz covariant vector. This means that it is easy to keep track of how it transforms under Lorentz transformations.

Minkowski normEdit

Calculating the Minkowski norm squared of the four-momentum gives a Lorentz invariant quantity equal (up to factors of the speed of light Template:Math) to the square of the particle's proper mass:

<math display="block">p \cdot p = \eta_{\mu\nu} p^\mu p^\nu = p_\nu p^\nu = -{E^2 \over c^2} + |\mathbf p|^2 = -m^2 c^2</math> where the following denote:

<math display="inline">p </math>, the four-momentum vector of a particle,

<math display="inline">p \cdot p</math>, the Minkowski inner product of the four-momentum with itself,

<math display="inline">p^\mu </math>and <math display="inline">p^\nu </math>, the contravariant components of the four-momentum vector,

<math display="inline">p_\nu </math>, the covariant form,

<math display="inline">E </math>, the energy of the particle,

<math display="inline">c </math>, the speed of light,

<math display="inline">|\mathbf p| </math>, the magnitude of the four-momentum vector,

<math display="inline">m </math>, the invariant mass (rest) of the particle,

and <math display="block"> \eta_{\mu\nu} = \begin{pmatrix}

 -1 & 0 & 0 & 0\\
  0 & 1 & 0 & 0\\
  0 & 0 & 1 & 0\\
  0 & 0 & 0 & 1

\end{pmatrix} </math> is the metric tensor of special relativity with metric signature for definiteness chosen to be Template:Math. The negativity of the norm reflects that the momentum is a timelike four-vector for massive particles. The other choice of signature would flip signs in certain formulas (like for the norm here). This choice is not important, but once made it must for consistency be kept throughout.

The Minkowski norm is Lorentz invariant, meaning its value is not changed by Lorentz transformations/boosting into different frames of reference. More generally, for any two four-momenta Template:Mvar and Template:Mvar, the quantity Template:Math is invariant.

Relation to four-velocityEdit

For a massive particle, the four-momentum is given by the particle's invariant mass Template:Mvar multiplied by the particle's four-velocity, <math display="block">p^\mu = m u^\mu,</math> where the four-velocity Template:Mvar is <math display="block"> u = \left(u^0 , u^1 , u^2 , u^3\right) = \gamma_v \left(c , v_x , v_y , v_z\right), </math> and <math display="block">\gamma_v := \frac{1}{\sqrt{1 - \frac{v^2}{c^2}}}</math> is the Lorentz factor (associated with the speed <math>v</math>), Template:Math is the speed of light.

DerivationEdit

There are several ways to arrive at the correct expression for four-momentum. One way is to first define the four-velocity Template:Math and simply define Template:Math, being content that it is a four-vector with the correct units and correct behavior. Another, more satisfactory, approach is to begin with the principle of least action and use the Lagrangian framework to derive the four-momentum, including the expression for the energy.<ref>Template:Harvnb</ref> One may at once, using the observations detailed below, define four-momentum from the action Template:Mvar. Given that in general for a closed system with generalized coordinates Template:Math and canonical momenta Template:Math,<ref>Template:Harvnb</ref> <math display="block">p_i = \frac{\partial S}{\partial q_i} = \frac{\partial S}{\partial x_i}, \quad E = -\frac{\partial S}{\partial t} = - c \cdot \frac{\partial S}{\partial x^{0}},</math> it is immediate (recalling Template:Math, Template:Math, Template:Math, Template:Math and Template:Math, Template:Math, Template:Math, Template:Math in the present metric convention) that <math display="block">p_\mu =\frac{\partial S}{\partial x^\mu} = \left(-{E \over c}, \mathbf p\right)</math> is a covariant four-vector with the three-vector part being the canonical momentum. Template:Hidden begin Consider initially a system of one degree of freedom Template:Mvar. In the derivation of the equations of motion from the action using Hamilton's principle, one finds (generally) in an intermediate stage for the variation of the action, <math display="block">\delta S = \left. \left[ \frac{\partial L}{\partial \dot q}\delta q\right]\right|_{t_1}^{t_2} + \int_{t_1}^{t_2} \left( \frac{\partial L}{\partial q} - \frac{d}{dt} \frac{\partial L}{\partial \dot q}\right)\delta q dt.</math>

The assumption is then that the varied paths satisfy Template:Math, from which Lagrange's equations follow at once. When the equations of motion are known (or simply assumed to be satisfied), one may let go of the requirement Template:Math. In this case the path is assumed to satisfy the equations of motion, and the action is a function of the upper integration limit Template:Math, but Template:Math is still fixed. The above equation becomes with Template:Math, and defining Template:Math, and letting in more degrees of freedom, <math display="block">\delta S = \sum_i \frac{\partial L}{\partial \dot{q}_i}\delta q_i = \sum_i p_i \delta q_i.</math>

Observing that <math display="block">\delta S = \sum_i \frac{\partial S}{\partial {q}_i}\delta q_i,</math> one concludes <math display="block">p_i = \frac{\partial S}{\partial q_i}.</math>

In a similar fashion, keep endpoints fixed, but let Template:Math vary. This time, the system is allowed to move through configuration space at "arbitrary speed" or with "more or less energy", the field equations still assumed to hold and variation can be carried out on the integral, but instead observe <math display="block">\frac{dS}{dt} = L</math> by the fundamental theorem of calculus. Compute using the above expression for canonical momenta, <math display="block">

 \frac{dS}{dt} = \frac{\partial S}{\partial t} + \sum_i \frac{\partial S}{\partial q_i}\dot{q}_i =
 \frac{\partial S}{\partial t} + \sum_i p_i\dot{q}_i = L.

</math>

Now using <math display="block">H = \sum_i p_i \dot{q}_i - L,</math> where Template:Mvar is the Hamiltonian, leads to, since Template:Math in the present case, <math display="block">E = H = -\frac{\partial S}{\partial t}.</math>

Incidentally, using Template:Math with Template:Math in the above equation yields the Hamilton–Jacobi equations. In this context, Template:Mvar is called Hamilton's principal function.


Template:Hidden end

The action Template:Mvar is given by <math display="block">S = -mc\int ds = \int L dt, \quad L = -mc^2\sqrt{1 - \frac{v^2}{c^2}},</math> where Template:Mvar is the relativistic Lagrangian for a free particle. From this, Template:Hidden begin The variation of the action is <math display="block">\delta S = -mc\int \delta ds.</math>

To calculate Template:Math, observe first that Template:Math and that <math display="block">\delta ds^2

 = \delta \eta_{\mu\nu}dx^\mu dx^\nu
 = \eta_{\mu\nu} \left(\delta \left(dx^\mu\right) dx^\nu + dx^\mu \delta \left(dx^\nu\right)\right)
 = 2\eta_{\mu\nu} \delta \left(dx^\mu\right) dx^\nu.

</math>

So <math display="block">\delta ds = \eta_{\mu\nu} \delta dx^\mu \frac{dx^\nu}{ds} = \eta_{\mu\nu} d\delta x^\mu \frac{dx^\nu}{ds},</math> or <math display="block">\delta ds = \eta_{\mu\nu} \frac{d\delta x^\mu}{d\tau} \frac{dx^\nu}{cd\tau}d\tau,</math> and thus <math display="block">\delta S =

 -m\int \eta_{\mu\nu} \frac{d\delta x^\mu}{d\tau} \frac{dx^\nu}{d\tau}d\tau =
 -m\int \eta_{\mu\nu} \frac{d\delta x^\mu}{d\tau} u^\nu d\tau =
 -m\int \eta_{\mu\nu} \left[\frac{d}{d\tau} \left(\delta x^\mu u^\nu\right) - \delta x^\mu\frac{d}{d\tau}u^\nu\right] d\tau

</math> which is just <math display="block">\delta S = \left[-mu_\mu\delta x^\mu\right]_{t_1}^{t_2} + m \int_{t_1}^{t_2} \delta x^\mu\frac{du_\mu}{ds}ds</math>


Template:Hidden end <math display="block">\delta S = \left[ -mu_\mu\delta x^\mu\right]_{t_1}^{t_2} + m\int_{t_1}^{t_2}\delta x^\mu\frac{du_\mu}{ds}ds = -mu_\mu\delta x^\mu = \frac{\partial S}{\partial x^\mu}\delta x^\mu = -p_\mu\delta x^\mu,</math>

where the second step employs the field equations Template:Math, Template:Math, and Template:Math as in the observations above. Now compare the last three expressions to find <math display="block">p^\mu = \partial^\mu[S] = \frac{\partial S}{\partial x_\mu} = mu^\mu = m\left(\frac{c}{\sqrt{1 - \frac{v^2}{c^2}}}, \frac{v_x}{\sqrt{1 - \frac{v^2}{c^2}}}, \frac{v_y}{\sqrt{1 - \frac{v^2}{c^2}}}, \frac{v_z}{\sqrt{1 - \frac{v^2}{c^2}}}\right),</math> with norm Template:Math, and the famed result for the relativistic energy,

Template:Equation box 1} = m_{r}c^2,</math>

|cellpadding = 6
|border
|border colour = #0073CF
|bgcolor = #F9FFF7

}}

where Template:Math is the now unfashionable relativistic mass, follows. By comparing the expressions for momentum and energy directly, one has

Template:Equation box 1

that holds for massless particles as well. Squaring the expressions for energy and three-momentum and relating them gives the energy–momentum relation,

Template:Equation box 1

Substituting <math display="block">p_\mu \leftrightarrow -\frac{\partial S}{\partial x^\mu}</math> in the equation for the norm gives the relativistic Hamilton–Jacobi equation,<ref>Template:Harvnb</ref>

Template:Equation box 1

It is also possible to derive the results from the Lagrangian directly. By definition,<ref>Template:Harvnb</ref> <math display="block">\begin{align}

 \mathbf p &= \frac{\partial L}{\partial \mathbf v} 
            = \left({\partial L\over \partial \dot x}, {\partial L\over\partial \dot y}, {\partial L\over\partial \dot z}\right)
            = m(\gamma v_x, \gamma v_y, \gamma v_z) = m\gamma \mathbf v
            = m \mathbf u , \\[3pt]
         E &= \mathbf p \cdot \mathbf v - L = \frac{mc^2}{\sqrt{1 - \frac{v^2}{c^2}}},

\end{align}</math> which constitute the standard formulae for canonical momentum and energy of a closed (time-independent Lagrangian) system. With this approach it is less clear that the energy and momentum are parts of a four-vector.

The energy and the three-momentum are separately conserved quantities for isolated systems in the Lagrangian framework. Hence four-momentum is conserved as well. More on this below.

More pedestrian approaches include expected behavior in electrodynamics.<ref>Template:Harvnb</ref> In this approach, the starting point is application of Lorentz force law and Newton's second law in the rest frame of the particle. The transformation properties of the electromagnetic field tensor, including invariance of electric charge, are then used to transform to the lab frame, and the resulting expression (again Lorentz force law) is interpreted in the spirit of Newton's second law, leading to the correct expression for the relativistic three- momentum. The disadvantage, of course, is that it isn't immediately clear that the result applies to all particles, whether charged or not, and that it doesn't yield the complete four-vector.

It is also possible to avoid electromagnetism and use well tuned experiments of thought involving well-trained physicists throwing billiard balls, utilizing knowledge of the velocity addition formula and assuming conservation of momentum.<ref>Template:Harvnb</ref><ref>Template:Harvnb Wikisource version</ref> This too gives only the three-vector part.

Conservation of four-momentumEdit

As shown above, there are three conservation laws (not independent, the last two imply the first and vice versa):

  • The four-momentum Template:Mvar (either covariant or contravariant) is conserved.
  • The total energy Template:Math is conserved.
  • The 3-space momentum <math>\mathbf{p} = \left(p^1, p^2, p^3\right)</math> is conserved (not to be confused with the classic non-relativistic momentum <math>m\mathbf{v}</math>).

Note that the invariant mass of a system of particles may be more than the sum of the particles' rest masses, since kinetic energy in the system center-of-mass frame and potential energy from forces between the particles contribute to the invariant mass. As an example, two particles with four-momenta Template:Nowrap and Template:Nowrap each have (rest) mass 3Template:NbspGeV/c2 separately, but their total mass (the system mass) is 10Template:NbspGeV/c2. If these particles were to collide and stick, the mass of the composite object would be 10Template:NbspGeV/c2.

One practical application from particle physics of the conservation of the invariant mass involves combining the four-momenta Template:Math and Template:Math of two daughter particles produced in the decay of a heavier particle with four-momentum Template:Math to find the mass of the heavier particle. Conservation of four-momentum gives Template:Math, while the mass Template:Math of the heavier particle is given by Template:Math. By measuring the energies and three-momenta of the daughter particles, one can reconstruct the invariant mass of the two-particle system, which must be equal to Template:Mvar. This technique is used, e.g., in experimental searches for Z′ bosons at high-energy particle colliders, where the Z′ boson would show up as a bump in the invariant mass spectrum of electronpositron or muon–antimuon pairs.

If the mass of an object does not change, the Minkowski inner product of its four-momentum and corresponding four-acceleration Template:Math is simply zero. The four-acceleration is proportional to the proper time derivative of the four-momentum divided by the particle's mass, so <math display="block">p^\mu A_\mu = \eta_{\mu\nu} p^\mu A^\nu = \eta_{\mu\nu} p^\mu \frac{d}{d\tau} \frac{p^{\nu}}{m} = \frac{1}{2m} \frac{d}{d\tau} p \cdot p = \frac{1}{2m} \frac{d}{d\tau} \left(-m^2c^2\right) = 0 .</math>

Canonical momentum in the presence of an electromagnetic potentialEdit

For a charged particle of charge Template:Math, moving in an electromagnetic field given by the electromagnetic four-potential: <math display="block"> A = \left(A^0 , A^1 , A^2 , A^3\right) = \left({\phi \over c}, A_x , A_y , A_z\right) </math> where Template:Mvar is the scalar potential and Template:Math the vector potential, the components of the (not gauge-invariant) canonical momentum four-vector Template:Mvar is <math display="block"> P^\mu = p^\mu + q A^\mu. </math>

This, in turn, allows the potential energy from the charged particle in an electrostatic potential and the Lorentz force on the charged particle moving in a magnetic field to be incorporated in a compact way, in relativistic quantum mechanics.

Four-momentum in curved spacetimeEdit

In the case when there is a moving physical system with a continuous distribution of matter in curved spacetime, the primary expression for four-momentum is a four-vector with covariant index: <ref>Template:Cite journal</ref>

<math> P_\mu = \left(\frac {E} {c} , -\mathbf P \right).</math>

Four-momentum <math> P_\mu</math> is expressed through the energy <math> E</math> of physical system and relativistic momentum <math> \mathbf P </math>. At the same time, the four-momentum <math> P_\mu </math> can be represented as the sum of two non-local four-vectors of integral type:

<math> P_\mu = p_\mu + K_\mu.</math>

Four-vector <math> p_\mu</math> is the generalized four-momentum associated with the action of fields on particles; four-vector <math> K_\mu</math> is the four-momentum of the fields arising from the action of particles on the fields.

Energy <math> E</math> and momentum <math> \mathbf P </math>, as well as components of four-vectors <math> p_\mu</math> and <math> K_\mu</math> can be calculated if the Lagrangian density <math> \mathcal{L} =\mathcal{L}_p + \mathcal{L}_f </math> of the system is given. The following formulas are obtained for the energy and momentum of the system:

<math> E = \int_{V} \frac {\partial}{\partial \mathbf v} \left( \frac { \mathcal{L}_p }{u^0} \right) \cdot \mathbf v u^0 \sqrt {-g} dx^1 dx^2 dx^3 -\int_{V} \left (\mathcal{L}_p + \mathcal{L}_f \right ) \sqrt {-g} dx^1 dx^2 dx^3 +\sum_{n=1}^N \left( \mathbf v_n \cdot \frac {\partial L_f}{\partial \mathbf v_n}\right ) .</math>
<math> \mathbf P = \int_{V} \frac {\partial}{\partial \mathbf v} \left( \frac { \mathcal{L}_p }{u^0} \right) u^0 \sqrt {-g} dx^1 dx^2 dx^3 +\sum_{n=1}^N \frac {\partial L_f}{\partial \mathbf v_n} .</math>

Here <math> \mathcal{L}_p </math> is that part of the Lagrangian density that contains terms with four-currents; <math> \mathbf v </math> is the velocity of matter particles; <math> u^0 </math> is the time component of four-velocity of particles; <math> g </math> is determinant of metric tensor; <math> L_f = \int_{V} \mathcal{L}_f \sqrt {-g} dx^1 dx^2 dx^3 </math> is the part of the Lagrangian associated with the Lagrangian density <math> \mathcal{ L}_f </math>; <math> \mathbf v_n </math> is velocity of a particle of matter with number <math> n </math>.

See alsoEdit

Template:Portal

ReferencesEdit

Template:Reflist