Template:Short description Template:Redirect Template:Quantum mechanics Template:Quantum field theory
In physics, specifically relativistic quantum mechanics (RQM) and its applications to particle physics, relativistic wave equations predict the behavior of particles at high energies and velocities comparable to the speed of light. In the context of quantum field theory (QFT), the equations determine the dynamics of quantum fields. The solutions to the equations, universally denoted as Template:Math or Template:Math (Greek psi), are referred to as "wave functions" in the context of RQM, and "fields" in the context of QFT. The equations themselves are called "wave equations" or "field equations", because they have the mathematical form of a wave equation or are generated from a Lagrangian density and the field-theoretic Euler–Lagrange equations (see classical field theory for background).
In the Schrödinger picture, the wave function or field is the solution to the Schrödinger equation, <math display="block">
i\hbar\frac{\partial}{\partial t}\psi = \hat{H} \psi,
</math> one of the postulates of quantum mechanics. All relativistic wave equations can be constructed by specifying various forms of the Hamiltonian operator Ĥ describing the quantum system. Alternatively, Feynman's path integral formulation uses a Lagrangian rather than a Hamiltonian operator.
More generally – the modern formalism behind relativistic wave equations is Lorentz group theory, wherein the spin of the particle has a correspondence with the representations of the Lorentz group.<ref name="T Jaroszewicz, P.S Kurzepa"> Template:Cite journal</ref>
HistoryEdit
Early 1920s: Classical and quantum mechanicsEdit
The failure of classical mechanics applied to molecular, atomic, and nuclear systems and smaller induced the need for a new mechanics: quantum mechanics. The mathematical formulation was led by De Broglie, Bohr, Schrödinger, Pauli, and Heisenberg, and others, around the mid-1920s, and at that time was analogous to that of classical mechanics. The Schrödinger equation and the Heisenberg picture resemble the classical equations of motion in the limit of large quantum numbers and as the reduced Planck constant Template:Math, the quantum of action, tends to zero. This is the correspondence principle. At this point, special relativity was not fully combined with quantum mechanics, so the Schrödinger and Heisenberg formulations, as originally proposed, could not be used in situations where the particles travel near the speed of light, or when the number of each type of particle changes (this happens in real particle interactions; the numerous forms of particle decays, annihilation, matter creation, pair production, and so on).
Late 1920s: Relativistic quantum mechanics of spin-0 and spin-1/2 particlesEdit
A description of quantum mechanical systems which could account for relativistic effects was sought for by many theoretical physicists from the late 1920s to the mid-1940s.<ref name="Esposito">Template:Cite journal</ref> The first basis for relativistic quantum mechanics, i.e. special relativity applied with quantum mechanics together, was found by all those who discovered what is frequently called the Klein–Gordon equation: Template:NumBlk by inserting the energy operator and momentum operator into the relativistic energy–momentum relation: Template:NumBlk
The solutions to (Template:EquationNote) are scalar fields. The KG equation is undesirable due to its prediction of negative energies and probabilities, as a result of the quadratic nature of (Template:EquationNote) – inevitable in a relativistic theory. This equation was initially proposed by Schrödinger, and he discarded it for such reasons, only to realize a few months later that its non-relativistic limit (what is now called the Schrödinger equation) was still of importance. Nevertheless, (Template:EquationNote) is applicable to spin-0 bosons.<ref>Template:Cite book</ref>
Neither the non-relativistic nor relativistic equations found by Schrödinger could predict the fine structure in the Hydrogen spectral series. The mysterious underlying property was spin. The first two-dimensional spin matrices (better known as the Pauli matrices) were introduced by Pauli in the Pauli equation; the Schrödinger equation with a non-relativistic Hamiltonian including an extra term for particles in magnetic fields, but this was phenomenological. Weyl found a relativistic equation in terms of the Pauli matrices; the Weyl equation, for massless spin-1/2 fermions. The problem was resolved by Dirac in the late 1920s, when he furthered the application of equation (Template:EquationNote) to the electron – by various manipulations he factorized the equation into the form Template:NumBlk and one of these factors is the Dirac equation (see below), upon inserting the energy and momentum operators. For the first time, this introduced new four-dimensional spin matrices Template:Math and Template:Math in a relativistic wave equation, and explained the fine structure of hydrogen. The solutions to (Template:EquationNote) are multi-component spinor fields, and each component satisfies (Template:EquationNote). A remarkable result of spinor solutions is that half of the components describe a particle while the other half describe an antiparticle; in this case the electron and positron. The Dirac equation is now known to apply for all massive spin-1/2 fermions. In the non-relativistic limit, the Pauli equation is recovered, while the massless case results in the Weyl equation.
Although a landmark in quantum theory, the Dirac equation is only true for spin-1/2 fermions, and still predicts negative energy solutions, which caused controversy at the time (in particular – not all physicists were comfortable with the "Dirac sea" of negative energy states).
1930s–1960s: Relativistic quantum mechanics of higher-spin particlesEdit
The natural problem became clear: to generalize the Dirac equation to particles with any spin; both fermions and bosons, and in the same equations their antiparticles (possible because of the spinor formalism introduced by Dirac in his equation, and then-recent developments in spinor calculus by van der Waerden in 1929), and ideally with positive energy solutions.<ref name="Esposito"/>
This was introduced and solved by Majorana in 1932, by a deviated approach to Dirac. Majorana considered one "root" of (Template:EquationNote): Template:NumBlk where Template:Math is a spinor field, now with infinitely many components, irreducible to a finite number of tensors or spinors, to remove the indeterminacy in sign. The matrices Template:Math and Template:Math are infinite-dimensional matrices, related to infinitesimal Lorentz transformations. He did not demand that each component of Template:EquationNote satisfy equation (Template:EquationNote); instead he regenerated the equation using a Lorentz-invariant action, via the principle of least action, and application of Lorentz group theory.<ref>Template:Cite journal</ref><ref name = "Bekaert, Traubenberg, Valenzuela">Template:Cite journal</ref>
Majorana produced other important contributions that were unpublished, including wave equations of various dimensions (5, 6, and 16). They were anticipated later (in a more involved way) by de Broglie (1934), and Duffin, Kemmer, and Petiau (around 1938–1939) see Duffin–Kemmer–Petiau algebra. The Dirac–Fierz–Pauli formalism was more sophisticated than Majorana's, as spinors were new mathematical tools in the early twentieth century, although Majorana's paper of 1932 was difficult to fully understand; it took Pauli and Wigner some time to understand it, around 1940.<ref name="Esposito"/>
Dirac in 1936, and Fierz and Pauli in 1939, built equations from irreducible spinors Template:Math and Template:Math, symmetric in all indices, for a massive particle of spin Template:Nobr for integer Template:Math (see Van der Waerden notation for the meaning of the dotted indices): Template:NumBlk A_{\epsilon_1\epsilon_2\cdots\epsilon_n}^{\dot{\alpha}\dot{\beta}_1\dot{\beta}_2\cdots\dot{\beta}_n} = mcB_{\gamma\epsilon_1\epsilon_2\cdots\epsilon_n}^{\dot{\beta}_1\dot{\beta}_2\cdots\dot{\beta}_n}, </math>|Template:EquationRef}} Template:NumBlk B_{\gamma\epsilon_1\epsilon_2\cdots\epsilon_n}^{\dot{\beta}_1\dot{\beta}_2\cdots\dot{\beta}_n} = mcA_{\epsilon_1\epsilon_2\cdots\epsilon_n}^{\dot{\alpha}\dot{\beta}_1\dot{\beta}_2\cdots\dot{\beta}_n}, </math>|Template:EquationRef}} where Template:Math is the momentum as a covariant spinor operator. For Template:Math, the equations reduce to the coupled Dirac equations, and Template:Math and Template:Math together transform as the original Dirac spinor. Eliminating either Template:Math or Template:Math shows that Template:Math and Template:Math each fulfill (Template:EquationNote).<ref name="Esposito"/> The direct derivation of the Dirac–Pauli–Fierz equations using the Bargmann–Wigner operators is given by Isaev and Podoinitsyn.<ref> Template:Cite journal</ref>
In 1941, Rarita and Schwinger focussed on spin-3/2 particles and derived the Rarita–Schwinger equation, including a Lagrangian to generate it, and later generalized the equations analogous to spin Template:Math for integer Template:Math. In 1945, Pauli suggested Majorana's 1932 paper to Bhabha, who returned to the general ideas introduced by Majorana in 1932. Bhabha and Lubanski proposed a completely general set of equations by replacing the mass terms in (Template:EquationNote) and (Template:EquationNote) by an arbitrary constant, subject to a set of conditions which the wave functions must obey.<ref>Template:Cite journal</ref>
Finally, in the year 1948 (the same year as Feynman's path integral formulation was cast), Bargmann and Wigner formulated the general equation for massive particles which could have any spin, by considering the Dirac equation with a totally symmetric finite-component spinor, and using Lorentz group theory (as Majorana did): the Bargmann–Wigner equations.<ref name="Esposito"/><ref>Template:Cite journal</ref> In the early 1960s, a reformulation of the Bargmann–Wigner equations was made by H. Joos and Steven Weinberg, the Joos–Weinberg equation. Various theorists at this time did further research in relativistic Hamiltonians for higher spin particles.<ref name="T Jaroszewicz, P.S Kurzepa"/><ref name="E.A. Jeffery 1978"> Template:Cite journal</ref><ref> Template:Cite journal</ref>
1960s–presentEdit
The relativistic description of spin particles has been a difficult problem in quantum theory. It is still an area of the present-day research because the problem is only partially solved; including interactions in the equations is problematic, and paradoxical predictions (even from the Dirac equation) are still present.<ref name = "Bekaert, Traubenberg, Valenzuela"/>
Linear equationsEdit
The following equations have solutions which satisfy the superposition principle, that is, the wave functions are additive.
Throughout, the standard conventions of tensor index notation and Feynman slash notation are used, including Greek indices which take the values 1, 2, 3 for the spatial components and 0 for the timelike component of the indexed quantities. The wave functions are denoted Template:Math, and Template:Math are the components of the four-gradient operator.
In matrix equations, the Pauli matrices are denoted by Template:Math in which Template:Math, where Template:Math is the Template:Math identity matrix: <math display="block">\sigma^0 = \begin{pmatrix} 1&0 \\ 0&1 \\ \end{pmatrix} </math> and the other matrices have their usual representations. The expression <math display="block">\sigma^\mu \partial_\mu \equiv \sigma^0 \partial_0 + \sigma^1 \partial_1 + \sigma^2 \partial_2 + \sigma^3 \partial_3 </math> is a Template:Math matrix operator which acts on 2-component spinor fields.
The gamma matrices are denoted by Template:Math, in which again Template:Math, and there are a number of representations to select from. The matrix Template:Math is not necessarily the Template:Math identity matrix. The expression <math display="block">i\hbar \gamma^\mu \partial_\mu + mc \equiv i\hbar(\gamma^0 \partial_0 + \gamma^1 \partial_1 + \gamma^2 \partial_2 + \gamma^3 \partial_3) + mc \begin{pmatrix}1&0&0&0\\ 0&1&0&0 \\ 0&0&1&0 \\ 0&0&0&1 \end{pmatrix} </math> is a Template:Math matrix operator which acts on 4-component spinor fields.
Note that terms such as "Template:Math" scalar multiply an identity matrix of the relevant dimension, the common sizes are Template:Math or Template:Math, and are conventionally not written for simplicity.
Particle spin quantum number s | Name | Equation | Typical particles the equation describes |
---|---|---|---|
0 | Klein–Gordon equation | <math>(\hbar \partial_{\mu} + imc)(\hbar \partial^{\mu} -imc)\psi = 0</math> | Massless or massive spin-0 particle (such as Higgs bosons). |
1/2 | Weyl equation | <math> \sigma^\mu\partial_\mu \psi=0</math> | Massless spin-1/2 particles. |
Dirac equation | <math>\left( i \hbar \partial\!\!\!/ - m c \right) \psi = 0 </math> | Massive spin-1/2 particles (such as electrons). | |
Two-body Dirac equations | <math>[(\gamma_1)_\mu (p_1-\tilde{A}_1)^\mu+m_1 + \tilde{S}_1]\Psi=0,</math>Template:Br
<math>[(\gamma_2)_\mu (p_2-\tilde{A}_2)^\mu+m_2 + \tilde{S}_2]\Psi=0.</math> |
Massive spin-1/2 particles (such as electrons). | |
Majorana equation | <math> i \hbar \partial\!\!\!/ \psi - m c \psi_c = 0</math> | Massive Majorana particles. | |
Breit equation | <math> i\hbar\frac{\partial \Psi}{\partial t} = \left(\sum_{i}\hat{H}_{D}(i) + \sum_{i>j}\frac{1}{r_{ij}} - \sum_{i>j}\hat{B}_{ij} \right) \Psi </math> | Two massive spin-1/2 particles (such as electrons) interacting electromagnetically to first order in perturbation theory. | |
1 | Maxwell's equations (in QED using the Lorenz gauge) | <math>\partial_\mu\partial^\mu A^\nu = e \overline{\psi} \gamma^\nu \psi </math> | Photons, massless spin-1 particles. |
Proca equation | <math>\partial_\mu(\partial^\mu A^\nu - \partial^\nu A^\mu)+\left(\frac{mc}{\hbar}\right)^2 A^\nu=0</math> | Massive spin-1 particle (such as W and Z bosons). | |
3/2 | Rarita–Schwinger equation | <math> \epsilon^{\mu \nu \rho \sigma} \gamma^5 \gamma_\nu \partial_\rho \psi_\sigma + m\psi^\mu = 0</math> | Massive spin-3/2 particles. |
s | Bargmann–Wigner equations | <math>\begin{align}
(-i\hbar \gamma^\mu \partial_\mu + mc)_{\alpha_1 \alpha_1'}\psi_{\alpha'_1 \alpha_2 \alpha_3 \cdots \alpha_{2s}} &= 0 \\ (-i\hbar \gamma^\mu \partial_\mu + mc)_{\alpha_2 \alpha_2'}\psi_{\alpha_1 \alpha'_2 \alpha_3 \cdots \alpha_{2s}} &= 0 \\ &\;\; \vdots \\ (-i\hbar \gamma^\mu \partial_\mu + mc)_{\alpha_{2s} \alpha'_{2s}}\psi_{\alpha_1 \alpha_2 \alpha_3 \cdots \alpha'_{2s}} &= 0 \end{align}</math>Template:Br where Template:Math is a rank-2s 4-component spinor. |
Free particles of arbitrary spin (bosons and fermions).<ref name="E.A. Jeffery 1978"/><ref>
Template:Cite news</ref> |
Joos–Weinberg equation | <math> [(i\hbar )^{2s}\gamma ^{\mu _{1}\mu _{2}\cdots \mu _{2s}}\partial _{\mu _{1}}\partial _{\mu _{2}}\cdots \partial _{\mu _{2s}}+(mc)^{2s}]\psi =0</math> | Free particles of arbitrary spin (bosons and fermions). |
Linear gauge fieldsEdit
The Duffin–Kemmer–Petiau equation is an alternative equation for spin-0 and spin-1 particles: <math display="block">(i \hbar \beta^{a} \partial_a - m c) \psi = 0</math>
Constructing RWEsEdit
Using 4-vectors and the energy–momentum relationEdit
{{#invoke:Labelled list hatnote|labelledList|Main article|Main articles|Main page|Main pages}}
Start with the standard special relativity (SR) 4-vectors
- 4-position <math>X^\mu = \mathbf{X} = (ct,\vec{\mathbf{x}})</math>
- 4-velocity <math>U^\mu = \mathbf{U} = \gamma(c,\vec{\mathbf{u}})</math>
- 4-momentum <math>P^\mu = \mathbf{P} = \left(\frac{E}{c},\vec{\mathbf{p}}\right)</math>
- 4-wavevector <math>K^\mu = \mathbf{K} = \left(\frac{\omega}{c},\vec{\mathbf{k}}\right)</math>
- 4-gradient <math>\partial^\mu = \mathbf{\partial} = \left(\frac{\partial_t}{c},-\vec{\mathbf{\nabla}}\right)</math>
Note that each 4-vector is related to another by a Lorentz scalar:
- <math>\mathbf{U} = \frac{d}{d\tau} \mathbf{X}</math>, where <math>\tau</math> is the proper time
- <math>\mathbf{P} = m_0 \mathbf{U}</math>, where <math>m_0</math> is the rest mass
- <math>\mathbf{K} = (1/\hbar) \mathbf{P}</math>, which is the 4-vector version of the Planck–Einstein relation & the de Broglie matter wave relation
- <math>\mathbf{\partial} = -i \mathbf{K}</math>, which is the 4-gradient version of complex-valued plane waves
Now, just apply the standard Lorentz scalar product rule to each one:
- <math>\mathbf{U} \cdot \mathbf{U} = (c)^2</math>
- <math>\mathbf{P} \cdot \mathbf{P} = (m_0 c)^2</math>
- <math>\mathbf{K} \cdot \mathbf{K} = \left(\frac{m_0 c}{\hbar}\right)^2</math>
- <math>\mathbf{\partial} \cdot \mathbf{\partial} = \left(\frac{-i m_0 c}{\hbar}\right)^2 = -\left(\frac{m_0 c}{\hbar}\right)^2</math>
The last equation is a fundamental quantum relation.
When applied to a Lorentz scalar field <math>\psi</math>, one gets the Klein–Gordon equation, the most basic of the quantum relativistic wave equations.
- <math>\left[\mathbf{\partial} \cdot \mathbf{\partial} + \left(\frac{m_0 c}{\hbar}\right)^2\right]\psi = 0</math>: in 4-vector format
- <math>\left[\partial_\mu \partial^\mu + \left(\frac{m_0 c}{\hbar}\right)^2\right]\psi = 0</math>: in tensor format
- <math>\left[(\hbar \partial_{\mu} + i m_0 c)(\hbar \partial^{\mu} -i m_0 c)\right]\psi = 0</math>: in factored tensor format
The Schrödinger equation is the low-velocity limiting case (Template:Math) of the Klein–Gordon equation.
When the relation is applied to a four-vector field <math>A^\mu</math> instead of a Lorentz scalar field <math>\psi</math>, then one gets the Proca equation (in Lorenz gauge): <math display="block">\left[\mathbf{\partial} \cdot \mathbf{\partial} + \left(\frac{m_0 c}{\hbar}\right)^2\right]A^\mu = 0</math>
If the rest mass term is set to zero (light-like particles), then this gives the free Maxwell equation (in Lorenz gauge) <math display="block">[\mathbf{\partial} \cdot \mathbf{\partial}]A^\mu = 0</math>
Representations of the Lorentz groupEdit
Under a proper orthochronous Lorentz transformation Template:Math in Minkowski space, all one-particle quantum states Template:Math of spin Template:Math with spin z-component Template:Math locally transform under some representation Template:Math of the Lorentz group:<ref name="Weinberg">Template:Cite journal; Template:Cite journal; Template:Cite journal</ref><ref name="Kenmoku"> Template:Cite arXiv</ref> <math display="block">\psi(x) \rightarrow D(\Lambda) \psi(\Lambda^{-1}x) </math> where Template:Math is some finite-dimensional representation, i.e. a matrix. Here Template:Math is thought of as a column vector containing components with the allowed values of Template:Math. The quantum numbers Template:Math and Template:Math as well as other labels, continuous or discrete, representing other quantum numbers are suppressed. One value of Template:Math may occur more than once depending on the representation. Representations with several possible values for Template:Math are considered below.
The irreducible representations are labeled by a pair of half-integers or integers Template:Math. From these all other representations can be built up using a variety of standard methods, like taking tensor products and direct sums. In particular, space-time itself constitutes a 4-vector representation Template:Math so that Template:Math. To put this into context; Dirac spinors transform under the Template:Math representation. In general, the Template:Math representation space has subspaces that under the subgroup of spatial rotations, SO(3), transform irreducibly like objects of spin j, where each allowed value: <math display="block">j = A + B, A + B - 1, \dots, |A - B|,</math> occurs exactly once.<ref> Template:Citation</ref> In general, tensor products of irreducible representations are reducible; they decompose as direct sums of irreducible representations.
The representations Template:Math and Template:Math can each separately represent particles of spin Template:Math. A state or quantum field in such a representation would satisfy no field equation except the Klein–Gordon equation.
Non-linear equationsEdit
There are equations which have solutions that do not satisfy the superposition principle.
Nonlinear gauge fieldsEdit
- Yang–Mills equation: describes a non-abelian gauge field
- Yang–Mills–Higgs equations: describes a non-abelian gauge field coupled with a massive spin-0 particle
Spin 2Edit
- Einstein field equations: describe interaction of matter with the gravitational field (massless spin-2 field): <math display="block">R_{\mu \nu} - \frac{1}{2} g_{\mu \nu}\,R + g_{\mu \nu} \Lambda = \frac{8 \pi G}{c^4} T_{\mu \nu}</math> The solution is a metric tensor field, rather than a wave function.
See alsoEdit
- List of equations in nuclear and particle physics
- List of equations in quantum mechanics
- Lorentz transformation
- Mathematical descriptions of the electromagnetic field
- Minimal coupling
- Scalar field theory
- Status of special relativity