Yang–Mills theory

Revision as of 14:49, 26 May 2025 by imported>OAbot (Open access bot: url-access updated in citation with #oabot.)
(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

Template:Short description Template:Use American English Template:Unsolved Template:Quantum field theory Yang–Mills theory is a quantum field theory for nuclear binding devised by Chen Ning Yang and Robert Mills in 1953, as well as a generic term for the class of similar theories. The Yang–Mills theory is a gauge theory based on a special unitary group Template:Math, or more generally any compact Lie group. A Yang–Mills theory seeks to describe the behavior of elementary particles using these non-abelian Lie groups and is at the core of the unification of the electromagnetic force and weak forces (i.e. Template:Math) as well as quantum chromodynamics, the theory of the strong force (based on Template:Math). Thus it forms the basis of the understanding of the Standard Model of particle physics.

History and qualitative descriptionEdit

Gauge theory in electrodynamicsEdit

All known fundamental interactions can be described in terms of gauge theories, but working this out took decades.<ref name=oraifearthagh>Template:Cite journal</ref> Hermann Weyl's pioneering work on this project started in 1915 when his colleague Emmy Noether proved that every conserved physical quantity has a matching symmetry, and culminated in 1928 when he published his book applying the geometrical theory of symmetry (group theory) to quantum mechanics.<ref name=Baggott40/>Template:Rp Weyl named the relevant symmetry in Noether's theorem the "gauge symmetry", by analogy to distance standardization in railroad gauges.

Erwin Schrödinger in 1922, three years before working on his equation, connected Weyl's group concept to electron charge. Schrödinger showed that the group <math>U(1)</math> produced a phase shift <math>e^{i\theta}</math> in electromagnetic fields that matched the conservation of electric charge.<ref name=Baggott40/>Template:Rp As the theory of quantum electrodynamics developed in the 1930's and 1940's the <math>U(1)</math> group transformations played a central role. Many physicists thought there must be an analog for the dynamics of nucleons. Chen Ning Yang in particular was obsessed with this possibility.

Yang and Mills find the nuclear force gauge theoryEdit

Yang's core idea was to look for a conserved quantity in nuclear physics comparable to electric charge and use it to develop a corresponding gauge theory comparable to electrodynamics. He settled on conservation of isospin, a quantum number that distinguishes a neutron from a proton, but he made no progress on a theory.<ref name=Baggott40/>Template:Rp Taking a break from Princeton in the summer of 1953, Yang met a collaborator who could help: Robert Mills. As Mills himself describes:

"During the academic year 1953–1954, Yang was a visitor to Brookhaven National Laboratory ... I was at Brookhaven also ... and was assigned to the same office as Yang. Yang, who has demonstrated on a number of occasions his generosity to physicists beginning their careers, told me about his idea of generalizing gauge invariance and we discussed it at some length ... I was able to contribute something to the discussions, especially with regard to the quantization procedures, and to a small degree in working out the formalism; however, the key ideas were Yang's."<ref>

Template:Cite book </ref>

In the summer 1953, Yang and Mills extended the concept of gauge theory for abelian groups, e.g. quantum electrodynamics, to non-abelian groups, selecting the group Template:Math to provide an explanation for isospin conservation in collisions involving the strong interactions. Yang's presentation of the work at Princeton in February 1954 was challenged by Pauli, asking about the mass in the field developed with the gauge invariance idea.<ref name=Baggott40>Template:Cite book</ref>Template:Rp Pauli knew that this might be an issue as he had worked on applying gauge invariance but chose not to publish it, viewing the massless excitations of the theory to be "unphysical 'shadow particles'".<ref name=oraifearthagh/>Template:Rp Yang and Mills published in October 1954; near the end of the paper, they admit:

<templatestyles src="Template:Blockquote/styles.css" />

Template:ErrorTemplate:Main other{{#if:|{{#if:|}}

}}

{{#invoke:Check for unknown parameters|check|unknown=Template:Main other|preview=Page using Template:Blockquote with unknown parameter "_VALUE_"|ignoreblank=y| 1 | 2 | 3 | 4 | 5 | author | by | char | character | cite | class | content | multiline | personquoted | publication | quote | quotesource | quotetext | sign | source | style | text | title | ts }}

This problem of unphysical massless excitation blocked further progress.<ref name=Baggott40/>

The idea was set aside until 1960, when the concept of particles acquiring mass through symmetry breaking in massless theories was put forward, initially by Jeffrey Goldstone, Yoichiro Nambu, and Giovanni Jona-Lasinio. This prompted a significant restart of Yang–Mills theory studies that proved successful in the formulation of both electroweak unification and quantum chromodynamics (QCD). The electroweak interaction is described by the gauge group Template:Math, while QCD is an Template:Math Yang–Mills theory. The massless gauge bosons of the electroweak Template:Math mix after spontaneous symmetry breaking to produce the three massive bosons of the weak interaction (Template:Math, Template:Math, and Template:Math) as well as the still-massless photon field. The dynamics of the photon field and its interactions with matter are, in turn, governed by the Template:Math gauge theory of quantum electrodynamics. The Standard Model combines the strong interaction with the unified electroweak interaction (unifying the weak and electromagnetic interaction) through the symmetry group Template:Math. In the current epoch the strong interaction is not unified with the electroweak interaction, but from the observed running of the coupling constants it is believedTemplate:Citation needed they all converge to a single value at very high energies.

Phenomenology at lower energies in quantum chromodynamics is not completely understood due to the difficulties of managing such a theory with a strong coupling. This may be the reason why confinement has not been theoretically proven, though it is a consistent experimental observation. This shows why QCD confinement at low energy is a mathematical problem of great relevance, and why the Yang–Mills existence and mass gap problem is a Millennium Prize Problem.

Parallel work on non-Abelian gauge theoriesEdit

In 1953, in a private correspondence, Wolfgang Pauli formulated a six-dimensional theory of Einstein's field equations of general relativity, extending the five-dimensional theory of Kaluza, Klein, Fock, and others to a higher-dimensional internal space.<ref name = Straumann>Template:Cite arXiv</ref> However, there is no evidence that Pauli developed the Lagrangian of a gauge field or the quantization of it. Because Pauli found that his theory "leads to some rather unphysical shadow particles", he refrained from publishing his results formally.<ref name=Straumann/> Although Pauli did not publish his six-dimensional theory, he gave two seminar lectures about it in Zürich in November 1953.<ref name = Straumann />

In January 1954 Ronald Shaw, a graduate student at the University of Cambridge also developed a non-Abelian gauge theory for nuclear forces.<ref name=shaw> Template:Cite journal </ref> However, the theory needed massless particles in order to maintain gauge invariance. Since no such massless particles were known at the time, Shaw and his supervisor Abdus Salam chose not to publish their work.<ref name=shaw/> Shortly after Yang and Mills published their paper in October 1954, Salam encouraged Shaw to publish his work to mark his contribution. Shaw declined, and instead it only forms a chapter of his PhD thesis published in 1956.<ref> Template:Cite thesis </ref><ref> Template:Cite book </ref>

Mathematical overviewEdit

Template:See also Template:Multiple image

Yang–Mills theories are special examples of gauge theories with a non-abelian symmetry group given by the Lagrangian

<math>\ \mathcal{L}_\mathrm{gf} = -\tfrac{1}{2}\operatorname{tr}(F^2) = - \tfrac{1}{4} F^{a\mu \nu} F_{\mu \nu}^a\ </math>

with the generators <math>\ T^a\ </math> of the Lie algebra, indexed by Template:Mvar, corresponding to the Template:Mvar-quantities (the curvature or field-strength form) satisfying

<math>\ \operatorname{tr}\left( T^a\ T^b \right) = \tfrac{1}{2} \delta^{ab}\ , \qquad \left[ T^a,\ T^b \right] = i\ f^{abc}\ T^c ~.</math>

Here, the Template:Mvar are structure constants of the Lie algebra (totally antisymmetric if the generators of the Lie algebra are normalised such that <math>\ \operatorname{tr}(T^a\ T^b)\ </math> is proportional to <math>\ \delta^{ab}\ </math>), the covariant derivative is defined as

<math>\ D_\mu = I\ \partial_\mu - i\ g\ T^a\ A^a_\mu\ ,</math>

Template:Mvar is the identity matrix (matching the size of the generators), <math>\ A^a_\mu\ </math> is the vector potential, and Template:Mvar is the coupling constant. In four dimensions, the coupling constant Template:Mvar is a pure number and for a Template:Math group one has <math>\ a, b, c = 1 \ldots n^2-1 ~.</math>

The relation

<math>\ F_{\mu \nu}^a = \partial_\mu A_\nu^a - \partial_\nu A_\mu^a + g\ f^{abc}\ A_\mu^b\ A_\nu^c\ </math>

can be derived by the commutator

<math>\ \left[ D_\mu, D_\nu \right] = -i\ g\ T^a\ F_{\mu\nu}^a ~.</math>

The field has the property of being self-interacting and the equations of motion that one obtains are said to be semilinear, as nonlinearities are both with and without derivatives. This means that one can manage this theory only by perturbation theory with small nonlinearities.Template:Citation needed

Note that the transition between "upper" ("contravariant") and "lower" ("covariant") vector or tensor components is trivial for a indices (e.g. <math>\ f^{abc} = f_{abc}\ </math>), whereas for μ and ν it is nontrivial, corresponding e.g. to the usual Lorentz signature, <math>\ \eta_{\mu \nu } = {\rm diag}(+---) ~.</math>

From the given Lagrangian one can derive the equations of motion given by

<math>\ \partial^\mu F_{\mu\nu}^a + g\ f^{abc}\ A^{\mu b}\ F_{\mu\nu}^c = 0 ~.</math>

Putting <math>\ F_{\mu\nu}=T^aF^a_{\mu\nu}\ ,</math> these can be rewritten as

<math>\ \left( D^\mu F_{\mu\nu} \right)^a = 0 ~.</math>

A Bianchi identity holds

<math>\ \left( D_\mu\ F_{\nu \kappa} \right)^a + \left( D_\kappa\ F_{\mu \nu} \right)^a + \left( D_\nu\ F_{\kappa \mu} \right)^a = 0\ </math>

which is equivalent to the Jacobi identity

<math>\ \left[ D_{\mu}, \left[ D_{\nu},D_{\kappa} \right] \right] + \left[ D_{\kappa}, \left[ D_{\mu},D_{\nu} \right] \right] + \left[ D_{\nu}, \left[ D_{\kappa},D_{\mu} \right] \right] = 0\ </math>

since <math>\ \left[ D_{\mu},F^a_{\nu\kappa} \right] = D_{\mu}\ F^a_{\nu\kappa} ~.</math> Define the dual strength tensor <math>\ \tilde{F}^{\mu\nu} = \tfrac{1}{2}\varepsilon^{\mu\nu\rho\sigma}F_{\rho\sigma}\ ,</math> then the Bianchi identity can be rewritten as

<math>\ D_{\mu}\tilde{F}^{\mu\nu} = 0 ~.</math>

A source <math>\ J_\mu^a\ </math> enters into the equations of motion as

<math>\ \partial^\mu F_{\mu\nu}^a + g\ f^{abc}\ A^{b\mu}\ F_{\mu\nu}^c = -J_\nu^a ~.</math>

Note that the currents must properly change under gauge group transformations.

We give here some comments about the physical dimensions of the coupling. In Template:Mvar dimensions, the field scales as <math>\ \left[A\right]=\left[ L^{\left(\tfrac{2-D}{2}\right)} \right]\ </math> and so the coupling must scale as <math>\ \left[g^2\right] = \left[L^{\left(D-4\right)}\right] ~.</math> This implies that Yang–Mills theory is not renormalizable for dimensions greater than four. Furthermore, for Template:Nobr the coupling is dimensionless and both the field and the square of the coupling have the same dimensions of the field and the coupling of a massless quartic scalar field theory. So, these theories share the scale invariance at the classical level.

QuantizationEdit

A method of quantizing the Yang–Mills theory is by functional methods, i.e. path integrals. One introduces a generating functional for Template:Mvar-point functions as

<math>\ Z[j] = \int [\mathrm{d}A]\ \exp\left[- \tfrac{i}{2} \int \mathrm{d}^4x\ \operatorname{tr}\left( F^{\mu \nu}\ F_{\mu \nu}\right) + i\ \int \mathrm{d}^4x\ j^a_\mu(x)\ A^{a\mu}(x) \right]\ ,</math>

but this integral has no meaning as it is because the potential vector can be arbitrarily chosen due to the gauge freedom. This problem was already known for quantum electrodynamics but here becomes more severe due to non-abelian properties of the gauge group. A way out has been given by Ludvig Faddeev and Victor Popov with the introduction of a ghost field (see Faddeev–Popov ghost) that has the property of being unphysical since, although it agrees with Fermi–Dirac statistics, it is a complex scalar field, which violates the spin–statistics theorem. So, we can write the generating functional as

<math>\begin{align}

Z[j,\bar\varepsilon,\varepsilon] & = \int [\mathrm{d}\ A] [\mathrm{d}\ \bar c] [\mathrm{d}\ c]\ \exp\Bigl\{ i\ S_F\ \left[\partial A, A\right] + i\ S_{gf}\left[\partial A\right] + i\ S_g\left[\partial c, \partial\bar c, c,\bar c, A \right] \Bigr\} \\ &\exp\left\{i\int \mathrm{d}^4x\ j^a_\mu(x)A^{a\mu}(x)+i\int \mathrm{d}^4x\ \left[\bar c^a(x)\ \varepsilon^a(x) + \bar\varepsilon^a(x)\ c^a(x)\right]\right\} \end{align}</math>

being

<math>S_F=- \tfrac{1}{2} \int \mathrm{d}^4 x\ \operatorname{tr}\left( F^{\mu \nu}\ F_{\mu \nu} \right)\ </math>

for the field,

<math>S_{gf} = -\frac{1}{2\xi} \int \mathrm{d}^4 x\ (\partial\cdot A)^2\ </math>

for the gauge fixing and

<math>\ S_g = -\int \mathrm{d}^4 x\ \left(\bar c^a\ \partial_\mu\partial^\mu c^a + g\ \bar c^a\ f^{abc}\ \partial_\mu\ A^{b\mu}\ c^c \right)\ </math>

for the ghost. This is the expression commonly used to derive Feynman's rules (see Feynman diagram). Here we have Template:Mvar for the ghost field while Template:Mvar fixes the gauge's choice for the quantization. Feynman's rules obtained from this functional are the following

These rules for Feynman's diagrams can be obtained when the generating functional given above is rewritten as

<math>\begin{align}

Z[j,\bar\varepsilon,\varepsilon] &= \exp\left(-i\ g\int \mathrm{d}^4x\ \frac{\delta}{i\ \delta\ \bar\varepsilon^a(x)}\ f^{abc}\partial_\mu\ \frac{i\ \delta}{\delta\ j^b_\mu(x)}\ \frac{i\ \delta}{\delta\ \varepsilon^c(x)} \right)\\ & \qquad \times \exp\left(-i\ g\int \mathrm{d}^4x\ f^{abc}\partial_\mu\frac{i\ \delta}{\delta\ j^a_\nu(x)}\frac{i\ \delta}{\delta\ j^b_\mu(x)}\ \frac{i\ \delta}{\delta\ j^{c\nu}(x)}\right)\\ & \qquad \qquad \times \exp\left(-i\ \frac{g^2}{4}\int \mathrm{d}^4x\ f^{abc}\ f^{ars}\frac{i\ \delta}{\delta\ j^b_\mu(x)}\ \frac{i\ \delta}{\delta\ j^c_\nu(x)}\ \frac{\ i\delta}{\delta\ j^{r\mu}(x)} \frac{i\ \delta}{\delta\ j^{s\nu}(x)} \right) \\ & \qquad \qquad \qquad \times Z_0[j,\bar\varepsilon,\varepsilon] \end{align}</math>

with

<math> Z_0[j,\bar\varepsilon,\varepsilon] = \exp \left( -\int \mathrm{d}^4x\ \mathrm{d}^4y\ \bar\varepsilon^a(x)\ C^{ab}(x-y)\ \varepsilon^b(y) \right)\exp \left( \tfrac{1}{2} \int \mathrm{d}^4x\ \mathrm{d}^4y\ j^a_\mu(x)\ D^{ab\mu\nu}(x-y)\ j^b_\nu(y) \right)\ </math>

being the generating functional of the free theory. Expanding in Template:Mvar and computing the functional derivatives, we are able to obtain all the Template:Mvar-point functions with perturbation theory. Using LSZ reduction formula we get from the Template:Mvar-point functions the corresponding process amplitudes, cross sections and decay rates. The theory is renormalizable and corrections are finite at any order of perturbation theory.

For quantum electrodynamics the ghost field decouples because the gauge group is abelian. This can be seen from the coupling between the gauge field and the ghost field that is <math>\ \bar c^a\ f^{abc}\ \partial_\mu A^{b\mu}\ c^c ~.</math> For the abelian case, all the structure constants <math>\ f^{abc}\ </math> are zero and so there is no coupling. In the non-abelian case, the ghost field appears as a useful way to rewrite the quantum field theory without physical consequences on the observables of the theory such as cross sections or decay rates.

One of the most important results obtained for Yang–Mills theory is asymptotic freedom. This result can be obtained by assuming that the coupling constant Template:Mvar is small (so small nonlinearities), as for high energies, and applying perturbation theory. The relevance of this result is due to the fact that a Yang–Mills theory that describes strong interaction and asymptotic freedom permits proper treatment of experimental results coming from deep inelastic scattering.

To obtain the behavior of the Yang–Mills theory at high energies, and so to prove asymptotic freedom, one applies perturbation theory assuming a small coupling. This is verified a posteriori in the ultraviolet limit. In the opposite limit, the infrared limit, the situation is the opposite, as the coupling is too large for perturbation theory to be reliable. Most of the difficulties that research meets is just managing the theory at low energies. That is the interesting case, being inherent to the description of hadronic matter and, more generally, to all the observed bound states of gluons and quarks and their confinement (see hadrons). The most used method to study the theory in this limit is to try to solve it on computers (see lattice gauge theory). In this case, large computational resources are needed to be sure the correct limit of infinite volume (smaller lattice spacing) is obtained. This is the limit the results must be compared with. Smaller spacing and larger coupling are not independent of each other, and larger computational resources are needed for each. As of today, the situation appears somewhat satisfactory for the hadronic spectrum and the computation of the gluon and ghost propagators, but the glueball and hybrids spectra are yet a questioned matter in view of the experimental observation of such exotic states. Indeed, the Template:Mvar resonance<ref name=ccl> Template:Cite journal </ref><ref name=ygp> Template:Cite journal </ref> is not seen in any of such lattice computations and contrasting interpretations have been put forward. This is a hotly debated issue.

Open problemsEdit

Yang–Mills theories met with general acceptance in the physics community after Gerard 't Hooft, in 1972, worked out their renormalization, relying on a formulation of the problem worked out by his advisor Martinus Veltman.<ref> Template:Cite journal </ref> Renormalizability is obtained even if the gauge bosons described by this theory are massive, as in the electroweak theory, provided the mass is only an "acquired" one, generated by the Higgs mechanism.

The mathematics of the Yang–Mills theory is a very active field of research, yielding e.g. invariants of differentiable structures on four-dimensional manifolds via work of Simon Donaldson. Furthermore, the field of Yang–Mills theories was included in the Clay Mathematics Institute's list of "Millennium Prize Problems". Here the prize-problem consists, especially, in a proof of the conjecture that the lowest excitations of a pure Yang–Mills theory (i.e. without matter fields) have a finite mass-gap with regard to the vacuum state. Another open problem, connected with this conjecture, is a proof of the confinement property in the presence of additional fermions.

In physics the survey of Yang–Mills theories does not usually start from perturbation analysis or analytical methods, but more recently from systematic application of numerical methods to lattice gauge theories.

See alsoEdit

Template:Div col

Template:Div col end

ReferencesEdit

Template:Reflist

Further readingEdit

Template:Refbegin

Books
Articles

|CitationClass=web }}

Template:Refend

External linksEdit

|CitationClass=web }}

  • {{#invoke:citation/CS1|citation

|CitationClass=web }}

Template:Quantum field theories