Four-vector
In the theory of relativity, a four-vector or 4-vector is a vector in a four-dimensional real vector space, called Minkowski space. It differs from a Euclidean vector in that four-vectors transform by the Lorentz transformations. The term four-vector tacitly assumes that its components refer to a vector basis. In a standard basis, the components transform between these bases as the space and time coordinate differences, (cΔt, Δx, Δy, Δz) under spatial translations, spatial rotations, spatial and time inversions and boosts (a change by a constant velocity to another inertial reference frame). The set of all such translations, rotations, inversions and boosts (called Poincaré transformations) forms the Poincaré group. The set of rotations, inversions and boosts (Lorentz transformations, described by 4×4 matrices) forms the Lorentz group.
This article considers four-vectors in the context of special relativity. Although the concept of four-vectors also extends to general relativity, some of the results stated in this article require modification in general relativity.
Mathematics of four-vectors
Four-vectors in a real-valued basis
A four-vector A is defined as, in various equivalent notations, a vector with a "timelike" component and three "spacelike" components:[1]
where the upper index denotes it to be contravariant. Here the standard convention of Latin indices take values for spatial components (so i = 1, 2, 3), and Greek indices take values for space and time components (so α = 0, 1, 2, 3), used with the summation convention. The split between the time component and the spatial components is a useful one to make when determining contractions of one four vector with other tensor quantities, such as for calculating Lorentz invariants in inner products (examples given throughout below), or raising and lowering indices.
It is also customary to represent the bases by column vectors:
so that:
The relation between the covariant and contravariant coordinates is through the Minkowski metric tensor, η:
and in various equivalent notations the covariant components are:
where the lowered index indicates it to be covariant. Often the metric is diagonal, as is the case for orthogonal coordinates (see line element), but not in general curvilinear coordinates.
Again, the bases can be represented by column vectors:
so that:
Geometrically, a four-vector can still be interpreted as an arrow. In relativity, the arrows are drawn as part of a spacetime diagram or Minkowski diagram. In this article, four-vectors will be referred to simply as vectors.
Properties
Linearity
Four-vectors have the same linearity properties as Euclidean vectors in three dimensions. They can be added in the usual entrywise way:
and similarly scalar multiplication by a scalar λ is defined entrywise by:
Then subtraction is the inverse operation of addition, defined entrywise by:
Inner product
The inner product (also called the scalar product) of two four-vectors A and B is defined, using Einstein notation, as
where η is the Minkowski metric. The inner product in this context is also called the Minkowski inner product. For visual clarity, it is convenient to rewrite the definition in matrix form:
in which case ημν above is the entry in row μ and column ν of the Minkowski metric as a square matrix.
The inner product of a four-vector A with itself is the square of the norm of the vector, denoted and defined by:
and intuitively represents (the square of) the length or magnitude of the vector. However, the Minkowski metric is not a Euclidean metric, because it is indefinite (see metric signature). In general, four-vectors can have nonpositive length, contrary to three dimensional vectors in Euclidean space.
(+−−−) signature
In the (+−−−) metric signature, evaluating the summation over indices gives:
while in matrix form:
It is a recurring theme in special relativity to take the expression
in one reference frame, where C is the value of the inner product in this frame, and:
in another frame, in which C′ is the value of the inner product in this frame. Then since the inner product is an invariant, these must be equal:
that is:
This has the appearance of a "conservation law", but there is no "conservation" involved, since the primary significance of the Minkowski inner product is that for any two four-vectors, its value is invariant for all observers; a change of coordinates does not result in a change in value of the inner product. The components of the four-vectors change from one frame to another; A and A′ are connected by a Lorentz transformation, and similarly for B and B′, although the inner products are the same in all frames. Nevertheless, this type of expression is exploited in relativistic calculations on a par with conservation laws, since the magnitudes of components can be determined without explicitly performing any Lorentz transformations. A particular example is with energy and momentum in the energy-momentum relation derived from the four-momentum vector (see also below).
In this signature, the norm of the vector A is:
With the signature (+−−−), four-vectors may be classified as either spacelike if ||A|| < 0, timelike if ||A|| > 0, and null vectors if ||A|| = 0.
(−+++) signature
Some authors define η with the opposite sign, in which case we have the (−+++) metric signature. Evaluating the summation with this signature:
while the matrix form is:
Note that in this case, in one frame:
while in another:
so that:
which is equivalent to the above expression for C in terms of A and B. Either convention will work. With the Minkowski metric defined in the two ways above, the only difference between covariant and contravariant four-vector components are signs, therefore the signs depend on which sign convention is used.
The square of the norm in this signature is:
With the signature (−+++), four-vectors may be classified as either spacelike if ||A|| > 0, timelike if ||A|| < 0, and null vectors if ||A|| = 0.
Dual vectors
The inner product is often expressed as the effect of the dual vector of one vector on the other:
Here the Aνs are the components of the dual vector A* of A in the dual basis and called the covariant coordinates of A, while the original Aν components are called the contravariant coordinates. Lower and upper indices indicate always covariant and contravariant coordinates, respectively.
Derivatives and differentials
In special relativity (but not general relativity), the derivative of a four-vector with respect to a scalar λ (invariant) is itself a four-vector. It is also useful to take the differential of the four-vector, dA and divide it by the differential of the scalar, dλ:
where the contravariant components are:
while the covariant components are:
In relativistic mechanics, one often takes the differential of a four-vector and divides by the differential in proper time (see below).
Lorentz transformation
For two inertial or rotated frames of reference, all four-vectors transform in the same way according to the Lorentz transformation matrix Λ:
or in index notation:
in which the matrix Λ has components Λμν in row μ and column ν.
Pure rotations about an arbitrary axis
For two frames rotated by a fixed angle θ about an axis defined by the unit vector , without any boosts, the matrix Λ has components given by:[2]
where δij is the Kronecker delta, and εijk is the three dimensional Levi-Civita symbol. The spacelike components of 4-vectors rotated, while the time-like components remain unchanged.
For the case of rotations about the z-axis only, the spacelike part of the Lorentz matrix reduces to the rotation matrix about the z-axis:
Pure boosts in an arbitrary direction
For two frames moving at constant relative 3-velocity (not 4-velocity) v = (v1, v2, v3) without rotations, the matrix Λ has components given by:[3]
where for convenience the Lorentz factor defined by:
the relative velocity in units of c:
and δij is the Kronecker delta, are used for simplicity. Contrary to the case for pure rotations, the spacelike and timelike components are mixed together under boosts.
For the case of a boost in the x-direction only, the matrix reduces to;[4][5]
Written in terms of the hyperbolic functions of rapidity ϕ, this Lorentz matrix illustrates the boost to be a hyperbolic rotation in four dimensional spacetime, analogous to the circular rotation above in three dimensional space.
Four-vectors in the Pauli basis
A four-vector A can also be defined in using the Pauli matrices as a basis, again in various equivalent notations:[6]
or explicitly:
and in this formulation, the four-vector is a represented as a unitary matrix (the matrix transpose and complex conjugate of the matrix leaves it unchanged), rather than a real-valued column or row vector. The determinant of the matrix is the modulus of the four-vector, so the determinant is an invariant:
This idea of using matrices as basis vectors is employed in spacetime algebra and geometric algebra, which are examples of Clifford algebra.
Four-position
A point in Minkowski space is called an "event" and is described in a standard basis by a set of four coordinates such as
where = 0, 1, 2, 3, labels the spacetime dimensions and where c is the speed of light. The definition ensures that all the coordinates have the same units (of distance).[7][8][9] These coordinates are the components of the position four-vector for the event. The displacement four-vector is defined to be an "arrow" linking two events:
The position vector is the displacement vector when one of the two events is the origin of the coordinate system. Position vectors are relatively trivial; the general theory of four-vectors is concerned with displacement vectors.
The scalar product of the 4-position with itself is;[10]
which defines the spacetime interval s and proper time τ in Minkowski spacetime, which are invariant. The scalar product of the differential 4-position with itself is:
defining the differential line element ds and differential proper time increment dτ.
Dynamics
When considering physical phenomena, differential equations arise naturally; however, when considering space and time derivatives of functions, it is unclear which reference frame these derivatives are taken with respect to. It is agreed that time derivatives are taken with respect to the proper time τ. As proper time is an invariant, this guarantees that the proper-time-derivative of any four-vector is itself a four-vector. It is then important to find a relation between this proper-time-derivative and another time derivative (using the coordinate time t of an inertial reference frame). This relation is provided by taking the differential invariant spacetime interval;
where
is the distance differential of Cartesian coordinates x, y, z measured in some frame. Then dividing by (cdt)2 gives:
where u = dr/dt is the 3-velocity of an object measured in the same frame as the coordinates x, y, z, and coordinate time t, and
is the Lorentz factor, so
It can also be found from the time transformation in the Lorentz transformations. Important four-vectors in relativity theory can now be defined.
Four-velocity
The four-velocity of an world line is defined by:
where, using suffix notation,
for i = 1, 2, 3. Using the differential of the 4-position, the magnitude of the 4-velocity can be obtained;
in short
The geometric meaning of 4-velocity is the unit vector tangent to the world line in Minkowski space.
Four-acceleration
The four-acceleration is given by:
Since the magnitude of is a constant, the four acceleration is (pseudo-)orthogonal to the four velocity, i.e. the Minkowski inner product of the four-acceleration and the four-velocity is zero:
which is true for all world lines.
The geometric meaning of 4-acceleration is the curvature vector of the world line in Minkowski space.
Four-momentum
The four-momentum for a massive particle is given by:
where m is the invariant mass of the particle and p is the relativistic momentum.
Four-force
The four-force is defined by:
For a particle of constant mass, this is equivalent to
where
Thermodynamics
Four-heat flux
The 4-heat flux vector field, is essentially similar to the 3d heat flux vector field q:[11]
where T is absolute temperature and k is thermal conductivity.
The components are:
in the local frame of the fluid, where ∂α = ∂/∂xα is the four-gradient.
Four-baryon number flux
The flux of baryons is:[12]
where n is the number density of baryons in the local rest frame of the baryon fluid (positive values for baryons, negative for antibaryons), and U the 4-velocity field (of the fluid) as above.
Four-entropy
The 4-entropy vector is defined by:[13]
where s is the entropy per baryon, and T the absolute temperature, in the local rest frame of the fluid.[14]
Electromagnetism
Examples of four-vectors in electromagnetism include the following.
Four-current
The four-current is defined by
formed from the current density j and charge density ρ.
Four-potential
The electromagnetic four-potential defined by
formed from the vector potential a and the scalar potential . The four-potential is not uniquely determined, because it depends on a choice of gauge.
Four-frequency
A plane electromagnetic wave can be described by the four-frequency defined as
where is the frequency of the wave and n is a unit vector in the travel direction of the wave. Notice that
so that the four-frequency is always a null vector.
Four-wavevector
A wave packet of nearly monochromatic light can be characterized by the wave vector, or four-wavevector
Quantum theory
In relativistic quantum mechanics, the 4-probability current is:[15]
where ρ is the probability density function corresponding to the time component, in turn Ψ is the wavefunction, and j is the probability current vector.
Physics of four-vectors
One advantage of the four-vector formalism, over that of three-vectors, is illuminated by showing that known relations between energy and matter are contained in the four-momentum vector.
Energy of massive particles
Here, an expression for the total energy of a particle
will be derived. The kinetic energy (K) of a particle is defined analogously to the classical definition, namely as
with f as above. Noticing that and expanding this out we get
Hence
which yields
for some constant S. When the particle is at rest (u = 0), we take its kinetic energy to be zero (K = 0). This gives
Thus, we interpret the total energy E of the particle as composed of its kinetic energy K and its rest energy m c2. Thus, we have
Total energy and invariant mass
We can also derive the energy–momentum relation:
using the four-vector formalism. Using the relation
we can write the four-momentum as
Taking the inner product of the four-momentum with itself in two different ways, we obtain the relation
reducing to
Hence
This last relation is useful in many areas of physics.
See also
- Relativistic mechanics
- paravector
- wave vector
- Dust (relativity) for the number-flux four-vector
- Basic introduction to the mathematics of curved spacetime
- Minkowski space
References
- ^ Relativity DeMystified, D. McMahon, Mc Graw Hill (BSA), 2006, ISBN 0-07-145545-0
- ^ C.B. Parker (1994). McGraw Hill Encyclopaedia of Physics (2nd ed.). McGraw Hill. p. 1333. ISBN 0-07-051400-3.
- ^ Gravitation, J.B. Wheeler, C. Misner, K.S. Thorne, W.H. Freeman & Co, 1973, ISAN 0-7167-0344-0
- ^ Dynamics and Relativity, J.R. Forshaw, B.G. Smith, Wiley, 2009, ISAN 978-0-470-01460-8
- ^ Relativity DeMystified, D. McMahon, Mc Graw Hill (ASB), 2006, ISAN 0-07-145545-0
- ^ J.A. Wheeler, C. Misner, K.S. Thorne (1973). Gravitation. W.H. Freeman & Co. pp. 1142–1143. ISBN 0-7167-0344-0.
{{cite book}}
: CS1 maint: multiple names: authors list (link) - ^ Jean-Bernard Zuber & Claude Itzykson, Quantum Field Theory, pg 5 , ISBN 0-07-032071-3
- ^ Charles W. Misner, Kip S. Thorne & John A. Wheeler,Gravitation, pg 51, ISBN 0-7167-0344-0
- ^ George Sterman, An Introduction to Quantum Field Theory, pg 4 , ISBN 0-521-31132-2
- ^ Dynamics and Relativity, J.R. Forshaw, A.G. Smith, Wiley, 2009, ISBN 978-0-470-01460-8
- ^ Ali, Y. M.; Zhang, L. C. (2005). "Relativistic heat conduction". Int. J. Heat Mass Trans. 48 (12). doi:10.1016/j.ijheatmasstransfer.2005.02.003.
- ^ J.A. Wheeler, C. Misner, K.S. Thorne (1973). Gravitation. W.H. Freeman & Co. p. 558-559. ISBN 0-7167-0344-0.
{{cite book}}
: CS1 maint: multiple names: authors list (link) - ^ J.A. Wheeler, C. Misner, K.S. Thorne (1973). Gravitation. W.H. Freeman & Co. p. 567. ISBN 0-7167-0344-0.
{{cite book}}
: CS1 maint: multiple names: authors list (link) - ^ J.A. Wheeler, C. Misner, K.S. Thorne (1973). Gravitation. W.H. Freeman & Co. p. 558. ISBN 0-7167-0344-0.
{{cite book}}
: CS1 maint: multiple names: authors list (link) - ^ Vladimir G. Ivancevic, Tijana T. Ivancevic (2008) Quantum leap: from Dirac and Feynman, across the universe, to human body and mind. World Scientific Publishing Company, ISBN 978-981-281-927-7, p. 41
- Rindler, W. Introduction to Special Relativity (2nd edn.) (1991) Clarendon Press Oxford ISBN 0-19-853952-5