General Relativity PDF
General Relativity PDF
General Relativity
Autumn semester 2016
Revision: Antoine Klein, Raymond Angélil, Cédric Huwyler, Simone Balmelli, Yannick Boetzel
• R. Sachs and H. Wu, General Relativity for mathematicians, Springer Verlag, 1977
2
CONTENTS
Contents
I Introduction 6
II Special Relativity 9
3 Lorentz transformations 9
3.1 Galilean invariance . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9
3.2 Lorentz transformations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.3 Proper time . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
4 Relativistic mechanics 13
4.1 Equations of motion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.2 Energy and momentum . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 13
4.3 Equivalence between mass and energy . . . . . . . . . . . . . . . . . . . . . . . . . . 14
6 Electrodynamics 17
10 Time dilation 28
10.1 Proper time . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
10.2 Redshift . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28
10.3 Photon in a gravitational field . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29
3
CONTENTS
11 Geometrical considerations 30
11.1 Curvature of space . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
IV Differential Geometry 32
12 Differentiable manifolds 32
12.1 Tangent vectors and tangent spaces . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
12.2 The tangent map . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
14 Lie derivative 40
15 Differential forms 42
15.1 Exterior derivative of a differential form . . . . . . . . . . . . . . . . . . . . . . . . . 43
15.2 Stokes theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
15.3 The inner product of a p-form . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
18 Riemannian connections 60
V General Relativity 64
4
CONTENTS
24 Light deflection 80
25 Perihelion precession 83
25.1 Quadrupole moment of the Sun . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 86
28 Friedmann equations 91
5
1 NEWTON’S THEORY OF GRAVITATION
Part I
Introduction
1 Newton’s theory of gravitation
In his book Principia in 1687, Isaac Newton laid the foundations of classical mechanics and made a
first step in unifying the laws of physics.
The trajectories of N point masses, attracted to each other via gravity, are the solutions to the equation
of motion
N
d2~ri X mi mj (~ri − ~rj )
mi = −G i = 1 . . . N, (1.1)
dt2 j=1
|~ri − ~rj |3
j6=i
with ~ri (t) being the position of point mass mi at time t. Newton’s constant of gravitation is determined
experimentally to be
G = 6.6743 ± 0.0007 × 10−11 m3 kg−1 s−2 (1.2)
where it has been assumed that the mass is smeared out in a small volume d3 r. The mass is given by
dm = ρ(~r 0 )d3 r0 , ρ(~r 0 ) being the mass density. For point-like particles we have ρ(~r 0 ) ∼ mj δ (3) (~r 0 −~rj ).
The gradient of the gravitational potential can then be used to produce the equation of motion:
d2~r
m = −m∇φ(~r). (1.4)
dt2
According to (1.3), the field φ(~r) is determined through the mass of the other particles. The corre-
sponding field equation derived from (1.3) is given by1
The so called Poisson equation (1.5) is a linear partial differential equation of 2nd order. The source of
the field is the mass density. Equations (1.4) and (1.5) show the same structure as the field equation
of electrostatics:
∆φe (~r) = −4πρe (~r), (1.6)
6
2 GOALS OF GENERAL RELATIVITY
analogy one could consider the “gravitational mass” (right side) as a charge, not to be confused with
the “inertial mass” (left side). Experimentally, one finds to very high accuracy (∼ 10−13 ) that they
are the same. As a consequence, all bodies fall at a rate independent of their mass (Galileo Galilei).
This appears to be just a chance in Newton’s theory, whereas in GR it will be an important starting
point.
For many applications, (1.4) and (1.5) are good enough. It must however be clear that these
equations cannot be always valid. In particular (1.5) implies an instantaneous action at a distance,
what is in contradiction with the predictions of special relativity. We therefore have to suspect that
Newton’s theory of gravitation is only a special case of a more general theory.
The Laplace operator ∆ is completed such as to get the D’Alembert operator (wave equation)
1 ∂2
∆⇒= −∆ (2.1)
c2 ∂t2
Changes in ρe travel with the speed of light to another point in space. If we consider inertial coordinate
frames in relative motion to each other it is clear that the charge density has to be related to a
current density. In other words, charge density and current density transform into each other. In
electrodynamics we use the current density j α (α = 0, 1, 2, 3):
ρe → (ρe c, ρe v i ) = j α , (2.2)
where the v i are the cartesian components of the velocity ~v (i = 1, 2, 3). An analogous generalization
can be performed for the potential:
φe → (φe , Ai ) = Aα . (2.3)
7
2 GOALS OF GENERAL RELATIVITY
is q = qe + qp = 0, but for the total mass we get mH 6= mp + me (binding energy). Formally this
means that charge is a Lorentz scalar (does not depend on the frame in which the measurement is
performed). Therefore we can assign a charge to an elementary particle, and not only a “charge at
rest”, whereas for the mass we must specify the rest mass.
δq
Since charge is a Lorentz scalar, the charge density (ρe = δV ) transforms like the 0-component
1 √ 1
of a Lorentz vector ( δV gets a factor γ = 2 2
due to length contraction). The mass density
1−v /c
δm
(ρ = δV ) transforms instead like the 00-component of a Lorentz tensor, which we denote as the
energy-momentum tensor Tαβ . This follows from the fact that the energy (mass is energy E = mc2 )
is the 0-component of a 4-vector (energy-momentum vector pα ) and transforms as such. Thus, instead
of (2.2), we shall have !
ρc2 ρcv i
ρ⇒ ∼ T αβ i, j = 1, 2, 3 (2.5)
ρcv i ρv i v j
This implies that we have to generalize the gravitational potential φ to a quantity depending on 2
indices which we shall call the metric tensor g αβ . Hence we get
∆φ = 4πGρ ⇒ g αβ ∼ GT αβ . (2.6)
In GR one finds (2.6) for a weak gravitational field (linearized case), e.g. used for the description of
gravitational waves.
Due to the equivalence between mass and energy, the energy carried by the gravitational field is
also mass and thus also a source of the gravitational field itself. This leads to non-linearities. One can
note that photons do not have a charge and thus Maxwell’s equations can be linear.
To summarize:
8
3 LORENTZ TRANSFORMATIONS
Part II
Special Relativity
3 Lorentz transformations
A reference system with a well defined choice of coordinates is called a coordinate system. Inertial
reference systems (IS) are (from a “practical” point of view) systems which move with constant speed
with respect to distant (thus fixed) stars in the sky. Newton’s equations of motion are valid in IS. Non-
IS are reference systems which are accelerated with respect to an IS. In this chapter we will establish
how to transform coordinates between different inertial systems.
x0i = αi k xk + v i t + ai , (3.1)
t0 = t + τ, (3.2)
where
9
3 LORENTZ TRANSFORMATIONS
remains invariant. α can be defined by giving 3 Euler angles. Eqs. (3.1) and (3.2) define a 10
(a = 3, v = 3, τ = 1 and α = 3) parametric group of transformations, the so-called Galilean
group.
The laws of mechanics are left invariant under transformations (3.1) and (3.2). But Maxwell’s equations
are not invariant under Galilean transformations, since they contain the speed of light c. This led
Einstein to formulate a new relativity principle (special relativity, SR): All physical laws, including
Maxwell’s equations, are valid in any inertial system. This leads us to Lorentz transformations (instead
of Galilean), thus the law of mechanics have to be modified.
x0 = ct, x1 = x, x2 = y, x3 = z. (3.5)
x0α = Λα β xβ + aα , (3.6)
where aα is a space and time translation. The relative rotations and boosts are described by the 4 × 4
matrix Λ. Linear means in this context that the coefficients Λα β and aα do not depend on xα . In
order to preserve the speed of light appearing in Maxwell’s equations as a constant, the Λα
β have to be
such that the square of the line element
Because of ds2 = ds02 ⇔ c2 dτ 2 = c2 dτ 02 , the proper time is an invariant under Lorentz transformations.
2
+dy 2 +dz 2
x 2
Indeed for light dτ 2 = dt2 − dx
= 0. Thus, c2 = d~ and c = d~x . Applying a Lorentz
0 c2 dt dt
x
transformation results in c = d~
dt 0 . This has the important consequence that the speed of light c is
the same in all coordinate systems (what we intended by the definition of (3.7)).
10
3 LORENTZ TRANSFORMATIONS
A 4-dimensionial space together with this metric is called a Minkowski space. Inserting (3.6) into the
invariant condition ds2 = ds02 gives
Then we get
Λα γ Λβ δ ηαβ = ηγδ or ΛT ηΛ = η. (3.10)
Rotations are special subcases incorporated in Λ: x0α = Λα β xβ with Λi k = αi k , and Λ0 0 = 1,
Λi 0 = Λ0 i = 0. The entire group of Lorentz transformations (LT) is the so called Poincaré group (and
has 10 parameters). The case aα 6= 0 corresponds to the Poincaré group or inhomogeneous Lorentz
group, while the subcase aα = 0 can be described by the homogeneous Lorentz group. Translations and
rotations are subgroups of Galilean and Lorentz groups.
Consider now a Lorentz ’boost’ in the direction of the x-axis: x02 = x2 , x03 = x3 . v denotes the
relative velocity difference between IS and the boosted IS’. Then
Λ0 0 Λ0 1 0 0
1
Λ 0 Λ1 1 0 0
Λα β =
0
. (3.11)
0 1 0
0 0 0 1
For the origin of IS’ we have x01 = 0 = Λ1 0 ct + Λ1 1 vt. This way we find
Λ1 0 v
tanh Ψ = − = , (3.14)
Λ0 0 c
and as a function of velocity:
1
Λ0 0 = Λ1 1 = γ = q , (3.15a)
v2
1− c2
−v/c
Λ0 1 = Λ1 0 = q . (3.15b)
2
1 − vc2
11
3 LORENTZ TRANSFORMATIONS
A Lorentz transformation (called a boost) along the x-axis can then be written explicitly as
x − vt
x0 = q , (3.16a)
2
1 − vc2
y 0 = y, (3.16b)
z 0 = z, (3.16c)
ct − x vc
ct0 = q , (3.16d)
2
1 − vc2
which is valid only for |v| < c. For |v| c, (3.16) recovers the special (no rotation) Galilean transfor-
mation x0 = x − vt, y 0 = y, z 0 = z and t0 = t. The parameter
v
Ψ = arctanh (3.17)
c
is called the rapidity. From this we find for the addition of parallel velocities:
Ψ = Ψ1 + Ψ2
v1 + v2
⇒v= (3.18)
1 + v1c2v2
Zt2 r
v 2 (t)
τ= dt 1 − (3.20)
c2
t1
This is the time interval measured by an observer moving at a speed v (t) between t1 and t2 (as given
by a clock at rest in IS). This effect is called time dilation.
12
4 RELATIVISTIC MECHANICS
4 Relativistic mechanics
Let us now perform the relativistic generalization of Newton’s equation of motion for a point particle.
u0α = Λα β uβ (4.2)
All quantities which transforms this way are Lorentz vectors or form-vectors. The generalized equation
of motion is then
duα
m = f α. (4.3)
dτ
α
du
Both dτ and f α are Lorentz vectors, therefore, (4.3) is a Lorentz vector equation: if we perform a
0α
0α
Lorentz transformation, we get m du dτ = f . Eq. (4.3) is covariant under Lorentz transformations
d~v
v c itreduces
and for to Newton’s equations. (left hand side becomes m 0, dt and the right hand
0 ~ ~
side f , f = 0, K ). The Minkowski force f 0α is determined in any IS through a corresponding
2
−1/2 1
LT: f 0α = Λα β f β . For example ~v = −v~e1 with γ = 1 − vc2 , leads to f 00 = γvK
c , f
01
= γK 1 ,
f 02 = K 2 and f 03 = K 3 . For a general direction of velocity (−~v ) we get:
~
~v · K ~
f 00 = γ , ~ + (γ − 1) ~v ~v · K .
f~0 = K (4.4)
c v 2
m~v vc
momentum : p~ = q = γm~v −−−→ p~ = m~v . (4.6b)
2
1 − vc2
13
5 TENSORS IN MINKOWSKI SPACE
This justifies to call the quantity E = γmc2 an energy. From ds2 = c2 dτ 2 = ηαβ dxα dxβ it follows
ηαβ pα pβ = m2 c2 and thus
E 2 = m2 c4 + c2 p~ 2 , (4.8)
with p = |~
p|. For particles with no rest mass (photons): E = cp (exact relation).
E0 = mc2 (4.10)
and the kinetic energy Ekin = E − E0 = E − mc2 . The quantities defined in (4.6) are conserved when
more particles are involved. Due to the equivalence between energy and mass, the mass or the mass
density becomes a source of the gravitational field.
V α → V 0α = Λα β V β . (5.1)
This is a so-called contravariant 4-vector (indices are up). The coordinate system transforms according
to X α → X 0α = Λα β X β . A covariant 4-vector is defined through
Vα = ηαβ V β . (5.2)
In our case
1 0 0 0
0 −1 0 0
η αβ = ηαβ = . (5.4)
0 0
−1 0
0 0 0 −1
With (5.3) we can express (5.2) equally as
V α = η αβ Vβ . (5.5)
14
5 TENSORS IN MINKOWSKI SPACE
with
Λ̄δ α = ηαβ Λβ γ η γδ (5.7)
(one can use Λα β instead of Λ̄βα but one should be very careful in writings since Λα β 6= Λβ α ). Thanks
to (3.10), we find
Λ̄γ α Λα β = ηαδ η γ Λδ Λα β = η γ ηβ = δ γ β (5.8)
And similarly, we get Λβ α Λ̄αγ = δ β γ . In matrix notation, we have ΛΛ̄ = Λ̄Λ = I and thus Λ̄ = Λ−1 .
To summarize the transformations of 4-vectors:
V α Uα = Vα U α = η αβ Vα Uβ = ηαβ V α U β (5.9)
A quantity is a rank r contravariant tensor if its components transform like the coordinates xα :
Tensors of rank 0 are scalars, tensors of rank 1 are vectors. For “mixed” tensors we have for example:
T 0α βγ = Λα δ Λ̄ β Λ̄µγ T δ µ
1. Linear combinations of tensors with the same upper and lower indices: T α β = aRα β + bS α β
4. Differentiation of a tensor field: ∂α T βα (the derivative ∂α of any tensor is a tensor with one
additional lower index α: ∂α T βγ ≡ Rα βγ )
5. Going from a covariant to a contravariant component of a tensor is defined like in (5.2) and (5.5)
(lowering and raising indices with η αβ , ηαβ ).
15
5 TENSORS IN MINKOWSKI SPACE
• Λα β is not a tensor.
0 (3.10) (5.8)
ηαβ = Λ̄γ α Λ̄δ β ηγδ = Λ̄γ α Λ̄δ β Λµ γ Λν δ ηµν = ηαβ
η appears in the line element (ds2 = ηαβ dxα dxβ ) and is thus the metric tensor in Minkowski space.
We also have η α β = η αγ ηγβ = δ α β = ηβ α , and thus the Kronecker symbol is also a tensor.
0αβγδ = αβγδ ,
0 0
γ 0 δ0
αβγδ = ηαα0 ηββ 0 ηγγ 0 ηδδ0 α β = −αβγδ .
The functions S(x), V α (x), T αβ . . . with x = (x0 , x1 , x2 , x3 ) are a scalar field, a vector field, or a tensor
field . . . respectively if:
S 0 (x0 ) = S(x)
V 0α (x0 ) = Λα β
β V (x)
T 0αβ (x0 ) = Λα β δγ
δ Λγ T (x)
..
.
16
6 ELECTRODYNAMICS
6 Electrodynamics
~ r, t), B(~
Maxwell’s equations relate the fields E(~ ~ r, t), the charge density ρe (~r, t) and the current density
~(~r, t):
~ = 4πρc
div E
~ =0
div B
inhomogeneous homogeneous (6.1)
~ ~
~ = 4π ~ + 1 ∂ E ~ = − 1 ∂B
rot B
rot E
c c ∂t c ∂t
The continuity equation
div ~ + ρ̇c = 0 → ∂α j α = 0 (6.2)
α
Z j = (cρe , ~) follows from the conservation of charge, which for an isolated system implies
with
∂t j 0 d3 r = 0. ∂α j α is a Lorentz scalar. We can define the field strength tensor which is given
by the antisymmetric matrix
0 −Ex −Ey −Ez
Ex 0 −Bz By
F αβ = . (6.3)
E
y Bz 0 −Bx
Ez −By Bx 0
Both equations are covariant under a Lorentz transformation. Eq. (6.5) allows to represent F αβ as a
“curl” of a 4-vector Aα :
F αβ = ∂ α Aβ − ∂ β Aα . (6.6)
We can then reformulate Maxwell’s equations for Aα = (φ, Ai ). From (6.6) it follows that the gauge
transformation
Aα → Aα + ∂ α Θ (6.7)
of the 4-vector Aα leaves F αβ unchanged, where Θ(x) is an arbitrary scalar field. The Lorenz gauge
∂α Aα = 0 leads to the decoupling of the inhomogeneous Maxwell’s equation (6.4) to
4π α
Aα = j . (6.8)
c
The generalized equation of motion for a particle with charge q is
duα q
m = F αβ uβ (6.9)
dτ c
17
7 ACCELERATED REFERENCE SYSTEMS IN SPECIAL RELATIVITY
p
d~ ~ + ~v ∧ B
~ with p~ = γm~v .
The spatial components give the expression of the Lorentz force =q E
dt c
The energy-momentum tensor for the electromagnetic field is
αβ 1 1
Tem = F α γ F γβ + η αβ Fγδ F γδ (6.10)
4π 4
The 00-component represents the energy density of the field Tem 00
= uem = 8π 1 ~2 + B
E ~ 2 and the
i
0i-components the Poynting vector S ~ i = cTem0i c
= 4π ~ ∧B
E ~ . In terms of these tensors, Maxwell’s
αβ 1
equations are ∂α Tem = − F βγ jγ . Tem αβ αβ
is symmetric and conserved: ∂α Tem = 0. Setting β = 0
c
αk
leads to energy conservation whereas ∂α Tem = 0 leads to conservation of the k th component of the
αβ
momentum. One should note that ∂α Tem = 0 is valid only if there is no external force, otherwise we
αβ
can write ∂α Tem = f β , where f β is the external force density. Such an external force can often be
included in the energy-momentum tensor.
and assume that ω 2 (x02 + y 02 ) c2 . Then we insert (7.1) into the line element ds (in the known IS
form):
The resulting line element is more complicated. For arbitrary coordinates x0µ , ds2 is a quadratic form
of the coordinate differentials dx0µ . Consider a general coordinate transformation from xµ (in IS) to
x0µ (in KS’):
xα ≡ xα (x0 ) = xα (x00 , x01 , x02 , x03 ), (7.3)
18
7 ACCELERATED REFERENCE SYSTEMS IN SPECIAL RELATIVITY
∂xα ∂xβ 0µ 0ν
ds2 = ηαβ dxα dxβ = ηαβ dx dx = gµν (x0 )dx0µ dx0ν , (7.4)
∂x0µ ∂x0ν
with
∂xα ∂xβ
gµν (x0 ) = ηαβ . (7.5)
∂x0µ ∂x0ν
The quantity gµν is the metric tensor of the KS’ system. It is symmetric (gµν = gνµ ) and depends on
x0 . It is called metric because it defines distances between points in coordinate systems.
In an accelerated reference system we get inertial forces. In the rotating frame we expect to
~ which can be written in terms of a centrifugal potential φ:
experience the centrifugal force Z,
ω 2 02 ~ = −m∇φ.
~
φ=− (x + y 02 ) and Z (7.6)
2
This enables us to see that g00 from (7.2) is
2φ
g00 = 1 + . (7.7)
c2
The centrifugal potential appears in the metric tensor. We will see later that the first derivatives of
the metric tensor are related to the forces in the relativistic equations of motion. To get the meaning
of t0 in KS’ we evaluate (7.2) at a point with dx0 = dy 0 = dz 0 = 0:
r r
dsclock √ 0 2φ 0 v2
dτ = = g00 dt = 1 + 2 dt = 1− 2 dt (7.8)
c c | {z c }
correspond to clocks
time computed in
an inertial system
ds2 = c2 dt2 − dx2 − dy 2 − dz 2 = c2 dt2 − dρ2 − ρ2 dθ2 − dz 2 = gµν (x0 )dx0µ dx0ν . (7.9)
19
8 THE EQUIVALENCE PRINCIPLE
Part III
3. In a local inertial frame, we experience the known laws of special relativity without gravitation
d2~r
mt = ms~g (8.2)
dt2
20
8 THE EQUIVALENCE PRINCIPLE
This expression is valid for a reference system which is at rest on Earth’s surface (∼ to a good
approximation an IS). Then we perform the following transformation to an accelerated KS system:
1
~r = ~r 0 + gt02 , t = t0 , (8.3)
2
and we assume gt c. The origin of KS ~r 0 = 0 moves in IS with ~r(t) = 21 gt2 . Then, inserting (8.3)
into (8.2) results in
d2
0 1 02
mt 02 ~r + gt = ms~g
dt 2
d2~r 0
⇒ mt = (ms − mt ) ~g . (8.4)
dt02
r0
d2 ~
If ms = mt , the resulting equation in KS is the one of a free moving particle dt02 = 0; the gravitational
force vanishes. As another example in a “free falling elevator” the “observer” does not feel any gravity.
Einstein generalized this finding postulating that (this is the Einstein equivalence principle) “in a
free falling accelerated reference system all physical processes run as if there is no gravitational field”.
Notice that the “mechanical” finding is now expanded to all types of physical processes (at all times
and places). Moreover also non-homogeneous gravitational fields are allowed. The equality of inertial
and gravitational mass is also called the weak equivalence principle (or universality of the free fall).
As an example of a freely falling system, consider a satellite in orbit around Earth (assuming that
the laboratory on the satellite is not rotating). Thus the equivalence principle states that in such a
system all physical processes run as if there would be no gravitational field. The processes run as in an
inertial system: the local IS. However, the local IS is not an inertial system, indeed the laboratory on
the satellite is accelerated compared to the reference system of the fixed distant stars. The equivalence
principle implies that in a local IS the rules of special relativity apply.
• For an observer on the satellite laboratory all physical processes follow special relativity and
there are neither gravitational nor inertial forces.
• For an observer on Earth, the laboratory moves in a gravitational field and moreover inertial
forces are present, since it is accelerated.
The motion of the satellite laboratory, i.e. its free falling trajectory, is such that the gravitational
forces and inertial forces just compensate each other (cf (8.4)). The compensation of the forces is
exactly valid only for the center of mass of the satellite laboratory. Thus the equivalence principle
applies only to a very small or local satellite laboratory (”how small” depends on the situation).
on a curved surface we may erect a locally Cartesian coordinate system in which distances obey the law of Pythagoras.
21
8 THE EQUIVALENCE PRINCIPLE
The equivalence principle allows us to set up the relativistic laws including gravitation; indeed one can
just perform a coordinate transformation to another KS:
special relativity laws
o nrelativistic laws
coordinate
without −−−−−−−−−−→ with
gravitation transf ormation gravitation
The coordinate transformation includes the relative acceleration between the laboratory system and
KS which corresponds to the gravitational field. Thus from the equivalence principle we can derive the
relativistic laws in a gravitational field. However, it does not fix the field equations for gµν (x) since
those equations have no analogue in special relativity.
From a geometrical point of view the coordinate dependence of the metric tensor gµν (x) means
that space is curved. In this sense the field equations describe the connection between curvature of
space and the sources of the gravitational field in a quantitative way.
∂ξ α ∂ξ β µ ν
ds2 = ηαβ dx dx = gµν (x)dxµ dxν , (8.6)
∂xµ ∂xν
∂ξ α ∂ξ β
and thus gµν (x) = ηαβ . A space with such a path element of the form (8.6) is called a Riemann
∂xµ ∂xν
space.
The coordinate transformation (expressed via gµν ) also describes the relative acceleration between
KS and the local IS. Since at two different points of the local IS the accelerations are (in general)
different, there is no global transformation in the form (8.6) that can be brought to the Minkowski
form (8.5). We shall see that gµν are the relativistic gravitational potentials, whereas their derivatives
determine the gravitational forces.
22
8 THE EQUIVALENCE PRINCIPLE
Figure 1: An experimenter and his two stones freely floating somewhere in outer space, i.e. in the
absence of forces.
Figure 2: Constant acceleration upwards mimics the effect of a gravitational field: experimenter and
stones drop to the bottom of the box.
23
8 THE EQUIVALENCE PRINCIPLE
24
9 PHYSICS IN A GRAVITATIONAL FIELD
ξ α = aα τ + bα . (9.2)
Light (or a photon) moves in the local IS on straight lines. However, for photons τ cannot be identified
with the proper time since on the light cone ds = cdτ = 0. Thus we denote by λ a parameter of the
trajectory of photons:
d2 ξ α
= 0. (9.3)
dλ2
Let us now consider a global coordinate system KS with xµ and metric gµν (x). At all points xµ , one can
locally bring ds2 into the form ds2 = ηαβ dξ α dξ β . Thus at all points P there exists a transformation
ξ α (x) = ξ α (x0 , x1 , x2 , x3 ) between ξ α and xµ . The transformation varies from point to point. Consider
a small region around point P . Inserting the coordinate transformation into the line element, we get
∂ξ α ∂ξ β
ds2 = ηαβ dξ α dξ β = ηαβ µ ν dxµ dxν . (9.4)
| ∂x{z ∂x }
≡gµν (x) metric tensor
25
9 PHYSICS IN A GRAVITATIONAL FIELD
dxµ
(assume dτ 6= 0 and m 6= 0). Due to (9.7) only 3 of the 4 components of dτ are independent (this
is also the case for the 4-velocity in special relativity). For photons (m = 0) one finds instead, using
(9.3), a completely analogous equation for the trajectory:
d2 xκ µ
κ dx dx
ν
= −Γµν , (9.8)
dλ2 dλ dλ
and since dτ = ds = 0, one has instead of (9.7):
dxµ dxν
0 = gµν (for m = 0).
dλ dλ
∂ 2 ξ α ∂ξ β
= 2ηαβ . (9.9)
∂xµ ∂xλ ∂xν
On the other hand
gνσ Γσ
µλ
z }| {z }| {
∂ξ α ∂ξ β ∂xσ ∂ 2 ξ ρ
gνσ Γσµλ = ηαβ ν σ
∂x ∂x ∂ξ ρ ∂xµ ∂xλ
| {z }
δρβ
∂ξ α ∂ 2 ξ β
= ηαβ
∂xν ∂xµ ∂xλ
1 ∂gµν ∂gλν ∂gµλ
= + − . (9.10)
2 ∂xλ ∂xµ ∂xν
We introduce the inverse matrix g µν such that g µν gνσ = δ µ σ . Therefore we can solve with respect to
the Christoffel symbols:
1 κν ∂gµν ∂gλν ∂gµλ
Γκµλ = g + − . (9.11)
2 ∂xλ ∂xµ ∂xν
26
9 PHYSICS IN A GRAVITATIONAL FIELD
Note that the Γ’s are symmetric in the lower indices Γκµν = Γκνµ . The gravitational forces on the right
hand side of (9.6) are given by derivatives of gµν . Comparing with the equation of motion of a particle
in a electromagnetic field shows that the Γλµν correspond to the field F αβ , whereas the gµν correspond
to the potentials Aα .
small
velocity 2
2 κ µ ν
dx0
d x dx dx
= −Γκµν ≈ −Γκ00
z}|{
. (9.12)
dτ 2 dτ dτ dτ
27
10 TIME DILATION
distribution. Then
2φ(R)
≈ 1.4 × 10−9 at Earth surface,
c2
10 Time dilation
We study a clock in a static gravitational field and the phenomenon of gravitational redshift.
x = (xµ ) are the coordinates of the clock. The time shown by the clock depends on both the gravita-
tional field and of its motion (the gravitational field being described by gµν ).
Special cases:
10.2 Redshift
Let us now consider objects which emit or absorb light with a given frequency. Consider only a static
gravitational field (gµν does not depend on time). A source in ~rA (at rest) emits a monochromatic
28
10 TIME DILATION
As a time interval we consider the time between two following peaks departing from A or arriving
at B. In this case dτA and dτB correspond to the period of the electromagnetic waves at A and B,
respectively, and therefore
1 1
dτA = , dτB = . (10.4)
νA νB
Going from A to B needs the same time ∆t for the first and the second peak of the electromagnetic
wave. Consequently, they will arrive with a time delay which is equal to the one with which they were
emitted, thus dtA = dtB . With (10.3) and (10.4) we get:
s
νA g00 (~rB ) νA λB
= , with z = −1= − 1. (10.5)
νB g00 (~rA ) νB λA
2φ
For weak fields with g00 = 1 + c2 we have
φ(~rB ) − φ(~rA )
z= (|φ| c2 ), (10.7)
c2
such a redshift is observed by measuring spectral lines from stars. As an example take solar light with
(10.7)
φ(rB ) − φ(rA ) φ(rA ) GM
z= 2
≈− 2 = 2 ≈ 2 × 10−6 ,
c c c R
with M ≈ 2 × 1030 kg and R ≈ 7 × 108 m. For a white dwarf we find z ≈ 10−4 and for a neutron star
z ≈ 10−1 . In general there are 3 effects which can lead to a modification in the frequency of spectral
lines:
1. Doppler shift due to the motion of the source (or of the observer)
2. Gravitational redshift due to the gravitational field at the source (or at the observer)
3. Cosmological redshift due to the expansion of the Universe (metric tensor is time dependent)
29
11 GEOMETRICAL CONSIDERATIONS
∆ν gh
=− 2. (10.9)
ν c
E E
The photon changes its energy by ∆Eγ = − c2γ gh (like a particle with mass c2γ = m). This effect has
∆νexp
been measured in 1965 (through the Mössbauer effect) as = 1.00 ± 0.01 (1% accuracy)3 .
∆νth
11 Geometrical considerations
In general, the coordinate dependence of gµν (x) means that spacetime, defined through the line element
ds2 , is curved. The trajectories in a gravitational field are the geodesic lines in the corresponding
spacetime.
ds2 = g11 dx1 dx1 + 2g12 dx1 dx2 + g22 dx2 dx2 . (11.1)
Examples:
The line element (11.2) can, via a coordinate transformation, be brought into the form (11.3). However,
there is no coordinate transformation which brings (11.4) into (11.2). Thus:
• The metric tensor determines the properties of the space, among which is also the curvature.
• The form of the metric tensor is not uniquely determined by the space, in other words it depends
on the choice of coordinates.
3 Pound, R. V. and Snider, J. L., Effect of Gravity on Gamma Radiation, Physical Review, 140
30
11 GEOMETRICAL CONSIDERATIONS
The curvature of the space is determined via the metric tensor (and it does not depend on the coordinate
choice)4 . If gik = const then the space is not curved. In an Euclidian space, one can introduce Cartesian
coordinates gik = δik . For a curved space gik 6= const (does not always imply that space is curved).
For instance by measuring the angles of a triangle and checking if their sum amounts to 180 degrees
or differs, one can infer if the space is curved or not (for instance by being on the surface of a sphere).
4 Beside the curvature discussed here, there is also an exterior curvature. We only consider intrinsic curvatures here.
31
12 DIFFERENTIABLE MANIFOLDS
Part IV
Differential Geometry
12 Differentiable manifolds
A manifold is a topological space that locally looks like the Euclidean Rn space with its usual topology.
A simple example of a curved space is the S 2 sphere: one can setup local coordinates (θ, ϕ) which map
S 2 onto a plane R2 (a chart). Collections of charts are called atlases. There is no one-to-one map of
S 2 onto R2 ; we need several charts to cover S 2 .
Definition: Given a (topological) space M, a chart on M is a one-to-one map φ from an open subset
U ⊂ M to an open subset φ(U) ⊂ Rn , i.e. a map φ : M → Rn . A chart is often called a coordinate
S
system. A set of charts with domain Uα is called an atlas of M, if Uα = M, {φα |α ∈ I}.
α
Definition: dim M = n
The collection of all such C ∞ -related charts forms a maximal atlas. If M is a space and A its maximal
atlas, the set (M, A) is a (C ∞ )-differentiable manifold. (If for each φ in the atlas the map φ : U → Rn
has the same n, then the manifold has dimension n.)
Important notions:
32
12 DIFFERENTIABLE MANIFOLDS
M
U1
U2
φ1 φ2
Rn Rn
x x̄
φ2 ◦ φ−1
1
X X̄
(Chart φ1 ) ⊂ Rn (Chart φ2 ) ⊂ Rn
33
12 DIFFERENTIABLE MANIFOLDS
Definition: The tangent vector γ̇p to a curve γ(t) at a point p is a map from the set of real functions
f defined in a neighbourhood of p to R defined by
d
γ̇p : f 7→ (f ◦ γ) = (f ◦ γ)• p = γ̇p (f ). (12.2)
dt p
Given a chart φ with coordinates xi , the components of γ̇p with respect to the chart are
i • d i
(x ◦ γ) p = x (γ(t)) . (12.3)
dt p
Theorem: If the dimension of M is n, then Tp (M) is a vector space of dimension n (without proof).
We set γ(0) = p (t = 0), Xp = γ̇p , and Xp f = γ̇p (f ). Eq. (12.3) determines Xp (xi ), the components
of Xp with respect to a given basis:
Xp f = [f ◦ γ]• (0)
•
= f ◦ φ−1 ◦ φ ◦ γ (0)
n
X ∂ d (12.4)
= i
(f ◦ φ−1 ) (xi ◦ γ)(0)
i=1
∂x dt
X ∂
f (x , . . . , x ) Xp (xi ) .
1 n
= i
i
∂x
34
12 DIFFERENTIABLE MANIFOLDS
Basis of Tp (M): Tp = TP (M) has dimension n. In any basis (e1 , . . . , en ) we have X = X i ei . Changes
of basis are given by
ēi = φi k ek , X̄ i = φi k X k . (12.7)
∂xk ∂ x̄i
φi k = , φi k = . (12.8)
∂ x̄i ∂xk
Definition: The cotangent space Tp∗ (or dual space Tp∗ of Tp ) consists of covectors ω ∈ Tp∗ , which are
linear one-forms ω : X 7→ ω(X) ≡< ω, X >∈ R (ω : Tp → R).
∂f
In particular for functions f , df : X 7→ Xf is an element of Tp∗ . The elements df = f,i dxi = ∂xi dxi
form a linear space of dimension n, therefore all of Tp∗ .
We can define a dual basis (e1 , . . . , en ) of Tp∗ : ω = ωi ei . In particular the dual basis of a basis
(e1 , . . . , en ) of Tp is given by < ei , X >= X i or < ei , X j ej >= X j < ei , ej > = X i . Thus ωi =< ω, ei >.
| {z }
δi j
Upon changing the basis, the ωi transform like the ei and the ei like the X i (see (12.7)). In particular
∂ ∂
we have for the coordinate basis ei = ∂xi , ei = dxi (< ei , ej >=< dxi , ∂x i
j >= δ j ). The change of
basis is:
∂ ∂xk ∂ ∂
i
= i k
= φi k k
∂ x̄ ∂ x̄ ∂x ∂x
∂ x̄i k
dx̄i = dx = φi k dxk
∂xk
(Similar to co- and contravariant vectors.)
1
Tensors on Tp are multilinear forms on Tp∗ and Tp , i.e. a tensor T of type
2 (for short T ∈ ⊗12 Tp ):
T (ω, X, Y ) is a trilinear form on Tp∗ × Tp × Tp . The tensor product is defined between tensors of any
type, i.e. T (ω, X, Y ) = R(ω, X)S(Y ) : T = R ⊗ S. In components:
hence T = T i jk ei ⊗ ej ⊗ ek . Any tensor of any type can therefore be obtained as a linear combination
of tensor products X ⊗ ω ⊗ ω 0 with X ∈ Tp , ω, ω 0 ∈ Tp∗ . A change of basis can be performed similarly
35
12 DIFFERENTIABLE MANIFOLDS
i
T̄jk = T α βγ φi α φj β φk γ (12.10)
Trace: any bilinear form b ∈ Tp∗ ⊗ Tp determines a linear form l ∈ (Tp ⊗ Tp∗ )∗ such that l(X ⊗ ω) =
b(X, ω). In particular tr T is a linear form on tensors T of type 11 , defined by tr(X ⊗ ω) =< ω, X >.
(a) For any f¯ ∈ Fp (M̄) (F: space of all smooth functions on M (or M̄), that is C ∞ map f : M → R):
(b) Let γ be a representative of X (X = γ̇p , see (12.2) and (12.3)). Then let γ̄ = ϕ ◦ γ be a
d ¯ d ¯
representative of ϕ∗ X. This agrees with (a) since dt f (γ̄(t))t=0 = dt (f ◦ ϕ)(γ(t))t=0 .
With respect to bases (e1 , . . . , en ) of Tp and (ē1 , . . . , ēn ) of Tp̄ (M̄), this reads X̄ = ϕ∗ X: X̄ i = (ϕ∗ )i k X k
∂ x̄i
with (ϕ∗ )i k =< ēi , ϕ∗ ek > or in case of coordinate bases: (ϕ∗ )i k = .
∂xk
−1
i.e. maps ϕ such that ϕ exists in a neighbourhood of p̄. Note that
Consider (local) diffeomorphisms,
∂ x̄i
dim M = dim M̄ and det ∂xj 6= 0. Then ϕ∗ and ϕ∗ , as defined above, are invertible and may be
extended to tensors of arbitrary types.
1
Example: tensor of type 1
36
13 VECTOR AND TENSOR FIELDS
∂
Given a coordinate system xi and associated basis ∂x i
p
for each Tp (M), Xp has components Xpi with
∂
Xp = Xpi ∂x i
p
and Xpi = Xp (xi ) (see (12.5)). Eq. (12.8) shows how Xpi transform under coordinate
transformations. The quantity Xf is called the derivative of f with respect to the vector field X. The
following rules apply:
X(f + g) = Xf + Xg,
(13.1)
X(f g) = (Xf )g + f (Xg) (Leibnitz rule).
The vector fields on M form a linear space on which the following operations are defined as well:
X 7→ f X (multiplication by f ∈ F),
X, Y 7→ [X, Y ] = XY − Y X (commutator).
[X, Y ], unlike XY , satisfies the Leibniz rule (13.1). The components of the commutator of two vector
fields X, Y relative to a local coordinate basis can be obtained by its action on xi . Thus using
∂ k ∂
X = X i ∂x i and Y = Y ∂xk
we get
j
[X, Y ] = (XY − Y X)xj
∂xj
Y xj = Y k = Y k δj k = Y j
∂xk
∂
XY j = X k (Y j ) = X k Y j ,k
∂xk |{z}
∂Y j
∂xk
⇒ XY j − Y X j = X k Y j ,k − Y k X j ,k
In a local coordinate basis, the bracket [∂k , ∂j ] clearly vanishes (X k = 1 and Y k = 1, and thus Y,jk = 0).
The Jacobi identity holds:
[X, [Y, Z]] + [Y, [Z, X]] + [Z, [X, Y ]] = 0. (13.2)
37
13 VECTOR AND TENSOR FIELDS
Definition: Let Tp (M)rs be the set of all tensors of rank (r, s) defined on Tp (M) (contravariant of rank
r, covariant of rank s). If we assign to every p ∈ M a tensor tp ∈ Tp (M)rs , then the map t : p 7→ tp
defines a tensor field of type rs .
Algebraic operations on tensor fields are defined point-wise; for instance the sum of two tensor fields
is defined by (t + s̃)p = tp + s̃p where t, s̃ ∈ Tp (M)rs . Tensor products and contractions of tensor fields
are defined analogously. Multiplication by a function f ∈ F(M) is given by (f t)p = f (p)tp . In a
neighbourhood U of p, having coordinates (x1 , . . . , xn ) a tensor field can be expanded in the form
i1 ...ir ∂ ∂
⊗ dxj1 ⊗ . . . ⊗ dxjs .
t= t j1 ...js i
⊗ ... ⊗ i
(13.3)
| {z } ∂x 1 ∂x r
components of t relative
to the coordinate system
(x1 , . . . , xn )
If the coordinates are transformed to (x̄1 , . . . , x̄n ) the components of t transform according to
(ii) For every p ∈ M, gp is a non-degenerate (6= 0) bilinear form on Tp (M). This means that
gp (X, Y ) = 0 for all X ∈ Tp (M) if and only if Y = 0.
The tensor field g ∈ T20 (M) is a (proper) Riemannian metric if gp is positive definite at every point p.
d
i.e. Xp = dt γ(t)t=0 = γ̇(0) (see (12.2) and (12.3)). γ̇(0) is the tangent vector to γ at the point
p = γ(0). At the point γ(t) we have then
d d
γ̇(t) = ϕt (p) = (ϕs ◦ ϕt ) (p) = Xϕt (p)
dt ds s=0
38
13 VECTOR AND TENSOR FIELDS
The generating vector field determines the flow uniquely. Not always does (13.6) admit global solutions
(i.e. for all t ∈ R), however for most purposes, “local flows” are good enough.
39
14 LIE DERIVATIVE
14 Lie derivative
The derivative of a vector field V rests on the comparison of Vp and Vp0 at nearby points p, p0 . Since
Vp ∈ Tp and Vp0 ∈ Tp0 belong to different spaces their difference can be taken only after Vp0 has been
transported to Vp . This can be achieved by means of the tangent map ϕ∗ (Lie transport). The Lie
derivative LX R of a tensor field R in direction of a vector field X is defined by
d ∗
LX R = ϕt R , (14.1)
dt t=0
d ∗
or more explicitly (LX R)p = ϕt Rϕt (p) . Here ϕt is the (local) flow generated by X, where
dt t=0
ϕ∗t Rϕt (p) is a tensor on Tp depending on t.
R ◦ ϕt (p)
Rp
ϕ−t∗ (R ◦ ϕt (p)) = ϕ∗t (R ◦ ϕt (p))
ϕt (p)
p
d ∗ 1
LX R = ϕ R = lim (ϕ∗ R − R)
dt t t=0 t→0 t t
d
t → ϕt (p) = γ(t); Xp = dt γ(t)|t=0 = γ̇(0)
(ϕ∗ is the inverse of ϕ∗ )
In order to express LX in components we write ϕt in a chart: ϕt : x 7→ x̄(t), and linearize it for small
2 i 2 i
∂ x̄ ∂ x
t: x̄i = xi + tX i (x) + O(t2 ), xi = x̄i − tX i (x̄) + O(t2 ), thus ∂x i
k ∂t = − ∂ x̄k ∂t = X ,k at t = 0.
i
∂ x̄β
As an example, let R be of type 11 . By (12.12) we have (ϕ∗t R)i j (x) = Rα β (x̄) ∂∂x
x̄α ∂xj . Taking
(according to (14.1)) a derivative with respect to t at t = 0 yields
40
14 LIE DERIVATIVE
Properties of LX :
(a) LX is a linear map from tensor fields to tensor fields of the same type.
(d) LX f = Xf (f ∈ F(M))
(proof: (a) follows from (14.1), (b) and (c) from (12.11), (d) from (13.5), whereas (e) is more involved).
For higher rank tensors the derivation follows from the use of (c).
If [X, Y ] = 0 then LX LY = LY LX and for φ and ψ, which are the flows generated by X and Y , one
finds: φs ◦ ψt = ψt ◦ φs .
41
15 DIFFERENTIAL FORMS
15 Differential forms
0
Definition: A p-form Ω is a totally antisymmetric tensor field of type p
for any permutation π of {1, . . . , p} (π ∈ Sp (group of permutations)) with sign π being its parity. For
p > dim M, Ω ≡ 0. Any tensor field of type p0 can be antisymmetrized by means of the operation A:
1 X
(AT )(X1 , . . . , Xp ) = (sign π)T (Xπ(1) , . . . , Xπ(p) ) (15.1)
p!
π∈Sp
with A2 = A. The exterior product of a p1 -form Ω1 with a p2 -form Ω2 is the (p1 + p2 )-form:
(p1 + p2 )!
Ω1 ∧ Ω2 = A(Ω1 ⊗ Ω2 ) (15.2)
p1 ! p2 !
Properties:
• Ω1 ∧ Ω2 = (−1)p1 p2 Ω2 ∧ Ω1
(p1 + p2 + p3 )!
• Ω1 ∧ (Ω2 ∧ Ω3 ) = (Ω1 ∧ Ω2 ) ∧ Ω3 = A(Ω1 ⊗ Ω2 ⊗ Ω3 )
p1 ! p2 ! p3 !
The components in a local basis (e1 , . . . , en ) of 1-forms are
A covariant tensor of rank p, which is antisymmetric under exchange of any pair of indices (i.e. is a
p-form), in n dimensions has np = (n−p)!p!
n!
independent components.
Examples:
42
15 DIFFERENTIAL FORMS
since
(1 + 2)!
A∧B = A(A ⊗ B)
1! 2!
3! 1
= (Aik Bl ) ei ∧ ek ∧ el
1! 2! 3!
1
= (Aik Bl )ei ∧ ek ∧ el
2
11
= (Aik Bl + cyclic permutations)ei ∧ ek ∧ el
23
1 i
= (Aik Bl + cyclic permutations) e ∧ ek ∧ el .
3!
then
d(Ω ∧ f ) = dΩ ∧ f − Ω ∧ df
43
15 DIFFERENTIAL FORMS
and
∂ ∂ ∂f
X1 (f Ω)(X2 ) = X1i (f Ωk X2k ) = f X1i i Ωk X2k + X1i i Ωk X2k .
∂x i ∂x ∂x
| {z } | {z }
f X1 Ω(X2 ) df (X1 )Ω(X2 )
So
d(f ∧ Ω)(X1 , X2 ) = f dΩ(X1 , X2 ) + Ω(X2 )df (X1 ) − Ω(X1 )df (X2 ) . (15.7)
| {z } | {z }
dΩ∧f −Ω∧df
2
Moreover we have d f = 0, since
= X1 X2 f − X2 X1 f − [X1 , X2 ]f = 0.
(15.8)
p+1
X
+ (−1)i+j Ω([Xi , Xj ], X1 , .. , X̂i , .. , X̂j , .. , Xp+1 ), (15.9)
i<j
44
15 DIFFERENTIAL FORMS
Components:
Examples:
• p = 1:
(dΩ)ik = Ωk,i − Ωi,k (15.12)
• p = 2:
(dΩ)ikl = Ωik,l + Ωkl,i + Ωli,k (15.13)
ϕ∗ ◦ d = d ◦ ϕ∗ . (15.14)
A “proof” is found by using (15.10), (12.11) and property (b). It suffices to verify (15.14) on 0-forms
and 1-forms. For 0-forms f¯, (15.14) is identical to (12.10a). For 1-forms which are differentials df¯,
due to (c) we have
d ∗
Setting ϕ = ϕt (the flow generated by X) and forming (14.1) (LX R = dt ϕt R t=0 ), one obtains the
infinitesimal version of (15.14):
LX ◦ d = d ◦ LX . (15.15)
• dω = 0 is closed
An exact p-form is always closed (d2 η = 0), but the converse is not generally true (Poincaré lemma
gives conditions under which the converse is valid).5 6
5η is not unique since gauge transformations η 7→ η + dρ, with ρ any (p − 1)-form, leave dη unchanged.
6 This is a generalization of the results of three-dimensional vector analysis: rot grad f = 0 and div rot ~k = 0.
45
15 DIFFERENTIAL FORMS
is determined by the single component ω(x); under a change of coordinates ω(x) transforms like
∂xi1 ∂xin
i
∂x
ω̄(x̄) = ω̄1...n = ωi1 ...in ··· = ω(x) det . (15.17)
| {z } ∂ x̄1 ∂ x̄n ∂ x̄j
totally
antisymmetric
Properties:
7 Actually,it is often impossible to cover the whole manifold with a single set of coordinates. In the general case it is
necessary to introduce different sets of coordinates in different overlapping patches of the manifold, with the constraint
that in the overlap between the patch with coordinate xi and another patch with coordinate x̄i , the xi can be expressed
in a smooth one-to-one way as functions of x̄i and vice-versa (orientable manifold).
8 The integral over a p-form over the overlap between two patches (xi and x̄i ) can be evaluated using either coordinate
∂xi
system, provided det ∂ x̄j
> 0.
46
15 DIFFERENTIAL FORMS
(c) iX 2 = 0,
(e) LX = iX ◦ d + d ◦ iX .
LX f = Xf,
iX ◦ df +
d ◦iX
f = iX df = Xf,
=0
LX df = d(LX f ) = d(Xf ),
(15.15)
LX ◦d=d◦LX
iX ◦ + d ◦ iX df = d(Xf ).
ddf
=0
Let X be a vector field. Then d(iX η) is an n-form with dim M = n. η is an n-form, and if ηp 6= 0 ∀p ∈
M, then η is a “volume form”. A function divη X ∈ F is defined through
We can apply Stokes theorem since d(iX η) is an n-form and thus iX η an (n − 1)-form:
Z Z Z
d(iX η) = (divη X)η = iX η. (15.21)
D D ∂D
p
The standard volume form η is given by η = |g| dx1 ∧ . . . ∧ dxn .
47
16 AFFINE CONNECTIONS: COVARIANT DERIVATIVE OF A VECTOR FIELD
∂
Let η = a(x) dx1 ∧ . . . ∧ dxn , X = X i ∂x i . Then since (divη X)η = LX η, we have (using property (c)
∂ i
Since d(Xxi ) = d(X k k
x ) = dX i (x) = X i ,j dxj , but dx1 ∧ . . . ∧ dxj ∧ . . . ∧ dxn 6= 0 only if j = i
∂x
| {z }
i
δk
(otherwise we have two identical dxi ) we find
n
X
Xa dx1 ∧ . . . ∧ dxn + a
LX η = |{z} X i ,i dx1 ∧ . . . ∧ dxn
Xi ∂a i=1
∂xi
1
= (X i a,i + aX i ,i ) η
a
1 1 p
⇒ divη X = (aX i ),i = p |g|X i for the “standard” η. (15.22)
a |g| ,i
∇f X Y = f ∇X Y,
∇X (f Y ) = f ∇X Y + X(f )Y.
(16.1)
Lemma: Let X and Y be vector fields. If X vanishes at the point p on M, then ∇X Y also vanishes at
p.
∂
Proof: Let U be a coordinate neighbourhood of p. On U we have the representation X = ξ i ∂x i,
48
16 AFFINE CONNECTIONS: COVARIANT DERIVATIVE OF A VECTOR FIELD
The n3 functions Γkij ∈ F(U) are called Christoffel symbols (or connection coefficients) of the connection
∇ in a given chart.10
∂xk ∂
∂ ∂
∇ ∂a b
= Γ̄cab c = Γ̄cab c k . (16.3)
∂ x̄ ∂ x̄ ∂ x̄ ∂ x̄ ∂x
If we use (16.1):
∂xi ∂xj k ∂ ∂ 2 xj ∂
= a b
Γij k + .
∂ x̄ ∂ x̄ ∂x ∂ x̄a ∂ x̄b ∂xj
Definition: for every vector field X we can introduce the tensor ∇X ∈ T11 (M) defined by
10 Fora pseudo-Riemannian manifold, the corresponding connection coefficients are given by (9.6) or (9.11).
11 semicolonshall denote the covariant derivative (”normal derivative” + additional terms, that vanish in (cartesian)
Euclidean or Minkowski space)
49
16 AFFINE CONNECTIONS: COVARIANT DERIVATIVE OF A VECTOR FIELD
∇γ̇ X = 0. (16.7)
DX ∇X
The vector ∇γ̇ X is sometimes denoted as dt or dt (covariant derivative along γ). In terms of
i
i dx
coordinates, we have X = ξ ∂i , γ̇ = dt ∂i (see (12.3)). With (16.1) and (16.2) we get
∇γ̇ X = ∇ dxi ∂ (ξ k ∂k )
dt i
dxi
= ∇∂i (ξ k ∂k )
dt
dxi h k j i
= ξ Γik ∂j + ∂i ξ k ∂k
dt
dxi j k
ξ Γij ∂k + ∂i ξ k ∂k
=
dt
k i
dξ k dx j
= + Γij ξ ∂k , (16.8)
dt dt
dxi ∂ξ k dξ k
where we used = . This shows that ∇γ̇ X only depends on the values of X along γ. In
dt ∂xi dt
terms of coordinates we get for (16.7)
dξ k dxi j
+ Γkij ξ = 0. (16.9)
dt dt
For a curve γ and any two point γ(s) and γ(t) consider the mapping
which transforms a vector v(s) at γ(s) into the parallel transported vector v(t) at γ(t). The mapping
τt,s is the parallel transport along γ from γ(s) to γ(t). We have τs,s = 1 and τr,s ◦ τs,t = τr,t .
We can now give a geometrical interpretation of the covariant derivative that will be generalized
to tensors. Let X be a vector field along γ, then
d
∇γ̇ X(γ(t)) = τt,s X(γ(s)), (16.10)
ds s=t
Proof: Let’s work in a given chart. By construction, v(t) = τt,s v(s) with v(s) ∈ Tγ(s) (M) and due to
(16.8) it satisifies: v̇ i + Γikj ẋk v j = 0. If we write (τt,s v(s))i = (τt,s )i j v j (s) = v i (t) (with τt,s = (τs,t )−1
50
16 AFFINE CONNECTIONS: COVARIANT DERIVATIVE OF A VECTOR FIELD
τt,s v(t)
v(s) M
γ
γ(t)
γ(s)
dxj (γ(s))
= Γikj ẋk X j +X i
,j ,
ds
s=t
dξ i
which is again (16.8) (X = ξ i ∂i and the second term gives dt ).
51
16 AFFINE CONNECTIONS: COVARIANT DERIVATIVE OF A VECTOR FIELD
Definition: Let X be a vector field such that ∇X X = 0. Then the integral curves of X are called
geodesics.
d i
In local coordinates xi the curve is given by (using (12.3) and (13.6)) the requirement dt x (t) =
2 i i
i d x dX
X (x(t)). Inserting this into (16.8) and using dt2 = dt , we get
For a vector parallel transported along a geodesic, its length and angle with the geodesic does not
change.
Let γ : [0, 1] → M be a closed path, wih γ(0) = p = γ(1). Displace a vector v0 ∈ Tp (M) parallel along
γ and obtain the field v(t) = τt,0 v0 ∈ Tγ(t) (M). We assume that the closed path is sufficiently small
(such that we can work in the image of some chart), thus we can expand Γikj (x) around the point
x(0) = x0 on the curve:
∂ i
Γikj (x) Γikj (x0 ) ρ
xρ0 )
' + (x − Γ kj (x) + ··· (16.14)
∂xρ
x=x0
Zt Zt Zt
i i
v̇ dt = v (t) − v0i =− Γikj j k
v ẋ dt ≈
|{z} −Γikj (x0 )v0j ẋk dt,
0 0 'v0j 0
| {z }
xk (t)−xk
0
52
16 AFFINE CONNECTIONS: COVARIANT DERIVATIVE OF A VECTOR FIELD
N 1
W
C E
α
4 3
Figure 10: Illustration of the path dependence of parallel transport on a curved space: vector 1 at N
can be parallel transported along the geodesic N-S to C, giving rise to vector 2. Alternatively, it can
be first transported along the geodesic N-S to E (vector 3) and then along E-W to C to give the vector
4. Clearly these two are different. The angle α between them reflects the curvature of the two-sphere.
By plugging (16.14) and (16.15) into (16.13), we obtain an equation valid to second order:
Z1 Z1
i
v̇ dt = − Γikj ẋk v j dt (16.16)
0 0
Z1
∂ i
i
v (1) − v0i '− Γikj (x0 ) + (x − ρ
xρ0 ) Γ (x0 ) + · · · ×
∂xρ kj
0
v0j − Γjk̃j̃ (x0 )(xk̃ (t) − xk̃0 )v0j̃ + · · · ẋk dt.
53
16 AFFINE CONNECTIONS: COVARIANT DERIVATIVE OF A VECTOR FIELD
Multiplying out and discarding terms of third order or higher in xk − xk0 , we get:
Z1 Z1
∂ i
i
v (1) ' v0i − Γikj (x0 )v0j k
ẋ dt − Γ (x0 ) − Γkj̃ (x0 )Γρj (x0 ) v0j (xρ − xρ0 )ẋk dt.
i j̃
∂xρ kj
0 0
| {z }
xk (1)−xk (0)
=0
R1
Since we are considering a closed path ( 0 ẋ dt = xk (1) − xk (0) = 0),
Z1
∂ i
∆v i = v i (1) − v i (0) = − Γ (x 0 ) − Γi
(x
kl 0 )Γl
(x0 ) v0j xρ ẋk dt,
∂xρ kj ρj
0
with
I1 I1 I1 I1
ρ k d ρ k
x ẋ dt = (x x ) dt − ẋ x dt = − ẋρ xk dt,
ρ k
dt
0 0 0 0
| {z }
=0
Z1
i 1 ∂ i i l ∂ i j
∆v = − Γ − Γkl Γρj − Γ + Γρl Γkj (x0 ) v0 xρ ẋk dt,
i l
2 ∂xρ kj ∂xk ρj
| {z } 0
i
−Rjkρ
Z1
1 i
i
∆v = Rjkρ (x0 )v0j xρ ẋk dt. (16.17)
2
0
i
We shall see that Rjkρ is the curvature tensor.
i ∂ i ∂ i
Rjkρ = Γρj − Γ + Γlρj Γikl − Γlkj Γiρl (16.18)
∂xk ∂xρ kj
Thus an arbitrary vector v i will not change when parallel transported around an arbitrary small closed
i
curve at x0 if and only if Rjkρ vanishes at x0 .
τs,t c = c (c ∈ R).
54
16 AFFINE CONNECTIONS: COVARIANT DERIVATIVE OF A VECTOR FIELD
1
For e.g. a covariant vector ω, hτs,t ω, τs,t Xiγ(s) = hω, Xiγ(t) and for a tensor of type 1 : τs,t T (τs,t ω, τs,t X) =
T (ω, X). In components:
i β
(τs,t T ) k = T α β (τs,t )i α (τs,t )k (16.19)
(τi k is inverse transpose of τ i k ). The covariant derivative ∇X (X vector field, T tensor field) associated
to τ is
d
(∇X T )p = τ0,t Tγ(t) , (16.20)
dt t=0
with γ(t) any curve with γ(0) = p and γ̇(0) = Xp (generalization of (16.10)).
(b) ∇X f = Xf ,
= tr(∇X (ω ⊗ Y )) − tr(ω ⊗ ∇X Y )
= ∇X tr(ω ⊗ Y ) − ω(∇X Y )
1
General differentiation rule for a tensor field of type 1 :
Due to (a)-(d), the operation ∇X is completely determined by its action on vector fields Y , which are
the affine connections (see (16.1) and (16.2)).
55
16 AFFINE CONNECTIONS: COVARIANT DERIVATIVE OF A VECTOR FIELD
Moreover,
(16.21)
(∇X dxj )(∂i ) X dxj , ∂i − dxj , ∇X ∂i
=
| {z }
δj i
| {z }
0
= −X k Γjki ,
Using (16.23), (16.24) and (16.25) for ω j = dxj , Yi = ∂i we obtain the following expression for ∇X T :
i
T i1 ...ip j1 ...jq ;k = T i1 ...ip j1 ...jq ,k + Γikl1 T li2 ...ip j1 ...jq + . . . + Γklp T i1 ...ip−1 l j1 ...jq
Examples:
ξ i ;k = ξ i ,k + Γikl ξ l ,
• Kronecker tensor:
δ i j;k = 0,
1
• Tensor 1 :
T i k;r = T i k,r + Γirl T l k − Γlrk T i l .
The covariant derivative of a tensor is again a tensor. Consider the covariant derivative of the metric
gµν :
∂gµν
gµν;λ = − Γρλµ gρν − Γρλν gρµ . (16.27)
∂xλ
Inserting into this the expressions of Γρλµ given by (9.11) leads us to
gµν;λ = 0. (16.28)
This is not surprising since gµν;λ vanishes in locally inertial coordinates and being a tensor it is then
zero in all systems.
Covariance principle: Write the appropriate special relativistic equations that hold in the absence of
gravitation, replace ηαβ by gαβ , and replace all derivatives with covariant derivatives (, → ;). The resulting
equations will be generally covariant and true in the presence of gravitational fields.
56
17 CURVATURE AND TORSION OF AN AFFINE CONNECTION, BIANCHI IDENTITIES
Definition:
1
T (X, Y ) is antisymmetric and f -linear in X, Y and then defines a tensor of type 2 through:
(ω, X, Y ) → hω, T (X, Y )i is thus a 12 tensor field called the torsion tensor.
f -linearity:
T (f X, gY ) = f gT (X, Y ) f, g, ∈ F(M).
In local coordinates, the components of the torsion tensor are given by:
* +
k
k k
T ij = dx , T (∂i , ∂j ) = dx , ∇∂i ∂j −∇∂j ∂i − [∂i , ∂j ]
| {z } | {z }
=Γlij ∂l =0
Ri jkl = dxi , R(∂k , ∂l )∂j = dxi , (∇∂k ∇∂l − ∇∂l ∇∂k )∂j 12
57
17 CURVATURE AND TORSION OF AN AFFINE CONNECTION, BIANCHI IDENTITIES
Eq. (17.4) is exactly the the same as defined in (16.18). It is antisymmetric in the last two indices:
Ri jkl = −Ri jlk .
Definition: The Ricci tensor is the following contraction of the curvature tensor:
R ≡ g lj Rjl = Rl l (17.6)
Example: For a pseudo-Riemannian manifold the connection coefficients are given by (9.11). Consider
a two-sphere (which is a pseudo Riemannian manifold) with the metric ds2 = a2 (dθ2 + sin2 θdφ2 ),
then: ! !
2 1 0 θφ 1 1 0
gθφ = a , g = 2 1
.
0 sin2 θ a 0 sin2 θ
The non-zero Γ are:
= sin2 θ.
Rθθ = 1,
Rθφ = Rφθ = 0.
1 1
= 2
+ 2 2 sin2 θ
a a sin θ
2
= .
a2
58
17 CURVATURE AND TORSION OF AN AFFINE CONNECTION, BIANCHI IDENTITIES
The Ricci scalar is constant over this two-sphere and positive, thus the sphere is “positively curved”.
13 14 15
(∇X ∇Y − ∇Y ∇X )Z + (∇Z ∇X − ∇X ∇Z )Y
13 For a position independent metric (e.g. Cartesian coordinates) the Riemann tensor (and thus the scalar curvature)
59
18 RIEMANNIAN CONNECTIONS
18 Riemannian connections
Metric: Let M be equipped with a pseudo-Riemannian metric: a symmetric, non-degenerate tensor
field: g(X, Y ) = hX, Y i of type 02 .
X̃i = gik X k , ω̃ i = g ik ωk ,
where g ik denotes the inverse of gik . It also works for tensor fields of different types: T i k =
Tlk g il = T il glk .
• Given a basis (e1 , . . . , en ) of Tp , the covectors of the dual basis (e1 , . . . , en ) become themselves
vectors; indeed ei = gij ej .
Rt2
• The length of a curve with tangent vector X between t1 and t2 is L(t1 , t2 ) = d(X) dt.
t1
• If (ea ) is a basis of Tp (M), the components of g with respect to this basis are gab = g(ea , eb ).
• Like in special relativity we classify vectors at a point as timelike (g(X, X) > 0), null (g(X, X) =
0) and space like (g(X, X) < 0).
∇g = 0. (18.2)
60
18 RIEMANNIAN CONNECTIONS
or
Theorem: For every pseudo-Riemannian manifold (M, g), there exists a unique affine connection such
that
(a) ∇ has vanishing torsion (∇ is symmetric),
(b) ∇ is metric.
Proof: T = 0 (vanishing torsion) means ∇X Y = ∇Y X + [X, Y ]. Inserting this into (18.3) (and the
linearity of g) gives
Taking the linear combination (18.5) + (18.6) - (18.4), we get (Koszul formula):
(18.7)
The right hand side is independent of ∇. Since g is non-degenerate, the uniqueness of ∇ follows from
(18.7).
Definition: the unique connection on (M, g) from the above theorem is called the Riemannian or
Levi-Civita connection.
We determine the Christoffel symbols for the Riemannian connection in a given chart (U, x1 , . . . , xn ).
For this purpose we take X = ∂k , Y = ∂j , Z = ∂i in (18.7) and we use [∂i , ∂j ] = 0 as well as
h∂i , ∂j i = gij . The result is
61
18 RIEMANNIAN CONNECTIONS
or
1
glk Γlij = (gkj,i + gik,j − gji,k ). (18.8)
2
1 lk
Γlij = g (gkj,i + gik,j − gji,k ), (18.9)
2
which is exactly equation (9.11).
(i) The inner product of any two vectors remains constant upon parallel transporting them along
any curve γ (g(X, Y )γ(t) = g(X, Y )γ(0) ).
i
(ii) The covariant derivative commutes with raising or lowering indices, e.g. Tk;l = (gkm T im );l
= gkm T im ;l , because gkm;l = 0.
Riemann tensor: the curvature tensor of a Riemannian connection has the following additional sym-
metry properties (without proof):
X
Ri jkl;m = 0 2nd Bianchi identity. (18.14)
(klm)
P
Eqs. (18.13) and (18.14) are valid for vanishing torsion. Here denotes the cyclic sum. Additionally,
(jkl)
62
18 RIEMANNIAN CONNECTIONS
1
Gik = Rik − Rgik Einstein tensor (18.19)
2
Gk i;k = 0, (18.21)
(jm)-trace:
⇒ (18.20)
For (18.21):
1 1
Gk i = Rk i − g k i R = Rk i − δ k i R
2 2
1 1 1
Gk i;k = Rk i;k − (δik R);k = R;i − R;i = 0
2 |2{z } 2
(18.20)
n2 (n2 −1)
Without proof in n dimensions, the Riemann tensor has cn = 12 independent components
(c1 = 0, c2 = 1, c3 = 6, c4 = 20).
63
19 PHYSICAL LAWS WITH GRAVITATION
Part V
General Relativity
19 Physical laws with gravitation
19.1 Mechanics
The physical laws are relations among tensors (scalars and vectors being tensors of rank 0 and 1
respectively). Thus the physical laws read the same in all coordinate systems (provided the physical
quantities are transformed suitably) and satisfy general covariance (same form). Practically, this means
that from the special relativity laws that hold in absence of gravitation, we have to replace ηαβ by gαβ
and replace derivation by covariant derivation.
D dxi
(∇X → dτ ) with ξ µ → uµ (dt → dτ and dτ = ui ) and thus
Duµ duµ
= + Γµνλ uν uλ .
dτ dτ
Then equation (19.2) reads
duµ
= f µ − mΓµνλ uν uλ .
m (19.3)
dτ
We see that on the right hand side there are now gravitational forces appearing explicitly (via Γµνλ ).
Equation (19.3) (or (19.2)) is covariant (it has the same form in all coordinate systems) and reduces for
gµν → ηαβ (thus Γµνλ = 0) to equation (19.1) (in a local IS). The components of uµ are not independent
but satisfy the condition gµν uµ uν = c2 .
19.2 Electrodynamics
According to the equivalence principle, Maxwell’s equations (see (6.4) and (6.5))
4π β
∂α F αβ = j and αβγδ ∂β Fγδ = 0
c
are valid in a local IS. Applying the covariance principle, they become as follows in a general KS:
4π µ
F µν ;ν = j and µνλκ Fλκ;ν = 0, (19.4)
c
64
19 PHYSICAL LAWS WITH GRAVITATION
• uµ : four-velocity,
In a KS this becomes p
T µν = ρ + 2 uµ uν − g µν p. (19.9)
c
In the IS the conservation law implies T µν ,ν = 0 and in the KS T µν ;ν = 0 (explicitely, T µν ;ν =
√
∂ g
T µν ,ν + Γµνλ T νλ + Γννλ T µλ = 0). With (19.5), Γννλ = √1
g ∂xλ , we get instead
√
1 ∂ gT µν
T µν ;ν = √ + Γµνλ T νλ = 0. (19.10)
g ∂xν
This is no longer a conservation law, as we cannot form any constant of motion from (19.10). This
should also not be expected, since the system under consideration can exchange energy and momentum
with the gravitational field.
n
∂g
P ∂gki ∂gki ∂gkm m
16 g = det(gik ) = i1 ...in g1i1 . . . gnin . Consider ∂xl
= i1 ...in g1i1 . . . ∂xl
k
. . . gnin and use ∂xl
k
= ∂xl
δ ik =
k=1
∂gkm mr
∂xl
g grik . Due to the antisymmetry of , only the term r = k survives. Thus
∂g ∂gkm mk
= g g.
∂xl ∂xl
Plugging this into the definition of Γkkl (one contraction):
√ √
g km ∂gmk ∂gml ∂gkl g km ∂gmk ∂ ln g 1 ∂ g
Γkkl = + − = = = √ . (19.5)
2 ∂xl ∂xk ∂xm 2 ∂x l ∂x l g ∂xl
| {z }
vanish by interchanging
(m↔k)
With (19.5) one can show that the inhomogeneous Maxwell equation in KS can be written as
√
1 ∂( gF µν ) 4π
√ = 2 jµ, (19.6)
g ∂xν c
and the continuity equation: j µ ;µ = 0 becomes
√
∂( gj µ )
= 0. (19.7)
∂xµ
65
20 EINSTEIN’S FIELD EQUATIONS
Requirements:
• The Newtonian limit is well confirmed through all observations: ∆φ = 4πGρ.
• From the Newtonian limit of the equation of motion for a particle we derived (equation (9.17))
g00 ≈ 1 + 2 cφ2 .
(2) Gµν has the “dimension” of a second derivative. If we assume that no new dimensional constant
enter in Gµν then it has to be a linear combination of terms which are either second derivatives of
the metric gµν or quadratic in the first derivative of gµν .
(3) Since Tµν is symmetric, Gµν also has to be symmetric and due to the fact that Tµν is covariantly
conserved, i.e. T µν ;ν = 0, it follows that Gµν must satisfy Gµν = Gνµ and Gµν ;ν = 0.
(4) For a weak stationary field we shall get (20.1), thus G00 ' ∆g00 .
Conditions (1)-(4) determine Gµν uniquely. (1) and (2) imply that Gµν has to be a linear combination
of Rµν , the Ricci tensor, and R, the Ricci scalar17 . The symmetry of Gµν is automatically satisfied.
The contracted Bianchi identity (18.20), (18.21) suggests that Gµν ;ν = 0 on the Einstein tensor, what
implies b = − a2 . Thus we find
1 8πG
Gµν = a(Rµν − gµν R) = 4 Tµν . (20.3)
2 c
The constant a has to be determined by performing the Newtonian limit. Consider weak fields:
gµν = ηµν + hµν , |hµν | 1 (non relativistic velocities: v i c), then |Tik | |T00 | ⇒ |Gik | |G00 |.
Compute the trace of Gµν :
= a(R − 2R) = −aR from (20.3)
g µν Gµν ≈ G00 = a R00 − R g00 = a(R00 − R/2) . (20.4)
2 |{z}
≈η00
=1
17 It can be shown that indeed the Ricci tensor is the only tensor made of the metric tensor and first and second
deivatives of it, and which is linear in the second derivative.
66
20 EINSTEIN’S FIELD EQUATIONS
For weak fields all terms quadratic in hµν can be neglected in the Riemann tensor; we get to leading
order:
∂Γρµν ∂Γρρµ
Rµν = Rρ µρν ' − (|hµν | 1).
∂xρ ∂xν
For weak stationary fields we find:
∂Γi00 1 ∂g00
R00 = with Γi00 = .
∂xi 2 ∂xi
∂Γi00 ! 18
Thus G00 ≈ 2a ∂xi = a∆g00 = ∆g00 , therefore a = 1. Einstein’s field equations are (found 1915 by
Albert Einstein):
R 8πG
Rµν − gµν = 4 Tµν (20.6)
2 c
Together with the geodesic equation ((16.12) or (19.3)), these are the fundamental equations of general
relativity. By contraction of (20.6), we find also
R µ 8πG
Rµ µ − δ µ = −R = 4 T. (20.7)
2 |{z} c
=4
an equivalent version of the field equations. For the vacuum case where Tµν = 0 we have
Rµν = 0. (20.9)
Einstein’s equation constitutes a set of non-linear coupled partial differential equations whose general
solution is not known. Usually one makes some assumptions, for instance spherical symmetry. Because
the Ricci tensor is symmetric, the Einstein equations constitute a set of 10 algebraically independent
second order differential equations for gµν .
The Einstein equations are generally covariant, so that they can at best determine the metric
up to coordinate transformation (→ 4 functions). Therefore we expect only 6 independent generally
covariant equations for the metric. Indeed the (contracted) Bianchi identities tell us that (equation
(18.21)) Gν µ;ν = 0 and hence there are 4 differential relations among the Einstein’s equations. Bianchi
identities can also be understood as a consequence of the general covariance of the Einstein equations.
18 Depending
on the convention used for the Riemann tensor, one could also encounter a minus in front of the energy-
momentum tensor, as for example in Weinberg.
67
21 THE EINSTEIN-HILBERT ACTION
(g = det gik , d4 x in 4 dimensions). The Euler-Lagrange equations are the field equations in vacuum:
δSD [g] = 0.
We have
√ µν √ √
Z Z Z Z
µν 4
δ R(g) dv = δ(g Rµν −g) d x = (δRµν )g −g d x + Rµν δ(g µν −g) d4 x.
4
(21.3)
D D D D
δRµν = δΓα α
20
µν ,α − δΓµα ,ν . (21.5)
δRµν = δΓα α
µν ;α − δΓµα ;ν (21.6)
19 Note that gµν;σ = 0.
20 variation (with respect to g) δ and normal derivative commute
68
21 THE EINSTEIN-HILBERT ACTION
g µν δRµν = g µν δΓα µν α
µν ;α − g δΓµα ;ν (21.7)
= ω α ;α
= ω α ,α + Γααµ ωµ
|{z}
√
∂ −g
√1 ∂xµ
−g
√
1 ∂( −g ω µ )
=√ . (21.8)
−g ∂xµ
Inserting this into the integral (21.3) and applying (15.21) (Gauss theorem), we get
Z Z
(divg ω)η = iω η,
D ∂D
α
where divg ω = ω ;α and thus
µν √ √
Z Z
(δRµν )g 4
−g d x = ω α −g d0α .
D ∂D
ω α = g µν δΓα
µν − g
µα
δΓνµν (21.9)
is a vector field. If the variations of δg µν vanish outside a region contained in D, then the boundary
term vanishes as well.
√
As for the second term in (21.3) ( Rµν δ(g µν −g) d4 x), we recall that for an n × n matrix A(λ)
R
D
we have (see linear algebra):
d
det A = det A tr A−1 dA
i) dλ dλ ,
−1
ii) d
dλ (A )A = −A−1 dA
dλ .
Thus (δg µν )gνσ = −g µν δgνσ comes from ii) and δg = gg µν δgνµ comes from i) with A−1 = g µν . Hence
we find the desired expressions
√ 1√ 1√
δ −g = −g g µν δgνµ = − −g gαβ δg αβ ,
2 2
√ √ 1√
δ(g µν −g) = −g δg µν − −g g µν gαβ δg αβ . (21.10)
2
And thus
√ √
Z Z 1
0= Rµν δ(g µν −g) d4 x = −g d4 x Rµν δg µν − Rµν g µν gαβ δg αβ
| {z } 2 | {z } | {z }
D D dv R α→µ
β→ν
Z
1
= dv Rµν − Rgµν δg µν .
2
D | {z }
=Gµν =0
69
21 THE EINSTEIN-HILBERT ACTION
R√ R√
Therefore δSD [gµν ] = 0 ⇒ Gµν = Rµν − 12 Rgµν = 0. Since δ −g d4 x = −g d4 x 12 g µν δgµν =
R√ D D
− 21 −g d4 x gµν δg µν , it follows that if we have a cosmological constant , the Einstein’s vacuum
D
equations are obtained from the action principle applied on
√
Z
SD [g] = (R − 2Λ) −g d4 x. (21.11)
D
The variational principle extends to matter described by any field ψ = (ψA ) (A = 1, . . . , N ), (we include
also the electromagnetic field among the ψA ) transforming as a tensor under change of coordinates.
Consider an action of the form
√
Z
SD [ψ] = L(ψ, ∇g ψ) −g d4 x, (21.12)
D
where ∇g is the Riemannian connection of the metric g. If we know L in flat space, the equivalence
principle prescribes to replace ηαβ by gαβ and replace ordinary derivatives by covariant ones.
1 1
L=− Fµν F µν = − Fµν Fσρ g σµ g ρν ,
16π 16π
and the Euler-Lagrange equations in this case (Fµν = Aν;µ − Aµ;ν = Aν,µ − Aµ,ν ) for the basic
4-potential Aµ field read:
∂L ∂L
− ∇µ = 0, with ∇µ Aν = Aν;µ ;
∂Aν ∂∇µ Aν
∂L ∂L 1
in this case ∂Aν = 0, and ∂∇µ Aν = − 4π F µν . The Euler-Lagrange equations are then F µν ;ν = 0, which
4π µ 1
are the Maxwell equations for vanishing current j µ (F µν ;ν = c j and L = − 16π Fµν F µν − 1c j µ Aµ
with j µ Aµ = g µν jν Aµ ).
Variations in (21.12) with respect to the fields ψA lead to the Euler-Lagrange equations, whereas
variations with respect to the metric (which is also a function and is determined by solving Einstein’s
equations) gives (without proof)
√ √
Z Z
1
4
δg L(ψ, ∇g ψ) −g d x = − T µν δgµν −g d4 x. (21.13)
2
D D
This term has to be added to the one proportional to δgµν in Einstein’s action:
√ c4
Z
1
−g d4 x Gµν − Tµν δg µν
16πG 2
D | {z }
=0
8πG 1
Fµσ Fνσ − 14 Fσρ F σρ gµν (or T αβ = −Fµα F µβ −
and thus Gµν = c4 Tµν . For electrodynamics: Tµν = 4π
Lg αβ ). And similarly for other “matter” fields.
70
22 STATIC ISOTROPIC METRIC
Due to isotropy and time independence, A, B and C cannot depend on θ, φ and t (and no linear terms
in dθ and dφ). Freedom in the choice of coordinates allows to introduce a new radial coordinate in
(22.1): C(r)r2 → r2 , thus C(r) can be absorbed into r. We get the standard form:
(θ and φ have the same significance as in Minkowski coordinates). Due to our asymptotic requirements
(r → ∞) we can assume that B(r) → 1 and A(r) → 1.
71
22 STATIC ISOTROPIC METRIC
22.3 Christoffel symbols and Ricci tensor for the standard form
The metric tensor gµν is diagonal.
g00 = B(r) g11 = −A(r) g22 = −r2 g33 = −r2 sin2 θ (22.4)
1 1 1 1
g 00 = g 11 = − g 22 = − g 33 = − (22.5)
B(r) A(r) r2 r2 sin2 θ
g σν
∂gµν ∂gλν ∂gµλ
The non-vanishing components of Γσλµ = + − are
2 ∂xλ ∂xµ ∂xν
B0 B0 A0
Γ001 = Γ010 = Γ100 = Γ111 =
2B 2A 2A
1 r r sin2 θ
Γ212 = Γ221 = Γ122 = − Γ133 = − (22.6)
r A A
1
Γ313 = Γ331 = Γ323 = Γ332 = cot θ Γ233 = − sin θ cos θ
r
where 0 stands for ∂
∂r . With
− g = r4 AB sin2 θ (22.7)
we get √
A0 B0
∂ ln −g 2
Γρµρ
= = 0, + + , cot θ, 0 . (22.8)
∂xµ r 2A 2B
The Ricci tensor can then be calculated as
∂Γρµν ∂Γρρµ
Rµν = − + Γσµν Γρρσ − Γσρµ Γρνσ , (22.9)
∂xρ ∂xν
and we get as a result
B 00 A0 B 0 B 02 B0 2 A0 B0
R00 = − − + + + ,
2A 2A2 2AB 2A r 2A 2B
B 00 B 0 A0 B0 B0
= − + + , (22.10)
2A 4A A B rA
B 00 B 0 A0 B0 A0
R11 =− + + + , (22.11)
2B 4B A B rA
0
B0
r A 1
R22 =1+ − − , (22.12)
2A A B A
72
22 STATIC ISOTROPIC METRIC
Similarly, the pressure P (r) is thought to vanish for r > r0 . The four velocity vector within the mass
distribution in the static case is uµ = (u0 = constant, 0, 0, 0). This way, the energy-momentum tensor
(describing matter) does not depend on time. We then adopt the ansatz for the metric elaborated
in (22.2): gµν = diag(B(r), −A(r), −r2 , −r2 sin2 θ). Outside the mass distribution (r ≥ r0 ), the Ricci
tensor vanishes: Rµν = 0. We have already calculated the coefficients Rµν in equations (22.10) –
(22.13). For µ 6= ν, Rµν = 0 is trivially satisfied while the diagonal components should be set to zero:
R00 = R11 = R22 = R33 = 0 (r ≥ r0 ).
0
A0
R00 R11 1 B d
Consider + =− + = 0 and thus (ln AB) = 0 (since rA 6= 0) or AB = constant
B A rA B A dr
1
(or ln AB = constant). For r → ∞ we require A = B = 1, therefore AB = 1 ⇒ A(r) = B(r) .
Introducing this into R22 (22.12) and R11 (22.11) leads to
R22 = 1 − rB 0 − B = 0, (22.15)
B 00 B0 rB 00 + 2B 0 1 dR22
R11 = − − =− = = 0. (22.16)
2B rB 2rB 2rB dr
With (22.15), (22.16) is automatically satisified (since R22 = 0 also its derivative vanishes). We write
(22.15) as
d
(rB) = 1. (22.17)
dr
| {z } = r − 2a. Then
We integrate it and get rB = r + constant
−2a
2a
B(r) = 1 − ,
r
(22.18)
1
A(r) = ,
1 − 2a
r
for r ≥ r0 . This solution for the vacuum Einstein’ equations was found in 1916 by Schwarzschild. The
Schwarzschild solution is
dr2
2 2a 2 2
ds = 1 − c dt − − r2 (dθ2 + sin2 θdϕ2 ) (22.19)
r 1 − 2a
r
73
23 GENERAL EQUATIONS OF MOTION
2GM
The Schwarzschild radius of the Sun is rs, = c2 ' 3 km (M ' 2 × 1030 kg, R = 7 × 105 km) so
rS, 2GM
√
R = c 2 R ' 4 × 10−6 21 dt
. A clock at rest in r has the proper time dτ = B dt, thus dτ diverges
at r → rS . This implies that a photon emitted at r = rS will be infinitely redshifted (t is not a good
coordinate either for events taking place at r ≤ rS ). A star, whose radius rstar is smaller than rS , is
a black hole since photons emitted at its surface cannot reach regions with r > rS .
rS
Expanding the Schwarzschild metric in power of r and comparing it with the Robertson expansion
(22.3), one finds β = γ = 1 for general relativity.
with the coordinates (x0 , x1 , x2 , x3 ) = (ct, r, θ, φ). Equations (23.1) – (23.3) define the relativistic
Kepler problem. Using the Christoffel symbols given in (22.6), we get for (23.1):
d2 x0 B 0 dx0 dr
2
=− , (23.4)
dλ B dλ dλ
2 2 2 2
d2 r B 0 dx0 A0 dr r sin2 θ dφ
r dθ
=− − + + , (23.5)
dλ2 2A dλ 2A dλ A dλ A dλ
2
d2 θ
2 dθ dr dφ
2
=− + sin θ cos θ , (23.6)
dλ r dλ dλ dλ
d2 φ 2 dφ dr dθ dφ
2
=− − 2 cot θ . (23.7)
dλ r dλ dλ dλ dλ
Equation (23.6) can be solved by
π
θ=
= constant. (23.8)
2
π
Without loss of generality we can choose the coordinate system such that θ = 2, this way the trajectory
2
π d θ
lies on the plane with θ = 2 . dλ2 = 0 corresponds to angular momentum conservation. With (23.8)
21 Apparently it seems that the Schwarzschild metric is singular for r = r , but this is not the case. It is only an
s
artefact of the coordinate choice. To be discussed later.
74
23 GENERAL EQUATIONS OF MOTION
dx0
B = constant = F. (23.12)
dλ
In (23.5) we use (23.8), (23.10) and (23.12) and get:
2
d2 r F 2B0 A0 l2
dr
2
+ 2
+ − = 0. (23.13)
dλ 2AB 2A dλ Ar3
dr
We multiply it with 2A dλ and get
" #
2
d dr l2 F2 22
A + 2− = 0. (23.14)
dλ dλ r B
Integration gives
2
l2 F2
dr
A + − = − = constant. (23.15)
dλ r2 B
Integrating it once more we get r = r(λ). Inserting then this result into (23.10) and (23.12), we obtain
with one more integration φ = φ(λ) and t = t(λ). Next we eliminate λ and get r = r(t) and φ = φ(t).
π
Together with θ = 2, this is then a complete solution (generally it has to be done numerically).
Equation (23.2) becomes
2 2 2 2
dxµ dxν dx0
dr 2 dθ 2 2 dφ
gµν =B −A −r − r sin θ = , (23.16)
dλ dλ dλ dλ dλ dλ
75
23 GENERAL EQUATIONS OF MOTION
23.1 Trajectory
From (23.15) we get s
F2 2
dr B − rl 2 −
= , (23.17)
dλ A
and with (23.10), s
dφ dφ dλ l A
= = 2 F2 2 . (23.18)
dr dλ dr r B − rl 2 −
Thus, s
Z
dr A(r)
φ(r) = F2
. (23.19)
r2 B(r)l2 − r12 −
l2
With this we can find the trajectory φ = φ(r) in the orbital plane. (Massive particles: 2 integration
F2 F2
constants l2 and l2 , massless particles: only l2 ).
rS
Insert Schwarzschild metric: B(r) = A−1 (r) = 1 − r =1− 2a
r and write:
dt dr dφ
ṫ = , ṙ = , φ̇ = .
dλ dλ dλ
Then with (23.8), (23.10), (23.12) and (23.15) we get
π 2a
θ = , cṫ 1 − = F, r2 φ̇ = l. (23.20)
2 r
Multiplying (23.15) with B and using AB = 1, we have
ṙ2 a l2 al2 F2 −
− + 2− 3 = = constant. (23.21)
2 r 2r r 2
The radial component can be written as
ṙ2
+ Veff (r) = constant, (23.22)
2
2GM
with the effective potential (2a = c2 , = {c2 , 0})
− GM + l2 − GM l2
(m 6= 0)
r 2r 2 c2 r 3
Veff (r) = . (23.23)
l2 − GM l2 (m = 0)
2r 2 c2 r 3
76
23 GENERAL EQUATIONS OF MOTION
Veff collision
circular unstable
scattering
circular stable
GM l2 GM v 2
− 2 2
'− (23.25)
r c r r c2
1 dr dr
Eq. (23.22) differs from the non-relativistic case by an additional r3 term and ṙ = dτ differs from dt
2
v
by terms of order c2 .
Observations:
• Where Veff has a minimum there are bounded solutions, however due to the relativistic effects
there will be small deviations from the elliptical orbits (precession of the perihelion). As a special
case, with ṙ = 0, the circular orbit is a possible solution (in which case the constant in (23.22) is
equal to the value of Veff at its minimum).
• If the constant is positive one gets non-bounded trajectories (corresponding to hyperbolic solu-
tions in the non-relativistic case).
77
23 GENERAL EQUATIONS OF MOTION
• If the constant is larger than the maximum value of the potential, the particle falls into the
center.
dVef f
• At minimum and maximum we have dr = 0. For m 6= 0 we get
c2 2 r
2
r − 2 + 3 = 0. (23.26)
l rS
3c2 1
In order to have two real solutions, we need l2 < 2 .
rS
That means
√
l ≥ lcrit = 3 rS c. (23.27)
For l → lcrit , the angular momentum barrier gets smaller and smaller until the maximum and
minimum fall together for l = lcrit . For l < lcrit , the potential decreases monotonically for r → 0.
Veff collision
circular unstable
scattering
78
23 GENERAL EQUATIONS OF MOTION
Here both terms are proportional to l2 , thus the shape of Veff does not depend on l. At rmax = 32 rS
the potential has a maximum. At rmax the photons can move on a circular orbit, which is unstable. If
the constant in (23.22) is smaller than Veff (rmax ) then the incoming photon will be scattered, whereas
25
if the constant is bigger the photon will be absorbed at the center.
79
24 LIGHT DEFLECTION
Part VI
∆φ
r0
φ r
Sun
light ray
We will now show that light is deflected by a massive body, carrying through calculations for the Sun.
In fig. 13, the following quantities are defined: light is deflected by ∆φ and r0 is the minimal distance
(or impact parameter) from the Sun. For simplification we assume also r0 rS .
As starting point of the integration we choose the minimum distance r0 , where we set φ(r0 ) = 0.
Going from r0 till r∞ the angle changes by φ(∞). Along the drawn trajectory the radial vector turns
by 2φ(∞). If the trajectory would be a straight line, then 2φ(∞) = π.
Thus ∆φ = π − π = 0 for a straight line and in general (φ(r0 ) = 0):
∆φ = 2φ(∞) − π. (24.2)
80
24 LIGHT DEFLECTION
light ray
π π
φ(r) → φ(∞) = 2 φ(r) → φ(∞) = 2
Sun
B(r0 ) r2 r2
1 1
− 1 ' 2 1 + 2a − −1
B(r) r02 r0 r r0
2
r 2ar
= 2 −1 1− .
r0 r0 (r + r0 )
√
We get using 1 + x = 1 + x2 ,
Z∞
dr r0 a ar
φ(∞) ' p 1+γ +
r2 − r02 r r r0 (r + r0 )
r0
" p #∞
r2 − r02
r
r
0 a a r − r0
= arccos +γ +
r r0 r r0 r + r0
r0
π a a
= +γ + . (24.6)
2 r0 r0
With (24.2) we get
4a 1+γ 2rS 1+γ
∆φ = = . (24.7)
r0 2 r0 2
81
24 LIGHT DEFLECTION
2GM
For general relativity, γ = 1, rS = c2 and thus
2rS
∆φ =
r0
For a light ray which just grazes the surface of the Sun (r0 = R = 7×105 km) we get (π = 180×360000 ):
00 1+γ
∆φ = 1.75 . 26 (24.8)
2
26 “cheating” with Newton’s theory gives half this result that is 0.8400
82
25 PERIHELION PRECESSION
25 Perihelion precession
Consider the elliptical orbit of a planet around the Sun:
y
Mercury
r
p
φ
rmax rmin
Sun x
We will use the following notations: minimum distance r− = rmin , maximum distance r+ = rmax ,
φ± = φ(r± ), A± = A(r± ), B± = B(r± ). The relativistic orbit follows from equation (23.19) for
r = r(φ) with = c2 . The integral gives for the change in angle between r− and r+ :
Zr+ s Zr+ s
dr A(r) dr A(r)
φ+ − φ− = F2 c2
= . (25.1)
r2 1
B(r)l2 − r 2 − l2
r2 K(r)
r− r−
For a full orbit the angle is 2π, i.e. twice the integral (25.1). The shift of the perihelion (per complete
orbit) is given by
∆φ = 2(φ+ − φ− ) − 2π. (25.2)
dφ dr
p
The integrand in (25.1) is equal to dr . For r = r± due to dφ = 0, K(r) r2 has to vanish, thus
K(r± ) = 0:
F2 1 c2
2
= 2 + 2. (25.3)
B± l r± l
83
25 PERIHELION PRECESSION
B+ B− r2 r2
2 − r2
−
B+ − B−
+
c2 r+ −
= − = − . (25.5)
l2 B+ − B− 2 r2
r+ 1
− 1
− B+ B−
c̃ can be obtained by comparing with (25.6) for r → ∞. With (25.9) one gets
a a
c̃ = 1 − (2 − β + γ) + . (25.11)
r+ r−
84
25 PERIHELION PRECESSION
π
r+ and r− correspond to ψ = 2 and ψ = − π2 , respectively. With
1 1 1 1 1
d = − 2 dr = − cos ψ dψ, (25.14)
r r 2 r+ r−
1 1 1 1 1
− = − (1 + sin ψ), (25.15)
r− r 2 r− r+
1 1 1 1 1
− = − (1 − sin ψ), (25.16)
r r+ 2 r− r+
6πa
∆φ =
p
Consider Mercury: p = 55 × 106 km, 2a ≈ 3 km, π = 180◦ × 360000 which give ∆φ = 6πa
p =
00
0.104 (per full orbit). In 100 years Mercury fulfills 415 orbits around the Sun, this way we get
∆φ = 43 (per century). For more distant planets (Venus, Earth,. . . ) ∆φ is at most ∼ 500 per century.
00
Already in 1882, Newcomb found a perihelion precession of 4300 per century for Mercury. Full perihelion
precession amounts to 57500 per century of which 53200 are due to the influence of other planets (this
within Newtonian theory). One finds
2 − β + 2γ
= 1.003 ± 0.005, (25.21)
3
in good agreement with general relativity. So far, the parameters of the Robertson expansion are
constrained to |γ − 1| < 3 × 10−4 and |β − 1| < 3 × 10−3 . More recently (radar echoes delay from
Cassini spacecraft): |γ − 1| = (2.1 ± 2.3) × 10−5 .28
28 B. Bertotti et al. Nature 425, 374 (2003)
85
25 PERIHELION PRECESSION
Sun
Mercury
Figure 17: Illustration of the perihelion precession of Mercury (effect strongly exagerated)
2 2 Rk − R⊥
Q = J2 M R with J2 = , (25.22)
5 R
and
• R⊥ : radius orthogonal to the plane containing the planet orbits and parallel to the rotation axis
of the Sun.
The induced gravitational potential in the planet’s orbital plane (which is also the equatorial plane of
the Sun) is
GM GQ
φ(r) = − − 3. (25.23)
r 2r
The additional term has the same r dependence than the additional relativistic term:
GM l2 GM l2
Veff = − + 2− 2 3 . (25.24)
r 2r c r
86
26 LIE DERIVATIVE OF THE METRIC AND KILLING VECTORS
GM
With l ∼ pv and v 2 ∼ p we can compare the two terms (their relative strength)
2 2 2
GQ J2 R J2 R J2 R
2 ∼ 2 ∼ ∼ .
GM cl 2 p2 vc2 p GM
c2
pa
We see that the full expression for the perihelion precession is given by
2
J2 R
6πa 2 − β + 2γ
∆φ = + . (25.25)
p 3 2ap
2
J2 R
From observations one finds J2 ∼ (1 − 1.7) × 10−7 ; thus the additional term is 2ap ≈ 5 × 10−4 ,
accounting for at most 1/10 of the error given in (25.21) and is thus negligible.
Kσ = gσµ K µ ,
∂Kσ ∂Kρ
= + − 2Kµ Γµρσ (26.2)
∂xρ ∂xσ
= Kσ;ρ + Kρ;σ .
29
Any 4-vector Kσ (x) satisfying this equation will be said to form a Killing vector.
Example: Consider a stationary gravitational field, for which there exists coordinates {xµ } such that the
components of gµν do not depend on ct = x0 (for instance Schwarzschild metric). Let K µ = δ µ 0 with
the corresponding vector field δ µ 0 ∂µ (→ ∂0 ). Inserting K µ into (26.1) one gets LK gµν = gµν,0 + 0 + 0 =
∂
∂x0 gµν = 0 (since gµν does not depend on x0 ). K is a Killing vector or Killing field or an infinitesimal
isometry.
29 Named after 19th century mathematician Wilhelm Killing
87
27 MAXIMALLY SYMMETRIC SPACES
Notice that, due to the properties of the Lie derivative, if K1 and K2 are Killing vectors, LK1 gµν = 0,
LK2 gµν = 0 then [K1 , K2 ] is also a Killing vector since
We are used to the fact that symmetries lead to conserved quantities: in classical mechanics the angular
momentum of a particle moving in a rotationally symmetric field is conserved. In the present context,
the concept of “symmetries of a gravitational field” is replaced by “symmetries of the metric” and we
therefore expect conserved quantities to be associated with the presence of Killing vectors.
Let K µ be a Killing vector and xµ (τ ) be a geodesic. Then the quantity Kµ ẋµ is constant along the
geodesic. Indeed,
D
(Kµ ẋµ ) = (∇ν Kµ ẋν )ẋµ + Kµ (∇ν ẋµ ) ẋν
dτ | {z }
=0 geodesic
1
= (∇ν Kµ + ∇µ Kν ) ẋµ ẋν = 0. (26.5)
2| {z }
=0 (26.3)
Taking into account the first Bianchi identity, it is possible to find that for a Killing vector Kµ , one
has (no proof here)
ρ
∇λ ∇µ Kν (x) = Rλµν Kρ (x) (27.3)
88
27 MAXIMALLY SYMMETRIC SPACES
for x = x0 . Thus a Killing vector K µ (x) is completely determined everywhere by the values of K µ (x0 )
(i)
and ∇µ Kν (x0 ) at a single point x0 (think of Taylor expansion). A set of Killing vectors {Kµ (x)} is
said to be independent if any linear relation of the form
X
ci Kµ(i) (x) = 0, (27.4)
i
• Homogeneous space is meaning that the n-dimensional space(-time) admits n-translational Killing
vectors.
• We define a maximally symmetric space to be a space with a metric with a maximal number of
n(n+1)
2 Killing vectors.
The Riemann curvature tensor of a maximally symmetric space becomes simpler. One can show (no
proof) that it becomes
Rijkl = k(gik gjl − gil gjk ) (27.6)
dσ 2 = A(r)dr2 + r2 dΩ2
|{z}
dθ 2 +sin2 θdφ2
For the Christoffel symbols, we use the ones for the general form of a static isotropic metric (22.6) with
B(r) = 0. Since the Christoffel symbols are invariant under an overall sign change of the metric, also
89
27 MAXIMALLY SYMMETRIC SPACES
the Ricci tensor is and thus one can apply (22.10)-(22.13) with B(r) = 0 for this three-dimensional
space, without caring for the sign in front of A. Hence we get for equation (27.7)
A0
Rrr = R11 = ,
rA
(27.9)
1 rA0
Rθθ = R22 =− +1+ .
A 2A2
From eq. (27.7), we have Rrr = 2kA, and Rθθ = 2kgθθ = 2kr2 . Thus from equating the two first
equation leads us to
A0
2kA = ⇒ A0 = 2krA2 , (27.10)
rA
while we get for the second one
1 rA0 1 2kr2 A2 1 1
2kr2 = − +1+ 2
=− +1+ = − + 1 + kr2 ⇒ kr2 = − + 1,
A 2A A 2A2 A A
which leads to
1
A= , (27.11)
1 − kr2
and solves also (27.10). Then the metric on the 3-dimensional subspace (maximally symmetric) is
dr2
dσ 2 = + r2 dΩ2 . (27.12)
1 − kr2
It can be shown that k can have the following values: 0, ±1.
+1 sphere, positive curvature
k = −1 hyperbola, negative curvature
0 plane, zero curvature
The full metric (with time coordinate) has then the form:
dr2
ds2 = c2 dt2 − a2 (t) + r 2
(dθ 2
+ sin 2
θ dφ 2
) (27.13)
1 − kr2
where a(t) is the cosmic scale factor, which has to be determined by solving the Einstein’s equations via
the matter content of the universe. This metric (first discovered by Friedmann, Lemaître, Roberston
and Walker) is a reasonable ansatz for describing the universe. There is good evidence that the universe
(on large scales) is surprisingly homogeneous and isotropic (from redshift surveys of galaxies and cosmic
microwave background radiation).
90
28 FRIEDMANN EQUATIONS
28 Friedmann equations
We write the metric (27.13) as follows:
where tildes denote 3-dimensional quantities calculated with the metric g̃ij . The Christoffel symbols
are given by (notice Γµ00 = 0):
ȧ i
Γijk = Γ̃ijk , Γij0 = δ j, Γ0ij = ȧ a g̃ij , (28.2)
a
where dot denotes derivation with respect to t. The relevant components of the Riemann tensor are:
ä
Ri 0j0 = − δ i j , R0 i0j = a ä g̃ij , Rk ikj = R̃ij + 2ȧ2 g̃ij (28.3)
a
We can make use of R̃ij = 2kg̃ij (maximal symmetry of the 3-dimensional subspace) to compute Rµν .
The non-zero components are then
ȧ2
ä ä k
R00 = −3 , Rij = (aä + 2ȧ2 + 2k)g̃ij = − +2 2 +2 2 gij , (28.4)
a a a a
where gij = −a2 g̃ij . The Ricci scalar becomes R = − a62 (aä + ȧ2 + k) and the non-zero components of
the Einstein tensor are
ȧ2 ä ȧ2
k k
G00 = 3 2
+ 2 , G0i = 0, Gij = 2 + 2+ 2 gij . (28.5)
a a a a a
Next we have to specify the matter content. We treat here the universe as non-interacting particles or
a perfect fluid. A perfect fluid has energy-momentum tensor (19.8)
p
T µν = 2 + ρ uµ uν − g µν p, (28.6)
c
where p is the pressure, ρ the energy density and uµ the velocity field of the fluid (uµ = (c, 0, 0, 0) in
a comoving coordinate system). The trace of the energy-momentum tensor is then
p = wρ, (28.8)
Examples:
• For radiation the energy-momentum tensor is (like in Maxwell’s theory) traceless and hence
radiation has the equation of state
1
p= ρ, (28.9)
3
thus w = 31 .
91
28 FRIEDMANN EQUATIONS
Integration gives ρa4 = constant or ρ ∝ a−4 . More generally for (28.8) one gets:
8πG
Gµν = Tµν + Λgµν .
c4
Using (19.8) and that uµ = (c, 0, 0, 0) in a comoving coordinate system, let us write down the 00-
component and the ij-component of this equation:
2
ȧ k
3 + 2 = 8πGρ + Λ, (28.14)
a2 a
ä ȧ2
k
2 + 2 + 2 gij = (−8πGp + Λ) gij . (28.15)
a a a
One has in addition equation (28.10) from the conservation law. Using the first equation to eliminate
ȧ2 k
a2 + a2 from the second one, one obtains the Friedmann equations:
ȧ2 k 8πG Λ
2
+ 2 = ρ+ , (28.16)
a a 3 3
ä
−3 = 4πG(ρ + 3p) − Λ, (28.17)
a
ȧ
ρ̇ = −3(ρ + p) . (28.18)
a
Notice that one could also use the form (20.8) of the Einstein equations to derive the above equations.
ȧ(t)
Introducing the Hubble parameter: H(t) = a(t) and the deceleration parameter: q(t) = − a(t)ä(t)
ȧ2 (t) , with
their present day values denoted by H0 = H(t0 ) and q0 = q(t0 ), where t0 is the age of the universe;
92
28 FRIEDMANN EQUATIONS
we get instead
8πG k Λ
H2 = ρ− 2 +
3 a 3
1
q= (4πG(ρ + 3p) − Λ) (28.19)
3H 2
d
(ρa3 ) = −3Hpa3
dt
3H 2 ρ
In the case of Λ = 0, we define a critical density ρcrit = 8πG and a density parameter Ω = ρcrit . Then
ρ < ρcrit ⇔ k = −1 open universe
ρ = ρcrit ⇔ k = 0 flat universe
crit ⇔ k = +1 close universe
ρ > ρ
Let us now assume that the density is a combination of dust (that we shall simply denote as
“matter”) and radiation: ρ = ρm + ρr . Moreover, we assume that ρm ∼ a−3 and ρr ∼ a−4 . This is
valid if radiation and matter are decoupled, or if one density is much bigger that the other one (notice
8πG 3 8πG 4
that in today’s universe ρm ρr ). Let us introduce the constants Km = 3 ρm a and Kr = 3 ρr a .
Inserting them into equation (28.16) leads to
Kr Km 1
ȧ2 − − − Λa2 = −k. (28.20)
a2 a 3
This equation reads as
ȧ2 + V (a) = −k,
where
Kr Km 1
V (a) = − − − Λa2 (28.21)
a2 a 3
plays the role of an effective potential, see figure 18.
Kr Km
Consider the solution for a → 0: in that case the terms a2 and a dominate and the behavior
does not depend neither on k nor on Λ.
Kr √
ȧ2 ≈ → a(t) ∼ t (28.22)
a2
Km 2
ȧ2 ≈ → a(t) ∼ t 3 (28.23)
a
√
For a → 0, ȧ goes to ∞. If Kr 6= 0, then for a → 0 a ∼ t. From figure 18 we can discriminate
different types of solutions:
1. For Λ < 0, there is for all k-values a maximal amax . Thus there will be a periodic solution going
between a = 0 and amax .
2. Λ = 0 (Einstein-de Sitter universe): for k = 1 there is a bounded solution as in the case 1. For
k = 0 the expansion velocity goes towards 0, instead for k = −1 it goes towards a constant value.
93
28 FRIEDMANN EQUATIONS
a) Λ = Λcrit and k = 1. The value of Λcrit is obtained for the horizontal line −k (figure 18)
just going through the maximum of the potential. Assuming Kr ≈ 0 (as in today’s universe)
dV
we get with ȧ = 0, V = −1 → da = 0:
4 3Km
Λ = Λcrit = 2
and a = astat = . (28.24)
9Km 2
This corresponds to the static Einstein solution. Einstein introduced a cosmological constant
in order to get such a static solution, which is however unstable. Small perturbations lead
either to a contraction or to an exponential growth.
b) Λ < Λcrit and k = 1. The horizontal line −k intercepts the curve V (a) in two points a1 and
a2 . We obtain either a periodic solution between 0 and a1 or an unbounded solution with
a > a2 .
c) Λ = Λcrit (1 + ) and k = 1. For 0 < 1 the horizontal line −k lies just above the
maximum value of V (a). Thus the expansion velocity ȧ will be very low in this region
(Lemaître universe).
d) Λ > Λcrit and k = 1. The line −k does not intercept the V (a) curve. Around the maximum
of V (a) the expansion is lowered.
e) Λ > 0 and k = −1, 0 : as in the previous case, but there may be less deceleration in the
region of the maximum of V (a).
Λ<0
−k
a
Λ=0
Λ>0
Figure 18: Sketch of the “effective potential” V (a) for different values of Λ.
For Λ > 0, all solutions are unbounded. In the limit a → ∞, V (a) is dominated by the Λ-term:
Λ 2
ȧ2 ≈ a ,
3
94
28 FRIEDMANN EQUATIONS
thus r !
Λ
a(t) ∼ exp t . (28.25)
3
The expansion is exponentially accelerated.
According to the currently most accepted model (so-called ΛCDM-model), the main contributions
to the density are
Moreover, the universe seems to be almost flat: k ≈ 0. These cosmological parameters would thus
correspond to the case 3e) of the previous discussion.
Finally, H0−1 is related to the age of the universe. H0 ∼ 67.80 ± 0.77 sec/Mpc
km
gives an age of ∼ 13.8
billion years.30
95