Advanced Quantum Field Theory Roberto Casalbuoni

  • November 2019
  • PDF

This document was uploaded by user and they confirmed that they have the permission to share it. If you are author or own the copyright of this book, please report to us by using this DMCA report form. Report DMCA


Overview

Download & View Advanced Quantum Field Theory Roberto Casalbuoni as PDF for free.

More details

  • Words: 84,262
  • Pages: 261
Advanced Quantum Field Theory Roberto Casalbuoni Dipartimento di Fisica Universita di Firenze

Lectures given at the Florence University during the academic year 1998/99.

Contents Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

1 Notations and Conventions 1.1 1.2 1.3 1.4

Units . . . . . . . . . . . . . . . . . . . . . . . . Relativity and Tensors . . . . . . . . . . . . . . The Noether's theorem for relativistic elds . . Field Quantization . . . . . . . . . . . . . . . . 1.4.1 The Real Scalar Field . . . . . . . . . . . 1.4.2 The Charged Scalar Field . . . . . . . . 1.4.3 The Dirac Field . . . . . . . . . . . . . . 1.4.4 The Electromagnetic Field . . . . . . . . 1.5 Perturbation Theory . . . . . . . . . . . . . . . 1.5.1 The Scattering Matrix . . . . . . . . . . 1.5.2 Wick's theorem . . . . . . . . . . . . . . 1.5.3 Feynman diagrams in momentum space . 1.5.4 The cross-section . . . . . . . . . . . . .

. . . . . . . . . . . . .

2 One-loop renormalization 2.1 2.2 2.3 2.4 2.5 2.6 2.7

. . . . . . . . . . . . .

. . . . . . . . . . . . .

Divergences of the Feynman integrals . . . . . . . . . Higher order corrections . . . . . . . . . . . . . . . . The analysis by counterterms . . . . . . . . . . . . . Dimensional regularization of the Feynman integrals . Integration in arbitrary dimensions . . . . . . . . . . One loop regularization of QED . . . . . . . . . . . . One loop renormalization . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

. . . . . . . . . . . . . . . . . . . .

4

4 6 6 9 10 12 14 17 27 27 29 30 33

36

36 38 45 52 54 56 63

3 Vacuum expectation values and the S matrix

73

4 Path integral formulation of quantum mechanics

85

3.1 In- and Out-states . . . . . . . . . . . . . . . . . . . . . . . . . . . . 73 3.2 The S matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 79 3.3 The reduction formalism . . . . . . . . . . . . . . . . . . . . . . . . . 81 4.1 Feynman's formulation of quantum mechanics . . . . . . . . . . . . . 85 4.2 Path integral in conguration space . . . . . . . . . . . . . . . . . . . 88 1

4.3 4.4 4.5 4.6 4.7 4.8 4.9

The physical interpretation of the path integral . . The free particle . . . . . . . . . . . . . . . . . . . The case of a quadratic action . . . . . . . . . . . . Functional formalism . . . . . . . . . . . . . . . . . General properties of the path integral . . . . . . . The generating functional of the Green's functions . The Green's functions for the harmonic oscillator .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. 91 . 97 . 99 . 104 . 107 . 113 . 116

The path integral for a free scalar eld . . . . . . . . . . . . The generating functional of the connected Green's functions The perturbative expansion for the theory '4 . . . . . . . . The Feynman's rules in momentum space . . . . . . . . . . . Power counting in '4 . . . . . . . . . . . . . . . . . . . . . Regularization in '4 . . . . . . . . . . . . . . . . . . . . . . Renormalization in the theory '4 . . . . . . . . . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. . . . . . .

. 123 . 127 . 129 . 135 . 137 . 142 . 145

5 The path integral in eld theory 5.1 5.2 5.3 5.4 5.5 5.6 5.7

123

6 The renormalization group

152

7 The path integral for Fermi elds

169

8 The quantization of the gauge elds

180

6.1 6.2 6.3 6.4 6.5

The renormalization group equations . . . . . . . . . . . . . . . . . . 152 Renormalization conditions . . . . . . . . . . . . . . . . . . . . . . . . 156 Application to QED . . . . . . . . . . . . . . . . . . . . . . . . . . . 160 Properties of the renormalization group equations . . . . . . . . . . . 161 The coecients of the renormalization group equations and the renormalization conditions . . . . . . . . . . . . . . . . . . . . . . . . . . . 167

7.1 Fermionic oscillators and Grassmann algebras . . . . . . . . . . . . . 169 7.2 Integration over Grassmann variables . . . . . . . . . . . . . . . . . . 174 7.3 The path integral for the fermionic theories . . . . . . . . . . . . . . . 178 8.1 8.2 8.3 8.4 8.5 8.6

QED as a gauge theory . . . . . . . . . . . . . . . . . . . . . Non-abelian gauge theories . . . . . . . . . . . . . . . . . . . Path integral quantization of the gauge theories . . . . . . . Path integral quantization of QED . . . . . . . . . . . . . . Path integral quantization of the non-abelian gauge theories The  -function in non-abelian gauge theories . . . . . . . . .

9 Spontaneous symmetry breaking 9.1 9.2 9.3 9.4 9.5

. . . . . .

. . . . . .

. . . . . .

. . . . . .

. 180 . 182 . 186 . 196 . 200 . 204

213

The linear -model . . . . . . . . . . . . . . . . . . . . . . . . . . . . 213 Spontaneous symmetry breaking . . . . . . . . . . . . . . . . . . . . . 219 The Goldstone theorem . . . . . . . . . . . . . . . . . . . . . . . . . . 223 The Higgs mechanism . . . . . . . . . . . . . . . . . . . . . . . . . . 225 Quantization of a spontaneously broken gauge theory in the R -gauge 230 2

9.6  -cancellation in perturbation theory . . . . . . . . . . . . . . . . . . 235

10 The Standard model of the electroweak interactions

239

A

259

10.1 The Standard Model of the electroweak interactions . . . . . . . . . . 239 10.2 The Higgs sector in the Standard Model . . . . . . . . . . . . . . . . 244 10.3 The electroweak interactions of quarks and the Kobayashi-MaskawaCabibbo matrix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 249 10.4 The parameters of the SM . . . . . . . . . . . . . . . . . . . . . . . . 257 A.1 Properties of the real antisymmetric matrices . . . . . . . . . . . . . 259

3

Chapter 1 Notations and Conventions 1.1 Units In quantum relativistic theories the two fundamental constants c e /h, the light velocity and the Planck constant respectively, appear everywhere. Therefore it is convenient to choose a unit system where their numerical value is given by

c = /h = 1

(1.1)

For the electromagnetism we will use the Heaviside-Lorentz system, where we take also 0 = 1 (1.2) From the relation 0 0 = 1=c2 it follows

0 = 1

(1.3)

In these units the Coulomb force is given by jF~ j = e41 e2 j~x ;1 ~x j2 (1.4) 1 2 and the Maxwell equations appear without any visible constant. For instance the gauss law is r~  E~ = (1.5) dove is the charge density. The dimensionless ne structure constant

is given by

2 = e / 4 0 hc

(1.6)

2 = 4e

(1.7)

4

Any physical quantity can be expressed equivalently by using as fundamental unit energy, mass, lenght or time in an equivalent fashion. In fact from our choice the following equivalence relations follow

ct E E Et

   

` =) time  lenght mc2 =) energy  mass pv =) energy  momentum /h =) energy  (time);1  (lenght);1

(1.8)

In practice, it is enough to notice that the product ch/ has dimensions E `]. Therefore

ch/ = 3  108 mt  sec;1  1:05  10;34 J  sec = 3:15  10;26 J  mt Recalling that it follows From which

1 eV = e  1 = 1:602  10;19 J

ch/ =

3:15  10;26 MeV  mt = 197 MeV  fermi 1:6  10;13

(1.9) (1.10) (1.11)

1 MeV;1 = 197 fermi (1.12) Using this relation we can easily convert a quantity given in MeV (the typical unit used in elementary particle physics) in fermi. For instance, using the fact that also the elementary particle masses are usually given in MeV , the wave lenght of an electron is given by 1  fermi  400 fermi  200 0MeV (1.13) eCompton = m1  0:5 MeV :5 MeV e Therefore the approximate relation to keep in mind is 1 = 200 MeV  fermi. Furthermore, using c = 3  1023 fermi  sec;1 (1.14) we get 1 fermi = 3:3  10;24 sec (1.15) and 1 MeV;1 = 6:58  10;22 sec (1.16) Also, using 1 barn = 10;24 cm2 (1.17) it follows from (1.12) 1 GeV;2 = 0:389 mbarn (1.18)

5

1.2 Relativity and Tensors Our conventions are as follows: the metric tensor g is diagonal with eigenvalues (+1 ;1 ;1 ;1). The position and momentum four-vectors are given by

x = (t ~x)

p = (E ~p)  = 0 1 2 3

(1.19)

where ~x and ~p are the three-dimensional position and momentum. The scalar product between two four-vectors is given by a  b = a b g = a b = a b = a0 b0 ; ~a  ~b (1.20) where the indices have been lowered by

a = g a

(1.21)

and can be raised by using the inverse metric tensor g  g . The four-gradient is dened as @ @    @ ~ @ = @x = @t  @~x = @t  r (1.22) The four-momentum operator in position space is ~) p ! i @x@ = (i@t  ;ir (1.23)  The following relations will be useful (1.24) p2 = pp ! ; @x@  @x@ = ; 

x  p = Et ; p~  ~x

(1.25)

1.3 The Noether's theorem for relativistic elds We will now review the Noether's theorem. This allows to relate symmetries of the action with conserved quantities. More precisely, given a transformation involving both the elds and the coordinates, if it happens that the action is invariant under this transformation, then a conservation law follows. When the transformations are limited to the elds one speaks about internal transformations. When both types of transformations are involved the total variation of a local quantity F (x) (that is a function of the space-time point) is given by F (x) = F~ (x0) ; F (x) = F~ (x + x) ; F (x) = F~ (x) ; F (x) + x @F (x) (1.26) @x 6

The total variation  keeps into account both the variation of the reference frame and the form variation of F . It is then convenient to dene a local variation F , depending only on the form variation F (x) = F~ (x) ; F (x) (1.27) Then we get F (x) = F (x) + x @F (x) (1.28) @x Let us now start form a generic four-dimensional action

S=

Z

V

d4x L(i x)

i = 1 : : :  N

(1.29)

and let us consider a generic variation of the elds and of the coordinates, x0  = x + x @ (1.30) i(x) = ~i(x0 ) ; i(x)  i(x) + x @x  If the action is invariant under the transformation, then S~V = SV (1.31) This gives rise to the conservation equation   @ L @ L  i  i @ Lx + @i  ; x @i  = 0 (1.32)   This is the general result expressing the local conservation of the quantity in parenthesis. According to the choice one does for the variations x and i, and of the corresponding symmetries of the action, one gets dierent kind of conserved quantities. Let us start with an action invariant under space and time translations. In the case we take x = a with a independent on x e i = 0. From the general result in eq. (1.32) we get the following local conservation law @ L i ; Lg @ T  = 0 T = @ (1.33)    i   T is called the energy-momentum tensor of the system. From its local conservation we get four constant of motion 0

P =

Z

d3xT0

(1.34)

P is the four-momentum of the system. In the case of internal symmetries we take x = 0. The conserved current will be @ L i = @ L i @ J  = 0 (1.35) J  = @  i @i  7

with an associated constant of motion given by

Q=

Z

d3 xJ 0

(1.36)

In general, if the system has more that one internal symmetry, we may have more that one conserved charge Q, that is we have a conserved charge for any . The last case we will consider is the invariance with respect to Lorentz transformations. Let us recall that they are dened as the transformations leaving invariant the norm of a four-vector x2 = x0 2 (1.37) For an innitesimal transformation

x0 =  x  x +  x

(1.38)

with

 = ; (1.39) We see that the number of independent parameters characterizing a Lorentz transformation is six. As well known, three of them correspond to spatial rotations, whereas the remaining three correspond to Lorentz boosts. In general, the relativistic elds are chosen to belong to a representation of the Lorentz group ( for instance the Klein-Gordon eld belongs to the scalar representation). This means that under a Lorentz transformation the components of the eld mix together, as, for instance, a vector eld does under rotations. Therefore, the transformation law of the elds i under an innitesimal Lorentz transformation can be written as (1.40) i = ; 21 ij  j where we have required that the transformation of the elds is of rst order in the Lorentz parameters  . The coecients  (antisymmetric in the indices (  )) dene a matrix in the indices (i j ) which can be shown to be the representative of the innitesimal generators of the Lorentz group in the eld representation. Using this equation and the expression for x we get the local conservation law 0 = =

 @L



@ L ij j @ @i i ; Lg  x + 21 @ i    1  @ T x ; T x + @ L ij j 2      @i 

;







(1.41)

and dening (watch at the change of sign)

@ L ij j M = x T ; x T ; @ i  

8

(1.42)

it follows the existence of six locally conserved currents (one for each Lorentz transformation) @ M = 0 (1.43) and consequently six constants of motion (notice that the lower indices are antisymmetric) Z M = d3 xM0 (1.44) Three of these constants (the ones with  and assuming spatial values) are nothing but the components of the angular momentum of the eld.

1.4 Field Quantization The quantization procedure for a eld goes through the following steps:  construction of the lagrangian density and determination of the canonical momentum density (1.45) (x) = @@ L(x) 

where the index describes both the spin and internal degrees of freedom  quantization through the requirement of the equal time canonical commutation relations for a spin integer eld or through canonical anticommutation relations for half-integer elds (~x t)  (~y t)] = i 3(~x ; ~y)  (~x t)  (~y t)] = 0

 (~x t)  (~y t)] = 0

(1.46) (1.47)

In the case of a free eld, or for an interacting eld in the interaction picture (and therefore evolving with the free hamiltonian), we can add the following steps  expansion of (x) in terms of a complete set of solutions of the Klein-Gordon equation, allowing the denition of creation and annihilation operators  construction of the Fock space through the creation and annihilation operators. In the following we will give the main results about the quantization of the scalar, the Dirac and the electromagnetic eld.

9

1.4.1 The Real Scalar Field

The relativistic real scalar free eld (x) obeys the Klein-Gordon equation ( + m2 )(x) = 0

(1.48)

which can be derived from the variation of the action

S= where L is the lagrangian density

Z

d4xL



L = 21 @  @   ; m2 2

(1.49)



(1.50)

giving rise to the canonical momentum

 = @ L_ = _ @ From this we get the equal time canonical commutation relations (~x t) _ (~y t)] = i3 (~x ; ~y) (~x t) (~y t)] = _ (~x t) _ (~y t)] = 0

(1.51) (1.52)

By using the box normalization, a complete set of solutions of the Klein-Gordon equation is given by f~k (x) = p1 p 1 e;ikx  f~k(x) = p1 p 1 eikx (1.53) V 2!k V 2!k corresponding respectively to positive and negative energies, with

q

and with

k0 = !k = j~kj2 + m2

(1.54)

~k = 2 ~n L

(1.55)

~n = n1~i1 + n2~i2 + n3~i3  ni 2 ZZ (1.56) and L being the side of the quantization box (V = L3 ). For any two solutions f and g of the Klein-Gordon equation, the following quantity is a constant of motion and denes a scalar product (not positive denite)

Z

where

hf jgi = i d3~xf  @t(;) g

(1.57)

f  @t(;) g = f g_ ; f_g

(1.58)

10

The solutions of eq. (1.53) are orthogonal with respect to this scalar product, for instance Y3 hf~k jf~k i = ~k~k  nini (1.59) 0

0

0

i=1

We can go to the continuum normalization by the substitution p1 ! p 1 3 (1.60) V (2 ) and sending the Kronecker delta function into the Dirac delta function. The expansion of the eld in terms of these solutions is then (in the continuum)

(x) = or, more explicitly

Z

d3~kf~k (x)a(~k) + f~k(x)ay(~k)]

(1.61)

Z

d3~k p 1 3 a(~k)e;ikx + ay(~k)eikx ] (1.62) (2 ) 2!k using the orthogonality properties of the solutions one can invert this expression (x) =

a(~k) = i

Z

d3~xf~k(x)@t(;) (x)

ay(~k) = i

Z

d3~x(x)@t(;) f~k (x)

and obtain the commutation relations a(~k) ay(~k0)] = 3 (~k ; ~k0 )

(1.63) (1.64)

a(~k) a(~k0)] = ay(~k) ay(~k0)] = 0 (1.65) From these equations one constructs the Fock space starting from the vacuum state which is dened by a(~k)j0i = 0 (1.66) for any ~k. The states ay(~k1)    ay(~kn)j0i (1.67) describe n Bose particles of equal mass m2 = ki2 , and of total four-momentum k = k1 +    kn. This follows by recalling the expression (1.33) for the energymomentum tensor, that for the Klein-Gordon eld is ;  (1.68) T = @ @  ; 21 @ @   ; m2 2 g from which Z Z   1 0 3 00 3 2 2 2 2 ~ P = H = d xT = 2 d x  + jrj + m  (1.69) 11

Pi

=

Z

d3xT 0i

=;

Z

@  d3x @x i

(P~ = ;

Z

~ ) d3xr

(1.70)

and for the normal ordered operator P we get (the normal ordering is dened by putting all the creation operators to the left of the annihilation operators)

Z

d3~k kay(~k)a(~k)

(1.71)

: P : ay(~k)j0i = k ay(~k)j0i The propagator for the scalar eld is given by

(1.72)

: P := implying

where and

h0jT ((x)(y))j0i = ;iF (x ; y)

(1.73)

Z d4k ;ikx F (k) F (x) = ; e 4 (2 )

(1.74)

F (k) = k2 ; m1 2 + i

(1.75)

1.4.2 The Charged Scalar Field

The charged scalar eld can be described in terms of two real scalar elds of equal mass 2 X

1 L = 2 (@ i)(@  i) ; m2 2i (1.76) i=1

and we can write immediately the canonical commutation relations i(~x t) _ j (~y t)] = iij 3(~x ; ~y) i(~x t) j (~y t)] = _ i(~x t) _ j (~y t)] = 0 The charged eld is better understood in a complex basis  = p1 (1 + i2) y = p1 (1 ; i2) 2 2 The lagrangian density becomes

L = @ y@   ; m2 y

(1.77) (1.78) (1.79) (1.80)

The theory is invariant under the phase transformation  ! ei , and therefore it admits the conserved current (see eq. (1.35))





j = i (@ )y ; (@ y) 12

(1.81)

with the corresponding constant of motion

Q=i

Z

d3 x y@t(;) 

(1.82)

The commutation relations in the new basis are given by (~x t) _ y(~y t)] = i3 (~x ; ~y)

(1.83)

and (~x t) (~y t)] = y(~x t) y(~y t)] = 0 _ (~x t) _ (~y t)] = _ y(~x t) _ y(~y t)] = 0

(1.84)

Let us notice that these commutation relations could have also been obtained directly from the lagrangian (1.80), since  = @ L_ = _ y  = @ L_ y = _ (1.85) @ @ Using the expansion for the real elds y

i(x) = we get

Z

Z

h

i

d3k f~k (x)ai(~k) + f~k (x)ayi(~k)

(1.86)





(x) = f~k (x) p1 (a1 (~k) + ia2 (~k)) + f~k(x) p1 (ay1(~k) + iay2 (~k)) 2 2 Introducing the combinations b(~k) = p1 (a1 (~k) ; ia2 (~k)) a(~k) = p1 (a1(~k) + ia2 (~k)) 2 2 it follows i R h (x) = d3k f~k (x)a(~k) + f~ (x)by(~k) d3k

h i R y 3  y ~ ~  (x) = d k f~k (x)b(k) + f~k (x)a (k)

(1.87)

(1.88)

k

(1.89)

from which we can evaluate the commutation relations for the creation and annihilation operators in the complex basis a(~k) ay(~k0 )] = b(~k) by(~k0)] = 3(~k ; ~k0) (1.90) a(~k) b(~k0)] = a(~k) by(~k0)] = 0 13

(1.91)

We get also : P :=

Z

d3kk

Z 2 h i X y ~ ~ a i(k)ai(k) = d3kk ay(~k)a(~k) + by(~k)b(~k) i=1

(1.92)

Therefore the operators ay(~k) e by(~k) both create particles states with momentum k, as the original operators ayi. For the normal ordered charge operator we get : Q :=

Z

h

i

d3k ay(~k)a(~k) ; by(~k)b(~k)

(1.93)

showing explicitly that ay and by create particles of charge +1 and ;1 respectively. The only non vanishing propagator for the scalar eld is given by h0jT ((x)y(y))j0i = ;iF (x ; y) (1.94)

1.4.3 The Dirac Field

The Dirac eld is described by the action

S=

Z

V

d4 x !(i@^ ; m)

(1.95)

where we have used the following notations for four-vectors contracted with the  matrices v^  v   (1.96) The canonical momenta result to be  = @ L_ y = 0 (1.97)  = @ L_ = iy @ @ The canonical momenta do not depend on the velocities. In principle, this creates a problem in going to the hamiltonian formalism. In fact a rigorous treatment requires an extension of the classical hamiltonian approach which was performed by Dirac himself. In this particular case, the result one gets is the same as proceeding in a naive way. For this reason we will avoid to describe this extension, and we will proceed as in the standard case. From the general expression for the energy momentum tensor (see eq. 1.33) we get !   ; g(!(i@^ ; m)) T = i (1.98) and using the Dirac equation !  T = i (1.99) The momentum of the eld is given by y

Pk

=

Z

d3x

T 0k

=) P~ = ;i 14

Z

~ d3 x yr

(1.100)

and the hamiltonian

H=

Z

d3x

T 00

=) H = i

Z

d3x y@t 

(1.101)

For a Lorentz transformation, the eld variation is dened by i = ; 12 ij  j (1.102) In the Dirac case one has (x) = 0(x0 ) ; (x) = ; i   (x) (1.103) 4 from which (1.104)  = 2i  = ; 41   ] Therefore the angular momentum density (see eq. (1.42)) is     i 1    ! x@ ; x @ ;   = i ! x @ ; x @ +   ]  M = i 2 4 (1.105) By taking the spatial components we obtain   Z 1 23 31 12 3 y ~ + ~ 12  (1.106) J~ = (M  M  M ) = d x  ;i~x ^ r 2 where 12 is the identity matrix in 2 dimensions, and we have dened  ~ 0  ~ 12 = 0 ~ (1.107) The expression of J~ shows the decomposition of the total angular momentum in the orbital and in the spin part. The theory has a further conserved quantity, the !   resulting from the phase invariance  ! ei . current  The decomposition of the Dirac eld in plane waves is obtained by using the spinors u(p n) e v(p n) solutions of the Dirac equation in momentum space for positive and negative energies respectively. We write i X Z d3 p r m h p(2 )3 Ep b(p n)u(p n)e;ipx + dy(p n)v(p n)eipx (1.108) (x) = n i X Z d3 p r m h y p(2 )3 Ep d(p n)!v(p n)e;ipx + by(p n)!u(p n)eipx 0  (x) = n (1.109) p 2 2 where Ep = jp~j + m . We will collect here the various properties of the spinors. The four-vector n is the one that identies the direction of the spin quantization in the rest frame. For instance, by quantizing the spin along the third axis one takes n in the rest frame as nR = (0 0 0 1). Then n is obtained by boosting from the rest frame to the frame where the particle has four-momentum p . 15

 Dirac equation (^p ; m)u(p n) = u!(p n)(^p ; m) = 0 (^p + m)v(p n) = v!(p n)(^p + m) = 0

(1.110)

 Orthogonality u!(p n)u(p n0) = ;v!(p n)v(p n0) = nn uy(p n)u(p n0) = vy(p n)v(p n0) = Emp nn v!(p n)u(p n0) = vy(p n)u(~p n0) = 0 0

0

(1.111)

where, if p = (Ep ~p), then p~ = (Ep ;p~).  Completeness

X n

X n

u(p n)!u(p n) = p^2+mm v(p n)!v(p n) = p^2;mm

(1.112)

The Dirac eld is quantized by canonical anticommutation relations in order to satisfy the Pauli principle



(~x t) (~y t)]+ = y(~x t) y(~y t) + = 0

(1.113)

 (~x t) (~y t)]+ = i3 (~x ; ~y)

(1.114)

= 3(~x ; ~y) The normal ordered four-momentum is given by

(1.115)

and or

(~x t) y(~y t)

: P :=

XZ n

+

d3p p by(p n)b(p n) + dy(p n)d(p n)]

(1.116)

If we couple the Dirac eld to the electromagnetism through the minimal substitution we nd that the free action (1.95) becomes

S=

Z

V

d4x!(i@^ ; eA^ ; m)

Therefore the electromagnetic eld is coupled to the conserved current !  j  = e 16

(1.117) (1.118)

This forces us to say that the integral of the fourth component of the current should be the charge operator. In fact, we nd :Q: = e =

Z

d3x y

XZ n





d3p e by(p n)b(p n) ; dy(p n)d(p n)

(1.119)

The expressions for the momentum angular momentum and charge operators show that the operators by(p~) and dy(p~) create out of the vacuum particles of spin 1=2, four-momentum p and charge e and ;e respectively. The propagator for the Dirac eld is given by h0jT ((x)! (y))j0i = iSF (x ; y) (1.120) where and

SF (x) = ;(i@^ + m)F (x) =

Z d4k ;ikx SF (k) e 4 (2 )

^ m 1 SF (k) = k2 ;k + = m2 + i k^ ; m + i

(1.121) (1.122)

1.4.4 The Electromagnetic Field

The lagrangian density for the free electromagnetic eld, expressed in terms of the four-vector potential is (1.123) L = ; 14 F F  where F = @ A ; @ A (1.124) and ~ A0 ; @ A~  B~ = r ~ ^ A~ E~ = ;r (1.125) @t from which E~ = (F 10 F 20 F 30 ) B~ = (;F 23  ;F 31  ;F 12) (1.126) The resulting equations of motion are A ; @ (@  A ) = 0 (1.127) The potentials are dened up to a gauge transformation

A(x) ! A0(x) = A (x) + @ (x) (1.128) In fact, A and A0 satisfy the same equations of motion and give rise to the same electromagnetic eld. It is possible to use the gauge invariance to require some 17

particular condition on the eld A . For instance, we can perform a gauge transformation in such a way that the transformed eld satises

@ A = 0 This is called the Lorentz gauge. Or we can choose a gauge such that ~  A~ = 0 A0 = r

(1.129) (1.130)

This is called the Coulomb gauge. Since in this gauge all the gauge freedom is completely xed (in contrast with the Lorentz gauge), it is easy to count the independent degrees of freedom of the electromagnetic eld. From the equations (1.130) we see that A has only two degrees of freedom. Another way of showing that there are only two independent degrees of freedom is through the equations of motion. Let us consider the four dimensional Fourier transform of A(x)

A (x) =

Z

d4k eikx A (k)

(1.131)

Substituting this expression in the equations of motion we get

;k2 A (k) + k(k A (k)) = 0

(1.132)

Let us now decompose A(k) in terms of four independent four vectors, which can be chosen as k = (E ~k), k~ = (E ;~k), and two further four vectors e  (k),  = 1 2, orthogonal to k k e  = 0  = 1 2 (1.133) The decomposition of A(k) reads A (k) = a (k)e  + b(k)k + c(k)k~ (1.134) From the equations of motion we get

;k2 (a e  + bk + ck~ ) + k(bk2 + c(k  k~)) = 0

(1.135)

The term in b(k) cancels, therefore it is left undetermined by the equations of motion. For the other quantities we have

k2 a (k) = c(k) = 0

(1.136)

The arbitrariness of b(k) is a consequence of the gauge invariance. In fact if we gauge transform A (x) A(x) ! A(x) + @ (x) (1.137) then A (k) ! A (k) + ik (k) (1.138) 18

where

(x) =

Z

d4k eikx (k)

(1.139)

Since the gauge transformation amounts to a translation in b(k) by an arbitrary function of k, we can always to choose it equal to zero. Therefore we are left with the two degrees of freedom described by the amplitudes a (k),  = 1 2. Furthermore these amplitudes are dierent from zero only if the dispersion relation k2 = 0 is satised. This shows that the corresponding quanta will have zero mass. With the choice b(k) = 0, the eld A(k) becomes

A (k) = a (k)e (k)

(1.140)

showing that k A (k) = 0. Therefore the choice b(k) = 0 is equivalent to the choice of the Lorentz gauge. Let us consider now the quantization of this theory. Due to the lack of manifest covariance of the Coulomb gauge we will discuss here the quantization in the Lorentz gauge, where however we will encounter other problems. If we want to maintain the explicit covariance of the theory we have to require non trivial commutation relations for all the component of the eld. That is

with

A(~x t)  (~y t)] = ig 3(~x ; ~y)

(1.141)

A (~x t) A (~y t)] =  (~x t)  (~y t)] = 0

(1.142)

(1.143)  = @ L_ @ A To evaluate the conjugated momenta is better to write the lagrangian density (see eq. (1.123) in the following form L = ; 41 A ; A ]A ; A ] = ; 21 A A + 12 A A (1.144) Therefore @ L = ;A + A = F  (1.145) @A implying

It follows



 = @ L_ = F 0 @ A

0 = @ L_ = 0 @ A0 We see that it is impossible to satisfy the condition A0(~x t) 0(~y t)] = i3 (~x ; ~y) 19

(1.146) (1.147) (1.148)

We can try to nd a solution to this problem modifying the lagrangian density in such a way that 0 6= 0. But doing so we will not recover the Maxwell equation. However we can take advantage of the gauge symmetry, modifying the lagrangian density in such a way to recover the equations of motion in a particular gauge. For instance, in the Lorentz gauge we have

A (x) = 0

(1.149)

and this equation can be obtained by the lagrangian density L = ; 21 A A (1.150) (just think to the Klein-Gordon case). The minus sign is necessary to recover a positive hamiltonian density. We now express this lagrangian density in terms of the gauge invariant one, given in eq. (1.123). To this end we observe that the dierence between the two lagrangian densities is nothing but the second term of eq. (1.144)   1 A A = @  1 A A ; 1 (@  A )A   2   21  2 1  1     = @ 2 A A ; @ 2 (@ A)A + 2 (@  A )2 (1.151) Then, up to a four divergence, we can write the new lagrangian density in the form (1.152) L = ; 14 F F  ; 21 (@  A )2 One can check that this form gives the correct equations of motion. In fact from @ L = ;A + A ; g (@ A ) @L = 0 (1.153) @A @A we get 0 = ;A + @  (@  A ) ; @  (@ A ) = ;A (1.154) the term ; 12 (@  A )2 (1.155) which is not gauge invariant, is called the gauge xing term. More generally, we could add to the original lagrangian density a term of the form ; 2 (@  A )2 (1.156) The corresponding equations of motion would be

A ; (1 ; )@  (@ A ) = 0 20

(1.157)

In the following we will use  = 1. From eq. (1.153) we see that 0 = @ L_ = ;@  A @ A0

(1.158)

In the Lorentz gauge we nd again 0 = 0. To avoid the corresponding problem we can ask that @  A = 0 does not hold as an operator condition, but rather as a condition upon the physical states

hphysj@  A jphysi = 0

(1.159)

The price to pay to quantize the theory in a covariant way is to work in a Hilbert space much bigger than the physical one. The physical states span a subspace which is dened by the previous relation. A further bonus is that in this way one has to do with local commutation relations. On the contrary, in the Coulomb gauge, one needs to introduce non local commutation relations for the canonical variables. We will come back later to the condition (1.159). Since we don't have to worry any more about the operator condition 0 = 0, we can proceed with our program of canonical quantization. The canonical momentum densities are  = @ L_ = F 0 ; g0(@ A ) (1.160) @ A or, explicitly ~  A~ 0 = ;@ A = ;A_ 0 ; r i = @ i A0 ; @ 0 Ai = ;A_ i + @ i A0 (1.161) Since the spatial gradient of the eld commutes with the eld itself at equal time, the canonical commutator (1.141) gives rise to A (~x t) A_  (~y t)] = ;ig 3(~x ; ~y) (1.162) To get the quanta of the eld we look for plane wave solutions of the wave equation. We need four independent four vectors in order to expand the solutions in the momentum space. In a given frame, let us consider the unit four vector which denes the time axis. This must be a time-like vector, n2 = 1, and we will choose n0 > 0. For instance, n = (1 0 0 0). Then we take two four vectors ( ) ,  = 1 2,in the plane orthogonal to n and k . Notice that now k2 = 0,since we are considering solutions of the wave equation. Therefore

k ( ) = n( ) = 0

 = 1 2

(1.163)

The four vectors ( ) , being orthogonal to n are space-like, then they will be chosen orthogonal and normalized in the following way

( ) ( )  = ; 0

0

21

(1.164)

Next, we dene a unit space-like four vector, orthogonal to n and lying in the plane (k n) n (3) : = 0 (1.165) with (3)  = ;1 (3) (1.166)   ( ) By construction (3)  is orthogonal to  . This four vector is completely xed by the previous conditions, and we get k ; (n  k)n (3) (1.167)  = (n  k) As a last unit four vector we choose n (0) (1.168)  = n These four vectors are orthonormal ( ) ( ) = g (1.169) and linearly independent. Then they satisfy the completeness relation ( ) ( ) g = g (1.170) In the frame where n = (1 ~0) and k = (k 0 0 k), we have 0

0

0

0

(1)  = (0 1 0 0) (2)  = (0 0 1 0) The plane wave expansion of A is

(3)  = (0 0 0 1)

(1.171)

Z

3 h i 3k X d ( ) (k ) a (k)e;ikx + ay (k )eikx A (x) = p  (1.172) 2!k (2 )3 =0  where we have included the hermiticity condition for A(x). For any xed , this expansion is the same as the one that we wrote for the Klein-Gordon eld, with the substitution ( ) a (k) ! a(k). Then, from eq. (1.63)

( ) (k)a (k) = i

Z

d3x f~k(x)@t(;) A (x)

(1.173)

with the functions f~k (x) dened as in eq. (1.53). Using the orthogonality of the ( ) 's we nd Z a (k) = ig d3x ( ) (k)f~k (x)@t(;) A(x) (1.174) 0

0

0

and analogously

ay (k) = ig



0

Z

d3x ( ) (k)A (x)@t(;) f~k (x) 0

22

(1.175)

From these expressions we can evaluate the commutator Za (k) ay (k0 )] = h  _     0 3 3 ( ) ( ) 3 = d x d y ; f~k (x)f~k (y) ig g  (k) (k )g  (~x ; ~y) 0

00



000

i

00

0

0

000

; f_~k(x)f~k (y) ;ig g ( ) (k)( )  (k0 )g 3(~x ; ~y) = ;

Z

00

000

00

0

0

d3x f~k(x)i@t(;) f~k (x)g 0

000

(1.176)

0

and using the orthogonality relations for the functions f )~k a (k) ay (k0)] = ;g 3(~k ; k~0) (1.177) Analogously a (k) a (k0)] = ay (k) ay (k0)] = 0 (1.178) The commutation rules we have derived for the operators a (k) create some problem. Let us consider a one-particle state 0

0

0

j1 i = its norm is given by

h1 j1 i = =

0

Z

d3k f (k)ay (k)j0i

Z d3k d3k0 f ?(k)f (k0)h0ja (k)ay (k0)j0i Z d3k d3k0 f ?(k)f (k0)h0ja (k) ay (k0)j0i Z

= ;g

d3k jf (k)j2

(1.179)

(1.180)

Therefore the states with  = 0 have negative norm. This problem does not come out completely unexpected. In fact, our expectation is that only the transverse states ( = 1 2), are physical states. For the moment being we have ignored the gauge xing condition hphysj@  A jphysi = 0, but its meaning is that only part of the total Hilbert space is physical. Therefore the relevant thing is to show that the states satisfying the Lorentz condition have positive norm. To discuss the gauge xing condition, let us notice that formulated in the way we did, being bilinear in the states, it could destroy the linearity of the Hilbert space. So we will try to modify the condition in a linear one @  A jphys:i = 0 (1.181) But this would be a too strong requirement. Not even the vacuum state satises it. However, if we consider the positive and negative frequency parts of the eld Z d3k X 3 (+) A (x) = p ( ) (k)a (k)e;ikx  A(;) (x) = (A(+) (x))y (1.182) 3 2!k (2 ) =0 23

it is possible to weaken the condition, and require

@  A(+)  (x)jphys:i = 0

(1.183)

This allows us to satisfy automatically the original requirement  (;) hphys:j(@  A(+)  + @ A )jphys:i = 0

(1.184)

To make this condition more explicit let us evaluate the four divergence of A(+) 

i@  A(+)  (x) =

Z

3

X

k

=03

p2!d (2k )3 e;ikx

k( ) (k)a (k)

(1.185)

Using eq. (1.167), we get

k (3)  = ;(n  k) from which

k(0)  = (n  k)

a0(k) ; a3 (k)]jphys:i = 0

Notice that

a0(k) ; a3 (k) ay0(k0) ; ay3 (k0)] = ;3 (~k ; k~0) + 3 (~k ; k~0) = 0

(1.186) (1.187) (1.188)

Let us denote by #~k (n0 n3 ) the state with n0 scalar photons (that is with polarization (3) (0)  (k)), and with n3 longitudinal photons (that is with polarization  (k)). Then the following states satisfy the condition (1.183) #~(km) = 1 (ay0(k) ; ay3 (k))m#~k (0 0) (1.189) m! These states have vanishing norm

jj#~(km) jj2 = 0

(1.190)

More generally we can make the following observation. Let us consider the number operator for scalar and longitudinal photons

N=

Z

d3k (ay3 (k)a3 (k) ; ay0 (k)a0 (k))

(1.191)

Notice the minus sign that is a consequence of the commutation relations, and it ensures that N has positive eigenvalues. For instance

Nay (k)j0i = ; 0

Z

d3k0 ay0(k0 )a0(k0 ) ay0(k)]j0i = ay0 (k)j0i 24

(1.192)

Let us consider a physical state with a total number n of scalar and longitudinal photons. Then h'njN j'ni = 0 (1.193) since a0 and a3 act in the same way on a physical state (see eq. (1.187)). It follows

nh'nj'ni = 0

(1.194)

Therefore all the physical states with a total denite number of scalar and longitudinal photons have zero norm, except for the vacuum state (n = 0). Then

h'nj'ni = n0

(1.195)

A generic physical state with zero transverse photons is a linear superposition of the previous states X j'i = c0 j'0i + cij'ii (1.196) i6=0

This state has a positive denite norm

h'j'i = jc0j2  0

(1.197)

The proof that a physical state has a positive norm can be extended to the case in which also transverse photons are present. Of course, the coecients ci, appearing in the expression of a physical state, are completely arbitrary, but this is not going to modify the values of the observables. For instance, consider the hamiltonian, we have

H =

Z Z

d3x :  A_  ; L] :





1 F F  + 1 (@ A )2 : (1.198) = : 4  2 One can easily show the hamiltonian is given by the sum of all the degrees of freedom appearing in A

d3x

#  2 ~ Ai)2 ; A_ 20 ; r ~ A0 : d3 x : A_ 2i + (r =1 "iX # Z 3 y y 3

H = 12 =

F 0 A_  ; (@ A )A_ 0 +

"X 3 

Z

d k !k :

=1

a (k)a (k) ; a0 (k)a0 (k) :

(1.199)

Since on the physical states a0 and a3 act in the same way, we get

hphys:jH jphys:i = hphys:j

Z

d3 k !k 25

2 X

=1

ay (k)a (k)jphys:i

(1.200)

The generic physical state is of the form j'T i j'i. with j'i dened as in eq. (1.196). Since only j'T i, contributes to the evaluation of an observable quantity , we can always choose j'i proportional to j'0i. However, this does not mean that we are always working in the restricted physical space, because in a sum over the intermediate states we need to include all the degrees of freedom. This is crucial for the explicit covariance and locality of the theory. The arbitrariness in dening the state j'i has, in fact, a very simple interpretation. It corresponds to add to A a four gradient, that is it corresponds to perform a gauge transformation. Consider the following matrix element

h'jA(x)j'i =

X nm

c?ncm h'njA (x)j'mi

(1.201)

Since A change the occupation number by one unit and all the states j'ni have zero norm (except for the state with n = 0), the only non vanishing contributions come from n = 0, m = 1 and n = 1, m = 0

Z

h'jA(x)j'i = c?0c1 h0j

(0) p2!d (2k )3 e;ikx (3)  (k)a3 (k) +  (k)a0 (k)]j'1 i + c:c: 3

k

In order to satisfy the gauge condition the state j'1i must be of the form

d3q f (~q)ay3 (q) ; ay0 (q)]j0i

(1.203)

(0) ? ;ikx f (~k) + c:c:] p2!d (2k )3 (3)  (k) +  (k)]c0 c1 e

(1.204)

j'1i = and therefore

h'jA(x)j'i =

Z

Z

(1.202)

3

k

From eqs. (1.167) and (1.168) we have

from which with

(x) =

Z

k (0) (3)  +  = (k  n)

(1.205)

h'jA(x)j'i = @ (x)

(1.206)

p2!d (2k )3 n 1 k (ic?0c1e;ikx f (~k) + c:c:) k 3

(1.207)

It is important to notice that this gauge transformation leaves A in the Lorentz gauge, since  = 0 (1.208) 26

because the momentum k inside the integral satises k2 = 0. From the expansion in eq. (1.172) one gets immediately the expression for the photon propagator Z d4k e;ikx h0jT (A(x)A (y)j0i = ;ig (2 )4 k2 + i (1.209) By dening Z d4k (1.210) D(x) = (2 )4 e;ikx D(k) and

we get

1 D(k) = ; k2 + i

(1.211)

h0jT (A(x)A (y)j0i = ig D(x ; y)

(1.212)

1.5 Perturbation Theory 1.5.1 The Scattering Matrix

In order to describe a scattering process we will assign to the vector of state a condition at t = ;1 j#(;1)i  j#ii (1.213) where the state #i will be specied by assigning the set of incoming free particles in terms of eigenstates of momentum, spin and so on. For instance, in QED we will have to specify how many electrons, positrons and photons are in the initial state and we will have to specify their momenta, the spin projection of fermions and the polarization of the photons. The equations of motion will tell us how this state evolves with time and it will be possible to evaluate the state at t = +1, where, ideally, we will detect the nal states. In practice the preparation and the detection processes are made at some nite times. It follows that our ideal description will be correct only if these times are much bigger than the typical interaction time of the scattering process. Once we know #(+1), we are interested to evaluate the probability amplitude of detecting at t = +1 a given set of free particles specied by a vector state #f . The amplitude is

Sfi = h#f j#(+1)i

(1.214)

We will dene the S matrix as the operator that give us j#(+1)i once we know j#(;1)i j#(+1)i = S j#(;1)i (1.215) The amplitude Sfi is then Sfi = h#f jS j#ii (1.216) 27

Therefore Sfi is the S matrix element between free states (j#ii and h#f j are called in and out states respectively). In the interaction representation dened by 0 0 0 j#(t)i = eiHS t j#S (t)i O(t) = eiHS tOS (t)e;iHS t (1.217)

where the index S identies the Schrodinger representation and HS0 is the free hamiltonian, the state vectors satisfy the Schrodinger equation with the interaction hamiltonian (evaluated in the interaction picture) @ j#(t)i = H j#(t)i i @t (1.218) I whereas the operators evolve with the free hamiltonian. To evaluate the S matrix we rst transform the Schrodinger equation in the interaction representation in an integral equation

j#(t)i = j#(;1)i ; i

Zt

;1

dt1 H I (t1 )j#(t1 )i

(1.219)

One can verify that this is indeed a solution, and that it satises explicitly the boundary condition at t = ;1. The perturbative expansion consists in evaluating j#(t)i by iterating this integral equation. The result in terms of ordered T -products is Z +1 1 n Z +1 X ; I  ( ; i ) I dt dt (1.220) S =1+ 1 n T H (t1 )    H (tn ) n! ;1

n=1

;1

The T -product of n terms means that the factors have to be written from left to right with decreasing times. For instance, if t1  t2      tn, then ;  T H I (t1 )    H I (tn) = H I (t1 )    H I (tn) (1.221) This result can be written in a more compact form by introducing the T -ordered exponential Z +1  ;i dt H I (t) S = T e ;1 (1.222) This expression is a symbolic one and it is really dened by its series expansion. The motivation for introducing the T -ordered exponential is that it satises the following factorization property



Zt

O(t)dt



Zt

Zt

O(t)dt  O(t)dt T e t1 = T e t2 T e t1 (1.223) If there are no derivative interactions we have Z +1 Z I  ;i dt H (t)  +i d4x Lint  ;1 S=T e =T e (1.224) It follows that if the theory is Lorentz invariant, also the S matrix enjoys the same property. 3

3

28

2

1.5.2 Wick's theorem

The matrix elements of the S matrix between free particle states can be expressed as vacuum expectation values (VEV's) of T -products. These V EV 0s satisfy an important theorem due to Wick that states that the T -products of an arbitrary number of free elds (the ones we have to do in the interaction representation) can be expressed as combinations of T -products among two elds, that is in terms of Feynman propagators. The Wick's theorem is summarized in the following equation

Z  ;i d4x j (x)(x) T e Z ;i

=: e

d4x

Z 1 j (x)(x) ; 2 d4x d4y j (x)j (y)h0jT ((x)(y))j0i (1.225) :e

where (x) is a free real scalar eld and j (x) an ordinary real function. The previous formula can be easily extended to charged scalar, fermionic and photon elds. The Wick's theorem is then obtained by expanding both sides of this equation in powers of j (x) and taking the VEV of both sides. Let us now expand both sides of eq. (1.225) in a series of j (x) and compare term by term. We will use the simplied notation i  (xi). We get T () = :  : (1.226)

T (12) = : 1 2 : +h0jT (12)j0i T (123) = : 123 : + T (1234) =

3 X

i6=j 6=k=1

: 12 34 : +

: i : h0jT (j k )j0i

4 h X

i6=j 6=k6=l=1

i

(1.227) (1.228)

: ij : h0jT (kl )j0i

+ h0jT (ij )j0ih0jT (kl)j0i

(1.229)

and so on. By taking the VEV of these expression, and recalling that the VEV of a normal product is zero, we get the Wick's theorem. The T -product of two eld operators is sometimes called the contraction of the two operators. Therefore to evaluate the VEV of a T -product of an arbitrary number of free elds, it is enough to consider all the possible contractions of the elds appearing in the T -product. For instance, from the last of the previous relations we get

h0jT (12 34)j0i =

4 X

h0jT (ij )j0ih0jT (kl )j0i

i6=j 6=k6=l=1

29

(1.230)

An analogous theorem holds for the photon eld. For the fermions one has to remember that the T -product is dened in a slightly dierent way. This gives a minus sign any time we have a permutation of the fermion elds which is odd with respect to the original ordering. As an illustration the previous formula becomes

h0jT (12 3 4 )j0i =

4 X

i6=j 6=k6=l=1

P h0jT (ij )j0ih0jT (kl )j0i

(1.231)

where P = 1 is the sign of the permutation (i j k l) with respect to the fundamental one (1 2 3 4) appearing on the right hand side. More explicitly

h0jT (12 3 4)j0i = h0jT (12 )j0ih0jT (34)j0i ; h0jT (13 )j0ih0jT (24)j0i + h0jT (14 )j0ih0jT (23)j0i

(1.232)

1.5.3 Feynman diagrams in momentum space

We will discuss here the Feynman rules for the scalar and spinor QED. Let us recall that the electromagnetic interaction is introduced via the minimal substitution

@ ! @ + ieA

(1.233)

For instance, for the charged scalar eld we get Lfree = @ y@   ; m2y ; 14 F F  ! ! (@ + ieA ))]y (@  + ieA ) ] ; m2y ; 41 F F  The interaction part is given by





Lint: = ;ie y@  ; (@ y) A + e2A2 y

(1.234) (1.235)

Therefore the electromagnetic eld is coupled to the current





j = ie y@  ; (@ y)

(1.236)

but another interaction term appears (the seagull term). In the spinor case we have a simpler situation Lfree = !(i@^ ; m) ; 14 F F  ! !(i@^ ; eA^ ; m) ; 14 F F  (1.237) giving the interaction term ! A Lint = ;e (1.238) 30

In a typical experiment in particle physics we prepare beams of particles with definite momentum, polarization, etc. At the same time we measure momenta and polarizations of the nal states. For this reason it is convenient to work in a momentum representation. The result for a generic matrix element of the S matrix can be expressed in the following form

X

hf jS jii = (2 )44 (

in

pin ;

X out

pout )

Y r

ext: bosons

1 2EV

Y rm VE M

ext: fermions

(1.239)

Here f 6= i, and we are using the box normalization. The Feynman amplitude M (often one denes a matrix T wich diers from M by a factor i) can be obtained by drawing at a given perturbative order all the connected and topologically inequivalent Feynman's diagrams. The diagrams are obtained by combining together the graphics elements given in Figures 1.5.1 and 1.5.2. The amplitude M is given by the sum of the P amplitudes associated to these diagrams, after extraction of the factor (2 )44 ( p) in (1.239). . Fermion line

Photon line

Scalar line

Fig. 1.5.1 - Graphical representation for both internal and external particle lines. . QED vertex

Scalar QED seagull

Fig. 1.5.2 - Graphical representation for the vertices in scalar and spinor QED. 31

At each diagram we associate an amplitude obtained by multiplying together the following factors

 for each ingoing and/or outgoing external photon line a factor   (or its com     

plex conjugate for outgoing photons if we consider complex polarizations as the circular one) for each ingoing and/or outgoing external fermionic line a factor u(p) and/or u!(p) for each ingoing and/or outgoing external antifermionic line a factor v!(p) and/or v(p) for each internal scalar line a factor iF (p) (propagator) for each internal photon line a factor ig D(p) (propagator) for each internal fermion line a factor iSF (p) (propagator) P P for each vertex in scalar QED a factor ;ie(p + p0)(2 )44( in pi ; out pf ), where p and p0 are the momenta of the scalar particle (taking p ingoing and p0 outgoing) P P For each seagull term in scalar QED a factor 2ie2 g (2 )44( in pi ; out pf ) P P for each vertex in spinor QED a factor ;ie (2 )44 ( ingoing pi ; outgoing pf )

   Integrate over all the internal momenta with measure d4p=(2 )4. This gives

rise automatically to the conservation of the total four-momentum.  In a graph containing closed lines (loops), a factor (;1) for each fermionic loop (see Fig. 1.5.3)

. Fermion loop

Fig. 1.5.3 - A fermion loop can appear inside any Feynman diagram (attaching the rest of the diagram by photonic lines).

In the following we will use need also the rules for the interaction with a static external e.m. eld. The correspending graphical element is given in Fig. 1.5.4. The 32

rule is to treat it as a normal photon, except that one needs to substitute the wave function of the photon  1 1=2 ( ) (q) (1.240) 2EV with the Fourier transform of the external eld Z ext A (~q ) = d3qe;i~q  ~x Aext (1.241)  (~x ) where ~q is the momentum in transfer at the vertex where the external eld line is attached to. Also, sincePthe staticPexternal eld breaks translational invariance, we loose the factor (2 )3( in ~pin ; out p~out ). .

Fig. 1.5.4 - Graphical representation for an external electromagnetic eld. The upper end of the line can be attached to any fermion line.

1.5.4 The cross-section

Let us consider a scattering process with a set of initial particles with four momenta pi = (Ei ~pi) which collide and produce a set of nal particles with four momenta pf = (Ef  ~pf ). From the rules of the previous Chapter we know that each external photon line contributes with a factor (1=2V E )1=2, whereas each external fermionic line contributes with (m=V E )1=2 .PFurthermore the conservation of the total four P 4 4 momentum gives a term (2 )  ( i pi ; f pf ). If we exclude the uninteresting case f = i we can write

Sfi

= (2 )44

X i

X ! Y  m 1=2 Y  pi ; pf f

fermioni

VE

bosoni

1 2V E

1=2

M

(1.242)

where M is the Feynman amplitude which can be evaluated by using the rules of the previous Section. Let us consider the typical case of a two particle collision giving rise to an N particles nal state. Therefore, the probability per unit time of the transition is given by w = jSfij2 with Y Y Y w = V (2 )44(Pf ; Pi) 2V1E 2V1E (2m)jMj2 (1.243) i f i f fermioni 33

where, for reasons of convenience, we have written m = (2m) 1 (1.244) VE 2V E w gives the probability per unit time of a transition toward a state with well dened quantum numbers, but we are rather interested to the nal states having momenta between ~pf and ~pf + d~pf . Since in the volume V the momentum is given by ~p = 2 ~n=L, the number of nal states is given by

 L 3

3 (1.245) 2 d p The cross-section is dened as the probability per unit time divided by the &ux of the ingoing particles, and has the dimensions of a length to the square

t;1 `2t] = `2]

(1.246)

In fact the &ux is dened as vrel = vrel=V , since in our normalization we have a particle in the volume V , and vrel is the relative velocity of the ingoing particles. For the bosons this follows from the normalization condition of the functions f~k (x). For the fermions recall that in the box normalization the wave function is rm ;ipx (1.247) EV u(p)e from which Z Z 3 (1.248) d x (x) = d3x VmE uy(p)u(p) = 1 V

V

Then the cross-section for getting the nal states with momenta between ~pf e p~f +d~pf is given by Y V d3pf d = w vV dNF = w vV (1.249) (2 )3 rel

rel f

We obtain

Y V d3 pf 1 Y 1 Y (2m)jMj2 4  4 (P ; P ) V (2

) d = vV f i 4V 2 E1E2 f 2V Ef fermioni rel f (2 )3 Y Y 3 = (2 )44(Pf ; Pi) 4E E1 v (2m) (2 d)3p2fE jMj2 (1.250) 1 2 rel fermioni f f

Notice that the dependence on the quantization volume V disappears, as it should be, in the nal equation. Furthermore, the total cross-section, which is obtained by integrating the previous expression over all the nal momenta, is Lorentz invariant. 34

In fact, as it follows from the Feynman rules, M is invariant, as the factors d3p=2E . Furthermore ~vrel = ~v1 ; ~v2 = E~p1 ; E~p2 (1.251) 1 2 but in the frame where the particle 2 is at rest (laboratory frame) we have p2 = (m2  ~0) and ~vrel = ~v1, from which

q 2 2 j p ~ 1j E1 E2j~vrelj = E1 m2 E = m2jp~1j = m2 E1 ; m1 =

q

1 m22 E12 ; m21 m22

We see that also this factor is Lorentz invariant.

35

q

= (p1  p2)2 ; m21 m22

(1.252)

Chapter 2 One-loop renormalization 2.1 Divergences of the Feynman integrals Let us consider the Coulomb scattering. If we expand the S matrix up to the third order in the electric charge, and we assume that the external eld is weak enough such to take only the rst order, we can easily see that the relevant Feynman diagrams are the ones of Fig. 2.1.1 . . p

p’

p

p’

= ’

p −p



p −p p

p p−k



p p’−k p’

k

k ’



p −p

p −p p

a) p

b)

p’ p−κ k

c)

k+q

p’ k ’

p −k ’

p −p



p −p

d)

Fig. 2.1.1 - The Feynman diagram for the Coulomb scattering at the third order in the electric charge and at the rst order in the external eld.

36

The Coulomb scattering can be used, in principle, to dene the physical electric charge of the electron. This is done assuming that the amplitude is linear in ephys, from which we get an expansion of the type ephys = e + a2 e3 +    = e(1 + a2 e2 +   ) (2.1) in terms of the parameter e which appears in the original lagrangian. The rst problem we encounter is that we would like to have the results of our calculation in terms of measured quantities as ephys . This could be done by inverting the previous expansion, but, and here comes the second problem, the coecient of the expansion are divergent quantities. To show this, consider, for instance the self-energy contribution to one of the external photons (as the one in Fig. 2.1.1a). We have 2

i 0 (2.2) Ma = u!(p0)(;ie  )Aext  (p~ ; p~) p^ ; m + i ie (p) u(p) where Z d4k ;ig i 2 ie (p) = ( ; ie )  2 4 (2 ) k + i p^ ; k^ ; m + i (;ie ) Z d4k 1 1 2   (2.3) = ;e  2 4 (2 ) k + i p^ ; k^ ; m + i or Z d4k p^ ; k^ + m   1 (p) = i  (2.4)  (2 )4 (p ; k)2 ; m2 + i k2 + i For large momentum, k, the integrand behaves as 1=k3 and the integral diverges linearly. Analogously one can check that all the other third order contributions diverge. Let us write explicitly the amplitudes for the other diagrams 0 (2.5) Mb = u!(p0)ie2 (p0 ) p^ 0 ; mi + i (;ie  )Aext  (p~ ; p~)u(p) 0 0 Mc = u!(p0)(;ie  ) q;2 ig+i ie2 (q)Aext (2.6)  (p~ ; ~p)u(p) q = p ; p where  Z d4k  i i   2  (;ie ) ^ (;ie ) (2.7) ie  (q) = (;1) (2 )4 Tr ^ k + q^ ; m + i k ; m + i (the minus sign originates from the fermion loop) and therefore  Z d4k  1 1     (q) = i (2 )4 Tr ^   (2.8) k + q^ ; m k^ ; m + i The last contribution is 0 Md = u!(p0)(;ie)e2 (p0 p)u(p)Aext (2.9)  (p~ ; ~p) 37

where

e2 (p0 p) =

Z d4k i i )   ) ;ig ( ; ie  ( ; ie 4 (2 ) k2 + i p^ 0 ; k^ ; m + i p^ ; k^ ; m + i

or (p0 p) = ;i

Z d4k 1 1 1       4 2 (2 ) p^ 0 ; k^ ; m + i p^ ; k^ ; m + i k + i

(2.10) (2.11)

The problem of the divergences is a serious one and in order to give some sense to the theory we have to dene a way to dene our integrals. This is what is called the regularization procedure of the Feynman integrals. That is we give a prescription in order to make the integrals nite. This can be done in various ways, as introducing an ultraviolet cut-o, or, as we shall see later by the more convenient means of dimensional regularization. However, we want that the theory does not depend on the way in which we dene the integrals, otherwise we would have to look for some physical meaning of the regularization procedure we choose. This bring us to the other problem, the inversion of eq. (2.1). Since now the coecients are nite we can indeed perform the inversion and obtaining e as a function of ephys and obtain all the observables in terms of the physical electric charge (that is the one measured in the Coulomb scattering). By doing so, a priori we will introduce in the observables a dependence on the renormalization procedure. We will say that the theory is renormalizable when this dependence cancels out. Thinking to the regularization in terms of a cut-o this means that considering the observable quantities in terms of ephys , and removing the cut-o (that is by taking the limit for the cut-o going to the innity), the result should be nite. Of course, this cancellation is not obvious at all, and in fact in most of the theories this does not happen. However there is a restrict class of renormalizable theories, as for instance QED. We will not discuss the renormalization at all order and neither we will prove which criteria a theory should satisfy in order to be renormalizable. We will give these criteria without a proof but we will try only to justify them On a physical basis. As far QED is concerned we will study in detail the renormalization at one-loop.

2.2 Higher order corrections In this section we will illustrate, in a simplied context, the previous considerations. Consider again the Coulomb scattering. At the lowest order the Feynman amplitude in presence of an external static electromagnetic eld is simply given by the following expression (see Fig. 2.1.1), (f 6= i) Sfi = (2 )(Ef ; Ei) VmE M(1) (2.12) where 0 M(1) = u!(p0 )(;ie  )u(p)Aext (2.13)  (p~ ; p~ ) 38

and

Aext  (~q) =

Z

;i~q  ~x d3x Aext  (~x)e

(2.14)

is the Fourier transform of the static e.m. eld. In the case of a point-like source, of charge Ze we have Ze Aext (2.15)  (~x) = (; 4 j~xj  ~0)

For the following it is more convenient to re-express the previous amplitude in terms of the external current generating the external e.m. eld. Since we have

A = j we can invert this equation in momentum space, by getting A (q) = ; q2g+ i j  (q) from which  ; ig (1) 0 M = u!(p )(;ie )u(p) q2 + i (;ijext (p~ 0 ; p~))

with

(2.16) (2.17) (2.18)

jext (~x) = (Ze3(~x) ~0) (2.19) From these expressions one can get easily the dierential cross-section d=d'. This quantity could be measured and then we could compare the result with the theoretical result obtained from the previous expression. In this way we get a measure of the electric charge. However the result is tied to a theoretical equation evaluated at the lowest order in e. In order to get a better determination one has to evaluate the higher order corrections, and again extract the value of e from the data. The lowest order result is at the order e2 in the electric charge. the next-to-leading contributions ar at the order e4 . For instance, the diagram in Fig. 2.1.1c gives one of these contribution. This is usually referred to as the vacuum polarization contribution and in this Section we will consider only this correction, just to exemplify how the renormalization procedure works. This contribution is given in eqs. (2.6) and (2.7). The total result is  ;ig ;ig   ; ig 2 (1) 0 M + Mc = u!(p )(;ie )u(p) q2 + i + q2 + i (ie  ) q2 + i (;ijxt (~q)) (2.20) The sum gives a result very similar the to one obtained from M(1) alone, but with the photon propagator modied in the following way ;ig ! ;ig + ;ig (ie2  (q2)) ;ig  = ;ig + ;ie2  (2.21)  q2 q2 q2 q2 q2 q4 39

As discussed in the previous Section  is a divergent quantity. In fact, at high values of the momentum it behaves as Z d4 p (2.22) p2 and therefore it diverges in a quadratic way. This follows by evaluating the integral by cutting the upper limit by a cut-o . As we shall see later on, the real divergence is weaker due to the gauge invariance. We will see that the correct behaviour is given by Z d4 p (2.23) p4 The corresponding divergence is only logarithmic, but in any case the integral is divergent. By evaluating it with a cut-o we get  (q2) = ;g q2I (q2) +   

(2.24)

The terms omitted are proportional to q q , and since the photon is always attached to a conserved current, they do not contribute to the nal result. The functions I (q2) can be evaluated by various tricks (we will evaluate it explicitly in the following) and the result is  q2z(1 ; z)  Z1 2  1 1 2 dz z(1 ; z) log 1 ; m2 (2.25) I (q ) = 12 2 log m2 ; 2 2 0 For small values of q2 (;q2 << m2 ) one can expand the logarithm obtaining 2 + 1 q2 I (q2)  121 2 log m 2 60 2 m2

(2.26)

;ig  ! ;ig 1 ; e2I (q2)]

(2.27)

Summarizing the propagator undergoes the following modication

q2 q2 This relation is shown in graphical terms in Fig. 2.2.1. . +

Fig. 2.2.1 - The vacuum polarization correction to the propagator. The total result is then

M(1) + M

c

= u!(p0)(;ie

 e2 2 e2 q2  ; i log ; (;iZe) (2.28) 0 )u(p) 2 1 ; q 12 2 m2 60 2 m2 40

in the limit q ! 0 the result looks exactly as at the lowest order, except that we need to introduce a renormalized charge eR given by



2 2 eR = e 1 ; 12e 2 log m 2

1=2

(2.29)

In term of eR and taking into account that we are working at the order e4 we can write  e2 q2  ; i (1) M + Mc = u!f (ieR 0)ui q2 1 ; 60R 2 m2 (;iZeR ) (2.30) Now we compare this result with the experiment where we measure d=d' in the limit q ! 0. We can extract again from the data the value of the electric charge, but this time we will measure eR . Notice that in this expression there are no more divergent quantities, since they have been eliminated by the denition of the renormalized charge. It must be stressed that the quantity that is xed by the experiments is not the "parameter" e appearing in the original lagrangian, but rather eR. This means that the quantity e is not really dened unless we specify how to relate it to measured quantities. This relation is usually referred to as the renormalization condition. As we have noticed the nal expression we got does not contain any explicit divergent quantity. Furthermore, also eR , which is obtained from the experimental data, is a nite quantity. It follows that the parameter e, must diverge for  ! 1. In other words, the divergence which we have obtained in the amplitude, is compensated by the divergence implicit in e. Therefore the procedure outlined here has meaning only if we start with an innite charge. Said that, the renormalization procedure is a way of reorganizing the terms of the perturbative expansion in such a way that the parameter controlling it is the renormalized charge eR . If this is possible, the various terms in the expansion turn out to be nite and the theory is said to be renormalizable. As already stressed this is not automatically satised. In fact renormalizability is not a generic feature of the relativistic quantum eld theories. Rather the class of renormalizable theories (that is the theories for which the previous procedure is meaningful) is quite restricted. We have seen that the measured electric charge is dierent from the quantity appearing in the expression of the vertex. It will be convenient in the following to call this quantity eB (tha bare charge). On the contrary, the renormalized charge will be called e, rather than eR as done previously. For the moment being we have included only the vacuum polarization corrections, however if we restrict our problem to the charge renormalization, the divergent part coming from the self-energy and vertex corrections cancel out (see later). Let us consider now a process like e; ; ! e; ;. Neglecting other corrections the total amplitude is given by Fig. 2.2.2. We will dene the charge by choosing an arbitrary value of the square of the momentum in transfer q , such that Q2 = ;q2 = 2. In this way the renormalized charge depends on the scale . In the previous example we were using q ! 0. The graphs in Fig. 2.2.2 sum up to give (omitting the contribution of the fermionic 41

. e− eB

e−

e− eB

q

eB

eB

eB

eB

eB

eB

µ−

+

+ eB

+

eB

eB

µ−

eB

µ−

Fig. 2.2.2 - The vacuum polarization contributions to the e;; ! e;; scattering. lines,)

e2B

h ;ig ;ig 2 ;ig  ;ig 2 ;ig  2 ;ig i + (ie  ) + (ie  ) (ie  ) +   q2

q2

B

q2

B

q2

q2

B

q2

Q2 =2

(2.31) By considering only the g contribution to  , we get

 1 ; e2B I (q2) + e4B I 2(q2) +    Q2=2 (2.32) e2B ;ig 2 q This expression looks as the lowest order contribution to the propagator be dening the renormalized electric charge as

e2 = e2B 1 ; e2B I (2) + e4B I 2(2) +   ]

(2.33)

Since the fermionic contribution is the same for all the diagrams, by calling it F (Q2), we get the amplitude in the form

M(eB ) = e2B F (Q2)1 ; e2B I (Q2 ) + e4B I 2(Q2 )   ]

(2.34)

where e2B F (Q2) is the amplitude at the lowest order in e2B . As we know I (Q2) is a divergent quantity and therefore the amplitude is not dened. However, following the discussion of the previous Section we can try to re-express everything in terms of the renormalized charge (2.33). We can easily invert by series the eq. (2.33) obtaining at the order e4

e2B = e21 + e2 I (2) +   ] 42

(2.35)

By substituting this expression in M(e0) we get

M(eB ) = e2 1 + e2I (2) +   ]F (Q2 )1 ; e2 I (Q2 ) +   ] = = e2 F (Q2)1 ; e2 (I (Q2) ; I (2)) +   ]  MR (e) From the expression (2.25) for I (Q2) we get



(2.36)



1 log 2 ; 1 Z 1 dz z(1 ; z) log 1 + Q2 z(1 ; z) (2.37) 12 2 m2 2 2 0 m2 Since the divergent part is momentum independent it cancels out in the expression for MR(e). Furthermore, the renormalized charge e will be measured through the cross-section, and therefore, by denition, it will be nite. Therefore, the charge renormalization makes the amplitude nite. Evaluating I (Q2) ; I (2) per large Q2 (and 2) we get

I (Q2) =

e2 (I (Q2) ; I (2))

and in this limit





Z1 2 2 2 = ; dz z (1 ; z) log m 2 + Q2 z(1 ; z) !

0 m +  z(1 ; z) Z 1 2 2 ! ; 2 dz z(1 ; z) log Q2 = ; 3 log Q2 (2.38) 0 



2 MR 1 ; 3 log Q2 +    (2.39) At rst sight MR(e) depends on , but actually this is not the case. In fact, e is dened through the eq. (2.33) and therefore it depends also on . What is going on is that the implicit dependecnce of e on  cancels out the explicit dependence of MR (e). This follows from MR(e) = M(eB ) and on the observation that M(eB ) does not depend on . This can be formalized in a way that amounts to introduce the concept of renormalization group that we will discuss in much more detail later on. In fact, let us be more explicit by writing

(e) = e2F (Q2 )

M(eB ) = MR (e() )

(2.40)

By dierentiating this expression with respect to  we get dM(eB ) = @ MR + @ MR @e = 0 (2.41) d @ @e @ This equation say formally that the amplitude expressed as a function of the renormalized charge is indeed independent on . This equation is the prototype of the renormalization group equations, and tells us how we have to change the coupling, when we change the denition scale . Going back to the original expression for M(e0), we see that this is given by the tree amplitude e2B F (Q2) times a geometric series arising from the iteration of 43

the vacuum polarization diagram. Therefore, it comes natural to dene a coupling running with the momentum of the photon. 2 e2 (Q2 ) = e2B 1 ; e2B I (Q2 ) + e4B I 2(Q2 ) +   ] = 1 + ee2BI (Q2) (2.42) B Also this expression is cooked up in terms of the divergent quantities eB and I (Q2), but again e2(Q2 ) is nite. This can be shown explicitly by evaluating the renormalized charge, e(2), in terms of the bare one 2 e2 (2) = 1 + ee2BI (2)

B

We can put the last two equations in the form 1 = 1 + I (2) 1 = 1 + I (Q2 ) 2 2 2 2 e (Q ) eB e (2) e2B and subtracting them 1 ; 1 = I (Q2 ) ; I (2) 2 e (Q2 ) e2 (2) we get 2 2 e2(Q2 ) = 1 + e2 (2)eI((Q2)) ; I (2)]

(2.43)

(2.44)

(2.45) (2.46)

By using this result we can write MR(e) as

MR(e) = e2 (Q2 )F (Q2)

(2.47)

It follows that the true expansion parameter in the perturbative series is the running coupling e(Q2 ). By using eq. (2.38) we obtain

(2) (2.48) 2) 2 (  Q 1; log 2 3

 We see that (Q2) increases with Q2, therefore the perturbative approach looses validity for those values Q2 of such that (Q2 )  1, that is when 2 2 (2.49) 1 ; ( ) log Q2 = (2) 3

 or  1  Q2 = e3 (2) ; 1 (2.50) 2 (Q2 ) =

44

For (2)  1=137 we get Actually (Q2) is divergent for

Q2  e1281  10556 2

(2.51)

3

Q2 = e (2)  e1291  10560 (2.52) 2 This value of Q2 is referred to as the Landau pole. Therefore in the case of QED the perturbative approach maintains its validity up to gigantic values of the momenta. This discussion shows clearly that the behaviour of the running coupling constant is determined by I (Q2), that is from the vaccum polarization contribution. In particular, (Q2 ) increases with Q2 since in the expression e2 (2)(I (Q2) ; I (2)) the coecient of the logarithm is negative (equal to ; (2)=(3 )). We will see that in the so called non-abelian gauge theories the running is opposite, that is the theory becomes more and more perturbative, by increasing the energy.

2.3 The analysis by counterterms The previous way of dening a renormalizable theory amounts to say that the original parameters in the lagrangian, as e, should be innite and that their divergences should compensate the divergences of the Feynman diagrams. Then one can try to separate the innite from the nite part of the parameters (this separation is ambiguous, see later). The innite contributions are called counterterms, and by denition they have the same operator structure of the original terms in the lagrangian. On the other hand, the procedure of regularization can be performed by adding to the original lagrangian counterterms cooked in such a way that their contribution kills the divergent part of the Feynman integrals. This means that the coecients of these counterterms have to be innite. However they can also be regularized in the same way as the other integrals. We see that the theory will be renormalizable if the counterterms we add to make the theory nite have the same structure of the original terms in the lagrangian, in fact, if this is the case, they can be absorbed in the original parameters, which however are arbitrary, because they have to be xed by the experiments (renormalization conditions). We will show now that at one-loop the divergent contributions come only from the three diagrams discussed before. In fact, let us dene D as the supercial degree of divergence of a one-loop diagram the dierence between 4 (the number of integration over the mimentum) and the number of powers of momentum coming from the propagators (we assume here that no powers of momenta are coming from the vertices, which is not true in scalar QED). We get D = 4 ; IF ; 2IB (2.53) 45

where IF and IB are the number of internal fermionic and bosonic lines. Going through the loop one has to have a number of vertices, V equal to the number of internal lines V = IF + IB (2.54) Also, if at each vertex there are Ni lines of the particle of type i, then we have

NiV = 2Ii + Ei from which

(2.55)

2V = 2IF + EF  V = 2IB + EB (2.56) From these equations we can get V , IF and IB in terms of EF and EB (2.57) IF = 21 EF + EB  IB = 21 EF  V = EF + EB from which D = 4 ; 32 EF ; EB (2.58) It follows that the only one-loop supercially divergent diagrams in spinor QED are the ones in Fig. 2.3.1. In fact, all the diagrams with an odd number of external photons vanish. This is trivial at one-loop, since the trace of an odd number of  matrices is zero. In general (Furry's theorem) it follows from the conservation of charge conjugation, since the photon is an eigenstate of C with eigenvalue equal to -1. Furthermore, the eective degree of divergence can be less than the supercial degree. In fact, gauge invariance often lowers the divergence. For instance the diagram of Fig. 2.3.1 with four external photons (light-light scattering) has D = 0 (logarithmic divergence), but gauge invariance makes it nite. Also the vacuum polarization diagram has an eective degree of divergence equal to 2. Therefore, there are only three divergent one-loop diagrams and all the divergences can be brought back to the three functions (p),  (q) e  (p0 p). This does not mean that an arbitrary diagram is not divergence, but it can be made nite if the previous functions are such. In such a case one has only to show that eliminating these three divergences (primitive divergences) the theory is automatically nite. In particular we will show that the divergent part of (p) can be absorbed into the denition of the mass of the electron and a redenition of the electron eld (wave function renormalization). The divergence in  , the photon self-energy, can be absorbed in the wave function renormalization of the photon (the mass of the photon is not renormalized due to the gauge invariance). And nally the divergence of  (p0 p) goes into the denition of the parameter e. To realize this program we divide up the lagrangian density in two parts, one written in terms of the physical parameters, the other will contain the counterterms. We will call also the original parameters and elds of the theory the bare parameters and the bare elds and we will use an index B in order to distinguish them from the physical quantities. Therefore the 46

. EF

EB

0

2

2

0

4

0

2

0

1

2

1

0

D

Fig. 2.3.1 - The supercially divergent one-loop diagrams in spinor QED two pieces of the lagrangian should look like as follows: the piece in terms of the physical parameters Lp ! A ; 1 F F  ; 1 (@ A)2 Lp = !(i@^ ; m) ; e (2.59) 4 2 and the counter terms piece Lc:t: ! ; C F ; E (@ A )2 ; eD!A ^ Lc:t: = iB !@^ ; A (2.60) 4 2 and we have to require that la sum of these two contributions should coincide with the original lagrangian written in terms of the bare quantities. Adding together Lp and Lc:t: we get ! ; e(1 + D) ! A ; 1 + C F F  + gauge ; xing L = (1 + B )i!@^ ; (m + A) 4 (2.61) where, for sake of simplicity, we have omitted the gauge xing term. Dening the renormalization constant of the elds Z1 = (1 + D) Z2 = (1 + B ) Z3 = (1 + C ) (2.62) we write the bare elds as B = Z21=2  AB = Z31=2 A (2.63) 47

we obtain L = i!B @^B ; mZ+ A !B B ; eZ11=2 !B  B AB ; 14 FB FB + gauge ; xing 2 Z2Z3 (2.64) and putting mB = mZ+ A  eB = eZ11=2 (2.65) 2 Z2 Z3 we get L = i!B @^B ; mB !B B ; eB !B  B AB ; 14 FB FB + gauge ; xing (2.66) So we have succeeded in the wanted separation. Notice that the division of the parameters in physical and counter term part is well dened, because the nite piece is xed to be an observable quantity. This requirement gives the renormalization conditions. The counter terms A, B , ... are determined recursively at each perturbative order in such a way to eliminate the divergent parts and to respect the renormalization conditions. We will see later how this works in practice at one-loop level. Another observation is that Z1 and Z2 have to do with the self-energy of the electron, and as such they depend on the electron mass. Therefore if we consider the theory for a dierent particle, as the muon, which has the same interactions as the electron and diers only for the value of the mass (m  200me), one would get a dierent bare electric charge for the two particles. Or, phrased in a dierent way, one would have to tune the bare electric charge at dierent values in order to get the same physical charge. This looks very unnatural, but the gauge invariance of the theory implies that at all the perturbative orders Z1 = Z2. As a consequence eB = e=Z31=2 , and since Z3 comes from the photon self-energy, the relation between the bare and the physical electric charge is universal (that is it does not depend on the kind of charged particle under consideration). Summarizing, one starts dividing the bare lagrangian in two pieces. Then we regularize the theory giving some prescription to get nite Feynman integrals. The part containing the counter terms is determined, order by order, by requiring that the divergences of the Feynman integrals, which come about when removing the regularization, are cancelled out by the counter term contributions. Since the separation of an innite quantity into an innite plus a nite term is not well dened, we use the renormalization conditions, to x the nite part. After evaluating a physical quantity we remove the regularization. Notice that although the counter terms are divergent quantity when we remove the cut o, we will order them according to the power of the coupling in which we are doing the perturbative calculation. That is we have a double limit, one in the coupling and the other in some parameter (regulator) which denes the regularization. The order of the limit is rst to work at some order in the coupling, at xed regulator, and then remove the regularization. Before going into the calculations for QED we want to illustrate some general results about the renormalization. If one considers only theory involving scalar, 48

fermion and massless spin 1 (as the photon) elds, it is not dicult to construct an algorithm which allows to evaluate the ultraviolet (that is for large momenta) divergence of any Feynman diagram. In the case of the electron self-energy (see Figs. 2.1.1a and 2.1.1b) one has an integration over the four momentum p and a behaviour of the integrand, coming from the propagators, as 1=p3, giving a linear divergence (it turns out that the divergence is only logarithmic). From this counting one can see that only the lagrangian densities containing monomials in the elds with mass dimension smaller or equal to the number of space-time dimensions have a nite number of divergent diagrams. It turns out also that these are renormalizable theories ( a part some small technicalities). The mass dimensions of the elds can be easily evaluated from the observation that the action is dimensionless in our units (/h = 1). Therefore, in n space-time dimensions, the lagrangian density, dened as

Z

dnx L

(2.67)

has a mass dimension n. Looking at the kinetic terms of the bosonic elds (two derivatives) and of the fermionic elds (one derivative), we see that (2.68) dim] = dimA ] = n ; 1 dim] = n ; 1 2 2 In particular, in 4 dimensions the bosonic elds have dimension 1 and the fermionic 3=2. Then, we see that QED is renormalizable, since all the terms in the lagrangian density have dimensions smaller or equal to 4 ! ] = 3 dim !  A] = 4 dim(@  A )2] = 4 dim (2.69) The condition on the dimensions of the operators appearing in the lagrangian can be translated into a condition over the coupling constants. In fact each monomial Oi will appear multiplied by a coupling gi

L= therefore The renormalizability requires from which

X i

giOi

(2.70)

dimgi] = 4 ; dimOi ]

(2.71)

dimOi]  4

(2.72)

dimgi]  0 (2.73) that is the couplings must have positive dimension in mass are to be dimensionless. In QED the only couplings are the mass of the electron and the electric charge 49

which is dimensionless. As a further example consider a single scalar eld. The most general renormalizable lagrangian density is characterized by two parameters L = 21 @ @   ; 21 m2 2 ; 3 ; 4 (2.74) Here has dimension 1 and  is dimensionless. We see that the linear -models are renormalizable theories. Giving these facts let us try to understand what makes renormalizable and non renormalizable theories dierent. In the renormalizable case, if we have written the most general lagrangian, the only divergent diagrams which appear are the ones corresponding to the processes described by the operators appearing in the lagrangian. Therefore adding to L the counter term

Lc:t: =

X i

giOi

(2.75)

we can choose the gi in such a way to cancel, order by order, the divergences. The theory depends on a nite number of arbitrary parameters equal to the number of parameters gi. Therefore the theory is a predictive one. In the non renormalizable case, the number of divergent diagrams increase with the perturbative order. At each order we have to introduce new counter terms having an operator structure dierent from the original one. At the end the theory will depend on an innite number of arbitrary parameters. As an example consider a fermionic theory with an ! )2. Since this term has dimension 6, the relative coupling interaction of the type ( has dimension -2 ! )2 Lint = ;g2( (2.76) At one loop the theory gives rise to the divergent diagrams of Fig. 2.3.2. The divergence of the rst diagram can be absorbed into a counter term of the original type ! )2 ;g2 ( (2.77) The other two need counter terms of the type ! )3 + g4( ! )4 g3 ( (2.78) These counter terms originate new one-loop divergent diagrams, as for instance the ones in Fig. 2.3.3. The rst diagram modies the already introduced counter term ! )4 , but the second one needs a new counter term ( ! )5 g5( (2.79) This process never ends. The renormalization requirement restricts in a fantastic way the possible eld theories. However one could think that this requirement is too technical and one could imagine other ways of giving a meaning to lagrangians which do not satisfy this 50

.

a)

b)

c)

! )2. Fig. 2.3.2 - Divergent diagrams coming from the interaction ( condition. But suppose we try to give a meaning to a non renormalizable lagrangian, simply requiring that it gives rise to a consistent theory at any energy. We will show that this does not happen. Consider again a theory with a four-fermion interaction. Since dim g2 = ;2, if we consider the scattering  +  !  +  in the high energy limit (where we can neglect all the masses), on dimensional ground we get that the total cross-section behaves like   g22E 2 (2.80) Analogously, in any non renormalizable theory, being there couplings with negative dimensions, the cross-section will increase with the energy. But the cross-section has to do with the S matrix which is unitary. Since a unitary matrix has eigenvalues of .

a)

b)

! )2 and ( ! )4. Fig. 2.3.3 - Divergent diagrams coming from the interactions ( 51

modulus 1, it follows that its matrix elements must be bounded. Translating this argument in the cross-section one gets the bound 2   Ec 2

(2.81)

g2E 2  c

(2.82)

where c is some constant. For the previous example we get This implies a violation of the S matrix unitarity at energies such that rc E g 2

(2.83)

It follows that we can give a meaning also to non renormalizable theories, but only for a limited range of values of the energy. This range is xed by the value of the non renormalizable coupling. It is not diculty to realize that non renormalizability and bad behaviour of the amplitudes at high energies are strictly connected.

2.4 Dimensional regularization of the Feynman integrals As we have discussed in the previous Section we need a procedure to give sense at the otherwise divergent Feynman diagrams. The simplest is just R of these procedures R to introduce a cut-o  and dene our integrals as 01 ! lim!1 0). Of course, in the spirit of renormalization we have rst to perform the perturbative expansion and then take the limit over the cut-o. Although this procedure is very simple it results to be inadequate in situations like gauge theories. In fact one can show that the cut-o regularization breaks the translational invariance and creates problems in gauge theories. One kind of regularization which nowadays is very much used is the dimensional regularization. This consists in considering the integration in an arbitrary number of space-time dimensions and taking the limit of four dimensions at the end. This way of regularizing is very convenient because it respects all the symmetries. In fact, except for very few cases the symmetries do not depend on the number of space-time dimensions. Let us what is dimensional regularization about. We want to evaluate integrals of the type

I4 (k) =

Z

d4p F (p k)

(2.84)

with F (p k)  p;2 or p;4 . The idea is that integrating on a lower number of dimensions the integral improves the convergence properties in the ultraviolet. For 52

instance, if F (p k)  p;4, the integral is convergent in 2 and in 3 dimensions. Therefore, we would like to introduce a quantity

I (! k) =

Z

d2! p F (p k)

(2.85)

to be regarded as a function of the complex variable !. Then, if we can dene a complex function I 0(! k) on the entire complex plane, with denite singularity, and as such that it coincides with I on some common domain, then by analytic continuation I and I 0 dene the same analytic function. A simple example of this procedure is given by the Euler's ;. This complex function is dened for Re z > 0 by the integral representation ;(z) = If Re z  0, the integral diverges as

Z1 0

dt e;t tz;1

dt

(2.86) (2.87)

t1+jRe zj

in the limit t ! 0.However it is easy to get a representation valid also for Re z  0. Let us divide the integration region in two parts dened by a parameter Z Z1 ; t z ; 1 ;(z) = dt e t + dt e;t tz;1 (2.88) 

0

Expanding the exponential in the rst integral and integrating term by term we get Z1 1 nZ  X ( ; 1) n + z ; 1 ;(z) = dt t + dt e;t tz;1 n ! 0  n=0 Z 1 X (;1)n n+z 1 ;t z;1 + dt e t (2.89) =  n=0 n! n + z

The second integral converges for any z since > 0. This expression coincides with the representation for the ; function for Re z > 0, but it is dened also for Re z < 0 where it has simple poles located at z = ;n. Therefore it is a meaningful expression on all the complex plane z. Notice that in order to isolate the divergences we need to introduce an arbitrary parameter . However the result does not depend on the particular value of this parameter. This the Weierstrass representation of the Euler ;(z). From this example we see that we need the following three steps  Find a domain where I (! k) is convergent. Typically this will be for Re ! < 2.  Construct an analytic function identical to I (! k) in the domain of convergence, but dened on a larger domain including the point ! = 2.  At the end of the calculation take the limit ! ! 2. 53

2.5 Integration in arbitrary dimensions Let us consider the integral

IN =

Z

dN pF (p2)

(2.90)

with N an integer number and p a vector in an Euclidean N -dimensional space. Since the integrand is invariant under rotations of the N -dimensional vector p, we can perform the angular integration by means of

dN p = d'N pN ;1dp

R

(2.91)

where d'N is the solid angle element in N -dimensions. Therefore d'N = SN , with SN the surface of the unit sphere in N -dimensions. Then

IN = SN

Z1 0

pN ;1 F (p2)dp

(2.92)

The value of the sphere surface can be evaluated by the following trick. Consider

I=

Z +1 ;1

p

e;x dx =

2

(2.93)

By taking N of these factors we get

IN

=

Z

dx1    dxN e;(x1 +    + xN ) = N=2 2

2

(2.94)

The same integral can be evaluated in polar coordinates

N=2

= SN

Z1 0

N ;1e; d 2

(2.95)

By putting x = 2 we have N  Z1 1 1 N= 2 N= 2 ; 1 ; x x e dx = 2 SN ; 2 (2.96)

= 2 SN 0 where we have used the representation of the Euler ; function given in the previous Section. Therefore N=2 SN = 2 ; N  (2.97) ; 2 and N=2 Z 1

xN=2;1 F (x)dx (2.98) IN = ; N  ; 2 0 with x = p2. 54

The integrals we will be interested in are of the type Z N (M ) IN = (p2 ; da2 p+ i)A (2.99) with p a vector in a N dimensional Minkowski space. We can perform an anticlockwise rotation of 900 (Wick's rotation) in the complex plane of p0 without hitting any singularity P. Then we do a change of variables p0 ! ip0, and we use the denition p2 ! ;p2 = Ni=1 p2i , obtaining Z N (M ) (2.100) IN = i (;p2d;pa2 )A = i(;1)AIN where IN is an integral of the kind discussed at the beginning of this Section with F (x) given by F (x) = (x + a2 );A (2.101) It follows N=2 Z 1 xN=2;1

IN = ; N  dx (2.102) ; 2 0 (x + a2 )A By putting x = a2y we get N=2 Z 1

2 N= 2 ; A ;  yN=2;1(1 + y);Adx IN = ( a ) (2.103) ; N2 0 and recalling the integral representation for the Euler B (x y) function (valid for Rex y > 0) Z1 ;( x );( y ) tx;1(1 + t);(x+y) dt (2.104) B (x y) = ;(x + y) = it follows

0

2) 1 IN = N=2 ;(A;(;AN= ) (a2)A;N=2

(2.105)

We have obtained this representation for N=2 > 0 and Re(A ; N=2) > 0. But we know how to extend the Euler gamma-function to the entire complex plane, and therefore we can extend this formula to complex dimensions N = 2! A ; !) 1 I2! = ! ;(;( (2.106) A) (a2 )A;! This shows that I2! has simple poles located at ! = A A + 1    (2.107) Therefore our integral will be perfectly dened at all ! such that ! 6= A A + 1   . At the end we will have to consider the limit ! ! 2. The original integral in Minkowski space is then Z d2! p 1 ! (;1)A ;(A ; ! ) = i

(2.108) (p2 ; a2 )A ;(A) (a2)A;! 55

For the following it will be useful to derive another formula. Let us put in the previous equation p = p0 + k and b2 = ;a2 + k2, then Z 1 d2! p0 ! (;1)A ;(A ; ! ) = i

(2.109) 0 2 0 2 2 A 2 (p + 2p  k + k ; a ) ;(A) (a )A;! from which Z d2! p 1 ! (;1)A ;(A ; ! ) = i

(2.110) (p2 + 2p  k + b2 )A ;(A) (k2 ; b2 )A;! Dierentiating with respect to k we get various useful relations as Z A ; !) ;k (2.111) d2! p (p2 + 2ppk + b2 )A = i ! (;1)A ;(;( A) (k2 ; b2 )A;! and Z ! A d2! p (p2 + 2pppk + b2 )A = i ;(A)( k(2;;1)b2 )A;!   1 2 2 (2.112)  ;(A ; !)kk ; 2 g (k ; b );(A ; ! ; 1) Since at the end of our calculation we will have to take the limit ! ! 2, it will be useful to recall the expansion of the Gamma function around its poles (2.113) ;() = 1 ;  + O()  where  = 0:5772::: (2.114) is the Euler-Mascheroni constant, and for (n  1):  n 1 ( ; 1) ;(;n + ) = n!  + (n + 1) + O() (2.115) where (2.116) (n + 1) = 1 + 21 +    + n1 ; 

2.6 One loop regularization of QED In this Section we will regularize, by using dimensional regularization, the relevant divergent quantities in QED, that is (p),  (q) e (p p0). Furthermore, in order to dene the counter terms we will determine the expressions which become divergent in the limit of ! ! 2. It will be also convenient to introduce a parameter  such that these quantities have the same dimensions in the space with d = 2! as in d = 4. 56

The algebra of the Dirac matrices is also easily extended to arbitrary dimensions d. For instance, starting from   ]+ = 2g (2.117) we get    = d (2.118) and    = (2 ; d) (2.119) Other relations can be obtained by starting from the algebraic properties of the  -matrices. Let us start with the electron self-energy which we will require to have dimension 1 as in d = 4. From eq. (2.4) we have Z d2! k p^ ; k^ + m   1 4 ; 2 ! (p) = i  (2.120)  (2 )2! (p ; k)2 ; m2 + i k2 + i In order to use the equations of the previous Section it is convenient to combine together the denominators of this expression into a single one. This is done by using a formula due to Feynman 1 = ab which is proven using

Z1 0

dz az + b(1 ; z)]2





1 = 1 1;1 = 1 ab b ; a a b b ; a and doing the change of variables

(2.121)

Z b dx a

x2

x = az + b(1 ; z) We get

Z 1 Z d2! k dz 

p^ ; k^ + m  2 ! 2 2 2 2 (2

) ( p ; k ) z ; m z + k (1 ; z )] 0 The denominator can be written in the following way (p) = i4;2!

:::] = p2z ; m2 z + k2 ; 2p  kz

(2.122) (2.123) (2.124) (2.125)

and the term p  k can be eliminated through the following change of variables k = k0 + pz. We nd :::] = (p2 ; m2 )z + (k0 + pz)2 ; 2p  (k0 + pz)z = k02 ; m2 z + p2z(1 ; z) (2.126) 57

It follows (we put k0 = k) (p) = i4;2!

Z 1 Z d2! k ^ dz (2 )2!  k2 ;p^(1m;2zz+) ;p2kz(1+ ;mz)]2   0

(2.127)

The linear term in k has vanishing integral, therefore Z 1 Z d2! k p^(1 ; z) + m 4 ; 2 !  (p) = i dz (2 )2!  k2 ; m  (2.128) 2 2 2 z + p z(1 ; z)] 0 and integrating over k Z1 ; !)  p^(1 ; z) + m   (2.129) 4 ; 2 ! (p) = i dz (2 1)2! i ! ;(2;(2)  2 m z ; p2 z(1 ; z)]2;! 0 By dening  = 4 ; 2! we get Z1 ; z) + m   (2.130)  (p) = ; dz (2 1)4; (4;)=2 ;(=2) m2 zp^(1 ; p2 z(1 ; z)]=2 0 Contracting the  matrices Z1 p(1 ; z) + (4 ; )m 1 (2.131) (p) = ; 2 ;(=2) dz(4 2)=2 ( ; 2)^ 2 16

m z ; p2z(1 ; z)]=2 0 we obtain (p) = 1 2 ;(=2)  m2z ; p2z(1 ; z) ;=2 Z 1 16

 dz2^p(1 ; z) ; 4m ; (^p(1 ; z) ; m)] (2.132) 4 2 Dening

0

A = 2^p(1 ; z) ; 4m B = ;p^(1 ; z) + m C = m z ;4p z2(1 ; z) 2

and expanding for  ! 0 (p) = = =

; =



2

(2.133)



1 Z 1 dz 2A + 2B ; A h1 ;  log C i 16 2 0  2   Z 1 1 2 A 16 2 0 dz  ; A log C + 2B ; A 1 (^p ; 4m) ; 1 ^p ; 2m +  (^p ; 4m)] 8 2Z 16 2 1 1 m2 z ; p2z(1 ; z) dz ^ p (1 ; z ) ; 2 m ] log 8 2 0 4 2 1 (^p ; 4m) + nite terms 8 2  58

(2.134)

Consider now the vacuum polarization (see eq. (2.8))  Z d2! k  1 1  4 ; 2 !   Tr ^  (q) = i   (2 )2! k + q^ ; m k^ ; m Z d2! k Tr(k^ + m) (k^ + q^ + m)] 4 ; 2 ! (2.135) = i (2 )2! (k2 ; m2)((k + q)2 ; m2 ) Using again the Feynman representation for the denominators Z 1 Z d2! k (k^ + m) (k^ + q^ + m)]  4 ; 2 !  (q) = i dz (2 )2! (k2 ;Trm2 )(1 ; z) + ((k + q)2 ; m2 )z]2 (2.136) 0 We write the denominator as :::] = k2 + q2z ; m2 + 2k  qz (2.137) through the change of variable k = k0 ; qz we cancel the mixed term obtaining :::] = (k0 ; qz)2 + 2(k0 ; qz)  qz + q2z ; m2 = k0 2 + q2 z(1 ; z) ; m2 (2.138) and therefore (we put again k0 = k) Z 1 Z d2! k Tr(k^ ; q^z + m) (k^ + q^(1 ; z) + m)]  4 ; 2 !  (q) = i dz (2 )2! k2 + q2z(1 ; z) ; m2]2 0 (2.139) Since the integral of the odd terms in k is zero, it is enough to evaluate the contribution of the even term to the trace Tr:::]even = Tr(k^ ; q^z + m) (k^ + q^(1 ; z)]pari + m2 Tr ] = Trk^ k^] ; Trq^ q^]z(1 ; z) + m2 Tr ] (2.140) If we dene the  as matrices of dimension 2!  2! we can repeat the standard calculation by obtaining a factor 2! instead of 4. Therefore Tr:::]even = 2! 2k k ; g k2 ; (2qq ; g q2)z(1 ; z) + m2 g ] = 2! 2k k ; 2z(1 ; z)(q q ; g q2 ) ; g (k2 ; m2 + q2z(1 ; z)](2.141) and we nd Z 1 Z d2! k h  4 ; 2 ! !  (q) = i 2 dz (2 )2! k2 + q2z2(1k;kz) ; m2 ]2 0 i 2 z (1 ; z )( q g  q ; g q 2 )  ; k2 + q2z(1 ; z) ; m2 ]2 ; k2 + q2z(1 ; z) ; m2 ] (2.142) From the relations of the previous Section we get Z ; !) (2.143) d2! p p22p;pa2 ]2 = ;ig ! ;(1 (a2 )1;! 59

whereas

Z

; !) d2! p p2 ;1 a2 ] = ;i ! ;(1 2 (a )1;!

(2.144)

Therefore the rst and the third contribution to the vacuum polarization cancel out and we are left with Z1 Z d2! k  4 ; 2 ! ! 2  (q) = ;i 2 (q q ; g q ) dz 2z(1 ; z) (2 )2! k2 + q2 z(1 1; z) ; m2 ]2 0 (2.145) Notice that the original integral was quadratically divergent, but due to the previous cancellation the divergence is only logarithmic. The reason is again gauge invariance. In fact it is possible to show that this implies q  (q) = 0. Performing the momentum integration we have Z1 ! ; !)  4 ; 2 ! ! 2  (q) = ;i2 2 (q q ; g q ) dz z(1 ; z) (2i

)2! m2 ;;(2 2 q z(1 ; z)]2;! 0 (2.146) By putting again  = 4 ; 2! and expanding the previous expression Z1 2;=2

  2 ; = 2 2  (q) = 2 2 (q q ; g q ) dz z(1 ; z) (2 )4; m2 ; q;(2z=(12); z)]=2 0  m2 ; q2z(1 ; z) ;=2 Z1 2 2 ; = 2 2 2 (q q ; g q ) dz z(1 ; z);(=2) = 16 2 4 2 0 = 162 2 (4 ; 2 log 2)(qq ; g q2) Z1 h i (2.147)  dz z(1 ; z)( 2 ;  ) 1 ; 2 log C 0 where C is denite in eq. (2.133). Finally 8 h  i Z 1 2  (q) = 8 2 (qq ; g q ) dz z(1 ; z)  ; 4 ; 4 log 2 1 ; 2 log C = 2 1 2 (qq ; g q2)  m2 ; q2z(1 ; z)  1  Z (2.148)  3 ; 6 ; dz z(1 ; z) log 2 2 or, in abbreviated way  (q) = 6 1 2 (q q ; g q2 ) 1 + nite terms (2.149) We have now to evaluate (p0 p). From eq. (2.11) we have Z d2! k p^ 0 ; k^ + m ^   p^ ; k + m  1  0 4 ; 2 !    (p  p) = ;i (2 )2! (p0 ; k)2 ; m2 (p ; k)2 ; m2  k2 (2.150) 60

the general formula to reduce n denominators to a single one is

Z 1 Yn (1 ; Pn i) = (n ; 1)! di Pn i=1a ]n a i 0 i=1 i i i=1 i=1

Yn 1

To show this equation notice that

Yn 1 Z 1 Yn i=1 ai

=

Introducing the identity 1= and changing variables i = i

0

Z1 0

Yn 1 Z 1 Yn = a i i=1

=

Z 1 Yn 0

i=1

0

i=1

i=1

d( ;

i=1

n X i=1

d id( ;

;

din d e 

d ie

;

n X

n X i=1

iai

(2.151)

iai

(2.152)

i)

n X i=1

(2.153)

i)e

(1 ;

;

n X i=1

n X i=1

iai

i)

(2.154)

The integration over i can be restricted to the interval 0 1] due to the delta function, and furthermore Z1 (2.155) dn;1e;  = (n ;n 1)! 0 In our case we get Z d2! k Z 1 Z 1;x  0 4 ; 2 ! dx dy  (p  p) = ;2i (2 )2! 0 0  (^ 0 ; k^ + m)  (^ ^  p   k2 (1 ; x ; y) + (p ; k)2x ; mp2x;+k (+p0m ; k)2y ; m2 y]3 (2.156) The denominator is :::] = k2 ; m2(x + y) + p2x + p02 y ; 2k  (px + p0y)

(2.157)

Changing variable, k = k0 + px + py :::] = (k0 + px + p0 y)2 ; m2 (x + y) + p2 x + p0 2y ; 2(k0 + px + p0y)  (px + p0 y) = k0 2 ; m2 (x + y) + p2x(1 ; x) + p02y(1 ; y) ; 2p  p0xy (2.158) 61

Letting again k0 ! k (p0 p) = ;2i4;2!

Z 1 Z 1;x Z d2! k dx dy

(2 )2!  0  p(1 ; x) ; p^0 y ; k^ + m) ^   k(^2p;(1m;2(xy)+;yp^)x+;p2kx+(1m;)x) (^ (2.159) 2 0 0 3 + p y(1 ; y) ; 2p  p xy] The odd term in k is zero after integration, the term in k2 is logarithmically divergent, whereas the remaining part is convergent. Separating the divergent piece, (3) (1)  , from the convergent one,  , (2)  = (1) (2.160)  +  we get for the rst term Z 1 Z 1;x Z d2! k (1) 0 4 ; 2 !  (p  p) = ;2i dx dy (2 )2! 0 0 k k     2 2 2 02 0 3 Z 1k ;Zm1;(xx +i y)! (+;p1)x3 (1 ;1x) + p y(1 ; y) ; 2p  p xy] = ;2i4;2! dx dy ;(3) ; 2 ;(2 ; !) 0 0       m2 (x + y) ; p2x(1 ; x) ; p0 2y(1 ; y) + 2p  p0xy]2;!  1 ! Z 1 Z 1;x 1 4 ; 2 ! = 2 4 ;(2 ; !) 0 dx 0 dy       (2.161) m2 (x + y) ; p2x(1 ; x) ; p0 2y(1 ; y) + 2p  p0xy]2;! Using the relation       = (2 ; d)2  (2.162) we obtain ( = 4 ; 2!) 0

0

  1  1 2;=2 ;(=2)( ; 2)2 Z 1 dx Z 1;x dy  2 4

0 0 1 2 2 m (x + y) ; p x(1 ; x) ; p0 2y(1 ; y) + 2p  p0xy]=2 2  Z 1 Z 1;x 1 = 32 2  ;  4 ; 4] dx dy 0 0  m2(x + y) ; p2x(1 ; x) ; p02y(1 ; y) + 2p  p0xy ;=2 4 2 Z 1 Z 1;x 1 1 1 dy = 8 2  ; 16 2 ( + 2) ; 8 2  dx 0 0

0 (1)  (p  p) =

62

 log

 m2(x + y) ; p2x(1 ; x) ; p02y(1 ; y) + 2p  p0xy  4 2

(2.163)

and nally

1 0 (1) (2.164)  (p  p) = 8 2   + nite terms In the convergent part we can put directly ! = 2 Z 1 Z 1;x i 2(;1)3 i (2) 0  (p  p) = ; 8 4 dx dy ;(3) 0 0  (^p0 (1 ; y) ; p^x + m)  (^p(1 ; x) ; p^0y + m) m2(x + y) ; p2x(1 ; x) ; p02 y(1 ; y) + 2p  p0xy Z 1 Z 1;x 1 = ; 16 2 dx dy 0 0  (^p0 (1 ; y) ; p^x + m)  (^p(1 ; x) ; p^0y + m) (2.165) m2(x + y) ; p2x(1 ; x) ; p02 y(1 ; y) + 2p  p0xy

2.7 One loop renormalization We summarize here the results of the previous Section (p) = 12 (^p ; 4m) + f (p) (2.166) 8    1 f 2 (2.167)  (q) = (q q ; g q ) 6 2 +  (q)  (q q ; g q2)(q2 )  (p0 p) = 12  + f (p0 p) (2.168) 8  where the functions with the superscript f represent the nite contributions. Let us start discussing the electron self-energy. As shown in eq. (2.2), the eect of (p) is to correct the electron propagator. In fact we have (see Fig. 2.7.1): SF (p) = p^ ;i m + p^ ;i m ie2 (p) p^ ;i m +    (2.169) .

Fig. 2.7.1 - The loop expansion for the electron propagator. from which, at the same order in the perturbative expansion



2 (p) SF (p) = p^ ; m 1 + ep^ ; m

i

63

;1

= p^ ; m +i e2 (p)

(2.170)

Therefore the eect of the divergent terms is to modify the coecients of p^ and m:









2 2 e e 1 + 8 2 ; m 1 + 2 2 + nite terms (2.171) F This allows us to dene the counter terms to be added to the lagrangian expressed in terms of the physical parameters in such a way to cancel these divergences ! (L1)ct = iB !@^ ; A (2.172)

iS ;1(p) = p^ ; m + e2 (p) = p^

(L1)ct modies the Feynman rules adding two particles interacting terms. These can be easily evaluated noticing that the expression of the propagator, taking into account (L1)ct, is





 1 ; B p^ ; A (1 + B )^p ; (m + A) p^ ; m p^ ; m  p^ ;i m + p^ ;i m (iB p^ ; iA) p^ ;i m i

i

(2.173)

where, consistently with our expansion we have taken only the rst order terms in A and B . We can associate to these two terms the diagrams of Fig. 2.7.2, with contributions ;iA to the mass term, and iB p^ to p^. . -iA

iBp/

Fig. 2.7.2 - The counter terms for the self-energy (/p in the gure should be read as

p^).

The propagator at the second order in the coupling constant is then given by adding the diagrams of Fig. 2.7.3. At this order we get ;  SF (p) = p^ ;i m + p^ ;i m ie2 (p) + iB p^ ; iA p^ ;i m (2.174) and the correction to the free propagator is given by

e2(p) + B p^ ; A =

 e2



 e2  8 2 + B p^ ; 2 2 m + A + nite terms

We can now x the counter terms by choosing



 

2 B = ; 8e 2 1 + F2 m 

64

(2.175)

(2.176)

. =

Fig. 2.7.3 - The second order contributions at the electron propagator.



 

2 1 m A = ; me + F (2.177) m 2 2   with F2 andFm nite for  ! 0. Notice also that these two functions are dimensionless and arbitrary up to this moment. However they can be determined by the renormalization conditions, that is by xing the arbitrary constants appearing in the lagrangian. In fact, given

iSF;1(p)  ;(2) (p) = p^ ; m + B p^ ; A + e2(p)

(2.178)

we can require that at the physical pole, p^ = m, the propagator coincides with the free propagator (2.179) SF (p)  p^ ;i m  for p^ = m

From here we get two conditions. The rst one is









e2 (^p ; 4m)+ e2f (^p = m) ; e2 1 + F p^+ me2 1 + F 8 2 8 2  2 2 2  m (2.180) from which 2 me2 F = 0 e2 f (^p = m) ; me F + (2.181) 8 2 2 2 2 m The second condition is @ ;(2) (p)  = 1 (2.182) @ p^ p^=m 0 = ;(2) (^p = m) =

65

which gives

 e2 F = 0 ;  @ p^ p^=m 8 2 2

e2 @ 

f (p)

(2.183)

One should be careful because these particular conditions of renormalization gives some problems related to the zero mass of the photon. In fact one nds some illdened integral in the infrared region. However these are harmless divergences, because giving a small to the photon and letting it to zero at the end of the calculations gives rise to nite results. Notice that these conditions of renormalization have the advantage of being expressed directly in terms of the measured parameters, as the electron mass. However, one could renormalize at an arbitrary mass scale, M . In this case the parameters comparing in Lp are not the directly measured parameters, but they can be correlated to the actual parameters by evaluating some observable quantity. From this point of view one could avoid the problems mentioned above by choosing a dierent point of renormalization. As far as the vacuum polarization is concerned,  gives rise to the following correction of the photon propagator (illustrated in Fig. 2.7.4) 0 (q ) = ;ig + ;ig ie2   (q ) ;ig +    D (2.184) q2 q2 q2 .

Fig. 2.7.4 - The loop expansion for the photon propagator. from which

0 (q ) = ;ig + ;ig (ie2 )(q q  ; g q 2 )(q 2 ) ;ig +    D q2 q2 q2  2 (q 2 ) ; i q q e2 (q 2 ) +    = ;ig 1 ; e (2.185) q2 q4

We see that the one loop propagator has a divergent part in g , and also divergent and nite pieces in the term proportional to the momenta. Therefore the propagator is not any more in the Feynman gauge. It follows that we should add to Lp L2 = ; 41 F F  ; 12 (@ A )2 = 21 A g A (2.186) 66

the two counterterms 1  E ;C E 1 C   2   2 (L2 )ct = ; 4 F F ; 2 (@ A ) = ;C 4 F F + 2 (@ A ) ; 2 (@ A )2 (2.187) As for the electron propagator, we can look at these two contributions as perturbations to the free lagrangian, and evaluate the corresponding Feynman rules, or evaluate the eect on the propagator, and then expand in the counter terms. The eect of these two terms to the equation which denes the propagator in momentum space is (1 + C )q2g ; (C ; E )q q ]D (q) = ;ig (2.188) We solve this equation by putting D (q) = (q2)g +  (q2)q q (2.189) Substituting in the previous equation we determine e  . The result is E = ; q2 (1 i+ C )   = ; qi4 (1 +CC ; (2.190) )(1 + E ) The free propagator, including the corrections at the rst order in C and E is q q (C ; E )  D (q) = ;ig (1 ; C ) ; i (2.191) q2 q4 and the total propagator 0 (q ) = ;ig 1 ; e2 (q 2 ) ; C ] ; i q q e2 (q 2 ) + C ; E ] (2.192) D q2 q4 We can now choose E = 0 and cancel the divergence by the choice m 2 e (2.193) C = ; 6 2  + F3  In fact we are free to choose the nite term of the gauge xing, since this choice does not change the physics. This is because the terms proportional to qq , as it follows from the gauge invariance, and the conservation of the electromagnetic current. For instance, if we have a vertex with a virtual photon (that is the vertex is connected to an internal photon line) and two external electrons, the term proportional to q q is saturated with u!(p0)u(p) q = p0 ; p (2.194) The result is zero, bay taking into account the Dirac equation. Let us see what the full propagator says about the mass of the photon. We have 0 (q ) = ;ig 1 ; F ; e2 f ; i q q F + e2 f  ;ig 1 ;  ~ ] ; i q q4  ~ D 3 3 2 4 2 q q q q (2.195) 67

where

~ = e2 f + F3 (2.196) At the second order in the electric charge we can write the propagator in the form   q ; i  q ~ 0 g + D (q) = 2 (q) (2.197) q (1 + ~ (q))  q2 and we see that the propagator has a pole at q2 = 0, since ~ (q2) is nite for q2 ! 0. Therefore the photon remains massless after renormalization. This is part of a rather general aspect of renormalization which says that if the regularization procedure respects the symmetries of the original lagrangian, the symmetries are preserved at any perturbative order. An exception is the case of the anomalous symmetries, which are symmetries at the classical level, but are broken by the quantum corrections. All the anomalous symmetries can be easily identied in a given theory. Also in this case we will require that at the physical pole, q2 ! 0, the propagator has the free form, that is ~ (0) = 0 (2.198) from which F3 = ;e2 f (0) (2.199) For small momenta we can write f  ~(q)  e2q2 d (2q)  (2.200) dq q2=0 since F3 is a constant. Using the expression for f from the previous Section, we get f (q)  ;





1  + 1 log m2 2 2 6 6 2 2

and from which



Z1 2 1 + 2 2 dz z2 (1 ; z)2 mq 2 +    (2.201) 0

2 2 ~ (q)  60e 2qm2





(2.202)

2q2 e g  (2.203)  = ;i q 2 1 ; 60 2 m2 + gauge terms The rst term, 1=q2, gives rise to the Coulomb potential, e2 =4 r. Therefore this is modied by a constant term in momentum space, or by a term proportional to the delta function in the real space 1 Z Z d4q 2  e e2 ; e4 3(~r) 2 ; iq ( x ; x ) 1 2 12 = e dt e ; = ; (2 )4 q2 60 2m2 4 r 60 2m2 (2.204)

D0

68

This modication of the Coulomb potential modies the energy levels of the hydrogen atom, and it is one of the contributions to the Lamb shift, which produces a splitting of the levels 2S1=2 and 2P1=2 . The total Lamb shift is the sum of all the self-energy and vertex corrections, and turns out to be about 1057:9 MHz. The contribution we have just calculated is only ;27:1 MHz, but it is important since the agreement between experiment and theory is of the order of 0:1 MHz. We have now to discuss the vertex corrections. We have seen that the divergent contribution is (1)  (p0  p), and this is proportional to  . The counter term to add to interacting part of Lp  ! Lint (2.205) p = ;e A is ! A (L3)ct = ;eD (2.206) The complete vertex is given by (see Fig. 2.7.5)



;ie  + e2 + D



(2.207)

.

(counter-term)

Fig. 2.7.5 - The one loop vertex corrections. We x the counter term by





2 D = ; 8e 2 1 + F2 (2.208) with F2 the same as in eq. (2.7). The reason is that with this choice we get the equality of the wave function renormalization factors Z1 = Z2. This equality follows from the conservation of the current, and therefore we can use the arbitrariness in the nite part of the vertex counter term to guarantee it. The one-loop vertex is then   F 2  0 2 f (2.209) ( ) =  + e ( ; 8 2 ) One can see that this choice of F2 is such that for spinors on shell the vertex is the free one u!(p)()0u(p)jp^=m = u!(p) u(p) (2.210)

69

We will evaluate now the radiative corrections to the g ; 2 of the electron. Here g is the gyromagnetic ratio, which the Dirac equation predicts to be equal to be two. To this end we rst need to prove the Gordon identity for the current of a Dirac particle   p + p0 i  0 0 (2.211) u!(p )u(p) = u!(p ) 2m + 2m  q u(p)

con q = p0 ; p. For the proof we start from

p^u(p) = (;m + 2p )u(p)

(2.212)

and

p^u(p) = m (2.213) Subtracting these two expressions we obtain p i    u(p) = m ; m  p u(p) (2.214) An analogous operation on the barred spinor leads to the result. We observe also that the Gordon identity shows immediately that g = 2, because it implies that the coupling with the electromagnetic eld is just e  F  (q) (2.215) 2m  To evaluate the correction to this term from the one loop diagrams, it is enough to evaluate the matrix element 0 e2 u!(p0)(2)  (p  p)u(p)

(2.216)

in the limit p0 ! p and for on-shell momenta. In fact (1)  contributes only to the terms in , and in the previous limit they have to build up the free vertex, as implied by the renormalization condition. Therefore we will ignore all the terms proportional to  and we will take the rst order in the momentum q. For momenta on shell, the denominator of (2)  is given by :::] = m2 (x + y) ; m2 x(1 ; x) ; m2 y(1 ; y) + 2m2 xy = m2(x + y)2

(2.217)

In order to evaluate the numerator, let us dene

V = u!(p0)  ^p0(1 ; y) ; p^x + m]   ^p(1 ; x) ; p^0y + m] u(p)

(2.218)

Using p^ = ; p^ + 2p, and an analogous equation for p^0 , we can bring p^ to act on the spinor at the right of the expression, and p^0 on the spinor at the left, obtaining





V = u!(p0) my  + 2(1 ; y)p0 ;   p^x   mx + 2(1 ; x)p ; p^0 y] u(p) (2.219) 70

Making use of

   = ;2      = 4g p^ p^0  = ;2^p0  p^

we get

(2.220) (2.221) (2.222)

h

V = u!(p0) ; 2m2 xy  + 2my(1 ; x)(;m  + 2p) ; 4my2p0  + 2mx(1 ; y)(;m  + 2p0 ) + 4(1 ; x)(1 ; y)im2  ; 2y(1 ; y)m2  ; 4mx2 p ; 2m2x(1 ; x)  ; 2xym2  u(p) (2.223) and for the piece which does not contain 

V = 4mu!(p0 ) p(y ; xy ; x2 ) + p0 (x ; xy ; y2) u(p)

(2.224)

Therefore the relevant part of the vertex contribution is Z 1;x 4 h (y ; xy ; x2) 2 Z 1 e 2 0 (2) 0 dx dy m u!(p0 ) p (x + y)2 e u!(p ) (p  p)u(p) ! ; 16 2 0 0 i 2 (2.225) + p0 (x ; xy ;2y ) u(p) (x + y) By changing variable z = x + y we have Z 1 Z 1;x y ; xy ; x2 Z 1 Z 1  1 ; x x  dx dy (x + y)2 = dx dz z ; z2 0 0 0 x =

from which

Z1 

0

;(1 ; x) log x + x ; 1]



1 2 2 2 = ;x log x + x + x2 log x ; x4 + x2 ; x = 14 0

(2.226)

2 0  p)u(p) ! ; e u!(p0 )p + p0 ]u(p) e2 u!(p0)(2) ( p (2.227)    16 2 m Using the Gordon in this expression, and eliminating the further contribution in  we obtain 2 ie 2 0 (2) 0 (2.228) e u!(p ) (p  p)u(p)jmom: magn: ! 16 2m u!(p0) q u(p) Finally we have to add this correction to the vertex part taken at p0 = p, which coincides with the free vertex 2 p + p0 i  e2   ie  0 0 u!(p ) + 16 2m  q ]u(p) = u!(p ) 2m + 2m 1 + 8 2  q ]u(p) (2.229)

71

Therefore the correction is

e ! e 1 +  (2.230) 2m 2m 2

Recalling that g is the ratio between S~  B~ and e=2m, we get g = 1 + + O( 2) (2.231) 2 2

This correction was evaluated by Schwinger in 1948. Actually we know the rst three terms of the expansion  2 ath = 12 (g ; 2) = 12 ; 0:32848 + 1:49( )3 +    = (1159652:4 0:4)  10;9 (2.232) to be compared with the experimental value aexp = (1159652:4 0:2)  10;9

72

(2.233)

Chapter 3 Vacuum expectation values and the S matrix 3.1 In- and Out-states For the future we will need a formulation of eld theory showing that any S -matrix element can be reduced to the evaluation of the vacuum expectation value of a T product of eld operators. Such a formulation is called LSZ (from the authors l (Lehaman, Symanzik and Zimmermann). This formalism is based on a series of very general properties as displacement and Lorentz invariance. It is interesting to see how far one can go in determining the features of a eld theory by using only invariance arguments. In the following we will work in the Heisenberg picture, that is the operators will be time-dependent and evolving with the full hamiltonian. Formally this representation can be obtained from eq. (1.217) by sending the free hamiltonian into the full hamiltonian. It should be also kept in mind that since no exact solution of an interacting eld theory is known, we are not sure that the assumptions we are going to make are consistent. Said that, our assumptions are: 1. The eigenvalues of the four-momentum lie within the forward light cone p2 = pp  0 p0  0 (3.1) 2. There exists a nondegenerate Lorentz invariant states of minimum energy. This is called the vacuum state #0  j0i (3.2) By convention, we assume that the corresponding energy is zero P0j0i = 0 (3.3) From eq. (3.1) we get also P~ j0i = 0 ! Pj0i = 0 (3.4) 73

.

Since p = 0 is a Lorentz invariant condition, it follows that #0 appears as the vacuum states in all the Lorentz frames. 3. For each stable particle of mass m, there exists a stable stae of single particle #1  jpi (3.5) where #1 is a momentum eigenstate with eigenvalue p such that p2 = m2 . 4. Neglecting the complications from the zero mass states we can add a further requirement: the vacuum and the single particle states give a discrete spectrum in p. For instance, the case of the mesons is given in Fig. 3.1. E

ep

gl sin

2m

ar

tic

le

sta

te

xxx Continuum xxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx xx

xx xx

m p

xx xx

Vacuum

Fig. 3.1 - Energy-momentum spectrum for particles of mass m. Remember that the pion is not a stable particle and decays into  + ! with a lifetime of the order of  2  10;8 sec., but this number is very large with respect to the natural decay frequency 1=m  5  10;24 sec. (m  140 MeV ). Therefore, by neglecting the weak interactions (responsible for the decay) we can assume that the pion is stable. The same argument can be applied to quarks and leptons that will be the main actors in the following. However we mentioned also the pions, to say that the arguments presented here can be applied (in the appropriate range of energy) also to composite particles. Notice that this assumption is very much in the spirit of perturbation theory. That is we introduce a eld for each particle, assuming that the interactions do not modify the spectrum in a violent way. The main tools of investigation in the eld of elementary particle physics are the scattering experiments. Ideally these experiments are realized by preparing at 74

t = ;1 a system of free particles. That is assuming that they do not interact each other (this requires that the particles are well separated in space). These particles will interact with a target at some nite time. After that scattering process we imagine that at t = +1 the emerging particles will behave again as free particles. In real life the times t = 1 correspond to laboratory time. That is we prepare the beam at some nite time ;T , the scattering process will occur around the time t = 0 (assuming this as the time scale) and then we will measure the outgoing particles at a time T . The ideal description considered before can be considered as correct if the interaction time t around t = 0 is much smaller that T . For instance, imagine an electron prepared in an eigenstate of momentum p, impinging over an atom. Here there is a further simplication that we are doing since in a nite time or a nite volume we cannot prepare a state of denite momentum. This is one of the reasons why analyzing a scattering process it is always convenient to quantize in a box of the dimension of the experiment itself (that is about the distance between the source of the beam and the detector). The same consideration holds for the energy, that is one has always to remember that the experiments is taking place between to nite times ;T and T . Having prepared the electron in the initial state at the time ;T , the particle will travel since it will reach the target where it will be subject to the interaction with the atoms of target itself. The interaction time will be of the order of the time necessary to cross the atomic dimensions for the case of a single scattering, or, at maximum, of the order of the thickness of the target for a multiple scattering. After that we will let again the electron to move freely since it will reach the detector where we will measure its properties as momentum and polarization. In conclusion we can think to describe the particles at t = 1 in terms of free elds, except that they will be subject to the self-interaction which, as we know, cannot be neglected. The free elds that we are going to use will be denoted by in(x) for t ! ;1 and by out (x) for t ! +1. The interacting eld (x)) can be thought of as constructed in terms of these free elds operators and such that at t ! 1 it reduces to out(x) or in(x). In order that the in- and out-elds describe correctly the incoming and outgoing free particles we have to require some properties. In particular we will require 1. in(x) must transform as the corresponding (x) with respect to the symmetries of the theory. In particular, with respect to translation it must satisfy P ] = ;i @in (x) (3.6) @x equivalent to require in(x + a) = eiP  ain(x)e;iP  a (3.7) 2. in(x) must satisfy the free Klein-Gordon equation (for a scalar eld, or the Dirac free equation for a spinor eld, etc.) corresponding to the physical mass m ( + m2 )in(x) = 0 (3.8) 75

Notice that this requirement implies that we have taken into account the selfinteractions of the eld which are responsible for the mass renormalization (we will mention the wave function renormalization later). The same properties are required for the out-elds. By using these two properties it follows at once that the in-eld creates from the vacuum the physical one particle state from the vacuum. In fact, let jpi an eigenstate ofP with eigenvalue p. We have ;i @x@  hpjin(x)j0i = hpjP in(x)]j0i = p hpjin(x)j0i (3.9) By applying ;i@=@x once more, we nd

;hpjin(x)j0i = p2hpjin(x)j0i

(3.10)

and using the Klein-Gordon equation (p2 ; m2 )hpjin(x)j0i = 0

(3.11)

We see that the only state that in(x) can create out of the vacuum is the one with p such that p2 = m2 . From our starting hypotheses we have that this is the state of single particle with mass m. Since the in- and out-elds are free elds we can apply all the corresponding formalism. In particular they can be Fourier expanded

Z

in(x) =

h

i

d3~k ain(~k)f~k (x) + ayin (~k)f~k(x)

with

f~k (x) = p 1 3 e;ikx  (2 ) 2!~k Inverting this relation, we nd

(3.12)

q

!~k = ~k2 + m2

(3.13)



ain(~k) = i d3~x f~k (x)@0 in(x) ; @0f~k (x)in(x)

(3.14)

Z

from which Pi ain(~k)] =

Z

d3~x

Z



 @ @ (x)   @  @ (x)  in in  ; @x0 f~k (x) @x ~k 0 @xi i @x

@f (x) ! # " @f (x)   @ @ ~ ~

f (x)

= ; d3~x

= ;kiain(~k)

k

@xi

@x0 in(x)

k ; @x0 @x i

in(x)

(3.15)

Since f~k(x) and in(x) both satisfy the free Klein-Gordon equation, we see that the operators dened in eq. (3.14) are time independent. This follows immediately, by 76

recalling that if 1(x) and 2(x) are both solutions of the Klein-Gordon equation, then the four-vector @2 ; @1  j = 1 @x (3.16)  @x 2 is a conserved current, and therefore the charge

Q=

Z

d3~xj0

(3.17)

is a constant of motion. Using this, it follows  @ (x)  Z   @ @in(x)   @ in  3  P0 ain(~k)] = d ~x f~k (x) @x0 @x0 ; @x0 f~k (x) @x 0 #  @ @f~(x) ! Z " @f~(x)  @ k = ; d3~x k 0  in (x) ; 0 0 @x @x @x @x0 in(x) = ;k0 ain(~k) (3.18) In conclusion

P ain(~k)] = ;kain (~k) Also, by hermitian conjugation we get P ayin(~k)] = k ayin(~k)

(3.19) (3.20)

It follows that ayin (~k) creates states of four-momentum k whereas ain(~k) annihilates states with momentum k. For instance, Payin(~k1)ayin (~k2)j0i = P ayin(~k1)ayin(~k2)]j0i = (k1 + k2) ayin(~k1)ayin(~k2)j0i (3.21)

Pain (~k1)ayin(~k2)j0i = P ain(~k1)ayin (~k2)]j0i = (k2 ; k1) ain(~k1)ayin(~k2)j0i (3.22)

By using the condition on the spectrum of the momentum operator we get also ain(~k)j0i = 0 (3.23) since, otherwise, this state would have a negative eigenvalue of the energy. We get also hp1     pM  injk1     kN  ini = 0 (3.24) unless M = N and the set (p1     pM ) is identical to the set (k1     kN ). Let us now look for a relation between the in- and the -eld. The eld (x) will satisfy an equation of the type ( + m2)(x) = j (x)

(3.25)

where j (x) (the scalar current) contains the self-interactions of the elds and possible interactions with other elds in the theory. We need to solve this equation with the 77

boundary conditions at t = 1 relating (x) with the in- and out-elds. Notice that in writing the previous equation we have included the eects of mass renormalization inside the current j (x), but we know that at the same time we have also a wave function renormalization. Therefore asymptotically the eldp (x) cannot p be the same as the in- and out-elds but it will dier by a factor Z , where Z is the wave function renormalization. Therefore, the required asymptotic conditions are

p

lim (x) = t!;1 lim Zin(x) t!;1 The solution for (x) is then

p

p

lim (x) = t!lim Zout (x) t!+1 +1

Z

(x) = Zin(x) + d4yret(x ; y m2)j (y)

(3.26)

(3.27)

We will see in a moment the meaning of Z in this context. The function ret (x m2 ) is the retarded Green's function, dened by ( + m2 )ret (x m2) = 4(x)

(3.28)

such that

ret(x m2 ) = 0 for x0 < 0 (3.29) The need of Z in this context, is because we want the the in-eld properly normalized in order to produce single particle states. In fact, both h1jin(x)j0i e h1j(x)j0i have the same x-dependence (they must be both proportional to exp(ipx), see eq. (3.6)), but (x) may create out of the vacuum states other than the single particle state, and therefore the normalization of the two previous matrix elements cannot be the same. It can be shown that the asymptotic condition (3.26), cannot hold as an operator statement. This is because it is not possible to isolate the operator (x) from the scalar current j (x) at t = ;1, since this includes also the self-interactions. However the statement can be given in a weak form for the matrix elements. The correct way of doing it is rst to smear out the eld over a space-like region

f (t) = i

Z

d3 x (f (~x t)@0 (~x t) ; @0 f (~x t)(~x t))

(3.30)

with f (~x t) an arbitrary normalizable solution of the Klein-Gordon equation ( + m2 )f (x) = 0

(3.31)

Then the asymptotic condition is formulated as

p

lim h jf (t)j i = Z t!;1 lim h jf (t)in j i t!;1 where j i and j i are any two normalizable states. 78

(3.32)

Similar considerations can be made for the out-eld. In particular we can nd a relation between the in- and out-elds

p

Z

(x) = Zout (x) + d4yadv (x ; y m2)j (y)

(3.33)

where

adv (x m2 ) = 0 for x0 > 0 (3.34) ( + m2 )adv (x m2 ) = 4(x) (3.35) is the advanced Green's function. By comparing this expression with eq. (3.27) we nd Z p p Zin(x) = Zout(x) + d4y(x ; y m2)j (y) (3.36)

with

(x ; y m2) = adv (x ; y m2) ; ret(x ; y m2) (3.37) The construction of the out-operators of creation and annihilation goes along the same lines discussed for the in-operators.

3.2 The S matrix We have shown in the previous Section that an initial state of n non-interacting particles can be described by

jp1     pn ini = ayin(p1)    ayin(pn)j0 ini  j  ini

(3.38)

where stands for the all collection of the quantum numbers of the initial state. After the scattering process we are interested to evaluate the probability amplitude for the nal state to be a state of non-interacting particles described by the outoperators jp01     p0n outi = ayout (p01 )    ayout (p0n)j0 outi  j  outi (3.39) The probability amplitude is given by the S matrix elements

S = h  outj  ini

(3.40)

We can also dene an operator S transforming the in-states into the out-states

h  injS = h  outj such that

(3.41)

S = h  injS j  ini (3.42) From the denition we can derive some important properties for the operator S 79

1. Since the vacuum is stable and not degenerate, it follows jS00 j = 1, and therefore h0injS = h0outj = eih0inj (3.43) By choosing the phase equal to zero we get

j0 outi = j0 ini  j0i

(3.44)

2. In an analogous way the stability of the one particle state requires

jp ini = jp outi

(3.45)

3. The operator S maps the in- to the out-elds

in(x) = Sout (x)S ;1

(3.46)

In fact, let us consider the following matrix element

h  outjoutj  ini = h  injSoutj  ini

(3.47)

But h  outjout is an out-state and therefore

h  outjout = h  injinS

(3.48)

Then

h  outjout (x)j  ini = h  injSout (x)j  ini = h  injin(x)S j  ini (3.49) which shows the relation (3.46). 4. The S is unitary. This follows from the very denition (3.41) implying from which showing that

S yj  ini = j  outi

(3.50)

h  injSS yj  ini = h  outj  outi = 

(3.51)

SS y = 1 From eq. (3.50) and the unitarity of S we get

j  ini = S y;1j  outi = S j  outi

(3.52) (3.53)

5. S is Lorentz invariant, as it follows from the fact that transform in- into outelds having the same Lorentz properties. For the same reason the S matrix is invariant under any symmetry of the theory. 80

3.3 The reduction formalism In this Section we will describe a systematic way of evaluating the S matrix elements in terms of the vacuum expectation values of T -products of eld operators. Let us start considering the following S matrix element Sp = h  outj  p ini (3.54) where j  p ini is the state of a set of particles plus an additional incoming particle of momentum p . Here we will consider the case of scalar particles, but the formalism can be easily extended to other cases. We want to show that, by using the asymptotic condition, it is possible to extract the particle of momentum p from the state by introducing the corresponding eld operator. Let us consider the following identities h  outj  p ini = h  outjayin(p)j  ini = h  outjayout (p)j  ini + h  outjayin(p) ; ayout (p)j  ini = hZ ; p outj  in i @  h 3 ; i d ~x fp~(x) @x0 h  outjin(x) ; out(x)j  ini  @ i (3.55) ; @x0 fp~(x) h  outjin(x) ; out (x)j  ini where we have used the adjoint of eq. (3.14). We have denoted j ; p outi the outstate obtained by removing the particle with momentum p, if present in the state, otherwise it must be taken as the null vector. As we know the integral appearing in the previous expression is time-independent since fp~(x), in(x) and out(x) are solutions of the Klein-Gordon equation. Therefore we are allowed to make the following substitutions (in the weak sense, since we are taking matrix elements) 1 lim  in(x) = 0lim p (x) 0 x !;1 x !;1 Z 1 (x) p lim  ( x ) = lim (3.56) out x0 !+1 x0 !+1 Z obtaining h  outj  p ini = h ;p outj  ini Z  h @ i 3 ; lim d ~x fp~(x) @x0 h  outj(x)j  ini + p x0lim !+1 x0 !;1 Z @  i ; @x0 fp~(x) h  outj(x)j  ini (3.57) By using the identity  @   Z +1 @   @ 4 d x @x0 g1(x) @x0 g2(x) ; @x0 g1(x) g2(x) = ;1 81

= =

 Z

lim ; lim

Z

d3~x



@  @   g1(x) @x0 g2 (x) ; @x0 g1(x) g2 (x)

x0 !+1 x0 !;1 +1 d4 x g1(x)g2(x) ; g1 (x)g2(x)]

;1

(3.58)

we can write

h  outj  p ini = h ;Zp outj  ini h i i 4 + p d xh  outj fp~(x)(x) ; fp~(x)(x) j  ini (3.59)

Z By using the Klein-Gordon equation for fp~(x) we get

h  out j  p ini = h ; p outj  ini Z h ;  i + pi d4xh  outj fp~(x)(x) ; (r2 ; m2)fp~(x) (x) j  ini (3.60)

Z Integrating by parts

h  outj  p ini = h ;Zp outj  ini + pi d4xfp~ (x)( + m2 )h  outj(x)j  ini

(3.61) Z We can iterate this procedure in order to extract all the in- and out-particles from the states. Let us see also the case in which we want to extract from the previous matrix element an out-particle. This exercise is interesting because it shows how the T -product comes about. Let us suppose that the out state is of the type  = ( p0), with p0 the momentum of a single particle. We want to extract this particle from the state

h  outj(x)j  ini = h p0 outj(x)j  ini = h  outjaout (p0)(x)j  ini = h  outjaout (p0)(x) ; (x)ain(p0)]j  ini + h  outj(x)ain(p0 )j  ini = hZ outj(x)j ;p0  ini  h @ 3 + i d ~y fp~ (y) @y0 h  outj(out (y)(x) ; (x)in(y))j  ini 0

@  i  ; @y0 fp~ (y) h  outj(out (y)(x) ; (x)in(y))j  ini 0

(3.62)

Using again the time independence of the integral we have

h  out jout(y)(x) ;(x)in(y)]j  ini  ; lim h  outjT ((y)(x))j  ini = p1 y lim !+1 y !;1 Z

0

0

82

(3.63)

and eq. (3.58)

h  out j(x)j  ini = h Zoutj(x)j ;p0 ini   h i @ 3 ; lim d ~y fp~ (y) @y0 h  outjT ((y)(x))j  ini + p y lim !+1 y !;1

 Z@ 0  0 i ; @y0 fp~ (y) h  outjT ((y)(x))j  ini = h  outj(x)j ; p0 ini Z h i @ 2 T ((y)(x)) ; f  (y)T ((y)(x))ij  ini + p d4 yh  outj fp~ (y) @y p~ 2 Z 0 = h  out j(x)j ; p0 ini Z (3.64) + pi d4 yfp~ (y)( + m2 )y h  outjT ((y)(x))j  ini Z Substituting inside the (3.61) h  outj  p ini = h ;Zp outj  ini + pi d4xfp~(x)( + m2 )xh  outj(x)j ; p0 ini  Zi 2 Z d4xd4 y + p Z fp~(x)fp~ (y)( + m2 )x( + m2 )y h  outjT ((y)(x))j  ini (3.65) More generally if we want to remove all the particles in (q1     qm) and in (p1     pn) with the condition pi 6= qj we get hp1     pm outjq1     qn ini  i m+n Z Y mn = p d4xid4yj f~qi (xi)fp~j (yj )( + m2 )xi ( + m2 )yj  Z ij =1 h0jT ((y1)    (yn)(x1 )    (xm ))j0i (3.66) If some of the qi's are equal to some of the pj 's, we can reduce further the matrix elements of the type h  outj ; p ini. Therefore we have shown that it is possible to evaluate all the S matrix elements once one knows the vacuum expectation values of the T -products (called also the n-point Green's function). The result obtained here can be easily extended to generic elds simply by substituting to the wave functions fp~(x) the corresponding solutions of the wave equation (for instance, for spinor elds, spinors times plane waves) and to the Klein-Gordon operator the corresponding wave operators (for the spinor Dirac operator (i@/ ; m)). pIn the Dirac case one p case the p halso to change i= Z to ;i= Z2 for fermions and to +i= Z2 for antifermions, where Z2 is the wave function renormalization for the spinor eld. 0

0

0

0

0

0

83

We will show in the following that a simple technique to evaluate perturbatively the vacuum expectation value of n elds, a n-point Green's function, can be obtained in the context of the path-integral formulation of quantum mechanics.

84

Chapter 4 Path integral formulation of quantum mechanics 4.1 Feynman's formulation of quantum mechanics In 1948 Feynman gave a formulation of quantum mechanics quite dierent from the standard one. This formulation did not make use of operators and of Schrodinger equation, but rather it was giving an explicit expression for the quantum-mechanical amplitude, although using a mathematical formalism which was far from being well dened, which was obtained directly from the physical idea that the probability amplitude to go from one state to another can be obtained by summing together all the possible ways with an appropriate weight. This was as apply the ordinary composition laws in probability theory, to the probability amplitudes (remember that the probability is the square of the amplitude). The key observation to get the appropriate weight was taken by the Dirac's book where it was shown that, for one degree of freedom, the probability amplitude

hq0 t + tjq ti

(4.1)

for t innitesimal, is given by the eiS where S is the classical action to go from q at the time t to q0 at the tome t + t. In fact we will show that the Feynman's formulation can be derived from the usual formulation of quantum mechanics. Although the path integral formulation (as the Feynman's approach is usually called) is just a dierent way of formulating quantum mechanics, at the beginning of the 70's it turned out to be a crucial tool for the quantization of gauge theories. Also it is one the few methods which can be used for studying situations outside the perturbative approach. In order to derive the Feynman's formulation we will start studying a quantum system described by one degree of freedom, and we will see later how this can be extended also to innite degrees of freedom as needed in quantum eld theory. First of all we notice that the quantum mechanical problem is to evaluate 85

the time evolution, which is given by the propagator. This can be evaluated in any basis, for instance in conguration space, its matrix elements are given by hq0 t0jq ti t0  t (4.2) It will be convenient to work in the Heisenberg representation, that is using timedependent operators q(t). The states of the previous equation are dened as eigenstates of the operator q(t) at the time t, that is q(t)jq ti = jq tiq (4.3) It follows immediately that the relation between these states at dierent times is given by jq ti = eiHtjq 0i (4.4) for a time independent hamiltonian. The matrix element (4.2) can be evaluated by slicing the time interval t0 ; t in innitesimal pieces and using the completeness relation. we get

hq0 t0jq ti where Therefore

Z

= nlim dq1    dqn;1hq0 t0jqn;1 tn;1ihqn;1 tn;1jqn;2 tn;2i  !1   hq1 t1jq ti (4.5)

tk = t + k

hq0 t0jq ti = nlim !1

t0  t

Z nY ;1 ! nY ;1 k=1

dqk

k=0

tn  t0

!

hqk+1 tk+1jqk  tk i

(4.6) (4.7)

We have still to evaluate the matrix element (4.2), but now for an innitesimal time interval . let us use again the completeness in momentum space with states taken at a time t~ in between tk e tk+1:

hqk+1 tk + jqk  tk i =

Z

dphqk+1 tk + jp t~ihp t~jqk  tk i

The mixed matrix elements can be written as hqk+1 tk + jp t~i = hqk+1 0je;iH (p q)(tk +  ; t~)jp 0i

(4.8)

(4.9) (remember /h = 1). making use of the canonical commutation relations for p and q, the hamiltonian can be written in such a way that all the operators q are at the left of all the operators p. The result will be denoted by the symbol H+(q p). By doing so we get ;  hqk+1 tk + jp t~i ' hqk+1 0jp 0i 1 ; iH+(qk+1 p)(tk +  ; t~) ' hqk+1 0jp 0ie;iH+ (qk+1 p)(tk +  ; t~) (4.10) 86

The function H+(q p) is obtained by substituting in H+(q p) the operators q and p with their eigenvalues. Notice that although H (q p) and H+(q p) are the same operator, H (q p) e H+(q p) are, in general, dierent functions. In this way we get hqk+1 tk + jp t~i ' p1 ei(pqk+1 ; H+(qk+1  p)(tk+1 ; t~)) (4.11) 2

and hp t~jqk  tk i ' p1 ei(;pqk ; H;(qk  p)(t~ ; tk )) (4.12) 2

where H; is the eigenvalue of the operator H;(p q) dened by putting the operators q to the right of p by using the canonical commutation relations. By choosing t~ = (tk + tk+1)=2 and the previous two equations, we get Z dp hqk+1 tk + jqk  ti ' 2 ei(p(qk+1 ; qk )= ; Hc(qk+1 qk  p)) (4.13) where Hc(qk+1 qk  p) = 21 (H+(qk+1 p) + H;(qk  p)) (4.14) Since in the limit  ! 0 we have

hqk+1 tk + jqk  tk i ! (qk+1 ; qk ) it is reasonable to dene a variable velocity as q_ = qk+1 ; qk k



and write

(4.15) (4.16)

Hc(qk+1 qk  p)  H (qk  p) (4.17) In the same limit H (qk  p) is nothing but the hamiltonian of the system evaluated in the phase space. Therefore Z dp hqk+1 tk + jqk  tk i ' 2 eiL(qk  p) (4.18) where L(qk  p) = pq_k ; H (qk  p) (4.19) is the lagrangian in phase space. Coming back to (4.7) and using (4.18) we obtain

! i X L(qk  pk) Z nY ;1 ! nY ;1 dpk e k=0 hq0 t0 jq ti = nlim dq k !1 2

n;1

k=1

k=0

87

(4.20)

In the limit, the argument of the exponential gives the integral between t and t0 of the lagrangian in the phase space, therefore we get the canonical action

S=

Zt

0

t

dt L(q p)

(4.21)

In the limit n ! 1, the multiple integral in eq. (4.20) is called the functional integral and we will denote it symbolically as

hq0 t0jq ti = where

d(q(t)) =

Z q t 0

0

qt

Y t
i d(q(t))d(p(t))e

dq(t00) d(p(t)) =

0

Zt

0

t

dtL(q p)

Y dp(t00) tt t 00

0

2

(4.22) (4.23)

The asymmetry between the integration in q(t) and p(t) implies that the integral is not invariant under a generic canonical transformation. Notice that the integration is made over all the functions q(t00) such that q(t) = q e q(t0 ) = q0. On the contrary there are no limitations on p(t00 ), as it required by the uncertainty principle.

4.2 Path integral in conguration space The path integral in eq. (4.20) can be easily written as a functional integral in conguration space when the hamiltonian of the system is of the type

H (q p) = 2pm + V (q) 2

In such a case

(4.24)

2 H+(q p) = H;(q p) = H (q p) = 2pm + V (q) The momentum integration can be done naively

hq0 t0 jq ti

= nlim !1



(4.25)

Z nY;1 ! k=1

dqk

Y dpk ipk (qk+1 ; qk ) ; p2k =2m ; (V (qk+1) + V (qk ))=2] e = k=0 2 Z nY;1 ! 1 r 2m !n nY;1 2

n;1

= nlim !1

dqk

2 i k=1  e;i(V (qk+1) + V (qk ))=2 = 88

k=0

e(;2m=i)(qk+1 ; qk ) =4

=



Z r m n nY;1 ! lim dqk n!1 2i  k=1 nY ;1 im((qk+1 ; qk )=)2 =2 ; (V (qk+1) + V (qk ))=2] k=0

e

with the result

hq0 t0jq ti =

(4.26)

Z r m n nY ;1 ! lim dqk n!1 2i  k=1 n;1 X 2 i

 e

k=0

m((qk+1 ; qk )=) =2 ; (V (qk+1) + V (qk ))=2]

(4.27)

We have also

2 V (q ) + V (q ) # Z t n;1 "  X m q k+1 ; qk k lim  ; k+1 = dtL(q q_) 0

n!1

k=0

2



2

t

(4.28)

where L(q q_) is the lagrangian in conguration space. Therefore we will write

hq0 t0jq ti =

Z q t 0

0

qt

i

D(q(t))e

Zt

0

t

dtL(q q_)

(4.29)

with the functional measure given by

r m n nY;1 dqk D(q(t)) = nlim !1 2i  k=1

(4.30)

It is the expression (4.29) (really dened by (4.27)) which was originally formulated by Feynman. In fact it has a simple physical. But before discussing the interpretation let us notice that for N degrees of freedom and with hamiltonians of the type

H (qi pi) =

N 2 X pi

(4.31) 2m + V (qi) i = 1 : : :  N the expression (4.29) can be easily generalized. However for arbitrary system one has to start with the formulation in the phase space of the previous Section, which also can be easily generalized to an arbitrary number of degrees of freedom. For discussing the interpretation of the expression (4.29), let us recall the boundary conditions q(t) = q q(t0) = q0 (4.32) i=1

89

. t'

t

q

q'

Fig. 4.2.1 - The Feynman denition for the path integral. The expression in the exponent of (4.29) is nothing but the action evaluated for a continuous path starting from q at time t and ending at q0 at time t0 . This path is approximated in eq. (4.27) by many innitesimal straight paths as shown in Fig. 4.2.1. Therefore the interpretation is that in order to evaluate the amplitude to go from (q t) to (q0 t0) is obtained by multiplying together the amplitudes corresponding to the innitesimal straight paths. Each of these amplitudes (neglecting the normalp ization factor m=2i ) is given by the exponential of i times the action evaluated along the path. Furthermore we need to sum (or integrate functionally) over all the possible paths joining the two given end points. In a more concise way one says that the amplitude to from one point to another is obtained by summing over all the possible paths joining the two points with a weight proportional to the exponential of iS , where S is the classical action evaluated along the path.

90

4.3 The physical interpretation of the path integral Feynman arrived to the path integral formulation in conguration space on a more physical basis. First of all he knew that the amplitude for innitesimal time dierences is proportional to the exponential of iS , with S the classical action evaluated along the innitesimal path. From that one can derive directly the nal formula in the following way. Let us consider the classical double slit experiment, as illustrated in Fig. 4.3.1, where S is the electron source. The electrons come out of the slit A and arrive to the screen C, where they are detected by D, going through one of the two silts A or B. By counting the electrons with the detectpr D we can determine, as a function of x, how many electrons arrive on C, and therefore the probability distribution of the electrons on the screen C. Under the conditions of Fig. 4.3.1, that is with both slits 1 and 2 open, one gets the distribution given in Fig. 4.3.2. In order to explain this distribution it would seem reasonable to make the following assumptions: . D 1 x S

2 A

C

B

Fig. 4.3.1 - The double slit experiment. . P

x

Fig. 4.3.2 - The experimental probability distribution, P , with both slits open.  Each electron must go through the slit 1 or through the slit 2. 91

 As a consequence the probability for the electron to reach the screen at the

point x must be given by the sum of the probability to reach x through 1 with the probability to go through 2. These hypotheses can be easily checked by closing one of the two slits and doing again the experiment. The results obtained in the two cases are given in Fig. 4.3.3. . P2

P1

x

Fig. 4.3.3 - The experimental probability distributions, P1 and P2 obtained keeping open the sli1 1 or the slit 2 respectively.

We see that P 6= P1 + P2, therefore one of our hypotheses must be wrong. In fact, quantum mechanics tells us that we can say that an electron went through one of the two slits with absolute certainty, only if we detect its passage through the slit. Only in this case one would get P = P1 + P2. In fact, this is the result that one obtains when doing the experiment with a further detector telling us which of the holes have been crossed by the electron. This means that with this experimental setting the interference gets destroyed. On the contrary the experiment tells us that when we do not detect the hole crossed by the electron, the total amplitude for the process is given by the sum of the amplitudes corresponding to the two dierent possibilities, that is P = jAj2 = jA1 +A2j2 = jA1j2 +jA2j2 +A?1A2 +A1A?2 6= jA1j2 +jA2j2 = P1 +P2 (4.33) The two expressions dier for a term which is not positive denite and which is the responsible for the interference pattern. Therefore, quantum mechanics tells us that the usual rules for combining together probabilities are correct, but the result that we get follows from our ignorance about the path followed by the particle, unless we measure it. That is it does not make sense, under such circumstances, dene a concept as the path followed by a particle. The Feynnman's interpretation is a little bit dierent although at the end the theories turn out to be equivalent. He says that it is still possible to speak about trajectories of the electron if one gives up to the usual rules of combining probabilities. This point of view requires taking into account dierent type of alternatives according to dierent experimental settings. In particular one denes the following alternatives 92

1. Exclusive alternative - We will speak about exclusive alternatives when we use an apparatus to determine the alternative chosen by the system. For instance one we use a detector to determine the slit crossed by the electrons. In this case the total probability is given by the sum of the probabilities corresponding to the dierent alternatives,

P=

X

Pi:

i

(4.34)

2. Interfering alternatives - This is the situation where the alternative chosen by the system is not detected by an experimental apparatus. In this case we associate to each alternative an amplitude Ai and assume that the total amplitude is given by the sum of all the amplitudes Ai

A=

X i

Ai

(4.35)

The total probability for the process is evaluated as

P = jAj2 = j

X i

Ai j2

(4.36)

Therefore the probability rules are strictly related to the experimental setting. There are situations in which one may have both type of alternatives. For instance, if we want to know the total probability of having an electron in C within the interval (a b), having a detector in C and not looking at the slit crossed by the electrons. In this case the total probability will be given by

P=

Zb a

dxjA1(x) + A2 (x)j2

(4.37)

Since the alternatives of getting the electron at dierent points in C are exclusive ones. From this point of view it is well conceivable to speak about trajectories of the electron, but we have to associate to each possible path an amplitude and then sum all over the possible paths. For instance, suppose that we want to evaluate the amplitude to go from S to the screen C when the screen B is eliminated. This can be done by increasing the number of holes in the screen B. Then, if we denote by A(x y) the amplitude for the electron to reach x going through the hole placed at a distance y from the center of B, the total amplitude will be given by

A(x) =

Z

dyA(x y)

(4.38)

We can continue by using more and more screen of type B with more and more holes. That is we can construct a lattice of points between A and C and construct 93

all the possible paths from A to the point x on C joining the sites of the lattice, as shown in Fig. 4.3.4. By denoting with C the generic path we get that the total amplitude is given by X A(x) = A(x C ) (4.39) C

. D C1

x

S C2

C

A

Fig. 4.3.4 - The lattice for the evaluation of equation (4.39). From this point it is easy to get the path integral expression for the total amplitude, since each path is obtained by joining together many innitesimal paths, and for each of these paths the amplitude is proportional to exp(iS ) where S is the action for the innitesimal path. Therefore, for the single path C the amplitude will be given by exp(iS (C ). This is the nal result, except for a proportionality factor. This is easy to get if one starts from the phase space formulation. This is important because in many cases the simple result outlined above is not the correct one. For instance, consider the following lagrangian L = m2 q_2f (q) ; V (q) (4.40) where f (q) is an arbitrary function of q. The canonical momentum is given by

p = mqf _ (q)

(4.41)

and the corresponding hamiltonian is

H = 2mf1 (q) p2 + V (q) 94

(4.42)

Notice that for qk  qk+1 we have f (!qk )  (f (qk ) + f (qk+1))=2, where q!k = (qk + qk+1)=2. Therefore, form eq. (4.20) we get

hq0 t0jq ti = nlim !1 

Z nY;1 ! k=1

dqk

Y dpk ipk (qk+1 ; qk ) ; p2k =(2mf (!qk )) ; (V (qk+1) + V (qk ))=2] e (4.43)

n;1 k=0

2

This expression is formally the same as the one in eq. (4.26) after sending m ! mf (!qk ). Therefore we get

Z r mf (!qk ) !n nY ;1 ! lim dqk

hq0 t0 jq ti =

n!1 n;1

 e

i

2i 

X k=0

k=1

mf (!qk )((qk+1 ; qk )=)2=2 ; (V (qk+1) + V (qk ))=2]

(4.44)

We can rewrite this expression in a way similar to the one used in eq. (4.29)

hq0 t0jq ti =

Z q t 0

0

qt

i

Zt

D(q(t))e

0

t

dtL(q q_)

(4.45)

with L(q q_) given in (4.40), but the functional measure is now

D(q(t)) = nlim !1

r

mf (!qk ) 2i 

!n nY;1 k=1

dqk

(4.46)

The formal expression for the path integral is always the same, but the integration measure must be determined by using the original phase space formulation. In the continuum limit we have n;1 1X nY ;1 p 2 k=0 log f (!qk ) lim f (!qk ) = nlim e = n!1 !1 k=0

= nlim e !1

n;1 1X 2 k=0  log f (!qk )

where we have used lim n!1

n ;1 X k=0

=

Zt

0

t

1 (0) Z t dt log f (q) t = e2 (4.47)

hk = (t ; t ) lim h k n!1 

dt  95

0

(4.48)

Therefore the result can be also interpreted by saying that the classical lagrangian gets modied by quantum eects as follows L ! L ; 2i (0) log f (q) (4.49) The reason why we say that this is a quantum modication is because the second term must be proportional to /h, as it follows from dimensional arguments. An interesting question is how one gets the classical limit from this formulation. Let us recall that the amplitude for a single path is exp(iS (C )=/h) (where we have put back the factor h/ ). To nd the total amplitude we must sum over all the possible paths C . Since the amplitudes are phase factors the will cancel except for the paths which are very close to a stationary points of S . Around this point the phases will add coherently, whereas they will cancel going away from this point due to the fast variation of the phase. Roughly we can expect that the coherence eect will take place for those paths such that their phase diers from the stationary value, S (Cclass)=/h, of the order of one. For a macroscopical particle (m  1 gr and times of about 1 sec) the typical action is about 1027 /h, and the amplitude gets contribution only from the classical path. For instance, take a free particle starting from the origin at t = 0 and arriving at x0 at the time t0 . The classical path (the one making S stationary) is xclass(t) = x0 tt (4.50) 0 The corresponding value for S is

Z t0 1 x2 1 (4.51) S (xclass) = 2 m q_2 = 2 m t 0 0 0 Let us now consider another path joining the origin with x0 as, for instance, 2 t x(t) = x0 t2 0

The corresponding action is and we get

(4.52)

2 S (x) = 32 m xt 0 0

(4.53)

2 1 x S = S (x) ; S (xclass) = 6 m t 0 (4.54) 0 Let us take t0 = 1 sec and x0 = 1 cm. We get S = m=6. For a macroscopic particle (say m = 1 gr) and recalling that h/  10;27 ergsec, we get S = 1:6  1026/h and this non-classical path can be safely ignored. However, for an electron m  10;27 gr, and S = /h=6. It follows that in the microscopic world, many paths may contribute to the amplitude.

96

4.4 The free particle We shall now consider the case of the one-dimensional free motion. In this case we can easily make the integrations going back to the original denition (eq. (4.27). The expression we wish to evaluate is

X m((qk+1 ; qk )=)2=2 Z r m n nY ;1 ! i dqk e k=0 hq0 t0jq ti = lim n;1

2i 

n!1

k=1

since the lagrangian is

(4.55)

L = 21 mq_2 (4.56) The integrations can be done immediately starting from the gaussian integral r 2 Z1 2 + bx ax dxe = ; a e;b =4a Re a  0 (4.57) ;1 which allows us to evaluate the expression r

Z 2 + b(x ; x)2 2 a ( x ; x ) 1 2 dxe = ; a + b eab(x1 ; x2 ) =(a + b) (4.58) let us start by integrating q1 r Z 2 2 im(q2 ; q0 )2=(2(2) e dq1eim(q2 ; q1) + (q1 ; q0 ) ]=(2) = i  m 1  

i m 2 eim(q2 ; q0)2 =(2(2) = 2m (4.59) 2 i(2) Multiplying by the exponential depending on q2 and integrating over this variable we get  m  21 Z 2 i im(q3 ; q2 )2 + (q2 ; q0)2 =2]=(2) dq 2e m 2 i(2)  m  21  2 i(2)  12 2

i im(q3 ; q0 )2=(2(3)) = m 2 i(2) e 3m  2 i  23  m  12 im(q3 ; q0 )2 =(2(3)) = e (4.60) m 2 i(3) By repeating this procedure, after n ; 1 steps we nd

X ! i m((qk+1 ; qk )=)2 =2 Z nY ;1 n;1

dqk e

=

k=0

 2 ik=1 n  m  2 eim(qn ; q0 ) =(2(n)) 1 2

2

m

2 i(n)

97

(4.61)

The rst factor in the right hand side of this equation cancels out an analogous factor in the integration measure (see eq. (4.55))





1

2 m im(q0 ; q)2=(2(t0 ; t)) e (4.62) 2 i(t0 ; t) This expression can be generalized to a free system with N degrees of freedom

hq0 t0jq ti =



hqi0  t0jqi ti = 2 i(m t0 ; t)

 N im 2

e

N X i=1

(qi0 ; qi )2=(2(t0 ; t))



i = 1     N (4.63)

Notice also that

lim hq0 t0jq ti = (q0 ; q) (4.64) The classical path connecting (q t) to (q0 t0) for the free particle is given by t (q0 ; q) (4.65) qcl( ) = q + t0 ; ;t The corresponding classical action is t !t 0

Ztm 0 ; q )2 Z t m ( q m (q0 ; q)2 2 Scl = q _cld = d = 2 2 (t0 ; t)2 2 (t0 ; t) 0

0

t

t

Therefore our result can be written as





(4.66)

1

2 iScl hq0 t0 jq ti = 2 i(m (4.67) t0 ; t) e We shall see that for all the lagrangians quadratic in the coordinates an in the momenta this result, that is that the amplitude is proportional to the exponential of iS with S the action evaluated along the classical path, is generally true. Of course, one could formulate quantum mechanics directly in terms of the path integral. It is not dicult to show that it is possible to recover all the results of the ordinary formulation and prove that one has complete equivalence. However we will not do it here, but we will limit ourselves to notice that the formulations agree in the free case. we take a xed initial point of propagation, for instance (q t) = (0 0). Then we dene the wave function as

(q0 t0) = hq0 t0j0 0i =

Z q t 0

00

0

D(q(t))eiS

(4.68)

For the free particle we get

h m i i mq2 (q t) = 2 it e 2t 1 2

98

(4.69)

This is indeed the wave function for a free particle in conguration space, as it can be checked by going in the momentum space h m i 12 Z ;ipq i mq2 h m i 12  2 it  12 ;i p2 t ;iEt 2m = e (p t) = 2 it dq e e 2t = 2 it (4.70) m e where E = p2 =2m.

4.5 The case of a quadratic action In this Section we will consider the case of a quadratic lagrangian for a single degree of freedom (it can be easily extended to many degrees of freedom) L = 21 mq_2 + b( )qq _ ; 12 c( )q2 + d( )q_ ; e( )q ; f ( ) (4.71) To evaluate the integral (4.29) let us start by the Euler-Lagrange equations associated to eq. (4.71) d mq_ + b( )q + d( )] ; b( )q_ ; c( )q ; e( )] = 0 (4.72) d that is mq + b_ q + d_ + cq + e = 0 (4.73) Let us denote by qcl( ) the solution of the classical equations of motion satisfying the boundary conditions

qcl(t) = q qcl(t0 ) = q0 (4.74) We then perform the change of variables ( ) = q( ) ; qcl( ) (4.75) which in terms of our original denition (4.29) means k = qk ; qclk  k = 1     n ; 1 (4.76) Since this is a translation the functional measure remains invariant. The new variable ( ) satises the boundary condition (t) = (t0) = 0 (4.77) The lagrangian in terms of the new variables is given by _ ; 1 c2 L(q) = L(qcl + ) = L(qcl ) + 21 m_ 2 + b 2 + d_ ; e + mq_cl _ + bq_cl  + bq _ cl ; cqcl (4.78) 99

Inside the action we can integrate by parts the terms containing both qcl and . By using the boundary conditions for  we get h i L(q) = L(qcl ) + 21 m_ 2 + b _ ; 1 c2 + d_ ; e ; mqcl + b_ qcl + cqcl  (4.79) 2 Recalling the equations of motion for qcl and performing a further integration by parts we nd h i _ ; 12 c2 + d_ ; e ; ;d_ ; e  L(q) = L(qcl) + 21 m_ 2 + b = L(qcl) + 1 m_ 2 + b _ ; 1 c2 (4.80) 2 2 Therefore we can write the path integral as follows Zt 1 2 + b _ ; 1 c 2 )d Z 0t m  _ i ( 0 2 (4.81) hq0 t0jq ti = eiScl(q q ) D(( ))e t 2 0

0

0t

Notice that here all the dependence on the variables q and q0 is in the classical action. In fact the path integral which remains to be evaluated depends only on t and t0 due to the boundary conditions satised by . For the future purposes of extension of this approach to eld theory, this term turns out to be irrelevant. However it is important for systems with a nite number of degrees of freedom and therefore we will give here an idea how one can evaluate it in general. First, let us notice that the term b_ is a trivial one, since we can absorb it into a re-denition of c( ), through an integration by part 1  1 d b_ = d 2 b2 ; 2 b_ 2 (4.82) and dening c~ = c + b_ (4.83) In this way we get   Z t 1 n;1  X 1 m 1 2 2 2 2 i (k+1 ; k ) ; 2 c~k k i !1 2 m_ ; 2 c~ d = nlim t k=0 2  n;1  X ;  im i 2 2 = nlim 2k ; 2k k+1 ; 2 c~k k !1 2  k=1 T  = nlim i (4.84) !1 where we have used the boundary conditions on  (0 = n = 0) and we have introduced the vector 2 3 66 12 77  = 66  77 (4.85) 0

4  5 n;1

100

and the matrix 22 66 ;1 6  = 2m 66 0 64 0 0

;1 0 2 ;1 ;1 2   0 0

    0  0 

Therefore

G(t0 t)  =

0 0 0

0

;1 2

 ;1

Z 0t

0

3 2 c~ 0    0 77 66 01 c~2    0 77 ;  66         77 2 66       4 0 0    c~n;2 2 ;1 5

0 0 0

3 77  77  77 0 5

0 0 0

D(( ))e

i

Zt

0 0 

0

t

0

0 0

c~n;1 (4.86)

( 21 m_ 2 + b_ ; 12 c2)d

 m  n Z nY;1 ! iT  dk e lim n!1 2 i 0t

(4.87)

2

k=1

Since  is a symmetric matrix, it can be diagonalized by an orthogonal transformation  = U T D U (4.88) The eigenvectors of ,  = U, are real and the Jacobian of the transformation is one, since detjU j = 1. We get Z nY Z nY;1 ! T n 1 ;1 ! T nY ;1 r 2



i  i   D p dk e = dk e = = det(;i) k=1 k=1 k=1 (;ik ) (4.89) We get m1  12  m n n;1  12 1 0 = nlim (4.90) G(t  t) = nlim !1 2 i  ( 2mi )n;1 det(;i ) !1 2 i det(;i ) Let us dene "  n;1 # 2 i det(;i) (4.91) f (t0 t) = nlim  m !1 ;

then

r

m 2 if (t0 t) In order to evaluate f (t0 t), let us dene (for j = 1     n ; 1) 2 2 ;1 0    0 0 0 3 2 c~ 0 0 66 ;1 2 ;1    0 0 0 77 66 01 c~2 0 6 7 2 66 0 0 c~3 Pj = 66 0 ;1 2    0 0 0 77 ; m 64 0 0 0    ;1 2 ;1 75 664 0 0 0 0 0 0    0 ;1 2 0 0 0 G(t0 t) =

101

(4.92)

 0  0  0      c~j;1  0

3

0 0 77 0 77

 77

05 c~j (4.93)

and Clearly

pj = detjPj j

 2i n;1

det(;i) = pn;1 m For the rst values of j , pj is given by 2 c~ p1 = 2 ; m 1  2  p2 = 2 ; m c~2 p1 ; 1  2  p3 = 2 ; m c~3 p2 ; p1 Therefore we nd the recurrency relation  2  pj+1 = 2 ; m c~j+1 pj ; pj;1 j = 1     n ; 1 where we dene p0 = 1. It is convenient to put this expression in the form pj+1 ; 2pj + pj;1 = ; c~j+1 pj 2 m It is also convenient to introduce the nite dierence operator pj = pj+1; pj a time  = t + j, and a function ( ) such that lim p = ( ) !0 j in the limit in which we keep  xed. It follows d( ) lim  p = j !0 d The equation (4.98) can be written as 2 pj;1 = ; c~j+1pj m therefore we get the following dierential equation for ( )

d2( ) = ; c~( ) ( ) d 2 m 102

(4.94) (4.95)

(4.96)

(4.97)

(4.98) (4.99) (4.100) (4.101) (4.102)

(4.103)

Furthermore, ( ) satises the following boundary conditions

(t) = lim p = 0 !0 0 d( ) j = lim  p1 ; p0 = lim 2 ; 2 c~ ; 1 = 1 !0 d  =t !0  m1 Recalling eq. (4.91) we have f (t0 t) = lim p = (t0) !0 n;1 Therefore, for quadratic lagrangian we get the general result r m 0 0 hq  t jq ti = 2 if (t0 t) eiScl

(4.104) (4.105) (4.106)

with f (t0  t) = (t0).

Examples: The free particle. We need only to evaluate f (t0 t). The function ( ) satises

with solution

d2( ) = 0 (t) = 0 d( ) j = 1 d 2 d  =t

(4.107)

( ) =  ; t

(4.108)

from which

f (t0  t) = t0 ; t in agreement with the direct calculation.

(4.109)

The harmonic oscillator. In this case b = 0 and c = m!2. Therefore d2 ( ) = ;!2( ) (t) = 0 d( ) j = 1 d 2 d  =t from which ( ) = !1 sin !( ; t) by putting T = t0 ; t: f (t0 t) = !1 sin !T 103

(4.110) (4.111) (4.112)

To evaluate the classical action we need to solve the classical equations of motion

mq + m!2q = 0 with boundary conditions

q(t) = q q(t0) = q0 The general solution is given by q( ) = A sin !( ; t) + B cos !( ; t)

(4.113) (4.114) (4.115)

From the boundary conditions 0 !T  B = q A = q ;sinq cos (4.116) !T and substituting inside the lagrangian

L = 12 mq_2 ; 21 m!2 q2 = 12 m!2 (A2 ; B 2 ) cos 2!( ; t) ; 2AB sin 2!( ; t) (4.117) Recalling the following integrals

Zt

0

t

d cos 2!( ; t) = sin22!!T 

we get

Zt

0

t

2!T d sin 2!( ; t) = 1 ; cos 2!

(q2 + q02) cos !T ; 2qq0

Scl = 21 sinm! !T

and

hq0 t0jq ti =

r m! i m! (q2 + q02) cos !T ; 2qq0

2 sin !T 2 i sin !T e

(4.118) (4.119)

(4.120)

In the limit ! ! 0 we get back correctly the free case.

4.6 Functional formalism The most natural mathematical setting to deal with the path integral is the theory of functionals. The concept of functional is nothing but an extension of the concept of function. Recall that a function is simply a mapping from a manifold M to R1 , that a law which associates a number to each point of the manifold. A real functional is dened as an application from a space of functions (and therefore an 1-dimensional space) to R1. There are many spaces of functions. For instance, the space of the square integrable real functions of a real variable. Therefore a real 104

functional associate a real number to a real function. If we denote the space of functions by L, a real functional is the mapping

F:

L ! R1

(4.121)

We will make use of the notation

F ] F F ] 2L (4.122) to denote the functional of dened in eq. (4.121). Examples of functionals are  L=L1;1 +1]= the space of the integrable functions in R1 with respect to the measure w(x)dx: Z +1 F1] = dxw(x)(x) (4.123) ;1

 L=L2;1 +1]= space of the square integrable functions in R1: ; 12 F2 ] = e

Z

dx2 (x)

(4.124)

 L=C = space of the continuous functions: F3] = (x0 ) (4.125) F3 is the functional that associates to each continuous function its value at x0. The next step is to dene the derivative of a functional. We need to understand how a functional varies when we vary the function. A simple way of understand this point is to consider some path on the denition manifold of the functions. For instance consider functions from R1 to R1 , and take an interval (t t0) of R1 made of many innitesimal pieces, as we did for dening the path integral. Then we can approximate a function (t) with a convenient limit of its discrete approximation (1  2     n) obtained by evaluating the functions at the discrete points in which the interval has been divided. In this approximation we can think of the functional F as an application from Rn ! R1 . That is F ] ! F (1      n)  F (i) i = 1     n (4.126) Then we can consider the variation of F (i) when we vary the quantities i F (i + i) ; F (i) = Let us put

n X @F (i) j =1

@F Ki = 1 @

i

105

@j j

(4.127) (4.128)

where is the innitesimal interval where we evaluate the discrete apoproximation to the function. Then, for  ! 0

F (i + i ) ; F (i) = that is

F ] =

Z

n X j =1

Kj j  !

Z

Kdx

K (x)(x)dx

(4.129) (4.130)

Since (x) is the variation of the function evaluated at x, we will dene the functional derivative as Z F (4.131) F ] = (x) (x)dx

In other words. to evaluate the functional derivative of a functional we rst evaluate its innitesimal variation and then we can get the functional derivative by comparison with the previous formula. Looking at the eq. (4.128) we get F ] = lim 1 @F (i ) !0  @i (x) x1  i = 1     n (4.132) i = (xi) xi = x1 + (i ; 1)  = xnn ; ;1 let us now evaluate the functional derivatives of the previous functionals:

F1] = giving Then

Z

w(x)(x)dx

F1] = w(x) (x)

Z 1 ; 2 dx2 (x) Z (; (x)(x)) F2 ] = e

and therefore

F2 ] = ;(x)F ] 2 (x) In order to evaluate the derivative of F3 let us write F3   ] =

Z

(x)(x ; x0 )dx

In this way F3 looks formally as a functional of type F1 , and F3 ] = (x ; x ) 0 (x) 106

(4.133) (4.134) (4.135) (4.136) (4.137) (4.138)

Let us consider also

then

; 21 F4] = e

Z

Z 1 ; dxdyK (x y)(x)(y) F4] = e 2 dxdyK (x y)(x)(y)

(4.139)

Z

(; dxdyK (x y)(x)(y))

(4.140)

where we have used the symmetry of the integral with respect to x $ y. We obtain F4] = ; Z dyK (x y)(y)F ] (4.141) 4 (x) A further example is the action functional. In fact the action is a functional of the path Zt S q] = L(q q_)dt (4.142) 0

By varying it

t

Z t  @L







Z t @L d @L @L (4.143) S q] = @q q + @ q_ q_ dt = t @q ; dt @ q_ qdt t Since we consider variations with respect to functions with xed boundary conditions, we have q(t) = q(t0) = 0. We get S = @L ; d @L (4.144) q(t) @q(t) dt @ q_(t) Starting from the functional derivative one can generalize the Taylor expansion to the functional case. This generalization is called the Volterra expansion of a functional  Z 1 k F  ] X  1 F 0 + 1 ] = k! dx1    dxk (x )    (x )  1(x1 )    1 (xk ) (4.145) 1 k =0 k=0 0

0

All these denitions are easily generalized to the case of  a function from Rm ! Rn, or from a manifold M to a manifold N .

4.7 General properties of the path integral In this Section we would like to stress some important properties of the path integral, namely 107

 Invariance of the functional measure under translations - It follows immediately from the denition that q( ) ! q( ) + ( ) p( ) ! p( ) + ( )

(t) = (t0) = 0 for any (t) (t0 )

(4.146)

 Factorization of the path integral. From the very denition of the path it

follows that in order to evaluate the amplitude for going from q at time t to q0 at time t0 we can rst go to q" to a time t  t"  t0, sum up over all the paths fromq a q" and from q" to q0, and eventually integrating over all the possible intermediate points q" (see Fig. 4.7.1)).

. t'

t"

t

q

q'

q"

Fig. 4.7.1 - The factorization of the path integral. In equations we get

hq0 t0jq ti = =

Z qt

0 0

qt

Z

 =

Z

d(q( ))d(p( ))e

dq"

Z q"t"

Z q t 0

qt 0

q"t"

i

Z t"

Ld + i

t

d(q( ))d(p( ))e

d(q( ))d(p( ))e

i

i

Zt

0

t"

Z t" t

Zt

0

t"

Ld

Ld

Ld

dq"hq0 t0jq" t"ihq" t"jq ti

(4.147)

as it follows from the factorization properties of the measure

d(q( )) =

Y

t<
dq( ) = dq" 0

108

Y

t<
dq( )

Y

t"<
0

dq( )

d(p( )) =

Y t t"

dp( ) =

Y t t"

dp( )

Y

dp( )

t" t

(4.148)

0

It is clear that the factorization property is equivalent to the completeness of the states. Using these properties it is possible to show that the wave function, as dened in (4.68) satises the Schrodinger equation. The next important property of the path integral is that it gives an easy way of evaluating expectation values. To this end we will consider expressions of the type (which will be called the average value of F )

Z q t 0

qt

0

d(q( ))d(p( ))F q( ) p( )]eiS

(4.149)

where F is an arbitrary functional of q, p and of their time derivatives. Let us start with Z q t d(q( ))d(p( ))eiS q(t") t  t"  t0 (4.150) 0

0

qt

We begin by the observation that

Z q t

d(q( ))d(p( ))eiS q(t0) = q0

Z q t

d(q( ))d(p( ))eiS = q0hq0 t0jq ti qt qt (4.151) since q(t) satises the boundary conditions q(t) = q, q(t0 ) = q0 . By using the factorization we get 0

0

Z q t 0

0

qt

d(q( ))d(p( ))eiS q(t") =

0

Z

 =

0

dq"

Z q t 0

0

Z q"t"

Z q"t" qt

d(q( ))d(p( ))eiS q(t")

d(q( ))d(p( ))eiS

dq"hq0 t0jq" t"iq"hq" t"jq ti

(4.152)

Since jq ti is an eigenstate of q(t) it follows

Z q t 0

qt

0

d(q( ))d(p( ))eiS q(t") =

Z

dq"hq0 t0jq(t")jq" t"ihq" t"jq ti

= hq0 t0jq(t")jq ti

(4.153)

Therefore the average value of q(t") evaluated with exp(iS ) coincides with the expectation value of the operator q(t"). Now let us study the average value of q(t1 )q(t2) con t  t1  t2  t0 . By proceeding as before we will get the expectation value of q(t1 )q(t2) if t1  t2 , or q(t2)q(t1 ) if 109

t2  t1. In terms of the T -product we get

Z q t

d(q( ))d(p( ))eiS q(t1 )q(t2) = hq0 t0jT (q(t1)q(t2 )jq ti

(4.154)

d(q( ))d(p( ))eiS q(t1 )    q(tn) = hq0 t0jT (q(t1)    q(tn))jq ti

(4.155)

0

0

qt

Analogously

Z q t 0

qt

0

Let us now consider a functional of q(t)such to admit an expansion in a Volterra's series

Z q t 0

0

qt

d(q( ))d(p( ))F q]eiS



Z k F q]  1 = dt 1    dtk q(t1 )    q(tk ) q=0 k=0 k! 1 X



Z q t 0

qt

0

d(q( ))d(p( ))eiS q(t1 )    q(tk )



1 X 1Z

k F q]  hq0 t0jT (q(t )    q(t ))jq ti = dt    dt 1 k 1 k q(t1 )    q(tk ) q=0 k=0 k!  hq0 t0jT (F q])jq ti (4.156) where

T (F q]) 

Z 1 X 1 k=0 k!



k  dt1    dtk q(t ) F qq] (t )  T (q(t1 )    q(tk )) 1 k q=0

(4.157)

If the functional depends also on the time derivatives a certain caution is necessary. For instance, let us study the average value of q(t1)q_(t2) (t  t1 t2  t0):

Z q t 0

qt

= lim !0

0

d(q( ))d(p( ))eiS q(t1)q_(t2 )

Z q t 0

0

d(q( ))d(p( ))eiS q(t1) q(t2 + ) ; q(t2)

    1 0 0 hq  t jT q(t1 )q(t2 + ) ; T q(t1)q(t2 ) jq ti = lim !0    d 0 0 = dt2 hq  t jT q(t1)q(t2 ) jq ti Let us dene an ordered T? -product (or covariant T-product) as     T ? q(t1 )q_ (t2 ) = dtd T q(t1 )q(t2 ) 2 qt

110

(4.158) (4.159)

Then it is not dicult to show that for an arbitrary functional of q(t) e q_(t) one has

Z q t 0

qt

0

d(q( ))d(p( ))F q q_]eiS = hq0 t0 jT ?(F q q_ ])jq ti

(4.160)

The proof can be done by expanding F q q_] in a Volterra's series of q and q_, integrating by parts and using (4.156). At the end one has to integrate again by parts. The T? product is obtained as in (4.158), The power of this formulation of quantum mechanics comes about when we want to get the fundamental properties of quantum mechanics as the equations of motion for the operators, the commutation relations and so on so forth. The fundamental relation follows from the following property of the path integral

Z q t 0

qt

0

h i d(q( ))d(p( )) q( ) F q( ) p( )]eiS = 0

(4.161)

This is a simple consequence of the denition of the path integral and of the functional derivative

2 3 n;1 X i L(qk  pk )77 Z nY;1 ! nY;1 dpk ! 1 @ 66 6 77 = 0 F (q1      qn;1)e k=0 lim dqk 6 n!1 2

 @q k4 5 k=1 k=0

(4.162)

(of course we are assuming that the integrand goes to zero for large values of the qi 's. To this end one has to dene the integral in an appropriate way as it will be shown later on). On the other hand the previous equation follows from the invariance under translation of the functional measure. In fact, let us introduce the following notation dqp = d(q( ))d(p( )) (4.163) It follows for an innitesimal function  such that (t) = (t0) = 0

Z q t

Z q t

Z q t

Z q t

Z

q] dt (t) A q(t) qt qt qt qt (4.164) Proving the validity of eq. (4.161). By performing explicitly the functional derivative in eq. (4.161)we get a relation which is the basis of the Schwinger's formulation of quantum mechanics, that is    F q]  S 0 0 ? 0 0 ? ihq  t jT F q] q( ) jq ti = ;hq  t jT q( ) jq ti (4.165) The importance of this relation follows from the fact that we can use this relation to get all the properties of quantum mechanics by selecting the functional F in a 0

0

dqp Aq] =

0

0

dqp Aq + ] =

0

111

0

dqp Aq]+

0

0

dqp

convenient way. For instance, by choosing F q] = 1 we get  S  0 0 ? hq  t jT q(t ) jq ti = 0 1 that is the quantum equations of motion. With the choice F q] = q we get   S 0 0 ? hq  t jT q(t1 ) q(t ) jq ti = ihq0 t0jq ti(t1 ; t2) 2 and since this relation holds for arbitrary states   S ? T q(t1 ) q(t ) = i(t1 ; t2 ) 2 For simplicity, let us consider the following action  Z t 1 2 S= 2 mq_ ; V (q) dt t then   S = ; mq(t ) + @V (q(t2 )) 2 q(t2) @q(t2 ) from which    @V ( q ) ? T q(t1 ) mq(t2 ) + @ q(t ) = ;i(t1 ; t2 ) 0

2

(4.166) (4.167) (4.168) (4.169) (4.170) (4.171)

By using the denition of the T ?-product we get 2 T ?(q(t1 )q(t2) = dtd 2 T (q(t1 )q(t2 )) 2 2 d = dt2 (t1 ; t2 )q(t1)q(t2 ) + (t2 ; t1)q(t2 )q(t1 )] 2 h d = dt2 ; (t1 ; t2)q(t1 )q(t2 ) + (t1 ; t2 )q(t2)q(it1 ) + theta(t1 ; t2)q(t1 )q_ (t2 ) + (t2 ; t1 )q_ (t2 )q(t1) = (t1 ; t2 )q(t1 )q_ (t2) + (t1 ; t2 )q_ (t2)q(t1 ) + T (q(t1)q(t2)) = (t1 ; t2 )q(t1 ) q_ (t1)] + T (q(t1 )q(t2 )) (4.172) Inserting this inside the eq. (4.171    @V ;m(t1 ; t2 )q(t1 ) q_ (t2)] + T q(t1 ) mq(t2 ) + @ q(t ) = ;i(t1 ; t2) (4.173) 2 and using the equations of motion, we get q(t) mq_ (t)] = i (4.174) Therefore we get the canonical commutator q(t) p(t)] = i (4.175) 112

4.8 The generating functional of the Green's functions In eld theory the relevant amplitudes are the ones evaluated between t = ;1 and t = +1. We have seen that using the reduction formulas we are lead to evaluate the vacuum expectation value of T -products of local operators. Therefore we will consider matrix elements of the type (limiting ourselves, for the moment being, to a single degree of freedom)

h0jT (q(t1)    q(tn))j0i

(4.176)

Here j0i is the ground state of system. Notice that more generally one can consider the matrix element

hq0 t0jT (q(t

1 )    q(tn ))jq ti =

Z q t 0

0

qt

d(q( ))d(p( ))eiS q(t1 )    q(tn)

(4.177)

These matrix elements can be derived in a more compact form, by introducing the generating functional

hq0 t0jq ti

J

where

=

SJ =

Z q t 0

0

qt

Zt

0

t

d(q( ))d(p( ))eiSJ

(4.178)

d pq_ ; H + Jq]

(4.179)

and J ( ) is an arbitrary external source. By dierentiating this expression with respect to the external source and evaluating the derivatives at J = 0 we get

hq0 t0 jT (q(t

1 )    q(tn ))jq ti =

 1 n



0  t0 jq ti  h q J J =0 J (t1 )    J (tn)

n

(4.180) i Now we want to see how it is possible to bring back the evaluation of the matrix element in (4.176) to the knowledge of the generating functional. Let us introduce the wave function of the ground state #0 (q t) = e;iE0 thqj0i = hqje;iHtj0i = hq tj0i (4.181) then

Z

h0jT (q(t1)    q(tn ))j0i = dq0dqh0jq0 t0ihq0 t0jT (q(t1)    q(tn ))jq tihq tj0i =

Z

dq0dq#?0 (q0 t0)hq0 t0jT (q(t1)    q(tn))jq ti#0(q t) 113

(4.182)

Next we dene the functional Z J ], the generating functional of the vacuum amplitudes, that is

Z J ] =

Z

dq0dq#?0(q0 t0)hq0 t0jq tiJ #0 (q t) = h0j0iJ

It follows

 1 n

(4.183)



 Z  J ] (4.184) J (t1)    J (tn) J =0 We want to show that it is possible evaluate Z J ] as a path integral with arbitrary boundary conditions on q and q0. That is we want to prove the following relation iE0 (t0 ; t) 0 0 e hq  t jq tiJ (4.185) Z J ] = t!+i1lim  t !;i1 #?0 (q )#0 (q 0 ) wit q and q0 arbitrary and with the ground state wave functions taken at t = 0. To this end let us consider an external source J (t) vanishing outside the interval (t00  t000 ) with t0 > t000 > t00 > t. Then, we can write n

h0jT (q(t1)    q(tn))j0i = i

0

hq0 t0jq ti

J

=

Z

dq00dq000hq0 t0jq000  t000ihq000 t000jq00 t00 iJ hq00 t00 jq ti

(4.186)

with X 0 000 0 000 hq0 t0jq000 t000 i = hq0je;iH (t ; t )jq000i = #n (q0)#?n(q000)e;iEn (t ; t ) (4.187) n

Since E0 is the lowest eigenvalue we get X 0 ? 000 iEnt000 ;i(En ; E0)t0 0 lim eiE0 t hq0 t0jq000 t000i = lim #n (q )#n(q )e e = t !;i1

t !;i1

0

0

= Analogously

n 000 #0 (q0)#?0 (q000)eiE0 t

hq00 t00jq ti =

and

X n

00 #n (q00)#?n(q)e;iEn (t ; t)

lim e;iE0 t hq00 t00jq ti = t!lim t!+i1 +i1 = #0

And nally we get the result

= #?0(q000  t000)#0 (q0)

X n

(4.188) (4.189)

00 #n(q00 )#?n(q)e;iEn t ei(En ; E0 )t =

00 (q00)#? (q)e;iE0 t 0

= #0(q00  t00)#?0 (q)

(4.190)

iE0 (t0 ; t)hq0 t0 jq tiJ im e t ! + i 1 t !;1 Z = dq00dq000#?0 (q)#0 (q0)#?0 (q000 t000)hq000  t000jq00 t00iJ #0 (q00 t00) = = #?0 (q)#0(q0 )Z J ] (4.191) 0

114

Let us notice again that the values q and q0 are completely arbitrary, and furthermore that the coecient in front of the matrix element is J -independent. Therefore it is physically irrelevant. In fact, the relevant physical quantities are the ratios

 n h0jT (q(t1)    q(tn))j0i =  1 n 1 h0j0i i Z 0] J (t1 )    J (tn) Z J ]J =0

(4.192)

where the J -independent factor cancels out. Then we can write

Z q t 0

Z J ] = t!+i1lim N t !;i1

0

D(q( ))e

qt

0

i

Zt

0

t

(L + Jq)d

(4.193)

where N is a J -independent normalization factor. This expression suggests the denition of an euclidean generating functional ZE J ]. This is obtained by introducing an euclidean time  = it (Wick's rotation)

ZE J ] =  !+1lim !;1 N 0

Consider for instance the case

Z q  0

q

Z

0

D(q( ))e

0



(L(q i dq ) + Jq)d d

2 L = 12 m( dq dt ) ; V (q)

(4.194)

(4.195)

 

from which

dq ) = ; 1 m dq 2 ; V (q) L(q i d (4.196) 2 d It is then convenient to dene the euclidean lagrangian and euclidean action

 2

dq LE = 12 m d

Z

SE =

0



+ V (q)

(4.197)

LE d

(4.198)

Notice that the euclidean lagrangian coincides formally with the hamiltonian. More generally dq ) (4.199) LE = ;L(q i d Therefore

ZE J ] =  !+1lim !;1 N 0

Z q  0

q

0

D(q( ))e 115

;SE +

Z 

0

Jqd

(4.200)

Since e;SE is a positive denite functional, the path integral turns out to be well dened and convergent if SE is positive denite.the vacuum expectation values of the operators q(t) can be evaluated by using ZE and continuing the result for real times nZ J ]  = (i)n 1 nZE J ]  1 (4.201)  Z 0] J (t1 )    J (tn ) J =0 ZE 0] J (1 )    J (n) J =0i=iti Since the values of q and q0 are arbitrary a standard choice is to set them at zero.

4.9 The Green's functions for the harmonic oscillator It is interesting to evaluate the generating functional for the harmonic oscillator starting from both the equations (4.185) and (4.200). Let us start from eq. (4.183), where we do not need to specify the boundary conditions Z Z J ] = D(q( ))#?0(q0 t0)eiSJ #0 (q t)dqdq0 (4.202) with

 m!  ; 1 m!q2 ; i !t # (q t) = e 2 e 2 1 4

0

and

 m!  41 ; 1 m!q02 i !t0 ? 0 0 2 #0(q  t ) = e2

e

Z t 1



(4.203) (4.204)

1 m!2q2 + Jq d SJ = (4.205) 2 t 2 In this calculation we are not interested in the normalization factors but only on the J dependence. Then, let us consider the argument of the exponential in eq. (4.9 and, for the moment being, let us neglect the time dependent terms from the oscillator wave functions  Z t 1 1 1 2 2 0 2 2 2 mq_ ; 2 m! q + Jq d (4.206) arg = ; 2 m!(q + q ) + i t 2 let us perform the change of variable q( ) = x( ) + x0 ( ) (4.207) where we will try to arrange x0 ( ) in such a way to extract all the dependence from the source J out of the integral. We have arg = ; 1 m!x(t)2 + x2 (t0 )] ; 1 m!x0 (t)2 + x20 (t0 )] ; m!x(t)x0 (t) + x(t0 )x0 (t0)] 2 2 0

mq_2 ;

0

116

+ i + i

Z t 1 0

Zt t 2 0

t

mx_ 2 ;







1 m!2x2 d + i Z t 1 mx_ 2 ; 1 m!2x2 + Jx d 0 0 2 0 2 t 2 0



mx_ x_ 0 ; m!2xx0 + Jx d

(4.208)

Integrating by parts the last integral can be written as

imxx_ 0 ]tt

0

;i

Zt

0

t

mx0 + m!2x0 ; J ]xd

(4.209)

and choosing x0 as a solution of the wave equation in presence of the source J x0 + !2x0 = m1 J (4.210) we get imx(t0 )x_ 0 (t0 ) ; x(t)x_ 0 (t)] (4.211) Let us take the following boundary conditions on x0 :

ix_ 0 (t) = ;!x0(t)

ix_ 0 (t0 ) = !x0(t0 )

(4.212)

In this way, the term (4.211) cancels the mixed terms arg = ; 12 m!x2 (t) + x2 (t0)] ; 12 m!x20(t) + x20 (t0 )] Zt 1 + i  2 mx_ 2 ; 21 m!2x2 ]d 0

Zt t

 1 mx_ 20 ; 1 m!2x20 + Jx0 ]d 2 t 2 Integrating by parts the last term we get + i

0

(4.213)

Zt

 1 mx_ 20 ; 1 m!2x20 + Jx0 ]d = 2 Z tt 21 = ; 2 mx0 (x0 + !2x0 ; m1 J ) + 12 Jx0 ]d + 2i mx0 x_ 0 ]tt = t Zt 1 i 2 0 2 = m!x0 (t ) + x0(t)] + Jx0 d (4.214) 2 2 t Keeping in mind the boundary conditions (4.212), the term coming from the integration by parts cancels the second term in eq. (4.213), and therefore

i

0

0

0

0

Zt 1 Zt 1 1 i 2 2 0 2 2 2 arg = ; 2 m!x (t) + x (t )] + i  2 mx_ ; 2 m! x ]d + 2 Jx0 d (4.215) t t 0

0

117

The J dependence is now in the last term, and we can extract it from the integral. The rst two term give rise to the functional at J = 0. We obtain

i Z t J ( )x ( )d 0 Z 0] Z J ] = e 2 t 0

(4.216)

with x0( ) solution of the equations of motion (4.210) with boundary conditions (4.212). For t <  < t0 this solution can be written as

 d2

d 2

and

+ !2



0

( ; s)J (s)ds

(4.217)

( ; s) = ; i ( ; s) m

(4.218)

x0 ( ) = i with

Zt t

_ t0 ; s) = !(t0 ; s) i( _ t ; s) = ;!(t ; s) i( These equations are easily solved fort t0 > s and t < s ( ) = Ae;i!   >0 ( ) = Be+i!   <0 Therefore we have

( ) = A( )e;i! + B(; )e+i!

(4.219)

(4.220) (4.221)

The constants A e B are xed by the requirement that ( ) satises eq. (4.218). We have _  ) = (A ; B )( ) ; iA!( )e;i! + iB!(; )e+i! ( (4.222) and from which

( ) = (A ; B )_ ( ) ; i!(A + B )( ) ; !2( ) 1 A = 2!m

A = B

and nally

1 h( )e;i! + (; )e+i! i ( ) = 2!m The functional we were looking for is

Zt 1 ; 2 dsds0J (s)(s ; s0)J (s0 ) t Z 0] Z J ] = e

(4.223) (4.224) (4.225)

0

118

(4.226)

We see that (s ; s0) is the vacuum expectation value of the T -product of the position operators at the times s and s0 :

 2 0 (s ; s0) = ; 1  Z J ] 0  = h0jT (q(s)q(s ))j0i (4.227) Z 0] J (s)J (s ) J =0 h0j0i The analogue calculation in the euclidean version is simpler since the path integral is of the type already considered (quadratic action) (cl)

ZE J ]  e;SE

(4.228)

Here SE(cl) is th euclidean classical action. We have already noticed that here the boundary conditions are arbitrary and therefore we choose q( ) ! 0 for  ! 1. we have to solve the equations of motion in presence of the external source





 S ; Z +1 Jqd = ;mq ( ) + m!2q( ) ; J = 0 q( ) E ;1

that is

q ; !2q = ; m1 J Let us now dene the euclidean Green's function   d2 1 2 ; ! D E ( ) = ;  ( ) 2 d m with the boundary condition lim D ( ) = 0  !1 E

(4.229) (4.230) (4.231) (4.232)

then, the solution of the eq. (4.230) is

q ( ) =

Z +1 ;1

DE ( ; s)J (s)ds

(4.233)

We can evaluate DE ( ) starting from the Fourier transform

DE ( ) =

Z

de;i DE ( )

(4.234)

Substituting inside (4.231) we get

Z

from which

; de;i  2 + !2]DE ( ) = ; m1 ( )

(4.235)

1 1 DE ( ) = 2 m 2  + !2

(4.236)

119

Z

and

1 +1 d 1 e;i DE ( ) = 2 m (4.237)  2 + !2 ;1 To evaluate this integral we close the integral with the circle at the 1 in the lower  -plane for  > 0 or in the upper  -plane for  < 0. The result is 1 e;! (;2 i) = 1 e;! DE ( ) = 2 m (4.238) (;2i!) 2m! By doing the same calculation for  < 0 we nally get 1 e;!j j DE ( ) = 2m! (4.239) The classical action is given by

Z +1

Z +1 







Z

+1 SE ; Jqd = ; 21 m q ; !2q + m1 J q ; 21 Jq d = ; 12 Jqd ;1 ;1 ;1 (4.240) and therefore Z Z 1 +1 J (s)D (s ; s0)J (s0)dsds0 1 +1 Jqd E Z 0] = e 2 ;1 Z 0] (4.241) ZE J ] = e 2 ;1

It follows

1 2ZE J ]  = D (s ; s0 ) E ZE 0] J (s)J (s0 ) J =0 and using the eq. (4.201)

1 2 Z J ]  = ; 1 2ZE J ]  Z 0] J (t1)J (t2 ) J =0 ZE 0] J (1 )J (2 ) J =0i=iti

(4.242) (4.243)

we get

(t) = DE (it) (4.244) To compare these two expressions let us introduce the Fourier transform of (t). Since it satises eq. (4.218) we obtain (t) = with that is

Z

de;it ( )

(; 2 + !2)( ) = ; i 2 m i 1 ( ) = 2 m 2  ; !2 120

(4.245) (4.246) (4.247)

Whereas eq. (4.237) does not present any problem ( 2 + !2 > 0), in eq. (4.246) we need to dene the singularities present in the denominator. In fact, this expression has two poles at  = !, and therefore we need to specify how the integral is dened. In order to reproduce the euclidean result, it is necessary to specify the integration path as in Fig. 4.9.1. For instance, for t > 0 by closing the integral in the lower plane we get i (;2 i) 1 e;i!t = 1 e;i!t (4.248) 2 m 2! 2m! . Im ν



−ω

Re ν

Fig. 4.9.1 - The integration path for equation (4.245). We would get the same result by integrating along the real axis but translating the denominator of +i ( > 0) in such a way to move the pole at ! in the lower plane and the one at ;! in the upper plane. We get Z +1 e;it i d (4.249) (t) = lim !0+ 2 m ;1  2 ; ! 2 + i With this prescription we can safely rotate the integration path anti-clockwise by 900, since we do not encounter any singularity (see Fig. 4.9.2) Z +i1 e;it i (t) = 2 m d (4.250) ;i1  2 ; ! 2 We have omitted here the limit since the expression is now regular. Performing the change of variable  = i 0 Z +1 e+ 0t 1 0 (4.251) (t) = ; 2 m 2 ; ! 2 d = DE (it) 0 ;  ;1 Therefore we have shown how it works the analytic continuation implicit in eq. (4.244). By doing so we have also seen that the Feynman prescription (the +i term) is equivalent to the euclidean formulation. One could arrive to this connection also 121

. Im ν

− ω + iε ω − iε

Re ν

Fig. 4.9.2 - The Wick rotation in the  -plane. by noticing that the Feynman prescription is equivalent to substitute in the path integral the term Z i 2 ; 2 m! q2(t)dt e (4.252) with Z i 2 ; 2 m(! ; i) q2(t)dt e (4.253) Therefore the Feynman prescription is there to make the integral convergent. Z 1 ; 2 m q2(t)dt e (4.254) The same convergence is ensured by the euclidean formulation. Therefore we have shown that, in the case of quadratic actions, the path integral can be formulated as an euclidean integral, making it a well dened mathematical expression. This fact is enough to guarantee that all the manipulations that we make with the path integral in the perturbative expansion are well dened, since we will be expanding around the quadratic case.

122

Chapter 5 The path integral in eld theory 5.1 The path integral for a free scalar eld We will extend now the previous approach to the case of a scalar eld theory. We will show later how to extend the formulation to the case of Fermi elds. Let us start with a free scalar eld. We have denoted the quantum operator by (x). Here we will denote the classical eld by '(x). For a neutral particle this is a real function, and its dynamics is described by the lagrangian Z 1 Z t Z 1

(5.1) 4  2 2 S = d x 2 @ '@ ' ; m '  dt d3~x 2 @ '@  ' ; m2 '2 V t A free eld theory can be seen as a continuous collection of non-interacting harmonic oscillators. As it is well known this can be seen by looking for the normal modes. To this end, let us consider the Fourier transform of the eld Z 1 '(~x t) = (2 )3 d3~kei~k  ~x q(~k t) (5.2) By substituting inside eq. (5.1) we get Z Z d3~k d3~k0 1 S = 2 d4x (2 )3 (2 )3 q_(~k t)q_(~k0  t) V 0 + (~k  ~k0 ; m2)q(~k t)q(~k0 t)]ei~k  ~x + i~k  ~x Z t Z d3~k 1 = dt (2 )3 q_(~k t)q_(;~k t) ; (j~kj2 + m2 )q(~k t)q(;~k t)] (5.3) 2 t Since '(~x t) is a real eld q(~k t) = q(;~k t) (5.4) and Z t Z d3~k 1 h i 2 ~2 2 2 2 2 ~ ~ (5.5) S = dt (2 )3 2 jq_(k t)j ; !~k jq(k t)j  !~k = jkj + m 0

0

0

t

123

Therefore the system is equivalent to a continuous set of complex oscillators q(~k t) (or a pair of real oscillators)). The equations of motion are q(~k t) + !~k2q(~k t) = 0 (5.6) We are now in the position of evaluating the generating functional Z J ] by adding an external source term to the action  Z 1

 2 2 4 (5.7) S = d x 2 @ '@ ' ; m ' + J' V From the Fourier decomposition we get  Z t Z d3~k  1 h i 2 2 2 S = dt (2 )3 2 jq_(~k t)j ; !~k jq(~k t)j + J (;~k t)q(~k t) (5.8) t where we have also expanded J (x) in terms of its Fourier components. Let us separate q(~k t) and J (~k t) into their real and imaginary parts q(~k t) = x(~k t) + iy(~k t) J (~k t) = l(~k t) + im(~k t) (5.9) It follows from the reality conditions that the real parts are even functions of ~k whereas the imaginary parts are odd functions x(~k t) = x(;~k t) l(~k t) = l(;~k t) y(~k t) = ;y(;~k t) m(~k t) = ;m(;~k t) (5.10) Now we can use the result found in the previous Section for a single oscillator. it will be enough to take the mass equal to one and sum over all the degrees of freedom. In particular we get for the exponent in eq. (4.226) Zt 1 ; 2 dsds0J (s)(s ; s0)J (s0) ! t Zt Z d3~k h 1 0 ! ; 2 dsds (2 )3 l(~k s)(s ; s0 !~k )l(~k s0) t i + m(~k s)(s ; s0 !~k )m(~k s0) Zt Z d3~k 1 0 = ; dsds (2 )3 J (;~k s)(s ; s0 !~k )J (~k s0) (5.11) 2 t where we have made use of eq. (5.9). Going back to J (~x t), we get Z Z d3~k 1 4 4 ; 2 d xd yJ (x) (2 )3 (s ; s0 !~k )ei~k  (~x ; ~y)J (y) Z i  ; 2 d4 xd4yJ (x)F (x ; y m2)J (y) (5.12) 0

0

0

0

124

where we have dened x = (~x s), y = (~y s0) and Z d4k Z d3~k ~ i i k  ~ x ;ikx (5.13) 2 = lim iF (x m ) = (2 )3 (s !~k )e 4 2 + !0 (2 ) k ; m2 + i e Again we have put k = ( ~k). Therefore, the generating functional is Z i ; 2 d4xd4yJ (x)F (x ; y m2)J (y) Z J ] = e Z 0] (5.14) In particular we get 2 Z J ]  = i (x ; x  m2) (5.15) 1 h0jT ((x )(x ))j0i = ; 1 1 2 F 1 2 h0j0i Z 0] J (x1 )J (x2 ) J =0 We could repeat the exercise in the euclidean formulation, with the result 1 Z d4xd4yJ (x)G (x ; y m2)J (y) 0 ZE J ] = e 2 ZE 0] (5.16) where Z d4k 1 2 G0(x m ) = (2 )4 k2 + m2 e;ikx (5.17) We go from Minkowskian to the Euclidean description through the substitution ;iF ! G0 (5.18) For the future it will be convenient to use the following notation for the N -points Green's functions (5.19) G(N ) (x1      xN ) = h0jT ((x1h)0j0i(xN ))j0i In the free case we can get evaluate easily the generic G(N ) In fact G(0N ) (the index recalls that we are in the free case) is obtained by developing the generating functional 1 Z n Z J ] = X 4 x    d4 x J (x )    J (x ) (i) h0jT ((x1 )    (xn ))j0i  d 1 n 1 n Z 0] n=0 n! h0j0i Z 1 n X  d4x1    d4xnJ (x1 )    J (xn) (ni)! G(0n) (x1      xn ) (5.20) n=0 By comparison with the expansion of eq. (5.14) Z J ] = 1 ; i Z d4x d4x J (x )J (x ) (x ; x ) 1 2 1 2 F 1 2 Z 0] 2 Z ; 18 d4x1 d4x2 J (x1 )J (x2 )F (x1 ; x2 ) Z  d4x3 d4x4 J (x3 )J (x4 )F (x3 ; x4 ) +    (5.21) 125

Since Z J ] is even in J we have

G(20 k+1) (x1     x2k+1) = 0 (5.22) (4) For G(2) 0 we get back eq. (5.13). To evaluate G0 we have rst to notice that all the G(0N ) 's are symmetric in their arguments. therefore in order to extract the coecients we need to symmetrize. Therefore we write

Z

d4x1    d4x4 J (x1 )    J (x4 )F (x1 ; x2 )F (x3 ; x4)

Z h 1 4 x    d4 x J (x )    J (x )  (x ; x ) (x ; x ) d = 1 4 1 4 F 1 2 F 3 4 3 i + F (x1 ; x3 )F (x2 ; x4 ) + F (x1 ; x4 )F (x2 ; x3 )

(5.23)

It follows h G(4) ( x      x ) = ; F (x1 ; x2 )F (x3 ; x4 ) + F (x1 ; x3 )F (x2 ; x4 ) 1 4 0

i

+ F (x1 ; x4 )F (x2 ; x3 )

(5.24)

(4) If we represent G(2) 0 (x y ) by a line as in Fig. 5.1.1, G0 will be given by Fig. (2n) 5.1.2 since it is obtained in terms of products of G(2) 0 . Therefore G0 is given by a combination of products of n two point functions. We will call disconnected a diagram in which we can isolate two subdiagrams with no connecting lines. We see that in the free case all the non vanishing Green's functions are disconnected except for the two point function. Therefore it is natural to introduce a functional generating only the connected Green's functions. In the free case we can put .

y

x

Fig. 5.1.1 - The two point Green's function G(2) 0 (x y ). . x1

x1

x2

x2

+ x3

x4

x1

x2

x3

x4

+ x3

x4

Fig. 5.5.2 - The expansion of the four point Green's function G(4) 0 (x1  x2  x3  x4 ). Z J ] = eiW J ] 126

(5.25)

with

Z 1 W J ] = ; 2 d4x1 d4x2 J (x1 )J (x2)F (x1 ; x2 ) + W 0]

and we have 2Z J ]  = i 2 W J ]  = ;i (x ; x ) 1 F 1 2 Z 0] J (x1 )J (x2 ) J =0 J (x1 )J (x2 ) J =0

(5.26) (5.27)

Therefore the derivatives of W J ] give the connected diagrams with the proper normalization(that is divided by 1=Z 0]). In the next Section we will see that this property generalizes to the interacting case. That is W J ] is dened through eq. (5.25), and its Volterra expansion gives rise to the connected Green's functions

iW J ] = where

Z 1 X (i)n n=0

n!

d4x1    d4xn J (x1 )    J (xn)G(cn) (x1      xn)

G(cn) (x1     xn) = h01j0i h0jT ((x1)    (xn))j0iconn

(5.28) (5.29)

The index "conn" denotes the connected Green's functions. Notice that once we nd the connected Green's functions the theory is completely solved, since the generating functional is recovered by exponentiation of W J ].

5.2 The generating functional of the connected Green's functions Let us start by dening recursively the connected Green's functions. (1) G(1) (x1 ) = G(1) c (x1 )  Gc  (1) (1) (2) (1) 2 G(2) (x1  x2) = G(2) c (x1  x2 ) + Gc (x1 )Gc (x2 )  Gc + Gc  (1) (2) G(3) (x1  x2 x3 ) = G(3) c (x1  x2  x3 ) + Gc (x1 )Gc (x2  x3 ) (2) (1) (2) (1) (1) (1) +G(1) c (x2 )Gc (x1  x3 ) + Gc (x3 )Gc (x1  x2 ) + Gc (x1 )Gc (x2 )Gc (x3 )  (1) (2) (1) 3  G(3) (5.30) c + 3Gc Gc + Gc

On the right hand side we have used a symbolic notation by adding together the topological equivalent congurations. To dene G(cn) (x1     xn) we decompose the set (x1     xn) in all the possible subsets, and then we write

G(n) (x1      xn ) =

X

G(cn1 ) (xp1      xpn1 )G(cn2) (xq1      xqn2 )    G(cnk )(xr1      xrnk ) (5.31) where we sum over all the indices ni such that n1 +    + nk = n and over all the permutations from 1 to n of the indices in the set p1     pn1 , etc. A graphical 127

.

xxxxx xxxxx xxxxx xxxxx xxxxx xxxxx

=

xxxxx xxxxx xxxxx xxxxx xxxxx

c

=

c

+

c

c

c

c

xxxxx xxxxx xxxxx xxxxx xxxxx

=

+ 3

c

c

+ c

c

c

xxxxxx xxxxxx xxxxxx xxxxxx xxxxxx

=

c

+ 3

c

+

c

c

c

+ 6

+

4

c

c

c

c

+

c c

Fig. 5.2.1 - The connected Green's functions. description is given in Fig. 5.2.1. By using these denitions it is not dicult to check that W J ] is the generating functional of the connected Green's functions. It is enough to compare the expansion of Z J ] in series of G(cn) with the expansion from the G(n). This can be done by induction. We will limit ourselves to an explicit check up to the 4th order. By dening G(0) = 1 and using the same symbolic notation as before, we get

Z J ] = Z 0] =

1 X (i)n J nG(n) = ei(W J ] ; W 0]) n=0 n!  3 4 2 i i i (2) 2 (3) 3 (4) 4 (1) exp iGc J + 2! Gc J + 3! Gc J + 4! Gc J +   

i (2) 2 i (3) 3 i (4) 4  1 + iG(1) c J + 2! Gc J + 3! Gc J + 4! Gc J 2

3



4

4 2 2 i4 (2) 2 4 3 G(1) G(2) J 3 + i G(1) G(3) J 4 + 2!1 i2 G(1) J + G J + i c c c 4! c 3 c c   3 3 3 4 (1) 2 (2) 4 + 1 i3 G(1) J + 2 i Gc Gc J c 3! h 4 4i + 4!1 i4 G(1) c J +

128



= + +

h

i

i2 G(2) + G(1) 2 1 + iJG(1) + c c 2! c 3 i J 3 G(3) + 3G(1) G(2) + G(1) 3 c c c c 3! i4 J 4 G(4) + 3G(2) 2 + 4G(1) G(3) + 6G(1) 2G(2) + G(1) 4 c c c c c c c 4!

h h

i

i

+    (5.32)

5.3 The perturbative expansion for the theory '4 Let us consider an interacting scalar eld. The action will be of the type  Z 1

 2 2 4 (5.33) S = d x 2 @ '@ ' ; m ' ; V (') V where V (') is the interaction potential. A typical example is the theory '4 with the potential given by V (') = 4! '4 (5.34) The derivation of the perturbative expansion is very simple in this formalism. We take advantage of the following identity valid for any functional F

Z

Z

D('(x))F ']e

iS + i d4xJ (x)'(x)

Z

 1  Z

iS + i d4xJ (x)'(x)

= F i J D('(x))e (5.35) We separate in eq. (5.33) the quadratic part of the action, S0 , from the interacting part and then we use the previous identity

Z J ] = N = N

Z Z

Z

iS + i d4xJ (x)'(x) D('(x))e

;i

D('(x))e

Z

Z

d4xV (')

1  

Z

iS0 + i d4xJ (x)'(x) e

;i i J Z0J ] (5.36) = e Notice that we have suppressed the temporal limits in the expression for the generating functional. Z0J ] is the generating functional for the free case, evaluated in eq. (5.14). Therefore we get 1   i Z Z 4 ;i d xV i J ; 2 d4xd4yJ (x)F (x ; y)J (y) iW  J ] Z J ] = e = Z 0]e e (5.37) d4xV

129

In this expression we have suppressed the mass in the argument of F . For the following calculation it will be useful to introduce the following shorthand notation

Z

d4x1    d4xnF (x1      xn)  hF (x1     xn)i

(5.38)

Then, eq. (5.37) can be rewritten in the following way

1  

;ihV i J i ; i hJ (1)F (1 2)J (2)i iW  J ] Z J ] = e = e Z 0] 1   e 2 ;ihV i J i iW J ] e 0 = e

(5.39)

This gives rise to the following expression for the generating functional of the connected Green's functions 1   1 2 0 3 ; i h V i i J ; 1C W J ] = ;i log 64eiW0 J ] + B @e A eiW0J ]75 =

1   1 2 0 3 ; i h V i i J ; 1C W0J ] ; i log 641 + e;iW0 J ] B @e A eiW0J ]75(5.40)

This expression can be easily expanded in V . In fact, by dening 1   1 0 ;ihV i J i C iW J ] J ] = e;iW0 J ] B ; 1A e 0 @e

(5.41)

we see that we can expand W J ] in a series of . Notice also that the expansion is in term of the interaction lagrangian. At the second order in  we get  1  (5.42) W J ] = W0 J ] ; i  ; 2 2 +    Let us now consider the case of '4 . the functional J ] can be in turn expanded in a series of the dimensionless coupling 

 = 1 + 22 +    It follows

W J ] = W0 J ] ; i1

; i2

130



(5.43)



2 ; 21 12 +   

(5.44)

Then, from eq. (5.43) we get





4 1  i ; iW  J ] 0 h i J ieiW0 J ] 1 = ; 4! e  1  4  1  4 1 ; iW  J ] 0 2 = ; 2(4!)2 e h i J ih i J ieiW0 J ] (5.45) By using the following list of functional derivatives  eiW0 J ] = ;i(x 1)J (1)eiW0 J ] J (x) 2 eiW0 J ] = (;i(x x) ; (x 1)(x 2)J (1)J (2)) eiW0 J ] J (x)2 3 eiW0 J ] =  ; 3(x x)(x 1)J (1) J (x)3  + i(x 1)(x 2)(x 3)J (1)J (2)J (3) eiW0 J ] 4 eiW0 J ] = (;3(x x)2 + 6i(x x)(x 1)(x 2)J (1)J (2) J (x)4 +(x 1)(x 2)(x 3)(x 4)J (1)J (2)J (3)J (4))eiW0J ] (5.46)

where the integration on the variables 1 2 3 4 is understood, we obtain h i 1 = ; 4! h(y 1)(y 2)(y 3)(y 4)J (1)J (2)J (3)J (4)i i + 6ih(y y)(y 1)(y 2)J (1)J (2)i ; 3h2(y y)i The quantity 2 can be made in terms of 1 by writing









(5.47)

!

1 e;iW0 J ] h 1  4i eiW0 J ] e;iW0 J ] h 1  4ieiW0 J ] 2 = ; 2(4!) 2 i J i J  1  4   i ; iW  J ] iW  J ] 0 0 h i J i e 1 (5.48) = ; 2(4!) e Then we have  1  4   4 eiW0 J ] 3eiW0 J ] 1 i h i J i eiW0 J ] 1 = h  J i  + 4 h (x)4 1 J (x)3 J (x) 2 eiW0 J ]  2  eiW0J ] 3 1 i 1 + 6h  J i + 4 h (x)2 J (x)2 J (x) J (x)3 4 (5.49) + eiW0 J ]h J (x1)4 i 131

and therefore

h 3eiW0J ] 1 2 = 12 12 ; 2 i 4! e;iW0 J ] 4h  J (x)3 J (x) i 2 iW0 J ] 2 iW0J ] 31 + 6h  e 2  1 2 i + 4h e J (x) J (x) J (x) J (x)3 i i 4 + eiW0 J ]h J (x1)4 i

(5.50)

The 12 gives a disconnected contribution since there are no propagators connecting the two terms. In fact the previous expression shows that 2 contains a term which cancels precisely the term 12 in eq. (5.44). By using the expression for 1 and eqs. (5.46) we get 2 ; 21 12 =  i  i   = ; 2  4! ; 4! h4 ; 3(x x)(x 1)J (1)



+ i(x 1)(x 2)(x 3)J (1)J (2)J (3)

   4(y x)(y 4)(y 5)(y 6)J (4)J (5)J (6) + 12i(y y)(y x)(y 4)J (4)   + 6 ; i(x x) ; (x 1)(x 2)J (1)J (2) 122(y x)(y 3)(y 4)J (3)J (4)  + 12i(y y)2(y x)   3  4 + 4 ; i(x 1)J (1) 24 (y x)(y 2)J (2) + 24 (y x)i

(5.51)

By omitting the terms independent on the external source we have 2 ; 21 12 = 1 3  i 1 = hJ (1)(1 x)  (x y) + (x x)(x y)(y y) (y 2)J (2)i 2 6 4 i + 8 hJ (1)(1 x)2(x y)(y y)(x 2)J (2)i 1 hJ (1)(1 x)(x x)(x y)(y 2)(y 3)(y 4)J (2)J (3)J (4)i + 12 1 hJ (1)J (2)(1 x)(2 x)2 (x y)(y 3)(y 4)J (3)J (4)i + 16 ; 72i hJ (1)J (2)J (3)(1 x)(2 x)(3 x)(x y)(y 4)  (y 5)(y 6)J (4)J (5)J (6)i (5.52) 132

Now, by dierentiating functionally eq. (5.44) and putting J = 0 we can evaluate the connected Green's functions. For instance, recalling that 2 W J ]   (2) iGc (x1  x2) = J (x )J (x ) J =0 (5.53) 1 2 we get Z  (2) Gc (x1  x2 ) = iF (x1 ; x2 ) ; 2 d4xF (x1 ; x)F (x ; x)F (x ; x2) Z h1 3 2 4 4 ; i d xd yF (x1 ; x) 6 F (x ; y) i + 14 F (x ; y)F (x ; x)F (y ; y) F (y ; x2 ) 2Z  ; i 4 d4xd4 yF (x1 ; x)2F (x ; y)F (y ; y)F (x ; x2 ) (5.54) . xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx

xx xx

x1

xx xx

x x2 = xx 1

x

xx xx

+

x x

x1

xxx xx

y xx xx xx xx + x2

xx xx

x1

x

+

xx 1

x2 +

xx xx

xx 2

xx xx

xx xx

x2 +

x

y

x x

x

xx xx

x x

x1 x

xx xx

xx xx

y x2

Fig. 5.3.1 - The graphical expansion for the two point connected Green's function (2)

Gc (x y). In Fig. 5.3.1) we give a graphical form to this expression. as we see the result has an easy interpretation. The diagrams of Fig. 5.3.1) are given in terms of two elements, the propagator F (x ; y), corresponding to the lines and the vertex '4 corresponding to a point joining four lines. In order to get G(cn) we have to draw all the possible connected diagrams containing a number of interaction vertices corresponding to the xed perturbative order. The analytic expression is obtained associating a factor iF (x ; y) to each line connecting the point x with the point y, and a factor (;i=4!) for each interaction vertex. The numerical factors appearing in eq. (5.54) can be obtained by counting the possible ways to draw a given diagram. For instance let us consider the diagram of Fig. 5.3.2. This diagram contains the two external propagators, the internal one, and the vertex at x. To get the numerical factor we count the number of ways to attach the external propagators to the vertex, after that we have only a possibility to attach the vertex at the internal propagator. There are four ways to attach the the vertex at the rst propagator and three for the second. Therefore 1 1  4  3 = (5.55) 4! 2 133

. xx xx xx xx

x1

xx xx

xx xx

xx x2 = x1

x

x

xx xx

xx xx

xx xx

x

x

x2

Fig. 5.3.2 - How to get the numerical factor for the rst order contributions to (2) Gc (x1  x2 ).

where the factor 1=4! comes from the vertex. We proceed in the same way for the diagram of Fig. 5.3.3. . x

xx xx xx xx

x1

y

xx xx xx xx

x x x

x

x2= xx 1

y

y

xx xx

x

xx xx

x2

Fig. 5.3.3 - How to get the numerical factor for one of the second order contributions to G(2) c (x1  x2 ). .

xx xx x 4 xxxxxxx xxxxxxx xxxxxxx xxxxxxx x xxxxxxx x xxxxxxx xxxxxxx xx x 2xx xx xx x

x1 xx xx

x1 xx

=

+ x 2xx xx

xx xxxx y

x

x

x 2xx xx

3

x1 xx xx

xx xxx 4 xx xx

xx xxx 4

x1 +

xx

x

y xx x 2 xx

xx xx x 3

xx x

xx xx x 4

xx xx

+

xxx

xx xxx 4

x xx

xx xx

y

x 2xx xx

+ x 2xx

x3

xxx

x xx xx

xx xx

xx

3

x 2xx xx y

xx xx

x4 +

xx xxx 3

xx x

x

4

xx xx xxy xx xx xxx 3

+

xx xx x 4

xx xx

+

+ x

x1 xx

x1 +

xx x1xx

x1 xx xx

3

x1 xx xxxx yxx xx xx + x x 2xx

xx xx xx x 2xx

x

xx xxx 3

4

y xx

x3

Fig. 5.3.4 - The graphical expansion up to the second order of G(4) c (x1  x2  x3  x4 ). There are four ways to attach the rst vertex to the rst propagator and four for attaching the second vertex to the second propagator. Three ways for attaching the second leg of the rst vertex to the second vertex and two ways of attaching the the third leg of the rst vertex to the second one. Therefore we get 1 1  4  4  3  2 = (5.56) 4! 6 134

The four point function can be evaluated in analogous way and the result is given in Fig. 5.3.4. Of course we can get it by functional dierentiation of W J ]. The rst order contribution is Z 4 G(4) c (x1  x2  x3  x4 ) = ;i d xF (x1 ; x)F (x2 ; x)F (x3 ; x)F (x4 ; x) (5.57) where the numerical coecient is one, since there are 4  3  2 of attaching the vertex to the four external lines.

5.4 The Feynman's rules in momentum space The rules for the perturbative expansion of the Green's functions in space-time that we got in the previous Section can be easily extended to the momentum space. We will consider again the two point function given in eq. (5.54) where we will take i) separately the various contributions in  that we will denote by G(2)( c , where i (2) stands for the power of . let us also dene the Fourier transform of Gc (x1  x2) as Z (2) ip x + ip2 x2 Gc (p1 p2) = d4x1 d4x2 G(2) (5.58) c (x1  x2 )e 1 1 Then we get and

i 4 4 G(2)(0) c (p1  p2 ) = (2 )  (p1 + p2 ) p2 ; m2 + i 1

(5.59)

G(2)(1) c Z(p1  p2 ) = Z d4p  i 4 4 4 ip x + ip x ;ip(x1 ; x) 2 2 = ;i d x e 1 d x2 d xe 1 1 (2 )4 p2 ; m2 + i Z Z 2d4q i d4k e;ik(x ; x2 ) i  4 2 2 4 2 (2 ) q ; m + i (2 ) k ; m2 + i Z 4 4 4 = ;i 2 (2d p)4 (2d q)4 (2d k)4 (2 )44 (p ; k)(2 )44 (p1 ; p)(2 )44(p2 + k) (5.60)  p2 ; mi 2 + i q2 ; mi 2 + i k2 ; mi 2 + i The product of the 4 functions may be rewritten in such a way to pull out the overall four-momentum conservation (2 )44(p1 + p2 )(2 )44(p1 ; p)(2 )44(p ; k) (5.61) By integrating over p and k we get  i  2 Z d4 q  i (2)(1) 4 4 Gc (p1  p2) = ;i 2 (2 )  (p1 +p2 ) p2 ; m2 + i 4 2 (2 ) q ; m2 + i (5.62) 1 Also in momentum space we can make use of a diagrammatic expansion with the following rules: 135

.

 For each propagator draw a line with associated momentum p (see Fig. 5.4.1). :

i ___________ _ 2 p m2 + iε

Fig. 5.4.1 - The propagator in momentum space.  For each factor (;i=4!) draw a vertex with the convention that the momentum .

&ux is zero (see Fig. 5.4.2). p1

p

4

λ , p +p +p + p = 0 : _ i __ 1 4 3 2 4!

p

p

2

3

Fig. 5.4.2 - The vertex in momentum space.  To get G(cn) draw all the topological inequivalent diagrams after having xed

the external legs. The number of ways of drawing a given diagram is its topological weight. The contribution of such a diagram is multiplied by its topological weight.  After the requirement of the conservation of the four-momentum at each vertex we need to integrate over all the independent internal four-momenta with weight Z d4q (5.63) (2 )4 A more systematic way is to associate a factor 4 X i 4 4 ; 4! (2 )  ( pi) i=1

(5.64)

to each vertex and integrate over all the momenta of the internal lines. This gives automatically a factor n X

(2 )44 (

i=1

pi) (n = numero linee esterne)

(5.65)

corresponding to the conservation of the total four-momentum of the diagram. By using these rules we can easily evaluate the 2nd order contribution of Fig. 5.4.3. 136

.

p1

q1 q2

p

2

q3

Fig. 5.4.3 - One of the second order contributions to G(2) c . We get

 i 2 4432 ;

Z i p22 ; m2 + i

Y3

d4 qi i 2 4 2 4! i=1 (2 ) qi ; m + i (2 )44(p1 ; q1 ; q2 ; q3 )(2 )44(q1 + q2 + q3 ; p2 ) p2 ; mi 2 + i 1 2  1 = 6 (p2 ; m2 + i)2 (2 )44(p1 + p2)

Z Y31 d4qi 

!

i 4 4 2 4 q ; m2 + i (2 )  (p1 ; q1 ; q2 ; q3 ) (2

) i i=1

!

(5.66)

5.5 Power counting in '4 We start by going to the euclidean formulation. Let us denote the time by tR . The relation with the euclidean time is tE = itR . Recalling from the Section 4.9 that for the harmonic oscillator of unit mass one has (we have dened xR = (tR  ~x), xE = (tE  ~x)) Z d3p~ Z d3p~ 2 i~ p  ~ x iF (xR ) = (2 )3 (tR  !k )e = D (t  !2)ei~p  ~x (2 )3 E E k Z d4pE e;ipE xE (5.67) = (2 )4 p2E + m2 where pE = (EE  ~p), and we have used eq. (4.237) for unit mass and with  = EE . From this we get the following modications to Feynman's rules in the euclidean version  Associate to each line a propagator 1 (5.68) 2 p + m2 137

 To each vertex associate a factor

; 4!

(5.69)

This rule follows because the weight in the euclidean path-integral is given by e;SE instead of eiS . The other rules of the previous Section are unchanged. The reason to use the euclidean version is because in this way the divergences of the integrals become manifest by simply going in polar coordinates. We will now in the position to make a general analysis of the divergences. For simplicity we will consider only scalar particles. First of all consider the number of independent momenta in a given diagram, or the number of "loops", L. This is given, by denition, by the number of momenta upon which we are performing the integration, since they are not xed by the four-momentum conservation at the vertices. Since one of this conservation gives rise to the overall four-momentum conservation we see that the number of loops is given by

L = I ; (V ; 1) = I ; V + 1

(5.70)

where I is the number of internal lines and V the number of vertices. As in Section 2.3 we will be interested in evaluating the supercial degree of divergence of a diagram. To this end notice that  The L independent integrations give a factor QLk=1 ddpk where d is the number of space-time dimensions.  Each internal line gives rise to a propagator containing two inverse powers in momenta YI 1 (5.71) 2 2 i=1 pi + m Given that, the supercial degree of divergence is dened as

D = dL ; 2I

(5.72)

Notice that D has to do with a common scaling of all the integration variables, therefore it may well arise the situation where D < 0 (corresponding to a convergent situation), but there are divergent sub-integrations. Therefore we will need a more accurate analysis that we will do at the end of this Section. For the moment being let us stay with this denition. We will try now to get a relation between D and the number of external legs in a diagram. By putting VN equal to the total number of vertices with N lines we have (see Section 2.3)

NVN = E + 2I 138

(5.73)

with E the number of external legs and I the number of internal ones (recall we are dealing with scalar particles only). Eliminating I between this relation and the previous one, we get I = 21 (NVN ; E ) (5.74) from which D = d(I ; VN +1) ; 2I = (d ; 2)I ; d(VN ; 1) = d ;2 2 (NVN ; E ) ; d(VN ; 1) (5.75) This relation can be rewritten as N ; 2  1 (5.76) D = d ; 2 (d ; 2)E + 2 d ; N VN In particular, for d = 4 we have

D = 4 ; E + (N ; 4)VN (5.77) For the '4 theory this relation becomes particularly simple since then D depends only on the number of external legs D =4;E (5.78) Therefore the only diagrams supercially divergent, D  0, are the ones with E = 2 and E = 4. As noticed before, this does not prove that a diagram with E > 4 or D < 0 is convergent. Let us discuss more in detail this point. Let us consider the diagram of Fig. 5.5.1, and let us suppose that the diagrams 1 and 2 have degrees of supercial divergence D1 and D2 respectively. Since they are connected by n lines, we have n ; 1 independent momenta, and therefore . 1 D1

2 n

D2

Fig. 5.5.1 - Two diagrams connected by n-lines. D = D1 + D2 + 4(n ; 1) ; 2n = D1 + D2 + 2n ; 4

(5.79) Therefore we may have D < 0 with D1 and D2 positive, that is both divergent. However for n > 1 we have D  D1 + D2 (5.80) 139

and therefore in order to have D < 0 with 1 or 2 being divergent diagrams, we need D1 or D2 to be negative enough to overcome the other. To exclude the case n = 1 means to consider only the one-particle irreducible (1PI) Feynman's diagrams. In Fig. 5.5.2 we give an example of a one-particle reducible diagram. . xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx

xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx

Fig. 5.5.2 - One-particle reducible Feynman's diagram. The reason of excluding these diagrams from our considerations is that the momentum of the internal line connecting the two diagrams is xed, and therefore it cannot give rise to divergences. Furthermore, the one-particle reducible diagrams can be easily reconstructed from the 1PI's. After that we look for the diagrams having D < 0 but yet divergent, in order to have a complete classication of the divergences. The idea is simply to look for the reducibility of the diagram. We will now stay with the '4 theory. Then, we can consider the 2-, 3-particle reducible diagrams. In fact, for the nature of the vertex it is impossible to have a connected 4-particle irreducible diagram. We begin with the 3-particle reducibility. In this case we look for diagrams of the type of Fig. 5.5.3. . 1 xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx

xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx

N

Fig. 5.5.3 - Three-particle reducible Feynman's diagram. In this case we have D1 = 0 and D2 = 1 ; N , therefore D = D1 + D2 + 2 = 3 ; N

(5.81)

In order the supercial degree of divergence is negative, we must have N > 4. In this diagram we have a primitively divergent four point function and a diagram with D < 0 which can be again analyzed in the same way. In analogous way we look for two-particle reducible diagrams with D < 0 and we extract contributions of the forms given in Fig 5.5.4. 140

. 1 xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx

xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx

N 1 xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx

xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx xxxxxxxxx

N

Fig. 5.5.4 - Two-particle reducible Feynman's diagram. In the rst case we have D1 = 2 and D2 = 2 ; N , therefore

D = D1 + D2 = 4 ; N

(5.82)

Again this requires N > 4. The second diagram has again D2 < 0 and we can iterate the procedure. In the second case we have D1 = 0 and D2 = 2 ; N , therefore

D = D1 + D2 = 2 ; N

(5.83)

requiring N > 2. By proceeding in this way we see that truly convergent diagrams do not contain hidden two and four point divergent functions. What we have shown is that for the '4 theory in 4 dimensions a 1PI Feynman diagram is convergent if D < 0 and it cannot be split up into 3- or 2-particle reducible parts containing two-and four-point divergent functions. This turns out to be a general theorem due to Weinberg, valid in any eld theory and it says that a Feynman diagram is convergent if its supercial degree of divergence and that of all its subgraphs are negative. In the case of the 4 theory the Weinberg's theorem tells us that all the divergences rely in the two and four point functions. If it happens that all the divergent diagrams (in the primitive sense specied above) correspond to terms which are present in the original lagrangian, we say that the theory is renormalizable. In general we can look at this question by studying the equation for the supercial divergence that we got before N ; 2  1 D = d ; 2 (d ; 2)E + 2 d ; N VN (5.84) 141

For d = 4 we recall

D = 4 ; E + (N ; 4)VN (5.85) We see that D increases with VN for N > 4. Therefore we have innite divergent diagrams and the theory is not renormalizable. We can have a renormalizable scalar theory only for N  4, meaning that the dimension N of the vertex operator, 'N , must be less or equal to 4. An interesting case is the one of the theory '3 in 6 dimensions. For d = 6 we get D = 6 ; 2E + (2N ; 6)VN (5.86) and for N = 3 D = 6 ; 2E (5.87) and the only divergent functions are the one- two- and three-point ones. In d = 6 we have dim'] = 2 and again in order to have renormalizability we must have that the dimensions of the vertex operator must be less or equal to 6.

5.6 Regularization in '4 In this Section we will evaluate the one-loop contributions to the two- and fourpoint Green's functions by using the dimensional regularization. To this end, let us consider the action in dimensions d = 2! 1  Z 1  2 !  2 2 4 S2! = d x 2 @ '@ ' + 2 m ' + 4! ' (5.88) In our unit system we have (/h = c = 1), and therefore the action S2! should be dimensionless. As discussed in Section 2.3 this allows to evaluate the dimensions of the elds, and we recall that for the scalar eld we have (5.89) dim'] = d2 ; 1 = ! ; 1 From this it follows at once that dimm] = 1 dim] = 4 ; 2! (5.90) As we did in QED for the electric charge, here too it is convenient to introduce a dimensionful parameter  in order to be able to dene a dimensionless coupling new = old (2)!;2 (5.91) Then the action will be ( = new ) 1  Z 1  2 !  2 2 2 2 ; ! 4 S2! = d x 2 @ '@ ' + 2 m ' + 4! ( ) ' (5.92) The Feynman's rules are slightly modied 142

 The scalar product among two four-vectors becomes a sum over 2! compo-

nents.  In the loop integrals we will have a factor Z d2! p (2 )2! and the  function in 2!-dimensions will be dened by

Z

d2! p(2!) (

X i

pi ) = 1

(5.93) (5.94)

 The coupling  goes into (2)2;! .

Also, we will do all the calculation in the euclidean version. Remember that in Section 2.5 we gave the relevant integrals both in the euclidean and in the minkowskian version. We start our calculation from the rst order contribution to the two-point function corresponding to the diagram (tadpole) of Fig. 5.6.1. This is given by (2 )2! (2!) (p1 + p2) p2 +1 m2 T2 p2 +1 m2 (5.95) 1 1 .

p

p

p

1

2

Fig. 5.6.1 - The one-loop contribution to the two-point function in the '4 theory. with

Z d2! p 1 1  (2)2;! ! ;(1 ; !) 1 2 2 ; ! T2 = 2 (;)( ) = ; (2 )2! p2 + m2 2 (2 )2! (m2)1;!   2 2 2;! m 4

  ;(1 ; !) (5.96) = ; 2 (4 )2 m2 For ! ! 2 we get   1  2  2  m 4

 T2  ; 2 (4 )2 1 + (2 ; !) log m2 +     (;1) 2 ; ! + (2) +     1  2 4

  2 (5.97) = 32 2 m 2 ; ! + (2) + log m2 +    143

. p

p-k

p4

p

k

p

1

2

3

Fig. 5.6.2 - The one-loop contribution to the four-point function in the '4 theory. This expression has a simple pole at ! = 2 and a nite part depending explicitly on 2 . Let us consider now the one-loop contribution to the four-point function. The corresponding diagram is given in Fig. 5.6.2, with an amplitude (2 )2! (2!) (p1 + p2 + p3 + p4 )

Y4

1

2 2 T4 k=1 pk + m

(5.98)

with

Z d2! k 1 1 1 2 2 4 ; 2 ! T4 = 2 (;) ( ) (2 )2! k2 + m2 (k ; p)2 + m2 

p = p1 + p2

(5.99)

We reduce the two denominator to a single one, through the equation (see Section 2.6) 1 1 2 2 k + m (k ; p)2 + m2 = = =

Z1

Z0 1 Z0 1 0

1

dx

x(k2 + m2 ) + (1 ; x) ((k ; p)2 + m2 )]2

dx

x(k2 + m2 ) + (1 ; x) (k2 + m2 ; 2k  p + p2)]2

dx

1

1

(k2 + m2 ) + (1 ; x)(p2 ; 2k  p)]2

(5.100)

the denominator can be written as (k2 + m2 ) + (1 ; x)(p2 ; 2k  p) = k ; (1 ; x)p]2 + m2 + p2(1 ; x) ; p2(1 ; x)2 = k ; (1 ; x)p]2 + m2 + p2x(1 ; x) (5.101) Since our integral is a convergent one, it is possible to translate k into k ; p(1 ; x) obtaining

Z 1 Z d2! k 2  2 4 ; 2 ! dx (2 )2! 2 2 1 2 T4 = 2 ( ) k + m + p x(1 ; x)]2 0 144

(5.102)

By performing the momentum integration we get

Z 1 dx 2  1 2 4 ; 2 ! T4 = 2 ( ) ;(2 ; ! ) ! 2 2 m + p x(1 ; x)]2;! 0 (4 )

(5.103)

In the limit ! ! 2  Z 1 dx  1 2

 2 2 ; ! 2 T4  2 ( ) 1 + (2 ; !) log  2 2 ; ! +  (1) (4

) 0  2 + p2 x(1 ; x)  m  1 ; (2 ; !) log 4

 Z1 2 2 + p2 x(1 ; x)   1 m 2 2 ; !  ( ) 32 2 2 ; ! + (1) ; dx log (5.104) 4 2 0 Also the x integration can be done by using the formula

Z1





p

p dx log 1 + a4 x(1 ; x) = ;2 + 1 + a log p1 + a + 1  a > 0 1+a;1 0 The nal result is 2 h

2 T4  (2)2;! 32 2 2 ;1 ! + (1) + 2 + log 4m 2 q s i 1 + 4pm22 + 1 2 4 m ; 1 + p2 log q 4m2 +    1 + p2 ; 1

(5.105)

(5.106)

5.7 Renormalization in the theory '4 In order to renormalize the theory '4 we proceed as in QED. That is we split the original lagrangian, expressed in terms of bare couplings and bare elds in a part (5.107) Lp = 12 @ '@  ' + 12 m2 '2 + 4! 4;2! '4 depending on the renormalized parameters m and  and eld ', and in the counterterms contribution 4;2! 4 Lct = 21 Z@ '@  ' + 21 m2 '2 +  (5.108) 4!  ' The sum of these two expressions (5.109) L = 12 (1 + Z )@ '@  ' + 21 (m2 + m2)'2 + ( +4!) 4;2! '4 145

gives back the original lagrangian, if we redene couplings and elds as follows

'B = (1 + Z )1=2'  Z'1=2 '

(5.110)

2 + m2 m = 1 + Z = (m2 + m2 )Z';1  = 4;2! ( + )Z ;2 B = 4;2! (1++Z ' )2

m2B

In fact

(5.111) (5.112)

L = 21 @ 'B @  'B + 21 m2B '2B + 4!0 '4B

(5.113) The counter-terms are then evaluated by the requirement of removing the divergences arising in the limit ! ! 2, except for a nite part which is xed by the renormalization conditions. From the results of the previous Section, by summing up . xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx

xx

+

xx xx

xx xx

xx

=

xx xx

xx

xx xx

xx

+

+

xx

xx

xx

+

xxxx xxxx xx xx

Fig. 5.7.1 - The contributions to the teo-point connected Green's function. all the 1PI contributions we get the following expression for the two-point connected Green's function (see Fig. 5.7.1)





1 + 1 T 1 + = 1 + p2 ) 2 p1 + m2 p21 + m2 2 p21 + m2 (5.114)  (2 )2! 2! (p1 + p2) p2 + m12 ; T 2 1 Therefore, the eect of the rst order contribution is to produce a mass correction determined by T2 . The divergent part can be taken in care by the counterterm m2 which produces an additional contribution to the one-loop diagram, represented in Fig. 5.7.2. We choose the counterterm as    1 m2 '2 =  m2 2 + F ! m2 '2 (5.115) 2 64 2  1 2 with  = 4 ; 2! and F1 an arbitrary dimensionless function Adding the contribution of the counterterm to the previous result we get  1  1 1 (2) 2 ! 2 ! 2 Gc (p1 p2) = (2 )  (p1 + p2) p2 + m2 ; T + p2 + m2 (;m ) p2 + m2 2 1 1 1 (5.116)

G(2) c (p1  p2 )

(2 )2! 2! (p

146

. xx

x

=

_ δm2

Fig. 5.7.2 - The Feynman'rule for the counterterm m2 . at the order  we have

1 2! 2! G(2) c (p1  p2 ) = (2 )  (p1 + p2 ) p2 + m2 + m2 ; T

(5.117)

2

1

Of course the same result could have been obtained by noticing that the counterterm modies the propagator by shifting m2 into m2 +m2 . Now the combination m2 ;T2 is nite and given by





 m2 F ; (2) + log m2 (5.118) 1 2= 32 2 4 2 The nite result at the order  is then 1 2! 2! h i (5.119)  G(2) c (p1  p2 ) = (2 )  (p1 + p2 ) 2 p1 + m2 1 + 32 2 F1 ; (2) + log 4m22 m2 ; T

This expression has a pole in Minkowski space, and we can dene the renormalization condition by requiring that the inverse propagator at the physical mass is

p2 + m2phys

(5.120)

This choice determines uniquely the F1 term. However we will discuss more in detail the renormalization conditions in the next Chapter when we will speak about the renormalization group. Notice that at one loop there is no wave function renormalization, but this comes about at two loops, as we shall see later. Consider now the four-point connected Green's function. Its diagrammatic expansion at one loop is given in Fig. 5.7.3 (again we have included only 1PI diagrams). From the results of the previous Section and introducing the Mandelstam invariants s = (p1 + p2 )2 t = (p1 + p3)2 u = (p1 + p4 )2 (5.121) we get 2! (2!) G(4) c (p1  p2  p3  p4 ) = (2 )  (p1 + p2 + p3 + p4 )



(;4;2! )





Y4

1

2 2 k=1 pk + m



2 1 A(s t u) (5.122) 1 ; 323 2 2 + (1) + 2 ; log 4m ; 2

 3

147

. xx xx xx xxxxxxx xxxxxxx xxxxxxx xxxxxxx xx xxxxxxx xx xxxxxxx xx xxxxxxx xxxxxxx xx xx

xx xx

xx

xx xx xx

=

xx

+ xx

xx

xx xx

xx xx x

xx xx

xx

xx

xx

xx xx

+

xx xx

xx xx

+

xx xx

xx xx xx

+

xx xx

xx xx

xx xx

Fig. 5.7.3 - The one-loop expansion for the four-point Green's function. with

A(s t u) =

q 4m 2 1+ z q 1 + 4m log z 4m

Xr

2

+1

1+ z ;1 The divergent part can be disposed o by the counterterm

(5.123)

z=stu

2







 4;2! '4 = 1 4;2! 32 2 + G ! m2 '4 (5.124) 1 4! 4! 32 2  2 with G1 an arbitrary dimensionless function. This counterrtem produces an additional contribution with a Feynman rule represented in Fig. 5.7.4. .

=

_ (µ2 )2



δλ

Fig. 5.7.4 - The Feynman's rule for the couterterm . By adding the counterterm we get 2! (2!) G(4) c (p1  p2  p3  p4 ) = (2 )  (p1 + p2 + p3 + p4 )





Y4

1

2 + m2 (; p k=1 k



2;4! )

2 (5.125)  1 ; 323 2 ;G1 + (1) + 2 ; log 4m 2 ; 31 A(s t u) A possible renormalization condition is to x the scattering amplitude at some value of the invariants (for instance, s = 4m2phys, t = u = 0) to be equal to the lowest order value 2! (2!) G(4) c (p1  p2  p3  p4 ) = (2 )  (p1 + p2 + p3 + p4 )

148

Y4

1

2 2 k=1 pk + m

 (;2;4! ) (5.126)

In any case G1 is completely xed by this condition. Without entering into the calculation we want now show how the renormalization program works at two loops. We will examine only the two-point function. Its complete expansion up to two loops, including also the counterterm contributions is given in Fig. 5.7.5. By omitting the external propagators and the delta-function of . xx xx

xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx xxxxxxxx

xx xx

=

xx xx

xx xx

+

xx xx

xx xx

xx xx

+

xx xx

+

xx

xx

+

xx

xx

xx xx

+

xx xx

a)

+

xx

xx

+

xx

xx xx

x x

+

b)

x

x

d)

c)

Fig. 5.7.5 - The two-loop expansion for the two-point Green's function. conservation of the four-momentum, the various two-loop contributions are (notice that a counterterm inside a loop gives a second order contribution as a pure twoloops diagram):  Contribution from the diagram a) of Fig 5.7.5





2 2 2 2 ; 6(16 2)2 21 p2 + 6m2 + 6m 32 + (1) ; log 4m

2

 Contribution from the diagram b) of Fig 5.7.5





2 2 ; 4(16 2)2 42 + 2 (2) + (1) ; 2 log 4m 2

 Contribution from the diagram c) of Fig 5.7.5





32 m2 4 + 2 (2) + G ; log m2 1 4(16 2)2 2  4 2

 Contribution from the diagram d) of Fig 5.7.5





2 m2 4 + 2 (1) + F ; log m2 1 4(16 2)2 2  4 2 149





 

(5.127)

(5.128)

(5.129)

(5.130)

Adding together these terms we nd 2 p2 + m2 22 1 + m2 2 1 F + 3G ; 1] ; 12(16 (5.131) 1

2 )2  (16 2)2 2 2(16 2)2  1 where we have used (2) ; (1) = 1. This expression is still divergent but at the order 2 . Therefore we can make nite the diagram by modifying at the second order the m2 counterterm 1 ;m2 + m2 '2 (5.132) 2 2 1 with m21 given in eq. (5.115) and 4 1  m2  2  2 2 (5.133) m2 = m 2(16 2)2 2 +  (F1 + 3G1 ; 1) + F2 ! 2 and F2 a new dimensionless function. The further divergence in p2 can be eliminated through the wave function renormalization counterterm, that is 1 Z@ '@  ' (5.134) 2  with the choice 2 2 2   m Z = ; 24(16 2)2  + H2 (! 2 ) (5.135)

with H2 dimensionless. All the functions F1, F2 , G1 H2 are nite in the limit ! ! 2. Notice also that we are not introducing more and more freedom, because only particular combinations of these functions appear in the counterterms, and are precisely these combinations which are xed by the renormalization conditions. Also, the function H2 is xed by the renormalization condition on the inverse propagator, which are really two conditions: a) the position of the pole, b) the residuum at the pole. .

Fig. 5.7.6 - Insertion of counterterms in multi-loop diagrams. The previous procedure can be easily extended to higher and higher orders in the expansion in the coupling constant. What makes the theory renormalizable is the fact that the new divergences that one encounters can all be disposed o by modifying the counterterms m2 and . This depends from the fact that the divergent terms are constant or quadratic in the external momentum, and therefore they can be reproduced by operators as '2, @ '@  ' and '4. The non trivial part of 150

the renormalization program is just the last one. In fact consider diagrams of the type represented in Fig. 5.7.6, where we have taken out all the possible external lines. These diagrams contain 1= terms coming from the counterterms times log p2 terms from the loop integration. In the previous two-loop calculation we had no such terms but we got their strict relatives log(m2 =4 2). If present the log p2 terms would be a real disaster since they are non-local in conguration space and as such they cannot be absorbed by local operators. However if one makes all the required subtractions, that is one subtracts correctly also the diagrams of the type in Fig. 5.7.6, it is possible to show that the log p2 terms cancel out. For instance, in the expression for m22 there are not such terms. In fact they cancel in adding up the four two-loop contributions.

151

Chapter 6 The renormalization group 6.1 The renormalization group equations For the following considerations it will be convenient to introduce the one-particle irreducible truncated n-point functions ;(n) (p1  p2     pn) dened by removing the propagators on the external legs and omitting the delta-function expressing the conservation of the four-momentum (2 )2! (2!) ( where

n X i=1

pi);(n) (p1 p2     pn) = G(1nPI) (p1  p2     pn) DF (p) =

Z

Yn i=1

DF (pi);1

d2! xeipx h0jT ('(x)'(0))j0i

(6.1) (6.2)

is the exact euclidean propagator. We have two ways of evaluating these functions, we can start from the original lagrangian and evaluate them in terms of the bare couplings constants (in the case of '4, B and mB ). On the other hand we can express the bare parameters in terms of the renormalized ones, and, at the same time, renormalizing the operators through the wave function renormalization ('B = Z'1=2 '), obtaining the relation ;(Bn) (p1 p2     pn B  mB  !) = Z';n=2;(n) (p1 p2     pn  m  !)

(6.3)

The factor Z';n=2 originates from a factor Z'n=2 coming from the renormalization factors in G(cn) and from n factors Z';1 from the inverse propagators. Of course, in a renormalizable theory the ;(n) 's are nite by construction. In the previous relation we must regard the renormalized parameters as function of the bare ones. In the '4 case, this means  and m depending on B and mB . However the right hand side of the previous equation does not depend on the choice of the parameter . This implies that also the functions ;(n) do not depend on  except for the eld rescaling factor Z';n=2 . These considerations can be condensed into a dierential 152

equation obtained by requiring that the total derivative with respect to  of the right hand side of eq. (6.3) vanishes  @ @ @ @m @ n @ log Z   @ +  @ @ +  @ @m ; 2  @ ' ;(n)(p1  p2     pn  m  !) = 0 (6.4) The content of this relation is that a change of the scale  can be compensated by a convenient change of the couplings and of the eld normalization. Notice that since ;(n) is nite in the limit ! ! 2, it follows that also the derivative of log Z' is nite. It is convenient to dene the following dimensionless coecients  m  @  @ =     !   1  @ log Z' =   m  ! 2 @  d   @m =   m  ! (6.5) m @ m  obtaining  @  @ @  @ +  @ + mm @m ; nd ;(n) (p1 p2     pn  m  !) = 0 (6.6) We can try to eliminate from this equation the term @=@. To this end let us consider the dimensions of ;(n) . In dimensions d = 2! the delta function has dimensions ;2! and dim'] = ! ; 1, therefore

Z Yn

d2! xi ei(p1 x1 +    + pnxn)h0jT ('(x1)    '(xn )j0iconn: i=1 (6.7) has dimensions n(! ; 1) ; 2n! = ;n(! + 1). The propagator DF (p) has dimensions 2(! ; 1) ; 2! = ;2, and we get



dim ;(n) ; 2! = ;n(! + 1) + 2n ! dim ;(n) = n + !(2 ; n) (6.8) and putting  = 4 ; 2!

(6.9) dim ;(n) = 4 ; n + 2 (n ; 2) Therefore ;(n) must be a homogeneous function of degree 4 ; n + (n ; 2)=2 in the dimensionful parameters pi, m e . Introducing a common scale s for the momenta we get (from the Euler theorem)   @ @ @  (6.10)  @ + s @s + m @m ; (4 ; n + 2 (n ; 2)) ;(n) (spi  m  ) = 0 G(cn) (p1     pn) =

153

Since everything is nite for  ! 0 we can take the limit and eliminate @=@ between this equation and eq. (6.6)

 @ @  @ ;s @s +  @ + (m ; 1)m @m ; nd + 4 ; n ;(n)(spi  m  ) = 0

(6.11)

This equation tells us how the ;(n) scale with the momenta, and allows us to evaluate them at momenta spi once we know them at momenta pi. To this end one needs to know the coecients  , m and d. For this purpose one needs to specify the renormalization conditions since these coecients depend on the arbitrary functions F1  G1   . In order to use the renormalization group equations we need to know more about the structure of the functions  , d and m. We have seen that the counterterms are singular expressions in ,therefore it will be possible to express them as a Laurent series in the renormalized parameters

"

B =  a0 +

# 1 X ak

"

k=1

m2B = m2 b0 +

"

Z' = c0 +

(6.12)

k

1 # X bk

(6.13)

k k=1 

1 # X ck

(6.14) k  k=1 with the coecients functions of the dimensionless parameters  and m=. Notice that a0 , b0 and c0 may depend on  but only in an analytical way. Recalling from the previous Section the expressions for the counterterms at the second order in   2 2 3   = 32 2  + G1 (6.15)







2 = 32 2 m2 2 + F1 + 2(16 2)2 m2 42 + 1 (F1 + 3G1 ; 1) + F2 2  2  Z = ; 24(16 2)2  + H2 we get for the bare parameters  32  2   B =   + 32 2  + G1 + O(3 )

m2

154



(6.16) (6.17) (6.18)

m2B

h





= 1 + 32 2 2 + F1 4 1  2  + + (F + 3G1 ; 1) + F2 2(16 2)2 2  1 2 + H i + O(3 ) 2 + 24(16 2

2 )2 

m2





2 2 + H + O(3) Z' = 1 ; 24(16 2 2 2

)  Therefore the coecients are given by 32 G + O(3 ) a0 =  + 32

2 1 32 + O(3) a1 = 16

2 2 1 H ) + O(3) b0 = 1 + 32 2 F1 + 2(16 2)2 (F2 + 12 2

2 2 + O(3 ) b1 = 16 2 + 2(16 2)2 (F1 + 3G1 ; 1) + 12(16

2)2 2 b2 = (162 2)2 + O(3 )

2 H + O(3) c0 = 1 ; 24(16

2)2 2 2 + O(3 ) c1 = ; 12(16

2 )2

(6.19) (6.20) (6.21) (6.22) (6.23) (6.24) (6.25) (6.26) (6.27)

Notice that the dependence on m= comes only from the arbitrary functions dening the nite parts of the counterterms. This is easily understood since the divergent terms originate from the large momentum dependence where the masses can be neglected. This is an essential feature, since it is very dicult to solve the group renormalization equations when the coecients  , m e d depend on both variables  and m=. On the other hand since these coecients depend on the renormalization conditions, it is possible to dene a scheme where they do not depend on m=. Another option would be to try to solve the equations in the large momentum limit where the mass dependence can be neglected. Callan and Szymanzik have considered a dierent version of the renormalization group equations. They have considered the dependence of ;(n) on the physical mass. In that case one can show that  and d depend only on . Furthermore in their equation the terms m and =@ do not appear. In their place there is a inhomogeneous term in the large momentum limit. 155

6.2 Renormalization conditions Let us consider again the renormalization conditions. The renormalization procedure introduces in the theory a certain amount of arbitrariness in terms of the scale  and of the arbitrary functions F1  F2   . Recall that the lagrangian has been decomposed as L = Lp + Lct (6.28) therefore, the nite part of Lct can be absorbed into a redenition of the parameters appearing in L since both have the same operatorial structure. This is another way of saying that the nite terms in Lct are xed once we assign  and m. In turn these are xed by relating them to some observable quantity as a scattering cross-section. Furthermore we have to x a scale  at which we dene the parameters  and m. The renormalization group equations then tell us how to change the couplings when we change , in order to leave invariant the physical quantities. As we have said the procedure for xing m and  is largely arbitrary and we will exhibit various possibilities. To do that it is convenient to write explicitly the expression for the two- and four-point truncated amplitudes   2  (2) ; (p) = 1 ; 24(16 2)2 H2 p2    2  m 2 + m 1 + 32 2 F1 ; (2) + log 4 2 2 F +     + (6.29) 2(16 2)2 2  3   2 m 1 (4)  ; (p1 p2  p3 p4) = ;  1 ; 2 (1) + 2 ; log 2 ; A(s t u) ; G1 32

4  3 (6.30) In the two point function we have omitted the nite terms coming from the second order contributions for which we have given only the divergent part. let us now discuss various possibilities of renormalization conditions (sometimes one speaks of subtraction conditions)  A rst possibility is to renormalize at zero momenta, that is requiring  ;(2) (p) 2 = p2 + m2A (6.31) p 0



;(4) (p1  p2 p3 p4)

pi =0

= ;   A

(6.32)

where m2A and A are quantities to be xed by comparing some theoretical cross-section with the experimental data. Then, comparing with eqs. (6.29) and (6.30) and using A(s t u) = 6 for pi = 0, we get 2 2 H2A = 0 F1A = (2) ; log 4m A2  GA1 = (1) ; log 4m A2 (6.33) 156

Another possibility would be to x the momenta at some physical value in the Minkowski space. For instance we could require ;(;p2 = m! 2 ) = 0 where m! is the physical mass and then x ;(4) with all the momenta on shell.  To choose the subtraction point at zero momenta is generally dangerous for massless particles (in this way one would introduce spurious infrared divergences). As a consequence a popular choice is to subtract at an arbitrary value of the momenta. For instance one requires ;(2) (p) = p2 + m2B  p2 = M 2 (6.34) ;(4) (p1  p2 p3 p4) = ;B  s = t = u = M 2 (6.35) Notice that the point chosen for the four-point function is a non-physical one. In this case we get 2 2 H2B = 0 F1B = (2);log 4m B2  GB1 = (1)+2;log 4m B2 ; 13 A(M 2  M 2  M 2 ) (6.36) This way of renormalize has however the problem of leaving an explicit dependence on the mass in the coecients of the renormalization group equations.  The last way of renormalizing we will consider is due to Weinberg and 't Hooft and the idea is simply to put equal to zero all the arbitrary functions F1  F2    at each order in the expansion. In this way the coecients ai, bi and ci appearing in the Laurent series dening the bare couplings turn out to be mass independent. The Weinberg and 't Hooft prescription allows to evaluate the renormalization group equations coecients in a very simple way. For instance, let us start with the bare coupling # " X 1 a (  ) k (6.37) B =    + k k=1

Dierentiating with respect to  and recalling that B does not depend on this variable, we get " X # " X # 1 1 0 a (  ) @ a (  ) k k 0= + +  @ 1 + (6.38) k k   k=1 k=1 where

k () a0k () = dad (6.39) Since  = @=@ is analytic for  ! 0, for small  we can write  = A + B, obtaining " X # " X # 1 1 0 a (  ) a (  ) k k 0= + + (A + B) 1 + (6.40) k  k k=1

k=1

157

and identifying the coecients of the various powers in 

from which

We then get

(6.41)

a1 + A + Ba01 = 0 ak+1 + Aa0k + Ba0k+1 = 0

(6.42) (6.43)

B = ;

(6.44)



 d





A = ; 1 ;  d a1

(6.45)

d a ;   () = ; 1 ;  d 1

(6.46)

d a  () = ; 1 ;  d 1

(6.47)





and for  ! 0 and

B+=0









d a = dak 1 ;  d a 1 ;  d (6.48) k+1 d d 1 From the expression of the previous Section for a1 at the second order in , we get @ =  () = 32 (6.49)  @ 16 2 This equation can be integrated in order to know how we have to change  when we change the value of the scale . The function () is known as the running coupling constant. Its knowledge, together with the knowledge of m() allows us to integrate easily the renormalization group equations. In fact, let us dene t = log s

(6.50)

and let us consider the following partial dierential equation (we shall see that the renormalization group equations can be brought back to this form) @ F (x  t) =  (x ) @ F (x  t) (6.51) i i @t i @xi i The general solution to this equation is obtained by using the method of the "characteristics". That is one considers the integral curves (the characteristic curves) dened by the ordinary dierential equations dxi(t) =  (x (t)) (6.52) i i dt 158

with given boundary conditions xi(0) = x!i . Then, the general solution is

F (xi t) = (xi (t))

(6.53)

where

(!xi) = F (!xi  0) (6.54) is an arbitrary function to be determined by the condition at t = 0 required for F . In fact we can check that  satises the dierential equation @ (x (t)) = dxi(t) @ =  @ (6.55) @t i dt @xi i @xi In our case the equation contains a non-derivative term @ G(x  t) =  (x ) @ G(x  t) +  (x )G(x  t) (6.56) i i i i @t i @xi i with  (xi ) a given function. By putting

Zt

G(xi t) = e

0

dt0 (xi(t0 ))

F (xi t)

(6.57)

we see that F (xi t) satises eq. (6.51) and therefore the general solution will be

Zt

G(xi t) = e

0

dt0 (xi (t0 ))

(xi(t))

(6.58)

with xi (t) satisfying eq. (6.52). For the ;(n) we get (t = log s) ;(n)(spi  m  ) = ;(n)(pi  m(s) (s) )s4;ne with

;n

Zs 1

0 d((s0)) ds s0

(6.59)

s @@s(s) =  ((s)) (6.60) (6.61) s @m@s(s) = m(s)(m((s)) ; 1) and (s = 1) = , m(s = 1) = m. This result tells us that when we perform a re-scaling on the momenta, the amplitudes ;(n) do not scale only with the trivial factor s4;n due to their dimensions, but that they undergo also a non trivial scaling, due to d 6= 0, which is necessary to compensate the variations of  and m with the scale. 159

6.3 Application to QED In this Section we will apply the previous results to QED. In particular, we want to check that the running coupling obtained via the renormalization group equations is the same that we have dened in Section 2.2. First of all let us start with the QED lagrangian in d = 2! dimensions (neglecting the gauge xing terms)   Z 1  2 ! ^ ; F F (6.62) S2! = d x !(i@^ ; m) ; e!A 4 Since  1 ^ ] = ! ; 1 + 2 ! ; = 3! ; 2 dim!A (6.63) 2

it follows

Therefore we dene

dime] = 2 ; !

(6.64)

enew = eold ()!;2 = eold ;=2 (6.65) As before we write the lagrangian as Lp + Lct, with (we ignore here the gauge xing term) ^ ; 1 F F  Lp = !(i@^ ; m) ; e=2!A (6.66) 4 and ! ; eZ1=2!A ^ ; Z3 F F  (6.67) Lct = iZ2!@^ ; m 4 giving ! ; e(1 + Z1 )=2!A ^ ; 1 + Z3 F F  (6.68) L = i(1 + Z2)!@^ ; (m + m) 4 After re-scaling of the elds one gets back the original lagrangian via, in particular, the identication (1 + Z1) =2 e(1 ; Z =2) =  (6.69) eB = =2e (1 + Z 3 2 )(1 + Z3 =2) where we have used Z1 = Z2. From Section 2.7 we recall that 2 Z3 = C = ; 6 e 2  (6.70) and therefore  e2  = 2 eB =  e 1 + 12 2 (6.71) We proceed as in the previous Section by dening

eB = 

X ! ak =2 e+

160

k

k

(6.72)

with

3 a1 = 12e 2 (6.73) Requiring that eB is -independent, and in the limit  ! 0 we get  d @e 1  (e) =  @ = ; 2 1 ; e de a1 (6.74) from which 3 (6.75)  (e) = 12e 2 The dierential equation dening the running coupling constant is () =  (e) = e3  @e@ (6.76) 12 2 Integrating this equation we obtain 1 = ; 1 log 2 + K (6.77) 2 e (2) 12 2 where K is an integration constant which can be eliminated by evaluating e at another momentum scale Q2. We get 1 ; 1 = ; 1 log Q2 (6.78) (Q2) (2) 3 2 or ( 2 ) (6.79) (Q2 ) = 1 ; (32 ) log Q22 which agrees with the result obtained in Section 2.2.

6.4 Properties of the renormalization group equations Integrating eq. (6.49) we get

d = 32 + O(3 )  d 16 2

(6.80)

; (1) = 163 2 log  ; 1 s s

(6.81)

where s is a reference scale and s = (s). Therefore 1 () = s 3 1 ; 16s2 log s 161

(6.82)

From this expression we see that starting from  = s,  increases with . Suppose that we start with a small  in such a way that the perturbative expansion is sensible. However increasing  we will need to add more and more terms to the expansion due to the increasing of . Therefore the '4 theory allows a perturbative expansion only at small mass scales, or equivalently at large distances. In this theory we can give a denite sense to the asymptotic states. The increasing of  is related to the sign of the  () function. For the '4 theory and for QED the  function is positive, however if it would be negative the theory would become perturbative at large mass scales or at small distances. In this case one could solve the renormalization group equations at large momenta by using the perturbative expansion at a lower order to evaluate the  and the other coecients. However in this case the coupling is increasing at large distances and this would be a problem for our approach based on th in- and out- states. As we shall see this is the situation in QCD. here the elds that are useful to describe the dynamics at short distances (the quark elds) cannot describe the asymptotic states, which can be rather described by bound states of quarks as mesons and baryons. Going back to the eq. (6.82) we see that, assuming the validity of the expression for the  function up to large momenta, the coupling constant becomes innite at the value of  given by 16 2  = se 3s (6.83) which is a very big scale if s is small. The pole in eq. (6.82) is called the Landau pole (see Section 2.2 in the case of QED). Of course there are no motivations why eq. (6.82) should be valid for large values of . Let us now discuss several possible scenarios starting from a theory where  = 0 for  = 0.

 If  () > 0 for any , than the running coupling is always increasing, and it

will become innite at some value of . If this happens for a nite  we say that we have a Landau pole at that scale.  Suppose that  () is positive at small  and that it becomes negative going through a zero at  = F (see Fig. 6.4.1)). The point F is called a xed point since if we start with the initial condition  = F than  remains xed at that value. To study the behaviour of  around the xed point, let us expand the  () around its zero It follows

 ()  ( ; F ) 0(F )

(6.84)

d = ( ;  ) 0( ) +     d F F

(6.85)

162

.

β(λ)

λS

λS

λS λF

λ

Fig. 6.4.1 - The behaviour of  () giving rise to an ultraviolet xed point. from which

d =  0( ) d F  ; F  Integrating this equation we get

 

.

(6.86)

 ; F =   ( F ) (6.87) s ; F s Notice that the sign of  0(F ) plays here a very important role. In the present case the sign is negative since  () > 0 for  < F and  () < 0 for  > F . For large values of  we have that  ! F independently on the initial value s. For this reason F is referred to as a ultraviolet (UV) xed point (see Fig. 6.4.2)). The large scale (or small distance) behaviour of a eld theory of 0

λ

λF

µS

µ

Fig. 6.4.2 - The behaviour of  close to a UV xed point. this type would be dictated by the value of F . In particular if F  1 and s < F we would be always in the perturbative regime. Otherwise, starting from s > F the theory would become perturbative at large scales (see Fig. 6.4.1). About this point, notice that  = 0 is a xed point since  (0) = 0. However this is an infrared (IR) xed point, since the coupling goes to zero for 163

 ! 0 independently on the initial value s. We say also that the xed point is repulsive since  goes away from the xed point for increasing , whereas a UV xed point is said to be attractive.  Let us consider now the case of  () < 0 for small values of  and decreasing monotonically. For instance, suppose  () = ;a2 with a > 0. Integrating the equation for the running coupling we get () = s 1 + a1 log  (6.88) s s In this case  is a decreasing function of  and goes to zero for  ! 1 (that is  = 0 is a UV xed point). This property is known as "asymptoyic freedom" and it holds for non-abelian gauge theories that we will describe later on in the course. these are the only 4-dimensional theories to share this property. In higher dimensions other theories enjoy this property. For instance in d = 6 the theory '3 is asymptotically free. Notice that also this time the running coupling has a pole, but at a smaller scale than s 1  = e; as (6.89) s Therefore the coupling increases at large distances.  Finally let us consider the case of  () < 0 for small values of , becomes zero at F and then positive. Then  0(F ) > 0. By expanding  () around F we get (by the same analysis we did before)

 

 ; F =   ( F ) (6.90) s ; F s and we see that  ! F  ! 0 in a way independent on s, whereas it goes away from s for increasing . Therefore F is an IR xed point (see Fig. 6.4.3)). 0

Let us now derive the perturbative expressions for m and d using the same method followed for  (). *e possibile ricavare. We dierentiate the eq. (6.13) with respect to  (notice that in this renormalization scheme b0 = 1)

"

#

1 1 0 X X @m b @ bk k 2 0 = 2m 1+ +m k @ @ k=1 k k=1 

from which

" X 1 # bk

0 = 2m 1 +

k k=1 

164

+  ()

1 0 X bk k k=1 

(6.91) (6.92)

.

λ

λF

µS

µ

Fig. 6.4.3 - The behaviour of  close to an IR xed point. where we have made use of the denition m = (=m)(@m=@). Using eq. we nd 2m ;  db1 = 0 d   d dbk+1 db k 1; a 2mbk ; 1; d d d or 1 m = 21  db d and   db d db k k+1 1 =0  d bk ; d 1 ;  d a1 ;  dbd Using the eq. (6.24) for b1 we get for m



(6.46)

(6.93) (6.94) (6.95)



2  5  m() = 32 2 ; 12 16 2 + O(3 ) (6.96) Let us now consider Z' (recall that although this quantity is innite in the limit  ! 0, d is nite). By dierentiating eq. (6.14) we nd (c0 = 1)

or

1 0 1 0 X X 1 @Z 1 d c 1 ' k Z'd = 2  @ = 2  d k = 2  () ckk k=1 k=1

X 1 ! ck 1+

1 0 X d = 12 (A + B) ckk

k k=1  Using again the expression for  () (see eq. (6.46) 1 d = ; 12  dc d

165

k=1

(6.97) (6.98)

(6.99)





d a k+1 1 dck  dcd = ck dc ; 1 ;  d d d 1 At the lowest order, using eq. (6.27) we get



1  d = 12 16 2

2

+ O ( 3 )

(6.100)

(6.101)

To conclude this Section, let us study the behaviour of ;(n) when the theory has a UV xed point at  = F . Since for large scales  ! F we may suppose that m and d go to m(F ) e d(F ) respectively. Then, integrating the following equation (see eq. (6.61)) s dm (6.102) ds = m(m (F ) ; 1) we have m(s) = m(1)sm ( F );1 (6.103) Analogously Z s ds0 (6.104) s0 d(F ) = d(F ) log s 1

Then, for large values of s (see eq. (6.59)

;(n)(spi m  ) ! s(4;n;nd ( F )) ;(n) (pi msm ( F );1  F  )

(6.105)

Therefore, if 1 ; m(F ) > 0 we can safely neglect the mass on the right hand side of this equation. From this equation we understand also the reason why d is called the anomalous dimension. In non abelian gauge theories there is a UV xed point at  = 0 where m = 0 and therefore one can neglect the masses at large momenta. To justify why we have assumed that in the evaluation of m and d the only important contribution comes from  = F we notice that the integral Z s ds0 0  (6.106) m ((s )) 0 1 s can be rewritten (using ds=s = d= )

Z (s) m(0) d0

(6.107)  (0) Therefore, barring the case in which m and  have simultaneous zeroes, the integral is dominated by the points where there is a zero of the  function.

166

6.5 The coecients of the renormalization group equations and the renormalization conditions We have already noticed that the coecients of the renormalization group equations depend on the subtraction procedure. In particular, using the rst two renormalization conditions considered in Section 6.2 a mass dependence of the coecients is introduced. To study better this problem, let us consider, for generic renormalization conditions, the expression (6.12) for the bare coupling

B

= 

a0 +

Dierentiating with respect to  we nd

 a0 +

! 1 X ak k=1

k

!

! 1 X ak k=1

(6.108)

k

!

1 0 1 X X @ a @ ( m= ) 0 k +  @ a0 + k +  @ a_ 0 + a_ kk = 0 (6.109) k=1 k=1

where the dot stands for the derivative with respect to m=. Notice also that (see eq. (6.5) m m @  @  =  (m ; 1) (6.110) With a familiar procedure, we put  = A + B, obtaining

a0 + Ba00 = 0 a1 + Aa00 + Ba01 + m  (m ; 1)_a0 = 0 At the second order in   3  3 ;1  3  B = ; 1 + 32 2 G1 1 + 16 2 G1  ; 1 ; 32 2 G1 and  3  3 m 2 3  32 G_ 0 Aa0 = ; 16 2 +  1 ; 32 2 G1 8 2 ;  (m ; 1) 32

2 1

(6.111) (6.112) (6.113) (6.114)

Since the right hand side is at least of the second order in , neglecting higher order terms we get 32 + m 32 G_ (6.115) A = 16

2  32 2 1 and for  ! 0 32 + m 32 G_  = 16 (6.116)

2  32 2 1

We see directly how the  function depends on the subtraction procedure. Similar expression can be obtained for m e d. Therefore, when using subtraction procedure 167

such that the coecients of the renormalization group equations depend explicitly on the masses, it is convenient to look for solutions in a region of momenta where the masses can be neglected. Another possibility is to choose the arbitrary scale  such that m= can be neglected. Of course, changing the renormalization procedure will induce a nite renormalization in the couplings. For instance we get expressions of the type

0 = f

 m  =  + O(2 ) 

(6.117)

Therefore we obtain

0 @ f 0 +  @ (m=) f_ = f 0 + m ( ; 1)f_ =   0 =  @ @ @ @  m

(6.118)

Working in a region where we can neglect the mass we have

 0(0) =  ()f 0()

(6.119)

and we see that a xed point at  = F implies a xed point in 0. Therefore the presence of a xed point does not depend on the scheme used (at least in this approximation). It is possible to show that also the sign of the  function is schemeindependent.

168

Chapter 7 The path integral for Fermi elds 7.1 Fermionic oscillators and Grassmann algebras The Feynman's formulation of the path integral can be extended to eld theories for fermions. However, since the path integral is strictly related to the classical description of the theory, it could be nice to have such a "classical" description for fermionic systems. In fact it turns out that it is possible to have a "classical" (called pseudoclassical) description in a way such that its quantization leads naturally to anticommutation relations. In this theory one introduces anticommuting variables already at the classical level. This type of mathematical objects form the so called Grassmann algebras. Before proceeding further let us introduce some concepts about these algebras which will turn out useful in the following. First of all we recall that an algebra V is nothing but a vector space where it is dened a bilinear mapping V  V ! V (the algebra product). A Grassmann algebra Gn is a vector space of dimensions 2n and it can be described in terms of n generators i (i = 1     n) satisfying the relations

i j + j i  i  j ]+ = 0 i j = 1     n

(7.1)

Then, the elements constructed by all the possible products among the generators form a basis of Gn : 1 i ij  i j k      12    n (7.2) Notice that there are no other possible elements since

i2 = 0

(7.3)

from eq. (7.1). The subset of elements of Gn generated by the product of an even number of generators is called the even part of the algebra, Gn(0) , whereas the subset generated by the product of an odd number of generators is the odd part, Gn(1) . It follows Gn = Gn(0)  Gn(1) (7.4) 169

A decomposition of this kind is called a grading of the vector space, if it is possible to assign a grade to each element of the subspaces. In this case the grading is as follows 1. If x 2 Gn(0) , deg(x) = +1. 2. If x 2 Gn(1) , deg(x) = ;1. Notice that this grading is also consistent with the algebra product, since if x y are homogeneous elements of Gn (that is to say even or odd), then deg(xy) = deg(x)deg(y)

(7.5)

Since we have no time to enter into the details of the classical mechanics of the Grassmann variables, we will formulate the path integral starting directly from quantum mechanics. Let us recall that in the Feynman's formulation one introduces states which are eigenstates of a complete set of observables describing the conguration space of the system. In the case of eld theories one takes eigenstates of the eld operators in the Heisenberg representation

(~x t)j'(~x) ti = '(~x)j'(~x) ti (7.6) this is possible since the elds commute at equal times (~x t) (~y t)]; = 0 (7.7) and as a consequence it is possible to diagonalize simultaneously at xed time the set of operators (~x t). For Fermi elds such a possibility does not arise since they anticommute at equal times. Therefore it is not possible to dene simultaneous eigenstates of Fermi elds. To gain a better understanding of this point let us stay for the moment with free elds, and let us recall that this is nothing but a set of harmonic oscillators. This is true also for Fermi elds, except for the fact that these oscillators are of Fermi type, that is they are described by creation and annihilation operators satisfying the anticommutation relations

h y yi

ai  aj ]+ = ai  aj

+

= 0

h

a  ay

i

i j = ij +

(7.8)

For greater simplicity take the case of a single Fermi oscillator, and let us try to diagonalize the corresponding annihilation operator

aji = ji (7.9) Then, from a2 = 0, as it foillows from eqs. (7.8), we must have 2 = 0. Therefore the eigenvalue  cannot be an ordinary number. However it can be thought as an odd element of a Grassmann algebra. If we have n Fermi oscillators and we require aij1      ni = i j1      ni (7.10) 170

we get

ai (aj j1     ni) = j (ai j1     ni) = j i j1     ni (7.11) But aiaj = ;aj ai and therefore i j ]+ = 0 (7.12) We see that the equations (7.10) are meaningful if we identify the eigenvalues i with the generators of a Grassmann algebra Gn. In this case the equation (7.9) can be solved easily y ji = ea j0i (7.13) where j0i is the ground state of the oscillator. In fact aji = a(1 + ay)j0i = aayj0i = (1 ; aya)j0i = j0i = (1 + ayj0i = ji (7.14) where we have used y ea = 1 + ay (7.15) since 2 = a2 = 0. We have also @ (1 + ay)j0i = @ ji ayji = ay(1 + ay)j0i = ayj0i = @ (7.16) @ Here we have used the left-derivative on the Gn dened by @    =     ;     i1 i i2 i3 ik i2 i i1 i3 ik @i i1 i2 ik +    + (;1)k;1ik ii1 i3    ik 1 (7.17) Suppose we want to dene a scalar product in the vector space of the eigenstates of a (notice that these are vector spaces with coecients in a Grassmann algebra. mathematically such an object is called a modulus over the algebra). Then it is convenient to extend the algebra G1 generated by  to a comples Grassmann algebra, G2 , generated by  and  . The operation "?" is dened as the complex conjugation on the ordinary numbers and it is an involution (automorphism of the algebra with square one) with the property (i1 i2    ik ) = ik    i2 i1 (7.18) In the algebra G2 we can dene normalized eigenstates y ji = ea j0iN (  ) (7.19) ;

where N (  ) 2 G2(0) and the phase is xed by the convention

N  ( ) = N (  ) 171

(7.20)

The corresponding bra will be



hj = N (  )h0je a

(7.21)

the normalization factor N (  ) is determined by the condition 1 = = = =



y

hji = N 2 (  )h0je aea j0i N 2 ( )h0j(1 +  a)(1 + ay)j0i N 2 ( )h0j(1 +  aay)j0i  N 2 ( )(1 + ) = N 2 ( )e;

(7.22)

It follows

 N (  ) = e =2 (7.23) Let us recall some simple properties of a Fermi oscillator. the occupation number operator has eigenvalues 1 and 0. In fact (aya)2 = ayaaya = ay(1 ; aya)a = aya (7.24) from which aya(aya ; 1) = 0 (7.25) Starting from the state with zero eigenvalue2 for aya, we can build up only the eigenstate with eigenvalue 1, ayj0i, since ay j0i = 0. Therefore the space of the eigenstates of aya is a two-dimensional space. By dening the ground state as 0 j0i = 1 (7.26) one sees easily that 0 0 0 1 y a= 1 0  a = 0 0 (7.27)

(one writes down a as the most general 2  2 matrix and then requires aj0i = 0, a2 = 0 and a ay + = 1). The state with occupation number equal to 1 is

1

j1i = 0

In this basis the state ji is given by  y  y)j0i ji = e =2ea j0i = e =2(1 + a    =2  =2    = e (j0i + j1i) = e 1 whereas  hj = e =2   1 ] 172

(7.28)

(7.29) (7.30)

We see that in order to introduce eigenstates of the operator a has forced us to generalize the ordinary notion of vector space on the real or on the complex numbers, to a vector space on a Grassmann algebra (a modulus). In this space the linear combinations of the vectors are taken with coecients in G2 (or more generally in G2n ). Let us now come to the denition of the integral over a Grassmann algebra. To this end we recall that the crucial element in order to derive the path integral formalism from quantum mechanics is the completeness relation for the eigenstates of the position operator Z dqjqihqj = 1 (7.31) What we are trying to do here is to think to the operators a as generalized position operators, and therefore we would like its eigenstates to satisfy a completeness relation of the previous type. Therefore we will "dene" the integration over the Grassmann algebra G2 by requiring that the states dened in eqs. (7.29) and (7.30) satisfy 1 0 Z  d djihj = 0 1  12 (7.32)

where the identity on the right hand side must be the identity on the two-dimensional vector space. Using the explicit representation for the states   Z Z       d djihj = d de 1 ( 1)   Z      = d de      1   1 0  Z = d d  1 +  = 0 1 (7.33) Therefore the integration must satisfy

Z

Z

dd   =

Z

dd   = 1

Z

d d  1 = 0 (7.35) These relations determine the integration in a unique way, since they x the value of the integral over all the basis elements of G2 . The integral over a generic element of the algebra is then dened by linearity. the previous integration rules can be rewritten as integration rules over the algebra G1 requiring d d]+ =  d]+ =  d]+ =  d]+ =   d]+ = 0 (7.36) Then, we can reproduce the rules (7.34) and (7.35) by requiring

Z

d d   =

(7.34)

Z

d   = 1 173

d  1 = 0

(7.37)

For n Fermi oscillators we get

n X

j1      ni = e i=1

with

Z Yn

and integration rules

i=1

   =2 i i

!

n X

e i=1

iayi

j0i

di di j1      nih1     nj = 1

Z

di j = ij

(7.38) (7.39) (7.40)

Assuming also

di j ]+ = di  dj ]+ = 0 (7.41) Notice that the measure d is translationally invariant. In fact if f () = a + b and 2 G1(1) Z Z df () = df ( + ) (7.42) since' Z db = 0 (7.43)

7.2 Integration over Grassmann variables

Let us consider the Grassmann algebra Gn. Its generic element can be expanded over the basis elements as X X X f (1      n) = f0 + f1(i1 )i1 + f2 (11:i2)i1 i2 +    fn(i1     in)i1    in i1

i1 

in

i1 12

(7.44) Of course the coecients of the expansion are antisymmetric. If we integrate f over the algebra Gn only the last element of the expansion gives a non vanishing contribution. Noticing also that for the antisymmetry we have X fn(i1      in)i1    in = n!fn (1     n)1    n (7.45) it follows

i1 

in

Z

dn    d1 f (1     n) = n!fn(1     n) (7.46) An interesting property of the integration rules for Grassmann variables is the integration by parts formula. This can be seen either by the observation that a derivative destroys a factor i in each element of the basis and therefore Z dn    d1 @@ f (1      n ) = 0 (7.47) j 174

either by the property that the integral is invariant under translations (see the previous Section and Section 4.7). Let us now consider the transformation properties of the measure under a change of variable. For simplicity consider a linear transformation X i = aik k0 (7.48) k

The eq. (7.40) must be required to hold in each basis, since it can be seen easily that the previous transformation leaves invariant the multiplication rules. In fact,

i0 j0 = cik cjhk h = ;cik cjhhk == ;cjhcik hk = ;j0 i0 where cij is the inverse of the matrix aij . Therefore we require

Z

di j =

By putting we obtain

di =

Z

dij =

ZX hk

Z

di0 j0 = ij

X k

dk0 bkiajhh0 =

(7.50)

dk0 bki

X hk

(7.49)

(7.51)

bkiajhhk = (ab)ij = ij

(7.52)

from which

bij = (a;1)ij (7.53) The transformation properties of the measure are dn    d1 = detja;1jdn0    d10 (7.54) where we have used the anticommutation properties of di among themselves. We see also that  @  (7.55) detjaj = det  @0  and  @0  ; 1 0 0 dn    d1 = det  @  dn    d1 (7.56) Therefore the rule for a linear change of variables is

Z

d0

n

   d0 f (0 ) = 1

i

Z

dn    d1

 0   @  f (i0 (i))

 det;1  @

This rule should be compared with the one for commuting variables  @x0  Z Z 0 0 0 dx1    dxnf (xi) = dx1    dxndet  @x  f (x0i (xi)) 175

(7.57) (7.58)

We see that the Grassmann measure transforms according to the inverse jacobian rather than with the jacobian itself. As in the commuting case, the more important integral to be evaluated for the path integral approach is the gaussian one X Aij ij =2 Z Z T ij I = dn    d1e  dn    d1e A=2 (7.59) The matrix A is supposed to be real antisymmetric AT = ;A (7.60) As shown in the Appendix a real and antisymmetric matrix can be reduced to the form 2 0  0 0   3 66 ;1 01 0 0    77 66 0 0 0 2    77 As = 66 0 0 ;2 0    77 (7.61)

66  4  

  

    77    5    

  

by means of an orthogonal transformation S . Then, let us do the following change of variables X i = (S T )ij j0 (7.62) j

we get

T A = (S T 0)AS T 0 = 0T SAS T 0 = 0T As0 and using detjS j = 1 Z 0T 0 I = dn0    d10 e As =2 The expression in the exponent is given by 1 0 T A 0 =  0 0 +    +  n 0 0  for n even s 1 1 2 2 n;1 n 2 1 0 T A 0 =  0 0 +    +  n 1 0 0  for n odd s 1 1 2 n;2 n;1 2 2 ;

(7.63) (7.64)

(7.65)

From eq. (7.46) we get contribution nto the integral only from the term in the expansion of the exponential (0T As0) 2 for n even. In the case of n odd we get I = 0 since 0T As0 does not contain n0 . Therefore, for n even Z 0 T A  0 =2 Z 0 0 0  I = dn    d1 e s = dn    d10 ( n1)! ( 12 0T As0 ) n2 2 Z 1 n = dn0    d10 ( n )! ( 2 )!1     n2 10    n0

p2

= 1     n2 = detA

(7.66)

176

where we have used detjAsj = detjAj. Then

Z

p T dn    d1 e A=2 = detA

(7.67)

This result is valid also for n odd since in this case detjAj = 0. A similar result holds for commuting variables. In fact, let us dene

J=

Z

T dx1    dxne;x Ax=2

(7.68)

with A a real and symmetric matrix. A can be diagonalized by an orthogonal transformation with the result Z 0T 0 J = dx01    dx0ne;x Ad x =2 = (2 ) n2 p 1 (7.69) detA Again, the result for fermionic variables is opposite to the result of the commuting case. Another useful integral is

I (A ) =

Z

T T dn    d1 e A=2 +  

(7.70)

with i Grassmann variables. To evaluate this integral let us put

 = 0 + A;1 

(7.71)

We get 1 T A + T  = 1 (0 T ; T A;1)A(0 + A;1) + T (0 + A;1 ) 2 2 = 21 0 T A0 + 21 T A;1  (7.72) where we have used the antisymmetry of A and T 0 = ;0 T . Since the measure is invariant under translations we nd Z p 0T 0 T ;1 T ;1 I (A ) = dn0    d10 e A =2 +  A =2 = e A =2 detA (7.73) again to be compared with the result for commuting variables

Z

T T T ;1 I (A J ) = dx1    dxne;x Ax=2 + J x = eJ A J=2 p(2 )

n 2

detA

(7.74)

The previous equations can be generalized to complex variables

Z

y

d1 d1    dndn e A = detjAj 177

(7.75)

and

Z

y y y y ;1 d1 d1    dndn e A +   +   = e; A  detjAj

(7.76)

The second of these equations follows from the rst one as in the real case. we will prove the rst one for the case n = 1. Starting from

Z

and putting

T d2 d1e A=2 = A12

(7.77)

1 = p1 ( + ) 2 = ; pi ( ; ) 2 2

we get

(7.78)

1 T A = ;i A  (7.79) 12 2 and recalling that (remember that here one has to use the inverse of the jacobian)

d2 d1 = idd we get

Z

(7.80)

Z

T  d2d1 e A=2 = i dde;i A12 

Z

and

 dde (;iA12 ) = ;iA12

(7.81) (7.82)

7.3 The path integral for the fermionic theories In this Section we will discuss the path integral formulation for a system described by anticommuting variables. Considering a Fermi oscillator, one could start from the ordinary quantum mechanical description and then by using the completeness in terms of Grassmann variable, it would be possible to derive the corresponding path integral formulation. We will give here only the result of this analysis. It turns out that the amplitude among eigenstates of the Fermi annihilation operator is given by

h0 t0j ti =

Z  (t )= 

0

(t)=t

where and

0

t

0

0 0  D( )e (t )=2 +  (t)=2 + iS

D( ) =

S=

Z t i 0

t

2

Y

dd

( _ ; _) ; H ( ) 178

(7.83) (7.84)



dt

(7.85)

where, for the Fermi oscillator H =  . The expression is very similar to the one obtained in the commuting case if we realize that the previous formula is the analogue in the phase space using complex coordinates of the type q ip. It should be noticed the particular boundary conditions for the  variables originating from the fact that the equations of motion are of the rst order. Correspondingly we can assign only the coordinates at a particular time. In any case, since at the end we are interested in eld theory where only the generating functional is relevant and where the boundary conditions are arbitrary (for instance we can take the elds vanishing at t = 1), and in any case do not aect the dependence of the generating functional on the external source. The extension to the Fermi eld is done exactly as we have done in the bosonic case, and correspondingly the generating functional for the free Dirac theory will be Z Z !] i !(i@/ ; m) + ! +  Z  !] = D( !)e (7.86) where  and ! are the external sources (of Grassmann character). Using the integration formula (7.73) we get formally ;(i!) i@/ ;;i m (i) ;i! i@/ ;1 m  Z  !] = Z 0]e = Z 0]e (7.87) where the determinant of the Dirac operator has been included in the term at zero source. In this expression we have to specify how to take the inverse operator. We know that this is nothing but the Dirac propagator dened by (i@/ ; m)SF (x ; y) = 4 (x ; y) (7.88) It follows Z ;i d4xd4 y!(x)SF (x ; y)(y) Z  !] = Z 0]e (7.89) with Z d4k /k + m Z d4k 1 ; ikx e = lim e;ikx SF (x) = lim !0+ (2 )4 k2 ; m2 + i !0+ (2 )4 /k ; m + i (7.90) The Green's functions are obtained by dierentiating the generating functional with respect to the external sources (Grassmann dierentiation). In particular, the two point function is given by 1 Z  = 1 h0jT ((x)!(y))j0i = iSF (x ; y) (7.91)  Z 0] !(x)(y) ==0 h0j0i The T -product in this expression is the one for the Fermi elds, as it follows from the derivation made in the bosonic case, taking into account that the classical eld of this case are anticommuting. We will not insist here further on this discussion, but all we have done in the bosonic case can be easily generalized for Fermi elds. 179

Chapter 8 The quantization of the gauge elds 8.1 QED as a gauge theory Many eld theories possess global symmetries. These are transformations leaving invariant the action of the system and are characterized by a certain number of parameters which are independent on the space-time point. As a prototype we can consider the free Dirac lagrangian L0 = !(x)i@/ ; m](x) (8.1) which is invariant under the global phase transformation (x) ! 0(x) = e;iQ (x)

(8.2)

If one has more than one eld, Q is a diagonal matrix having as eigenvalues the charges of the dierent elds measured in unit e. For instance, a term as !2 1 , with  a scalar eld, is invariant by choosing Q(1 ) = Q() = 1, and Q(2 ) = 2. This is a so called abelian symmetry since e;i Q e;iQ = e;i( +  )Q = e;iQ e;i Q (8.3) It is also referred to as a U (1) symmetry. The physical meaning of this invariance lies in the possibility of assigning the phase to the elds in an arbitrary way, without changing the observable quantities. This way of thinking is in some sort of contradiction with causality, since it requires to assign the phase of the elds simultaneously at all space-time points. It looks more physical to require the possibility of assigning the phase in an arbitrary way at each space-time point. This invariance, formulated by Weyl in 1929, was called gauge invariance. The free lagrangian (8.1) cannot be gauge invariant due to the derivative coming from the kinetic term. The idea is 180

simply to generalize the derivative @ to a so called covariant derivative D having the property that D  transforms as , that is D (x) ! D (x)]0 = e;iQ (x) D (x) (8.4) In this case the term

!  D (8.5) will be invariant as the mass term under the local phase transformation. To construct the covariant derivative, we need to enlarge the eld content of the theory, by introducing a vector eld, the gauge eld A , in the following way D = @ + ieQA (8.6) The transformation law of A is obtained from eq. (8.4) (@ + ieQA )]0 = (@ + ieQA0 )0(x) = (@ + ieQA0 )e;iQ (x)    1 ; iQ ( x ) 0 = e @ + ieQ(A ; e @ )  (8.7) from which (8.8) A0 = A + 1e @ The lagrangian density !  A L = !iD/ ; m] = !i  (@ + ieQA ) ; m) = L0 ; eQ (8.9) is then invariant under gauge transformations, or under the local group U (1). We see also that by requiring local invariance we reproduce the electromagnetic interaction as obtained through the minimal substitution we discussed before. In order to determine the kinetic term for the vector eld A we notice that eq. (8.4) implies that under a gauge transformation, the covariant derivative undergoes a unitary transformation D ! D0 = e;iQ (x) D eiQ (x) (8.10) Then, also the commutator of two covariant derivatives D  D ] = @ + ieQA  @ + ieQA ] = ieQF

(8.11)

F = @ A ; @ A

(8.12)

F ! e;iQ (x) F eiQ (x) = F

(8.13)

with transforms in the same way

181

The last equality follows from the commutativity of F with the phase factor. The complete lagrangian density is then L = L + LA = !i  (@ + ieQA ) ; m] ; 41 F F  (8.14) The gauge principle has automatically generated an interaction between the gauge eld and the charged eld. We notice also that gauge invariance prevents any mass term, 21 M 2 A A . Therefore, the photon eld is massless. Also, since the local invariance implies the global ones, by using the Noether's theorem we nd the conserved current as @ L  =  ! (Q ) j = @ (8.15)  from which, eliminating the innitesimal parameter , ! Q J =  (8.16)

8.2 Non-abelian gauge theories The approach of the previous section can be easily extended to local non-abelian symmetries. We will consider the case of N Dirac elds. The free lagrangian

L0 =

N X ! a=1

a (i@/ ; m)a

(8.17)

is invariant under the global transformation +(x) ! +0(x) = A+(x)

(8.18)

where A is a unitary N  N matrix, and we have denoted by + the column vector with components a . In a more general situation the actual symmetry could be a subgroup of U (N ). For instance, when the masses are not all equal. So we will consider here the gauging of a subgroup G of U (N ). The elds a (x) will belong, in general, to some reducible representation of G. Denoting by U the generic element of G, we will write the corresponding matrix Uab acting upon the elds a as

U = e;i A T  A

U 2G

(8.19)

where T A denote the generators of the Lie algebra associated to G, Lie(G), (that is the vector space spanned by the innitesimal generators of the group) in the fermion representation. The generators T A satisfy the algebra T A T B ] = ifCAB T C 182

(8.20)

where fCAB are the structure constants of Lie(G). For instance, if G = SU (2), and we take the fermions in the fundamental representation,

 

+ = 1 2 we have

T A = 2  A

A = 1 2 3

(8.21) (8.22)

where A are the Pauli matrices. In the general case the T A's are N  N hermitian matrices that we will choose normalized in such a way that Tr(T AT B ) = 21 AB (8.23) To make local the transformation (8.19), means to promote the parameters A to space-time functions A ! A(x) (8.24) Notice, that now the group does not need to be abelian, and therefore, in general A A A A e;i A T e;iA T 6= e;iA T e;i A T

(8.25)

Let us now proceed to the case of the local symmetry by dening again the concept of covariant derivative

D +(x) ! D +(x)]0 = U (x)D (x)] We will put again

D = @ + igB where B is a N  N matrix acting upon +(x). In components

(8.26) (8.27)

Dab = ab @  + ig(B )ab

(8.28)

D + ! (@ + igB0 )U (x)+ = U (x)@ + + U (x)U ;1 (x)igB0 U (x)]+ + (@ U (x))+ = U (x)@ + U ;1 (x)igB0 U (x) + U ;1 (x)@ U (x)]+

(8.29)

U ;1 (x)igB0 U (x) + U ;1 (x)@ U (x) = igB

(8.30)

B0 (x) = U (x)B (x)U ;1 (x) + gi (@ U (x))U ;1 (x)

(8.31)

The eq. (8.26) implies

therefore and

183

For an innitesimal transformation we get

U (x)  1 ; i A(x)T A

(8.32)

B (x) = ;i A (x)T A B (x)] + g1 (@ A(x))T A

(8.33)

(B  )ab  AA(T A)ab

(8.34)

Since B (x) acquires a term proportional to T A, the transformation law is consistent with a B linear in the generators of the Lie algebra, that is The transformation law for A becomes

AC = fCAB AAB + g1 @  C

(8.35)

The dierence with respect to the abelian case is that the eld undergoes also a homogeneous transformation. The kinetic term for the gauge elds is constructed as in the abelian case. In fact the quantity D D ]+  igF + (8.36) in virtue of the eq. (8.26), transforms as + under gauge transformations, that is (D D ]+)0 = igF0 +0 = igF0 U (x)+ = U (x)(D  D ]+) = U (x) (igF ) +

(8.37)

This time the tensor F is not invariant but transforms homogeneously, since it does not commute with the gauge transformation as in the abelian case

F0 = U (x)F U ;1 (x)

(8.38)

The invariant kinetic term will be assumed as LA = ; 21 TrF F  ] Let us now evaluate F

(8.39)

or in components

igF = D D ] = @ + igB @ + igB ] = ig(@B ; @ B ) ; g2B  B ]

(8.40)

F = (@ B ; @ B) + igB  B ]

(8.41)

F  = FC T C

(8.42)

184

with

FC = @  AC ; @  AC ; gfCAB AAAB (8.43) The main feature of the non-abelian gauge theories is the bilinear term in the previous expression. Such a term comes because fCAB 6= 0, expressing the fact that G is not abelian. The kinetic term for the gauge eld, expressed in components, is given by X LA = ; 41 FA FA (8.44) A Therefore, whereas in the abelian case LA is a free lagrangian (it contains only quadratic terms), now it contains interaction terms cubic and quartic in the elds. The physical motivation lies in the fact that the gauge elds couple to everything which transforms in a non trivial way under the gauge group. Therefore they couple also to themselves (remember the homogeneous piece of transformation). To derive the equations of motion for the gauge elds, let us consider the total action Z ! i@/ ; m)+ ; g+! B  + + SA d4x +( (8.45) V

where

and the variation of SA

Z 1 SA = ; 2 d4x Tr(F F  ) V SA = ;

Z V

d4xTr(F F  )

(8.46) (8.47)

Using the denition (8.41) for the eld strength we get

F = @ B + ig(B)B + igB(B ) ; ( $  ) from which

SA = ;2

Z V

d4xTrF  (@ B + ig(B)B + igB (B ))]

(8.48) (8.49)

where we have taken into account the antisymmetry properties of F . Integrating by parts we obtain

SA = ;2 = 2 =

Z

Z

Z V

V

V

d4xTr;(@ F  )B ; igBF  B + igF  B B ]

d4xTr (@ F  + igB F  ]) B ]

d4 x (@ F  + igB F  ])A AA 185

(8.50)

where we used the cyclic property of the trace. By taking into account also the free term for the Dirac elds and the interaction we nd the equations of motion @ F A + igB F  ]A = g+!   T A+ (i@/ ; m)+ = gB  + (8.51) From the rst equation we see that the currents +! T A+ are not conserved. In fact the conserved currents turn out to be JA = +!  T A+ ; iB   F ]A (8.52) The reason is that under a global transformation of the symmetry group, the gauge elds are not invariant, said in dierent words they are charged elds with respect to the gauge elds. In fact we can verify immediately that the previous currents are precisely the Noether's currents. Under a global variation we have AC = fCAB AAB  + = ;i A T A+ (8.53) and we get j  = @@+L + + @A@ L AC (8.54) from which



 C

j  = +!   AT A+ ; FC fCAB AAB (8.55) In the case of simple compact Lie groups one can dene f ABC = fCAB with the property f ABC = f BCA. It follows FC f ABC AB T A = iB   F ]  iB   F ]AT A (8.56) Therefore j  = +!   AT A+ ; iB  F ]A A (8.57) After division by A we get the Noether's currents (8.52). The contribution of the gauge elds to the currents is also crucial in order they are conserved quantities. In fact, the divergence of the fermionic contribution is given by @  (+! T A+) = ;ig+! T AB + + ig+! B  T A+ = ;ig+! T A B ]+ (8.58) which vanishes for abelian gauge elds, whereas it is compensated by the gauge elds contribution in the non abelian case.

8.3 Path integral quantization of the gauge theories As we have seen at the beginning of this course that the quantization of the electromagnetic eld is far from being trivial, the diculties being related to the gauge 186

invariance of the theory. In this Section we would like to discuss how these diculties arise also in the path integral quantization. To this end let us recall the general scheme of eld quantization in this approach. First of all we will deal only with the perturbative approach, meaning that the only thing that one needs to evaluate is the generating functional for the free case. The free case is dened by the quadratic part of the action. Therefore the evaluation of the path integral reduces to calculate a Gaussian integral of the type given in eqs. (7.73) and (7.73) for the fermionic and bosonic case respectively. For instance, in the bosonic case one evaluates Z T Ax=2 + J T x T A;1 J=2 (2 ) n2 ; x J p (8.59) I (A J ) = dx1    dxne =e detA In our case the operator A appearing in the gaussian factor is nothing but the wave operator. The result of the integration is meaningful only if the operator A is nonsingular (it needs to have an inverse) otherwise the integral is not well dened. This is precisely the point where we run into problems in the case of a gauge theory. Remember from eq. (1.132) that the wave operator for the electromagnetic eld in momentum space is given by (;k2 g + k k )A (k) = 0

(8.60)

We see that the operator has a null eigenvector k , since (;k2 g + k k )k = 0

(8.61)

and therefore it is a singular operator. This is strictly related to the gauge invariance, which in momentum space reads

A (k) ! A (k) + k(k) (8.62) In fact, we see that the equations of motion are invariant under a gauge transformation precisely because the wave operator has k as a null eigenvector. Another way of seeing this problem is the following: in order to get physical results, as to evaluate S matrix elements, it is necessary to integrate over gauge-invariant functionals

Z

D(A)F A ]eiS

(8.63)

with F A ] = F A ] (here A is the gauge transformed of A). But being the action and the functional F gauge invariant, it follows that the integrand is invariant along the gauge group orbits. The gauge group orbits are dened as the set of points, in the eld space, which can be reached by a given A via a gauge transformation. In other words, given A , its orbit is given by all the elds A obtained by varying the parameters of the gauge group (that is varying '), see Fig. 8.3.1. For instance, in the abelian case A = A + @ ' (8.64) 187

.

AΩ µ Aµ

Ω A'µ A'µ

Fig. 8.3.1 - The orbits of the gauge group in the eld space. Since the orbits dene equivalence classes (or cosets) we can imagine the eld space as the set of all the orbits. Correspondingly we can divide our functional integral in a part parallel and in one perpendicular to the orbits. Then, the reason why our functional integral is not well dened depends from the fact that the integrand along the space parallel to the orbits is innite (the integrand being gauge invariant does not depend on the corresponding variables). However this observation suggests a simple solution to our problem. We could dene the integral by dividing it by the integral along the part parallel to the orbit (notice that is the same as the volume of the gauge group). In order to gain a clear understanding of this problem let us consider the following very simple example in two dimensions Z +1 ; 21 a2(x ; y)2 Z= dxdye (8.65) ;1

This is as reducing the space-time two two points with x and y the possible values of the eld in those points. At the same time the exponent can be seen as an euclidean action with the (gauge) symmetry

x = x + ' y = y + '

(8.66)

The integral (8.65) is innite due to this invariance. In fact this implies that the integrand depends only on x ; y, and therefore the integration over the remaining variable gives rise to an innite result. In order to proceed along the lines we have described above, let us partition the space of the eld, that is the two-dimensional space (x y) in the gauge group orbits. Given a point (x0  y0), the group orbit is given by the set of points (x = x + ' y = y + ') corresponding to the line x ; y = x0 ; y0, as shown in Fig. 8.3.2 We can perform the integral by integrating 188

. y

(x Ω, y Ω)

(x 0 , y 0) x

f(x,y) = 0 x+y=c

Fig. 8.3.2 - The eld space (x y) partitioned in the orbits of the gauge group. We show also the line x + y = c and the arbitrary line f (x y) = 0 (dashed).

along the perpendicular space, the line x + y = c (c is a constant), and then along the parallel space. It is this last integral which gives a divergent result, since we are adding up the same contribution an innite number of times. therefore the obvious way to do is to introduce as integration variables x ; y and x + y, integrating only on x ; y. Or, said in a dierent way, perform both integrals and dividing up by the integral over x + y, which cancels the divergence at the numerator. This can be done in a more systematic way by using the following identity

Z +1



1

d' 2 (x + y) + ' Then we multiply Z per by this factor obtaining 1=

Z=

Z

;1

2Z 1  ; 1 a2(x ; y)2 3 5 d' 4 dxdy 2 (x + y) + ' e 2

(8.67)

(8.68)

The integral inside the parenthesis is what we are looking for since it is evaluated along the line x + y = constant. Of course the result should not depend on the choice of this constant (it corresponds to translate the integration line in a way parallel to the orbits). In fact, by doing the change of variables

x ! x + '

y !y+' 189

(8.69)

we get

Z=

Z

2Z  ; 1 a2(x ; y)2 3 1 5 d' 4 dxdy 2 (x + y) e 2

(8.70)

As promised the innite factor is now factorized out and we see also clearly that this nothing but the volume of the gauge group (' parameterize the gauge group, which in the present case is isomorphic to R1). Then, we dene our integral as 1  ; 1 a2(x ; y)2 Z Z 0 Z = V = dxdy 2 (x + y) e 2 (8.71) G

with

VG =

Z

d'

(8.72)

In this way the integral is dened by integrating over the particular surface x + y = 0. Of course we would get the same result integrating over any other line (or surface in the general case) having the property of intersecting each orbit only once. For instance, let us choose the line (see Fig. 8.3.2)

f (x y) = 0

(8.73)

Then, proceeding as in eq. (8.67) we consider the following identity, which allows us to dene a gauge invariant function f (x y) 1 = f (x y)

Z

Z

  d' f (x  y ) = f (x y) d' f (x + ' y + ')]

(8.74)

In the previous case we had f (x y) = (x + y)=2 = 0 giving rise to f (x y) = (x + y)=2 + ' = 0, from which f (x y) = 1. Multiplying Z by the expression (8.74) we get 3 1 Z 2Z 2 2 ; a (x ; y) 5 Z = d' 4 dxdyf (x y) f (x + ' y + ')] e 2 (8.75) Let us show that f (x y) is gauge invariant. In fact by the transformation x ! x + '0 , y ! y + '0 we obtain 1 = f

(x + '0  y + '0)

Z

d' f (x + ' + '0  y + ' + '0 )]

(8.76)

and changing the integration variable ' ! ' + '0 1 = f

(x + '0  y + '0)

Z

190

d' f (x + ' y + ')]

(8.77)

Comparison with eq. (8.74) gives f (x y) = f (x + '0  y + '0)

(8.78)

The integral inside the parenthesis in eq. (8.75) does not depend on ' (that is is gauge invariant). This can be shown exactly as we did before in the case f (x y) = (x + y)=2. Therefore we can again factorize the integration over ', being the sum of innite gauge copies 3 1 Z 2Z 2 2 ; a (x ; y ) 5 Z = d' 4 dxdyf (x y) f (x y)] e 2 (8.79) and dene

1 Z = Z dxdy (x y) f (x y)] e; 2 a2 (x ; y)2 (8.80) f VG The function f (x y) (called the Faddeev-Popov determinant) can be calculated directly from its denition. To this end notice that in (8.79), f (x y) appears multiplied by f (x y)]. As a consequence, we need to know f only at points very close at the surface f (x y) = 0. But in this case, the equation f (x y) = 0 has the only solution ' = 0, since by hypothesis the surface f (x y) crosses the gauge orbits only once. Therefore we can write f (x y)] =  @f 1 '] (8.81) @ ' =0 giving rise to   (8.82) f (x y) = @f @ ' =0 where f  = f (x y). Clearly f represents the answer of the gauge xing f = 0 to an innitesimal gauge transformation. As a last remark, notice that our original integral can be written as Z ; 1 xiAij xj Z = d2 xe 2 (8.83)

Z0 =

with





1 ;1 (8.84) ;1 1 a singular matrix. As we have already noticed the singularity of A is related to the gauge invariance, and the solution we have given here solves also the problem of the singularity of A. In fact, integrating over x ; y only, means to integrate only on the eigenvectors of A corresponding to non zero eigenvalues.

A = a2

191

Since the solution we have found to our problem is of geometrical type, it is very easy to generalize it to the case of a gauge eld theory, abelian or not. We start by choosing a surface in the gauge eld space given by fB (AB ) = 0 B = 1     n (8.85) where n is the number of parameters of LieG. Also, this surface should intersect the gauge orbits only once. To write down the analogue of equation (8.74) we need also a measure on the gauge group. This can be obtained as follows. We have seen that we are interested in the solutions of fB (A ) = 0 (8.86) around fB (A) = 0, therefore it is enough to know the measure around the identity of the gauge group. In such a case the transformation ' can be written as '  1 + i A (x)T A (8.87) Since at the rst order in the group parameters ''0  1 + i( A + A0 )T A (8.88) we see that the invariant measure of the group, around the identity, is given by

d(') =

Y Ax

d A(x)

(8.89)

It follows that for innitesimal transformations d(''0 ) = d('0') = d(') next we dene the Faddeev-Popov determinant as in (8.74)

Z

1 = f A ] d(')fA(A )]

(8.90) (8.91)

In this expression we have introduced a functional delta-function, dened by the identity Z 1 = d(')'] (8.92) having the usual properties extended to the functional case. We can see that f is a gauge invariant functional, since ;1 A 

f

1

0;

]=

Z

d(')fA(A( 

0;

and changing variables '0 ' = '00 we get ;1(a f



1

0;

)=

Z

d('00'0 )f

 A (A

192

00

)] =

Z

1

) ]

d(')fA(A )]

(8.93) (8.94)

Consider now the naive path integral

Z=

Z

Z

D(A )F A]e

i LAd4x

(8.95)

where F is a gauge invariant functional. Since in the following F will not play any particular role we will discuss only the case F = 1, but all the following considerations are valid for an arbitrary gauge invariant F . We multiply Z by the identity (8.91) Z 2 3 4 Z Z i LAd x Z = d(') 64 D(A )e fA(A )]f A ]75 (8.96) The functional measure is invariant under gauge transformations A ! 'A ';1 + gi (@ ')';1 (8.97) since the homogeneous part has determinant one, and the inhomogeneous part gives rise to a translation. Then by performing the change of variables A ! A , and using the gauge invariance of D(A), LA and f A ], we get

Z=

Z

Z 2 3 4 Z i LAd x d(') 64 D(A)e fA(A)]f A ]75

Again, we dene the functional integral as

Z

(8.98)

Z

i LAd4x Z Z Z ! V = R d(') = D(A)e fA(A )]f A] (8.99) G The evaluation of f goes as before. Since in the previous equation it appears multiplied by fA(A )], the only solution of fA(A ) = 0 in this neighborhood is ' = 1, or A = 0. therefore   )  f ( A A     A] (8.100) fA(A )] = det;1    B A =0  from which   fA(A )    f A ] = det    =0 (8.101) B

A

Once again, the Faddeev-Popov determinant f A ] is the answer of the gauge-xing to an innitesimal gauge transformation. The nal expression that we get for Z is

Z=

Z

 i Z L d4x    A (A )   e A D(A )fA(A)]det  f  B A =0  193

(8.102)

This expression gives the right measure to be used in the path integral quantization of the gauge elds. In particular, the vacuum expectation value (v.e.v.) of a gauge eld functional will be dened as Z  h0jT (F A])j0if = D(A)f ]f eiS F A] (8.103) where the index f species the gauge in which we evaluate the v.e.v. of F . About this point, we can prove an important result, that is: if F A] is a gauge invariant functional F A] = F A] (8.104) than its v.e.v. does not depend on the gauge xing fA = 0. We will prove this result by showing the relation between the v.e.v.'s of F A] in due dierent gauges. Consider the identity

h0jT (F A])j0i

f1

=

Z

Z

D(A)f1 ]f1 eiS F A] d(')f2]f2

(8.105)

where we have used eq. (8.91)and dened f2A]  f2 A]. By change of variable A ! A , we get

h0jT (F A])j0i

f1

= =

Z Z d(') D(A)f1 Z Z

;

1

]f1 eiS F A 1 ]f2]f2 ;





d(') D(A)eiS f2 ]f2 f1]f1 F A] (8.106)

where in the second line we have made a further change of variables ' ! ';1 and used the invariance of d('). Then

h0jT (F A])j0i

f1

=

Z

d(')h0jT (f1]f1 F A])j0if2

(8.107)

This is the relation we were looking for. If F is gauge invariant we get at once

h0jT (F A])j0if = h0jT (F A])j0if 1

2

(8.108)

For the following we will dene a generating functional

Zf A] =

Z

Z

i d4xAAA iS D(A)f ]f e e

(8.109)

which is not gauge invariant, but in terms of which we can easily construct the v.e.v.'s of gauge invariant quantities, since it is built up in terms of the correct functional measure. For the purpose of building up the perturbative theory it is convenient to write the integrand of eq. (8.109) in the form exp(iSe ), where Se is an eective action 194

taking into account the gauge xing f = 0 and the Faddeev-Popov determinant. We will discuss later the exponentiation of f . As far as f ] it is concerned, we can exponentiate it by using its Fourier representation, or, more conveniently by choosing the gauge condition in the form fA(A) ; BA(x) = 0, with BA(x) a set of arbitrary functions independent on the gauge elds. Then

 (f A] ; B )    f A]   A A     A   = f A] (f ;B) A] = det  = det    B  B A =0 A =0 and therefore Z 1 = f A] d(')fA ; BA]

(8.110) (8.111)

We can further multiply this identity by the following expression which does not depend on the gauge elds Z X i ; 2 d4x BA2 (x) Z A (8.112) constant = D(B )e We get

Z X i ; 2 d4x (fA(x))2 Z A constant = f A] d(')D(B )e (fA ; BA) Z X ; 2i d4x (fA(x))2 Z A (8.113) = f A] d(')e

Multiplying this expression by the functional (8.95)we can show as before that the gauge volume factor factorizes, allowing us to dene a class of generating functionals Z X Z i d4x(LA ; 21 (fA)2 ) i d4xA AA Z A Z A] = D(A)f A]e e (8.114)

Z X 1 Sf = ; 2 d4x (fA(x))2 A we nd the analogue of eq. (8.107) is

Finally, dening

h0jT (F A])j0i

f1

=

Z



d(')h0jT (f1 eiSf1 F A])j0if2

(8.115)

(8.116)

showing again that the v.e.v. a gauge invariant functional gives rise to a gauge invariant v.e.v. for the corresponding operator. 195

8.4 Path integral quantization of QED Let us now discuss the path integral quantization of QED in a covariant gauge. We start from the lagrangian L = ! i  (@ + ieA ) ; m]  ; 41 F F  (8.117) with F = @ A ; @ A (8.118) invariant under (x) ! 0(x) = e;ie (x) (x) A (x) ! A0 (x) = A (x) + 1e @ (x) (8.119) The Lorentz gauge is dened by and with

f (A) = @  A (x) = 0

(8.120)

f (A ) = @  A (x) + 1e  (x)

(8.121)

f (A )  = 1 x 4(x ; y) (8.122)   (y) =0 e Then the Faddeev-Popov determinant turns out to be eld-independent and it can be absorbed into the normalization of the generating functional, which is given by Z  ! ] = N

Z

D( !)D(A )e

i

Z

d4xL

Z

@

! + A ]d4x i ! + 

A ]e

We extract the interaction term in the usual way Z  1    1   1   ;ie d4x ; i    i  i ! Z  !  ] = e Z0 !  ] with

Z0 !  ] = N Writing

Z

i

Z

D( !)D(A)e

SA = ; 14

Z

F

F  d4x =

d4xL 1 2

Z

196

0

(8.123) (8.124)

Z

@

! + A ]d4x i ! + 

A ]e

d4x A (g  ; @  @  )A

(8.125) (8.126)

using @ A] and integrating over the Fermi elds we get

Z0 ! ] = e;ih!(x)SF (x ; y)(y)i i hA (x)A (x)i Z   N D(A)@ A ]e 2  eih (x)A (x)i (8.127) Then we exponentiate the @ A ] through its Fourier transform

@

A ] =

Z

 D(C )eihC (x)@ A (x)i =

Z

 D(C )e;ih@ C (x)A (x)i

(8.128)

obtaining the following expression for Z0 Z0 !  ] = e;ih!(x)SF (x ; y)(y)i Z ih 12 A (x)A (x) + ( (x) ; @ C (x))A (x)i (8.129) N D(A)D(C )e Doing the gaussian integral over A we get

Z0 !  ] = e;ih!(x)SF (x ; y)(y)i i 1 Z hi( (x) ; @ C (x))  i( (y) ; @  C (y))i 2 xy N D(C )e i h 1  i ; = e;ih!(x)SF (x ; y)(y)ie 2   Z ih@ C 1  i ; 2i h@ C 1 @  C i (8.130)  D(C )e After integrating by parts, we can rewrite the integral over C (x) as Z ;ihC 1 @ i + 2i hCC i D(C )e

(8.131)

and doing again this gaussian integral we nd

    i @@ 1 h ; i @  (+i) ; i @  i = e 2 h  2  i    e2 

The nal result is





i h g ; @ @  i ;  2 Z0  ! ] = Z 0]e;ih!(x)SF (x ; y)(y)ie 2 197

(8.132)

(8.133)

Introducing the function

Z d4k  g G = lim ; (2 )4

!0+

we write



+ 2kk 2 e;ikx 2 k + i (k + i)

; 2i h(x)G (x ; y) (y)i ; i h  ! ( x ) S ( x ; y )  ( y ) F Z0  ! ] = Z 0]e e

(8.134)

(8.135)

The photon propagator in this gauge is given by 1 h0jT (A (x)A (x))j0i = iG (x ; y) (8.136)    h0j0i Notice that this propagator is transverse, that is it satises the Lorentz condition @  G (x) = 0 (8.137) The Feynman's rules in this gauge are the same discussed in Section 1.5.3, except for th photon propagator which is  g  k   k ;i k2 + i ; (k2 + i)2 (8.138) Of course, the additional term in the propagator proportional to kk does not aect the physics since the photon is coupled to a conserved current. We can perform an analogous calculation but using the gauge xing in the form f ; B discussed at the end of the previous Section, that is using the generating functional (8.114)  Z  1 4  2 Z i d x L ; 2 (@ A ) Z  !  ] = N D( !)D(A)e

Z

 e

! +  A

d4x ! +  

(8.139) In this case the only eect of the gauge xing is to change the lagrangian L to L0 = L ; (@ A )2=(2 ). Of course, the addition of this term removes the problem of the singular wave operator. In fact we can write   Z Z 1 1 4  4 0 (8.140) d xL = 2 d xA g  ; (1 ;  )@ @ A Therefore we have to evaluate the functional integral  Z 1    4 Z i d x 2 A K A +  A Z0 !  ] = Ne;ih!(x)SF (x ; y)(y)i D(A )e (8.141) 198

with obtaining

1@ @ K = g  ;  ;  

(8.142)

; 2i h (x)G (x ; y) (y)i ; i h  ! ( x ) S ( x ; y )  ( y ) i F Z  !  ] = Z 0]e e (8.143) where

K G(x) =  4(x) (8.144) We can see that K is non singular for  6= 1. Going to momentum space, eq. (8.144) gives  Z d4k Z d4k   ; 1 2  ; ikx  =  (2 )4 e;ikx (8.145) (2 )4 ;g k +  kk G (k)e or    ; 1 2 ;g k +  kk G(k) =  (8.146)

This can be solved by writing the most general second rank symmetric tensor function of k G (k) = Ag + Bk k (8.147) with the coecients such that  ; 1 A ; 1 Bk2 = 0 (8.148) ;Ak2 = 1   Solving these equations G (k) = ; gk2 + (1 ;  ) kkk4  (8.149) The singularities are dened by the usual i term, and we get  Z d4k  g k  k G (x) = lim ; k2 + i + (1 ;  ) (k2 + i)2 e;ikx (8.150) !0+ (2 )4 The discussion made at the beginning of this Section corresponds to  = 0 (the so called Landau gauge). In fact for  ! 0 Z i ; 2 d4x(@ A )2 ! @ A] (8.151) lim e  !0 The choice corresponding to the quantization made in Section 1.5.3 is  = 1. In this case G (x) = ;g F (x m2 = 0) (8.152) 199

This is called the Feynman's gauge. As we see the parameter  selects dierent covariant gauges. As already stressed the physics does not depend on  since the ! , which is conserved in the abelian case, photon couples to the fermionic current  and as a consequence the  dependent part of the propagator, being proportional to kk does not contribute. The situation is quite dierent in the non-abelian case where the fermionic current is not conserved since the gauge elds are not neutral. Therefore we need further contributions in order to cancel the  dependent terms. Such contributions arise from the Faddeev-Popov determinant.

8.5 Path integral quantization of the non-abelian gauge theories In the case of a non-abelian gauge theory there are several dierences with respect to QED. The pure gauge lagrangian is not quadratic, it contains three-linear and fourlinear interaction terms and furthermore the Faddeev-Popov determinant typically depends on the gauge elds. In this Section we will consider the pure gauge case, since the fermionic coupling is completely analogue to the abelian case. Separating the lagrangian in the quadratic part plus the rest, we get X LA = ; 41 FC FC C ; ;  1 = ; 4 @ AC ; @ AC ; gfCAB AA AB @  AC ; @  AC ; gfCDE AD AE 1 2 AB DE   AB   = L(2) A + gfC AA AB @ AC ; 4 g fC fC AA AB AD AE I  L(2) (8.153) A + LA (I ) where L(2) A and LA are the quadratic and the interacting part respectively. The interaction terms can be extracted by the functional integration by the usual trick   (I ) 1  iSA i A  Z0 A ] Z ] = e (8.154)

R

with SA(I ) = d4xL(AI ) and

Z

"

#

1 X(@ A )2 Z 4  A i d4 x Z 2 A  A i d xA A Z0 ] = N D(A)e e f A ] (8.155) Consider now f A ]. In the Lorentz gauge

L(2) A ;

fAA ] = @ AA 200

(8.156)

Under an innitesimal gauge transformation (for later convenience we have changed the denition of the gauge parameters, sending A ! g A) AC = gfCAB A AB + @  C (8.157) we have fC A ] = @ AC + gfCAB @ ( AAB ) +  C (8.158) Then fC A (x)]   2 4 BA 4  B (y) A =0 = BC x (x ; y) + gfC @ ( (x ; y)AA) = MCB (x ; y) (8.159) We see that f A] depends explicitly on the gauge elds. In order to get an expression for the generating functional which is convenient r deriving the Feynman rules, it is useful to give to it an exponential form. This can be done, recalling from eq. (7.76)) that a determinant can be written as a gaussian integral over Grassmann variables. Z   Z i cA(x)MAB (x ; y)cB (y)d4xd4 y fAA (x)]     f A ] = det   B (y) A =0 = N D(c c )e (8.160) The Grassmann elds cA(x) are called the ghost elds. It should be noticed that the ghost carry zero spin and therefore in their particle interpretation lead to a violation of the spin-statistics theorem. However this fact has no consequence, since our generating functional do not generate amplitudes with external ghosts. The action for the ghost elds can be read from the previous equation and we get

Z





SFPG = d4xcA (x) AB  ; gfBAC @ AC cB (x) (8.161) The ghosts are zero mass particles interacting with the gauge elds through the interaction term Z (I ) SFPG = ;g d4xcAfBAC @ AC cB (8.162) In order to deal with this interaction it turns out convenient to dene a new generating functional where we introduce also external sources for the ghost elds    1 1  (I ) 1  ( I ) iSA i  iSFPG ; i   i   e Z   ] = e Z0  ] (8.163) with Z  Z0   ] = D(A )D(c c)

Z

"

1 X(@ A )2 + AA i d4x L(2) ;  A A 2 A  A e Z

i d4 x cAcA + cAA + A cA  e 201

#

(8.164)

The integration over AA like in the abelian case. For the ghost eld we use eq. (7.76) with the result  i  A )i A Z A 1 Ai ;h ( i ) ; ( i    A A ; i h    D(c c)eihcAcA + cA +  cAi = e =e (8.165) Dening Z d4k e;ikx AB (x) = ;AB lim (8.166) !0+ (2 )4 k2 + i we get

; 2i hA(x)GAB (x ; y)B (y)i ;ihA (x)AB (x ; y)B (y)i e (8.167) ] = e

  

Z0 with

 G AB = AB G (x ; y ) It is then easy to read the Feynman's rules. For the propagators we get

(8.168)

. k µ, A

ν, B

Fig. 8.5.1 - Gauge eld propagator: ;iAB . .





1 g ; (1 ;  ) kk2 +ki k2 + i

(8.169)

k A

B

Fig. 8.5.2 - Ghost eld propagator: .

1 iAB k2 + i

(8.170)

The Feynman's rules for the vertices are obtained by taking the Fourier transform of the interaction parts (here we have written fCAB  fABC ). 202

. µ, A

k1 k3 λ, C

ν, B

k2

Fig. 8.5.3 - Three-linear gluon vertex: gfABC (2 )44 (k1 + k2 + k3) g (k1 ; k2) + g (k2 ; k3) + g (k3 ; k1) ] (8.171) . µ, A

ρ, D

k1

k4 k3 ν, B

λ, C

k2

Fig. 8.5.4 - Four-linear gluon vertex: 4 h X

;ig2 (2 )44 (

i=1

ki) fABE fCDE (g g ; g g)

i

+ fACE fBDE (g g  ; g  g) + fADE fCBE (g g ; g g )

(8.172)

. µ, A k

p

q

B

C

Fig. 8.5.5 - Ghost-gluon vertex: ;gfABC (2 )44(k + p ; q)p 203

(8.173)

When we have fermions coupled to the gauge elds we need to add the rule for the vertex (the propagator is the usual one). By a simple comparison with the coupling in QED we see that in this case the rule for the vertex is . µ, A k

p'

p

γ, c

β, b

Fig. 8.5.6 - Fermion-gluon vertex: ;igTbcA() (2 )44 (p ; p0 ; k)

(8.174)

8.6 The  -function in non-abelian gauge theories We will skip a careful treatment of the renormalization in non-abelian gauge theories. However, since all the interaction terms have dimensions  4, the theory is renormalizable by power counting. On the other hand we will evaluate the beta function, since in these theories there is the possibility of having  < 0, giving rise to asymptotic freedom. As it is known, QCD the theory of strong interactions is in fact a gauge theory based on the gauge group SU (3), and it is precisely ths feature of being an asymptotically free theory that has made it very attractive phenomenologically. In non-abelian gauge theories there are several ways to dene the coupling, in fact we can look at the fermionic coupling, at the trilinear or at the four-linear couplings. If the theory is gauge invariant (and the renormalization should preserve this feature), all these ways should give the same result. This is in fact ensured by a set of identities among the renormalization constants (Ward identities), which are the analogue of the equality Z1 = Z2 in QED. Therefore we will study the denition coming from the fermionic coupling. In this case the relation between the bare coupling and the renormalized one is the same as in QED (see Section 6.3)) gB = =2g Z11=2 (8.175) Z2Z3 where Z1 is the vertex renormalization constant, and Z@ and Z3 are the wave function renormalization constants for the fermion and the gauge particle respectively. 204

As we shall see, one of the dierences with QED is that now Z1 6= Z2. As a consequence we will need to evaluate all the three renormalization constants. Let us start with the fermion self-energy. The one-loop contribution is given in Fig. 8.6.1. . k p

p

p _k

Fig. 8.6.1 - One-loop fermion self-energy. We will work in the Feynman-'t Hooft gauge, corresponding to the choice  = 1 for the gauge eld propagator (see eq. (8.169)). The Feynman rules for this diagram are the same as in QED except for the group factors. In order to avoid group theory complications, we will consider the case of a gauge group SU (2) with fermions in the spinor representation, where all the group factors are easily evaluated. At the same time we will exhibit also the general results for a theory SU (N ) with fermions in the fundamental representation (that is the one of dimensions N ). Then, recalling the expression for the electron self-energy in QED (see eq. (2.166)), we get (p) =

X A

X

(T AT A)

For SU (2) we have

QED (p) =

X A

(T AT A)

A

ab

(T AT A)

=





1 (^p ; 4m) + f (p) QED 8 2

X A A  A

2 2

ab

= 34 ab

(8.176)

(8.177)

In the case of a group SU (N ) and fermions in the fundamental representation it is possible to show that X A A (T T )ab = C2(F )ab (8.178) with

A

2;1 N C2(F ) = 2N (8.179) P This result is justied by the observation that the operator A T AT A (called the Casimir operator) commutes with all the generators and therefore it must be proportional to the identity matrix. From the previous expression for  we nd immediately (see Section 2.7) 2 Z2 = 1 ; 8g 2 C2(F ) (8.180)

205

. =

+

+ a)

+

+

+ c)

b)

+ d)

+

+ e)

f)

Fig. 8.6.2 - One-loop expansion of the vacuum polarization diagram in non-abelian gauge theories.

The next task is the evaluation of Z3. To this end we need the one-loop expansion of the gauge particle propagator (the vacuum polarization diagram). This is given in Fig. 8.6.2. Since we are using dimensional regularization we can see immediately that the last three diagrams (diagrams d), e) and f)) of Fig. 8.6.2 are zero (this is not true in other regularizations). In fact all these diagrams involve the integral

Z d2! k

(8.181) k2 which, in dimensional regularization gives (including a mass term) Z d2! k ! 2 !;1 (8.182) k2 ; a2 = ;i ;(1 ; !)(a ) By taking the limit a ! 0, we see that for ! > 1 the integral vanishes. Let us begin our calculation from the gauge particle contributions to the vacuum polarization. The kinematics is specied in Fig. 8.6.3. We will evaluate the truncated Green's function which diers by a factor ig2 from the vacuum polarization vacuum  as dened in Section 2.1. We have a) ig2(AB

Z d2! k 1 ;i ;i 2 ADC DBC = 2 (;g) f f E  2 ! (2 ) (p + k)2 k2 206

(8.183)

The factor 1=2 is a symmetry factor associated to the diagram and the tensor E comes from the trilinear vertices. It is given by





E = g (p ; k) + g (2k + p) ; g (2p + k)  g (p ; k) ; g(2p + k ) + g (2k + p) ] = g (p ; k)2 + (2p + k)2 + 2!(2k + p) (2k + p) ; (2p + k) (2k + p) ; (2k + p)(p ; k) + (p ; k) (2p + k) ](8.184) . p+k C, ρ A, µ p

D, σ

p

B, ν

k

Fig. 8.6.3 - One-loop gauge particle contribution to the vacuum polarization. The group factor is easily evaluated for SU (2), since in this case f ABC = ABC . Then we get ADC DBC = ;2AB (8.185) The general result for SU (N ) is (taking into account that f ABC are completely antisymmetric) f ADC f DBC = ;C2(G)AB (8.186) with C2(G) = N (8.187) This result can be understood by noticing that (T A)BC = if ABC gives a representation of the Lie algebra of the group (the adjoint representation), and therefore C2(G) is nothing but the Casimir of the adjoint. By using the standard trick we reduce the two denominators in the integral to a single one 1 1 = k2 (k + p)2

Z1 0

dz (k02 + z(11 ; z)p2 )2

with

(8.188)

k = k0 ; pz (8.189) Operating this substitution into the tensor E , taking into account that the integration over the odd terms in k0 gives zero, and that for symmetry we can make the substitution k0 k0 ! g2! k02 (8.190) 207

we get (by calling k02 with k2 )

E







= g 1 ; 21! + g p2 (1 + z)2 + (2 ; z)2 ; 

+ p p 2!(1 ; 2z)2 ; 2 (1 + z)2 + (2 ; z)(1 ; 2z)  a k2 + b k2 6

(8.191)

Performing the integral over k with the formulas of Section 2.5 and putting everything together, we get

! Z 1

1 i

2 = ; g (;C2 (G))AB 2! dz !z(1 ; z)p2 ;(1 ; !)a + ;(2 ; !)b 2 (2 ) 0 (8.192) By taking only the 1= term ( = 4 ; 2!) in the expansion and integrating over z we get  19  2 11 g (a)  2   2 ig AB = i 16 2 C2 (G)AB 6 g p ; 3 p p (8.193) As we see this contribution is not transverse, that is it is not proportional to (p p ; g p2 ), as it should be. in fact, we will see that adding to this diagram the contribution of the ghost loop we recover the transverse factor. The ghost contribution is the part b) of the expansion of Fig. 8.6.2. The relevant kinematics is given in Fig. 8.6.4. . a) ig2(AB

p+k D A, µ p

p

B, ν

C k

Fig. 8.6.4 - One-loop ghost contribution to the vacuum polarization. We have b) ig2(AB

= (;1)g2f ACD f BDC

Z d2! k

i

i

(p + k) k (2 )2! (p + k)2 k2

(8.194)

By the standard procedure we get (k = k0 ; pz)) b) ig2(AB

Z 1 Z d2! k k0 k0 ; pp z(1 ; z) dz 2 (G)AB

= ;g 2 C

0

(2 )2! (k02 + z(1 ; z)p2 )2

208

(8.195)

where we have eliminated the terms linear in k0. Then, we perform the integral over k0 1  Z1 !

(b) 2 2 2 ig AB = ;ig (2 )2! C2(G)AB dz z(1 ; z) 2 p ;(1 ; !) ; pp ;(2 ; !) 0 (8.196) We perform now the integral in z. Then taking the limit ! ! 2, and keeping only the leading term, we get 1  2 1 g (b) 2 2 ig AB = i 16 2  C2(G)AB 6 p g + 3 p p (8.197) Putting together this contribution with the former one we get 2 5

g (a) (b) 2 ig (AB + AB ) = ;i 8 2 3 C2 (G)AB p p ; g p2 (8.198) The fermionic loop contribution is given in Fig. 8.6.5, and except for the group factor is the same evaluated in Section 2.6. . p+k

A, µ p

p

B, ν

k

Fig. 8.6.5 - One-loop fermion contribution to the vacuum polarization. The leading contribution is

2 c) = ig 2 Tr(T AT B )QED = i g n  p p ; g p2

ig2(AB  12 2 F AB   

(8.199)

where we have used the conventional normalization for the group generators in the fermionic representation (8.200) Tr(T AT B ) = 12 nF AB where nF is the number of fundamental representations. For instance, in the case of QCD, we have three dierent kind of quarks corresponding to dierent &avors. Then, the one-loop leading term contribution to the vacuum polarization is given by a) + (b) + (c) ) ig2AB = ig2((AB AB AB  2 5

2 g (8.201) = ;i 8 2  3 C2(G) ; 3 nF pp ; g p2 209

By comparison with Section 2.7 we get





2 g Z3 = 1 + 8 2 53 C2(G) ; 32 nF (8.202) We are now left with the vertex corrections. The fermion contribution is given by the diagram of Fig. 8.6.6. .

A p' + k

p+k

B

B

p'

k

p

Fig. 8.6.6 - Fermion one-loop vertex correction. Again, this is the same as in QED except for the group factors. the leading contribution is 3 g A 3 ;ig f = ;i 8 2  (T B T AT B )  (8.203) The group factor can be elaborated as follows

T B T AT B = T B (T B T A + if ABC T C ) = C2(F)T A + if ABC T B T C = C2(F )T A ; 1 f ABC f BCD T D = C2 (F ) ; 1 C2(G) T A(8.204) 2 2 Therefore we obtain





3 = ;i g 2 C2 (F ) ; 1 C2(G) T A  (8.205) 8  2 The nal diagram to be evaluated is the vertex correction from the trilinear gauge coupling given in Fig. 8.6.7. We get (q = p0 ; p) Z d2! k ik^ ;i 3 A B ;ig g = (;ig T ) (2 )2! k2 (p0 ; k)2  (;g)f ABC g (q + p0 ; k) ; g(p + p0 ; 2k) + g(p ; k ; q) ]  (p ;;ik)2 (;igT C ) (8.206)

;ig3A f

210

.

A, µ

p' - k

p-k

B

C k p

p'

Fig. 8.6.7 - One-loop vertex correction from the trilinear gauge coupling. Since we are interested only in the divergent piece, the calculation can be simplied a lot by the observation that the denominator goes as (k2 )3, whereas the numerator goes at most as k2 . Therefore the divergent piece is obtained by neglecting all the external momenta. Therefore the expression simplies to Z d2! k  k^(g k ; 2gk + gk ) 3 A 3 ABC B C ;ig g = ig f T T (2 )2! (8.207) (k2 )3 The group factor gives (8.208) f ABC T B T C = 2i f ABC f BCD T D = 2i C2(G)T A whereas the numerator gives simply ;3k2  . Then we have Z d2! k 3 3 3 g 3 A A  ;ig g = ; (2 )2! 2 C2(G)T  (k2 )2 = ;i 8 g 2  32 C2(G)T A  (8.209) Adding up the two vertex contributions we obtain 3 A ) = ;i g (C (F ) + C (G)) T A  (8.210) ;ig3A = ;ig3 (A +  2 g f 8 2  2 Again, comparison with Section 2.7 gives 2 Z1 = 1 ; 8g 2 (C2 (F ) + C2 (G)) (8.211) Using eq. (8.175) we have   (8.212) gB = =2g(1 + Z1 ; Z2 ; 12 Z3) = =2 g + a1 and collecting everything together we get  3  11 g 2 a1 = ; 16 2 3 C2(G) ; 3 nF (8.213) 211

The evaluation of  (g) goes as in the QED case (see Section 6.3), that is





() = ; 1 1 ; g d a = ; 1 (a ; 3a ) = a  (g) =  @g@ 1 1 2 dg 1 2 1 that is





3 2 g C  (g) = ; 16 2 11 2 (G) ; nF 3 3 In the case of QCD where the gauge group is SU (3) we have

C2(G) = 3



and therefore

3 QCD(g) = ; 16g 2 11 ; 23 nF We see that QCD is asymptotically free ( (g) < 0) for nF < 33 2

212

(8.214) (8.215) (8.216)



(8.217) (8.218)

Chapter 9 Spontaneous symmetry breaking 9.1 The linear -model In this Section and in the following we will study, from a classical point of view, some eld theory with particular symmetry properties. We will start examining the linear -model. This is a model for N scalar elds, with a symmetry O(N ). The lagrangian is given by N N X X 1 1  2 L = 2 @ i@ i ; 2  ii ; 4 i=1 i=1

X N i=1

ii

!2

(9.1)

This lagrangian is invariant under linear transformations acting upon the vector ~ = (1     N ) and leaving invariant its norm

X j~j2 = ii N

i=1

(9.2)

Consider an innitesimal transformation (from now on we will omit the index of sum over the indices which are repeated) i = ij j (9.3) The condition for letting the norm invariant gives j~ + ~j2 = j~j2 (9.4) from which ~  ~ = 0 (9.5) or, in components, iij j = 0 (9.6) This is satised by ij = ;ji (9.7) 213

showing that the rotations in N dimensions depend on N (N ; 1)=2 parameters. For a nite transformation we have j~0j2 = j~j2 (9.8) with 0i = Sij j (9.9) implying SS T = 1 (9.10) In fact, by exponentiating the innitesimal transformation one gets S = e (9.11) with

T = ; (9.12) implying that S is an orthogonal transformation. The matrices S form the rotation group in N dimensions, O(N ). The Noether's theorem implies a conserved current for any symmetry of the theory. In this case we will get N (N ; 1)=2 conserved quantities. It is useful for further generalizations to write the innitesimal transformation in the form i = ij j = ; 2i AB TijAB j  i j = 1     N A B = 1     N (9.13) which is similar to what we did in Section 3.3 when we discussed the Lorentz transformations. By comparison we see that the matrices T AB are given by TijAB = i(iAjB ; jAiB ) (9.14) It is not dicult to show that these matrices satisfy the algebra T AB  T CD ] = ;iAC T BD + iAD T BC ; iBD T AC + iBC T AD (9.15) This is nothing but the Lie algebra of the group O(N ), and the T AB are the innitesimal generators of the group. Applying now the Noether's theorem we nd the conserved current j = @ (@@L ) i = ; 2i iAB TijAB j (9.16) i since the N (N ; 1)=2 parameters AB are linearly independent, we nd the N (N ; 1)=2 conserved currents JAB = ;ii TijAB j (9.17) In the case of N = 2 the symmetry is the same as for the charged eld in a real basis, and the only conserved current is given by J12 = ;J21 = 1 2 ; 21 (9.18) 214

One can easily check that the charges associated to the conserved currents close the same Lie algebra as the generators T AB . More generally, if we have conserved currents given by jA = ;iiTijAj (9.19) with T A T B ] = if ABC T C (9.20) then, using the canonical commutation relations, we get QA QB ] = if ABC QC with

QA

=

Z

d3x j0A(x) = ;i

Z

d3x _ iTijAj

(9.21) (9.22)

A particular example is the case N = 4. We parameterize our elds in the form ~ = ( 1  2 3 ) = (~  ) (9.23) These elds can be arranged into a 2  2 matrix

M =  + i~  ~

(9.24)

where ~ are the Pauli matrices. Noticing that 2 is pure imaginary, 1 and 3 real, and that 2 anticommutes with 1 and 3, we get

M = 2 M  2

(9.25)

j~j2 = 2 + j~ j2 = 21 Tr(M yM )

(9.26)

M ! LMRy

(9.28)

Furthermore we have the relation

Using this it is easy to write the lagrangian for the -model in the form ;  L = 41 Tr(@M y @  M ) ; 14 2Tr(M yM ) ; 161  Tr(M yM ) 2 (9.27) This lagrangian is invariant under the following transformation of the matrix M where L and R are two special (that is with determinant equal to 1) unitary matrices, that is L R 2 SU (2). The reason to restrict these matrices to be special is that only in this way the transformed matrix satisfy the condition (9.25). In fact, if A is a 2  2 matrix with detA = 1, then

2AT 2 = A;1 215

(9.29)

Therefore, for M 0 = LMRy we get

2 M 0 2 = 2 L M  RT 2 = 2 L 2 (2M  2 )2 RT 2

(9.30)

2L 2 = 2 LyT 2 = Ly;1 = L 2 RT 2 = R;1 = Ry

(9.31)

and from (9.29)

since the L and R are independent transformations, the invariance group in this basis is SU (2)L SU (2)R. In fact this group and O(4) are related by the following observation: the transformation M ! LMRy is a linear transformation on the matrix elements of M , but from the relation (9.26) we see that M ! LMRy leaves the norm of the vector ~ = (~  ) invariant and therefore the same must be true for the linear transformation acting upon the matrix elements of M , that is on  and ~ . Therefore this transformation must belongs to O(4). This shows that the two groups SU (2) SU (2) and O(4) are homomorphic (actually there is a 2 to 1 relationship, since ;L and ;R dene the same S as L and R). We can evaluate the eect of an innitesimal transformation. To this end we will consider separately left and right transformations. We parameterize the transformations as follows L  1 ; 2i ~L  ~  R  1 ; 2i ~R  ~ (9.32) then we get LM = (; 2i ~L  ~ )M = (; 2i ~L  ~ )( + i~  ~ ) = 21 ~L  ~ + 2i (~L ^ ~ ; ~L )  ~ (9.33) where we have used i j = ij + iijk k (9.34) Since L M = L + iL~  ~ (9.35) we get L = 21 ~L  ~  L~ = 21 (~L ^ ~ ; ~L) (9.36) Analogously we obtain R  = ; 12 ~R  ~  R~ = 12 (~R ^ ~ + ~R ) (9.37) The combined transformation is given by  = 21 (~L ; ~R )  ~  ~ = 21 (~L + ~R ) ^ ~ ; (~L ; ~R )] (9.38) 216

and we can check immediately that  + ~  ~ = 0 (9.39) as it must be for a transformation leaving the form 2 + j~ j2 invariant. Of particular interest are the transformations with ~L = ~R  . In this case we have L = R and M ! LMLy (9.40) These transformations span a subgroup SU (2) of SU (2)L SU (2)R called the diagonal subgroup. In this case we have  = 0 ~ = ~ ^ ~

(9.41) We see that the transformations corresponding to the diagonal SU (2) are the rotations in the 3-dimensional space spanned by ~ . These rotations dene a subgroup O(3) of the original symmetry group O(4). From the Noether's theorem we get the conserved currents (9.42) jL = 21  ~L  ~ + 21 ~   (~L ^ ~ ; ~L ) and dividing by L =2 J~L = ~ ; ~  ; ~  ^ ~

(9.43) and analogously J~R = ;~ + ~   ; ~  ^ ~

(9.44) Using the canonical commutation relations one can verify that the corresponding charges satisfy the Lie algebra of SU (2)L SU (2)R QLi QLj] = iijk QLk  QRi  QRj ] = iijk QRk  QLi QRj ] = 0 (9.45) By taking the following combinations of the currents (9.46) J~V = 21 (J~L + J~R ) J~A = 12 (J~L ; J~R ) one has J~V = ~ ^ ~  (9.47) and J~A = ~ ; ~   (9.48) The corresponding algebra of charges is QVi  QVj ] = iijk QVk  QVi  QAj] = iijk QAk QAi QAj] = iijk QVk (9.49) These equations show that QVi are the innitesimal generators of a subgroup SU (2) of SU (2)L SU (2)R which is the diagonal subgroup, as it follows from QVi  j ] = iijk k  QVi  ] = 0 (9.50) 217

In the following we will be interested in treating the interacting eld theories by using the perturbation theory. As in the quantum mechanical case, this is well dened only when we are considering the theory close to a minimum the energy of the system. In fact if we are going to expand around a maximum the oscillation of the system can become very large leading us outside of the domain of perturbation theory. In the case of the linear -model the energy is given by

Z

Z

"X N

#

"

#

Z N X @ L 1 3 _ ~ ij2 ) + V (j~j2) i ; L = d x 2 (_ 2i + jr H= _ i=1 i=1 @ i (9.51) Since in the last member of this relation the rst two terms are positive denite, it follows that the absolute minimum is obtained for constant eld congurations, such that @V (j~j2) = 0 (9.52) @i Let us call by vi the generic solution to this equation (in general it could happen that the absolute minimum is degenerate). Then the condition for getting a minimum is that the eigenvalues of the matrix of the second derivatives of the potential at the stationary point are denite positive. In this case we dene new elds by shifting the original elds by i ! 0i = i ; vi (9.53) The lagrangian density becomes L = 21 @ 0i@  0i ; V (j~0 + ~vj2) (9.54) Expanding V in series of 0i we get 2 V  @ 1 2 (9.55) V = V (j~vj ) + 2 @ @  ~=~v 0i0j +    i j This equation shows that the particle masses are given by the eigenvalues of the second derivative of the potential at the minimum. In the case of the linear -model we have V = 21 2 j~j2 + 4 (j~j2)2 (9.56) Therefore @V = 2 +  j~j2 (9.57) i i @i In order to have a solution to the stationary condition we must have i = 0, or 2 (9.58) j~j2 = ;  d3x H =

d3 x

218

This equation has real solutions only if 2 = < 0. However, in order to have a potential bounded from below one has to require  > 0, therefore we may have non zero solutions to the minimum condition only if 2 < 0. But, notice that in this case, 2 cannot be identied with a physical mass, these are given by the eigenvalues of the matrix of the second derivatives of the potential at the minimum and they are positive denite by denition. We will study this case in the following Sections. In the case of 2 > 0 the minimum is given by i = 0 and one can study the theory by taking the term (j~j2)2 as a small perturbation (that is requiring that both  and the values of i, the &uctuations, are small). The free theory is given by the quadratic terms in the lagrangian density, and they describe N particles of common mass m. Furthermore, both the free and the interacting theories are O(N ) symmetric.

9.2 Spontaneous symmetry breaking In this Section we will see that the linear -model with 2 < 0, is just but an example of a general phenomenon which goes under the name of spontaneous symmetry breaking of symmetry. This phenomenon lies at the basis of the modern description of phase transitions and it has acquired a capital relevance in the last years in all eld of physics. The idea is very simple and consists in the observation that a theory with hamiltonian invariant under a symmetry group may not show explicitly the symmetry at the level of the solutions. As we shall see this may happen when the following conditions are realized:  The theory is invariant under a symmetry group G.  The fundamental state of the theory is degenerate and transforms in a non trivial way under the symmetry group. Just as an example consider a scalar eld described by a lagrangian invariant under parity P :  ! ; (9.59) The lagrangian density will be of the type L = 21 @ @   ; V (2) (9.60) If the vacuum state is non degenerate, barring a phase factor, we must have

P j0i = j0i

(9.61)

since P commutes with the hamiltonian. It follows

h0jj0i = h0jP ;1PP ;1P j0i = h0jPP ;1j0i = ;h0jj0i 219

(9.62)

from which

h0jj0i = 0

(9.63) Things change if the fundamental state is degenerate. This would be the case in the example (9.60), if 2 (9.64) V (2) = 2 2 + 4 4 with 2 < 0. In fact, this potential has two minima located at

r

2 v = ;  (9.65) By denoting with jRi e jLi the two states corresponding to the classical congurations  = v, we have P jRi = jLi 6= jRi (9.66) Therefore hRjjRi = hRjP ;1PP ;1P jRi = ;hLjjLi (9.67) which now does not imply that the expectation value of the eld vanishes. In the following we will be rather interested in the case of continuous symmetries. So let us consider two scalar elds, and a lagrangian density with symmetry O(2) L = 21 @ ~  @  ~ ; 21 2~  ~ ; 4 (~  ~)2 (9.68) where ~  ~ = 21 + 22 (9.69) For 2 > 0 there is a unique fundamental state (minimum of the potential) ~ = 0, whereas for 2 < 0 there are innite degenerate states given by j~j2 = 21 + 22 = v2 (9.70) with v dened as in (9.65). By denoting with R() the operator rotating the elds in the plane (1 2), in the non-degenerate case we have R()j0i = j0i (9.71) and h0jj0i = h0jR;1RR;1 Rj0i = h0j j0i = 0 (9.72) since  6= . In the case 2 < 0 (degenerate case), we have R()j0i = ji (9.73) where ji is one of the innitely many degenerate fundamental states lying on the circle j~j2 = v2. Then (9.74) h0jij0i = h0jR;1()R()iR;1()R()j0i = hji ji

 = v

220

with

i = R()iR;1() 6= i (9.75) Again, the expectation value of the eld (contrarily to the non-degenerate state) does not need to vanish. The situation can be described qualitatively saying that the existence of a degenerate fundamental state forces the system to choose one of these equivalent states, and consequently to break the symmetry. But the breaking is only at the level of the solutions, the lagrangian and the equations of motion preserve the symmetry. One can easily construct classical systems exhibiting spontaneous symmetry breaking. For instance, a classical particle in a double-well potential. This system has parity invariance x ! ;x, where x is the particle position. The equilibrium positions are around the minima positions, x0 . If we put the particle close to x0 , it will perform oscillations around that point and the original symmetry is lost. A further example is given by a ferromagnet which has an hamiltonian invariant under rotations, but below the Curie temperature exhibits spontaneous magnetization, breaking in this way the symmetry. These situations are typical for the so called second order phase transitions. One can describe them through the Landau free-energy, which depends on two dierent kind of parameters:  Control parameters, as 2 for the scalar eld, and the temperature for the ferromagnet.  Order parameters, as the expectation value of the scalar eld or as the magnetization. The system goes from one phase to another varying the control parameters, and the phase transition is characterized by the order parameters which assume dierent values in dierent phases. In the previous examples, the order parameters were zero in the symmetric phase and dierent from zero in the broken phase. The situation is slightly more involved at the quantum level, since spontaneous symmetry breaking cannot happen in nite systems. This follows from the existence of the tunnel eect. Let us consider again a particle in a double-well potential, and recall that we have dened the fundamental states through the correspondence with the classical minima x = x0 ! jRi x = ;x0 ! jLi

(9.76)

But the tunnel eect gives rise to a transition between these two states and as a consequence it removes the degeneracy. In fact, due to the transition the hamiltonian acquires a non zero matrix element between the states jRi and jLi. By denoting with H the matrix of the hamiltonian between these two states, we get

   H= 0 1 1 0

221

(9.77)

The eigenvalues of H are

(0 + 1  0 ; 1) (9.78) We have no more degeneracy and the eigenstates are (9.79) jS i = p1 (jRi + jLi) 2 with eigenvalue ES = 0 + 1 , and jAi = p1 (jRi ; jLi) (9.80) 2 with eigenvalue EA = 0 ; 1 . One can show that 1 < 0 and therefore the fundamental state is the symmetric one, jS i. This situation gives rise to the well known eect of quantum oscillations. We can express the states jRi and jLi in terms of the energy eigenstates jRi = p12 (jS i + jAi) jLi = p12 (jS i ; jAi) (9.81) Let us now prepare a state, at t = 0, by putting the particle in the right minimum. This is not an energy eigenstate and its time evolution is given by  ;iES t  1 ;iES t   1 ; iE t ; it  E A jR ti = p e jS i + e jAi = p e jS i + e jAi (9.82) 2 2 with E = EA ; ES . Therefore, for t = =E the state jRi transforms into the state jLi. The state oscillates with a period given by T = 2 E (9.83) In nature there are nite systems as sugar molecules, which seem to exhibit spontaneous symmetry breaking. In fact one observes right-handed and left-handed sugar molecules. The explanation is simply that the energy dierence E is so small that the oscillation period is of the order of 104 ; 106 years. The splitting of the fundamental states decreases with the height of the potential between two minima, therefore, for innite systems, the previous mechanism does not work, and we may have spontaneous symmetry breaking. In fact, coming back to the scalar eld example, its expectation value on the vacuum must be a constant, as it follows from the translational invariance of the vacuum h0j(x)j0i = h0jeiPx(0)e;iPx j0i = h0j(0)j0i = v (9.84) and the energy dierence between the maximum at  = 0, and the minimum at  = v, becomes innite in the limit of innite volume  2   4 Z Z 4 3 H ( = 0) ; H ( = v) = ; d x 2 v2 + 4 v4 = 4 d3x = 4 V (9.85) V V 222

9.3 The Goldstone theorem From our point of view, the most interesting consequence of spontaneous symmetry breaking is the Goldstone theorem. This theorem says that for any continuous symmetry spontaneously broken, there exists a massless particle (the Goldstone boson). The theorem holds rigorously in a local eld theory, under the following hypotheses  The spontaneous broken symmetry must be a continuous one.  The theory must be manifestly covariant.  The Hilbert space of the theory must have a denite positive norm. We will limit ourselves to analyze the theorem in the case of a classical scalar eld theory. Let us start considering the lagrangian for the linear -model with invariance O(N ) 2 (9.86) L = 21 @ ~  @  ~ ; 2 ~  ~ ; 4 (~  ~)2 The conditions that V must satisfy in order to have a minimum are @V = 2 +  j~j2 = 0 (9.87) l l @l with solutions r 2 2 2 (9.88) l = 0 j~j = v  v = ;

The character of the stationary points can be studied by evaluating the second derivatives @ 2 V =  (2 + j~j2) + 2  (9.89) l m @l @m lm We have two possibilities  2 > 0, we have only one real solution given by ~ = 0, which is a minimum, since @ 2 V =  2 > 0 (9.90) lm @ @ l

m

 2 < 0, there are innite solutions, among which ~ = 0 is a maximum. The points of the sphere j~j2 = v2 are degenerate minima. In fact, by choosing l = vlN as a representative point, we get @ 2 V = 2v2  > 0 lN mN @l @m 223

(9.91)

Expanding the potential around this minimum we get  2 V  (l ; vlN )(m ; vmN ) (9.92) V (~)  V minimum + 21 @@ @  l m minimum If we are going to make a perturbative expansion, the right elds to be used are l ; vlN , and their mass is just given by the coecient of the quadratic term 20 0  0 3 2 V  6 7 (9.93) @ 2 Mlm = @ @ minimo = ;22 lN mN = 64 0 0  0 75 l m 0 0  ;22 Therefore the masses of the elds a, a = 1     N ; 1 , and  = N ; v, are given by m2 a = 0 m2 = ;22 (9.94) By dening m2 = ;22 (9.95) we can write the potential as a function of the new elds

r

m4 + 1 m2 2 + m2   V = 16  2 2

NX;1 a=1

2a + 2

!

+ 4

NX;1 a=1

2a + 2

!2

(9.96)

In this form the original symmetry O(N ) is broken. However a P residual symmetry O(N ; 1) is left. In fact, V depends only on the combination Na=1;1 2a, and it is invariant under rotations around the axis we have chosen as representative for the fundamental state, (0     v). It must be stressed that this is not the most general potential invariant under O(N ; 1). In fact the most general potential (up to the fourth order in the elds) describing N scalar elds with a symmetry O(N ; 1) would depend on 7 coupling constants, whereas the one we got depends only on the two parameters m and . Therefore spontaneous symmetry breaking puts heavy constraints on the dynamics of the system. We have also seen that we have N ; 1 massless scalars. Clearly the rotations along the rst N ; 1 directions leave the potential invariant, whereas the N ; 1 rotations on the planes a ; N move away from the surface of the minima. This can be seen also in terms of generators. Since the eld we have chosen as representative of the ground state is ijmin = viN , we have Tijab j jmin = i(iajb ; ib ja)vjN = 0 (9.97) since a b = 1     N ; 1, and TijaN j jmin = i(ia jN ; iN ja )vjN = ivia 6= 0 (9.98) Therefore we have N ; 1 broken symmetries and N ; 1 massless scalars. The generators of O(N ) divide up naturally in the generators of the vacuum symmetry 224

(here O(N ; 1)), and in the so called broken generators, each of them corresponding to a massless Goldstone boson. In general, if the original symmetry group G of the theory is spontaneously broken down to a subgroup H (which is the symmetry of the vacuum), the Goldstone bosons correspond to the generators of G which are left after subtracting the generators of H . Intuitively one can understand the origin of the massless particles noticing that the broken generators allow transitions from a possible vacuum to another. Since these states are degenerate the operation does not cost any energy. From the relativistic dispersion relation this implies that we must have massless particles. One can say that Goldstone bosons correspond to &at directions in the potential.

9.4 The Higgs mechanism We have seen that the spontaneous symmetry breaking mechanism, in the case of continuous symmetry leads to massless scalar particles, the Goldstone bosons. Also gauge theories lead to massless vector bosons, in fact, as in the electromagnetic case, gauge invariance forbids the presence in the lagrangian of terms quadratic in the elds. Unfortunately in nature the only massless particles we know are the photon and perhaps the neutrinos, which however are fermions. But once one couples spontaneous symmetry breaking to a gauge symmetry, things change. In fact, if we look back at the hypotheses underlying a gauge theory, it turns out that Goldstone theorem does not hold in this context. The reason is that it is impossible to quantize a gauge theory in a way which is at the same time manifestly covariant and has a Hilbert space with positive denite metric. This is well known already for the electromagnetic eld, where one has to choose the gauge before quantization. What happens is that, if one chooses a physical gauge, as the Coulomb gauge, in order to have a Hilbert space spanned by only the physical states, than the theory looses the manifest covariance. If one goes to a covariant gauge, as the Lorentz one, the theory is covariant but one has to work with a big Hilbert space, with non-denite positive metric, and where the physical states are extracted through a supplementary condition. The way in which the Goldstone theorem is evaded is that the Goldstone bosons disappear, and, at the same time, the gauge bosons corresponding to the broken symmetries acquire mass. This is the famous Higgs mechanism. Let us start with a scalar theory invariant under O(2)

L = 21 @ ~  @  ~ ; 2 ~  ~ ; 4 (~  ~)2 2

(9.99)

and let us analyze the spontaneous symmetry breaking mechanism. If 2 < 0 the symmetry is broken and we can choose the vacuum as the state

r

2 v = ;

~ = (v 0) 225

(9.100)

After the translation 1 =  + v, with h0jj0i = 0, we get the potential (m2 = ;22 )

r

m4 + 1 m2 2 + m2 (2 + 2 ) ;  (2 + 2)2 V = ; 16 (9.101) 2  2 2 4 2 In this case the group O(2) is completely broken (except for the discrete symmetry 2 ! ;2). The Goldstone eld is 2 . This has a peculiar way of transforming under O(2). In fact, the original elds transform as 1 = ; 2

2 = 1

(9.102)

from which

 = ; 2 2 =  + v (9.103) We see that the Goldstone eld undergoes a rotation plus a translation, v. This is the main reason for the Goldstone particle to be massless. In fact one can have invariance under translations of the eld, only if the potential is &at in the corresponding direction. This is what happens when one moves in a way which is tangent to the surface of the degenerate vacuums (in this case a circle). How do things change if our theory is gauge invariant? In that case we should have invariance under a transformation of the Goldstone eld given by 2(x) = (x)(x) + (x)v

(9.104)

Since (x) is an arbitrary function of the space-time point, it follows that we can choose it in such a way to make 2(x) vanish. In other words it must be possible to eliminate the Goldstone eld from the theory. This is better seen by using polar coordinates for the elds, that is q2 2 = 1 + 2 (9.105) sin  = p 22 2 1 + 2 Under a nite rotation, the new elds transform as

! 

 !+

(9.106)

It should be also noticed that the two coordinate systems coincide when we are close to the vacuum, as when we are doing perturbation theory. In fact, in that case we can perform the following expansion q = 22 + 2 + 2v + v2  v +    v +2   v2 (9.107) Again, if we make the theory invariant under a local transformation, we will have invariance under (x) ! (x) + (x) (9.108) 226

By choosing (x) = ;(x) we can eliminate this last eld from the theory. The only remaining degree of freedom in the scalar sector is (x). Let us study the gauging of this model. It is convenient to introduce complex variables (9.109)  = p1 (1 + i2) y = p1 (1 ; i2) 2 2 The O(2) transformations become phase transformations on   ! ei  (9.110) and the lagrangian (9.99) can be written as

L = @ y@   ; 2y ; (y)2

(9.111)

We know that it is possible to promote a global symmetry to a local one by introducing the covariant derivative

@  ! (@ ; igA)

(9.112)

from which

L = (@ + igA)y(@  ; igA) ; 2y ; (y)2 ; 14 F F 

In terms of the polar coordinates (  ) we have  = p1 ei  y = p1 e;i 2 2 By performing the following gauge transformation on the scalars  ! 0 = e;i

(9.113) (9.114) (9.115)

and the corresponding transformation on the gauge elds A ! A0 = A ; g1 @  (9.116) the lagrangian will depend only on the elds and A0 (we will put again A0 = A )

L = 12 (@ + igA ) (@  ; igA) ; 2 2 ; 4 4 ; 14 F F  2

(9.117)

In this way the Goldstone boson disappears. We have now to translate the eld

h0jj0i = 0

=  + v 227

(9.118)

and we see that this generates a bilinear term in A , coming from the covariant derivative, given by 1 g 2 v 2 A A (9.119)  2 Therefore the gauge eld acquires a mass m2A = g2v2 (9.120) It is instructive to count the degrees of freedom before and after the gauge transformation. Before we had 4 degrees of freedom, two from the scalar elds and two from the gauge eld. After the gauge transformation we have only one degree of freedom from the scalar sector, but three degrees of freedom from the gauge vector, because now it is a massive vector eld. The result looks a little bit strange, but the reason why we may read clearly the number of degrees of freedom only after the gauge transformation is that before the lagrangian contains a mixing term A @   (9.121) between the Goldstone eld and the gauge vector which makes complicate to read the mass of the states. The previous gauge transformation realizes the purpose of making that term vanish. The gauge in which such a thing happens is called the unitary gauge. We will consider now the further example of a symmetry O(N ). The lagrangian invariant under local transformations is 2 (9.122) L = 21 (D)ij j (D)ik k ; 2 ii ; 4 (ii)2 where (D)ij = ij @ + i g (T AB )ij WAB (9.123) 2 where (T AB )lm = i(lAmB ; mA lB ). In the case of broken symmetry (2 < 0), we choose again the vacuum along the direction N , with v dened as in (9.100) i = viN (9.124) Recalling that





Tijabj min = 0 TijaN j min = ivia a b = 1     N ; 1

the mass term for the gauge eld is given by   ; 18 g2TijAB j min(T CD )ik k minWAB W CD   = ; 14 g2TijaN j  (T bN )ik k  WaN W bN min min 1 1 = g2v2ia ibWaN W bN = g2v2WaN W bN 4 4 228

(9.125)

(9.126)

Therefore, the elds WaN associated to the broken directions T aN acquire a mass g2v2=2, whereas Wab , associated to the unbroken symmetry O(N ; 1), remain massless. In general, if G is the global symmetry group of the lagrangian, H the subgroup of G leaving invariant the vacuum, and GW the group of local (gauge) symmetries, GW 2 G, one can divide up the broken generators in two categories. In the rst category fall the broken generators lying in GW  they have associated massive vector bosons. In the second category fall the other broken generators they have associated massless Goldstone bosons. Finally the gauge elds associated to generators of GW lying in H remain massless. From the previous derivation this follows noticing that the generators of H annihilate the minimum of the elds, leaving the corresponding gauge bosons massless, whereas the non zero action of the broken generators generate a mass term for the other gauge elds. The situation is represented in Fig. 9.4.1. G

H

GW

Fig. 6.1 -This gure shows the various groups, G, the global symmetry of the lagrangian, H 2 G, the symmetry of the vacuum, and GW , the group of local sym-

metries. The broken generators in GW correspond to massive vector bosons. The broken generators do not belonging to GW correspond to massless Goldstone bosons. Th unbroken generators in GW correspond to massless vector bosons.

We can now show how to eliminate the Goldstone bosons. In fact we can dene new elds a and  as  aN  i = e;iT a ( + v) (9.127) iN

where a = 1     N ; 1, that is the sum is restricted to the broken directions. The other degree of freedom is in the other factor. The correspondence among the elds 229

~ and (a ) can be seen easily by expanding around the vacuum  aN  e;iT a  iN ; i(T aN )iN a = iN + iaa from which

iN

(9.128)

i  (a   + v) (9.129) showing that the a's are really the Goldstone elds. The unitary gauge is dened through the transformation  aN  i ! eiT a j = iN ( + v) (9.130) ij





aN aN aN aN W ! eiT a We;iT a ; gi @ eiT a e;iT a (9.131) This transformation eliminates the Goldstone degrees of freedom and the resulting lagrangian depends on the eld , on the massive vector elds WaN and on the massless eld Wab. Notice again the counting of the degrees of freedom N +2N (N ; 1)=2 = N 2 in a generic gauge, and 1 + 3(N ; 1) + 2(N ; 1)(N ; 2)=2 = N 2 in the unitary gauge.

9.5 Quantization of a spontaneously broken gauge theory in the R -gauge In the previous Section we have seen that in the unitary gauge some of the gauge elds acquire a mass. The corresponding quadratic part of the lagrangian will be (we consider here the abelian case corresponding to a symmetry U (1) or O(2)) 2 L = ; 41 F F  + m2 W 2 (9.132) where W is the massive gauge eld (called also a Yang-Mills massive eld). The free equations of motion are ( + m2 )W ; @ (@  W ) = 0 (9.133) Let us study the solutions of this equation in momentum space. We dene Z W(x) = d4 W (k)e;ikx (9.134) It is convenient to introduce a set of four independent four-vectors. By putting k = (E ~k) we introduce (i) = (0~ ni) i =!1 2 ~ (9.135) (3) = m1 j~kj E ~k jk j 230

with

~k  ~ni = 0 j~nij2 = 1 Then we can decompose W (k) as follows

(9.136)

W (k) = ( ) a (k) + kb(k)

(9.137)

Since k( ) = 0, from the equation of motion we get (;k2 + m2 )(( ) a (k) + kb(k)) + k k2b(k) = 0

(9.138)

implying

(k2 ; m2 )a (k) = 0 m2 b(k) = 0 (9.139) Therefore, for m 6= 0 the eld has three degrees of freedom with k2 = m2 , corresponding to transverse and longitudinal polarization. When the momentum is on shell, we can dene a fourth unit vector  (9.140) (0) = km In this way we have four independent unit vectors which satisfy the completeness relation 3 X ( ) ( ) g = g (9.141) 0

0

 =0 0

Therefore the sum over the physical degrees of freedom gives 3 X

=1

  ( ) ( ) = ;g + kmk2

(9.142)

The propagator is dened by the equation

( + m2) ; @ @ G (x) = ig 4(x)  



(9.143)

Solving in momentum space we nd  Z d4k  k  i k  ; ikx  G (k) = (2 )4 e (9.144) k2 ; m2 + i ;g + m2 Therefore, the corresponding Feynman's rule for the massive gauge eld propagator is   k  k i  (9.145) ; k2 ; m2 + i g ; m2 Notice that the expression in parenthesis is nothing but the projector over the physical states given in eq. (9.142). This propagator has a very bad high-energy behaviour. In fact it goes to constant for k ! 1. As a consequence the argument 231

for the renormalizability of the theory based on power counting does not hold any more. Although in the unitary gauge a spontaneously broken gauge theory does not seem to be renormalizable, we have to take into account that the gauge invariance allows us to make dierent choices of the gauge. In fact we will show that there exist an entire class of gauges where the theory is renormalizable by power counting. We will discuss this point only for the abelian case, with gauge group O(2), but the argument cab be easily extended to the non-abelian case. Let us consider the O(2) theory as presented in Section 9.4. Let us recall that under a transformation of O(2) the two real scalar eld of the theory transform as

1 = ; 2 2 = ; 1 (9.146) Therefore, in order to make it a gauge theory we introduce covariant derivatives D 1 = @ 1 + gW2 D 2 = @ 2 ; gW1 (9.147) with W transforming under a local transformation as W(x) = g1 @ (x) (9.148) Assuming that the spontaneous breaking occurs along the direction 1 , we introduce the elds 1 = v +  2 =  (9.149) with 2 (9.150) v2 = ;  Remember that  is the Higgs eld, whereas  is the Goldstone eld. In terms of the new variables the lagrangian is L = 12 (@  + gW)2 + 21 (@  ; gW ; gvW)2 ; V ( phi) ; 14 F F  (9.151) where, the potential up to quadratic terms is given by (see eq. (9.101)) m 4 + 1 m 2 2 +    (9.152) V ( ) = ; 16  2 with m2 = ;22 . We see that the eect of the shift on 1 is to produce the terms 1 g2v2W 2 ; gvW @   + g2vW 2 (9.153)  2 The last term is an interaction term and it does not play any role in the following considerations. The rst term is the mass term for the W m2W = g2v2 (9.154) 232

whereas the second term is the mixing between the W and the Goldstone eld, that we got rid of by choosing the unitary gauge. However this choice is not unique. In fact let us consider a gauge xing of the form f ; B = 0 f = @  W + gv (9.155) where  is an arbitrary parameter and B (x) is an arbitrary function. By proceeding as in Section 8.3 we know that the eect of such a gauge xing is to add to the lagrangian a term 2 ; 21 f 2 = ; 21 (@  W + gv)2 = ; 21 (@  W)2 ;  gv@  W ; 2 g2v22 (9.156) The rst term is the usual covariant gauge xing. The second term, after integration by parts reads +  gvW@   (9.157)  and we see that with the choice = (9.158) it cancels the unwanted mixing term. Finally the third contribution ; 12 g2v22 (9.159) is a mass term for the Goldstone eld m2 = g2v2 = m2W (9.160) However this mass is gauge dependent (it depends on the arbitrary parameter  ) and this is a signal of the fact that in this gauge the Goldstone eld is ctitious. In fact remember that it disappears in the unitary gauge. We will see later, in an example, that evaluating a physical amplitude the contribution of the Goldstone disappears, as it should be. Finally we have to consider the ghost contribution from the Faddeev-Popov determinant. For this purpose we need to know the variation of the gauge xing f under an innitesimal gauge transformation (9.161) f (x) = g1  (x) + gv (x)((x) + v) giving f (x) = 1 4 (x ; y) + gv ((x) + v)4(x ; y) (9.162)  (y) g Therefore the ghost lagrangian can be written as c c + g2v2cc + gvcc (9.163) Notice that in contrast with QED, the ghosts cannot be ignored, since they are coupled to the Higgs eld. We are now in the position of reading the various propagators from the quadratic part of the lagrangian 233

 Higgs propagator (): k2 ;i m2  Goldstone propagator (): k2 ; im2  Ghost propagator (c): ; k2 ; im2

W

W

The propagator for the vector eld is more involved. let us collect the various quadratic terms in W (9.164) ; 41 F F  + 21 m2W W 2 ; 21 (@  W )2 In momentum space wave operator is (g k2 ; k k ) ; m2W g + 1 kk (9.165)  which can be rewritten as   k    k  g ; k2 (k2 ; m2W ) + kkk2 1 (k2 ; m2W ) (9.166) The propagator in momentum space is the inverse of the wave operator (except for a factor ;i). To invert the operator we notice that it is written as a combination of two orthogonal projection operators P1 and P2

P1 + P2 The inverse of such an operator is simply 1P + 1P 1  2 Therefore the W propagator is given by   k    i k  ; k2 ; m2 g ; k2 ; k2 ;im2 kkk2 W W  k i k = ; k2 ; m2 g ; k2 ; m2 (1 ;  ) W

W

(9.167) (9.168)

(9.169)

We see that for this class of gauges the asymptotic behaviour of the propagator is 1=k2 and we can apply the usual power counting. Notice that the unitary gauge is recovered in the limit  ! 1. In this limit the W -propagator becomes the one discussed at the beginning of the Section. Of course, the singularity of the limit destroys the asymptotic behaviour that one has for nite values of  . Also, for  ! 1 the ghost and the Goldstone elds decouple since their mass goes to innity. 234

To end this Section we observe that with this approach one is able to prove at the same time that the theory is renormalizable and that it does not contain spurious states. In fact the theory is renormalizable for nite values of  and it does not contain spurious states for  ! 1. But, since the gauge invariant quantities do not depend on the gauge xing, they do not depend in particular on  , and therefore one gets the wanted result.

9.6

 -cancellation

in perturbation theory

In this Section we will discuss the cancellation of the  dependent terms in the calculation of a simple amplitude at tree level. To this end we consider again the O(2) model of the previous Section, and we add to the theory a fermion eld. We will also mimic the standard model by making transform under the gauge group only the left-handed part of the fermion eld. That is we decompose  as

 = L + R where

L = 1 ;2 5  R = 1 +2 5  and we assume the following transformation under the gauge group L ! ei L R ! R

(9.170) (9.171) (9.172)

A fermion mass term would destroy the gauge invariance, since it couples left-handed and right-handed elds. In fact

!L = and therefore

1 ;   

2

5

 = y 1 ;2 5 0 = ! 1 +2 5

(9.173)



! = ! 1 + 5 + 1 ; 5  = !L R + !R L  (9.174) 2 2 Therefore we will introduce a Yukawa coupling between the scalar eld and the fermions, in such a way to be gauge invariant and to gives rise to a mass term for the fermion when the symmetry is spontaneously broken. We will write ^ L + !R i@^R ; f (!L R # + !R L # ) Lf = !L iD (9.175) with

D = @ ; igW (9.176) Here we have used again the complex notation for the scalar eld, as in eq. (9.109), and we remind that it transforms as # ! ei # (9.177) 235

Therefore Lf is gauge invariant. The lagrangian can be rewritten in terms of the eld  by writing explicitly the chiral projectors (1 5)=2. We get ! @^ 1 + 5  + g !  1 ; 5 W  ! @^ 1 ; 5  + i Lf = i 2 2  21 +    1 ;  f 5 5 ; p ! 2 (1 + i2) + ! 2 (1 ; i2) (9.178) 2 from which f ; ! (v + ) + i ! 5 ! @^ + g !  1 ; 5 W  ; p (9.179) L = i 2 2 where  + v = 1 and  = 2. This expression shows that the fermion eld acquires a mass through the Higgs mechanism given by mf = pf v (9.180) 2 The previous expression shows also that the Higgs eld is a scalar (parity +1), whereas the Goldstone eld is a pseudoscalar (parity= -1).Consider now the fermionfermion scattering in this model at tree level. The amplitude gets contribution from the diagrams of Fig. 9.6.1. . p'

k

φ

W +

χ +

q p

k'

Fig. 9.6.1 - The fermion-fermion scattering in the U (1) Higgs model. We want to show that the  contribution cancels out. The  dependence arises from the gauge eld and the Goldstone eld exchange. Therefore we will consider only these two contributions. Let us start with the Goldstone exchange. The Feynman's rule for the vertex is given in Fig. 9.6.2. . φ

Fig. 9.6.2 - Goldstone-fermion vertex: pf 5 2

236

The corresponding amplitude is

  2

Mf = pf

u!(p0)5u(p) q2 ; im2 u!(k0 )5u(k)

(9.181) 2 W where q = p ; p0 = k0 ; k. The Feynman rule for the W-fermion vertex is given in Fig. 9.6.3. .

Fig. 9.6.3 - W-fermion vertex: ig 1 ;2 5 The corresponding amplitude is MW = (ig)2u!(p0) 1 ;2 5 u(p)   q ; i q   q2 ; m2 g ; q2 ; m2 (1 ;  ) u!(k0 ) 1 ;2 5 u(k) (9.182) W

W

It is convenient to isolate the  dependence by rewriting the gauge eld propagator as ;i g ; qq + qq  1 ; 1 ;   q2 ; m2W  m2W  m2W q2 ; m2W     = = 2 ;i 2 g ; q q2 + 2 ;i 2 q q2 (9.183) q ; mW mW q ; mW mW The rst term is  independent and it is precisely the W propagator in the unitary gauge. The second term can be simplied by using the identity q u!(p0 ) 1 ;2 5 u(p) = (p ; p0 )u!(p0) 1 ;2 5 u(p) =  1 +  1 ;  5 5 0 0 p^ ; p^ 2 u(p) = mf u!(p0)5u(p) (9.184) = u!(p ) 2 By using this relation we have qq u!(p0) 1 ;2 5 u(p)!u(k0 ) 1 ;2 5 u(k) = = (mf )!u(p0)5u(p)(;mf )!u(k0)5u(k)   2 = ; pf v2u!(p0)5u(p)!u(k0)5u(k) (9.185) 2 237

where we have made use of eq. (9.180). Substituting this result in MW we get





  1 ; 5 u(p) ;i  ; q q u!(k0 ) 1 ; 5 u(k) MW = g   2 q2 ; m2W m2W 2   2 2 2 i g v pf u!(p0) u(p)!u(k0) u(k) (9.186) + q2 ;;m 5 5 2 m2 2 W W Finally, using m2W = g2v2 we see that the last term cancels exactly the M amplitude. Therefore the result is the same that we would have obtained in the unitary gauge where there is no Goldstone contribution. Now let us make some comment about the result. In QED we have used the fact that the q q terms in the propagator do not play a physical role,since they are contracted with the fermionic current which is conserved. In the present case this does not happens, but we get a term proportional to the fermion mass. In ! (1 ; 5)=2 it is not conserved because the fermion fact the fermionic current  acquires a mass through the spontaneous breaking of the symmetry. On the other hand, the term that one gets in this way it is just the one necessary to give rise to the cancellation with the Goldstone boson contribution. It is also worth to compare what happens for dierent choices of  . For  = 0 the gauge propagator is transverse ;i g ; kk  (9.187) k2 ; m2W k2

(ig)2u!(p0)

This corresponds to the Lorentz gauge. The propagator has a pole term at k2 = 0, which however it is cancelled by the Goldstone pole which also is at k2 = 0. For the choice  = 1 the gauge propagator has the very simple form ;ig (9.188) k2 ; m2W This is the Feynman-'t Hooft gauge. Here the role of the Goldstone boson, which has a propagator i (9.189) 2 k ; m2 W

it is to cancel the contribution of the time-like component of the gauge eld, which is described by the polarization vector (0)  . In fact the gauge propagator can be rewritten as 3 3 ;ig = ;i X i X ;i k k ( ) ( ) g = ( ) ( ) +   k2 ; m2W k2 ; m2W  =0   k2 ; m2W =1   k2 ; m2W m2W 0

0

0

(9.190) The rst piece is the propagator in the unitary gauge, whereas the second piece is cancelled by the Goldstone boson contribution. 238

Chapter 10 The Standard model of the electroweak interactions 10.1 The Standard Model of the electroweak interactions We will dscribe now the Weinberg Salam model for the electroweak interactions. Let us recall that the Fermi theory of weak interactions is based on a lagrangian involving four Fermi elds. This lagrangian can be written as the product of two currents which are the sum of various pieces. LF = GpF J(+) J (;) (10.1) 2 Limiting for the moment our considerations to the charged current for the electron and its neutrino (here   e)), the current is given by

J(+) = !e (1 ; 5) 

J(;) = J(+) y = ! (1 ; 5)e

(10.2)

To understand the symmetry hidden in this couplings it is convenient to introduce the following spinors with 2  4 components  ( )  1 ;     (10.3) L = ( )L = 2 5  e e L where we have introduced left-handed elds. The right-handed spinors, projected out with (1 + 5)=2, don't feel the weak interaction. This is, in fact, the physical meaning of the V ; A interaction. By using ;  ;  (10.4) L! = (! )L (!e)L = !  !e 1 +2 5 239

we can write the weak currents in the following way  0  ;  1 +  5 1 ; 5 (+) ! ! ! J = 2e 2  2  = 2 ( )L (e)L  ( )  L    ;  = 2 (! )L (!e)L  01 00 (( ))L = 2L!  ;L e L

where we have dened

(10.5)

(10.6)  = 1 2 i2 the i 's being the Pauli matrices. Analogously J(;) = 2L! + L (10.7) In the fties the hypothesis of the intermediate vector bosons was advanced. According to this idea the Fermi theory was to be regarded as the low energy limit of a theory where the currents were coupled to vector bosons, very much as QED. However, due to the short range of the weak interactions these vector bosons had to have mass. The interaction among the vector bosons and the fermions is given by

Lint = pg J(+) W ; + J(;) W + = pg L! (; W ; + +W +)L (10.8) 2 2 2 The coupling g can be related to the Fermi constant through the relation 2 G pF = 8mg 2 (10.9) 2 W Introducing real elds in place of W  iW2 W  = W1 p (10.10) 2 we get (10.11) Lint = g2 L! (1W1 + 2 W2)L Let us now try to write the electromagnetic interaction within the same formalism 1 +  1 ;  1 ;  1 +    Lem = e!e QeA = e!eQ 2 5  2 5 + 2 5  2 5 eA = e(!e )LQ(e)L + (!e)RQ (e)R ]A (10.12) where (e )R = 1 + 5 e  (!e )R = !e 1 ; 5 (10.13) 2 2 where Q = ;1 is the electric charge of the electron, in unit of the electric charge of the proton, e. It is also convenient to denote the right-handed components of the electron as R = (e)R (10.14) 240

We can write and





(!e)L(e)L = L!  00 01 L = L!  1 ;2 3 L

(10.15)

! R (!e)R (e)R = R

(10.16)

from which, using Q = ;1,  1   3 ! R + L!  L A = ejemA Lem = e ; 2 L! L ; R (10.17) 2 From this equation we can write jem = j3 + 12 jY (10.18) with j3 = L!  23 L (10.19) and ! R jY = ;L! L ; 2R (10.20) These equations show that the electromagnetic current is a mixture of j3 , which is a partner of the charged weak currents, and of jY , which is another neutral current. The following notations unify the charged currents and j3 (10.21) ji = L!  2i L Using the canonical anticommutators for the Fermi elds, it is not dicukt to prove that the charges associated to the currents

Qi

Z

d3~x j0i (x)

(10.22)

Qi Qj ] = iijk Qk

(10.23)

=

span the Lie algebra of SU (2), The charge QY associated to the current jY commutes with Qi , as it can be easily veried noticing that the right-handed and left-handed elds commute with each other. Therefore the algebra of the charges is Qi Qj ] = iijk Qk 

Qi QY ] = 0

(10.24)

This is the Lie algebra of SU (2) U (1), since the Qi's generate a group SU (2), whereas QY generates a group U (1), with the two groups commuting among themselves. To build up a gauge theory from these elements we have to start with an initial lagrangian possessing an SU (2) U (1) global invariance. This will produce 241

4 gauge vector bosons. Two of these vectors are the charged one already introduce, whereas we would like to identify one of the the other two with the photon. By evaluating the commutators of the charges with the elds we can read immediately the quantum numbers of the various particles. We have Z     i Lc Q ] = d3 x Lc Lya i Lb ] = i Lb (10.25) 2 ab 2 cb and R Qi] = 0 (10.26) Lc QY ] = ;Lc R QY ] = ;2R (10.27) These relations show that L transforms as an SU (2) spinor, or, as the representation 2, where we have identied the representation with its dimensionality. R belongs to the trivial representation of dimension 1. Putting everything together we have the following behaviour under the representations of SU (2) U (1)

L 2 (2 ;1)

R 2 (1 ;2)

(10.28)

The relation (10.18) gives the following relation among the dierent charges Qem = Q3 + 12 QY (10.29) As we have argued, if SU (2) U (1) has to be the fundamental symmetry of electroweak interactions, we have to start with a lagrangian exhibiting this global symmetry. This is the free Dirac lagrangian for a massless electron and a left-handed neutrino ! @^R L0 = L! i@^L + Ri (10.30) Mass terms, mixing left- and right-handed elds, would destroy the global symmetry. For instance, a typical mass term gives 1 +  1 ;  1 ;  1 +   5 5 5 5 y ! ! !  +  m = m 0 0 2 2 2 2  = m(R L + LR ) (10.31) We shall see later that the same mechanism giving mass to the vector bosons can be used to generate fermion masses. A lagrangian invariant under the local symmetry SU (2) U (1) is obtained through the use of covariant derivatives





0 !  (@ + ig0Y ) R @ ; ig 2i Wi + i g2 Y L + Ri The interaction term is  ~  g0 ! RY  Lint = gL!  2  W~  L ; 2 L! LY  ; g0R

L = L! i 

242

(10.32)

(10.33)

In this expression we recognize the charged interaction with W . We have also, as expected, two neutral vector bosons W 3 and Y . The photon must couple to the electromagnetic current which is a linear combination of j 3 and j Y . The neutral vector bosons are coupled to these two currents, so we expect the photon eld, A , to be a linear combination of W3 and Y . It is convenient to introduce two orthogonal combination of these two elds, Z , and A. The mixing angle can be identied through the requirement that the current coupled to A is exactly the electromagnetic current with coupling constant e. Let us consider the part of Lint involving the neutral couplings 0 (10.34) LNC = gj3 W3 + g2 jY Y  By putting ( is called the Weinberg angle) W3 = Z cos  + A sin  Y = ;Z sin  + A cos  (10.35) we get     1 1 3 0 Y  3 0 Y LNC = g sin  j + g cos  2 j A + g cos  j ; g sin  2 j Z  (10.36) The electromagnetic coupling is reproduced by requiring the two conditions g sin  = g0 cos  = e (10.37) These two relations allow us to express both the Weinberg angle and the electric charge e in terms of g and g0. In the low-energy experiments performed before the LEP era (< 1990) the fundamental parameters used in the theory were e (or rather the ne structure constant ) and sin2 . We shall see in the following how the Weinberg angle can be eliminated in favor of the mass of the Z , which is now very well known. By using eq. (10.37), and jem = j3 + 21 jY , we can write

LNC = ejemA + g cos j3 ; g0 sin (jem ; j3 )]Z 

(10.38) Expressing g0 through g0 = g tan , the coecient of Z can be put in the form g cos  + g0 sin ]j3 ; g0 sin  jem = cosg  j3 ; cosg  sin2  jem (10.39) from which LNC = ejemA + cosg  j3 ; sin2  jem]Z   ejemA + cosg  jZ Z  (10.40) where we have introduced the neutral current coupled to the Z jZ = j3 ; sin2  jem 243

(10.41)

The value of sin2  was evaluated initially from processes induced by neutral currents (as the scattering  ; e) at low energy. The approximate value is sin2   0:23

(10.42)

This shows that both g and g0 are of the same order of the electric charge. Notice that the eq. (10.37) gives the following relation among the electric charge and the couplings g and g0 1 = 1+ 1 (10.43) e2 g2 g02

10.2 The Higgs sector in the Standard Model According to the discussion made in the previous Section, we need three broken symmetries in order to give mass to W  and Z . Since SU (2) U (1) has four generators, we will be left with one unbroken symmetry, that we should identify with the group U (1) of the electromagnetism (U (1)em), in such a way to have the corresponding gauge particle (the photon) massless. Therefore, the group U (1)em must be a symmetry of the vacuum (that is the vacuum should be electrically neutral). To realize this aim we have to introduce a set of scalar elds transforming in a convenient way under SU (2) U (1). The simplest choice turns out to be a complex representation of SU (2) of dimension 2 (Higgs doublet). As we already noticed the vacuum should be electrically neutral, so one of the components of the doublet must also be neutral. Assume that this component is the lower one, than we will write

 + 

# = 0

To determine the weak hypercharge, QY , of #, we use the relation Qem = Q3 + 21 QY for the lower component of #. We get   1  Y 0 Q ( ) = 2  0 ; ; 2 = 1

(10.44)

(10.45) (10.46)

Since QY and Qi commute, both the components of the doublet must have the same value of QY , and using again eq. (10.45), we get Qem(+) = 12 + 21 = 1 (10.47) which justies the notation + for the upper component of the doublet. 244

The most general lagrangian for # with the global symmetry SU (2) U (1) and containing terms of dimension lower or equal to four (in order to have a renormalizable theory) is LHiggs = @ #y@  # ; 2#y# ; (#y#)2 (10.48) where  > 0 and 2 < 0. The potential has innitely many minima on the surface 2 2  v 2 j#jminimum = ; 2 = 2 (10.49) with 2  2 (10.50) v =; Let us choose the vacuum as the state  0  (10.51) h0j#j0i = v=p2 By putting with we get from which



 



+ # = v=0p2 + (h + i)=p2  #0 + #0

(10.52)

h0j#0j0i = 0

(10.53)

j#j2 = 12 v2 + vh + j+j2 + 21 (h2 + 2)

(10.54)









2 4  1 1 1 2 2 + 2 2 2 + 2 2 2 VHiggs = ; 4  + v h + 2vh j j + 2 (h +  ) +  j j + 2 (h +  ) (10.55) From this expression we read immediately the particle masses (; = (+)y)

m2 = m2 + = m2 = 0

(10.56)

;

and

m2h = 2v2 = ;22 (10.57) It is also convenient to express the parameters appearing in the original form of the potential in terms of m2h e v (2 = ;2m2h,  = 2m2h=v2). In this way we get









m2h h2 + m2h h j+j2 + 1 (h2 + 2) + m2h j+j2 + 1 (h2 + 2) 2 2 v 2 2v2 2 (10.58)  Summarizing we have three massless Goldstone bosons  and , and a massive scalar h. This is called the Higgs eld. VHiggs = ;2m2h +

245

As usual, by now, we promote the global symmetry to a local one by introducing the covariant derivative. Recalling that # 2 (2 1) of SU (2) U (1), we have 0 ~  ; i g Y DH = @ ; i g2 ~  W (10.59) 2 and LHiggs = (DH #)yDH # ; 2#y# ; (#y#)2 (10.60) To study the mass generation it is convenient to write # in a way analogous to the one used in eq. (9.127)  0  ~ i   ~  =v p #=e (10.61) (v + h)= 2

According to our rules the exponential should contain the broken generators. In fact there are 1 , and 2 which are broken. Furthermore it should contain the combination of 1 and 3 which is broken (remember that (1 + 3 )=2 is the electric charge of the doublet #, and it is conserved). But, at any rate, 3 is a broken generator, so the previous expression gives a good representation of # around the vacuum. In fact, expanding around this state  0   1 + i =v i( ; i )=v  0  ~ 3 1 2 i   ~  =v p p e (v + h)= 2  i(1 + i2 )=v 1 ; i3=v (v + h)= 2  i( ; i )  1 (10.62)  p (v +1h ; i2 3) 2 p and introducing real components for # (+ = (1 ; i2)= 2)   ; i  1 2 # = p v +1 h + i (10.63) 2 we get the relation among the two sets of coordinates (1 2  h)  (2 ;1 ;3 h) (10.64) By performing the gauge transformation # ! e;i~  ~ =v # (10.65)





~   ~ ei~  ~ =v + i @ e;i~  ~ =v ei~  ~ =v ~   ~ ! e;i~  ~ =v W (10.66) W 2 2 g Y ! Y (10.67) the Higgs lagrangian remains invariant in form, except for the substitution of # with p (0 (v + h)= 2). We will also denote old and new gauge elds with the same symbol. Dening 0 1 #d = 1  #u = 0 (10.68) 246

the mass terms for the scalar elds can be read by substituting, in the kinetic term, # with its expectation value # ! pv #d (10.69) 2 We get g g 0  v 0  v g g g 3 ; + ~ ;i 2 ~  W + 2 Y p #d = ;i p (; W + +W ) + 2 3 W + 2 Y p #d 2 2 2   = ;i pv pg W +#u ; 1 (gW 3 ; g0Y )#d (10.70) 2 2 2 Since #d and #u are orthogonals, the mass term is   v2 g2 jW +j2 + 1 (gW 3 ; g0Y )2 (10.71) 2 2 4 From eq. (10.37) we have tan  = g0=g, and therefore 0 sin  = p 2g 02  cos  = p 2g 02 (10.72) g +g g +g allowing us to write the mass term in the form





v2 g2 jW +j2 + g2 + g02 (W 3 cos  ; Y sin )2 (10.73) 2 2 4 From eq. (10.35) we see that the neutral elds combination is just the Z eld, orthogonal to the photon. In this way we get g2v2 jW +j2 + 1 g2 + g02 v2Z 2 (10.74) 4 2 4 Finally the mass of the vector bosons is given by MW2 = 14 g2v2 MZ2 = 41 (g2 + g02)v2 (10.75) Notice that the masses of the W  and of the Z are not independent, since their ratio is determined by the Weinberg angle, or from the gauge coupling constants MW2 = g2 = cos2  (10.76) MZ2 g2 + g02 Let us now discuss the parameters that we have so far in the theory. The gauge interaction introduces the two gauge couplings g and g0, which can also be expressed in terms of the electric charge (or the ne structure constant), and of the Weinberg angle. The Higgs sector brings in two additional parameters, the mass of the Higgs, 247

mh, and the expectation value of the eld #, v. The Higgs particle has not been yet discovered, and at the moment we have only an experimental lower bound on mh, from LEP, which is given by mh > 60 GeV . The parameter v can be expressed in terms of the Fermi coupling constant GF . In fact, from eq. (10.9) 2 G pF = 8Mg 2 (10.77) 2 W using the expression for MW2 , we get v2 = p 1  (246 GeV )2 (10.78) 2GF Therefore, the three parameters g, g0 and v can be traded for e, sin2  and GF . Another possibility is to use the mass of the Z

e2 =p (10.79) MZ2 = 14 p 1 2 2 2GF sin  cos  2GF sin2  cos2  to eliminate sin2  " s # 1 4

sin2  = 2 1 ; 1 ; p (10.80) 2GF MZ2 The rst alternative has been the one used before LEP. However, after LEP1, the mass of the Z is very well known MZ = (91:1863 0:0020) GeV (10.81) Therefore, the parameters that are now used as input in the SM are , GF and MZ . The last problem we have to solve is how to give mass to the electron, since it is impossible to construct bilinear terms in the electron eld which are invariant under the gauge group. The solution is that we can build up trilinear invariant terms in the electron eld and in #. Once the eld # acquires a non vanishing expectation value, due to the breaking of the symmetry, the trilinear term generates the electron mass. Recalling the behaviour of the elds under SU (2) U (1) (10.82) L 2 (2 ;1) R 2 (1 ;2) # 2 (2 +1) we see that the following coupling (Yukawian coupling) is invariant  +  ! ! ! LY = geL#R + h:c: = ge  ( )L (e)L ] 0 (e )R + h:c: (10.83) In the unitary gauge, which is obtained by using the transformation (10.65) both for # and for the lepton eld L, we get  0  LY = ge  (! )L (!e )L ] (v + h)=p2 (e )R + h:c: (10.84) 248

from which Since

LY = gpev (!e)L(e )R + pge (!e)L (e)Rh + h:c: 2

2

(10.85)

(!e)L(e)R ]y = (!e 1 + 5 e)y = ey 1 + 5 0e = !e 1 ; 5 e = (!e)R (e)L 2 2 2 (10.86) we get LY = gpev !ee + pge !e eh (10.87) 2 2 Therefore the symmetry breaking generates an electron mass given by me = ; gpev (10.88) 2 In summary, we have been able to reproduce all the phenomenological features of the V ; A theory and its extension to neutral currents. This has been done with a theory that, before spontaneous symmetry breaking is renormalizable. In fact, also the Yukawian coupling has only dimension 3. The proof that the renormalizability of the theory holds also in the case of spontaneous symmetry breaking (2 < 0) is absolutely non trivial. In fact the proof was given only at the beginning of the seventies by 't Hooft.

10.3 The electroweak interactions of quarks and the Kobayashi-Maskawa-Cabibbo matrix So far we have formulated the SM for an electron-neutrino pair, but it can be extended in a trivial way to other lepton pairs. The extension to the quarks is a little bit less obvious and we will follow the necessary steps in this Section. First of all we recall that in the quark model the nucleon is made up of three quarks, with composition, p  uud, n  udd. Then, the weak transition n ! p + e; + !e is obtained through d ! u + e; + !e (10.89) and the assumption is made that quarks are point-like particles as leptons. Therefore, it is natural to write quark current in a form analogous to the leptonic current (see eqs. (10.5) and (10.7))

Jd() = 2+! L  +L with

+L =

u  L dL

 

= 1 ;2 5 ud

249

(10.90) (10.91)

Phenomenologically, from the hyperon decays it was known that it was necessary a further current involving the quark s, given by Js() = 2+! 0L  +0L (10.92) with

u 

 

(10.93) = 1 ; 5 us L= 2 Furthermore the total current contributing to the Fermi interaction had to be of the form Jh = Jd cos C + Js sin C (10.94) where C is the Cabibbo angle, and +0

L sL

sin C = 0:21 0:03

(10.95)

The presence of the angle in the current was necessary in order to explain why the weak decays strangeness violating (that is the ones corresponding to the transition u $ s, as K + ! +) where suppressed with respect to the decays strangeness conserving. Collecting together the two currents we get Jh() = 2Q! L   QL (10.96) with and



  

QL = d cos  u+L s sin   duCL L C L C L

(10.97)

dCL = dL cos C + sL sin C (10.98) This allows us to assign QL to the representation (2 1=3) of SU (2) U (1). In fact, from Qem = Q3 + 21 QY (10.99) we get (10.100) QY (uL) = 2( 23 ; 21 ) = 13 and (Qem(s) = Qem(d) = ;1=3) (10.101) QY (dCL ) = 2(; 31 + 21 ) = 13 As far as the right-handed components are concerned, we assign them to SU (2) singlets, obtaining uR 2 (1 34 ) dCR 2 (1 ; 23 ) (10.102) 250

Therefore, the hadronic neutral currents contribution from quarks u d s are given by (10.103) j3 = Q! L  23 QL jY = 13 Q! L QL + 43 u!R uR ; 32 d!CR dCR (10.104) Since the Z is coupled to the combination j3 ; sin2 jY , it is easily seen that the coupling contains a bilinear term in the eld dC given by 2 (10.105) ; cos2  Z d!CL dCL + 13 sin2 Z d!CR dCR ! and s!d which produce strangeness changing This gives rise to terms of the type ds neutral current transitions . However the experimental result is that these transitions are strongly suppressed. In a more general way we can get this result from the simple observation that W3 is coupled to the current j3 which has an associated charge Q3 which can be obtained by commuting the charges Q1 and Q2 Q1  Q2 ]

=

Z Z y y 3 3 d ~xd ~y QL1 QL QL 2QL ] = d3~x QyL 1 2 ]QL Z Z

= i

d3~x QyL 3 QL = i

d3~x(uyLuL ; dCL ydCL )

(10.106)

The last term is just the charge associated to a Flavour Changing Neutral Current (FCNC). We can compare the strength of a FCNC transition, which from

dC ydC = dyd cos2 C + sys sin2 C + (dys + syd) sin C cos C (10.107) is proportional to sin C cos C , with the strength of a &avour changing charged current transition as u ! s, which is proportional to sin C . From this comparison we expect the two transitions to be of the same order of magnitude. But if we compare K 0 ! +; induced by the neutral current , with K + ! + induced by the charged current (see Fig. 2.3), using BR(K 0 ! +;), we nd ;(K 0 ! +;)  6  10;5 (10.108) ;(K + ! + ) This problem was solved in 1970 by the suggestion (Glashow Illiopoulos and Maiani, GIM) that another quark with charge +2/3 should exist, the charm. Then, one can form two left-handed doublets u  c  L 1 2 QL = dC  QL = sCL (10.109) L L where sCL = ;dL sin C + sL cos C (10.110) 251

d K

Z

0



s

µ

u

+

µ

W

+

K –



s

+

µ

+

νµ

Fig. 10.3.1 - The neutral current avour changing process K 0 ! +; and he charged current avour changing process K + ! + . is the combination orthogonal to dCL . As a consequence the expression of Q3 gets modied Z 3 Q = d3~x (uyLuL + cyLcL ; dCL ydCL ; sCL ysCL ) (10.111) But

dCL ydCL + sCL ysCL = dyLdL + syLsL since the transformation matrix  dC   cos  sin   d  C C L L = C sL ; sin C cos C sL

(10.112) (10.113)

is orthogonal. The same considerations apply to QY . This mechanism of cancellation turns out to be eective also at the second order in the interaction, as it should be, since we need a cancellation at least at the order G2F in order to cope with the experimental bounds. In fact, one can see that the two second order graphs contributing to K 0 ! +; cancel out except for terms coming from the mass dierence between the quarks u and c. One gets

A(K 0 ! +;)  G2F (m2c ; m2u)

(10.114)

allowing to get a rough estimate of the mass of the charm quark, mc  1  3 GeV . The charm was discovered in 1974, when it was observed the bound state c!c, with J PC = 1;;, known as J=. The mass of charm was evaluated to be around 1:5 GeV . In 1977 there was the observation of a new vector resonance, the -, interpreted as a bound state !bb, where b is a new quark, the bottom or beauty, with mb  5 GeV . Finally in 1995 the partner of the bottom, the top, t, was discovered at Fermi Lab. The mass of the top is around 175 GeV . We see that in association with three 252

leptonic doublets

  e

 

 

 (10.115) e; L  ; L  ; L there are three quark doublets u c t (10.116) d L s L b L We will show later that experimental evidences for the quark b to belong to a doublet came out from PEP and PETRA much before the discovery of top. Summarizing we have three generations of quarks and leptons. Each generation includes two left-handed doublets   u  A  A A = 1 2 3 (10.117) e; d A L



A L

and the corresponding right-handed singlets. Inside each generation the total charge is equal to zero. In fact 2 1 X tot (10.118) Qi = Qf = 0 ; 1 + 3  3 ; 3 = 0 f 2gen where we have also taken into account that each quark comes in three colors. This is very important because it guarantees the renormalizability of the SM. In fact, in general there are quantum corrections to the divergences of the currents destroying their conservation. However, the conservation of the currents coupled to the YangMills elds is crucial for the renormalization properties. In the case of the SM for the electroweak interactions one can show that the condition for the absence of these corrections (Adler, Bell, Jackiw anomalies) is that the total electric charge of the elds is zero. We will complete now the formulation of the SM for the quark sector, showing that the mixing of dierent quarks arises naturally from the fact that the quark mass eigenstates are not necessarily the same elds which couple to the gauge elds. We will denote the last ones by  0   u0  0 0 A `AL = e0  qAL = d0A  A = 1 2 3 (10.119) A L A L and the right-handed singlets by e0AR  u0AR d0AR (10.120) We assume that the neutrinos are massless and that they do not have any righthanded partner. The gauge interaction is then Y (fL )  X!  ~  Q 0 ~+g L = fAL i@ + g 2  W Y   fAL 2 fL Y (fR )  X!  Q 0 + fAR i@ + g 2   fAR (10.121) fR 253

0 , and fAR = e0  u0  d0 . We recall also the weak hyperwhere fAL = `0AL qAL AR AR AR charge assignments 0 )= 1 QY (`0AL) = ;1 QY (qAL 3 QY (e0AR ) = ;2 QY (u0AR) = 43 (10.122) QY (d0AR ) = ; 23 Let us now see how to give mass to quarks. We start with up and down quarks, and a Yukawian coupling given by

gdq!L #dR + h:c:

(10.123)

wherep# is the Higgs doublet. When this takes its expectation value, h#i = (0 v= 2), a mass term for the down quark is generated gpd v ;d! d + d! d  = gpdv dd ! (10.124) 2 LR R L 2

p

corresponding to a mass md = ;gd v= 2. In order to give mass to the up quark we need to introduce the conjugated Higgs doublet dened as #~ = i2

#?

 (0)?  = + ?

(10.125)

; ( )

Observing that QY (#~ ) = ;1, another invariant Yukawian coupling is given by guq!L #~ uR + h:c: (10.126)

p

giving mass to the up quark, mu = ;guv= 2. The most general Yukawian coupling among quarks and Higgs eld is then given by

LY =

X e AB

u q!0 # 0 d 0 0 gAB `!0AL#e0BR + gAB AL ~ uBR + gAB q!AL #dBR



(10.127)

i , with i = e u d, are arbitrary 3  3 matrices. When we shift # by its where gAB vacuum expectation value we get a mass term for the fermions given by X ; e !0 0 u u!0 u0 + g d d!0 d0  Lfermion mass = pv gAB `ALeBR + gAB (10.128) AL BR AB AL BR 2 AB

Therefore we obtain three 3  3 mass matrices i = ; pv g i  i = e u d MAB 2 AB 254

(10.129)

These matrices give mass respectively to the charged leptons and to the up and down of quarks. They can be diagonalized by a biunitary transformation

Mdi = S yM i T

(10.130)

where S and T are unitary matrices (depending on i) and Mdi are three diagonal matrices. Furthermore one can take the eigenvalues to be positive. This follows from the polar decomposition of an arbitrary matrix

p

M = HU

(10.131)

where H y = H = MM y is a hermitian denite positive matrix and U y = U ;1 is a unitary matrix. Therefore, if the unitary matrix S diagonalizes H we have

Hdi = S yHS = S yMU ;1 S = S yMT

(10.132)

with T = U ;1 S is a unitary matrix. Each of the mass terms has the structure !L0 MR0 . Then we get !L0 MR0 = !L0 S (S yMT )T yR0  !LMd R (10.133) where we have dened the mass eigenstates

L0 = SL

R0 = TR

(10.134)

We can now express the charged current in terms of the mass eigenstates. We have 0   q 0 = 2!u0  d0 = 2! J+(h) = 2!qAL uAL((S u);1S d )AB dBL  ; AL AL  AL

Dening we get with

V = (S u);1S d

V y = V ;1

V) Jh(+) = 2!uALd(AL

(10.135) (10.136) (10.137)

V) = V d d(AL (10.138) AB BL The matrix V is called the Cabibbo-Kobayashi-Maskawa (CKM) matrix, and it describes the mixing among the down type of quarks (this is conventional we could have chosen to mix the up quarks as well). We see that its physical origin is that, in general, there are no relations between the mass matrix of the up quarks, M u, and the mass matrix of the down quarks, M d . The neutral currents are expressions which are diagonal in the "primed" states, therefore their expression is the same also in the basis of the mass eigenstates, due to the unitarity of the various S matrices. In fact jh(3)  u!0Lu0L ; d!0Ld0L = u!L(S u);1S uuL ; d!L(S d );1S ddL = u!LuL ; d!LdL (10.139)

255

For the charged leptonic current, since we have assumed massless neutrinos we get 0  e J`(+) = 2!AL(S e)AB eBL  2!AL (10.140)  AL where L0 = (S e)yL is again a massless eigenstate. Therefore, there is no mixing in the leptonic sector, among dierent generations. However this is tied to our assumption of massless neutrinos. By relaxing this assumption we may generate a mixing in the leptonic sector producing a violation of the dierent leptonic numbers. It is interesting to discuss in a more detailed way the structure of the CKM matrix. In the case we have n generations of quarks and leptons, the matrix V , being unitary, depends on n2 parameters. However one is free to choose in an arbitrary way the phase for the 2n up and down quark elds. But an overall phase does not change V . Therefore the CKM matrix depends only on n2 ; (2n ; 1) = (n ; 1)2 (10.141) parameters. We can see that, in general, it is impossible to choose the phases in such a way to make V real. In fact a real unitary matrix is nothing but an orthogonal matrix which depends on n(n ; 1) (10.142) 2 real parameters. This means that V will depend on a number of phases given by number of phases = (n ; 1)2 ; n(n ; 1) = (n ; 1)(n ; 2) (10.143) 2 2 Then, in the case of two generations V depends only on one real parameter (the Cabibbo angle). For three generations V depends on three real parameters and one phase. Since the invariance under the discrete symmetry CP implies that all the couplings in the lagrangian must be real, it follows that the SM, in the case of three generations, implies a CP violation. A violation of CP has been observed experimentally by Christensen et al. in 1964. These authors discovered that the eigenstate of CP , with CP = ;1 ;  jK20i = p1 jK 0i ; jK! 0i (10.144) 2 decays into a two pion state with CP = +1 with a BR BR(K20 ! + ;)  2  10;3 (10.145) It is not clear yet, if the SM is able to explain the observed CP violation in quantitative terms. To complete this Section we give the experimental values for some of the matrix elements of V . The elements Vud and Vus are determined through nuclear  -, K ; and hyperon-decays, using the muon decay as normalization. One gets jVudj = 0:9736 0:0010 (10.146) 256

and

jVusj = 0:2205 0:0018

(10.147) The elements Vcd and Vcs are determined from charm production in deep inelastic scattering  + N !  + c + X . The result is

jVcdj = 0:224 0:016 and

(10.148)

jVcsj = 1:01 0:18

(10.149) The elements Vub and Vcb are determined from the b ! u and b ! c semi-leptonic decays as evaluated in the spectator model, and from the B semi-leptonic exclusive decay B ! D! ?`` . One gets jVubj = 0:08 0:02 (10.150) jVcbj and jVcbj = 0:041 0:003 (10.151) More recently from the branching ratio t ! Wb, CDF (1996) has measured jVtb j

jVtb j = 0:97 0:15 0:07

(10.152)

A more recent comprehensive analysis gives the following 90 % C.L. range for the various elements of the CKM matrix 0 0:9745 ; 0:9757 0:219 ; 0:224 0:002 ; 0:005 1 V = @ 0:218 ; 0:224 0:9736 ; 0:9750 0:036 ; 0:046 A (10.153) 0:004 ; 0:014 0:034 ; 0:046 0:9989 ; 0:9993

10.4 The parameters of the SM It is now the moment to count the parameters of the SM. We start with the gauge sector, where we have the two gauge coupling constants g and g0. Then, the Higgs sectorpis specied by the parameters  and the self-coupling . We will rather use v = ;2 = and m2h = ;22. Then we have the Yukawian sector, that is that part of the interaction between fermions and Higgs eld giving rise to the quark and lepton masses. If we assume three generations and that the neutrinos are all massless we have three mass parameters for the charged leptons, six mass parameters for the quarks (assuming the color symmetry) and four mixing angles. In order to test the structure of the SM the important parameters are those relative to the gauge and to the Higgs sectors. As a consequence one assumes that the mass matrix for the fermions is known. This was not the case till two years ago, before the top discovery, and its mass was unknown. The masses of the vector bosons can be expressed in terms of the previous parameters. However, the question arises about 257

the better choice of the input parameters. Before LEP1, the better choice was to use quantities known with great precision related to the parameters (g g0 v). The choice was to use the ne structure constant 1 (10.154) = 137:0359895(61) and the Fermi constant

GF = 1:166389(22)  10;5 GeV ;2

(10.155)

as measured from the  -decay of the muon. Then, most of the experimental research of the seventies and eighties was centered about the determination of sin2 . The other parameters as (mt mh) aect only radiative corrections, which, in this type of experiments can be safely neglected due to the relatively large experimental errors. Therefore, in order to relate the set of parameters (  GF  sin2 ) to (g g0 v) we can use the tree level relations. The situation has changed a lot after the beginning of running of LEP1. In fact, the mass of the Z has been measured with great precision, MZ =MZ  2  10;5. In this case the most convenient set is (  GF  MZ ), with

MZ = (91:1863 0:0020) GeV

(10.156)

Also the great accuracy of the experimental measurements requires to take into account the radiative corrections. We will discuss this point in more detail at the end of these lectures. However we notice that one gets informations about parameters as (mt  mh) just because they aect the radiative corrections. In particular, the radiative corrections are particularly sensitive to the top mass. In fact, LEP1 was able to determine the top mass, in this indirect way, before the top discovery.

258

Appendix A A.1 Properties of the real antisymmetric matrices

Given a real and antisymmetric matrix n  n A, we want to show that it is possible to put it in the form (7.61) by means of an orthogonal transformation. To this end, let us notice that iA is a hermitian matrix and therefore it can be diagonalized through a unitary transformation U : U (iA)U y = Ad (A.1) where Ad is diagonal and real. The eigenvalues of iA satisfy the equation detjiA ;   1j = 0 (A.2) Since AT = ;A it follows detjiA ;   1j = detj(iA ;   1)T j = detj ; iA ;   1j = 0 (A.3) Therefore, if  is an eigenvalue, the same is for ;. lo e'. It follows that Ad can be written in the form 2 0 0 0   3 66 01 ;1 0 0    77 66 0 0 2 0    77 (A.4) Ad = 66 0 0 0 ;2    77

66  4 

  

  

  

   77   5   

Notice that if n is odd, Ad has necessarily a zero eigenvalue, that we will write as (Ad )nn. We will assume also to order the matrix in such a way that all the i are positive.Each submatrix of the type  0  i (A.5) 0 ;i 259



can be put in the form through the unitary 2  2 matrix

In fact Dening

V2

0 ii ;ii 0





V2 = p1 1i 1i 2





(A.6)



(A.7)



0 V y = 0 ii 0 ;i 2 ;ii 0 i

2V 0   66 02 V2   V = 66     4    

with Vnn = 0 for n odd, we get



(A.8)

3

  77  77 5    

2 0 i 0 0 66 ;i1 01 0 0 66 0 0 0 i20 y V Ad V = 66 0 0 ;i 0 66    1  4     







      

(A.9)

      

3

  77  77  77 = iAs  77 5 

(A.10)

with As dened in (7.61). Therefore (V U )(iA)(V U )y = iAs or

(A.11)

(V U )(A)(V U )y = As (A.12) But A and As are real matrices, and therefore also V U must be real. However being V U real and unitary it must be orthogonal.

260

Related Documents