Skip to main content

Near-field Propagation

· 14 min read
Yuri R. Tonin
Physics Engineer

This post is a (hopefully quick) guide on the most important equations for computing free space propagation of a wavefront in the Fresnel regime, and how to do it properly.

Helmholtz equation and elementary waves

First, some background. Consider a monochromatic wave ψ\psi in vacuum. It is governed by Helmholtz equation:

2ψ+k2ψ=0\nabla^2\psi + k^2 \psi = 0

where k=2π/λk = 2 \pi / \lambda is the wavenumber. The solutions to this equation are called elementary waves, which can either be

  • plane waves:
ψ(r)=Aexp(jkr)=Aexp[j(kxx+kyy+kzz)]\psi(\mathbf{r}) = A \exp{(-j \mathbf{k} \cdot \mathbf{r})} = A \exp{[-j (k_x x + k_y y + k_z z)]}
  • or spherical waves:
ψ(r)=Arexp(jkr)\psi(\mathbf{r}) = \frac{A}{r} \exp{(-j k r)}

The names indicate that the wavefronts (i.e. the surfaces of constant phase) are either planes or spheres. The term before the exponential is referred to as the amplitude of the wave, whereas the imaginary argument of the exponent is named the phase. For a spherical wave, the amplitude is modulated by the distance from the source due to the 1/r1/r term.

The figure below illustrates the propagation of a spherical wave. As we will understand ahead, near the source, the wavefronts are spherical. At small angles with the propagation axis, it can eventually be approximated by paraboloids. For large distances, one may approximate them by planes waves.

propagationplanes

In the real world, though, there is no free lunch. We usually encounter waves that are way more complex than these elementary waves. Fortunately, we can make things easier using the Angular Spectrum of waves.

Angular Spectrum Method (ASM)

An arbitrary wavefront ψ(x1,y1)\psi(x_1,y_1) at plane z=0z=0 can be decomposed into plane wave components using the Fourier transform (FT):

ψ(x1,y1)=12πψ^(kx,ky)e[j(kxx1+kyy1)]dkxdky\psi(x_1,y_1) = \frac{1}{2 \pi} \iint \hat{\psi}(k_x,k_y) e^{[-j (k_x x_1 + k_y y_1)]} dk_x dk_y

In other words, the wavefront is composed by a sum of plane waves with phase ϕ=kxx1+kyy1\phi = k_x x_1 + k_y y_1, each with amplitude ψ^(kx,ky)\hat{\psi}(k_x,k_y). Any wavefront can be decomposed in such fashion, which makes things easier.

Now, how can one obtain the wavefront at a different plane (x2,y2,z)(x_2,y_2,z) in space? Well, instead of working directly with ψ(x1,y1)\psi(x_1,y_1), one can propagate each plane wave component separately and then sum the contribution of all these elementary waves at zz. This is done using the free space propagator:

H(kx,ky)=exp(jzk2kx2ky2)H(k_x,k_y)=\exp{(j z \sqrt{k^2 - k_x^2 - k_ y^2})}

such that the resulting wave is

ψ(x2,y2,z)=12πψ^1(kx,ky)exp(jzk2kx2ky2)free space propagator multiplies each componentej(kxx2+kyy2)dkxdky\psi(x_2,y_2,z) = \frac{1}{2 \pi} \iint \underbrace{\hat{\psi}_1(k_x,k_y) \exp{(j z \sqrt{k^2 - k_x^2 - k_ y^2}})}_{\text{free space propagator multiplies each component}} e^{-j (k_x x_2 + k_y y_2)}dk_x dk_y

This expression can be written in operator form as

ψ(x2,y2,z)=F1{F{ψ(x1,y1)}H(kx,ky)}\psi(x_2,y_2,z) = \mathcal{F}^{-1}\{ \mathcal{F}\{\psi(x_1,y_1)\} H(k_x,k_y) \}

The above expression arises directly from the Raleigh-Sommerfeld diffraction solution (see reference [4] for a more detailed explanation). The only assumption here is that the distance between source and observation points be much greater than the wavelength λ\lambda.

Important approximations

We can apply approximations to get simpler forms of the propagator. For that, we make use of the paraxial approximation. It is common to hear that such approximation means the wave makes small angles with the optical axis. However, a wave may have plane wave components spreading in all directions. Hence, a more correct way of saying this is that the non-negligible plane wave components make a small angle with the optical axis [2]. A paraxial wave can also be thought as one that varies much more in the trasversal x,yx,y plane than in the longitudinal zz direction.

The paraxial Helmholtz equation governs paraxial waves, and it presents two important solutions:

  • Paraboloidal wave

This is also known as the Fresnel diffraciton integral. It is the paraxial approximation to the spherical wave:

ψ(x2,y2)=eikziλz+ψ(x1,y1)eik2z[(x2x1)2+(y2y1)2]dx1dy1\psi(x_2,y_2) = \frac{e^{ikz}}{i \lambda z} \iint_{-\infty}^{+\infty} \psi(x_1,y_1) e^{ \frac{i k}{2 z}\left[{(x_2-x_1)^2+(y_2-y_1)^2} \right]} dx_1 dy_1

In this case, we have a plane wave component from the eikze^{ikz} term which gets modulated by the phase term inside the integral. Note that (x2+y2)/z=const(x^2+y^2)/z = \text{const} is the equation of the paraboloid of revolution. Hence, the plane waves get "distorted" into paraboloids. For large distances zz, this phase term become negligible. Also, the variation of the amplitude with the term 1/z1/z becomes less relevant, which justifies the approximation of a spherical wave by a plane wave.

  • Gaussian beam:

The Gaussian beam is a more complex, but extremely useful solution which appears in lasers, optical communication and many optical instruments. It is described by:

ψ(x,y,z)=ψ0w0w(z)exp(x2+y2w(z)2)exp(ikzikx2+y22R(z)+iζ(z))\psi(x, y, z) = \psi_0 \frac{w_0}{w(z)} \exp\left(-\frac{x^2 + y^2}{w(z)^2}\right) \exp\left(-ikz - ik\frac{x^2 + y^2}{2R(z)} + i\zeta(z)\right)

If we calculate the intensity I=ψ(x,y,z)2I=|\psi(x,y,z)|^2, we get

I(x,y,z)=ψ02w02w(z)2exp(2x2+y2w(z)2)I(x,y,z) = |\psi_0|^2 \frac{w_0^2}{w(z)^2} \exp\left(-2\frac{x^2 + y^2}{w(z)^2}\right)

The xyxy dependence of the intensity is Gaussian, which explains the name. We can see the transversal slice I(x,y)I(x,y) at z=0z=0 in the figure below, which shows exactly such Gaussian distribution.

gaussianbeam
Transversal and longitudional cuts to a Gaussian beam

As the equation shows, there are a few parameters to model the bahavior of a Gaussian beam. First, it is useful to define the Rayleigh length zRz_R

zR=πw02λz_R = \frac{\pi w_0^2}{\lambda}

where w0w_0 is called the beam waist, which corresponds to the minimum radius of the beam, chosen to be at z=0z=0. The Rayleigh length is used to write the other quantities of interest, namely:

  • Beam radius:
w(z)=w01+(zzR)2w(z) = w_0 \sqrt{1 + \left(\frac{z}{z_R}\right)^2}
  • Radius of curvature:
R(z)=z[1+(zRz)2]R(z) = z \left[1 + \left(\frac{z_R}{z}\right)^2\right]
  • Gouy phase:
ζ(z)=arctan(zzR)\zeta(z) = \arctan\left(\frac{z}{z_R}\right)
propagationplanes

A few comments on these equations to help us better understand the Gaussian beam:

  • The transversal size of the beam is indeed minimum at w(z=0)=w0w(z=0) = w_0. This size increases in both ±z\pm z directions. The length from zR-z_R to zRz_R is defined to be the depth of focus bb of the beam.

  • The Gouy phase represents a phase delay from π/2toπ/2-\pi/2 to \pi/2. If we combine two of the phase terms, we have i(kzζ(z))i(kz-\zeta(z) ). Hence, it can be understood as a phase deviation from the plane wave scenario.

  • The third phase term relates to the radius of curvature, which causes the wavefront to curve. R(z)R(z) is infinite at z=0z=0, meaning we have a plane wave. The curvature then decreases up to a minimum at z=zRz=z_R. It then increases again to assimptoticaly reach R(z)zR(z)\approx z for big zz. A good enough approximation is to use R(z>2zR)=zR(z>2z_R) = z.

Using the Fresnel diffraction integral

Let us rewrite the Fresnel diffraction integral:

ψ(x2,y2)=eikziλz+ψ(x1,y1)eik2z[(x2x1)2+(y2y1)2]dx1dy1(1)\psi(x_2,y_2) = \frac{e^{ikz}}{i \lambda z} \iint_{-\infty}^{+\infty} \psi(x_1,y_1) e^{ \frac{i k}{2 z}\left[{(x_2-x_1)^2+(y_2-y_1)^2} \right]} dx_1 dy_1 \tag{1}

We can expand the exponent inside the integral and move some terms out, arriving at the following form:

ψ(x2,y2)=eikziλzeik2z[x22+y22]+ψ(x1,y1)eik2z[x12+y12]eik2z(x2x1+y2y1)dx1dy1(2)\psi(x_2,y_2) = \frac{e^{ikz}}{i \lambda z} e^{ \frac{i k}{2 z}\left[{x_2^2+y_2^2} \right]} \iint_{-\infty}^{+\infty} \psi (x_1,y_1) e^{ \frac{i k}{2 z} \left[{x_1^2+y_1^2} \right] } e^{ \frac{i k}{2 z}{(x_2 x_1+y_2 y_1)} }dx_1 dy_1 \tag{2}

The two approaches are named the Convolution and Fourier Transform forms of the Fresnel Diffraction integral. Although mathematically identical, they are numerically different. One must be careful to choose between them depending on the problem at hand.

Convolution form

- Impulse Response method

A convolution between two functions f and g is defined by

fg=+f(t)g(tτ)dτf \ast g = \int_{-\infty}^{+\infty} f(t) g(t-\tau) d\tau

Equation (1) is nothing more than a convolution of ψ\psi with the following impulse response function h(x,y)h(x,y):

h(x,y)=eikziλzeik2z[x2+y2](3) h(x,y) = \frac{e^{ikz}}{i \lambda z} e^{ \frac{i k}{2 z}\left[{x^2+y^2} \right]} \tag{3}

Hence

ψ(x2,y2)=eikziλz+ψ(x1,y1)h(x2x1,y2y1)dx1dy1\psi(x_2,y_2) = \frac{e^{ikz}}{i \lambda z} \iint_{-\infty}^{+\infty} \psi(x_1,y_1) h(x_2-x_1,y_2-y_1) dx_1 dy_1
- Transfer function method

Making use of the convolution theorem, it can be calculated in Fourier space as

ψ(x2,y2)=F1{F{ψ(x1,y1)}H(fx1,fy1)}(4) \psi(x_2,y_2) = \mathcal{F}^{-1}\{ \mathcal{F}\{ \psi(x_1,y_1) \} H(f_{x_1},f_{y_1}) \} \tag{4}

where

H(fx1,fy1)=F{h(x1,y1)}=ejkzejπλz(fx2+fy2)(5) H(f_{x1},f_{y1}) = \mathcal{F}\{h(x_1,y_1)\} = e^{jkz} e^{j \pi \lambda z (f_{x}^2+f_{y}^2)} \tag{5}

Note that in computing equation (4), one should pre-computes H(fx1,fy1)H(f_{x_1},f_{y_1}) and spares one calculation of the Fourier Transform.

Chirp functions

Equations (3) and (5) define what we call a chirp function. For the IR method, the chirp function is sampled in real space:

C1=eik2z(x12+y12)C_1 = e^{ \frac{i k}{2 z} \left({x_1^2+y_1^2} \right) }

whereas for the SFT it is samples in reciprocal space:

C2=ejπλz(fx2+fy2)C_2 = e^{j \pi \lambda z (f_{x}^2+f_{y}^2)}

This is the reason why they are numerically different. Correctly evaluation of the Fast Fourier transform requires proper sampling of the functions to avoid aliasing, according to the Nyquist sampling theorem. Now, the discrete Fourier Transform imposes that

Δx=1NΔf\Delta x = \frac{1}{N \Delta f}

where NN is the number of points in your matrix. In other words, for a given N there is an inverse relationship between the sampling Δx\Delta x in real-space and the sampling Δf\Delta f in reciprocal space. This indicates that, the best one chirp function is sampled, the worst the other one will be. This is as strong indication that one method might be superior to the other depending on the situation.

How to choose the best method?

Now, to the practical side of things. For selecting the best method for your application, one may use a rule of thumb in the table below (although for details I recommend taking a closer look at the references!). LL represents the size of the array and DD is width of the source field inside of it (D<LD<L).

Criteria       Sampling                      Comment
Δx>λzL\Delta x > \frac{\lambda z}{L}TF: oversampled.     IR: undersampledIR method causes periodic copies. TF is the method of choice. Observation plane width will be limited to width D+λz/ΔxD + \lambda z / \Delta x
Δx=λzL\Delta x = \frac{\lambda z}{L}both TF and IR: critical samplingTF and IR are identical
Δx<λzL\Delta x < \frac{\lambda z}{L}TF: undersampled. IR: oversampledIt depends. Needs to evaluate if source bandwidth satisfies BL2λzB \le \frac{L}{2 \lambda z}. If yes, use TF. Otherwise, IR may be better.

Fourier Transform form

- Single Fourier Transform

Equation (2), on the other hand, can be seen as a Fourier Transform:

ψ(x2,y2)=eikziλzeik2z[x22+y22]F{ψ(x1,y1)eik2z[x12+y12]} \psi(x_2,y_2) = \frac{e^{ikz}}{i \lambda z} e^{ \frac{i k}{2 z}\left[{x_2^2+y_2^2} \right]} \mathcal{F}\{ \psi(x_1,y_1) e^{ \frac{i k}{2 z} \left[{x_1^2+y_1^2} \right] } \}

Notice that it can also be rewritten using h(x,y)h(x,y), such that

ψ(x2,y2)=eik2z[x22+y22]F{ψ(x1,y1)h(x1,y1)}}(6) \psi(x_2,y_2) = e^{ \frac{i k}{2 z}\left[{x_2^2+y_2^2} \right]} \mathcal{F}\{ \psi(x_1,y_1) h(x_1,y_1) \} \} \tag{6}

In contrast to equation (4), the above expression requires a single Fourier Transform computation.

Grid spacing

In practice, we will compute the propagation on, well, a computer. In other words, we shall use discrete matrices. An important difference between the Single Fourier Trasform and Convolution methods is that the latter always involve an inverse Fourier transform, whereas the first does not. This has consequences on the final grid spacing of the resulting matrix.

If you perform an inverse Fourier transform at the end, the pixel sizes p1p_1 in the input and p2p_2 in the output fields will be the same. No problem there. However, if only forward Fourier transform is applied, the pixel sizes relate according to:

p2=λzNp1p_2 = \frac{\lambda z}{N p_1}
- Fresnel propagator in 2 steps

One alternative to select the pixel size p2p_2 in the output plane is to perform the propagation in two steps. The idea is to first propagate the wavefront from z1z_1 to an intermediary plane ziz_i, and then from ziz_i to the final desired plane z2z_2. We introduce a scaling parameter mm such that

p2=mp1p_2 = m p_1

It turns out that this scaling paremeter relates to the positions in the zz axis as

m=z2ziziz1=Δz2Δz1m = \left\lvert\frac{z_2 - z_i}{z_i - z_1} \right\rvert = \left\lvert\frac{\Delta z_2}{\Delta z_1} \right\rvert

In that case, the wavefront ψ\psi is obtained via

ψ(x2,y2,z2)=z2z1ej2πz/λexp(jπλzz1z2z2(x22+y22))F1{exp(jπλzz1z2(fx12+fy12))F{ψ1(x1,y1,z1)exp(jπλzz1z2z1(x12+y12))}}\begin{align*} \psi(x_2,y_2,z_2) & = \frac{z_2}{z_1} e^{j 2 \pi z/\lambda} \exp{\left( -j \frac{\pi}{\lambda z} \frac{z_1-z_2}{z_2} (x_2^2+y_2^2) \right)} \\ & \mathcal{F}^{-1}\{\exp{\left( -j\pi \lambda z \frac{z_1}{z_2}(f_{x1}^2+f_{y_1}^2)\right)} \mathcal{F}\{ \psi_1(x_1,y_1,z_1) \exp{\left( j \frac{\pi}{\lambda z}\frac{z_1-z_2}{z_1}(x_1^2+y_1^2)\right)}\}\} \end{align*}

But caution: the above expression presents an issue. It requires computing both chirp functions in real and reciprocal spaces. Therefore, unless working with critical sampling, the expression above will always be partly wrong due to undersampling of one of the two expressions.

Fresnel scaling theorem

A particularly common scenario is that of divergent or cone-beam sources. In these cases, the beam divergence is quite large for the lengths of interest, such that the expansion of the beam becomes relevant.

The Fresnel diffraction integral is, naturally, still valid. Nonetheless, if we are dealing with large propagation distances and a large divergence, a beam with only microns in size may expand hundreds of times in size. Therefore, it may be necessary to work with matrix with dozens of thousands of pixels, making you calculation infeasible due to computer memory limitations.

Fortunately, there exists a clever workaround in this case, called the Fresnel Scaling Theorem (FST). Essentially, the FST states that a divergent spherical wave has an equivalent plane-wave equivalent with scaled pixels and distances.

In the cone-beam case, the focus to sample distance z1z_1 and the sample to output plane distance z2z_2 define the magnification MM of the system:

M=z1+z2z1M = \frac{z_1 + z_2}{z_1}

This quantity is used then to convert from the cone to the equivalent parallel beam geometry. Note that we recover the parallel case (M=1M=1) when z1+z_1 \rightarrow + \infty.

Equivalent geometry #1

The most well know geometry puts the reference on the pixel size dd of the input plane. In this case, the distance between input and output planes and the output pixel size are downscaled according to

z2z2/Mz_2 \rightarrow z_2/M

and

DD/MD \rightarrow D/M

Equivalent geometry #2

A less well-known equivalent geometry [6] puts instead the reference on te pixel size DD of the output plane. In that case, an upscaling happens:

z2Mz2z_2 \rightarrow M z_2

and

DMDD \rightarrow M D

Which geometry to use will depend on your specific application, whichever make your calculations easier. For a particuarly nice implementation of geometry #2, see reference [6].

fstgeometries
Original (left) and the two equivalent geometries (#1, center. #2, right).

Validity of the FST

I would like to emphasize here that the Fresnel Scaling theorem assumes a point source, i.e., spherical waves. Therefore, to apply the FST the waves reaching our object in the input plane must be spherical. In many practical situations that may not be the case.

Referring back to the Gaussian beam equations, the phase is not yet spherical within the Rayleigh length. A minimum distance 2zR2z_R from the beam waist is required for application of the FST, since only then the phase wavefront can be approximatted by a spherical wavefront again [7].

References