Near-field Propagation
This post is a (hopefully quick) guide on the most important equations for computing free space propagation of a wavefront in the Fresnel regime, and how to do it properly.
Helmholtz equation and elementary waves
First, some background. Consider a monochromatic wave in vacuum. It is governed by Helmholtz equation:
where is the wavenumber. The solutions to this equation are called elementary waves, which can either be
- plane waves:
- or spherical waves:
The names indicate that the wavefronts (i.e. the surfaces of constant phase) are either planes or spheres. The term before the exponential is referred to as the amplitude of the wave, whereas the imaginary argument of the exponent is named the phase. For a spherical wave, the amplitude is modulated by the distance from the source due to the term.
The figure below illustrates the propagation of a spherical wave. As we will understand ahead, near the source, the wavefronts are spherical. At small angles with the propagation axis, it can eventually be approximated by paraboloids. For large distances, one may approximate them by planes waves.
data:image/s3,"s3://crabby-images/430a4/430a4c927061998c5ea6f138b58a96c5995aefe6" alt="propagationplanes"
In the real world, though, there is no free lunch. We usually encounter waves that are way more complex than these elementary waves. Fortunately, we can make things easier using the Angular Spectrum of waves.
Angular Spectrum Method (ASM)
An arbitrary wavefront at plane can be decomposed into plane wave components using the Fourier transform (FT):
In other words, the wavefront is composed by a sum of plane waves with phase , each with amplitude . Any wavefront can be decomposed in such fashion, which makes things easier.
Now, how can one obtain the wavefront at a different plane in space? Well, instead of working directly with , one can propagate each plane wave component separately and then sum the contribution of all these elementary waves at . This is done using the free space propagator:
such that the resulting wave is
This expression can be written in operator form as
The above expression arises directly from the Raleigh-Sommerfeld diffraction solution (see reference [4] for a more detailed explanation). The only assumption here is that the distance between source and observation points be much greater than the wavelength .
Important approximations
We can apply approximations to get simpler forms of the propagator. For that, we make use of the paraxial approximation. It is common to hear that such approximation means the wave makes small angles with the optical axis. However, a wave may have plane wave components spreading in all directions. Hence, a more correct way of saying this is that the non-negligible plane wave components make a small angle with the optical axis [2]. A paraxial wave can also be thought as one that varies much more in the trasversal plane than in the longitudinal direction.
The paraxial Helmholtz equation governs paraxial waves, and it presents two important solutions:
- Paraboloidal wave
This is also known as the Fresnel diffraciton integral. It is the paraxial approximation to the spherical wave:
In this case, we have a plane wave component from the term which gets modulated by the phase term inside the integral. Note that is the equation of the paraboloid of revolution. Hence, the plane waves get "distorted" into paraboloids. For large distances , this phase term become negligible. Also, the variation of the amplitude with the term becomes less relevant, which justifies the approximation of a spherical wave by a plane wave.
- Gaussian beam:
The Gaussian beam is a more complex, but extremely useful solution which appears in lasers, optical communication and many optical instruments. It is described by:
If we calculate the intensity , we get
The dependence of the intensity is Gaussian, which explains the name. We can see the transversal slice at in the figure below, which shows exactly such Gaussian distribution.
data:image/s3,"s3://crabby-images/65bd8/65bd8430876f3ed2e684591e255864de9c2a4d6a" alt="gaussianbeam"
As the equation shows, there are a few parameters to model the bahavior of a Gaussian beam. First, it is useful to define the Rayleigh length
where is called the beam waist, which corresponds to the minimum radius of the beam, chosen to be at . The Rayleigh length is used to write the other quantities of interest, namely:
- Beam radius:
- Radius of curvature:
- Gouy phase:
A few comments on these equations to help us better understand the Gaussian beam:
-
The transversal size of the beam is indeed minimum at . This size increases in both directions. The length from to is defined to be the depth of focus of the beam.
-
The Gouy phase represents a phase delay from . If we combine two of the phase terms, we have . Hence, it can be understood as a phase deviation from the plane wave scenario.
-
The third phase term relates to the radius of curvature, which causes the wavefront to curve. is infinite at , meaning we have a plane wave. The curvature then decreases up to a minimum at . It then increases again to assimptoticaly reach for big . A good enough approximation is to use .
Using the Fresnel diffraction integral
Let us rewrite the Fresnel diffraction integral:
We can expand the exponent inside the integral and move some terms out, arriving at the following form:
The two approaches are named the Convolution and Fourier Transform forms of the Fresnel Diffraction integral. Although mathematically identical, they are numerically different. One must be careful to choose between them depending on the problem at hand.
Convolution form
- Impulse Response method
A convolution between two functions f and g is defined by
Equation (1) is nothing more than a convolution of with the following impulse response function :
Hence
- Transfer function method
Making use of the convolution theorem, it can be calculated in Fourier space as
where
Note that in computing equation (4), one should pre-computes and spares one calculation of the Fourier Transform.
Chirp functions
Equations (3) and (5) define what we call a chirp function. For the IR method, the chirp function is sampled in real space:
whereas for the SFT it is samples in reciprocal space:
This is the reason why they are numerically different. Correctly evaluation of the Fast Fourier transform requires proper sampling of the functions to avoid aliasing, according to the Nyquist sampling theorem. Now, the discrete Fourier Transform imposes that
where is the number of points in your matrix. In other words, for a given N there is an inverse relationship between the sampling in real-space and the sampling in reciprocal space. This indicates that, the best one chirp function is sampled, the worst the other one will be. This is as strong indication that one method might be superior to the other depending on the situation.
How to choose the best method?
Now, to the practical side of things. For selecting the best method for your application, one may use a rule of thumb in the table below (although for details I recommend taking a closer look at the references!). represents the size of the array and is width of the source field inside of it ().
Criteria | Sampling | Comment |
---|---|---|
TF: oversampled. IR: undersampled | IR method causes periodic copies. TF is the method of choice. Observation plane width will be limited to width | |
both TF and IR: critical sampling | TF and IR are identical | |
TF: undersampled. IR: oversampled | It depends. Needs to evaluate if source bandwidth satisfies . If yes, use TF. Otherwise, IR may be better. |
Fourier Transform form
- Single Fourier Transform
Equation (2), on the other hand, can be seen as a Fourier Transform:
Notice that it can also be rewritten using , such that
In contrast to equation (4), the above expression requires a single Fourier Transform computation.
Grid spacing
In practice, we will compute the propagation on, well, a computer. In other words, we shall use discrete matrices. An important difference between the Single Fourier Trasform and Convolution methods is that the latter always involve an inverse Fourier transform, whereas the first does not. This has consequences on the final grid spacing of the resulting matrix.
If you perform an inverse Fourier transform at the end, the pixel sizes in the input and in the output fields will be the same. No problem there. However, if only forward Fourier transform is applied, the pixel sizes relate according to:
- Fresnel propagator in 2 steps
One alternative to select the pixel size in the output plane is to perform the propagation in two steps. The idea is to first propagate the wavefront from to an intermediary plane , and then from to the final desired plane . We introduce a scaling parameter such that
It turns out that this scaling paremeter relates to the positions in the axis as
In that case, the wavefront is obtained via
But caution: the above expression presents an issue. It requires computing both chirp functions in real and reciprocal spaces. Therefore, unless working with critical sampling, the expression above will always be partly wrong due to undersampling of one of the two expressions.
Fresnel scaling theorem
A particularly common scenario is that of divergent or cone-beam sources. In these cases, the beam divergence is quite large for the lengths of interest, such that the expansion of the beam becomes relevant.
The Fresnel diffraction integral is, naturally, still valid. Nonetheless, if we are dealing with large propagation distances and a large divergence, a beam with only microns in size may expand hundreds of times in size. Therefore, it may be necessary to work with matrix with dozens of thousands of pixels, making you calculation infeasible due to computer memory limitations.
Fortunately, there exists a clever workaround in this case, called the Fresnel Scaling Theorem (FST). Essentially, the FST states that a divergent spherical wave has an equivalent plane-wave equivalent with scaled pixels and distances.
In the cone-beam case, the focus to sample distance and the sample to output plane distance define the magnification of the system:
This quantity is used then to convert from the cone to the equivalent parallel beam geometry. Note that we recover the parallel case () when .
Equivalent geometry #1
The most well know geometry puts the reference on the pixel size of the input plane. In this case, the distance between input and output planes and the output pixel size are downscaled according to
and
Equivalent geometry #2
A less well-known equivalent geometry [6] puts instead the reference on te pixel size of the output plane. In that case, an upscaling happens:
and
Which geometry to use will depend on your specific application, whichever make your calculations easier. For a particuarly nice implementation of geometry #2, see reference [6].
data:image/s3,"s3://crabby-images/e3e65/e3e65d1ef9b864372861e14a2fa09c1c28d838e9" alt="fstgeometries"
Validity of the FST
I would like to emphasize here that the Fresnel Scaling theorem assumes a point source, i.e., spherical waves. Therefore, to apply the FST the waves reaching our object in the input plane must be spherical. In many practical situations that may not be the case.
Referring back to the Gaussian beam equations, the phase is not yet spherical within the Rayleigh length. A minimum distance from the beam waist is required for application of the FST, since only then the phase wavefront can be approximatted by a spherical wavefront again [7].
References
- [1] Saleh, B. E. A., & Teich, M. C. (1991). Fundamentals of Photonics. Wiley. https://doi.org/10.1002/0471213748
- [2] Paganin, David, Coherent X-Ray Optics, Oxford Series on Synchrotron Radiation (2006), https://doi.org/10.1093/acprof:oso/9780198567288.001.0001
- [3] Goodman, J. W., & Cox, M. E. (1969). Introduction to Fourier Optics.
- [4] Voelz, D. G. (2011). Computational Fourier Optics: A MATLAB Tutorial. SPIE. https://doi.org/10.1117/3.858456
- [5] Schmidt, J. D. (2010). Numerical Simulation of Optical Wave Propagation with Examples in MATLAB. SPIE. https://doi.org/10.1117/3.866274
- [6] Hu Ziyang, et. al. (2023) “Near-Field Multi-Slice Ptychography: Quantitative Phase Imaging of Optically Thick Samples with Visible Light and X-Rays.”. https://doi.org/10.1364/OE.487002.
- [7] Robisch, Anna-Lena. “Phase Retrieval for Object and Probe in the Optical Near-Field.” Universitätsverlag Göttingen, 2016.