Spatio-temporal characteristics of light and how to model them Spatio-temporal distortions Ray matrices The Gaussian beam Complex q and its propagation Ray-pulse “Kostenbauder” matrices The prism pulse compressor Optical system ↔ 4x4 Ray-pulse matrix ⎡ xin ⎤ ⎢θ ⎥ ⎢ in ⎥ ⎢ tin ⎥ ⎢ ⎥ ⎢⎣ν in ⎥⎦ ⎡A ⎢C ⎢ ⎢G ⎢ ⎣0 B 0 D 0 H 1 0 0 E⎤ F ⎥⎥ I⎥ ⎥ 1⎦ Gaussian beam in space and time and the complex Q matrix ⎡ xout ⎤ ⎢θ ⎥ ⎢ out ⎥ ⎢ tout ⎥ ⎢ ⎥ ⎣⎢ν out ⎦⎥ Spatio-temporal distortions Ordinarily, we assume that the pulse-field spatial and temporal factors (or their Fourier-domain equivalents) separate: E ( x, y, z , t ) = Exyz ( x, y, z ) Et (t ) Eˆ% (k x , k y , k z , ω ) = Eˆ xyz (k x , k y , k z ) E% t (ω ) where the tilde and hat mean FTs with respect to t and x, y, z In pulses with spatio-temporal distortions, they don’t: We’ll use dt0 coordinates xt : E ( x, y, z, t ) → E[ x, y, z, t + ( x − x0 )] such as x – x0, dx that are (small) dx0 deviations from % % xω : E ( x, y, z , ω ) → E[ x + (ω − ω0 ), y, z, ω ] their means. dω dk x 0 ˆ% ˆ% kω : E (k x , k y , k z , ω ) → E0 [k x + (ω − ω0 ), k y , k z , ω ] dω Spatio-temporal distortions can be useful or inconvenient. Good: They allow pulse compression. They help to measure pulses (tilted pulse fronts). They allow pulse shaping. They can increase bandwidth in nonlinear-optical processes. Bad: They usually increase the pulse length. They reduce intensity. They can be hard to measure. Angular dispersion is an example of a spatio-temporal distortion. dk x 0 dω In the presence of angular dispersion, the off-axis k-vector component kx depends on ω: dk x 0 ˆ ˆ % % (ω − ω0 ), k y , k z , ω ] E (k x , k y , k z , ω ) = E0 [k x + dω where kx0(ω) is the mean kx vs. frequency ω. x z Input pulse Prism Angularly dispersed output pulse Note that the mean off-axis k-vector component, kx0, depends on ω. Spatial chirp is a spatio-temporal distortion in which the color varies spatially across the beam. Propagation through a prism pair produces a beam with no angular dispersion, but with spatial dispersion, often called spatial chirp. Prism pair Input pulse Spatially chirped output pulse Prism pairs are inside nearly every ultrafast laser. A third and fourth prism undo this distortion, but must be aligned carefully. Spatial chirp is difficult to avoid. Simply propagating through a tilted window causes spatial chirp! Tilted window Input pulse Spatially chirped output pulse Because ultrashort pulses are so broadband, this distortion is very noticeable—and often problematic! How to think about ∂x0 spatial chirp ∂ω x x0(ω9) x0(ω8) x0(ω7) x0(ω6) x0(ω5) x0(ω4) x0(ω3) x0(ω2) x0(ω1) z Suppose we send the pulse through a set of monochromatic filters and find the beam center position, x0, for each frequency, ω. E% ( x, y, z, ω ) → dx0 % E[ x + (ω − ω0 ), y, z , ω ] dω where x0 is the center of the beam component of frequency ω. Pulse-front tilt is another common spatio-temporal distortion. Phase fronts are perpendicular to the direction of propagation. Because the group velocity is usually less than phase velocity, pulse fronts tilt when light traverses a prism. Undistorted input pulse Angularly dispersed pulse with pulse-front tilt Prism Angular dispersion causes pulse-front tilt. Angular dispersion causes pulse-front tilt even when group velocity is not involved. Diffraction gratings also yield pulse-front tilt. Angularly dispersed pulse with pulsefront tilt Undistorted input pulse Diffraction grating The path is simply shorter for rays that impinge on the near side of the grating. Of course, angular dispersion and spatial chirp occur, too. Gratings have about ten times the dispersion of prisms, and they yield about ten times the tilt. Modeling pulse-front tilt dt0 dx Pulse-front tilt involves coupling between the space and time domains: dt0 E ( x, y, z, t ) → E[ x, y, z, t + ( x − x0 )] dx For a given transverse position in the beam, x, the pulse mean time, t0, varies in the presence of pulse-front tilt. Pulse-front tilt occurs after pulse compressors that aren’t aligned properly. Angular dispersion always causes pulsefront tilt! Angular dispersion means that the off-axis k-vector depends on ω: ˆ ˆ E% (k x , k y , k z , ω ) = E% 0 [k x + γ (ω − ω0 ), k y , k z , ω ] where γ = dkx0 /dω Inverse Fourier-transforming with respect to kx, ky, and kz yields: i γ (ω −ω0 ) x % % E ( x, y, z , ω ) = E0 ( x, y, z , ω ) e using the shift theorem Inverse Fourier-transforming with respect to ω−ω0 yields: ⇒ E ( x, y, z , t ) = E0 ( x, y, z , t + γ x) using the shift theorem again which is just pulse-front tilt! The combination of spatial and temporal chirp also causes pulse-front tilt. The theorem we just proved assumed no spatial chirp, however. So it neglects another contribution to the pulse-front tilt. Dispersive medium Spatially chirped input pulse Spatially chirped pulse with pulse-front tilt, but no angular dispersion vg(red) > vg(blue) The total pulse-front tilt is the sum of that due to dispersion and that due to this effect. Xun Gu, Selcuk Akturk, and Erik Zeek A pulse with temporal chirp, spatial chirp, and pulse-front tilt. Suppressing the y-dependence, we can plot such a pulse: 4.5 mrad ψ = 11.3 mrad 797 803 nm nm where the pulsefront tilt angle is: ∂t 0 ψ ≡c ∂x xx [[m mm m] ] ] tt [[ffss] 775 nm 777 nm We’ll need a nice formalism for calculating these distortions! Ray Optics axis We'll define light rays as directions in space, corresponding, roughly, to k-vectors of light waves. Each optical system will have an axis, and all light rays will be assumed to propagate at small angles to it. This is called the Paraxial Approximation. The Optic Axis A mirror deflects the optic axis into a new direction. This “ring laser” has an optic axis that scans out a rectangle. Optic axis A ray propagating through this system We define all rays relative to the relevant optic axis. The Ray Vector xin, θin xout, θout A light ray can be defined by two co-ordinates: its position, x ray l a c i t op its slope, θ θ x Optical axis These parameters define a ray vector, which will change with distance and as the ray propagates through optics. ⎡x⎤ ⎢θ ⎥ ⎣ ⎦ Ray Matrices For many optical components, we can define 2 x 2 ray matrices. An element’s effect on a ray is found by multiplying its ray vector. Ray matrices can describe simple and complex systems. Optical system ↔ 2 x 2 Ray matrix ⎡ xin ⎤ ⎢θ ⎥ ⎣ in ⎦ ⎡A ⎢C ⎣ B⎤ D ⎥⎦ ⎡ xout ⎤ ⎢θ ⎥ ⎣ out ⎦ These matrices are often called "ABCD Matrices." Ray matrices as derivatives Since the displacements and angles are assumed to be small, we can think in terms of partial derivatives. spatial magnification ∂xout ∂xin xout ∂xout ∂xout = xin + θin ∂xin ∂θin θout ∂θout ∂θout = xin + θin ∂xin ∂θin ∂xout ∂θin ⎡ xout ⎤ ⎡ A B ⎤ ⎡ xin ⎤ ⎢θ ⎥ = ⎢C D ⎥ ⎢θ ⎥ ⎦ ⎣ in ⎦ ⎣ out ⎦ ⎣ ∂θout ∂xin ∂θout ∂θin angular magnification We can write these equations in matrix form. For cascaded elements, we simply multiply ray matrices. ⎡ xin ⎤ ⎢θ ⎥ ⎣ in ⎦ O2 O1 ⎡ xout ⎤ ⎪⎧ ⎢θ ⎥ = O3 ⎨O2 ⎣ out ⎦ ⎪⎩ ⎛ ⎜ O1 ⎝ O3 ⎡ xout ⎤ ⎢θ ⎥ ⎣ out ⎦ ⎡ xin ⎤ ⎞ ⎪⎫ ⎡ xin ⎤ ⎢θ ⎥ ⎟ ⎬ = O3 O2 O1 ⎢θ ⎥ ⎣ in ⎦ ⎠ ⎪⎭ ⎣ in ⎦ Notice that the order looks opposite to what it should be, but it makes sense when you think about it. Ray matrix for free space or a medium If xin and θin are the position and slope upon entering, let xout and θout be the position and slope after propagating from z = 0 to z. xout θout xin, θin xout = xin + z θin θ out = θin Rewriting these expressions in matrix notation: z z=0 Ospace ⎡1 z ⎤ = ⎢ ⎥ 0 1 ⎣ ⎦ ⎡xout ⎤ ⎡1 z⎤ ⎡xin ⎤ ⎢θ ⎥ = ⎢0 1⎥ ⎢θ ⎥ ⎣ ⎦ ⎣ in ⎦ ⎣ out ⎦ Ray Matrix for an Interface At the interface, clearly: θin xout = xin. xin xout n1 Now calculate θout. Snell's Law says: θout n2 n1 sin(θin) = n2 sin(θout) which becomes for small angles: n1 θin = n2 θout ⇒ θout = [n1 / n2] θin Ointerface 0 ⎤ ⎡1 =⎢ ⎥ n n 0 / ⎣ 1 2⎦ Ray matrix for a curved interface At the interface, again: xout = xin. To calculate θout, we must calculate θ1 and θ2. If θs is the surface-normal slope at the height xin, then θs θin R θ2 θ1 xin = xout n1 n2 θout θs z θ1 = θin+ θs and θ2 = θout+ θs If R is the surface radius of curvature, the surface z coordinate will be: z = R − R 2 − xin2 = R − R 1 − ( xin / R) 2 ≈ R − R ⎡⎣1 − 12 ( xin / R) 2 ⎤⎦ = 12 ( xin2 / R) dz ⇒ θs ≈ ≈ xin / R dxin Ray matrix for a curved interface (cont’d) θ1 = θin+ xin / R and θ2 = θout+ xin / R Snell's Law: n1 θ1 = n2 θ2 θ1 θ2 ⇒ n1 (θin + xin / R) ≈ n2 (θ out + xin / R) ⇒ θ out ≈ (n1 / n2 )(θin + xin / R) − xin / R n1 ⇒ θ out ≈ (n1 / n2 )θin + (n1 / n2 − 1) xin / R Ocurved interface 1 0 ⎤ ⎡ =⎢ ⎥ n n R n n − ( / 1) / / ⎣ 1 2 1 2⎦ Now the output angle depends on the input position, too. n2 A thin lens is just two curved interfaces. We’ll neglect the glass in between (it’s a really thin lens!), and we’ll take n1 = 1. Ocurved interface 1 0 ⎤ ⎡ =⎢ ⎥ ( / 1) / / n n − R n n 1 2⎦ ⎣ 1 2 R1 n=1 R2 n≠1 n=1 1 0⎤ ⎡ 1 0 ⎤ ⎡ Othin lens = Ocurved Ocurved = ⎢ ⎥ ⎢ (1/ n − 1) / R 1/ n ⎥ ( 1) / − n R n interface 2 interface 1 2 1 ⎣ ⎦⎣ ⎦ 1 0 ⎤ ⎡ 1 0⎤ ⎡ =⎢ =⎢ ⎥ ⎥ ( n 1) / R n (1/ n 1) / R n (1/ n ) ( n 1) / R (1 n ) / R 1 − + − − + − ⎣ 2 1 ⎦ ⎣ 2 1 ⎦ 1 0⎤ ⎡ =⎢ ⎥ ( n 1)(1/ R 1/ R ) 1 − − ⎣ 2 1 ⎦ This can be written: where: 1/ f = (n − 1)(1/ R1 − 1/ R2 ) ⎡ 1 ⎢ −1/ f ⎣ 0⎤ 1 ⎥⎦ The Lens-Maker’s Formula Ray matrix for a lens Olens 1/ f = (n − 1)(1/ R1 − 1/ R2 ) ⎡ 1 = ⎢ ⎣-1/f 0⎤ ⎥ 1⎦ The quantity, f, is the focal length of the lens. It’s the single most important parameter of a lens. It can be positive or negative. In a homework problem, you’ll extend the Lens Maker’s Formula to lenses of greater thickness. R1 > 0 R2 < 0 f>0 If f > 0, the lens deflects rays toward the axis. R1 < 0 R2 > 0 f<0 If f < 0, the lens deflects rays away from the axis. A lens focuses parallel rays to a point one focal length away. For all rays A lens followed by propagation by one focal length: xout = 0! f ⎤⎡xin ⎤ ⎡ 0 ⎤ ⎡xout ⎤ ⎡1 f ⎤⎡ 1 0⎤⎡xin ⎤ ⎡ 0 ⎢θ ⎥ = ⎢0 1⎥⎢−1/ f 1⎥⎢ 0 ⎥ = ⎢−1/ f 1⎥⎢ 0 ⎥ = ⎢−x / f ⎥ ⎦⎣ ⎦⎣ ⎦ ⎣ ⎦⎣ ⎦ ⎣ in ⎦ ⎣ out ⎦ ⎣ f f Assume all input rays have θin = 0 At the focal plane, all rays converge to the z axis (xout = 0) independent of input position. Parallel rays at a different angle focus at a different xout. Types of lenses Lens nomenclature Which type of lens to use (and how to orient it) depends on the aberrations and application. Ray Matrix for a Curved Mirror Consider a mirror with radius of curvature, R, with its optic axis perpendicular to the mirror: θ1 = θin − θ s θ s ≈ xin / R R θout θs θin θ out = θ1 − θ s = (θin − θ s ) − θ s ≈ θin − 2 xin / R θ1 θ1 xin = xout z 0⎤ ⎡ 1 ⇒ Omirror = ⎢ ⎥ − 2 / 1 R ⎣ ⎦ Like a lens, a curved mirror will focus a beam. Its focal length is R/2. Note that a flat mirror has R = ∞ and hence an identity ray matrix. Laser Cavities Mirror curvatures matter in lasers. Two flat mirrors, the “flat-flat” laser cavity, is difficult to align and maintain aligned. Two concave curved mirrors, the “stable” laser cavity, is easy to align and maintain aligned. Two convex mirrors, the “unstable” laser cavity, is impossible to align! A system images an object when B = 0. When B = 0, all rays from a point xin arrive at a point xout, independent of angle. xout = A xin A is the magnification. The Lens Law From the object to the image, we have: 1) A distance d0 2) A lens of focal length f 3) A distance di ⎡1 di ⎤ ⎡ 1 O=⎢ ⎥ ⎢ −1/ f 0 1 ⎣ ⎦⎣ ⎡1 d i ⎤ ⎡ 1 =⎢ ⎥ ⎢ −1/ f 0 1 ⎣ ⎦⎣ B ⎡ A =⎢ ⎣ −1/ f 1 − d 0 / 0 ⎤ ⎡1 d 0 ⎤ 1 ⎥⎦ ⎢⎣ 0 1 ⎥⎦ d0 ⎤ 1 − d 0 / f ⎥⎦ ⎤ f ⎥⎦ B = d 0 + di − d 0 di / f = d 0 di [1/ d 0 + 1/ di − 1/ f ] = 0 if 1 1 1 + = d 0 di f ⎡1 1⎤ A = 1 − di / f = 1 − di ⎢ + ⎥ ⎣ d 0 di ⎦ d ⇒ Mag = − i d0 Lenses can also map angle to position. From the object to the image, we have: 1) A distance f 2) A lens of focal length f 3) A distance f 0⎤ ⎡1 f ⎤ ⎡xin ⎤ ⎡xout ⎤ ⎡1 f ⎤ ⎡ 1 ⎢θ ⎥ = ⎢0 1 ⎥ ⎢−1/ f 1⎥ ⎢0 1 ⎥ ⎢θ ⎥ ⎦⎣ ⎦⎣ ⎦ ⎣ in ⎦ ⎣ out ⎦ ⎣ f ⎤ ⎡xin ⎤ ⎡1 f ⎤ ⎡ 1 =⎢ ⎥ ⎢−1/ f 1 ⎥ ⎢θ ⎥ 0 1 ⎣ ⎦⎣ ⎦ ⎣ in ⎦ f ⎤ ⎡xin ⎤ ⎡ f θin ⎤ ⎡ 0 =⎢ =⎢ ⎢ ⎥ ⎥ ⎥ x f θ / − f 1/ 0 − ⎣ ⎦ ⎣ in ⎦ ⎣ in ⎦ So xout ∝ θ in And this arrangement maps position to angle: θ out ∝ xin If an optical system lacks cylindrical symmetry, we must analyze its x- and ydirections separately: Cylindrical lenses A "spherical lens" focuses in both transverse directions. A "cylindrical lens" focuses in only one transverse direction. When using cylindrical lenses, we must perform two separate ray-matrix analyses, one for each transverse direction. Large-angle reflection off a curved mirror also destroys cylindrical symmetry. The optic axis makes a large angle with the mirror normal, and rays make an angle with respect to it. Optic axis before reflection tangential ray Optic axis after reflection Rays that deviate from the optic axis in the plane of incidence are called "tangential.” Rays that deviate from the optic axis ⊥ to the plane of incidence are called "sagittal.“ (We need a 3D display to show one of these.) Ray Matrix for Off-Axis Reflection from a Curved Mirror If the beam is incident at a large angle, θ, on a mirror with radius of curvature, R: tangential ray Optic axis θ ⇔ R where Re = R cosθ for tangential rays and Re = R / cosθ for sagittal rays ⎡ 1 ⎢ −2 / R e ⎣ 0⎤ 1 ⎥⎦ But lasers are Gaussian beams, not rays. Real laser beams are localized in space at the laser and hence must diffract as they propagate away from the laser. The beam has a waist at z = 0, where the spot size is w0. It then expands to w = w(z) with distance z away from the laser. The beam radius of curvature, R(z), also increases with distance far away. Gaussian beam math The expression for a real laser beam's electric field is given by: E ( x, y , z ) ∝ % exp [ −ikz − iψ ( z ) ] w( z ) ⎡ x2 + y 2 π x2 + y 2 ⎤ exp ⎢ − 2 −i ⎥ λ w ( z ) R ( z ) ⎣ ⎦ where: w(z) is the spot size vs. distance from the waist, R(z) is the beam radius of curvature, and ψ(z) is a phase shift. This equation is the solution to the wave equation when we require that the beam be well localized at some point (i.e., its waist). Gaussian beam spot size, radius, and phase The expressions for the spot size, radius of curvature, and phase shift: w( z ) = w0 1 + ( z / z R ) 2 R ( z ) = z + z R2 / z ψ ( z ) = arctan( z / z R ) where zR is the Rayleigh Range (the distance over which the beam remains about the same diameter), and it's given by: z R ≡ π w0 / λ 2 Gaussian beam collimation Twice the Rayleigh range is the distance over which the beam remains about the same size, that is, remains “collimated.” 2 z R = 2π w02 / λ Collimation Collimation Waist spot Distance Distance λ = 10.6 µm λ = 0.633 µm size w0 _____________________________________________ .225 cm 0.003 km 0.045 km 2.25 cm 0.3 km 5 km Longer wavelengths expand faster than shorter ones. 22.5 cm 30 km 500 km _____________________________________________ Tightly focused laser beams expand quickly. Weakly focused beams expand less quickly, but still expand. As a result, it's very difficult to shoot down a missile with a laser. Gaussian beam divergence Far away from the waist, the spot size of a Gaussian beam will be: w( z ) = w0 1 + ( z / z R ) ≈ w0 2 ( z / zR ) 2 = w0 z / z R The beam 1/e divergence half angle is then w(z) / z as z → ∞ : θ1/ e ⇒ w0 z w0 w0 = = = z R z z R π w02 / λ θ1/ e = λ / (π w0 ) The smaller the waist and the larger the wavelength, the larger the divergence angle. Focusing a Gaussian beam winput f wfocus f A lens will focus a collimated Gaussian beam to a new spot size: wfocus ≈ λ f / πwinput So the smaller the desired focus, the BIGGER the input beam should be! The Guoy phase shift The phase factor yields a phase shift relative to the phase of a plane wave when a Gaussian beam goes through a focus. ⎧ π / 2 when z → +∞ ψ ( z ) = arctan( z / z R ) → ⎨ ⎩−π / 2 when z → −∞ ψ(z) Phase relative to a plane wave: π/2 -zR zR −π/2 Recall the i in front of the Fresnel integral, which is a result of the Guoy phase shift. The Gaussian-beam complex-q parameter ⎡ π x2 + y2 ⎤ ⎡ x2 + y2 ⎤ E ( x, y, z ) ∝ exp ⎢ −i exp ⎢ − 2 ⎥ ⎥ λ R z w z ( ) ( ) ⎣ ⎦ ⎣ ⎦ We can combine these two factors (they’re both Gaussians): ⎡ π x2 + y 2 ⎤ E ( x, y, z ) ∝ exp ⎢ −i ⎥ ( ) λ q z ⎣ ⎦ where: 1 1 λ ≡ −i q( z ) R ( z ) π w2 ( z ) q completely determines the Gaussian beam. Ray matrices and the propagation of q We’d like to be able to follow Gaussian beams through optical systems. Remarkably, ray matrices can be used to propagate the q-parameter. ⎡A B⎤ O=⎢ ⎥ C D ⎣ ⎦ Optical system This relation holds for all systems for which ray matrices hold: qout Aqin + B = Cqin + D Just multiply all the matrices first and use this result to obtain qout for the relevant qin! Important point about propagating q Use qout Aqin + B = Cqin + D to compute qout. But use matrix multiplication for the various components to compute the total system ray matrix. Don’t compute qout Aqin + B = Cqin + D for each component. You’d get the right answer, but you’d work much harder than you need to! Propagating q: an example qout Aqin + B = Cqin + D Free-space propagation through a distance z: The ray matrix for free-space propagation is: Ospace 1q(0) + z = q(0) + z Then: q(z) = 0 q(0) + 1 ⎡1 z ⎤ =⎢ ⎥ 0 1 ⎣ ⎦ Propagating q: an example (cont’d) Does q(z) = q0 + z? LHS: This is equivalent to: 1/q(z) = 1/(q0 + z). 1 1 λ ≡ − i 2 πw (z) R(z) q(z) 1 1 1 1 1 ≡ − i = − i 2 2 2 2 2 q(z) z + zR / z zR (1+ z / zR ) z + zR / z zR + z /zR Now: RHS: π w20 q(0) = i = i zR λ so q(0) + z = izR + z 1 z − iz z 1 iz R = = 2 R2 = 2 − q(0) + z z + zR z + z2R z 2 + zR2 z + izR = 1 1 − i z + z2R / z z2 / zR + zR 1 = q(z) So: which is just this. q(z) = q(0) + z Propagating q: another example winput Focusing a collimated beam (i.e., a lens, f, followed by a distance, f ): wfocus f f A collimated beam has a big spot size (w) and Rayleigh range (zR), and an infinite radius of curvature (R), so: qin = i zR OspaceOlens ⎡1 =⎢ ⎣0 Cqin + D 1 = qout Aqin + B So: [ Im 1/ q focus But: [ ] Im 1/ q focus ≡ − ⇒ ] f ⎤⎡ 1 1 ⎥⎦ ⎢⎣ −1/ f 1 qafter lens = 0⎤ ⎡ 0 =⎢ ⎥ 1 ⎦ ⎣ −1/ f f⎤ 1 ⎥⎦ (−1/ f )(iz R ) + 1 1 − iz R / f = 0 (izR ) + f f π winput zR = − 2 =− f λf 2 2 λ πw 2 focus ⇒ w focus = λf π winput The well-known result for the focusing of a Gaussian beam Now consider the time and frequency of a light pulse in addition We’d like a matrix formalism to predict such effects as the: group-delay dispersion ∂t/∂ω angular dispersion ∂kx /∂ω or ∂θ /∂ω spatial chirp ∂x/∂ω pulse-front tilt ∂t/∂x time vs. angle ∂t/∂θ. This pulse has all of these distortions! where we’ve dropped “0” subscripts for simplicity. We’ll need to consider, not only the position (x) and slope (θ ) of the ray, but also the time (t) and frequency (ω ) of the pulse. Propagation in space and time: Ray-pulse “Kostenbauder” matrices Kostenbauder matrices are 4x4 matrices that multiply 4-vectors comprising the position, slope, time (group delay), and frequency. Optical system ↔ 4x4 Ray-pulse matrix ⎡ xin ⎤ ⎢θ ⎥ ⎢ in ⎥ ⎢ tin ⎥ ⎢ ⎥ ⎣⎢ν in ⎦⎥ ⎡A B 0 E⎤ ⎢C D 0 F ⎥ ⎢ ⎥ ⎢G H 1 I ⎥ ⎢ ⎥ ⎣0 0 0 1⎦ ⎡ xout ⎤ ⎢θ ⎥ ⎢ out ⎥ ⎢ tout ⎥ ⎢ ⎥ ⎣⎢ν out ⎦⎥ where each vector component corresponds to the deviation from a mean value for the ray or pulse. A Kostenbauder matrix requires five additional parameters, E, F, G, H, I. Kostenbauder matrix elements As with 2x2 ray matrices, consider each element to correspond to a small deviation from its mean value (xin = x – x0 ). So we can think in terms of partial derivatives. the usual 2x2 ray matrix angular dispersion ⎡ xout ⎤ ⎡A ⎢θ ⎥ ⎢C ⎢ out ⎥ = ⎢ ⎢ tout ⎥ ⎢G ⎢ ⎥ ⎢ ⎢⎣ν out ⎥⎦ ⎣0 pulsefront tilt ∂tout ∂xin B D H 0 ∂θ out ∂ν in E ⎤ ⎡ xin ⎤ ∂xout ∂ν in ⎢ ⎥ ⎥ 0 F ⎥ ⎢θ in ⎥ 1 I ⎥ ⎢ tin ⎥ ⎥⎢ ⎥ 0 1 ⎦ ⎢⎣ν in ⎥⎦ 0 time vs. angle ∂tout ∂θ in spatial chirp GDD ∂tout ∂ν in Some Kostenbauder matrix elements are always zero or one. ⎡ xout ⎤ ⎡A B ⎢θ ⎥ ⎢C D out ⎢ ⎥ = ⎢ ⎢ tout ⎥ ⎢G H ⎢ ⎥ ⎢ ⎢⎣ν out ⎥⎦ ⎣0 0 0 E ⎤ ⎡ xin ⎤ 0 F ⎥⎥ ⎢⎢θ in ⎥⎥ 1 I ⎥ ⎢ tin ⎥ ⎥⎢ ⎥ 0 1 ⎦ ⎢⎣ν in ⎥⎦ Kostenbauder matrix for propagation through free space or material The ABCD elements are always the same as the ray matrix. Here, the only other interesting element is the GDD: So: ^ material ⎡1 L / n ⎢0 1 ⎢ = ⎢0 0 ⎢ 0 ⎣0 ⎤ ⎥ ⎥ 1 2π Lk ′′⎥ ⎥ 0 1 ⎦ 0 0 0 0 I = ∂tout/∂νin The 2π is due to the definition of K-matrices in terms of ν, not ω. where L is the thickness of the medium, n is its refractive index, and k” is the GVD: d 2k k ′′ ≡ = 2 dω ω 0 λ3 d 2n 2πc 2 dλ 2 Example: Using the Kostenbauder matrix for propagation through free space Apply the free-space propagation matrix to an input vector: ⎡1 ⎢0 ⎢ ⎢0 ⎢ ⎣0 ⎤ ⎡ xin ⎤ ⎡ xin + Lθ in ⎤ ⎥ ⎥ ⎢θ ⎥ ⎢ θ in ⎥ ⎥ ⎢ in ⎥ = ⎢ 0 1 2π Lk ′′⎥ ⎢ tin ⎥ ⎢tin + 2π Lk ′′ν in ⎥ ⎥ ⎥⎢ ⎥ ⎢ ν in 0 0 1 ⎦ ⎢⎣ν in ⎥⎦ ⎢⎣ ⎥⎦ L 0 1 0 0 0 The position varies in the usual way, and the beam angle remains the same. The group delay increases by k”Lωin The frequency remains the same. Because the group delay depends on frequency, the pulse broadens. This approach works in much more complex situations, too. Kostenbauder matrix for a lens The ABCD elements are always the same as the ray matrix. Everything else is a zero or one. So: ^ lens ⎡ 1 ⎢ −1/ f = ⎢ ⎢ 0 ⎢ ⎣ 0 0 1 0 0 0 0 1 0 0⎤ 0 ⎥⎥ 0⎥ ⎥ 1⎦ where f is the lens focal length. The same holds for a curved mirror, as with ray matrices. While chromatic aberrations can be modeled using a wavelengthdependent focal length, other lens imperfections cannot be modeled using Kostenbauder matrices. Kostenbauder matrix for a diffraction grating Gratings introduce magnification, angular dispersion and pulse-front tilt: So: angular magnification spatial magnification cos( β ′) ⎡ − ⎢ cos( β ) ⎢ ⎢ 0 ⎢ ^ grating = ⎢ ⎢ sin( β ) − sin( β ′) ⎢ c cos( β ) pulse⎢ front ⎢ 0 ⎣ tilt 0 − cos( β ) cos( β ′) 0 0 0 1 0 0 no spatial chirp (yet) ⎤ ⎥ ⎥ λ[sin( β ') − sin( β )] ⎥ ⎥ c cos( β ') ⎥ ⎥ 0 ⎥ ⎥ ⎥ 1 ⎦ 0 time is independent of angle angular dispersion no GDD (yet) where β is the incidence angle, and β’ is the diffraction angle. The zero elements (E, H, I) will become nonzero when propagation follows. Kostenbauder matrix for a general prism ϑin ϑout L ψ in ψ out angular dispersion angular magnification ^ = spatial magnification ⎡ ⎢ ⎢ ⎢ −2π ⎢ ⎢⎣ min mout Lmout / nmin 0 1/ min mout dn dω min (tanψ in + tanψ out ) / λ0 0 −2π dn dω L tanψ out / nmin λ0 0 pulse-front tilt d 2k where k ′′ ≡ dω 2 All new elements are nonzero. ω0 Lmout tanψ in / n ⎤ ⎥ 0 −2π ddnω (tanψ in + tanψ out ) / mout ⎥ 2 dn 2 1 4π ( dω ) L tanψ in tanψ out / nλ0 + 2π Lk ′′⎥ ⎥ 0 1 ⎥⎦ 0 time vs. angle is the GVD, min ≡ −2π spatial chirp dn dω GDD cosψ in cos ϑin and mout cos ϑout ≡ cosψ out Kostenbauder matrix for a Brewster prism Brewster angle incidence and exit If the beam passes through the apex of the prism: ψ in = ψ out ≡ ψ min = 1/ mout L→0 (this simplifies the calculation a lot!) ^ prism where W = − 4π (dn /d ω ) tanψ / mout = −4π min (dn /d ω ) tanψ 0 1 0 0 0 0 ⎤ 0 ±W ⎥⎥ 1 0 ⎥ ⎥ 0 1 ⎦ Use + if the prism is oriented as above; use – if it’s inverted. ⎡ ⎢ = ⎢ ⎢ ±W / λ0 ⎢ ⎣ 0 1 0 Just angular dispersion and pulse-front tilt. No GDD etc. Using the Kostenbauder matrix for a Brewster prism Brewster angle incidence and exit This matrix takes into account all that we need to know for pulse compression. ⎡ 1 ⎢ 0 ⎢ ⎢ ±W / λ0 ⎢ ⎣ 0 xin 0 0 0 ⎤ ⎡ xin ⎤ ⎡ ⎤ 1 0 ±W ⎥⎥ ⎢⎢θin ⎥⎥ ⎢⎢ θ in ± W ν in ⎥⎥ = 0 1 0 ⎥ ⎢ tin ⎥ ⎢tin ± xinW / λ0 ⎥ ⎥ ⎥⎢ ⎥ ⎢ ν in 0 0 1 ⎦ ⎣ν in ⎦ ⎣ ⎦ When the pulse reaches the two inverted prisms, this effect becomes very important, yielding longer group delay for longer wavelengths (D < 0; and use the minus sign for inverted prisms). Dispersion changes the beam angle. Pulse-front tilt yields GDD. Modeling a prism pulse compressor using Kostenbauder matrices Use only Brewster prisms 1 ^prism 2 ^air 7 ^prism ^prism 6 ^prism ^air 3 ^air 4 5 K = ^7 ^6 ^5 ^4 ^3 ^2 ^1 Free space propagation in a pulse compressor There are three distances in this problem. L1 L3 L2 ^ free − space ⎡1 Li ⎢0 1 =⎢ ⎢0 0 ⎢ ⎣0 0 0 0 1 0 0⎤ 0 ⎥⎥ 0⎥ ⎥ 1⎦ n = 1 in free space K-matrix for a prism pulse compressor K = ^7 ^6 ^5 ^4 ^3 ^2 ^1 Spatial chirp unless L1 = L3. ^ = L1 + L2 + L3 ⎡1 ⎢0 1 ⎢ ⎢ W ( L1 − L3 ) ⎢0 − λ0 ⎢ ⎢⎣0 0 Time vs. angle unless L1 = L3. W ( L1 − L3 ) ⎤ ⎥ 0 0 ⎥ ⎥ W 2 1 − ( L1 + L3 ) ⎥ λ0 ⎥ ⎥⎦ 0 1 Negative GDD! 0 The GDD is negative and can be tuned by changing the amount of extra glass in the beam (which we haven’t included yet, but which is easy). Propagating spot size, radius of curvature, pulse length, and chirp To follow beams that are Gaussian in both space and time: Qout Qin We could propagate Gaussian beams in space because they’re quadratic in space (x and y): E ( x, y ) ∝ exp[ −(1/ w2 + iπ / λ R) ( x 2 + y 2 )] A Gaussian pulse is quadratic in time. And the real and imaginary parts also have important meanings (pulse length and chirp): E(t ) ∝ exp[−(1/τ G + i β )t ] 2 2 The complex-Q matrix We define the complex Q-matrix so that the space and time dependence of the pulse can be written: ⎧ π E ( x, t ) ∝ exp ⎨−i [ x, −t ] Q −1 ⎩ λ ⎡Q11−1 + Q12−1 ⎤ ⎡ x ⎤ ⎫⎪ ⎡ x⎤⎫ ⎪⎧ π ⎢ t ⎥ ⎬ = exp ⎨−i λ [ x, −t ] ⎢ −1 −1 ⎥ ⎢ ⎥ ⎬ ⎣ ⎦⎭ ⎪⎩ ⎣Q21 + Q22 ⎦ ⎣ t ⎦ ⎪⎭ ⎧⎪ π ⎡Q11−1 x + Q12−1t ⎤ ⎫⎪ = exp ⎨−i [ x, −t ] ⎢ −1 −1 ⎥ ⎬ λ + Q x Q ⎪ 22 t ⎦ ⎭ ⎣ 21 ⎩⎪ −1 −1 Note : Q21 = − Q12 ⎧ π −1 2 ⎫ = exp ⎨−i [Q11 x + 2Q12−1 xt − Q22−1t 2 ]⎬ ⎩ λ ⎭ These complex matrix elements contain all the parameters of beams/pulses that are Gaussian in space and time. The complex-Q matrix (cont’d) −1 When the off-diagonal elements, Q12 , are zero: ⎧ π −1 2 ⎫ E ( x, t ) ∝ exp ⎨−i [Q11 x − Q22−1t 2 ]⎬ ⎩ λ ⎭ Q11−1 ≡ 1/ qx = −1 22 Q 1 λ −i π w2 ( z ) R( z ) λ ≡ 1/ qt ≡ ( β + i /τ G2 ) π spatial complex-q parameter for Gaussian beams temporal complex-q parameter: the pulse length and chirp parameter for Gaussian pulses When the off-diagonal components are not zero, there is pulse-front tilt: Pulse-front tilt = Im ⎡⎣Q12−1 ⎤⎦ off-diagonal Q-1 element Im ⎡⎣Q22−1 ⎤⎦ temporal component of Q-1 K-matrices and the propagation of Q Kostenbauder matrices can be used to propagate the Q-matrix. Qout Qin This relation holds for all systems (multiply the component matrices together first and then use this complex result): Qout ⎡A ⎢G = ⎣ ⎡C ⎢0 ⎣ 0⎤ ⎡B E / λ⎤ Qin + ⎢ ⎥ ⎥ 1⎦ ⎣H I / λ ⎦ 0⎤ ⎡D F / λ⎤ Qin + ⎢ ⎥ ⎥ 0⎦ 1 ⎦ ⎣0 Division means multiplication by the inverse. This is actually more elegant than it looks... [A] Propagating the Q-matrix ⎡A ⎢C ^ =⎢ ⎢G ⎢ ⎣0 Qout B 0 D H 0 1 0 0 E /λ⎤ ⎥ F /λ⎥ I /λ ⎥ ⎥ 1 ⎦ In terms of these 2x2 matrices: 0⎤ ⎡ B E / λ⎤ Qin + ⎢ ⎥ ⎥ / 1⎦ λ H I ⎣ ⎦ 0⎤ ⎡D F / λ ⎤ Qin + ⎢ ⎥ ⎥ 0⎦ 0 1 ⎣ ⎦ ⎡A ⎢G = ⎣ ⎡C ⎢0 ⎣ [C] [D] Notice the symmetry in the 2x2 matrices in the Qpropagation equation. Qout = [ A] Qin + [ B ] [C ] Qin + [ D ] [B] Important point about propagating Q Use Qout = [ A] Qin + [ B ] [C ] Qin + [ D ] to compute Qout. But use matrix multiplication for the various components to compute the total system ray matrix. Don’t compute Qout = [ A] Qin + [ B ] [C ] Qin + [ D ] for each component. As with q, you’d get the right answer, but you’d work much harder than you need to! How to compute pulse distortions The K-matrix elements are derivatives of the output parameters with respect to the input parameters, and the spatio-temporal distortions are derivatives of means with respect to the output pulse parameters. But because νout = νin, the E = dxout/dvin, F = dθout/dvin, and I = dtout/dvin elements clearly indicate the (added) spatial chirp, angular dispersion, and group-delay dispersion. This doesn’t work for pulse-front tilt, G = dtout/dxin, however. Spatial chirp: E element of the K-matrix Angular dispersion: F element of the K-matrix Group-delay dispersion: I element of the K-matrix assuming there’s zero distortion to begin with because xout may not be xin Pulse-front tilt: You must look for an off-diagonal Q-1 element Kostenbauder matrices can model very general systems using the pulse Wigner Distribution. Okay, the complex-Q matrix tells us what happens to Gaussian pulse/beams. But what about more complex pulse/beams? Ordinarily, you’d have to numerically solve the Fresnel diffraction integral, which can yield a very complex computer computation. But there’s a simpler approach. It uses the Wigner Distribution, a different way to represent a pulse’s dependence on time: ∞ t t WEt (ω,τ ) ≡ Et (τ + ) Et *(τ − ) exp(−iω t ) dt 2 2 −∞ ∫ The Wigner Distribution converts the pulse into a plot of intensity vs. time (delay) and frequency. We can invert the Wigner Distribution to obtain the pulse field. Inverse-Fourier transform the Wigner Distribution with respect to ω : Y { -1 } 1 WEt (ω,τ ) = 2π ∞ t t Et (τ + ) Et *(τ − )exp(−iω t ) dt exp(iω t′) dω 2 2 −∞ ∫∫ −∞ ∞ t t ⎧1 = Et (τ + ) Et *(τ − ) ⎨ 2 2 ⎩ 2π −∞ ∫ ∞ ⎫ ′ exp[−iω( t − t )] dω ⎬ dt −∞ ⎭ ∫ ∞ ∞ t t = Et (τ + ) Et *(τ − ) δ ( t − t′) dt 2 2 −∞ t′ t′ = Et (τ + ) Et *(τ − ) 2 2 ∫ Setting τ = t’/2: = Et (t′) Et *(0) So we can’t determine the absolute phase, but simple inverse Fourier transforming yields the rest of E(t). Examples of Wigner Distributions The Wigner Distribution is always real, but it usually goes negative. Linearly chirped Gaussian Quadratically chirped Gaussian τ ω τ ω τ ω Double pulse Properties of the Wigner Distribution The marginals (integrals) of the Wigner Distribution yield the pulse intensity vs. time and the spectrum vs. frequency. I (τ ) = ∫ ∞ −∞ WE (ω ,τ ) dω I(τ) S (ω ) = ∫ ∞ −∞ WE (ω ,τ ) dτ S(ω) τ ω The Wigner Distribution has many other nice properties. We can define spatial and space-time Wigner Distributions, too. A spatial Wigner Distribution is in terms of x and kx: dropping the x subscript on k x′ x′ WEx (k , x) ≡ Ex ( x + ) Ex *( x − ) exp(−ikx′) dx′ 2 2 −∞ ∫ ∞ The Wigner Distribution converts the beam into a plot of intensity vs. space and spatial frequency. Then we can define a space-time Wigner Distribution: WE (k , x, ω,τ ) ≡ ∞ x′ t x′ t E ( x + ,τ + ) E *( x − ,τ − ) 2 2 2 2 −∞ exp(−ikx′ − iωt ) dx′ dt ∫∫ −∞ ∞ In terms of Wigner Distributions, Kostenbauder matrices can describe a general optical system! If the K-matrix of the system and the input-pulse Wigner Distribution are known: WEout ( xout , kout , ωout ,τ out ) = m WEin ( xin , kin , ωin ,τ in ) where the output Wigner Distribution and its parameters are determined from the input parameters, the K-matrix of the system, and the input pulse Wigner Distribution. The quantity, m, is a magnification. J. Paye and A. Migus, “Space–time Wigner functions and their application to the analysis of a pulse shaper,” J. Opt. Soc. Am. B, 12, #8, p 1480, August 1995.
© Copyright 2024