Hamiltonian optics and Lagrangian optics are two formulations of geometrical optics which share much of the mathematical formalism with Hamiltonian mechanics and Lagrangian mechanics.
In physics, Hamilton's principle states that the evolution of a system described by generalized coordinates between two specified states at two specified parameters ÃÂ<sub>A</sub> and ÃÂ<sub>B</sub> is a stationary point (a point where the variation is zero) of the action functional, or
where and is the Lagrangian. Condition is valid if and only if the Euler-Lagrange equations are satisfied, i.e.,
with .
The momentum is defined as
and the EulerâÂÂLagrange equations can then be rewritten as
where .
A different approach to solving this problem consists in defining a Hamiltonian (taking a Legendre transform of the Lagrangian) as
for which a new set of differential equations can be derived by looking at how the total differential of the Lagrangian depends on parameter ÃÂ, positions and their derivatives relative to ÃÂ. This derivation is the same as in Hamiltonian mechanics, only with time t now replaced by a general parameter ÃÂ. Those differential equations are the Hamilton's equations
with . Hamilton's equations are first-order differential equations, while Euler-Lagrange's equations are second-order.
The general results presented above for Hamilton's principle can be applied to optics. In 3D euclidean space the generalized coordinates are now the coordinates of euclidean space.
Fermat's principle states that the optical length of the path followed by light between two fixed points, A and B, is a stationary point. It may be a maximum, a minimum, constant or an inflection point. In general, as light travels, it moves in a medium of variable refractive index which is a scalar field of position in space, that is, in 3D euclidean space. Assuming now that light travels along the x<sub>3</sub> axis, the path of a light ray may be parametrized as starting at a point and ending at a point . In this case, when compared to Hamilton's principle above, coordinates and take the role of the generalized coordinates while takes the role of parameter , that is, parameter ÃÂ =x<sub>3</sub> and N=2.
In the context of calculus of variations this can be written as
where is an infinitesimal displacement along the ray given by and
is the optical Lagrangian and .
The optical path length (OPL) is defined as
where n is the local refractive index as a function of position along the path between points A and B.
The general results presented above for Hamilton's principle can be applied to optics using the Lagrangian defined in Fermat's principle. The Euler-Lagrange equations with parameter ÃÂ =x<sub>3</sub> and N=2 applied to Fermat's principle result in
with and where L is the optical Lagrangian and .
The optical momentum is defined as
and from the definition of the optical Lagrangian this expression can be rewritten as
or in vector form
where is a unit vector and angles ñ<sub>1</sub>, ñ<sub>2</sub> and ñ<sub>3</sub> are the angles p makes to axis x<sub>1</sub>, x<sub>2</sub> and x<sub>3</sub> respectively, as shown in figure "optical momentum". Therefore, the optical momentum is a vector of norm
where n is the refractive index at which p is calculated. Vector p points in the direction of propagation of light. If light is propagating in a gradient index optic the path of the light ray is curved and vector p is tangent to the light ray.
The expression for the optical path length can also be written as a function of the optical momentum. Having in consideration that the expression for the optical Lagrangian can be rewritten as
and the expression for the optical path length is
Similarly to what happens in Hamiltonian mechanics, also in optics the Hamiltonian is defined by the expression given above for corresponding to functions and to be determined
Comparing this expression with for the Lagrangian results in
And the corresponding Hamilton's equations with parameter ÃÂ =x<sub>3</sub> and k=1,2 applied to optics are
with and .
It is assumed that light travels along the x<sub>3</sub> axis, in Hamilton's principle above, coordinates and take the role of the generalized coordinates while takes the role of parameter , that is, parameter ÃÂ =x<sub>3</sub> and N=2.
If plane x<sub>1</sub>x<sub>2</sub> separates two media of refractive index n<sub>A</sub> below and n<sub>B</sub> above it, the refractive index is given by a step function
and from Hamilton's equations
and therefore or for .
An incoming light ray has momentum p<sub>A</sub> before refraction (below plane x<sub>1</sub>x<sub>2</sub>) and momentum p<sub>B</sub> after refraction (above plane x<sub>1</sub>x<sub>2</sub>). The light ray makes an angle ø<sub>A</sub> with axis x<sub>3</sub> (the normal to the refractive surface) before refraction and an angle ø<sub>B</sub> with axis x<sub>3</sub> after refraction. Since the p<sub>1</sub> and p<sub>2</sub> components of the momentum are constant, only p<sub>3</sub> changes from p<sub>3A</sub> to p<sub>3B</sub>.
Figure "refraction" shows the geometry of this refraction from which . Since and , this last expression can be written as
which is Snell's law of refraction.
In figure "refraction", the normal to the refractive surface points in the direction of axis x<sub>3</sub>, and also of vector . A unit normal to the refractive surface can then be obtained from the momenta of the incoming and outgoing rays by
where i and r are unit vectors in the directions of the incident and refracted rays. Also, the outgoing ray (in the direction of ) is contained in the plane defined by the incoming ray (in the direction of ) and the normal to the surface.
A similar argument can be used for reflection in deriving the law of specular reflection, only now with n<sub>A</sub>=n<sub>B</sub>, resulting in ø<sub>A</sub>=ø<sub>B</sub>. Also, if i and r are unit vectors in the directions of the incident and refracted ray respectively, the corresponding normal to the surface is given by the same expression as for refraction, only with n<sub>A</sub>=n<sub>B</sub>
In vector form, if i is a unit vector pointing in the direction of the incident ray and n is the unit normal to the surface, the direction r of the refracted ray is given by:
with
If iâ n<0 then âÂÂn should be used in the calculations. When , light suffers total internal reflection and the expression for the reflected ray is that of reflection:
From the definition of optical path length
with k=1,2 where the Euler-Lagrange equations with k=1,2 were used. Also, from the last of Hamilton's equations and from above
combining the equations for the components of momentum p results in
Since p is a vector tangent to the light rays, surfaces S=Constant must be perpendicular to those light rays. These surfaces are called wavefronts. Figure "rays and wavefronts" illustrates this relationship. Also shown is optical momentum p, tangent to a light ray and perpendicular to the wavefront.
Vector field is conservative vector field. The gradient theorem can then be applied to the optical path length (as given above) resulting in
and the optical path length S calculated along a curve C between points A and B is a function of only its end points A and B and not the shape of the curve between them. In particular, if the curve is closed, it starts and ends at the same point, or A=B so that
This result may be applied to a closed path ABCDA as in figure "optical path length"
for curve segment AB the optical momentum p is perpendicular to a displacement ds along curve AB, or . The same is true for segment CD. For segment BC the optical momentum p has the same direction as displacement ds and . For segment DA the optical momentum p has the opposite direction to displacement ds and . However inverting the direction of the integration so that the integral is taken from A to D, ds inverts direction and . From these considerations
or
and the optical path length S<sub>BC</sub> between points B and C along the ray connecting them is the same as the optical path length S<sub>AD</sub> between points A and D along the ray connecting them. The optical path length is constant between wavefronts.
Figure "2D phase space" shows at the top some light rays in a two-dimensional space. Here x<sub>2</sub>=0 and p<sub>2</sub>=0 so light travels on the plane x<sub>1</sub>x<sub>3</sub> in directions of increasing x<sub>3</sub> values. In this case and the direction of a light ray is completely specified by the p<sub>1</sub> component of momentum since p<sub>2</sub>=0. If p<sub>1</sub> is given, p<sub>3</sub> may be calculated (given the value of the refractive index n) and therefore p<sub>1</sub> suffices to determine the direction of the light ray. The refractive index of the medium the ray is traveling in is determined by .
For example, ray r<sub>C</sub> crosses axis x<sub>1</sub> at coordinate x<sub>B</sub> with an optical momentum p<sub>C</sub>, which has its tip on a circle of radius n centered at position x<sub>B</sub>. Coordinate x<sub>B</sub> and the horizontal coordinate p<sub>1C</sub> of momentum p<sub>C</sub> completely define ray r<sub>C</sub> as it crosses axis x<sub>1</sub>. This ray may then be defined by a point r<sub>C</sub>=(x<sub>B</sub>,p<sub>1C</sub>) in space x<sub>1</sub>p<sub>1</sub> as shown at the bottom of the figure. Space x<sub>1</sub>p<sub>1</sub> is called phase space and different light rays may be represented by different points in this space.
As such, ray r<sub>D</sub> shown at the top is represented by a point r<sub>D</sub> in phase space at the bottom. All rays crossing axis x<sub>1</sub> at coordinate x<sub>B</sub> contained between rays r<sub>C</sub> and r<sub>D</sub> are represented by a vertical line connecting points r<sub>C</sub> and r<sub>D</sub> in phase space. Accordingly, all rays crossing axis x<sub>1</sub> at coordinate x<sub>A</sub> contained between rays r<sub>A</sub> and r<sub>B</sub> are represented by a vertical line connecting points r<sub>A</sub> and r<sub>B</sub> in phase space. In general, all rays crossing axis x<sub>1</sub> between x<sub>L</sub> and x<sub>R</sub> are represented by a volume R in phase space. The rays at the boundary âÂÂR of volume R are called edge rays. For example, at position x<sub>A</sub> of axis x<sub>1</sub>, rays r<sub>A</sub> and r<sub>B</sub> are the edge rays since all other rays are contained between these two. (A ray parallel to x1 would not be between the two rays, since the momentum is not in-between the two rays)
In three-dimensional geometry the optical momentum is given by with . If p<sub>1</sub> and p<sub>2</sub> are given, p<sub>3</sub> may be calculated (given the value of the refractive index n) and therefore p<sub>1</sub> and p<sub>2</sub> suffice to determine the direction of the light ray. A ray traveling along axis x<sub>3</sub> is then defined by a point (x<sub>1</sub>,x<sub>2</sub>) in plane x<sub>1</sub>x<sub>2</sub> and a direction (p<sub>1</sub>,p<sub>2</sub>). It may then be defined by a point in four-dimensional phase space x<sub>1</sub>x<sub>2</sub>p<sub>1</sub>p<sub>2</sub>.
Figure "volume variation" shows a volume V bound by an area A. Over time, if the boundary A moves, the volume of V may vary. In particular, an infinitesimal area dA with outward pointing unit normal n moves with a velocity v.
This leads to a volume variation . Making use of Gauss's theorem, the variation in time of the total volume V volume moving in space is
The rightmost term is a volume integral over the volume V and the middle term is the surface integral over the boundary A of the volume V. Also, v is the velocity with which the points in V are moving.
In optics coordinate takes the role of time. In phase space a light ray is identified by a point which moves with a "velocity" where the dot represents a derivative relative to . A set of light rays spreading over in coordinate , in coordinate , in coordinate and in coordinate occupies a volume in phase space. In general, a large set of rays occupies a large volume in phase space to which Gauss's theorem may be applied
and using Hamilton's equations
or and which means that the phase space volume is conserved as light travels along an optical system.
The volume occupied by a set of rays in phase space is called etendue, which is conserved as light rays progress in the optical system along direction x<sub>3</sub>. This corresponds to Liouville's theorem, which also applies to Hamiltonian mechanics.
Figure "conservation of etendue" shows on the left a diagrammatic two-dimensional optical system in which x<sub>2</sub>=0 and p<sub>2</sub>=0 so light travels on the plane x<sub>1</sub>x<sub>3</sub> in directions of increasing x<sub>3</sub> values.
Light rays crossing the input aperture of the optic at point x<sub>1</sub>=x<sub>I</sub> are contained between edge rays r<sub>A</sub> and r<sub>B</sub> represented by a vertical line between points r<sub>A</sub> and r<sub>B</sub> at the phase space of the input aperture (right, bottom corner of the figure). All rays crossing the input aperture are represented in phase space by a region R<sub>I</sub>.
Also, light rays crossing the output aperture of the optic at point x<sub>1</sub>=x<sub>O</sub> are contained between edge rays r<sub>A</sub> and r<sub>B</sub> represented by a vertical line between points r<sub>A</sub> and r<sub>B</sub> at the phase space of the output aperture (right, top corner of the figure). All rays crossing the output aperture are represented in phase space by a region R<sub>O</sub>.
Conservation of etendue in the optical system means that the volume (or area in this two-dimensional case) in phase space occupied by R<sub>I</sub> at the input aperture must be the same as the volume in phase space occupied by R<sub>O</sub> at the output aperture.
In imaging optics, all light rays crossing the input aperture at x<sub>1</sub>=x<sub>I</sub> are redirected by it towards the output aperture at x<sub>1</sub>=x<sub>O</sub> where x<sub>I</sub>=m x<sub>O</sub>. This ensures that an image of the input is formed at the output with a magnification m. In phase space, this means that vertical lines in the phase space at the input are transformed into vertical lines at the output. That would be the case of vertical line r<sub>A</sub> r<sub>B</sub> in R<sub>I</sub> transformed to vertical line r<sub>A</sub> r<sub>B</sub> in R<sub>O</sub>.
In nonimaging optics, the goal is not to form an image but simply to transfer all light from the input aperture to the output aperture. This is accomplished by transforming the edge rays âÂÂR<sub>I</sub> of R<sub>I</sub> to edge rays âÂÂR<sub>O</sub> of R<sub>O</sub>. This is known as the edge ray principle.
Above it was assumed that light travels along the x<sub>3</sub> axis, in Hamilton's principle above, coordinates and take the role of the generalized coordinates while takes the role of parameter , that is, parameter ÃÂ =x<sub>3</sub> and N=2. However, different parametrizations of the light rays are possible, as well as the use of generalized coordinates.
A more general situation can be considered in which the path of a light ray is parametrized as in which ÃÂ is a general parameter. In this case, when compared to Hamilton's principle above, coordinates , and take the role of the generalized coordinates with N=3. Applying Hamilton's principle to optics in this case leads to
where now and and for which the Euler-Lagrange equations applied to this form of Fermat's principle result in
with k=1,2,3 and where L is the optical Lagrangian. Also in this case the optical momentum is defined as
and the Hamiltonian P is defined by the expression given above for N=3 corresponding to functions , and to be determined
And the corresponding Hamilton's equations with k=1,2,3 applied optics are
with and .
The optical Lagrangian is given by
and does not explicitly depend on parameter ÃÂ. For that reason not all solutions of the Euler-Lagrange equations will be possible light rays, since their derivation assumed an explicit dependence of L on ÃÂ which does not happen in optics.
The optical momentum components can be obtained from
where . The expression for the Lagrangian can be rewritten as
Comparing this expression for L with that for the Hamiltonian P it can be concluded that
From the expressions for the components of the optical momentum results
The optical Hamiltonian is chosen as
although other choices could be made. The Hamilton's equations with k = 1, 2, 3 defined above together with define the possible light rays.
As in Hamiltonian mechanics, it is also possible to write the equations of Hamiltonian optics in terms of generalized coordinates , generalized momenta and Hamiltonian P as
where the optical momentum is given by
and , and are unit vectors. A particular case is obtained when these vectors form an orthonormal basis, that is, they are all perpendicular to each other. In that case, is the cosine of the angle the optical momentum makes to unit vector .