Radon–Nikodym theorem

In mathematics, the RadonÃ¢ÂÂNikodym theorem, named after Johann Radon and Otto M. Nikodym, is a result in measure theory that expresses the relationship between two measures defined on the same measurable space. A measure is a set function that assigns a consistent magnitude to the measurable subsets of a measurable space. Examples of a measure include area and volume, where the subsets are sets of points; or the probability of an event, which is a subset of possible outcomes within a wider probability space.

One way to derive a new measure from one already given is to assign a density to each point of the space, then integrate over the measurable subset of interest. This can be expressed as

where is the new measure being defined for any measurable subset and the function is the density at a given point. The integral is with respect to an existing measure , which may often be the canonical Lebesgue measure on the real line or the -dimensional Euclidean space (corresponding to our standard notions of length, area and volume). For example, if represented mass density and was the Lebesgue measure in three-dimensional space , then would equal the total mass in a spatial region .

The RadonÃ¢ÂÂNikodym theorem essentially states that, under certain conditions, any measure can be expressed in this way with respect to another measure on the same space. The function is then called the RadonÃ¢ÂÂNikodym derivative and is denoted by . An important application is in probability theory, leading to the probability density function of a random variable.

The theorem is named after Johann Radon, who proved the theorem for the special case where the underlying space is in 1913, and for Otto Nikodym who proved the general case in 1930. In 1936, Hans Freudenthal generalized the RadonÃ¢ÂÂNikodym theorem by proving the Freudenthal spectral theorem, a result in Riesz space theory; this contains the RadonÃ¢ÂÂNikodym theorem as a special case.

A Banach space is said to have the RadonÃ¢ÂÂNikodym property if the generalization of the RadonÃ¢ÂÂNikodym theorem also holds (with the necessary adjustments made) for functions with values in . All Hilbert spaces have the RadonÃ¢ÂÂNikodym property.

Formal description

RadonÃ¢ÂÂNikodym theorem

The RadonÃ¢ÂÂNikodym theorem involves a measurable space on which two ÃÂ-finite measures are defined, and It states that, if (that is, if is absolutely continuous with respect to ), then there exists a -measurable function such that for any measurable set

RadonÃ¢ÂÂNikodym derivative

The function satisfying the above equality is , that is, if is another function which satisfies the same property, then . The function is commonly written and is called the . The choice of notation and the name of the function reflects the fact that the function is analogous to a derivative in calculus in the sense that it describes the rate of change of density of one measure with respect to another (the way the Jacobian determinant is used in multivariable integration).

Extension to signed or complex measures

A similar theorem can be proven for signed and complex measures: namely, that if is a nonnegative ÃÂ-finite measure, and is a finite-valued signed or complex measure such that that is, is absolutely continuous with respect to then there is a -integrable real- or complex-valued function on such that for every measurable set

Examples

In the following examples, the set is the real interval , and is the Borel sigma-algebra on .

Let be the length measure on , and let assign to each subset of twice the length of . Then .
Let be the length measure on , and let assign to each subset of the number of points from the set that are contained in . Then is not absolutely continuous with respect to since it assigns non-zero measure to zero-length points. Indeed, there is no derivative : there is no finite function that, when integrated e.g. from to , gives for all .
, where is the length measure on and is the Dirac measure on 0 (it assigns a measure of 1 to any set containing 0 and a measure of 0 to any other set). Then, is absolutely continuous with respect to , and Ã¢ÂÂ the derivative is 0 at and 1 at .

Properties

Let ÃÂ½, ÃÂ¼, and ÃÂ» be ÃÂ-finite measures on the same measurable space. If ÃÂ½ Ã¢ÂÂª ÃÂ» and ÃÂ¼ Ã¢ÂÂª ÃÂ» (ÃÂ½ and ÃÂ¼ are both absolutely continuous with respect to ÃÂ»), then
If ÃÂ½ Ã¢ÂÂª ÃÂ¼ Ã¢ÂÂª ÃÂ», then
In particular, if ÃÂ¼ Ã¢ÂÂª ÃÂ½ and ÃÂ½ Ã¢ÂÂª ÃÂ¼, then
If ÃÂ¼ Ã¢ÂÂª ÃÂ» and is a ÃÂ¼-integrable function, then
If ÃÂ½ is a finite signed or complex measure, then

Applications

Probability theory

The theorem is very important in extending the ideas of probability theory from probability masses and probability densities defined over real numbers to probability measures defined over arbitrary sets. It tells if and how it is possible to change from one probability measure to another. Specifically, the probability density function of a random variable is the RadonÃ¢ÂÂNikodym derivative of the induced measure with respect to some base measure (usually the Lebesgue measure for continuous random variables).

For example, it can be used to prove the existence of conditional expectation for probability measures. The latter itself is a key concept in probability theory, as conditional probability is just a special case of it.

Financial mathematics

Amongst other fields, financial mathematics uses the theorem extensively, in particular via the Girsanov theorem. Such changes of probability measure are the cornerstone of the rational pricing of derivatives and are used for converting actual probabilities into those of the risk neutral probabilities.

Information divergences

If ÃÂ¼ and ÃÂ½ are measures over , and ÃÂ¼ Ã¢ÂÂª ÃÂ½

The KullbackÃ¢ÂÂLeibler divergence from ÃÂ½ to ÃÂ¼ is defined to be
For ÃÂ± > 0, ÃÂ± Ã¢ÂÂ 1 the RÃÂ©nyi divergence of order ÃÂ± from ÃÂ½ to ÃÂ¼ is defined to be

The assumption of ÃÂ-finiteness

The RadonÃ¢ÂÂNikodym theorem above makes the assumption that the measure ÃÂ¼ with respect to which one computes the rate of change of ÃÂ½ is ÃÂ-finite.

Negative example

Here is an example when ÃÂ¼ is not ÃÂ-finite and the RadonÃ¢ÂÂNikodym theorem fails to hold.

Consider the Borel ÃÂ-algebra on the real line. Let the counting measure, , of a Borel set be defined as the number of elements of if is finite, and otherwise. One can check that is indeed a measure. It is not -finite, as not every Borel set is at most the union of countably many finite sets. Let be the usual Lebesgue measure on this Borel algebra. Then, is absolutely continuous with respect to , since for a set one has only if is the empty set, and then is also zero.

Assume that the RadonÃ¢ÂÂNikodym theorem holds, that is, for some measurable function one has

for all Borel sets. Taking to be a singleton set, , and using the above equality, one finds

for all real numbers . This implies that the function , and therefore the Lebesgue measure , is zero, which is a contradiction.

Positive result

Assuming the RadonÃ¢ÂÂNikodym theorem also holds if is localizable and is accessible with respect to , i.e., for all

Proof

This section gives a measure-theoretic proof of the theorem. There is also a functional-analytic proof, using Hilbert space methods, that was first given by von Neumann.

For finite measures and , the idea is to consider functions with . The supremum of all such functions, along with the monotone convergence theorem, then furnishes the RadonÃ¢ÂÂNikodym derivative. The fact that the remaining part of is singular with respect to follows from a technical fact about finite measures. Once the result is established for finite measures, extending to -finite, signed, and complex measures can be done naturally. The details are given below.

For finite measures

Constructing an extended-valued candidate First, suppose and are both finite-valued nonnegative measures. Let be the set of those extended-value measurable functions such that:

, since it contains at least the zero function. Now let , and suppose is an arbitrary measurable set, and define:

Then one has

and therefore, .

Now, let be a sequence of functions in such that

By replacing with the maximum of the first functions, one can assume that the sequence is increasing. Let be an extended-valued function defined as

By Lebesgue's monotone convergence theorem, one has

for each , and hence, . Also, by the construction of ,

Proving equality Now, since ,

defines a nonnegative measure on . To prove equality, we show that .

Suppose ; then, since is finite, there is an such that . To derive a contradiction from , we look for a positive set for the signed measure (i.e. a measurable set , all of whose measurable subsets have non-negative measure), where also has positive -measure. Conceptually, we're looking for a set , where in every part of . A convenient approach is to use the Hahn decomposition for the signed measure .

Note then that for every one has , and hence,

where is the indicator function of . Also, note that as desired; for if , then (since is absolutely continuous in relation to ) , so and

contradicting the fact that .

Then, since also

and satisfies

This is impossible because it violates the definition of a supremum; therefore, the initial assumption that must be false. Hence, , as desired.

Restricting to finite values Now, since is -integrable, the set is -null. Therefore, if a is defined as

then has the desired properties.

Uniqueness As for the uniqueness, let be measurable functions satisfying

for every measurable set . Then, is -integrable, and

(Recall that we can split the integral into two as long as they are measurable and non-negative)

In particular, for or . It follows that

and so, that -almost everywhere; the same is true for , and thus, -almost everywhere, as desired.

For -finite positive measures

If and are -finite, then can be written as the union of a sequence of disjoint sets in , each of which has finite measure under both and . For each , by the finite case, there is a -measurable function such that

for each -measurable subset of . The sum of those functions is then the required function such that .

As for the uniqueness, since each of the is -almost everywhere unique, so is .

For signed and complex measures

If is a -finite signed measure, then it can be HahnÃ¢ÂÂJordan decomposed as where one of the measures is finite. Applying the previous result to those two measures, one obtains two functions, , satisfying the RadonÃ¢ÂÂNikodym theorem for and respectively, at least one of which is -integrable (i.e., its integral with respect to is finite). It is clear then that satisfies the required properties, including uniqueness, since both and are unique up to -almost everywhere equality.

If is a complex measure, it can be decomposed as , where both and are finite-valued signed measures. Applying the above argument, one obtains two functions, , satisfying the required properties for and , respectively. Clearly, is the required function.

The Lebesgue decomposition theorem

Lebesgue's decomposition theorem shows that the assumptions of the RadonÃ¢ÂÂNikodym theorem can be found even in a situation which is seemingly more general. Consider a ÃÂ-finite positive measure on the measure space and a ÃÂ-finite signed measure on , without assuming any absolute continuity. Then there exist unique signed measures and on such that , , and . The RadonÃ¢ÂÂNikodym theorem can then be applied to the pair .

Notes

References

Contains a proof for vector measures assuming values in a Banach space.
Contains a lucid proof in case the measure ÃÂ½ is not ÃÂ-finite.
Contains a proof of the generalisation.