Yale University

PHYS 381/504: Random Variables, Correlation Functions and Power Spectral Density

Modules:

Optical Tweezers
Computer simulations/Phase transitions
Scanning tunnelling microscope
Superconducting tunneling
Weak localization of light
Nuclear magnetic resonance
Compton scattering
Least-mean-squares
Random Variables
COURSE GRADE
Questions
PHYS 381 HOME
Classes Page
Yale Graduate School
Yale University
Publications
© 2005 Yale University, New Haven, Connecticut 06520

Random variables

 

In experimental science, often one encounters a signal that seems to fluctuate versus time. An example such a signal is shown below. Several features seem to characterize this signal:

  1. The signal is a random variable.
  2. tweezers simulation

  3. It has zero mean.
  4. Its average magnitude seems relatively well-defined with values far greater than this magnitude unlikely.
  5. The statistical properties of the signal do not change in time. (It is an ergodic process.)
  6. The signals separated by large time intervals are statistically independent.
  7. However, for small time intervals, the signal does not change appreciably.

Correlation functions

In dealing with random variables, we are lead to the probability densities for those variables. Therefore, let's start by trying to describe mathematically the properties listed above by writing down a joint probability density [p (x sub 1, x sub 2)] which specifies the probability density that the signal at time t sub 1 is x sub 1 and that the signal at time t sub 2 is x sub 2. (Be sure that you understand the difference between a probability density and a probability.)

A joint probability density that seems to yield the properties listed above is a so-called jointly Gaussian (or bivariate Gaussian) probability density:

                     bivariate Gaussian                         (1)

sigma and rho are parameters. sigma is a constant, independent of t. It will turn out that rho equals rho (t) is a function of t equals the absolute value of t sub 1 minus t sub 2 that is always less than 1.

It is straightforward to show that p (x sub 1)--- the probability density for x sub 1, irrespective of x sub 2 --- is Gaussian with zero mean, and variance sigma squared, independent of rho, or equivalently time, as suggested in our list. (Do it --- use Mathematica to carry out the integral.)

We noted above that x sub 1 and x sub 2 are statistically independent for large the absolute value of t sub 1 minus t sub 2 . Statistical independence implies that p (x sub 1, x sub 2) equals p (x sub 1) p (x sub 2). This is only the case, if rho equals zero. Therefore, Eq. 1 can only be applicable to our case, if rho goes to zero as the absolute value of t sub 1 minus t sub 2 goes to infinity. (Another way to say that x sub 1 and x sub 2 are statistically independent is to say that they are uncorrelated.)

On the other hand, it is straightforward to show that as rho goes to 1 , the variance of x sub 1 minus x sub 2 goes to zero, i.e. x sub 1 goes to x sub 2 for rho goes to 1. Thus, we recover the observation that the signal does not change appreciably for small time intervals only if rho goes to 1 for the absolute value of t sub 1 minus t sub 2 goes to zero .

Thus, our candidate joint probability density seems consistent with the properties of our signal provided rho behaves as described. It is plausible then that our random process is characterized by two quantities sigma and rho (t). Our goal now is to figure out how to measure these quantities.

Using Eq. 1, we may show that

                                                                                             (2)

and that

                                                                               (3)

where the angle brackets denote an average. (Do this --- again use Mathematica.) These results inform us how, in principle, to determine sigma and rho from an experimental determination of x (t). How to determine sigma is straightforward. To determine rho, Eq. 3 tells us to take the signal at time t sub 1 and the signal at time t sub 2 equals t plus t sub 1 and multiply them together. We do this for every value of t sub 1, while keeping t fixed, and then average all these products together. This quantity is the correlation function or autocorrelation function of the random process in question. The correlation function concept is very important in both experimental and theoretical condensed matter physics. Correlation functions are important experimentally, because usually, if something can be measured, it is a correlation function. Correlation functions are important theoretically, because usually, if something can be calculated, it is a correlation function. And that's what physics is about: making and testing quantitative predictions.

Power spectral density

So far, our discussion has occurred in the time domain. However, time and frequency domains are related by Fourier transformation. Given a random variable as a function of time acquired at a sampling frequency f sub s equals 1 over (Delta t) for a total time T, we (or LabVIEW, at least) can calculate the discrete Fourier transform

                                   Fourier transform                                (4)

with f sub k equal to k over T, t sub j equal to j Delta t, and k equal to minus (N over 2) plus 1, ..., N over 2 is an integer, and N equals T over (Delta t). Note that the maximum frequency for which x (f sub k) is available is f sub (N over 2) equal to (N over (2T)) equal to f sub s over 2. This maximum frequency, equal to one half of the sampling frequency, is called the Nyquist frequency (f sub Nyquist).

A little thought will convince you that a finite sampling frequency means that fluctuations with an actual frequency of f sub actual greater than f sub Nyquist will appear in this measurement to have a frequency f sub apparent equals f sub actual minus m f sub Nyquist, where m is the integer that puts f sub apparent in the frequency range sampled. This is the phenomenon of aliasing. Aliasing will NOT be important if your PSD does NOT have much weight at frequencies above f sub Nyquist . Whether this is the case must be determined on a case-by-case basis.

Rather than outputing x (f sub k) itself, in fact, LabVIEW outputs the so-called power spectral density (PSD): S (f sub k) equals the absolute value of x (f sub k) squared. Note that the units of PSD as measured are volts squared over Hz, if x (t) is in volts. Sometimes people take the square-root and the units then would be volts over the square-root of Hz.

One reason for outputing the PSD rather than x (f sub k), is that the Fourier transform of the PSD is the correlation function, S (t), discussed above. Because of their Fourier transform relationship, an experimental determination of either S (t) or S (f) as a function of its argument is sufficient for the determination of the other.

Acknowledgement: Much of the discussion of this section is based on unpublished lecture notes by Tom Greytak (MIT) originally given in MIT's sophomore stat. mech. class, 8.044.