Algorithms

User Guide

Using the Service:

Algorithms

Printer-friendly version

The algorithms currently implemented focomputing periodograms from light curves are Lomb-Scargle (Scargle 1982), Box-fitting Least Squares or "BLS" (Kovacs et al. 2002), and Plavchan (Plavchan et al. 2008).

Skip to a section:

Lomb-Scargle
Box-fitting Least Squares (BLS)
Plavchan

Lomb-Scargle

How it works

The Lomb-Scargle (L-S) algorithm (Scargle, 1982) is a variation of the Discrete Fourier Transform (DFT), in which a time series is decomposed into a linear combination of sinusoidal functions. The basis of sinusoidal functions transforms the data from the time domain to the frequency domain. DFT techniques often assume evenly spaced data points in the time series, but this is rarely the case with astrophysical time-series data. Scargle has derived a formula for transform coefficients that is similar to the DFT in the limit of evenly spaced observations. In addition, an adjustment of the values used to calculate the transform coefficients makes the transform invariant to time shifts.

How to Use the Algorithm

The Lomb-Scargle periodogram is optimized to identify sinusoidal-shaped periodic signals in time-series data. Particular applications include radial velocity data and searches for pulsating variable stars. L-S is not optimal for detecting signals from transiting exoplanets, where the shape of the periodic light curve is not sinusoidal.

N	number of points in the input file
df	step size in frequency space
dp	step size in period space
p(i)	output period value

Statistical Distribution

In the NASA Exoplanet Archive's implementation, the periodogram power is normalized by the inverse of the variance of the original signal data values. Horne and Baliunas (Horne, 1986) showed that this scaled power has an exponential distribution for Gaussian noise data values and a large number of observations N_obs. The probability, p, of observing a power less than or equal to P₀ in one sample when the time series is a noise signal is then given by:

The probability of seeing at least one sample exceeding this value is then given by

where M is the number of periods sampled.

The above expression is invalid in the limit of a small number of observations, N_obs. When N_obs is less than 50, the following formula is applied as in Zechmeister and Kürster (2009):

and, again

where M is now the number of independent frequencies. The theoretical number of independent frequencies for a given data set lies between N and N*(N-1)/2 (or N choose 2). The effective number of independent frequencies is approximately equal to

where df is the width (in frequency) of a peak (Zechmeister and Kürster, 2009) that is defined as the width of the top peak in the periodogram. The beginning and ending points of a peak are defined as the frequencies at which the power is half of the peak's maximum.

References

Horne, J.H., Baliunas, S.L. "A prescription for period analysis of unevenly sampled time series." Astrophysical Journal, 302:757-763 (1986) Abstract

Scargle, J.D. "Studies in Astronomical Time Series Analysis II: Statistical Aspects of Spectral Analysis of Unevenly Spaced Data." Astrophysical Journal, 263:835-853 (1982) Abstract

Zechmeister, M., Kürster, M. "The Generalised Lomb-Scargle Periodogram. A new Formalism for the Floating-mean and Keplerian Periodograms." Astronomy and Astrophysics, 496:577-584 (2009) Abstract