Simulated Data for Completeness and Reliability

The completeness and reliability of the Kepler pipeline must be quantified in order to calculate robust planet occurrence rates. The Kepler project generated multiple kinds of simulated data to challenge the pipeline and examine the results; these data are served below to facilitate generation of custom completeness and reliability calculations. For the Kepler project completeness and reliability calculations, see the Kepler Completeness and Reliability page.

Skip to:

Modified Light Curve Products

Five sets of simulated data were created during the production of the Q1-Q17 DR25 KOI catalog:

  • Injected Group 1 (INJ1) contains a single on-target injection for 146,294 observed targets occupying 64 channels in the Kepler focal plane. Each injection mimics a planet transiting the specified target.
  • Injected Group 2 (INJ2) contains a single off-target injection for 33,978 observed targets occupying 16 channels in the Kepler focal plane. Each injection mimics a foreground or background transiting planet or eclipsing binary along the line of sight to the specified target.
  • Injected Group 3 (INJ3) contains two on-target injections for 9,856 observed targets occupying 4 channels in the Kepler focal plane. Each injection pair has a common period so as to simulate an eclipsing binary on the specified target.
  • Inverted Group (INV) contains 198,640 targets that have had their observed light curves inverted immediately before the planet search to produce a set of TCEs that are likely to be false positives.
  • Scrambled Group (SCR1) contains 198,640 targets that have had their observed light curves scrambled by rearranging the data in yearly chunks immediately before the planet search to produce a set of TCEs that are likely to be false positives.

The relevant data products are listed in the table below for each group:

  • Light curves with injected transits: The signals were injected at the pixel level (i.e., into the CAL output) and the normal photometric pipeline (i.e., modules PA and PDC) was run to create standard light curve products.
  • DV summaries and DV reports: A normal search was conducted on the injected light curves with pipeline modules TPS and DV to produce standard DV summaries and reports for each TCE identified.
  • Average Detection Efficiency: The results of the full pipeline run on the injected light curves (i.e., PA, PDC, TPS, DV) can be used to compute the average detection efficiency of the SOC 9.3 pipeline for an arbitrary subset of targets.
  • TCERT Reports: The DR25 Robovetter metrics were produced for all simulated (and observed) data sets and these standard reports were generated to allow manual evaluation of any simulated (or observed) TCE.
  • Robovetter Results: The DR25 Robovetter was run and the dispositions and flags preserved for all simulated TCEs for use in determining completeness and effectiveness of the dispositioning process.
  • RV input/output: The Robovetter metrics are formatted as input files and the Robovetter outputs are preserved for all simulated data sets.
INJ1 INJ2 INJ3 INV SCR1 Documentation
Light curves with injected transits wget script
(625 GB)†
wget script
(149 GB)†
wget script
(41 GB)†
N/A N/A KSCI-19110-001.pdf
DV Summaries wget script
(216 GB)†
wget script
(74 GB)†
wget script
(32 GB)†
N/A  N/A  KSCI-19110-001.pdf
DV Reports wget script
(492 GB)†
wget script
(172 GB)†
wget script
(76 GB)†
N/A N/A KSCI-19110-001.pdf
Average Detection Efficiency download table download table download table N/A   N/A   KSCI-19110-001.pdf
TCERT Reports wget script
(110 GB)†
wget script
(45 GB)†
wget script
(23 GB)†
wget script
(47 GB)†
wget script
(50 GB)†
KSCI-19114-001.pdf
Robovetter Results download table download table download table download table download table KSCI-19114-001.pdf
RV input file at Github file at Github file at Github file at Github file at Github KSCI-19114-001.pdf
RV output file at Github file at Github file at Github file at Github file at Github KSCI-19114-001.pdf

† The file size refers to the destination data set, which requires an initial download of a wget script that is much smaller (~15 MB). For more information, see the wget help page. For best performance, do not run more than 4 wget scripts in parallel. Running additional parallel scripts will not decrease the total download duration.



Flux-level Transit Injection Products

The per-target detection efficiency was generated by injecting thousands of signals into Kepler light curves. These files are divided into five runs, as described in the documentation (KSCI-19109-002.pdf).

  # Kepler Light Curves Injections Per Target Download File
KSOC-5004 40 ~600,000 tar file (1.9 GB)
KSOC-5006 32316 ~2000 tar file (5.3 GB)
KSOC-5008 25 ~600,000 tar file (5.3 GB)
KSOC-5104 5436 ~2000 tar file (999 MB)
KSOC-5125 50 ~600,000 tar file (2.5 GB)


Last updated: 14 June 2017