-
Notifications
You must be signed in to change notification settings - Fork 4
/
Copy pathms.tex
1899 lines (1683 loc) · 87.9 KB
/
ms.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
% Copyright 2015-2016 Dan Foreman-Mackey and the co-authors listed below.
\documentclass[manuscript, letterpaper]{aastex6}
\pdfoutput=1
\include{vc}
\include{numbers}
\include{completenessfit}
\usepackage{microtype}
\usepackage{url}
\usepackage{amssymb,amsmath}
\usepackage{natbib}
\usepackage{multirow}
\bibliographystyle{aasjournal}
% ----------------------------------- %
% start of AASTeX mods by DWH and DFM %
% ----------------------------------- %
\setlength{\voffset}{0in}
\setlength{\hoffset}{0in}
\setlength{\textwidth}{6in}
\setlength{\textheight}{9in}
\setlength{\headheight}{0ex}
\setlength{\headsep}{\baselinestretch\baselineskip} % this is 2 lines in ``manuscript''
\setlength{\footnotesep}{0in}
\setlength{\topmargin}{-\headsep}
\setlength{\oddsidemargin}{0.25in}
\setlength{\evensidemargin}{0.25in}
\linespread{0.54} % close to 10/13 spacing in ``manuscript''
\setlength{\parindent}{0.54\baselineskip}
\hypersetup{colorlinks = false}
\makeatletter % you know you are living your life wrong when you need to do this
\long\def\frontmatter@title@above{
\vspace*{-\headsep}\vspace*{\headheight}
\noindent\footnotesize
{\noindent\footnotesize\textsc{\@journalinfo}}\par
{\noindent\scriptsize Preprint typeset using \LaTeX\ style AASTeX6\\
With modifications by David W. Hogg and Daniel Foreman-Mackey
}\par\vspace*{-\baselineskip}\vspace*{0.625in}
}%
\makeatother
% Section spacing:
\makeatletter
\let\origsection\section
\renewcommand\section{\@ifstar{\starsection}{\nostarsection}}
\newcommand\nostarsection[1]{\sectionprelude\origsection{#1}}
\newcommand\starsection[1]{\sectionprelude\origsection*{#1}}
\newcommand\sectionprelude{\vspace{1em}}
\let\origsubsection\subsection
\renewcommand\subsection{\@ifstar{\starsubsection}{\nostarsubsection}}
\newcommand\nostarsubsection[1]{\subsectionprelude\origsubsection{#1}}
\newcommand\starsubsection[1]{\subsectionprelude\origsubsection*{#1}}
\newcommand\subsectionprelude{\vspace{1em}}
\makeatother
\widowpenalty=10000
\clubpenalty=10000
\sloppy\sloppypar
% ------------------ %
% end of AASTeX mods %
% ------------------ %
% Projects:
\newcommand{\project}[1]{\textsl{#1}}
\newcommand{\kepler}{\project{Kepler}}
\newcommand{\KT}{\project{K2}}
\newcommand{\tess}{\project{TESS}}
\newcommand{\plato}{\project{PLATO}}
\newcommand{\gaia}{\project{Gaia}}
\newcommand{\pdc}{\project{PDC}}
\newcommand{\bls}{\project{BLS}}
\newcommand{\emcee}{\project{emcee}}
\newcommand{\exosyspop}{\project{exosyspop}}
\newcommand{\foreign}[1]{\emph{#1}}
\newcommand{\etal}{\foreign{et\,al.}}
\newcommand{\etc}{\foreign{etc.}}
\newcommand{\True}{\foreign{True}}
\newcommand{\Truth}{\foreign{Truth}}
\newcommand{\dfmfigref}[1]{\ref{fig:#1}}
\newcommand{\dfmFig}[1]{Figure~\dfmfigref{#1}}
\newcommand{\dfmfig}[1]{\dfmFig{#1}}
\newcommand{\dfmfiglabel}[1]{\label{fig:#1}}
% \newcommand{\Tab}[1]{Table~\ref{tab:#1}}
% \newcommand{\tab}[1]{\Tab{#1}}
\newcommand{\tablabel}[1]{\label{tab:#1}}
\renewcommand{\eqref}[1]{\ref{eq:#1}}
\newcommand{\Eq}[1]{Equation~(\eqref{#1})}
\newcommand{\eq}[1]{\Eq{#1}}
\newcommand{\eqalt}[1]{Equation~\eqref{#1}}
\newcommand{\eqlabel}[1]{\label{eq:#1}}
\newcommand{\sectionname}{Section}
\newcommand{\sectref}[1]{\ref{sect:#1}}
\newcommand{\Sect}[1]{\sectionname~\sectref{#1}}
\newcommand{\sect}[1]{\Sect{#1}}
\newcommand{\sectalt}[1]{\sectref{#1}}
\newcommand{\App}[1]{Appendix~\sectref{#1}}
\newcommand{\app}[1]{\App{#1}}
\newcommand{\sectlabel}[1]{\label{sect:#1}}
\newcommand{\T}{\ensuremath{\mathrm{T}}}
\newcommand{\dd}{\ensuremath{\,\mathrm{d}}}
\newcommand{\unit}[1]{{\ensuremath{\,\mathrm{#1}}}}
\newcommand{\bvec}[1]{{\ensuremath{\boldsymbol{#1}}}}
\newcommand{\appropto}{\mathrel{\vcenter{
\offinterlineskip\halign{\hfil$##$\cr
\propto\cr\noalign{\kern2pt}\sim\cr\noalign{\kern-2pt}}}}}
\newcommand{\densityunit}{{\ensuremath{\mathrm{nat}^{-2}}}}
% TO DOS
\newcommand{\todo}[3]{{\color{#2}\emph{#1}: #3}}
\newcommand{\dfmtodo}[1]{\todo{DFM}{red}{#1}}
\newcommand{\tdmtodo}[1]{\todo{TDM}{blue}{#1}}
\newcommand{\hoggtodo}[1]{\todo{HOGG}{blue}{#1}}
% Response
\newcommand{\response}[1]{{#1}}
% \newcommand{\response}[1]{{\color{blue}#1}}
% Helpers for this paper:
\newcommand{\license}{MIT License}
\newcommand{\paper}{paper}
% Notation for this paper:
\newcommand{\meanpars}{{\ensuremath{\bvec{\theta}}}}
\newcommand{\kernpars}{{\ensuremath{\bvec{\alpha}}}}
\newcommand{\params}{{\ensuremath{\bvec{w}}}}
\newcommand{\poppars}{{\ensuremath{\bvec{\beta}}}}
\newcommand{\rate}{{\ensuremath{\Gamma}}}
\newcommand{\modelname}[1]{{\textsf{#1}}}
\newcommand{\datareleaseurl}{\url{http://dx.doi.org/10.5281/zenodo.58273}}
\shorttitle{The population of long-period transiting exoplanets}
\shortauthors{Foreman-Mackey, Morton, Hogg, \etal}
% \submitted{Submitted to \textit{The Astrophysical Journal}}
\begin{document}
\title{%
\vspace{\baselineskip}
The population of long-period transiting exoplanets
\vspace{-2\baselineskip} % OMG AASTEX6 IS SO BROKEN
}
\newcounter{affilcounter}
\altaffiltext{1}{\textsf{danfm@uw.edu}; Sagan Fellow}
\setcounter{affilcounter}{2}
\edef \uw {\arabic{affilcounter}}\stepcounter{affilcounter}
\altaffiltext{\uw} {Astronomy Department, University of Washington,
Seattle, WA, 98195, USA}
\edef \princeton {\arabic{affilcounter}}\stepcounter{affilcounter}
\altaffiltext{\princeton}{Department of Astrophysics, Princeton University,
Princeton, NJ, 08544, USA}
\edef \scda {\arabic{affilcounter}}\stepcounter{affilcounter}
\altaffiltext{\scda} {Simons Center for Data Analysis, 160 Fifth Avenue,
7th floor, New York, NY 10010, USA}
\edef \nyu {\arabic{affilcounter}}\stepcounter{affilcounter}
\altaffiltext{\nyu} {Center for Cosmology and Particle Physics,
New York University,
4 Washington Place, New York, NY, 10003, USA}
\edef \mpia {\arabic{affilcounter}}\stepcounter{affilcounter}
\altaffiltext{\mpia} {Max-Planck-Institut f\"ur Astronomie,
K\"onigstuhl 17, D-69117 Heidelberg, Germany}
\edef \cds {\arabic{affilcounter}}\stepcounter{affilcounter}
\altaffiltext{\cds} {Center for Data Science, New York University,
726 Broadway, 7th Floor, New York, NY, 10003, USA}
\edef \mpis {\arabic{affilcounter}}\stepcounter{affilcounter}
\altaffiltext{\mpis} {Max Planck Institute for Intelligent Systems
Spemannstrasse 38, 72076 T\"ubingen, Germany}
\author{%
Daniel~Foreman-Mackey\altaffilmark{1,\uw},
Timothy~D.~Morton\altaffilmark{\princeton},
David~W.~Hogg\altaffilmark{\scda,\nyu,\mpia,\cds},
Eric~Agol\altaffilmark{\uw}, and
Bernhard~Sch\"olkopf\altaffilmark{\mpis}
\vspace{\baselineskip}
}
\begin{abstract}
The \kepler\ Mission has discovered thousands of exoplanets and revolutionized
our understanding of their population.
This large, homogeneous catalog of discoveries has enabled rigorous studies of
the occurrence rate of exoplanets and planetary systems as a function of their
physical properties.
However, transit surveys like \kepler\ are most sensitive to planets with
orbital periods much shorter than the orbital periods of Jupiter and Saturn,
the most massive planets in our Solar System.
To address this deficiency, we perform a fully automated search for
long-period exoplanets with only one or two transits in the archival \kepler\
light curves.
When applied to the $\sim 40,000$ brightest Sun-like target stars, this search
produces \numcands\ long-period exoplanet candidates.
Of these candidates, 6 are novel discoveries and 5 are in systems with inner
short-period transiting planets.
Since our method involves no human intervention, we empirically characterize
the detection efficiency of our search.
Based on these results, we measure the average occurrence rate of exoplanets
smaller than Jupiter with orbital periods in the range 2--25~years to be
$2.0\pm0.7$ planets per Sun-like star.
\end{abstract}
\keywords{%
methods: data analysis
---
methods: statistical
---
catalogs
---
planetary systems
---
stars: statistics
}
\clearpage
\section{Introduction}
Data from the \kepler\ Mission \citep{Borucki:2011} have been used to discover
thousands of transiting exoplanets.
The systematic nature of these discoveries and careful quantification of
survey selection effects, search completeness, and catalog reliability has
enabled many diverse studies of the detailed frequency and distribution of
exoplanets \citep[for example,][]{Howard:2012, Petigura:2013,
Foreman-Mackey:2014, Dressing:2015, Burke:2015, Mulders:2015}.
So far, these results have been limited to relatively short orbital periods
because existing transit search methods impose the requirement of the detection of at least
three transits within the baseline of the data.
For \kepler, with a baseline of about four years, this sets an absolute upper
limit of about two years on the range of detectable periods.
In the Solar System, Jupiter~--~with a period of 12 years~--~dominates the
planetary dynamics and, since it would only exhibit at most one transit in the
\kepler\ data, an exo-Jupiter would be missed by most existing transit search
procedures.
Before the launch of the \kepler\ Mission, it was predicted that the nominal
mission would discover at least 10 exoplanets with only one or two observed
transits \citep{Yee:2008}, yet subsequent searches for these signals have
already been more fruitful than expected \citep{Wang:2015, Uehara:2016}.
However, the systematic study of the population of long-period exoplanets
found in the \kepler\ data to date has been hampered due to the
substantial technical challenge of implementing a search, as well as the
subtleties involved in interpreting the results. For example,
false alarms in the form of uncorrected systematics in the data and background
eclipsing binaries can make single-transit detections ambiguous.
Any single transit events discovered in the \kepler\ light curves are
interesting in their own right, but the development of a general and
systematic method for the discovery of planets with orbital periods longer
than the survey baseline is also crucial for the future of exoplanet research
with the transit method.
All future transit surveys have shorter observational baselines than the
\kepler\ Mission (\KT, \citealt{Howell:2014}; \tess, \citealt{Ricker:2015};
\plato, \citealt{Rauer:2014}) and given suitable techniques, single transit
events will be plentiful and easily discovered.
The methodological framework presented in the following pages is a candidate
for this task.
A study of the population of long-period transiting planets complements other
planet detection and characterization techniques, such as radial velocity
\citep[for example][]{Cumming:2008, Knutson:2014, Bryan:2016}, microlensing
\citep[for example][]{Gould:2010, Cassan:2012, Clanton:2014,
Shvartzvald:2016}, direct imaging \citep[for example][]{Bowler:2016}, and
transmission spectroscopy \citep[for example][]{Dalba:2015}.
The marriage of the radial velocity and transit techniques is particularly
powerful as exoplanets with both mass and radius measurements can be used to
study planetary compositions and the formation of planetary systems \citep[for
example][]{Weiss:2014, Rogers:2015, Wolfgang:2016}.
Unfortunately the existing catalog of exoplanets with measured densities is sparsely
populated at long orbital periods; this makes discoveries with the transit method
at long orbital period compelling targets for follow-up observations.
Furthermore, even at long orbital periods, the \kepler\ light curves should be
sensitive to planets at the detection limits of the current state-of-the-art
radial velocity surveys.
There are two main technical barriers to a systematic search for single
transit events.
The first is that the transit probability for long-period planets is very low;
scaling as $\propto P^{-5/3}$ for orbital periods $P$ longer than the
baseline of contiguous observations.
Therefore, even if long-period planets are intrinsically common, they will
be underrepresented in a transiting sample.
The second challenge is that there are many signals in the observed light
curves caused by stochastic processes~--~both instrumental and
astrophysical~--~that can masquerade as transits.
Even when the most sophisticated methods for removing this variability are
used, false signals far outnumber the true transit events in any traditional
search.
At the heart of all periodic transit search procedures is a filtering step
based on ``box least squares'' \citep[\bls;][]{Kovacs:2002}.
This step produces a list of candidate transit times that is then vetted to
remove the substantial fraction of false signals using some combination of
automated heuristics and expert curation.
In practice, the fraction of false signals can be substantially reduced by
requiring that at least three self-consistent transits be observed
\citep{Petigura:2013, Burke:2014, Rowe:2015, Coughlin:2016}.
Relaxing the requirement of three transits requires a higher signal-to-noise
threshold per transit for validating candidate planets that display only one
to two transits.
Higher signal-to-noise allows matching the candidate transit to the expected
shape of a limb-darkened light curve, as well as ruling out various false
alarms. This is analagous to microlensing surveys, for which a planet can only be
detected once, thus requiring high signal-to-noise for a reliable detection
\citep{Gould:2004}.
Recent work has yielded discoveries of long-period transiting planets with
only one or two high signal-to-noise transits identified in archival \kepler\
and \KT\ light curves by visual inspection \citep{Wang:2013, Kipping:2014a,
Wang:2015, Osborn:2016, Kipping:2016, Uehara:2016}.
These discoveries have already yielded some tantalizing insight into the
population of long-period transiting planets but, since these previous results
rely on human interaction, it is prohibitively expensive to reliably measure
the completeness of these catalogs.
As a result, the existing catalogs of long-period transiting planets cannot be
used to rigorously constrain the occurrence rate of long-period planets.
In this \paper, we develop a systematic method for reliably discovering the
transits of large, long-period companions in photometric time series
\emph{without human intervention}.
The method is similar in character to the recently published fully automated
method used to generate the official DR24 exoplanet candidate catalog from
\kepler\ \citep{Mullally:2016, Coughlin:2016}.
Since the search methodology is fully automated, we can robustly measure the
search completeness~--~using injection and recovery tests~--~and use these
products to place probabilistic constraints on the occurrence rate of
long-period planets.
We apply this method to a subset of the archival data from the \kepler\
Mission, present a catalog of exoplanet candidates, and estimate the
occurrence rate of long-period exoplanets.
We finish by discussing the potential effects of false positives, evaluating the
prospects for follow-up, and comparing our results to other studies
based on different planet discovery methods.
\section{A fully automated search method}\sectlabel{search}
To find long-period exoplanets in the \kepler\ light curves, we search for
individual, high signal-to-noise transit signals using a fully automated
procedure that can be broken into three main steps:
\begin{enumerate}
{\item an initial candidate search using a box-shaped matched filter,}
{\item light curve-level vetting (using automated model comparison) to remove
signals that don't have a convincing transit shape, and}
{\item pixel-level vetting to remove some astrophysical false positives.}
\end{enumerate}
The following sections describe each of these steps in more detail.
The model comparison step (step 2) is the key component of our method that
enables robust automation but it is also computationally expensive because we
must estimate the marginalized likelihoods of several different models
describing a transit and other processes that ``look'' like transits but are
actually caused by noise.
This step is conservative: unless a signal is a very convincing transit, it
won't pass the test.
In practice, this means that all but the highest signal-to-noise events will
be rejected at this step.
Therefore, in the inexpensive first step~--~the initial candidate search~--~we
can restrict the candidate list to high signal-to-noise events without a
substantial loss in detection efficiency.
\subsection{Step 1 -- Initial candidate events}\sectlabel{stepone}
It is not computationally feasible to run a full model comparison at every
conceivable transit time in the light curve so we must first find potentially
interesting events.
For our purposes, ``interesting'' means high signal-to-noise and previously
unknown.
To generate this list, we use a method much like the standard ``box least
squares'' \citep[\bls;][]{Kovacs:2002} procedure with a single (non-periodic)
box.
After masking any known transits, we filter the \pdc\ light curves
\citep{Stumpe:2012, Smith:2012} using a running windowed median with a
half-width of 2~days to remove stellar variability.
We then compute the signal-to-noise of the depth of a 0.6~day-long top hat on
a grid of times spanning the full baseline of observations.
In detail, at each proposal time $t_0$, we hypothesize a box-shaped transit
with duration $\tau$
\begin{eqnarray}
m(t) &=& \left\{\begin{array}{ll}
\mu - \delta, & \mbox{\,if $|t - t_0| < \tau/2$} \\
\mu, & \mbox{\,otherwise}
\end{array}\right. \quad.
\end{eqnarray}
Assuming that the uncertainties on the observed fluxes $f(t)$ are Gaussian
with known variance ${\sigma_f}^2$, the likelihood function for the mean flux
$\mu$ and transit depth $\delta$ can be analytically computed to be a
two-dimensional Gaussian with mean and covariance given by linear
least-squares.
This likelihood function provides a natural scalar objective: the
signal-to-noise of the measured depth computed as a function of time.
In principle this scalar is also a function of duration but we only use a
single transit duration because the following steps in this procedure are only
sensitive to transits with very high signal-to-noise, and in practice, the
final results are insensitive to the specific choice of duration.
To avoid edge effects, we apodize this detection scalar near any large gaps in
the time series using a logistic function with width equal to one transit
duration.
Finally, we estimate the background noise level in the detection scalar time
series using a robust running windowed variance estimate of the detection
scalar. We accept peaks that are more than 25-times this background noise
level as candidates.
For the \kepler\ light curves, this procedure yields at least one candidate
event in about 1~percent of targets. For these targets, we investigate
the three highest signal-to-noise events in the following step.
\subsection{Step 2 -- Light curve vetting}\sectlabel{light-curve-vetting}
In this step of the method, the goal is to discard any signals that are not
sufficiently ``transit-like'' in shape.
This step is similar to the method independently developed and recently
published by the \kepler\ team \citep{Mullally:2016}.
To quantify the quality of a candidate, we perform a model comparison between
a physical transit model and a set of other parameterized models for
systematics.
In order for a candidate to pass this vetting step, the transit model must be
``preferred'' to any other model as measured using the Bayesian Information
Criterion (BIC).
The BIC is not the optimal choice for this model comparison, but it is more
computationally tractable than the alternatives, such as computing thousands
of precise marginalized likelihoods or expected utilities for each model.
The BIC can be efficiently computed and it exhibits the desired
behavior~--~decreasing with increasing likelihood but flexible models are
penalized~--~and we find that it performs sufficiently well in practice.
For up to three candidate transit times per light curve, we select a
contiguous chunk of \pdc\ light curve approximately centered on the proposed
transit with no more than 500 cadences (about 10 days) and compute the BIC of
each model for this data set.
The BIC for a model $k$ in the set of $K$ models is given by
\begin{eqnarray}
\mathrm{BIC}_k &=& -2\,\ln \mathcal{L}^* + J\,\ln N
\end{eqnarray}
where the likelihood function $\mathcal{L}$ is evaluated at its maximum, $J$
is the number of free parameters in the model, and $N$ is the number of
data points in the data set.
For each model, we describe the data using a Gaussian Process
\citep[GP;][]{Rasmussen:2006} with a Mat\'ern-3/2 covariance and mean given by
the chosen model $m_k(t;\,\meanpars)$ parameterized by the parameter vector
\meanpars.
We consider the following mean models (this list provides a qualitative
justification for each model):
\begin{itemize}
{\item \modelname{transit} -- a limb-darkened, exposure-time integrated
transit light curve,}
{\item \modelname{variability} -- a pure GP model to capture
stellar variability,}
{\item \modelname{outlier} -- a single outlier to account for a bad data
point,}
{\item \modelname{step} -- a step function to describe sudden pixel
sensitivity dropouts \citep[SPSDs; for example][]{Christiansen:2013}, and}
{\item \modelname{box} -- a box to catch signals that are well fit by the
search scalar but insufficiently transit-like to be convincing.}
\end{itemize}
The functional forms of these models are given in \app{model-details} and the
details of the technical methodology of GP fitting are described in
\app{gp-regression}.
\dfmfig{model-comp} shows representative events that fall into different
classes and the corresponding maximum likelihood model.
For each candidate event, the BIC of each of these models is computed and the
event is only passed as a candidate if the \modelname{transit} model is
preferred to all the other models.
The \modelname{box} model is the most restrictive comparison, vetoing about
half of the candidate events in the \kepler\ light curves, followed by the
\modelname{variability} model.
To further restrict to non-grazing transits, we also reject events where the
maximum likelihood impact parameter is greater than $1 - R_\mathrm{P} /
R_\star$.
Since the search procedure described here was tuned to discover transit
signals, we do not consider the distribution or potential astrophysical nature
of any models besides the \modelname{transit} model.
In the future, it would be interesting to relax this goal and investigate the
other model classes; in particular, the \modelname{box} model is
sensitive to astrophysical phenomena, notably occultations of white dwarfs.
In a cursory investigation it is clear that the majority of signals labeled
\modelname{box} in our analysis are noise; however, a subset are likely to be
astrophysical in nature.
The reliability of this method of automated vetting is limited by the specific
models selected in this step.
We find that these are sufficient for the targets discussed below but
different target lists or data sets might require additional models to be
included for robust selection.
\begin{figure*}[p]~\\
\begin{center}
\includegraphics[width=\textwidth]{figures/model_comp.pdf}
\end{center}
\caption{%
Representative examples of candidate events flagged by the initial search.
Each example falls into a different model category and the figure shows the
data as black points and the best fit mean model prediction.
The examples represent the following model categories:
\emph{(a)} step \emph{(b)} variability, \emph{(c)} box, and \emph{(d)}
transit.
\dfmfiglabel{model-comp}}
\end{figure*}
\subsection{Step 3 -- Pixel-level vetting}
To minimize contamination from background eclipsing binary systems, we require
candidate events to pass a centroid shift test similar to the one used in the
official \kepler\ transit search pipeline \citep{Bryson:2013}.
To measure the centroid shift, we model the flux-weighted centroid traces
independently in each coordinate as a multiple of the best-fit transit model
and a GP noise model.
By properly normalizing the transit model, we measure the in-transit centroid
shift $\Delta_\mathrm{centroid}$ in pixels.
We reject any candidate event where the estimated transit location is more
than half a pixel from the out-of-transit centroid
\begin{eqnarray}
\Delta_\mathrm{centroid}\,\left(\frac{1}{\delta} - 1\right) &>& 0.5
\end{eqnarray}
where $\delta$ is the observed transit depth \citep{Bryson:2013}.
\section{Results: a catalog of long-period transiting exoplanet candidates}
To limit the scope of this paper while still demonstrating the applicability
of our method, we search the \kepler\ archival light curves of the brightest
and quietest Sun-like stars for long-period transiting exoplanets.
In this section, we describe the target selection process and the parameter
estimation procedure.
\subsection{Target selection}\sectlabel{target-selection}
We select the $\sim40,000$ brightest and quietest G and K dwarfs from the
\kepler\ catalog using the most recent catalog of stellar parameters%
\footnote{Parameters from the \textsf{q1\_q17\_dr24\_stellar} table from the
NASA Exoplanet Archive \citep[][with updates]{Huber:2014}.} and the cuts used
by \citet{Burke:2015}:
\begin{itemize}
{\item $4200\unit{K} \le T_\mathrm{eff} \le 6100\unit{K}$,}
{\item $R_\star \le 1.15\,R_\odot$,}
{\item $\mathrm{data\,span} \ge 2\,\mathrm{years}$,}
{\item $\mathrm{duty\,cycle} \ge 0.6$,}
{\item $K_p \le 15\unit{mag}$, and}
{\item $\mathrm{CDPP}_{7.5\unit{hrs}} \le 1000\unit{ppm}$.}
\end{itemize}
We continue by excluding the light curves of known eclipsing
binaries\footnote{\url{http://keplerebs.villanova.edu/}} \citep{Kirk:2016},
other known false positives \citep{Coughlin:2016}, a planet with known transit
timing variations (Kepler-9), and four especially noisy stars (KIC~4482348,
KIC~4450472, KIC~5438845, and KIC~10068041).
The final catalog contains \numtargets\ targets and the parameter distribution
is shown in \dfmfig{targets}.
Since these data have already been searched for short-period planets, we
assume that all high signal-to-noise candidates with three or more transits
have been previously found \citep{Coughlin:2016}.
To remove these candidates from consideration, we mask the cadences within two
transit durations of the time when a short-period planet candidate is known to
transit\footnote{We specifically use the \textsf{q1\_q17\_dr24\_koi} from the
NASA Exoplanet Archive \url{http://exoplanetarchive.ipac.caltech.edu/}.}.
\begin{figure}~\\
\begin{center}
\includegraphics{figures/targets.pdf}
\end{center}
\caption{%
The distribution of stellar parameters for \kepler\ targets selected for this
search (orange) compared to the distribution of the full \kepler\ target
catalog (black).
\dfmfiglabel{targets}}~\\
\end{figure}
\subsection{Parameter estimation}
For each transit candidate, we constrain the physical parameters of the system
by fitting a section of light curve around each transit using an exposure-time
integrated Keplerian orbit with a quadratic limb darkening law for the
central body \cite{Foreman-Mackey:2016a}.
It has previously been established that the orbital period of a transiting
planet with only one transit can still be constrained given a measurement of
the stellar density and an assumption about the orbital eccentricity \cite[for
example][]{Wang:2015, Osborn:2016}.
Qualitatively this works because the transit of a bound body cannot have an
arbitrary period for a given duration.
This is the same argument used to justify the ``photoeccentric effect''
\citep{Dawson:2012} and the method of ``asterodensity profiling''
\citep{Kipping:2014b}.
In particular, this suggests that the periods of single transits in systems
with multiple inner planets will be especially well constrained
\citep{Kipping:2012}.
In this \paper, we do not take advantage of the extra constraints provided by
the inner planets, instead treating each long-period transiting system in
isolation, but this would be a good follow-up project.
In the following paragraphs, we describe the components of the probabilistic
model used to infer the planet candidates' properties.
To perform parameter estimation under this model, we use the Markov Chain
Monte Carlo (MCMC) package \emcee\footnote{\url{http://dfm.io/emcee}}
\citep{Foreman-Mackey:2013} with an ensemble of 40 walkers.
We run each chain until at least 750 independent samples~--~in most cases, we
actually produce thousands of independent samples~--~are obtained\footnote{The
integrated autocorrelation time is estimated using a robust iterative method
as suggested by Alan Sokal:
\url{http://www.stat.unc.edu/faculty/cji/Sokal.pdf}.} and discard the first
third of the chain as burn-in.
The posterior constraints on a few physical parameters for the single transit
candidate in the light curve of KIC~8505215 are shown in \dfmfig{corner} and
all the chains are made available online\footnote{\datareleaseurl}.
\begin{figure}~\\
\begin{center}
\includegraphics[width=\textwidth]{figures/params/8505215.pdf}
\end{center}
\caption{%
The posterior constraints on the physical parameters for the single transit
candidate found in the light curve of KIC~8505215.
The contour plots show estimates of the two-dimensional marginalized
probability densities and the histograms show the marginalized density for
each parameter.
This figure was generated using \project{corner.py}
\citep{Foreman-Mackey:2016}.
\dfmfiglabel{corner}}~\\
\end{figure}
\paragraph{Priors}
For each candidate in our sample, we take the constraints on the stellar
parameters from the \kepler\ DR24 stellar properties catalog and assume an
empirical beta function prior on the eccentricities based on the observed
eccentricity distribution of long-period planets discovered using radial
velocities \citep{Kipping:2013}.
Table~\ref{tab:parameters} lists all the fit parameters and their prior
distributions.
Besides these listed priors, we add the extra constraint that no other
transits can occur in the baseline of the \kepler\ observations.
This constraint is overly conservative because there is some probability that
a second transit could occur in a data gap but we find that, in practice, most
of the posterior mass is at longer periods and the period inferences are not
significantly affected.
\paragraph{Likelihood function}
As above, we model the light curve as a Gaussian Process (GP) with a physical
transit model as the mean, and a covariance matrix described by a Mat\'ern-3/2
kernel function.
The full likelihood function and some details of GP regression are given in
\app{gp-regression}.
For computational efficiency, we first perform a joint optimization of the
physical parameters and GP hyperparameters to find the maximum \foreign{a
posteriori} model then keep the hyperparameters fixed and run MCMC sampling
for the 11 physical parameters alone.
\begin{floattable}
\begin{deluxetable}{lccc}
\tabletypesize{\footnotesize}
\caption{The inferred parameters and priors used in the inference
\label{tab:parameters}}
\tablehead{%
\colhead{name} & \colhead{symbol} & \colhead{units} &
\colhead{prior}
}
\startdata
mean flux & $\log f_\star$ & \nodata &
$\log f_\star \sim \mathcal{U}(-1,\,1)$ \\
stellar mass\tablenotemark{a} & $M_\star$ & $M_\odot$ &
$M_\star \sim \mathcal{N}(M_{\star,\mathrm{cat}},\,
\sigma_{M,\star,\mathrm{cat}})$ \\
stellar radius\tablenotemark{a} & $R_\star$ & $R_\odot$ &
$R_\star \sim \mathcal{N}(R_{\star,\mathrm{cat}},\,
\sigma_{R,\star,\mathrm{cat}})$ \\
\multirow{2}{*}{limb darkening} & $q_1$ & \nodata &
$q_1 \sim \mathcal{U}(0,\,1)$ \\
& $q_2$ & \nodata & $q_2 \sim \mathcal{U}(0,\,1)$ \\
\hline
planet radius & $\log R_\mathrm{P}$ & $R_\odot$ &
$\log R_\mathrm{P} \sim \mathcal{U}(-10,\,2)$ \\
reference time & $t_0$ & days &
$t_0 \sim \mathcal{U}(t_\mathrm{cand}-0.5,\,t_\mathrm{cand}+0.5)$%
\tablenotemark{b} \\
\multirow{2}{*}{\begin{minipage}{1in}semi-major axis \ \& inclination\end{minipage}}
& $\sqrt{a}\sin i$ & ${R_\odot}^{1/2}$ &
$\sqrt{a}\sin i \sim \mathcal{U}(-10^3,\,10^3) / \sqrt{a}$ \\
& $\sqrt{a}\cos i$ & ${R_\odot}^{1/2}$ &
$\sqrt{a}\cos i \sim \mathcal{U}(0,\,10^3) / \sqrt{a}$ \\
\multirow{2}{*}{eccentricity}
& $\sqrt{e}\sin \omega$ & \nodata & $e \sim \beta(1.12,\,3.09)$%
\tablenotemark{c} \\
& $\sqrt{e}\cos \omega$ & \nodata & $\omega \sim \mathcal{U}(-\pi,\,\pi)$\\
\enddata
\tablenotetext{a}{Stellar parameters and uncertainties taken from the \kepler\
catalog \citep{Huber:2014}}
\tablenotetext{b}{The reference time is constrained to be within half a day of
the candidate transit time}
\tablenotetext{c}{\citet{Kipping:2013a}}
\tablecomments{There is one further constraint that complicates these priors:
the period of the orbit must be longer than some minimum period
$P_\mathrm{min}$ set by the transit time and the full baseline of \kepler\
observations.}
\end{deluxetable}
\end{floattable}
\section{Catalog of transit candidates}\sectlabel{catalog}
Applying the search procedure described in \sect{search} to the \kepler\ light
curves of the \numtargets\ targets selected in \sect{target-selection}, we
find \numcands\ convincing transit candidates.
Visual inspection of each candidate confirms the reliability of the
classification and no candidates are manually removed from the catalog.
Of these, three candidates have two transits in the \kepler\ baseline and the
remainder have only one observable transit.
The candidates and their inferred physical parameters are listed in
Table~\ref{tab:catalog} and the light curves are plotted in
\dfmfig{light-curves}.
The inferred radius and orbital periods of the candidates are compared to the
short-period \kepler\ sample and the Solar System in \dfmfig{full-sample}.
Two of the shortest period candidates~--~both with two observed
transits~--~have previously been studied in detail \citep[KIC~8800954 and
KIC~3239945;][]{Kipping:2014a, Kipping:2016}.
Table~\ref{tab:catalog} indicates the candidates that were also discovered by
earlier searches for long-period transiting systems using visual inspection
\citep{Wang:2015, Uehara:2016}.
The consistency between our results and the earlier catalogs is reassuring.
In the light curves of targets with previously known short-period planets, our
automated search did not find any candidates that weren't previously detected
by visual inspection \citep{Uehara:2016} and one candidate (KIC~3230491)
reported by the human analysis was discarded as grazing by our search.
The Planet Hunters citizen science project \citep{Fischer:2012} reported five
long-period candidates with one or two observed transits in our target list
\citep{Wang:2015}.
Of these, we also find two (KIC~8410697 and KIC~10842718) although we find a
second transit in the KIC~8410697 system that was previously missed.
We do not recover the three other candidates reported by \citet{Wang:2015}:
KIC~5536555, KIC~9662267, and KIC~12454613.
The transits of these candidates are all low signal-to-noise and they do not
pass our initial signal-to-noise threshold.
Six of the candidates in Table~\ref{tab:catalog} have not been previously
published.
Of the \numcands\ candidates, \numinner\ have known inner planets with three
or more observable transits \citep{Coughlin:2016}.
Given the fact that only \numkois\ of the \numtargets\ targets had previously
known planets, this means that systems with short-period transiting planets
are nearly a factor of 20 more likely to host long-period planets accessible
by our method than systems with no known inner transiting planets.
This difference cannot be accounted for by differences in completeness between
targets with known planets and without because the average detection
efficiency for both populations is consistent within sampling uncertainty.
Qualitatively, this suggests that these long-period planets occur with a
higher frequency in multi-planet systems or are preferentially aligned with
the plane of any inner planets but a more detailed analysis would be needed to
make a quantitative statement \citep[see, for example,][]{Tremaine:2012,
Fang:2012, Ballard:2016, Moriarty:2015}.
The candidate in the light curve of KIC~4754460 is an individual transit
candidate but another deeper eclipse can be found at a \kepler\ Barycentric
Julian Date (KBJD) of 1587.13; right at the beginning of Quarter 17.
This eclipse was missed by the automated search because only the second half
of the eclipse is observed.
The most likely explanation of this system is that the listed candidate is the
secondary eclipse of a binary system but we will keep the candidate in the
list and treat this effect statistically in \sect{false-positives}.
Five candidate transit events in the light curves of four targets were
rejected because of a significant centroid shift or a large impact parameter.
These events are probably astrophysical eclipses from binary star systems that
were not found by previous studies of long-period eclipsing binary systems.
We do not consider these events further in the following analysis but
Table~\ref{tab:rejects} lists these events and their properties for posterity.
\begin{floattable}
\begin{deluxetable}{cccccccccccl}
\tabletypesize{\scriptsize}
\caption{The inferred parameters for the long-period transiting exoplanet
candidates \label{tab:catalog}}
\include{paramtab}
\tablenotetext{*}{The equilibrium temperature is computed assuming zero
albedo.}
\tablenotetext{\dagger}{The KOI number and, if applicable, the \kepler\ number
for the target.}
\tablenotetext{a}{Included in the \citet{Wang:2015} catalog.}
\tablenotetext{b}{Included in the \citet{Uehara:2016} catalog.}
\tablenotetext{c}{Candidate has two observed transits.}
\tablecomments{The values and uncertainties indicate the 16-th, 50-th, and
84-th percentiles of the posterior samples for each parameter.}
\end{deluxetable}
\end{floattable}
\begin{figure*}[p]~\\
\begin{center}
\include{lcfig}
\end{center}
\caption{%
Sections of \pdc\ light curve centered on each candidate (black) with the
posterior-median transit model over-plotted (orange).
The y-axis shows the relative apparent flux of the light curve in parts per
thousand (ppt).
Candidates with two transits are folded on the posterior-median period.
The plots are ordered by increasing planetary radius from the top-left to the
bottom-right.
\dfmfiglabel{light-curves}}
\end{figure*}
\begin{figure*}~\\
\begin{center}
\includegraphics{figures/full_sample_plus_cands.pdf}
\end{center}
\caption{%
The catalog of long-period transiting exoplanet candidates (green points with
error bars) compared to the \kepler\ candidates (blue points) and confirmed
planets \citep[black points;][]{Morton:2016} found in our target sample, and
the Solar System (orange squares).
The thin black error bars to the left of each candidate indicate the minimum
period allowed for each candidate by the prior assumption that no other
transit occurred during the baseline of \kepler\ observations of the target.
The vertical solid line shows the absolute maximum period accessible to
transit searches that require at least three transits in the \kepler\ data.
\dfmfiglabel{full-sample}}~\\
\end{figure*}
\begin{floattable}
\begin{deluxetable}{ccccl}
\tabletypesize{\footnotesize}
\caption{The signals rejected with a centroid shift or large impact parameter
\label{tab:rejects}}
\include{rejecttab}
\end{deluxetable}
\end{floattable}
\section{Empirical search completeness}\sectlabel{completeness}
To measure the completeness of the search procedure described in \sect{search}, we
exploit the fact that transit signals are sparse and rare.
Therefore, most light curves contain no transits and we can reliably measure
the recovery rate of our method on synthetic transit signals~--~with known
properties~--~injected into real light curves.
This procedure is standard practice in the transit literature and it has been
used to determine the completeness of the KOI catalog
\citep{Christiansen:2013, Christiansen:2015} and other independent transit
searches \citep{Petigura:2013, Dressing:2015, Foreman-Mackey:2015}.
To reliably capture the full structure of the search completeness function,
the simulations must sample the (high-dimensional) space of all properties
that affect the probability of detecting a transit: the stellar properties
(including variability amplitudes and time scales), the planet's physical
properties and orbital elements, and any observational effects (noise,
spacecraft pointing variations, \etc).
For the modest goals of this paper, we only need a robust constraint on the
transit detection efficiency \emph{integrated} across the target sample but,
even so, many simulations per star are required.
The procedure for measuring the recovery rate of simulated transits is as
follows:
\begin{enumerate}
{\item First, a star is randomly selected from the target list, and the \pdc\
light curve and stellar properties for that star are loaded.}
{\item Planetary properties are sampled from the distributions listed in
Table~\ref{tab:simulations} with phase uniformly distributed across the
baseline of observations. These properties are re-sampled until the transit is
visible in at least one non-flagged cadence.}
{\item The transit signal induced by this planet is computed and multiplied
into the \pdc\ light curve.}
{\item The transit search method described in \sect{search}~--~including
de-trending and \emph{all automated vetting}~--~is applied to this light
curve with the injected transit signal.}
{\item This candidate is flagged as recovered if at least one transit within
one transit duration passes all the cuts imposed by the automated vetting.}
\end{enumerate}
The fraction of recovered simulations as a function of the relevant parameters
gives an estimate of the probability of detecting an exoplanet transit with a
given set of parameters, \emph{conditioned on the fact that it transits the
star during a time when the star was being observed by Kepler}.
We will call this function $Q_{\mathrm{det},k}(\params)$ where \params\ is the
set of all parameters affecting the transit detectability and $k$ is an index
running over target stars.
\dfmfig{completeness} shows the fraction of recovered simulations as a
function of planet radius and orbital period based on \numinjs\ injected
signals.
This figure shows the transit detection efficiency falling with decreasing
planet radius.
This is the expected behavior because the depth (and signal strength) of a
transit scales with the area ratio between the planet and the star.
There is also a slight decrease in the completeness to larger planet radius.
This trend is introduced in steps~2 and~3 of the search procedure where the
tuning parameters were chosen to maximize the yield of convincing small
transit discoveries.
The decreasing completeness with orbital period is less intuitive because, on
average, the signal strength should increase as the duration of the transit
increases.
In this case, this simplistic treatment misses two important factors.
First, in step 1 of the search procedure (\sect{stepone}) only a single
transit duration is used and second, longer transits are less easily
distinguished from stellar variability and they will, therefore, be discarded
in the conservative light curve vetting step (\sect{light-curve-vetting}).
This detection efficiency must then be combined with the geometric transit
probability function and the window function.
For the star $k$, the geometric transit probability is given by
\citep{Winn:2010}
\begin{eqnarray}\eqlabel{geom-comp}
Q_{\mathrm{geom},k} (\params) &=& \frac{R_{\star,k} + R}{a_k}
\, \frac{1 + e\,\sin\omega}{1-e^2} \\
&=& \left[\frac{4\,\pi^2}{G\,M_{\star,k}}\right]^{1/3}
\, \left[\frac{1 + e\,\sin \omega}{1-e^2}\right]\,(R_{\star,k}+R)
\, P^{-2/3}
\end{eqnarray}
where $R$ is the planet radius, $P$ is the orbital period, $e$ is the orbital
eccentricity, $\omega$ is the argument of periastron, $R_{\star,k}$ is
the radius of star $k$, and $M_{\star,k}$ is the star's mass.
All of these parameters are included in \params.
\response{%
In \eq{geom-comp}, the term $(R_{\star,k}+R)$ takes grazing transits into
account.
This might seem counter intuitive because, as part of the search procedure, we
rejected candidates where the maximum likelihood model had a grazing transit.
However, since the measurement of $Q_{\mathrm{det},k}$ included a cut on the
measured impact parameter, the $Q_{\mathrm{det},k}$ term already takes this
effect into account.
In other words, $Q_{\mathrm{det},k}$ quantifies the probability that a transit
of a given shape will be detected \emph{given that it transits at all} and
$Q_{\mathrm{geom},k}$~--~the way it is written in \eq{geom-comp}~--~is the
marginalized probability that the system will transit given its physical
parameters.
}
Approximating the window function using a binomial probability of observing
at least one transit, we find \citep[following][]{Burke:2014a}
\begin{eqnarray}
Q_{\mathrm{win},k} (\params) &=& \left\{\begin{array}{ll}
1 - (1 - f_{\mathrm{duty},k})^{T_k/P} & \quad\mbox{if $P \le T_k$} \\
T_k\,f_{\mathrm{duty},k} / P & \quad\mbox{otherwise}
\end{array}\right.
\end{eqnarray}
where $f_{\mathrm{duty},k}$ is the duty cycle and $T_k$ is the full
observation baseline for target $k$.
Combining these detection effects, the total detection efficiency is given by
\begin{eqnarray}
Q_k(\params) &=& Q_{\mathrm{det},k}(\params) \,
Q_{\mathrm{win},k} (\params) \,
Q_{\mathrm{geom},k} (\params) \quad.
\end{eqnarray}
So that our planet candidate catalog can be easily used for other projects, we
also provide an analytic approximation to the relevant integrated detection
efficiency function
\begin{eqnarray}
Q_\mathrm{det}(P,\,R) &=& \sum_{k=1}^{K} \int Q_\mathrm{det,k}(\params)\,
p(\params_{\{P,\,R\}}) \dd\params_{\{P,\,R\}}
\end{eqnarray}
where $p(\params_{\{P,\,R\}})$ is the prior distribution of all the parameters
except the period and radius.
We find that a good fit to this integrated completeness is given by the
function
\begin{eqnarray}
Q_\mathrm{det}(P,\,R) &\approx&
\frac{\mathrm{min}[\mathrm{max}[a(P)\,b(R),\,0],\,1]}
{1+\exp\left[-k(P)\,(\ln R / R_\mathrm{J}-x(P))\right]}