NEW TECHNIQUES FOR HIGH-CONTRAST IMAGING WITH ADI: THE ACORNS-ADI SEEDS DATA REDUCTION PIPELINE*

Timothy D. Brandt; Michael W. McElwain; Edwin L. Turner; L. Abe; W. Brandner; J. Carson; S. Egner; M. Feldt; T. Golota; M. Goto; C. A. Grady; O. Guyon; J. Hashimoto; Y. Hayano; M. Hayashi; S. Hayashi; T. Henning; K. W. Hodapp; M. Ishii; M. Iye; M. Janson; R. Kandori; G. R. Knapp; T. Kudo; N. Kusakabe; M. Kuzuhara; J. Kwon; T. Matsuo; S. Miyama; J.-I. Morino; A. Moro-Martín; T. Nishimura; T.-S. Pyo; E. Serabyn; H. Suto; R. Suzuki; M. Takami; N. Takato; H. Terada; C. Thalmann; D. Tomono; M. Watanabe; J. P. Wisniewski; T. Yamada; H. Takami; T. Usuda; M. Tamura

doi:10.1088/0004-637X/764/2/183

1. INTRODUCTION

Since 1992, more than 700 confirmed exoplanets and 2000 additional candidates have been discovered.²⁰ Ground-based surveys have confirmed hundreds of exoplanets by measuring the periodic radial velocity shifts they induce in their host stars (e.g., Vogt et al. 2000; Queloz et al. 2000; Tinney et al. 2001; Mayor et al. 2003) or by measuring photometric variations as they transit their host stars (e.g., Alonso et al. 2004; Bakos et al. 2004; McCullough et al. 2005; Pollacco et al. 2006; Charbonneau et al. 2009). In space, NASA's Kepler satellite (Borucki et al. 2010) has identified more than 2000 candidate transiting exoplanets. These indirect methods are sensitive to short-period exoplanets: the magnitude of a radial velocity signal and the probability of a transit both decrease with separation. These methods also generally require observations over several orbital periods, making them impractical for detecting exoplanets with periods of more than a few years.

Direct imaging surveys, made possible by advances in adaptive optics (AO), infrared detectors, and image processing algorithms, are now complementing transit and radial velocity surveys, identifying giant exoplanets tens of astronomical units (AU) from their host stars. Ground-based high-contrast imaging surveys have shown that these giant exoplanets are rare (e.g., Biller et al. 2007; Lafrenière et al. 2007a), and are beginning to constrain models of exoplanet and exoplanetary system formation and evolution (Janson et al. 2012).

Large-scale direct-imaging surveys rely on sophisticated image processing to search for faint companions around bright stars. In addition to the usual bias, flat-field, and distortion corrections, these surveys must model and subtract the stellar point-spread function (PSF). Most surveys use Angular Differential Imaging (ADI) to make this task easier. As the Earth rotates, the orientation of the field of view (FOV) of an altitude-azimuth telescope (and thus of the PSF on the detector) changes relative to the celestial north. Features of the PSF due to the instrument, such as telescope spiders and the diffraction pattern, appear to rotate relative to any faint companion.

The first algorithms to take advantage of ADI used simple techniques to model the PSF, like taking the median of a sequence of exposures (Marois et al. 2006). More recently, algorithms like the Locally Optimized Combination of Images (LOCI; Lafrenière et al. 2007b) model the PSF locally, while principal component analysis (PCA) based techniques (Soummer et al. 2012; Amara & Quanz 2012) model it globally. These more sophisticated algorithms can offer a factor of ∼2 or more improvement in sensitivity over simple ADI reductions.

In this paper, we present Algorithms for Calibration, Optimized Registration, and Nulling the Star in Angular Differential Imaging (ACORNS-ADI), a software package to analyze ADI data for the SEEDS survey (see Tamura 2009), a five-year direct imaging survey using the HiCIAO instrument (Hayano et al. 2008) on the Subaru Telescope. We discuss each non-trivial step of the reduction process, from the bias and flat-field corrections to PSF modeling to the final sensitivity analysis. ACORNS-ADI is parallelized, open-source, and freely available for download at www.github.com/t-brandt/acorns-adi under a BSD license.

2. ADI DATA REDUCTION IN SEEDS

In order to take advantage of the field rotation in ADI, a series of short exposures is taken, with minimal field rotation during each individual exposure. The central star is usually allowed to saturate in order to increase the observing efficiency and limit the amount of read noise for a given number of companion photons. A typical high-contrast ADI data set thus consists of a series of short, sequential exposures with a saturated central star. The data reduction process searches for point sources in the image sequence. While it is also possible to analyze extended structures, like disks, using ADI reduction techniques (e.g., Liu 2004; Thalmann et al. 2011), it is far more difficult to interpret the final processed images (see Section 5.1). A detailed discussion of extended sources in ADI is beyond the scope of this paper.

A reduction of a high-contrast ADI image sequence proceeds in several steps.

1.
Correct for the bias, flat-field each image.
2.
Interpolate over hot pixels.
3.
Correct the image distortion (if necessary).
4.
Register the frames (if necessary).
5.
Model and subtract the PSF of the central star.
6.
Rotate each frame to align it to the celestial north.
7.
Combine the sequence of images.
8.
Search for point sources.
9.
Produce a sensitivity map.

Each step in the reduction impacts the sensitivity of the final combined image. Optimizing and characterizing this sensitivity is critical for understanding the incidence and properties of substellar companions.

Some of the steps listed above, such as the distortion correction (once the distortion map is known!) and the rotation to a common frame, are trivial. Others are surprisingly difficult, or can be optimized to give significant improvements in sensitivity. For example, a one pixel root-mean-square (rms) scatter in the image registration can degrade sensitivity by ∼20% (Section 5.2), as can the use of the median intensity, rather than a trimmed mean, to combine a sequence of images (Section 6.1).

Figure 1 shows a sample SEEDS data set through the above sequence of steps as processed by ACORNS-ADI. The first frame shows the central 5'' × 5'' of a 20'' × 20'' sample raw image, while the second frame shows the effect of correcting for the bias (Section 3.1), flat-fielding, and hot pixel masking (Section 3.2). The third frame shows the same image after correcting for field distortion (Section 3.3) and registering to the PSF centroid (Section 4). The first frame on the second line shows the residuals after subtracting the stellar PSF using the LOCI algorithm (Sections 5.2 and 5.3; Appendix A); the next frame shows the combined image from an ADI sequence (Section 6.1; Appendix B) convolved with a circular 0 farcs 05 aperture (Section 6.2) and normalized by the radial profile of its standard deviation. Finally, we show a radial profile of the data set's sensitivity to point sources (Section 6.3; Appendix A).

farcs — **Figure 1.** Step-by-step depiction of the ACORNS-ADI data reduction process, as discussed in Section 2. The first image shows the central 5'' × 5'' of a 20'' × 20'' frame of raw data. The second frame has been bias-corrected, flat-fielded, and had its hot pixels masked (Sections 3.1 and 3.2), while the third frame has been corrected for field distortion (Section 3.3) and registered to the PSF centroid (Section 4). The first frame on the second row shows the residuals in a single frame after applying the LOCI algorithm (Section 5.2; Appendix A), while the next frame shows the combined image from an ADI sequence (Section 6.1; Appendix B) convolved with a circular 005 aperture (Section 6.2) and normalized by the standard deviation at each radius. The final frame shows a radial profile, with shading to represent azimuthal scatter, of the final sensitivity map (Section 6.3; Appendix A).
Download figure:
Standard image High-resolution image

In the following sections, we describe each non-trivial step listed above. Some steps, like the subtraction of the stellar PSF and the combination of an image sequence, apply generally to data from any survey, while other steps, like the distortion correction and bias correction, have features specific to SEEDS data. Our discussions of two of these steps, the computation of a sensitivity map and the statistics of a combination of images, contain calculations that we relegate to appendices.

3. CALIBRATION

The first step in the data reduction is calibration: finding the count rate corresponding to zero intensity, flat-fielding, and applying a distortion correction. The zero point correction and flat-fielding are complex and interrelated for SEEDS data, and we handle them with a single routine. We then apply a distortion correction to the intensity-calibrated data. Because an ADI sequence consists of many short exposures, its sensitivity far from the central star can be limited by calibration uncertainties and read noise. Typical SEEDS observations are read noise limited at separations ≳2''.

The algorithms and discussions in this section are mostly specific to data from a 2048×2048 pixel Hawaii2-RG (H2RG) detector (Blank et al. 2011). These HgCdTe detectors are becoming more common on major instruments and telescopes. In addition to HiCIAO on Subaru, Calar Alto, Canada–France–Hawaii Telescope, Very Large Telescope, Infrared Telescope Facility, Keck, and SALT all use H2RG detectors. Future space-based missions such as JWST, JDEM, EUCLID, and Prime Focus Spectrograph and CHARIS, the next-generation spectrograph and camera for Subaru, will all contain H2RG or similar, but larger, 4096×4096 pixel H4RG arrays. To reduce data from another detector, the user would need to supply his or her own flat frame and bad pixel mask, and configure ACORNS-ADI to skip the steps listed below. This can be done during ACORNS' interactive configuration without any modification of the source code.

3.1. Removing the Bias

As configured for SEEDS, HiCIAO's H2RG detector reads out data in 32 channels at a pixel rate of 100 kHz with correlated double sampling (CDS). Reading out all 2048 × 2048 pixels thus takes about 1.3 s. H2RG detectors also feature a non-destructive read mode, so that up-the-ramp sampling could be implemented in the future on longer exposures. As discussed by Moseley et al. (2010), each of the 32 readout channels has its own reference voltage, which appears as a bias—a non-zero count level corresponding to zero intensity—in raw HiCIAO data. Superimposed on this is a time-varying reference voltage that is largely shared between the 32 channels. To calibrate the count level corresponding to zero intensity, we therefore need to fit for 32 stable reference voltages and at least one function. The H2RG detector provides four rows of pixels at each detector edge, for a total of 32,704 reference pixels (the "R" in H2RG). These pixels are not light-sensitive, but are subject to the same reference voltages as the rest of the array, and can be used to estimate the pixel-by-pixel bias. For some observations, SEEDS has taken data with the guide window capability (the "G" in H2RG), reading out only a subarray of the detector. In this mode, there is a single readout channel (and reference voltage) and there are no reference pixels.

For SEEDS data taken in the normal 32-channel readout mode, the 512 reference pixels at the ends of the channel are sufficient to determine the stable reference voltages. The H2RG read noise is very nearly Gaussian, so we use a sigma-reject technique to calculate their offset from zero. ACORNS-ADI iteratively rejects 3σ outliers, takes the mean of the remaining reference pixels, and subtracts this value from all of the pixels in each readout channel. This calibration is good to approximately the read noise over $\sqrt{512}$ (the number of reference pixels), or about 1 e⁻ per frame. The first row of Table 1 shows the read noise as measured in a series of 30 dark frames taken in 2010 December with no bias corrections. The difference between the root variance within a readout channel (27 e⁻) and over the entire array (52 e⁻) is almost completely removed by a mean subtraction using only the 512 reference pixels, as shown in the second row of Table 1.

Table 1. Average Residual Read Noise After Bias Correction in 30 Dark Frames

Zero Point Method	Average Read Noise (e⁻ pixel⁻¹)
Zero Point Method	Single Channel	Entire FOV
None	27.2	51.8
One voltage per channel	27.2	27.4
Reference pixels, ν < 300 Hz	26.1	26.2
All pixels, ν < 3 kHz	25.0	25.1
Half of the pixels, median	23.4	23.8
All pixels, median	22.3	22.5

Download table as: ASCII Typeset image

The time-varying reference voltage, largely shared among the 32 readout channels, is more difficult to subtract. Moseley et al. (2010) describe two techniques. One of these relies on reference pixels interspersed throughout the detector array, which would be difficult to implement in an imaging survey like SEEDS. The other technique saves the reference voltage in place of one of the 32 channels of science pixels. Appropriate weighting of the Fourier components of this reference voltage then provides a good estimate of the bias to be subtracted from each readout channel.

The H2RG detector has 1/f noise that extends from the frame rate, ∼1 Hz, to a knee at ∼3 kHz, where the noise becomes uncorrelated and the optimal weighting of the Fourier components falls to zero (Moseley et al. 2010). SEEDS does not currently save the reference voltage, which would require the sacrifice of 1/32 of the FOV; we have therefore estimated the possible improvement in read noise by applying a high-pass filter, removing all read noise with a frequency ≲ 3 kHz from each channel. Removing all of the low-frequency noise reduces the total read noise by about 10%, from 27 e⁻ to 25 e⁻ (fourth row of Table 1).

Without the reference voltage, ACORNS-ADI implements two techniques to estimate and subtract the reference voltage; the user must select which to use. The first technique uses the reference pixels at the edges of channels 1 and 32, which provide eight measurements of the reference voltage every 64 pixels. We first remove outliers with sigma-rejection, and then subtract the convolution of the time series of reference pixels with a masked Gaussian. The Gaussian is normalized to unit area, and is zero where no reference voltage is available. We choose its width to optimize the reference subtraction as follows.

While the H2RG detector has a 1/f noise that extends up to a knee at ∼3 kHz, sparse sampling (and read noise) limits our ability to measure the high frequency noise. As shown in Table 1, a perfect suppression of the noise up to ∼3 kHz would decrease the read noise by ∼10%, or the variance by just under 20%. However, a poor estimate of the reference voltage can increase the noise. We can then estimate our fractional suppression of the read noise as

$\begin{equation} \frac{0.2}{\ln 3000} \int _{1\,\rm Hz}^{\nu _{\rm max}} \frac{d\nu }{\nu } - \frac{1}{N} , \end{equation} \tag{ 1 }$

where N is the effective number of pixels used to estimate the reference voltage at each point, ν_max∝N⁻¹, and ln 3000 is the value of the integral with ν_max = 3 kHz. Maximizing Equation (1), we find that N ∼ 40. Since only one in eight pixels has a corresponding reference pixel, this is equivalent to an effective smoothing length of ∼300 science pixels. We therefore cannot suppress a noise of ≳300 Hz, and achieve a ∼5% reduction in read noise (third row of Table 1) rather than a ∼10% reduction (fourth row). Like Moseley et al. (2010), we optimally suppress low-frequency read noise by scaling the reference signal; we find a best-fit scaling of 0.87.

ACORNS-ADI also implements a technique to remove correlated read noise using some or all of the science pixels. To give an accurate estimate of the bias, these pixels must be uniformly illuminated. SEEDS generally observes a saturated central source whose seeing halo extends out to several hundred pixels in radius (several arcseconds); pixels beyond this halo can be used to better estimate the high frequency components of the reference voltage. ACORNS-ADI estimates the (uniform) illumination using the difference between the reference and science pixels, and takes the median of each set of up to 32 simultaneous readouts. Using all of the science pixels, this procedure reduces the read noise by ∼10% over the perfect subtraction of all of the read noise of frequency ≲3 kHz (last line of Table 1), much more than the ∼2% expected from self-subtraction. The result is nearly as good when using only half of the science pixels, those, at least, 800 pixels from the center (fifth line of Table 1). This level, ∼23 e⁻, indicates the read noise that is not shared between readout channels. We recommend using unilluminated science pixels for read noise suppression if they are available. We do not recommend a bias correction using only reference pixels unless the field is crowded.

3.2. Flat-fielding and Hot Pixel Masking

For a typical exposure of ∼1–10 s, the dark current is ∼0.05–0.5 e⁻, a factor of ∼50–500 lower than the read noise. Pixel-to-pixel variations in the dark current will be still lower. A perfect suppression of the dark current would reduce the read noise by ≲0.01%; we therefore do not attempt to correct for it, but treat the dark current as part of the background.

HiCIAO shares the problem of "hot," indium-contaminated pixels with other H2RG detectors. As a result, ∼2% of HiCIAO's pixels are unusable. These hot pixels are stable from night-to-night, though because the detector slowly degrades with time, new hot pixel maps must be made periodically. ACORNS-ADI corrects these pixels by taking the median of all uncontaminated pixels in a 5 × 5 box centered on the hot pixel. Because the data are oversampled, with a PSF core that is typically ∼6 pixels in diameter, this will not significantly bias the intensity. We mask hot pixels throughout the bias calculation described above.

Because of the short exposure times in most SEEDS images, cosmic rays are rarely a problem. We do implement an algorithm to reject isolated discrepant pixels or clusters of pixels similar to the one used by Lafrenière et al. (2007b). We aggressively smooth the image with a large median filter, identify pixels that are above a certain threshold in the difference between the original and the smoothed maps, and mask them.

Flat-field images are stable to ∼2% from month to month, and are even more stable from night to night. We therefore construct one flat-field for each observing run from nine dithered dome flats. We correct the bias of each frame with the method described in Section 3.1 using only the reference pixels, and then median-combine the nine dithered images. ACORNS-ADI divides each science frame by this master flat after performing the bias correction. We do not attempt to correct for the detector's nonlinearity as it approaches saturation.

3.3. Distortion Correction

The field distortion is the difference between positions on the detector and positions on-sky. We model the distortion using a third-order two-dimensional (2D) polynomial in pixel coordinates relative to the center of the FOV; the distortion at the image center is thus zero by definition. We use Hubble Space Telescope images of the globular clusters M5 and M15 for our reference images, extracting stellar positions using SExtractor (Bertin & Arnouts 2010). We use the same tool to extract stellar positions in HiCIAO images. We then fit for the polynomial coefficients, using Markov Chain Monte Carlo (Press et al. 2007) to minimize the difference between Hubble positions and corrected HiCIAO positions. This technique gives best-fit values and errors on all parameters. On a good night, we measure the horizontal and vertical pixel scales to 0.01%, and the direction of true north to a precision of ∼0 fdg 005; poor conditions degrade our precision by a factor of up to ∼5. Finally, we use bilinear interpolation to transform our image to the new coordinate system, which has a pixel scale of 9.5 mas, similar to the pixel scale of the raw data. The distortion correction is described in more detail by Suzuki et al. (2010).

Our corrected images appear to be free of systematics, with the distribution of offsets between Hubble and HiCIAO positions being Gaussian in both the horizontal and vertical coordinates, and random over the FOV. The orientation of true north has been stable to ∼0 fdg 03 since the SEEDS survey began in 2009, but the plate scale has changed by up to 2% due to small changes in the optical setup and a new camera lens which was installed in 2011 April. Apart from changes in the overall plate scale, the field distortion has been extremely stable over the duration of the SEEDS survey. It is dominated by a ∼3% difference between the horizontal and vertical pixel scales and a ∼0 fdg 3 offset between the vertical axis on the detector and the celestial north.

4. IMAGE REGISTRATION

Image registration is important both to optimize the subtraction of the stellar PSF and to maximize companion flux in the final de-rotated, co-added image. Unfortunately, it is difficult to centroid saturated stars, and most SEEDS data have no bright but unsaturated stars in the 20'' × 20'' FOV. Here, we present a new algorithm to register isolated, saturated HiCIAO PSFs. This algorithm performs well, with a residual scatter of ∼1–2 mas (0.1–0.2 pixels) for observations made in good conditions, and could easily be applied to data from other instruments.

4.1. Developing a PSF Template

To accurately centroid images, we need to build a model of the HiCIAO PSF. Unfortunately, HiCIAO's AO system must be re-tuned at the beginning of each observation, and its PSF varies accordingly. Figure 2 shows this variation in a representative sample of SEEDS images. The top-left PSF is unsaturated, while the other panels each show a typical PSF of a different ADI sequence. The PSF varies strongly with atmospheric conditions and the performance of AO188, HiCIAO's 188-actuator AO system (Hayano et al. 2008), which itself depends on stellar brightness. To capture this variation, we proceed empirically, ultimately building a set of three PSF templates from nearly 3000 individual exposures of saturated stars observed as part of the SEEDS survey.

**Figure 2.** Sample HiCIAO PSFs. The top-left panel is unsaturated and includes the scale, while the other panels each represent a typical PSF of a different object observed as part of the SEEDS survey. The color scale is linear with white representing zero; because HiCIAO reads each pixel twice, areas that saturate rapidly can appear white. Using the algorithm described in Section 4, we can centroid each PSF to an accuracy of better than 0.5 pixels. For the better-behaved PSFs, like that shown in the bottom left, our accuracy improves to ∼0.1–0.2 pixels, or ∼1–2 mas.
Download figure:
Standard image High-resolution image

We extract three templates from 3000 images by iteratively registering the frames and performing PCA. We initially centroid the frames by fitting Moffat profiles. We manually remove outliers—some of which result from bad data, others from the failure of the algorithm—and use a scaled, unsaturated PSF to flag saturated pixels. We then rescale the data to a common flux and perform PCA on the sequence of images. We use weighted PCA, estimating the noise at each unsaturated pixel as the sum of read noise and photon noise, and ignore saturated pixels.

We now use the mean PSF and the first two principal components from this first pass to refine our centroids. We first re-estimate the centroid of each image by flagging saturated pixels (those with at least 70% of the maximum intensity on the detector) and centroiding the greatest concentration of such pixels. We then compute the rms distance r_rms of the saturated pixels from the provisional center, and mask all pixels within 1.5r_rms. We model the variance of the intensity at each remaining pixel as shot noise plus read noise, and fit the frame with a linear combination of the mean PSF and the first two principal components by minimizing

$\begin{equation} \chi ^2 = \sum _{{\rm pix}\, i} \frac{1}{\sigma _i^2} \left(I_i - \sum _{{\rm tmpl}\, j} \alpha _j T_{ij} \right)^2, \end{equation} \tag{ 2 }$

where j indexes the templates T (T₀ represents the mean PSF), i indexes the pixels, and α_j are free parameters. We then shift the template PSF by an integer number of pixels relative to the image (thereby avoiding any artifacts from interpolation), and compute the best-fit χ² as a function of positional offset. Finally, we centroid the χ² map by fitting a 2D quadratic, and take the location of this peak to be the centroid of the image. We then register all of the frames, scale them to a common flux, mask saturated pixels, weight the other pixels, and perform PCA. We repeat this entire sequence of steps one more time to build our final set of PSF templates, which we take to be the mean PSF and the first two principal components.

4.2. Centroiding Saturated Frames

We register each sequence of saturated frames using the same method described in the previous section. The algorithm initially proceeds in four steps.

1.
Flag saturated pixels, centroid the greatest concentration of such pixels to compute a provisional center.
2.
Mask pixels near the provisional center, weight all other pixels by photon and read noise.
3.
Fit the PSF templates using χ² at an integer-pixel grid of offsets.
4.
Centroid the map of χ² merit statistics.

We then recenter each frame and compute the average PSF of the ADI sequence. As Figure 2 shows, the average PSF in a given sequence of images can be significantly asymmetric. We therefore compute relative centroids according to the procedure just described, and separately determine the centroid of the average PSF. We do not re-compute a separate PSF model for each ADI sequence. Doing so offers no improvement in performance, and appears to introduce slight interpolation artifacts.

A peculiar feature of HiCIAO makes it possible to visually estimate the centroid of a saturated frame. The HgCdTe H2RG chip is reset pixel-by-pixel and then read out twice; the interface computer records the difference between the two readings. As a result, pixels that saturate rapidly tend to show no difference between the two readings, and hence, are recorded as having zero intensity. We therefore allow the user to interactively select the absolute centroid of the average of an ADI sequence. Figure 3 shows a sample centroid verification image; the innermost circle has a radius of 4 HiCIAO pixels, or about 38 mas. As Figure 3 indicates, it is usually possible to absolutely centroid a sequence of saturated images to an accuracy of at least ∼0.5 pixels, around 5 mas. We then add this user-determined offset to each frame's centroid. In this way, ACORNS-ADI determines the relative centroids automatically, while the user selects the absolute centroid, in the form of a single offset for the sequence of images.

**Figure 3.** Sample average HiCIAO PSF from an ADI sequence. The user interactively chooses the absolute centroid in the average frame; the resulting offset is applied to each frame. The yellow cross shows the center of the image while the innermost circle has a radius of 4 HiCIAO pixels, or about 38 mas; the image is interpolated between points and therefore appears smooth. The absolute centroids in a typical sequence may be estimated to an accuracy of ≲0.5 pixels ≈ 5 mas; the relative centroids, as discussed in Section 4.3, are better.
Download figure:
Standard image High-resolution image

Unfortunately, it is difficult to independently verify the accuracy of the absolute centroid. The SEEDS survey does have a few image sequences with a bright, unsaturated star, and those taken under favorable conditions confirm an accuracy of ≲0.5 pixels. Image sequences with poor AO performance appear to have larger errors of ∼1–2 pixels. These errors remain impossible to verify on most sequences. We recommend that the user choose a conservative error σ_cen, and scale the sensitivity by the factor

$\begin{equation} \left(1 + \frac{\sigma _\phi \sigma _{\rm cen}}{R_{\rm PSF}} \right)^{-1}, \end{equation} \tag{ 3 }$

where σ_ϕ is the standard deviation in parallactic angle and R_PSF is the diameter of the PSF core. Any error in the absolute centroid will smear out a companion by a fractional amount roughly proportional to the field rotation, and inversely proportional to the size of the PSF.

4.3. Performance of the Centroiding Algorithm

We measure the performance of our new centroiding algorithm using ADI image sequences of single stars. The performance consists of the centroids' relative accuracy, which is completely determined by ACORNS-ADI, and their absolute accuracy, which is determined by the user and is much more difficult to assess. The accuracy of the relative centroids affects the quality of the data reduction, but not its astrometry. Poor image registration will smear out speckles and point sources but without introducing systematic offsets. The astrometric error in the reduced data is thus equal to the error in the absolute centroid, and must be estimated by the user. This varies from data set to data set but for data with good AO performance, is generally ≲0.5 pixels, ∼5 mas.

We can assign upper limits to the errors in image registration using the scatter in fitted centroid positions. This scatter will be due both to tracking errors and PSF fitting errors, which we assume to be uncorrelated. We further assume that tracking errors dominate the slow drift in the PSF position. Before the installation of an atmospheric dispersion corrector (ADC; Egner et al. 2010) in AO188, the PSF would drift over the course of an observation due to differential atmospheric refraction between the visible, in which the AO system guides, and the near-infrared, in which HiCIAO observes. This effect was mostly, though not entirely, eliminated by the installation of the ADC. More recent observations at high airmass (Thalmann et al. 2011) indicate that these slow drifts occur at the level of at most a few pixels (tens of mas) over the course of an ADI sequence.

As we wish to measure only the frame-to-frame fluctuations in the fitted centroids, we fit and subtract a low-order polynomial from the centroid positions in each ADI sequence. An alternative approach, measuring the positional difference between successive frames and dividing by $\sqrt{2}$ , gives nearly identical results. We then compute the rms scatter of the residual centroid positions, excluding the most discrepant 1% of points to remove outliers. For a true Gaussian distribution, this outlier exclusion reduces the variance by 10%; we correct our measured variances for this effect. The worst data sets, those in which the PSF varies most and positional jitter is most likely to be real, show an rms scatter of ∼0.3 pixels (3 mas) in both the horizontal and vertical directions, or ∼0.4 pixels overall. Most of the data are much better; 12 of the 21 ADI sequences we used to build the template PSFs showed overall residual rms scatters of less than 0.2 pixels (2 mas), and 17 of 21 had rms scatters of less than 0.3 pixels (3 mas).

Given the variation and asymmetries in HiCIAO's PSF, we allow the user to interactively determine the absolute centroid of an image sequence. However, the algorithm presented above can register a series of images to a typical precision of ∼0.2 pixels, or 2 mas. By centroiding a map of χ², which itself is computed only at integer pixel offsets, our new method avoids interpolating images or PSF templates. This makes it relatively fast and free of systematics.

5. ADI REDUCTION

The goal of an ADI reduction process is to subtract the stellar PSF in a way that maximizes sensitivity to point sources in the residual images. In practice, this means finding an algorithm that produces Gaussian residuals and an optimal signal-to-noise ratio (S/N) for single point sources.

We implement two basic techniques to model the PSF for each frame; these may be used alone or in conjunction with one another. The first method is to take the median of all of the frames to be the model PSF, subtract this from each individual image, and finally de-rotate and co-add the sequence. The second technique is the Locally Optimized Combination of Images (LOCI), described by Lafrenière et al. (2007b). We discuss several modifications of the basic LOCI algorithm, some of which have been described elsewhere. Unfortunately, the AO system on the Subaru Telescope does not perform well enough in the H-band to take advantage of most of these techniques.

We characterize each data reduction algorithm by its effective PSF, which we define to be the difference between a reduced image with and without a faint point source. The effective PSF varies with the choice of data reduction algorithm and with position on the detector; it is a product both of hardware (the PSF itself) and of software. For most SEEDS data sets, we find that the basic LOCI algorithm offers the best compromise between sensitivity and simplicity. When SCExAO (Guyon et al. 2011), the new higher-order AO system for the Subaru Telescope, is fully operational, other algorithms may offer significant sensitivity improvements.

5.1. Median PSF Subtraction

A simple way to model the PSF is to use the median of all frames in an ADI sequence (see Marois et al. 2006). The model PSF will then include all structures and companions in the FOV averaged over all position angles in the image sequence. Azimuthally extended sources will appear in the model PSF much as they do in individual exposures, while point sources will be smeared out by field rotation.

In this simple technique, the same model PSF is subtracted from each frame. The frames are then de-rotated to a common reference position and co-added. Variations in the PSF, such as a changing Strehl ratio, will strongly degrade the sensitivity of the final, processed image to point sources. A point source itself will suffer from a fractional flux loss roughly proportional to the ratio of the size of the PSF core to the amount of field rotation at its location,

$\begin{equation} f_{\rm loss} \sim \frac{\lambda }{D} \frac{1}{r_{\rm sep} \Delta \phi }, \end{equation} \tag{ 4 }$

where Δϕ is the total field rotation and r_sep is the angular separation between the point source and the central star. For a companion at r_sep ∼ 1'' with an H-band PSF core λ/D ∼ 0 farcs 05 and a total field rotation of 60°, f_loss ∼ 5%.

Azimuthally extended sources, like disks, will be suppressed by a factor

$\begin{equation} f_{\rm loss}(\phi) \sim \int _{\phi - \Delta \phi /2}^{\phi + \Delta \phi /2} \frac{I(\theta)\,d\theta }{I(\phi)}. \end{equation} \tag{ 5 }$

If we expand I as a Taylor series about ϕ, all of the odd terms vanish due to the symmetry of the integral. In other words, azimuthal gradients, as well as azimuthally symmetric sources, are completely suppressed; only higher-order features survive a median PSF subtraction.

The left two panels of Figure 4 show the original PSFs in annuli, smeared out by the field rotation of the data set, and the effective PSFs after a mean PSF subtraction. The latter are, on average, identical to the effective PSFs produced by a median PSF subtraction. The data set shown had 155 exposures, with a total field rotation of ∼30°, and is typical of a SEEDS observation. The integrated flux in the mean PSF-subtracted image is 0 to within 0.1% of the flux in the raw PSF image.

Median PSF subtraction is simple both conceptually and computationally, and has been successfully used to measure the geometry of circumstellar disks (Thalmann et al. 2011). However, as discussed above, this technique preserves only high-azimuthal-order disk features; as a result, it is only effective on a small subset of the SEEDS disk sample. Furthermore, other techniques such as LOCI, described in the following section, are much more sensitive to point sources. We include median subtraction as an option in ACORNS-ADI but generally recommend against its use. We use it here mainly as a baseline against which to measure the performance of other algorithms.

5.2. LOCI

LOCI, a technique for empirical PSF modeling, was introduced by Lafrenière et al. (2007b). The LOCI algorithm models the PSF in an ADI frame as a local linear combination of other frames in the sequence, with the coefficients calculated using simple least-squares. In each region of frame i, LOCI takes the other frames {j} and fits coefficients {α_ij}, eventually producing an image of residual intensity,

$\begin{equation} {\cal R}_i = I_i - \sum _j \alpha _{ij} I_j. \end{equation} \tag{ 6 }$

LOCI fits for the {α_ij} over optimization regions that are typically several hundred PSF footprints—several thousand pixels—in size. The subtraction regions are generally at least a factor of 10 smaller. We give more details about the calculation of the LOCI coefficients in Appendix A. Because LOCI uses least-squares fitting, the solution for the {α_ij} is a linear problem, with the size of the resulting linear system set by the number of frames in an ADI sequence.

To interpret a LOCI-processed ADI sequence, artificial point sources are added and the data are re-reduced (Lafrenière et al. 2007b). Here, we define the LOCI effective PSF to be the difference between the final, reduced image with and without a faint point source. The right panels of Figure 4 show the effective PSF after reducing a sample SEEDS data set with LOCI. The data set, with 155 frames and a total field rotation of ∼30°, represents a typical SEEDS observation. To ensure that each effective PSF is independent from the others, we add and reduce the faint companions one at a time. We use optimization regions 200 PSF footprints in size and a minimum field rotation of 70% of the PSF full width at half-maximum (FWHM) as our fiducial LOCI parameters.

As with median PSF subtraction (Section 5.1), LOCI subtracts azimuthally displaced copies of a faint source, producing negative "wings" in the effective PSF with an integrated flux approximately equal to the flux in the core. Indeed, the integrated flux in each LOCI panel is zero to within 0.5% of the flux in the original PSFs (top-left panel). The bottom-right panel shows the effect of a random positional jitter on the effective PSFs; such a jitter could be due to unmodeled instabilities in the PSF or image registration errors. These jitters, even if only 1/6 of the PSF FWHM in each coordinate, smear out the PSF cores and significantly degrade the sensitivity of observations (see Figure 4), emphasizing the need for reliable, sub-pixel image registration.

5.3. Calibrating LOCI

Especially at small separations from the central star, LOCI suppresses companion flux more than does median PSF subtraction. This is partly because the best reference frames tend to be nearby in time (and hence have little relative field rotation). Panels (a)–(c) of Figure 5 demonstrate this effect. Panel (b) shows azimuthally displaced PSF copies weighted by the LOCI coefficients obtained without adding a faint companion; panel (c), which closely resembles the mean-subtracted (lower left) panel of Figure 4, shows the effective PSFs after this step. Because of LOCI's angular protection criterion, there is little intensity suppression in the PSF cores in panel (b). However, this does not account for the full amount of flux loss (panel (e)).

**Figure 5.** Decomposition of LOCI image processing. Panel (a) shows the original PSFs, panel (b) shows the effect of subtracting angularly displaced copies of the PSF weighted by the LOCI subtraction coefficients, and panel (c) is the sum of panels (a) and (b). Panel (d) presents an additional source of flux suppression, which we demonstrate in Appendix A: if we perturb the flux in an image, then LOCI will, on average, fit this perturbed flux with a coefficient between 0 and 1. In panel (d), this coefficient varies from ∼0.5 at 03 to ∼0.1 at 15. We measure both effects as a function of spatial position and combine them to produce our sensitivity maps. Panel (e), the sum of panels (c) and (d), shows the effective PSFs (see Figure 4).
Download figure:
Standard image High-resolution image

As we show in Appendix A, the addition of a faint source perturbs the LOCI coefficients themselves. Rather than minimizing the least-squares equation for the companion-free PSF (Equation (6) squared and summed over pixels), we instead minimize

$\begin{equation} \sum _{\rm pixels} \left((I_i + I^{\prime }_i) - \sum _j \alpha ^{\prime }_{ij} (I_j + I^{\prime }_j ) \right)^2, \end{equation} \tag{ 7 }$

where I'_i is the perturbing (companion) intensity in frame i from a faint companion, and {α'_ij} are the perturbed LOCI coefficients. We use an approximation for I'_i, minimize Equation (7), and linearize it about the unperturbed coefficients {α_ij} to derive the fractional flux suppression. Appendix A gives the full derivation. This additional effect is the reason why adding too many reference frames can actually degrade LOCI's performance. To a certain degree, which varies according to AO performance and radial separation (and the number of reference frames), LOCI can fit anything. Panel (d) shows this effect in the same SEEDS data sequence; at small separations, it can suppress companion flux by an additional factor of ∼2.

The product of these two effects, the subtraction of azimuthally displaced PSF copies and the perturbation of the LOCI equations, accounts for the suppression of companion flux in a LOCI reduction. Both vary as a function of position but are nearly independent of companion flux. Figure 6 shows the results of aperture photometry on actual LOCI PSFs like those in the right panels of Figure 4. The error bars on individual points indicate the scatter in relative photometry as a function of source flux; that these are zero for the blue points (those with no positional jitter) indicates that the LOCI effective PSFs are linear in source flux. However, positional jitter, whether from AO tracking errors or from poor image registration, can introduce significant systematic errors into the recovered sensitivities.

**Figure 6.** Aperture photometry of reduced point sources normalized to the photometry of the original PSF. The error bars are the standard deviations of sources of various intensities at fixed position, while the vertical scatter of points represents azimuthal variation. The effective PSF is linear in source flux (Appendix A), but varies by up to ∼20% with azimuthal position. The orange curve shows the radial profile of a map of fractional flux suppression computed as described in Appendix A, while the hatched region indicates ±2σ of azimuthal variation. Though it does not capture the full range of azimuthal variation, our estimated flux suppression matches the simulated sources (blue points) with no systematics. Fluctuations in the position and shape of the PSF core and image registration errors, even with standard deviations (σ_x, σ_y) that are small compared with the PSF FWHM (006), can introduce large systematic errors in the recovered photometry.
Download figure:
Standard image High-resolution image

To avoid adding test sources everywhere on the FOV, we produce a map of flux suppression using the method derived in Appendix A. As an alternative, we could add faint sources to densely populate the FOV. However, these sources would have to be reduced independently of one another. If there is more than one source in an optimization region, the effect of the sources on the LOCI subtraction coefficients will change (see Equation (A6)). In general, because the linear system will be more heavily constrained, the residual intensity will be larger, and the user will overestimate his or her sensitivity.

The orange curve in Figure 6 indicates the radial profile of our map of simulated flux loss, with the hatched region covering a spread of ±2σ. Because we do not compute the perturbation of the LOCI coefficients self-consistently, and because we neglect asymmetries in the companion PSF, we do not capture the full range of positional variability in relative photometry. For this reason, we recommend using the mean flux suppression at a companion's separation to calibrate its flux. We also recommend adding the azimuthal variance in the partial flux subtraction to the usual annular variance in intensity. Our model is computationally simple and generally does an excellent job of reproducing the typical relative photometry at all angular separations; it is also free of the systematic error that we would introduce by adding and reducing many point sources simultaneously.

5.4. LOCI Refinements

Several authors (e.g., Marois et al. 2010; Soummer et al. 2011; Pueyo et al. 2012; Currie et al. 2012) have recently introduced refinements to the LOCI algorithm discussed above. These include reducing the size of subtraction regions to a single pixel, masking a small area around each subtraction region, preconditioning the design matrix, and selecting relatively correlated subsets of the full data as reference frames. We find that for SEEDS data, none of these steps offer a significant improvement in sensitivity. The ineffectiveness of masking an area around each subtraction region is particularly surprising. While this refinement does reduce flux suppression, it does so at the cost of additional noise. It appears that, for SEEDS data, LOCI is often as effective at fitting and removing sources as it is at fitting and removing noise. Even the subtraction of a radial profile from each image, a component of the original LOCI algorithm (Lafrenière et al. 2007b), does not improve sensitivity in typical SEEDS data.

One refinement, suggested by Marois et al. (2010) and Pueyo et al. (2012), does improve the sensitivity of some SEEDS data. Because LOCI can fit sources and noise equally well when given enough reference frames, it is preferable to reduce groups of ∼100 frames at a time; a number somewhat smaller than the size (in PSF footprints) of the optimization regions. We implement this refinement by adding the capability to process data in an integer number of groups of frames, with a single group being equivalent to a normal LOCI reduction.

We introduce three refinements of our own in addition to those listed above.

1.
Performing PCA on an ADI sequence and subtracting the first n components before applying LOCI, similar to the methods suggested by Soummer et al. (2012) and Amara & Quanz (2012).
2.
Including principal components as reference frames in the LOCI process.
3.
Applying LOCI twice, to overcorrect the residuals in the first application.

While we expect these refinements to be useful with a higher Strehl ratio, they seem to suppress noise and sources equally well in SEEDS data with its ∼30% Strehl ratio in the H-band. Soummer et al. (2012) and Amara & Quanz (2012) describe algorithms in which they use a library of PSF components like those we use for image registration (Section 4). Unfortunately, while these components are sufficiently good to register SEEDS images, they do not improve the sensitivity of LOCI. For SEEDS data, they are not even good enough to perform absolute centroiding to better than ∼1 pixel. Unlike a space telescope, HiCIAO's AO system must be re-tuned before each observation. As a result, the PSF variation from one observation to the next is generally much larger than the variation within a single ADI sequence. Applying an initial PCA subtraction also makes it much more difficult to understand the fractional flux loss, and hence the sensitivity to point sources. In other words, it makes the flux suppression in panel (d) of Figure 5 significantly larger and harder to estimate.

We implement all of these refinements as optional features of ACORNS-ADI. With the improved performance of SCExAO, Subaru's next-generation AO system, or when applied to data from other instruments, these refinements may become much more powerful.

6. SEARCHING FOR POINT SOURCES

After reducing each frame in an ADI sequence, we combine them into a single image and search for point sources. We now discuss each step in turn. In Section 6.1, we introduce a new algorithm, intermediate between the mean and the median, to combine an image sequence. This new algorithm improves the standard deviation by up to 20% relative to taking the median of the images. In Section 6.2, we test several filters to search for point sources, settling on a 0 farcs 05 diameter circular aperture as the best choice.

6.1. Combining an Image Sequence

The optimal method to combine a sequence of N frames (N ≫ 1) into a final, reduced image depends on the properties of the errors in each frame. For example, if the errors are independent and normally distributed, then taking the mean of all of the frames gives a combined data point with only 4N/(π(2N − 1)) ≈ 64% of the variance obtained by taking their median (Kenney & Keeping 1962). However, using the mean is not robust to outliers, and is a poor choice at small angular separations where speckle residuals may be highly non-Gaussian between frames.

The original LOCI algorithm (Lafrenière et al. 2007b) and various refinements (e.g., Soummer et al. 2011) simply use the median of their LOCI-processed frames, which may not be optimal for HiCIAO data, particularly in regions far from the central star where we expect read noise to dominate. We therefore use the trimmed mean, which is continuous between the mean and median: we sort the image sequence at each spatial location, and take the mean of the middle n points, discarding (N − n)/2 values each at the high and low end. When n = 1, this is equivalent to taking the median; when n = N, the number of images in the sequence, it is equivalent to taking the mean. We derive the efficiency of this estimator for data drawn from a normal distribution in Appendix B.

The top panel of Figure 7 shows this estimator as applied to a sample HiCIAO image sequence of 155 frames. At small angular separations, outliers are relatively common and the mean provides a poor estimator, with a standard deviation ∼20% higher than that of the median. Far from the central star, however, the picture is reversed; using the mean gives a large improvement in sensitivity. At nearly all separations, the optimal solution is somewhere in between, close to the median at small separations and close to the mean further away. We expect (and Figure 7 confirms) that the relationship between the optimal n in the trimmed mean and angular separation from the central star is essentially monotonic.

We implement a trimmed mean estimator iteratively. We begin with the median of our image sequence and calculate its noise profile. We then calculate the noise profile for an image created by averaging more frames (or trimming fewer), and replace data points in the median image at annuli where the new estimator reduces the variance. We repeat this step, using more frames (larger n) in each successive estimator, until we use nearly all of the frames. We always trim at least 5% of the data to guard against cosmic rays and rare outliers; Figure 7 shows that this approximation incurs at most a 0.5% penalty in noise.

The bottom panel of Figure 7 shows the results of our iterative trimmed mean relative to a simple median of the frames in an image sequence. The new image represents an improvement in noise at all angular separations, with a ∼20% improvement at large separations where the frame-to-frame noise is very nearly Gaussian.

While the frame-to-frame noise at a given pixel may be significantly non-Gaussian, the distribution of trimmed means, due to the central limit theorem, is Gaussian. Our new final image thus retains the noise properties of the original LOCI algorithm; in our sample data set, it produced zero single-pixel false positives (pixels with >5σ fluctuations).

6.2. Filtering the Image

The optimal filter for detecting an object depends on both the object and the character of the noise. For noise that is Gaussian and independent at adjacent pixels, the optimal filter to search for point sources is the normalized PSF, referred to as a matched filter. In this section, we measure the performance of three filters for point sources in the HiCIAO data.

1.
A matched filter.
2.
A circular aperture.
3.
A "truncated" matched filter set to zero outside an aperture.
4.
A 2D median filter.

Figure 8 shows the relative performance of these filters. We truncate the matched filter at twice the radius of our fiducial 0 farcs 05 diameter aperture to limit the impact of the outer wings, which can depend strongly on azimuthal position (see Figure 4). We do not show the full matched filter, which fails to outperform aperture photometry even when assuming perfect knowledge of the effective PSF.

Perhaps surprisingly, a 0 farcs 05 diameter (1.2λ/D at 1.6 μm) circular aperture seems to offer the best performance of all of the filters. A 2D median filter is not optimal, especially far from the central star, because the noise in the reduced frame is approximately Gaussian. The poor performance of the matched filter, on the other hand, results from the strong correlation of the residual intensity in neighboring pixels. Figure 9 demonstrates this correlation in Fourier space; it results from averaging adjacent pixels when interpolating onto a new spatial array. We interpolate each frame three times during the data reduction process: when applying the distortion correction, when recentering, and finally when de-rotating each image to a common orientation on-sky. We have verified that we can closely reproduce the power spectrum of noise at large separations by smoothing white noise.

In addition to the suppression of noise at high spatial frequency, Figure 9 shows an increase in power at a few λ/D (a few PSF diameters), particularly in regions close to the central star. This is an artifact of the LOCI algorithm, which tends to give zero flux averaged over a spatial region larger than a PSF core. As a result, LOCI introduces an anti-correlation in intensity over scales larger than the size of the PSF. This also helps explain the poor performance of the full matched filter, which would otherwise take advantage of the negative wings in the effective PSF: large random fluctuations will also tend to be surrounded by regions of negative intensity.

6.3. Producing a Sensitivity Map

To date, the vast majority of high-contrast direct imaging observations have not detected substellar companions. However, non-detections may still be used to test models of planet and brown dwarf frequency, separation, and luminosity (e.g., Lafrenière et al. 2007b; Bonavita et al. 2012). Such analyses rely on the accuracy of sensitivity maps. As we show in Figure 6, it is easy to systematically overestimate sensitivity by failing to include effects such as PSF fluctuations and image registration errors.

It has become widely accepted within the community to estimate sensitivity by computing the standard deviation in the final, combined image in annuli around the central star (e.g., Lafrenière et al. 2007b; McElwain et al. 2008; Metchev & Hillenbrand 2009; Vigan et al. 2012). At separations of more than a few λ/D, these annuli are already at least several tens of PSF footprints in size. To account for LOCI's suppression of companion flux, the data set is re-reduced after adding faint sources to compute the fractional flux loss as a function of radial separation. We begin in the same way, computing the standard deviation of the residual intensity after convolving with our chosen 0 farcs 05 aperture. We correct for companion flux suppression using our own method, which we describe in Section 5.3 and Appendix A.

The result of our sensitivity analysis is not simply a radial profile, but a full 2D sensitivity map, obtained with modest additional computational cost. As a final step, we multiply this map by the scaled aperture photometry of the central star in a sequence of reference images, producing a contrast map.

7. USE AND PERFORMANCE

ACORNS-ADI is easy to use, and requires user interaction at only two points.

1.
To start the program, select the data and calibration files, and the reduction parameters.
2.
To interactively set and verify the absolute image centroid.

The total human time to perform a reduction is thus a couple of minutes, and is independent of the data set.

The amount of computer time required scales with the size of the data set and the number of processors available. ACORNS-ADI is efficiently parallelized, running more than 12 times as fast on 16 processors as on a single processor. A full reduction from raw data on an ADI sequence of 155 frames takes about 40 minutes on our three-year-old 16-core machine; this compares with ∼2 hr of computer time and 8 user interactions to process 1/3 as many optimization regions with serial IDL software adapted from Lafrenière et al. (2007b). The LOCI step scales differently depending on the number of frames. For data sets of ≳100 frames, it scales with N⁴_frames, while for much smaller data sets it scales with N²_frames. The other steps in the data reduction process, with the exception of combining the images (which is computationally cheap), each scale linearly with N_frames.

7.1. Extension to Other Instruments

Most of the algorithms presented above apply to ADI data taken by any instrument. ACORNS-ADI can reduce these data with minimal modification by the user, who must supply the following.

1.
A flat-field correction and hot pixel mask.
2.
An (optional) field distortion correction.
3.
A set of PSF templates.
4.
A curve describing the integrated overlap of PSF cores.

The most difficult item to compute is the set of PSF templates. We hope to supply a set of images for each of several instruments over the coming months. The last item simply refers to the flux in an aperture displaced a certain number of pixels from the PSF centroid, and is used to compute the fractional flux loss in LOCI.

While it is easy to extend ACORNS-ADI to other instruments, we offer several notes of caution when doing so. Different instruments have different conventions for header data such as the exposure time, the number of co-adds in a frame, and the image orientation with the image rotator off. For example, in HiCIAO, the keyword "EXPTIME" refers to the total integration time of the frame, while for NIRI, "EXPTIME" refers to the integration time for each co-add. This makes it difficult to write a fully general software package. Unfortunately, ACORNS-ADI cannot detect these differing conventions automatically, and the user must be cautious.

8. CONCLUSIONS

We have described ACORNS-ADI, a new, parallel, open-source software package for reducing ADI data from the SEEDS survey. Most of its modules apply equally well to non-SEEDS ADI data, and the entire package could easily be adapted to analyze data from other instruments. We have introduced three new algorithms.

1.
A new method of performing image registration, which is accurate to ∼0002, ∼0.2 HiCIAO pixels.
2.
A new method for combining images in an ADI sequence, which reduces noise by up to 20%.
3.
A new method for calculating the flux loss in the LOCI algorithm without adding and reducing artificial point sources.

These new algorithms may be applied to any ADI data set, improving performance and decreasing run time.

We have described and characterized each step of the ADI data reduction process for SEEDS data. With ACORNS-ADI, we will be able to process data much more quickly and efficiently, taking advantage of the SEEDS survey's design as a large strategic observing program. In the future, we will modify ACORNS-ADI to process data from other surveys and instruments, providing a large set of uniformly reduced data with which to perform statistical analyses of substellar companion frequencies and luminosities.

The authors thank the anonymous referee for many helpful comments and suggestions that clarified this manuscript. This research is based on data collected at the Subaru Telescope, which is operated by the National Astronomical Observatories of Japan. This material is based upon work supported by the National Science Foundation Graduate Research Fellowship under grant No. DGE-0646086. Part of this research was carried out at the Jet Propulsion Laboratory, California Institute of Technology, under a contract with the National Aeronautics and Space Administration. The authors wish to recognize and acknowledge the very significant cultural role and reverence that the summit of Mauna Kea has always had within the indigenous Hawaiian community. We are most fortunate to have the opportunity to conduct observations from this mountain.

APPENDIX A: PARTIAL SUBTRACTION IN LOCI

In LOCI, we build a model PSF for each frame I_i in an ADI sequence from the other frames {I_j} satisfying LOCI's angular displacement criterion. Denoting the intensity at pixel k in frame j by I_jk, we calculate the coefficients {α_j} that minimize

$\begin{equation} R_i^2 = \sum _{{\rm pixels}\, k} \left(I_{ik} - \sum _{{\rm frames}\, j} \alpha _j I_{jk} \right)^2, \end{equation} \tag{ A1 }$

where the first sum is over the pixels k in the optimization region. The coefficients {α_j} are the solution to the linear system

$\begin{equation} \mathbf {A} \cdot \boldsymbol{\alpha } = \mathbf {b}, \end{equation} \tag{ A2 }$

with

$\begin{equation} {\rm A}_{jl} = \sum _{{\rm pixels}\, k} I_{jk} I_{lk} \quad {\rm and} \; {\rm b}_j = \sum _{{\rm pixels}\, k} I_{ik} I_{jk}. \end{equation} \tag{ A3 }$

We can perturb this problem by adding a faint source of intensity I' to frame i, and azimuthally displaced copies of it to the other frames. We approximate the azimuthally displaced copies by adding a source of intensity

$\begin{equation} I^{\prime }_{\rm eff} = I^{\prime }-\sum _j \alpha _j I^{\prime }(\delta \phi _j) \end{equation} \tag{ A4 }$

to frame i. We then solve the perturbed problem by minimizing Equation (A1) again, this time with the faint effective source of Equation (A4) added to each frame. Note that we use the unperturbed coefficients {α_j} in Equation (A4) rather than iteratively solving for the exact perturbed solution. The perturbations {β_j} in the LOCI coefficients will then be given by the solution to the linear system

$\begin{equation} \mathbf {A} \cdot \boldsymbol{\beta } = \mathbf {b^{\prime }}, \end{equation} \tag{ A5 }$

with

$\begin{equation} {\rm b}^{\prime }_j = \sum _{{\rm pixels}\, k} \left(I^{\prime }_{k} - \sum _{{\rm frames}\, l} \alpha _l I^{\prime }_{k}(\delta \phi _l) \right) I_{k}(\delta \phi _j). \end{equation} \tag{ A6 }$

Note that the companion intensity I' and the coefficients β_j are both linear in the companion flux. The residual intensity in pixel k of frame i, $\mathcal {R}_{ik}$ , is then

$\begin{eqnarray} \mathcal {R}_{ik} &= I_{ik} + I^{\prime }_{ik} - \sum _j \left(\alpha _j I_{jk} + \alpha _j I^{\prime }_k (\delta \phi _j) + \beta _j I_{jk} + \beta _j I^{\prime }_k (\delta \phi _j) \right)\nonumber\\ \end{eqnarray} \tag{ A7 }$

$\begin{eqnarray} &\approx I_{ik} - \sum _j \alpha _j I_{jk} + I^{\prime }_{ik} - \sum _j \left(\alpha _j I^{\prime }_k (\delta \phi _j) + \beta _j I_{jk} \right). \quad \end{eqnarray} \tag{ A8 }$

Because the source is faint, we drop the quadratic term β_jI'_k(δϕ_j). The first two terms in Equation (A8) give the residual intensity without the additional faint source, while the latter three terms give the residual intensity in the LOCI-processed image. These are all proportional to I'; hence, the LOCI effective PSFs are linear in the source flux.

We compute the relative photometry of a LOCI effective PSF by evaluating the latter two terms of Equation (A8), multiplying by an aperture, and summing over pixels (as described in Section 6.2, we use aperture photometry for the SEEDS data). We use a source of unit flux,

$\begin{equation} \sum _{{\rm pixels}\, k} I^{\prime }_{ik} a_{k} = 1, \end{equation} \tag{ A9 }$

where a is the aperture. Thus, ∑_kI'_k(δϕ_j) is the flux in an aperture displaced from the PSF center by the relative field rotation between frames i and j. We pre-compute these fluxes as a function of position, then multiply by the LOCI coefficients {α_j}.

We estimate the last term in Equation (A8) by first approximating I'_eff (Equation (A4)) as a Gaussian central peak, with two pairs of wider, negative Gaussians representing the wings (see Figure 4). The angular separations of the negative Gaussians from the central peak are 1.5 and 3 times LOCI's angular protection zone, or 1 and 2 times the standard deviation in parallactic angle, whichever are less. Our approximation for I'_eff has zero integrated flux, and accurately recovers the LOCI flux suppression (see Figure 6). By using the same approximate I'_eff for each frame, we avoid the need to recompute b' (Equation (A6)) for each frame, and save a factor of nearly N_frames in execution time. We then compute {β_j} using Equation (A5). Because we have already performed LU decomposition on A to solve Equation (A2), this step takes little computational effort. We finally compute the total flux loss within the aperture,

$\begin{equation} \sum_{{\rm pixels}\, k} a_k \sum _{{\rm frames} \,j} (\alpha _j I^{\prime }_k (\delta \phi _j) + \beta _j I_{jk}). \end{equation} \tag{ A10 }$

This allows us to compute the fractional flux loss everywhere on the FOV for little additional cost relative to LOCI itself.

APPENDIX B: EFFICIENCY OF THE TRIMMED MEAN ESTIMATOR

We briefly derive the efficiency of the trimmed mean estimator used in Section 6.1 when applied to data with normal errors. The efficiency is inverse of the variance of an estimator relative to the minimum possible variance of any estimator. For Gaussian data, the mean provides the minimum possible variance, which is equal to σ²/N. In the trimmed mean, we take the mean of the middle n out of a total of N points; for simplicity, we will assume N and n to be odd. We will work mostly in the space of the cumulative distribution function (CDF), in which each realization of the distribution is drawn from a uniform distribution between 0 and 1.

As a first step, we write down the probability that the middle n realizations (in CDF space) are all between x₁ and x₂, with one realization each at x₁ and x₂ (within dx); that is, that we have (N − n)/2 − 1 realizations <x₁ and (N − n)/2 − 1 realizations >x₂. We denote (N − n)/2 − 1, the number of data points trimmed at each end, by q. We have

$\begin{eqnarray} p(x_1, x_2) dx^2 &=& x_1^q (1 - x_2)^q (x_2 - x_1)^n \times {}_NC_q \times {}_{N-q}C_q \nonumber\\ &&\times (n + 2) \times (n + 1) \end{eqnarray} \tag{ B1 }$

$\begin{eqnarray} &\quad\ = x_1^q (1 - x_2)^q (x_2 - x_1)^n \frac{N!}{(q!)^2 n!}. \end{eqnarray} \tag{ B2 }$

Now, given x₁ and x₂, we wish to calculate the variance of the mean of n realizations of the truncated normal distribution. We will assume without loss of generality that the normal distribution has zero mean. We denote these limits as CDF⁻¹(x₁) = σy₁ and CDF⁻¹(x₂) = σy₂, where CDF⁻¹ is the quartile function and y₂ > y₁ are both drawn from a normal distribution with unit variance. Assuming the n realizations to be independent, the expectation value of the square of their mean is

$\begin{eqnarray} &&\sum _{y_1,y_2} p(y_1,y_2) \left[ \frac{n}{x_2 - x_1} \int _{\sigma y_1}^{\sigma y_2} \frac{t^2\,dt}{n^2 \sqrt{2\pi \sigma ^2}} e^{-t^2/2\sigma ^2}\right.\nonumber\\ &&\left.\quad +\, \frac{n (n - 1)}{(x_2 - x_1)^2} \left(\int _{\sigma y_1}^{\sigma y_2} \frac{t\,dt}{n \sqrt{2\pi \sigma ^2}} e^{-t^2/2\sigma ^2} \right)^2 \right]. \end{eqnarray} \tag{ B3 }$

We integrate the first term by parts and then integrate the second term. The inverse of the efficiency is then σ²/N times Equation (B3), equal to

$\begin{eqnarray} && \frac{N}{n} \sum _{y_1,y_2} \frac{p(y_1,y_2)}{x_2 - x_1} \left[ \frac{y_1}{\sqrt{2\pi }} e^{-y_1^2/2} - \frac{y_2}{\sqrt{2\pi }} e^{-y_2^2/2}\right.\nonumber\\ &&\left.\quad +\, \int _{y_1}^{y_2} \frac{dt}{\sqrt{2\pi }} e^{-t^2/2} + \frac{n - 1}{2\pi (x_2 - x_1)} \left(e^{-y_1^2/2} - e^{-y_2^2/2} \right)^2 \right]\qquad \end{eqnarray} \tag{ B4 }$

$\begin{eqnarray} &&= \frac{N}{n} \int _0^1 dx_1 \int _{x_1}^1 dx_2\, \frac{p(x_1,x_2)}{x_2 - x_1} \left[ \frac{y_1}{\sqrt{2\pi }} e^{-y_1^2/2} - \frac{y_2}{\sqrt{2\pi }} e^{-y_2^2/2}\right.\nonumber\\ &&\left.\quad +\, x_2 - x_1 + \frac{n - 1}{2\pi (x_2 - x_1)} \left(e^{-y_1^2/2} - e^{-y_2^2/2} \right)^2 \right]. \end{eqnarray} \tag{ B5 }$

We then substitute for p(x₁, x₂) using Equation (B2) and evaluate the integral. In the limit of the median (n = 1), Equation (B5) reduces to π(2n − 1)/4n, for an asymptotic efficiency of 2/π ≈ 64% (Kenney & Keeping 1962).

NEW TECHNIQUES FOR HIGH-CONTRAST IMAGING WITH ADI: THE ACORNS-ADI SEEDS DATA REDUCTION PIPELINE^{^*}

Article metrics

Permissions

Author affiliations

Dates

ABSTRACT

1. INTRODUCTION

2. ADI DATA REDUCTION IN SEEDS