Документ взят из кэша поисковой машины. Адрес оригинального документа : http://www.adass.org/adass/proceedings/adass94/zhangc.ps
Дата изменения: Tue Jun 13 20:56:29 1995
Дата индексирования: Tue Oct 2 00:52:24 2012
Кодировка:

Astronomical Data Analysis Software and Systems IV
ASP Conference Series, Vol. 77, 1995
R. A. Shaw, H. E. Payne, and J. J. E. Hayes, eds.
Robust Estimation and Image Combining
C. Y. Zhang
Space Telescope Science Institute, 3700 San Martin Dr., Baltimore, MD
21218
Abstract. A task GCOMBINE in the STSDAS package, for combin
ing images, is briefly reviewed. With a stack of input images, one can
identify bad pixels by comparing the deviation of a pixel from the mean
with a userspecified threshold times the clipping sigma. It is crucial that
estimation of the mean and sigma be robust against unidentified outliers,
in order to reject cosmic ray events effectively. A variety of weighting
schemes and means for storing noise characteristics are discussed. Infor
mation in neighboring pixels can also be used in a global way to identify
cosmic rays.
1. Introduction
In images taken with CCD cameras, cosmic ray (CR) events become a com
mon and annoying problem. Spacebased CCD cameras, such as the Wide
Field/Planetary Camera (WFPC) of the Hubble Space Telescope (HST ), are
especially subject to a large flux of CR events. The problem becomes even more
prominent for the new WFPC2. In these cases, it is recommended that long sci
ence observations be broken into at least two exposures (CRSPLIT) to ensure
that CR events can later be identified and removed (see, e.g., Wide Field and
Planetary Camera 2 Instrument Handbook, 1994).
2. Overview of GCOMBINE
The task GCOMBINE, in STSDAS, is designed mainly for HST images in GEIS
format, but is not restricted to them. A parameter, groups, allows a user to select
a range of groups to combine. A combined image is either a weighted average or
median of a set of input images with bad pixels, including CR events, removed.
As in the IRAF task IMCOMBINE, bad pixels can be rejected using masking,
threshold levels, and various rejection algorithms.
A list of input error images can be used for the algorithm, errclip to reject
bad pixels and to compute the pixelwise weight when the parameter, weight, is
set to pixelwise.
There are a variety of rejection algorithms from which to choose for rejecting
bad pixels after possible masking and thresholding operations. These are the
minmax, ccdclip, rsigclip, avsigclip, and the errclip algorithms. In the
following, we will discuss some of these algorithms in more detail.
1

2
3. Scaling and Weighting Schemes
In order to combine images with different exposures, one must first bring them
to a common level. For the i'th image, let us denote D i as the original pixel
value, Z i the zero level, hZi the average of Z i , S i the divisive scaling factor, hSi
the average of S i , and d i the scaled pixel value, we write
d i = (D i \Gamma Z i + hZi) \Lambda hSi=S i : (1)
This will bring the images to the mean sky level of hZi and common exposure
level of hSi. In the code, Equation (1) is translated into
d i = D i =s i \Gamma z i ; (2)
where s i = S i =hSi and z i = (Z i \Gamma hZi)=s i . The normalized scaling factors s i and
z i are stored in an output log file.
When the combined image is defined as a weighted average, it is useful
to distinguish between uniform and pixelwise weighting schemes. By pixelwise
weighting, we mean that the weight is calculated on a pixelbypixel basis. It
requires an input ``error map'' associated with each input image, and the weight
factor at a given pixel is then calculated as a reciprocal of the square of the error
at that pixel. By uniform weighting, we mean that the weight is a constant for
all pixels in an input image. In the case of uniform weighting, one has a variety
of choices, such as, the exposure time, the mean, median, or mode of the input
image. In a relatively flat image, one may choose a reciprocal of an averaged
variance as a uniform weight.
4. Robust Estimates of Means and Clipping Sigmas RSIGCLIP
After a mean is computed, one computes a deviation of the pixel from the
mean and then compares it with a ``clipping sigma''. If the deviation exceeds
a specified threshold, џ, times the clipping sigma, then the pixel under test is
regarded as being bad. After the bad pixel is rejected, one iterates the process,
but now the mean and the clipping sigma are recalculated, excluding the bad
pixels already rejected in the previous iterations. This iteration terminates when
no more pixels are rejected or the retained number of pixels is fewer than the
userspecified minimum number of retained pixels.
It is obvious that if there are outliers to be identified, the simple mean
of all measurements not excluding the unidentified outliers will be significantly
biased by often too large values of the outliers. The effects of the unidentified
outliers on the estimate of the clipping sigma could be even more severe, since
the deviation is being squared in computing the sigma. We discuss the robust
sigma clipping algorithm, rsigclip, in detail.
4.1. Effects of Poisson Scaling
Assuming the Poisson noise dominates over the read noise, and, for simplicity,
ignoring the effects of the zero flux level offset, Equation (2) is reduced to d i =
D i =s i . The sigma in the scaled image becomes, oe s
i
= oe u
i
=s i , where oe s
i
and oe u
i
are the sigmas corresponding to the scaled and unscaled data. For the Poisson

3
distribution, (oe u
i
) 2 = hD i i=g, where g is the gain. The optimal mean of the
unscaled data is given by hD i i = hd i is i , where hd i i is the proper mean of the
scaled data. We thus have, (oe s
i
) 2 = hd i i=(s i g). Therefore, even if the gain is
unknown, the relative variance, v i , in the scaled data is given by hd i i=s i . Taking
the weights w i = 1=v i = s i =hd i i; the average sample sigma is given by
hoei 2 =
P
w i (d i \Gamma hd i i) 2
P
w i
n
n \Gamma 1 : (3)
Noting that w i = s i =hd i i, the estimated absolute value of the sigma in the scaled
data is then given (see, e.g., Bevington 1969) by
(oe s
i ) 2
= ё i
n \Gamma 1
X (d i \Gamma hd i i) 2
ё i
; (4)
where the Poisson scaling factor is given by ё i = hd i i=s i .
4.2. Robust Estimates of the Mean and Sigma
It is emphasized that no matter how accurate Equation (4) is when the data
follow exactly the Poisson distributions, the presence of unidentified outliers,
which do not follow the same Poisson statistics at all, will inevitably make the
estimates of Equation (4) unrealistic. It is where the ``robust'' estimation of the
mean and sigma should come into play.
The method for estimating the mean used in rsigclip is robust. We exclude
the maximum and minimum values before computing the mean, because they
are most vulnerable to being bad. The effect is to assign a zero weight to
the two extremal points, while assigning normal weights to the rest of data.
This estimation of the socalled trimmed mean, used in the GCOMBINE and
IMCOMBINE tasks, is an estimate of the mean robust against unidentified
outliers.
To minimize influence of unidentified outliers on the clipping sigma, Equa
tion (3) is modified. We set a ``normal'' range of the data values to exclude the
most deviant points on both sides from the mean. For points that are within the
range, the normal weights, s i =hd i i, are used, as in Equation (3). Much smaller
weights are assigned to the points outside of the range, farther from the mean:
a weight of s i =d i is used for points on the high side of the mean, and a weight
of 0.001s i =hd i i is applied to points on the low side of the mean. An outlier, if
present, will get the negligible weight and therefore have minimal influence on
the clipping oe. This scheme works fine for identifying CR events, even in the
case of only a few images to combine.
5. CCDCLIP and ERRCLIP Algorithms
If the noise in the i'th unscaled image can be obtained from the CCD noise
model, using the error propagation principle and the relation that hD i i = hd i is i ,
we find that the optimal clipping sigma in the scaled image should be
(oe s
i
) 2 =
/
1
s 2
i
!(
oe 2
rd
g 2
+ hd i is i
g
+ s 2 hd i i 2
s 2
i
)
; (5)

4
where oe rd is the readout noise in number of electrons, g is the gain factor in
units of e \Gamma =DN , and s is the fractional ``sensitivity noise.''
In the code, the trimmed mean or median is used. Substituting the trimmed
mean or median into Equation (5) gives estimation of the clipping sigma. This
clipping sigma does not involve in any computations of the square of the devi
ations, is thus robust against the unidentified outliers. If the error maps exist
for input images, one does not have to compute the clipping sigma, because
the sigma is available from the input error maps for each individual pixels. The
testing shows that errclip works well for WFPC images. if one makes error maps
using, e.g., the imcalc task based on the CCD noise model. In doing so, it is
important that the error provided in the input error maps should simultaneously
be scaled as the input images.
6. Information in Neighboring Pixels
In the above discussions, only the information along a stack of input images for
a given output pixel is used. It is conceivable that information in neighboring
pixels, such as patterns of various astronomical objects, can be used to distin
guish CR events. In the case of, say, only two images to combine, it becomes
mandatory to use as much of the information in the image plane as possible.
In the GCOMBINE task the information contained in the neighboring pixels is
globally taken into account for the case of two images to combine.
It is conceivable that the pixel values of the two images for the same target,
through the same filter, must be closely correlated with each other. If one
plots pixel values of one image versus those of the other, it is seen that more
than 99.5% pixels are located in between the two envelope curves, where the
pixel values of one image equal those of the other plus and minus three sigma.
The outliers are almost exclusively located outside the enveloped region. The
locations of outliers relative to the rest of neighboring ``normal'' pixels show in
a global way that it is possible to identify outliers based on the direct difference
of the pixel values of the two images at a given pixel of the combined image.
If the difference exceeds the userspecified џ times of the sigma, the larger one
is rejected as long as the very negative pixels are excluded first. The clipping
sigma, in this case, can only be obtained with either the CCD noise model or
the input error map. Tests show that this method is very effective in removing
cosmic rays.
Acknowledgments. I am grateful to F. Valdes, R. White, and K. Rat
natunga for very stimulating discussions.
References
Bevington, P. R. 1969, Data Reduction and Error Analysis for the Physical
Sciences (New York, McGrawHill)
Burrows, C. J., ed. 1994, Wide Field and Planetary Camera 2 Instrument
Handbook (Baltimore, Space Telescope Science Institute)