. : http://www.naic.edu/~phil/mbeam/hipass.pdf
: Fri Mar 14 17:44:01 2003
: Tue Oct 2 01:06:14 2012
:
Mon. Not. R. Astron. Soc. 322, 486 498 (2001)

The H i Parkes All Sky Survey: southern observations, calibration and robust imaging
D. A. M. R. M. D. M. R.
1 2

Australia Telescope National Facility, CSIRO, PO Box 76, Epping, NSW 1710, Australia Centre for Astrophysics and Supercomputing, Swinburne University of Technology, PO Box 218, Hawthorn, VIC 3122, Australia 3 Netherlands Foundation for Research in Astronomy, PO Box 2, 7990 AA Dwingeloo, the Netherlands 4 Department of Physics and Astronomy, University of Leicester, Leicester LE1 7RH 5 Department of Physics and Astronomy, University of Wales, Cardiff CF24 3YB 6 Department of Physics, University of Western Sydney Macarthur, Campbelltown, NSW 2560, Australia 7 Department of Physics, University of Bristol, Tyndall Avenue, Bristol BS8 1TL 8 School of Physics, University of Melbourne, VIC 3010, Australia 9 Research School of Astronomy and Astrophysics, Australian National University, Weston Creek, ACT 2611, Australia 10 School of Physics, University of Sydney, NSW 2006, Australia 11 Institute for Astrophysics, University of New Mexico, Albuquerque, NM 87131, USA 12 Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA 13 Departamento de Astronomia, Universidad de Guanajuato, Guanajuato, GTO 36000, Mexico 14 Anglo Australian Observatory, PO Box 296, Epping, NSW 1710, Australia 15 Observatoire de la Cote d'Azur, Nice, 06304 Nice Cedex 4, France

G. Barnes,1,2w L. Staveley-Smith,1 W. J. G. de Blok,1 T. Oosterloo,3, I. M. Stewart,4 E. Wright,1 G. D. Banks,5 R. Bhathal,6 P. J. Boyce,7 M. R. Calabretta,1 M. J. Disney,5 J. Drinkwater,8 R. D. Ekers,1 K. C. Freeman,9 B. K. Gibson,2 A. J. Green,10 F. Haynes,1 P. te Lintel Hekkert,1 P. A. Henning,11 H. Jerjen,9 S. Juraszek,10 J. Kesteven,1 V. A. Kilborn,8 P. M. Knezek,12 B. Koribalski,1 R. C. Kraan-Korteweg,13 F. Malin,14 M. Marquarding,1 R. F. Minchin,5 J. R. Mould,9 R. M. Price,11 E. Putman,9 S. D. Ryder,14 E. M. Sadler,10 A. Schroder,15 F. Stootman,6 8 1 1 L. Webster, W. E. Wilson and T. Ye

Accepted 2000 October 2. Received 2000 September 28; in original form 2000 June 1

The acquisition of H i Parkes All Sky Survey (HIPASS) southern sky data commenced at the Australia Telescope National Facility's Parkes 64-m telescope in 1997 February, and was completed in 2000 March. HIPASS is the deepest H i survey yet of the sky south of declination 128, and is sensitive to emission out to 170 h21 MpcX The characteristic root 75 mean square noise in the survey images is 13.3 mJy. This paper describes the survey observations, which comprise 23 020 eight-degree scans of 9-min duration, and details the techniques used to calibrate and image the data. The processing algorithms are successfully designed to be statistically robust to the presence of interference signals, and are particular to imaging point (or nearly point) sources. Specifically, a major improvement in image quality is obtained by designing a median-gridding algorithm which uses the median estimator in place of the mean estimator. Key words: instrumentation: detectors methods: observational methods: statistical techniques: image processing surveys radio lines: galaxies.
1
E-mail: dbarnes@swin.edu.au Present address: Centre for Astrophysics and Supercomputing, Swinburne University of Technology, PO Box 218, Hawthorn, VIC 3122, Australia.
w

A B STRA CT

I N T R O DU CTI O N All Sky Survey (HIPASS) is a survey for neutral emission from extragalactic objects in the radial 21280 , cz , 12 700 km s21 Y over the entire south of declination 128, with an effective
q 2001 RAS

The H i Parkes hydrogen (H i) velocity range southern sky


HIPASS: southern observations, calibration and robust imaging
Table 1. Parameters for the H i Parkes All Sky Survey. The H i mass limit is calculated for a galaxy profile width of 200 km s21. Sky coverage Integration time per beam Average system temperature Central beam efficiency Central beam FWHM Average beam FWHM Velocity range Channel separation Velocity resolution 3s positional accuracy 3s detection limit 3s H i mass limit

487

d , 28 450 s 19 K (35 Jy) 63 per cent 14.0 arcmin 14.3 arcmin 21280 , cz , 12 700 km s2 13.2 km s21 at z 0 18.0 km s21 3 arcmin 40 mJy beam21 6 10 d2 M( Mpc

1

integration time of 450 s per beam. The survey parameters are shown in Table 1. HIPASS observations utilize the new 21-cm Multibeam receiver installed at the prime focus of the Parkes 64-m radio telescope,1 began in 1997 February, and were completed in 2000 March. The scientific potential of the survey is tremendous: the H i mass function for the nearby Universe will be determined better than ever before. HIPASS will provide completely new information on the distribution of galaxies, the cosmological density parameter, the space density of rare and optically invisible galaxies, and on group and supercluster dynamics. New results from the all-sky survey have included the discovery of a leading arm to the Magellanic system (Putman et al. 1998) and the identification of 10 new members of the Centaurus A group of galaxies, adding to the 28 already known within the area surveyed (Banks et al. 1999). Present work is directed towards the generation of a complete catalogue of southern sky H i sources. This paper describes the observing, calibration and imaging techniques used for the survey. In Section 2 the Multibeam hardware and the survey observing technique are described. In Section 3 the algorithms designed for the on-line processing of the data are described; these algorithms are directed at robust bandpass removal and flux calibration, residual baseline subtraction, spectral smoothing and Doppler tracking. Technical details on the implementation of some of the algorithms are given in Barnes (1998). Images of the H i sky are generated using the median-gridding technique introduced in Section 4, and are postprocessed into final survey images using the techniques given in Section 5. The primary survey product is a set of data cubes, measuring 88 88 on the sky, and having 1024 individual maps of H i emission at mean increments of 13.4 km s21 covering the velocity range given in Table 1.

Figure 1. Multibeam receiver configuration on the sky. The radii of the inner and outer rings of beams are 29.1 and 50.8 arcmin respectively.

Fig. 1; see also Staveley-Smith et al. 1996, Wilson et al. 1997 and Sinclair et al. 1997). Physically, the 13 feed horns are identical, having diameters of 240 mm at the focal plane, narrowing in steps towards the receiver end. The 26 receivers (two per feed horn) are sensitive to orthogonal linear polarizations of radiation in the frequency range 1.27 1.47 GHz. The dewar is cooled to ,80 K surrounding the feed horns, and to less than 20 K at the low-noise amplifiers. At 1.42 GHz, the mean beamwidth (full width at half power) is 14.3 arcmin, and the peak responses of the 13 beams are projected on to the sky with typical separations of 30 arcmin, or just over two beamwidths. The Multibeam correlator has an instantaneous bandwidth of 64 MHz divided into 1024 channels for all 26 receivers. This wide bandwidth offers a velocity range of 21280 to 12 700 km s21 (when the receivers are tuned to 1394.5 MHz), and a mean channel spacing of 13.4 km s21. The correlator chip is identical to that used in the new Arecibo system (Canaris 1993). 2.2 Scanning the sky Observations for HIPASS commenced on 1997 February 27, and are taken by scanning the telescope in Dec. strips of length 88. The Parkes telescope has an alt-azimuth mount, and while in principle the receiver could be rotated during each scan to compensate for the apparent rotation of the beam pattern on the sky, this is not done, as it has a detrimental effect on spectral baseline stability. Therefore, prior to each scan, the receiver assembly is rotated by the parallactic angle at the scan mid-point, plus another 158, to obtain approximately uniform coverage of the sky (see Fig. 1 and also Staveley-Smith 1997). The footprint of the receiver on the sky is ,18 7, so that each scan maps out an 88 1X78 area of sky with X reasonably uniform coverage. The exact coverage depends on the rate of change of parallactic angle (h ) during the scan. This is highest near the zenith dhadd sin h cot zY where d is Dec. and z is zenith angle). Generally, observations within 208 of the zenith have been avoided, as significant changes in parallactic angle jDhj . 108 become possible. To obtain full coverage of the sky at full sensitivity, subsequent scans are displaced by 7 arcmin in RA, as shown in Fig. 2. Since the Nyquist rate is la2D 5X7 arcmin (where l is the observing wavelength and D is the telescope diameter, in the same units), this scan displacement ensures that the sky is mapped at close to the Nyquist rate by each of the 13 beams, and gives sufficient redundancy that data can be edited by purely automated, statistically robust procedures. The scan rate is 18 min21, and the correlator signal is recorded every 5 s. Measurements of the widths

2 2.1

O B S ER VING The Multibeam system

The Parkes 21-cm Multibeam system comprises a cooled, 13 beam receiver and a digital correlator. The receiver was installed at the prime focus of the Parkes 64-m radio telescope on 1997 January 21. The 13 circular feed horns of the Multibeam are positioned in a hexagonal arrangement on the focal plane, with a single central feed, and inner and outer rings of six horns each (see
1

The Parkes telescope is part of the Australia Telescope which is funded by the Commonwealth of Australia for operation as a National Facility managed by CSIRO.

q 2001 RAS, MNRAS 322, 486 498


488

D. G. Barnes et al.
Coordinate information from the telescope is generally not synchronized with data from the correlator. As a result, coordinates in the data files refer to the position of the telescope a few seconds before the mean time-stamps applied to the data. A linear interpolation of telescope positions has been applied throughout the survey, typically resulting in corrections of a few arcmin in the Dec. direction. A comparison of the positions of more than 200 sources selected from the PKSCAT90 2700-MHz continuum source catalogue (Wright & Otrupcek 1990) with positions measured from the final survey images exhibits a standard deviation of 0.8 arcmin in RA, and slightly less than this in Dec. There appear to be no significant systematic position offsets. 3 3.1 S PECT RAL PR OC ESSING Requirements

Figure 2. HIPASS scan pattern on the sky for an 88 88 field. Five sets of 15 scans each are acquired, with adjacent scans within each set separated by 35 arcmin, and each set displaced from the previous set by 7 arcmin in RA. The solid lines with arrows depict one set of 15 scans; the arrows indicate the direction of each scan. The final scan density, for a single beam of the Multibeam receiver, is shown by the dot-dashed lines near the left-hand edge of the field.

of the beams give full widths at half power of 14.0 arcmin for the central beam, 14.1 arcmin for beams in the inner ring, and 14.5 arcmin for beams in the outer ring. The mean observing beamwidth is therefore 14.3 arcmin, but is extended to 14.7 arcmin in the Dec. direction because data are acquired while the telescope is scanning. The total integration time of the survey is 7 103 s deg22 Y or 450 s beam21. Although the Dec. scans are separated by 7 arcmin, the observations are made in five separate sets, with the scans in each set spaced by 35 arcmin. Care was taken to make the observations of a given part of the sky at well-separated times so that interference, if present, did not corrupt all data for that direction. This seems to have worked well, in general, with little sign of any solar or narrowband interference in the final images, except in a few unfortunate cases where more than ,60 per cent of the data have been acquired during the daytime, over a period of just a few days. In terms of narrowband interference, only the 11th harmonic of the 128-MHz sampler clock at 1408 MHz cz . 2640 km s21 consistently appears at moderate levels in the final cubes. This harmonic was removed in early 1999 by enclosing the correlator in a Faraday cage. 2.3 Data acquisition

Spectra generated by the Multibeam correlator contain the usual structure seen in spectra from other H i instruments: a bandpass spectrum which is the sum of the sky, ground and receiver temperatures multiplied by the product of the filters in the receiver chain; superimposed on this is a noise whose amplitude depends inversely on the square root of integration time and channel bandwidth; occasional baseline ripple whose amplitude is a complicated function of the telescope geometry and the location and strength of 20-cm continuum sources in the sky; and other internal effects such as ringing due to strong, sharply peaked sources. Any H i line emission whose amplitude is typically a few thousandths of that of the system bandpass spectrum lies on top of the structure described above. For HIPASS, algorithms were designed: to remove the system bandpass spectrum, to calibrate the residual spectrum, to remove any remaining baseline (DC) offset, to suppress strong ringing effects, and to shift the resultant spectrum from the topocentric observing frame to a fixed frame of reference. Since radio-frequency interference (RFI) had the potential to seriously contaminate a substantial fraction of the survey data, the algorithms were developed to be statistically robust to a moderate fraction of bad data. Furthermore, as the Parkes beam is significantly larger on the sky than the typical extragalactic H i source, the algorithms were designed specifically to identify compact emission regions. 3.2 Robust bandpass estimation

The Multibeam correlator is programmed to write data directly into a disk file. The correlator cycle time is 5 s, and spectra are written for each beam and polarization at the end of each cycle. There are two polarizations per beam, so a total of 26 spectra are written each cycle, each with 1024 channels. Each channel value is stored as a single-precision floating point number, occupying four bytes of storage. Thus the HIPASS raw data rate is 104 kb cycle21, or 1.2 Mb min21. The correlator file is closed and reopened each cycle to ensure that the very latest data can be read for near real-time processing. All HIPASS data are archived in their unprocessed state.

The dominant component in a raw correlator spectrum is the system bandpass spectrum, which varies with time because of, e.g., slow fluctuations in the physical temperature of the receivers, and external influences, such as atmospheric conditions which of course change with pointing direction. Traditionally, bandpass removal is accomplished by observing in a signal/reference mode. In this mode, an extended integration (e.g., 200 s) is acquired while the telescope tracks the target position, yielding a signal spectrum. A second integration, usually of the same duration, is then acquired while the telescope is pointed toward a nearby position (hopefully) free of line or continuum emission, yielding a reference spectrum. The reference spectrum is assumed to be a good estimate of the bandpass spectrum, and is subsequently removed from the signal spectrum by division. For longer onsource integrations times, this process is repeated many times, and the quotient spectra averaged. It is normal to spend as much time off-source as on-source for this observing mode, since division by the (noisy) reference spectrum increases the noise in the resultant
q 2001 RAS, MNRAS 322, 486 498


HIPASS: southern observations, calibration and robust imaging
quotient spectrum accordingly. It is also normal to select a reference position such that the sidereal telescope track with respect to the ground is similar to the track for the signal observation, in order to re-create the ground spill-over contribution to the bandpass spectrum as well as possible. For HIPASS, where the telescope is actively scanned across the sky at 18 min21, the signal/reference method of bandpass removal needs to be modified. The basic method for removing the bandpass from a selected (target) HIPASS spectrum is first to estimate the shape of the bandpass spectrum at the time the spectrum was acquired, and then to divide the target spectrum by the bandpass estimate. The bandpass estimate is determined from a set of earlier and/or later spectra observed by the same feed of the Multibeam receiver. These reference spectra are selected individually for the target spectrum, and must satisfy a number of criteria to be suitable. Suitable reference spectra should be independent measures of the H i sky to that of the target spectrum, but should be acquired nearby in time so that temporal variations of the bandpass are kept to a minimum. Since the bandpass estimate is made from a number of spectra, the increase in spectral noise caused by bandpass removal can be made negligible. Actual estimation of the bandpass is done by taking a channelby-channel median of the reference spectra, i.e., the bandpass estimate is the median reference spectrum, not the mean reference spectrum. Despite the standard error on the median statistic being 25.3 per cent greater than that on the mean statistic (for normal distributions; Kendall & Stuart 1963; Freund 1971), the median statistic is robust to a high fraction of outlying data points, and is independent of the magnitude of deviation of the outlying points. The mean statistic, on the other hand, has the characteristic that no data point can be arbitrarily distant from the general trend without affecting the statistic. Formally, the HIPASS bandpass estimate is determined independently for all 26 feeds (13 beams, two polarizations). Since the 13 beams track out nearly parallel paths on the sky, it is sufficient to generate a single list of cycle numbers, or equivalently integration time-stamps, which refer to integrations which contain spectra that are valid for estimating the bandpass spectrum at the instant of the target spectrum. This set of cycle numbers, wT, is calculated once for the central beam, and assumed to be good for all other beams. Only in the case of unusual observing circumstances would this assumption be invalid, e.g., spiral scanning, or rapid rotation of the receiver while acquiring data. The suitability of a particular cycle for inclusion in the set wT is determined as follows. (i) Only spectra from the same Dec. scan as the target spectrum are used as reference spectra. Between scans, the receiver is rotated slightly, thus changing the observing geometry, and hence the sidelobe radiation pattern. (ii) The displacement on the sky from the target integration position must be at least 15 arcmin, in order to measure an independent part of the sky to that viewed by the target integration. Even though the beam half-width is only 7 8 arcmin, depending on the position of the beam on the focal plane, this minimum distance is set conservatively to prevent a particularly strong point source, or instead a nearby and extended source, from being partially removed during bandpass removal. (iii) The displacement in time from the integration time-stamp of the target cycle must be less than 120 s. This is principally a restriction to enable near real-time processing of the data, and provide immediate feedback to the user on the data quality.
q 2001 RAS, MNRAS 322, 486 498

489

However, this criterion also ensures that the spectra used to generate the bandpass estimate are not drawn from a long period of time over which the bandpass may have varied significantly. Once the set of cycle numbers (wT) is determined, the bandpass estimate is generated channel by channel, for each polarization of each beam of the receiver. For beam b 113 and polarization p (1 or 2) of the target cycle cT, the bandpass estimate as a function of channel number n 11024 is: BcT
YbYp

n median{ScYbYp n : c [ w T }Y

1

where Sc, b, p is the raw correlator spectrum for cycle c, beam b and polarization p. There are two points worth special mention regarding equation (1): first, in this formalism the bandpass estimate for the nth channel is independent of the bandpass estimate for any other channel and, secondly, an immediate consequence of the choice of the median statistic is that spatially unresolved spectral line sources will be absent from all bandpass estimates, provided that the set wT is sufficiently large (compared to the source density on the sky), independent of (ii) above. In particular cases, though, the median estimate may be slightly biased by the presence of a strong source in a subset of wT ; see Fig. 3. 3.3 Bandpass removal and calibration Calibration of the spectra is done concurrently with bandpass removal, since system temperatures are recorded for every

Figure 3. An example of negative bandpass sidelobes generated by bandpass correction north and south of a strong H i line source: the source at the centre is ESO 214 2 G017Y and the fainter source to the north east is ESO 264 2 G035X The contours are at 250 and 220 mJy (dashed), and 20, 50 and 200 mJy (solid).


490

D. G. Barnes et al.
3.4.1 Spectral smoothing A characteristic of digital correlators is that their spectra often suffer from what is commonly known as the Gibbs ripple. The spectral response function of a digital correlator is the Fourier transform of the time-lag weighting function used in the correlator. In the Multibeam correlator, the lag weighting is nearly constant as a function of lag so that the spectral response function is, to first order, a sinc function. When narrow lines are present in the data, this spectral response causes strong ringing in the spectra that decays roughly as n21, where n is the number of channels away from the narrow line. Given that in many positions on the sky the Galactic H i emission is strong and contains (in terms of the spectral resolution of the Multibeam receiver) narrow emission lines, this ringing can seriously affect the data over a large velocity range (see Fig. 4 for an example). A standard procedure to suppress this ringing is to Hanning-smooth the data. Such a smoothing very effectively suppresses the sidelobe level (the first spectral sidelobe decreases from 26.7 to 215.9 dB) and more importantly, the sidelobe level decays as n23. Because of these effects, the region of the spectrum affected by the ringing is much smaller, and a larger fraction of the spectrum can be used.
Unfiltered

spectrum written to the correlator files. The system temperature, T cT YbY p , of the raw target spectrum, in Jy, indicates the total power measured by the correlator, calibrated against a calibration diode which is constantly switched in and out of the signal path. The diode itself is occasionally calibrated against an extragalactic radio source of known amplitude, e.g., 19342638 (14.9 Jy at 1420 MHz) or Hydra A (40.6 Jy at 1395 MHz). T cT YbYp is read directly from the correlator file. The system temperature of the bandpass estimate EcT YbYp must be estimated. This is accomplished by treating the system temperature in exactly the same way as a single channel, and taking the median of the system temperatures for the cycles in wT, i.e. E
cT YbYp

median{T

cYbYp

: c [ w T }X

2

Then the bandpass-removed, calibrated target spectrum ScT YbYp is obtained by comparing the raw correlator spectrum to the bandpass estimate, scaled by the ratio of the system temperatures: ScT YbYp EcT YbYp ScT YbYp T cT YbYp 21 BcT YbYp T cT YbYp ScT YbYp 3 2 T cT YbYp X EcT YbYp BcT YbYp In practice, some H i sources are sufficiently bright that bandpass estimates for target cycles just prior to and just after the cycle when the source makes its closest approach to a beam axis are elevated by the source itself. This results in bandpass sidelobes which are depressions in the spectra north and south of strong H i sources. The depressions are generated during the division of the target cycle by a bandpass estimate containing positive H i flux. Note, though, that depressions do not occur in the sources themselves, unless they are spatially extended (compared to the Parkes beam and the restrictions on wT) or brighter than ,1 Jy: this is prevented for all but the strongest sources by the second criterion in generating wT. In Fig. 3 an example of the sidelobes associated with a strong source, as they appear in a gridded sky image (see Section 4), is given. HIPASS bandpass sidelobes are generated during the bandpass removal stage of the HIPASS spectral processing. Thus the H i emission from extended H i sources, including the Galaxy, the high-velocity clouds (HVCs), the Magellanic Clouds and Stream, and other nearby galaxies such as Circinus, is corrupted in the standard HIPASS calibrated data, and therefore also in the HIPASS sky images (see Section 4). However, since all unprocessed HIPASS spectra are archived, the spectra can be reprocessed using different techniques, e.g., algorithms which preserve spatially extended emission. Such an approach has already been applied to the data in order to successfully image HVCs and the Magellanic Stream (Putman et al. 1998). 3.4 Residual baseline removal

30

S (arbitrary units)

20

10

0

500

0

500 Vhel (km s )
1

1000

1500

Tukey 25%

30

S (arbitrary units)

20

10

Whilst bandpass removal and calibration ordinarily yields spectra that are flat, excluding receiver noise and spectral line sources, there are cases where the baseline is not flat. The two predominant causes of non-flat baselines for the HIPASS project are ringing associated with strong Galactic H i emission, and continuum emission extending over the wavelength range 20 22 cm which can produce standing wave patterns in the telescope structure, thereby causing residual ripple after bandpass removal. The first of these spectral ringing is relatively straightforward to correct, by smoothing every spectrum with a well-chosen kernel.

0

500

0 V

500
hel

1000
1

1500

(km s )

Figure 4. An unfiltered (top) and Tukey 25 per cent smoothed (bottom) HIPASS spectrum containing strong Galactic emission near 0 km s21 and extragalactic emission at 1000 km s21. The ringing in the unfiltered spectrum is essentially removed in the smoothed spectrum. q 2001 RAS, MNRAS 322, 486 498


HIPASS: southern observations, calibration and robust imaging
However, the spectral resolution after Hanning smoothing is degraded by 67 per cent, and some detail in the signal is lost. In order to suppress the Gibbs ringing in Multibeam spectra, while retaining most of the information in the signal, a Tukey filter was selected for smoothing purposes. This filter is applied in the lag domain, and can be written as V for jxj , f xmax b1 ` T f x 1 1 b 1 cos p jxj 2 fxmax for fxmax # jxj # xmax Y X xmax 2 fxmax 22 4 where xmax is the maximum lag, and 1 2 f is the fraction of the lag spectrum that is tapered. The Tukey filter is very similar to a Hanning filter (in fact, it is identical for f 0Y and the sidelobe level of a filtered spectrum also decays as n23. Hence the region of the spectrum affected by the ringing is limited. The main difference with a Hanning filter is that the tapering is performed only on part of the lag spectrum as controlled by the parameter f. This means that the sidelobe level is higher, but also that the spectral resolution is degraded much less. For HIPASS spectra, 1 2 f is set to 0.25, and the resolution is degraded by only 15 per cent compared to the full resolution (and the first sidelobe is at 26.9 dB). Fig. 4 also shows an example smoothed spectrum, illustrating that the ringing is much reduced and only a very small part of the spectral is not usable because of the ringing. Fig. 5 shows that the spectral resolution is much less affected by this Tukey filter than by the Hanning filter, and that more detail in the signal is retained. 3.5 Doppler tracking

491

yields a spectrum in the selected frame. The velocity frame conversion is applied after bandpass removal, since the bulk of the bandpass spectrum is internally generated in the observing frame. 4 I M AG I N G

4.1 Requirements The result of the spectral processing of a single HIPASS scan is a set of calibrated H i spectra for 1300 unique positions on the sky. The scanning approach delivers typically 1500 spectra per square degree of sky, ,55 of which are acquired by each Multibeam feed. For the most part, these spectra are free of continuum ripple, of baseline curvature, and residual bandpass effects. Indeed, they are mostly free of H i line emission too, given the sparsity of galaxies on the sky. Furthermore, since the integration time is only 5 s, the root mean square (rms) noise level in the individual spectra is typically 72 mJy (see Section 4.3.2), and so only the very brightest H i sources are ever seen in single spectra. Some operation is therefore required to compile the individual spectra into high signal-to-noise images of the H i sky. As well as dramatically improving the visibility of low-flux sources, such an operation produces a more compact and natural representation of the H i sky. This operation is the primary task of the Multibeam imaging software. Projection centres have been defined for 388 images covering the southern sky. The projection used is the orthographic projection (see Kellaway 1946) with reference (tangent) point at the centre of each image. Spatially the images measure 88 88Y where the width in RA is measured at the centre of the image. For the HIPASS pixel size of 4 H 4 H (see Section 4.2.2), the images have dimensions of 170 160X Spectrally, each image comprises 1024 planes (channels), extending in cz from 21280 to 12 700 km s21. Each image is generated from 75 HIPASS scans, and has typically a 10 per cent sky overlap (by area) with adjacent images. 4.2 Robust gridding Conversion of the individual spectra into position-positionvelocity cubes requires placing the spectra on a regular grid, i.e., gridding. For each pixel in the map, the gridding process must: (1) determine which spectra will contribute to the pixel; (2) reject those spectra which appear to contain corrupt data; (3) ascribe a weight to each remaining spectrum, and (4) calculate the value of the pixel based on the input data and weights. The gridding algorithm used for the survey is optimized for point sources: more than ,90 per cent of HIPASS sources are expected to be small in angular size compared to the Multibeam resolution. 4.2.1 Algorithm The objective of gridding is to reconstruct the flux at a certain position (pixel) on the sky, given an arbitrary number of single beam spectra measured at irregularly distributed positions near the target pixel. Fig. 6 shows a representative example of the problem: two observations A and B have been made near the target pixel P. H Observation A measures a flux FA at a distance rA from the pixel H centre, and B measures a flux FB at a distance rB from the pixel centre. Two possible sources C and D are shown in Fig. 6, having

Observations for HIPASS are made in topocentric mode, wherein the observing frequencies for each channel remain fixed throughout the survey. Consequently, the velocity range observed varies with time and position, and some correction must be made to the spectra to align them in a fixed reference frame. This is accomplished in the Fourier domain. First, for each spectrum, the velocity shift required to place the spectrum in the frame of the Solar System barycentre is calculated: this depends on observing epoch, sky position and antenna location. This shift is converted to a phase gradient across the spectrum, and is applied to the Fourier transform of the spectrum. The subsequent inverse transform

0.2 Unfiltered Tukey 25% Hanning

0.15 S (arbitrary units)

0.1

0.05

0

850

900

950 Vhel (km s )
1

1000

1050

1100

Figure 5. Comparison of the effects of Tukey 25 per cent and Hanning smooths for a strong, resolved signal. q 2001 RAS, MNRAS 322, 486 498


492

D. G. Barnes et al.
admirably. Thus Fe medianF H Y medianw 8

B P A D C

which we define as median gridding. Note that the mean estimator, which we have replaced with the median, was only one of many that could be chosen. Furthermore, alternative weighting schemes are possible, e.g., Fe

qw n F H Y qw n11

9

Figure 6. Diagram showing the positions of two observations A and B near a target pixel P, with two sources C and D in the field. The dashed circles indicate the half-power radii of the observations A and B.

intrinsic fluxes FC and FD respectively. It is, of course, generally H true that F H , F C 1 F D and FB , F C 1 F D X A Provided that the instrumental beam shape is known (or can be estimated), the flux in the pixel P can be estimated as follows: the beam shape (beam sensitivity) is introduced as a weighting function w(r) which depends only on the distance from the beam centre r, with w0 ; 1 and w1 ; 0X Then F A F H awr A Y A
H F B FB awr B Y

where q is some statistical estimator (e.g., mean, median, etc.), and n is a measure of the importance of the weights. Generally, q mean and n 1 will give the minimum variance result for interference-free data. It is actually possible to estimate the image error per pixel by slightly modifying the gridding algorithm. Equation (9) can be even more generally written as Fe

ww n F H Y qw n11

10

5

where FA and FB are the reconstructed summed fluxes of the sources C and D as measured by A and B.As r A . r B and wr A , wr B Y FA is a less certain measurement of the flux in sky pixel P than is FB. Then the weighted mean of FA and FB yields a (simple) statistical estimate of the flux Fe which may be assigned to the pixel P, with the weights (again) provided by the beam sensitivity w(r): Fe wr A F A 1 wr B F B X wr A 1 wr B 6

Substituting the reconstructed fluxes into this equation, and generalizing to N pointings, we find: N N H H Fi Fi aN meanF H F e i1 i1 X 7 N N meanw wi wi aN
i1 i1

Unfortunately, for real data, mean (F H ) can be severely corrupted by artefacts in the data like interference. In principle, this could be corrected by explicitly removing bad data, either manually or automatically, using some pre-defined quality criteria. In practice, however, manual data removal is tedious work subject to human error, and automatic data removal often depends on arbitrarily chosen criteria. A far better way is simply to replace the mean statistic used in equation (7) with a more statistically robust estimator the median in keeping with the philosophy applied elsewhere in the Multibeam processing. An estimator like the median is robust to a substantial presence of outliers, for example, measurements corrupted by interference. The statistic itself acts to identify the unusual elements in the sample, and requires no arbitrary criteria to do so. As long as the fraction of corrupt data in the sample is less than ,30 40 per cent (as is the case for nearly all samples of the survey data) the median statistic performs

where different statistical estimators w and q are used in the flux and weight calculations respectively. In this form, with w selected as a measure of dispersion, and q selected as a mean-like estimator (e.g., mean, median, ), the resultant gridded image describes the error present in each pixel according to the selected dispersion measure. Though not used for the purposes of this paper, images constructed with robust dispersion measures are proving to be extremely important for assessing the significance of source detections in the HIPASS images. Equation (7) (and therefore equations 8 and 9) neglects the displacements between the observation A and B and the sources C and D in Fig. 6. These observation source displacements cannot be known without adopting an iterative sky reconstruction technique. Despite this, equation (8) works well for the HIPASS data, which is highly redundant and over-sampled. While the source C in Fig. 6 is within the half-power radius of observation B, it will only be measured at a fraction of its intrinsic strength, because the observation itself applies a beam-weighting to the sky source distribution. Source C will be invisible to observation A, thereby weighting its contribution to pixel P even lower. Generalizing to the case of highly over-sampled data, and applying a nearness-to-P requirement, the majority of measurements considered in equation (8) for pixel P will not detect source C. On the other hand, source D is likely to be detected by most observations clustered around P, and will therefore be reconstructed at a modest fraction of its intrinsic flux in pixel P. For a sky containing a single point source centred on pixel P, equation (8) will reconstruct exactly the flux of the point source. 4.2.2 HIPASS-specific parameters The value of median (w) depends primarily on the weighting function w(r) used and the radius rmax out to which spectra will be included for the calculation of a single-pixel value. For the HIPASS, the telescope beam is assumed to be a two-dimensional Gaussian function, so the weighting function takes the following form: V 2 ! b ` exp 2 r a2 for r # r max s wr 11 b X 0 for r . r max X
q 2001 RAS, MNRAS 322, 486 498


HIPASS: southern observations, calibration and robust imaging
The mean beamwidth of 14.3 arcmin is used, corresponding to s 6X1 arcminX The generalization of median gridding to use a weighting function per beam, while simple, has not been made, since it is unlikely to affect the images in any significant way. The choice of smoothing radius (rmax) determines how many spectra contribute to each pixel value, and therefore partly determines the final image noise level. The smoothing radius also ultimately decides the final gridded beamsize because of quantization noise. Additionally, the choice of pixel size affects the correlation between pixels and, consequently, the apparent image noise. Tests have shown that smoothing radii less than 4 arcmin do not provide sufficient robustness; this is simply because the number of HIPASS spectra per pixel is too low (typically 30 40). Large smoothing radii (.10 arcmin) cause loss of resolution and positional accuracy. Following extensive testing, it was evident that a compromise was necessary between obtaining the lowest possible image noise and obtaining the highest possible resolution images. The HIPASS smoothing radius was selected to be 6 arcmin, and the pixel size to be 4 arcmin. For the selected imaging parameters, the distribution of w(r) was investigated. For a random distribution of observations, and a smoothing radius of rmax, the median of the weights [median (w)] is simply the beam (weighting) function evaluated at the radius which divides the smoothing area equally in two, i.e., p wr max a 2X For r max 6 arcminY median w 1X28X Tests during gridding HIPASS images show the distribution around this value to be very narrow, indicating that the input spectra are very nearly randomly distributed on the sky. 4.3 4.3.1 Properties of median-gridded images Robustness

493

The motivation for the median-gridding algorithm was the potential for automated statistical robustness to corrupt data arising from externally generated interference or an observing system failure. This objective has been achieved, as Fig. 7 exhibits. The Global Positioning System (GPS) L3 beacon near 1380 MHz is powerful (typically more than 10 Jy is measured when the offending satellite is in the distant sidelobes of the telescope beams!) and is visible in up to ,5 per cent of HIPASS spectra. Without robust gridding, the GPS signals are preserved in the HIPASS sky images, manifested as severe declination striping for instances when the GPS signal persisted for several minutes, and as beam imprints on the sky when the GPS signal is brief (see Fig. 7, top panel). Median gridding, by comparison, is mostly impervious to the GPS signal, which corrupts only a small fraction of the spectra taken at that position on the sky (Fig. 7, lower panel). Only in cases where the GPS signal is present for an extended period of time will the median gridding fail to remove the offending signal completely, thus GPS signals are seen in a small but non-negligible number of HIPASS images. 4.3.2 Image noise

Figure 7. Planes from mean (top) near 1380 MHz. The powerful L3 corrupted the mean gridded image almost completely removed by the

and median-gridded HIPASS images beacon of the GPS has completely (in this and nearby channels), yet is robust median-gridding algorithm.

The expected rms noise in single-polarization HIPASS spectra is 72 mJy. This is calculated assuming a mean system temperature of 19 K (measured for 1350 spectra for each feed, sampled from 18 months of data), a correlator efficiency of 82 per cent, and a noise equivalent bandwidth of 1X10 62X5 68X8 kHzY where this adjustment is made for the Tukey 25 per cent smooth that is applied to the spectra. In order to determine the rms noise present
q 2001 RAS, MNRAS 322, 486 498

in the images, the number of spectra contributing to each pixel is required. The total integration time for the survey is 450 s beam21, and for the r max 6 arcmin smoothing radius, this corresponds to 220 s pixel21 in the gridded images. The number of singlepolarization spectra contributing to the calculation of any pixel therefore is ,88, since each spectrum takes 5 s to gather, and there are two independent linear polarizations. Thus the rms noise expected in mean gridded HIPASS images is p 72a 88 7X7 mJyY assuming that the rms noise in the individual spectra is close to a normal distribution. For median gridding though (see equation 8), the noise level is increased due to the variance on the median estimator being higher than that on the mean estimator by 25.3 per cent (Kendall & Stuart 1963; Freund 1971). Additionally, the (narrow) scatter in the weighting factor in equation (8) increases the noise in the images by a further 3.2 per cent (measured from the distribution of weights), giving an image


494

D. G. Barnes et al.
Table 2. Image beam dimensions and noise as a function of peak flux, for point sources. See text for a description of the columns. Source peak flux (mJy) (1) 3.0 5.0 1.0 1.5 1.0 1.0 101 101 102 102 103 104 Image noise (mJy) (2) 12.5 12.6 12.7 13.0 31.2 350 measured image beam Shape Error arcmin arcmin (arcmin) (3) (4) 19 18 17 17.6 15.6 15.3 17 16 16 16.0 14.9 14.6 6 4 2 0.6 0.3 0.2 intrinsic image beam Shape Area arcmin arcmin (arcmin2) (5) (6) 16.6 16.2 16.4 16.3 15.1 14.8 16.0 16.1 16.2 15.9 14.9 14.6 300 295 300 292 255 244

rms noise level of 9.9 mJy. Finally, the median weight term in equation (8) (i.e., the denominator) effectively globally scales the images by a factor of 1.28; thus the rms noise level measured in median-gridded HIPASS images should be 12.7 mJy at best. Simulations of point sources placed at various offsets from the centre of a gridding top-hat, with input noise 72 mJy, and 88 unique observations per top-hat, confirm rms noise levels of 12X7 ^ 0X2 mJy [see Table 2, columns (1) and (2)]. The rms noise is elevated for sources stronger than ,1 Jy, because there is a much wider variation in flux in the samples made within the gridding top-hat; note that the signal-to-noise ratio SaN is typically around 30 or greater for such strong sources. For the HIPASS images themselves, measurements show the canonical rms noise level to be 13.3 mJy. Careful selection of a region devoid of H i sources, continuum sources, and residual solar ripple (see Section 5.1) enables the measurement of rms values approaching ,12.8 mJy. Thus, in places, the theoretical noise limit of the survey is attained. 4.3.3 Beam properties The HIPASS image beam is manufactured from 26 unique telescope beams, each of which has been elongated by the active scan (effectively convolved with a 5 arcmin top-hat in the Dec. direction), and then convolved with the gridding kernel, which is a 12 arcmin top-hat function. However, since the median gridding is a markedly non-linear process, the actual HIPASS image beam is statistical rather than definite. The beam depends not only on the smoothing radius and pixel size, but also on the sky sampling (and therefore position in the image), source strength (akin to S/N), and source shape. The only way to determine canonical values for the image beam, therefore, has been to construct and analyse simulations. Dependence on signal-to-noise ratio. A simulation to assess the dependence of the beamwidth on S/N for point sources was designed as follows: a circle of radius rmax 6 arcmin was laid down having its centre at the origin of a Cartesian plane. A source of strength S mJy was placed at (x,0), with x measured in arcmin along the X-axis. 44 random positions within the circle were generated, and the measured flux calculated twice (once per polarization, each measuring the same flux, but having different noise contributions) for each position assuming a Gaussian beam of width 14.3 arcmin (the mean ungridded beam). The smearing of the beam in the dec. direction by the active scanning was introduced by actually taking the average flux over a set of positions displaced from each random position by displacements in the interval [22.5 H ,2.5 H ]. Normally distributed noise with rms

72 mJy was added to the 88 flux measurements, and the median flux measurement calculated and scaled by the median of the weight values for all positions, according to equation (8). Varying x and S, and repeating the simulation a large number of times, yields the gridded beam shape for various peak flux image signalto-noise ratios S/13.3. The results of these simulations are presented in Table 2. In column (3) the measured image beam dimensions are given for a series of input source peak flux values. The data provided are the mean widths for Gaussian fits to 50 independent simulations, and the standard deviations of the width distributions are given in column (4). Some component of the beam widths measured in this way, however, will be due to problems inherent in fitting Gaussians to relatively noisy data. Therefore a more meaningful estimate of the image beam has been obtained for each source by adding the 50 simulation images together and making one Gaussian fit to the resultant image. This yields the beam dimensions given in column (5), and the beam area in column (6). While columns (3) and (4) give the range of width measurements which in practice will be determined from Gaussian fits to HIPASS images,2 columns (5) and (6) are more representative of the intrinsic image beam, and should be used for image analysis purposes. Point sources. Simulations which inject false sources into genuine survey data prior to imaging show directly that median gridding produces correct peak fluxes (typically within 1 per cent of the expected value) for point sources. This is expected, since equation (8) has been constructed with this in mind. To calibrate the integrated flux scale, it is useful to consider the area of the image beam (Ab), which in arcmin2 is defined to be Ab ; 1X13 Si px py X S 12

S is the integrated flux of the input (simulated) source, Si is the summed flux in the output image not corrected for the beam, and px and py are the image pixel dimensions in arcmin. The premultiplier 1.13 is equal to pa4ln2Y the constant term in the area of a Gaussian beam. Inspection of the integrated flux in the simulated point sources gives Si 1X59 Jy pixel for S 1JyY and thus Ab 243 arcmin2 X The beam area calculated for a 1.00 Jy source from Table 2 is 255 arcmin2; thus the image beam for a 1.00-Jy source deviates from Gaussian by ,5 per cent.
2

Actually, the situation is slightly better than this, since the values in Table 2 are appropriate for fits made to a single plane of an image; in practice, source fits are normally made to intensity maps integrated over several channels. q 2001 RAS, MNRAS 322, 486 498


HIPASS: southern observations, calibration and robust imaging
Table 3. Flux corrections to be applied to measurements made on HIPASS images. See text for a description of the columns. Source width (arcmin) (1) 0 5 10 15 20 30 40 Peak flux scale (2) 1.00 0.97 0.92 0.86 0.83 0.79 0.78 Flux-weighted beam area (arcmin2) (3) 244 256 268 275 283 289 293

495

are not intrinsically Gaussian, it is not possible to provide generic beam shape or flux calibration tables. In such cases, simulations pertinent to the particular source geometry must be constructed. Note also that sources extended much beyond ,40 arcmin can be imaged poorly in HIPASS images because of the bandpass sidelobe effect described in Section 3.3. 5 P OST - PR OCESSING

5.1 Continuum ripple removal As is the case for observations made with most H i observing instruments, continuum sources moving through any of the beams or sidelobes of the Multibeam generally manifest themselves as sinusoidal ripple in the spectral baseline. The main reflection path in the Parkes telescope, from the receiver directly to the dish centre and returning to the receiver, i.e., twice the focal length, is 52 m in length, which corresponds to a time displacement of 0.17 ms. This time delay (appearing in the lag spectrum as a spike) gives rise to a sine wave in the Fourier transform spectrum having this wave number or, equivalently, a period of 5.7 MHz in frequency space or 1X2 103 km s21 in velocity space. Solar emission is the most common cause of ripple, and is problematic for a high fraction (say ,40 60 per cent) of spectra acquired during daylight hours. However, the apparent movement of the Sun against the extragalactic sky means that solar ripple conditions vary over the course of a year, and consequently do not persistently affect spectra of a particular part of the sky. All other continuum sources which lie at fixed positions on the sky can inflict baseline ripple on all spectra within a few arcminutes of the sources. In particular, sightlines directed towards continuum sources, or near strong, extended sources like the Galactic plane or Cen A, always suffer from baseline ripple. When most or all of the spectra from a particular part of the sky are plagued by baseline ripple, the ripple will be embedded in the gridded images, and it therefore deteriorates image quality to a much greater degree than does solar ripple. An additional baseline distortion seen in calibrated HIPASS spectra taken near strong continuum sources is a gradual rise in flux towards the high-velocity (low-frequency) end of the observing band. There is a clear dependence of the slope of this deviation on the strength of the underlying continuum source (see below). This distortion suggests that the response of the Multibeam receivers to strong continuum sources and therefore mildly increased system temperature is a noise spectrum whose amplitude is elevated at lower frequencies. The multiplication of this spectrum by the correlator bandpass, and subsequent division by a weak source bandpass spectrum generates a baseline which curves upward at low frequencies. Inspection of the HIPASS cube data base shows that spectra taken from HIPASS cubes are well described by the expression Sn N n 1 Rn 1 k Bn 1 H nY 14

Extended sources. The median weight [median (w) in equation 8] can be interpreted as a correction for the case where a set of observations are made on and around a point source centred on a pixel in the output image. Without the median weight correction, the value assigned to the pixel will be consistently lower than the true peak flux, as the few observations made directly on-source are at the extremes of the distribution, and consequently are ignored by the median operation. In the case of an extended source, however, the median weight correction over-corrects, resulting in peak fluxes that are too high. For the extreme case of an infinitely extended source (i.e., any source larger than say 60 arcmin), the over-correction is by exactly the factor median (w) in equation (8), or typically 1X0a0X78 1X28X Simulations have been used to provide a table of peak flux corrections for sources of various sizes, where the peak flux measured in the image is scaled by a factor between 1.0 (for point sources) and 0.78 (for infinitely extended sources). These corrections are given in Table 3, where column (1) gives the simulated source width, and column (2) gives the factor by which to multiply peak fluxes measured in HIPASS images to obtain correct source peak fluxes. For integrated fluxes of extended sources, simulations have been used to measure the flux-weighted beam area as a function of source size, for a fixed peak flux of 1.00 Jy. The input integrated flux S is given by S 14X32 1 u 2 Y 14X32 13

where u is the full width at half power of the simulated (Gaussian) source. The results are given in column (3) of Table 3, which gives the flux-weighted beam area to use when calculating integrated fluxes of point and extended sources in HIPASS images. In summary, to measure the H i flux of a source found in a HIPASS image, it is necessary to first deduce (by fitting the source of interest and comparing the result to Table 2) whether the source is a point or extended source. For the first case (corresponding to probably more than 90 per cent of extragalactic H i sources visible in the HIPASS images), the source peak flux is exactly that measured in the HIPASS image, and the source integrated flux is the summed flux in the region of the source, divided by the appropriate flux-weighted beam area from Table 3 (in this case, 243 arcmin2), multiplied by the pixel area (i.e., 16 arcmin2). For a source which appears to be extended, provided that the assumption that the source is intrinsically Gaussian is valid, the source size must be estimated by deconvolving the fitted image size with a suitable beam from Table 2, and then the corresponding flux scale and flux-weighted beam area from Table 3 used in place of the values for the point source case. For extended sources which
q 2001 RAS, MNRAS 322, 486 498

where S(n) is the flux in channel n, N(n) is the image noise, R(n)is residual continuum ripple associated with solar and spill-over radiation, B(n) is the bandpass curvature and continuum source ripple, and H(n) is any H i signal present. Typically N(n) approximates 1/f-noise (Voss 1988) with an rms value of 13.3 mJy, and R(n) is all but eliminated by robust statistics operating on spectra acquired under different observing conditions. For positions in HIPASS images coincident with strong continuum sources, the component B(n) dominates, and to first order is constant in shape


496

D. G. Barnes et al.

for all sources. This important feature makes it possible to remove the bulk of spectral ripple and curvature from HIPASS images. The coefficient k is predominantly proportional to the increased system temperature (compared to the average system temperature) of the (ungridded) spectra which were combined to produce that part of the image. That is, the magnitude of k is correlated with the strength of any continuum sources at that position. The principal steps in the method used to accomplish the removal of B(n) from S(n) hereafter the scaled template method are the formation of a canonical baseline [i.e., a template for B(n)], and the statistically robust excision of k Bn from all spectra (pixel by pixel) across a HIPASS image. Both parts make use of the same basic algorithm, and at present are applied directly to the gridded images. Future reprocessing may be able to make use of the information gathered on B(n) and apply it to spectra prior to gridding. The method by which the canonical baseline Sc is constructed is to form the weighted sum Sc
M wi Si i1

15

of the continuum source spectra exhibiting the most severe spectral ripple and curvature in a HIPASS image. These spectra Si are found by selecting those with the largest positive fluxes at the low-frequency end of the spectrum. The maximum signal-to-noise ratio of this weighted sum over terms which include uncorrelated noise of uniform variance is accomplished by setting the weights wi proportional to the amplitude of the terms themselves. In the present case, that means setting the weights equal to the value of k for each spectrum in the sum. To determine k, note that, in the absence of significant contributions from the R(n)or H(n) terms in equation (14), Si a Sj 1 N H Y 16

Figure 8. Moderate spectral baseline ripple and curvature as seen in a spectrum at the position of the 3.9-Jy continuum source NGC 4696 (top), and a spectrum at the same position in the image after cleaning with the scaled template method (bottom).

where Si and Sj are two arbitrary spectra, a ki akj Y and N H is a noise term given by N Ni 2 a NjX
H

formed as follows: S Hi Si 2 aiYc Sc N 1 R 1 H X 18

17

Hence a scatter plot of Si against Sj should give a straight line (plus noise) with a slope equal to ki/kj. Provided that the maximum excursion of that member of the pair with greater amplitude is significantly greater than the standard deviation of the noise, and that ki is not very large compared to kj, a linear regression can be performed to give the slope ai,j. The need to avoid contamination of the canonical baseline by H i signals is addressed by an iterative process that flags any points which depart by more than 5s from the straight-line fit to the scatter plots, and then performs the regression without these points; this cycle continues until no further points are being flagged. This procedure works only if an H i signal appears (for a given channel) in one spectrum but not in the others. This is not true for the contribution from the Milky Way: these channels (less than 5 per cent of the total) are therefore not included in any of the scatter plots. After formation via equation (15), the canonical baseline Sc is passed through a narrow median filter, typically nine channels wide, to further reduce the high spatial frequency noise component. Sc can then be used to remove continuum source distortion components k Bn from the whole cube. This is done in exactly the same way that Sc is formed, namely, by making scatter plots of each spectrum Si against Sc and performing a linear regression to derive the slopes aiYc ki akc X The cleaned spectra S iH are then

The noise variance is increased only slightly, since Sc is relatively noiseless compared to Si, and the H i contribution H is undisturbed. A typical example of ripple and curvature seen in a HIPASS image, and the result of processing that image to remove the canonical baseline, is given in Fig. 8. 5.2 Mosaicking

There are 388 HIPASS images, each measuring 88 88 on the sky. Because of the significant overlap in RA and Dec. of adjacent cubes, each discrete source can be examined with optimum sensitivity and with sufficient surrounding blank field in a single cube. However, for slightly extended sources, for removing cube edge-effects, and for making a large field for display purposes, it is desirable to mosaic several cubes together. The alternative to mosaicking, which is to regrid the original scan data, is often impractical due to the disk space and processing time needed. For the ith data cube, a weight image Wi(x, y) is defined to be the number of data points contributing to the flux density image Fi(x, y) (which itself is the scaled median of these data points). The mosaicked image is then: M xY y
i

W i xY yF i xY y Y i W i xY y

19
q 2001 RAS, MNRAS 322, 486 498


HIPASS: southern observations, calibration and robust imaging
where it is assumed that all cubes and weight images have identical projections. Points with a weight W i , 0X15kW i l are excluded from the calculation to preserve robustness. In practice, HIPASS cubes have different tangent points, so are usually reprojected (e.g. with the miriad task regrid) to a common tangent point near the intended centre of the mosaic. The weight for each pixel is nominally inversely proportional to variance, so equation (19) is an optimum estimator for normal data. It does not give an identical cube to one produced by, say, regridding the calibrated scan data on to the same large output grid. This is because of the non-linear median estimator used in the gridder. Nevertheless, in practice, similar data quality is obtained, and it is much faster and more convenient to use equation (19) on already existing cubes than to start again from the scan data. An example wide-field HIPASS image is shown in Fig. 9. The image is a 168 258 map of integrated H i emission in the Sculptor region. The moment map has been generated over a velocity range of 200 550 km s21, which includes the main members of the Sculptor galaxy group, numerous other galaxies, and a large number of high-velocity clouds in the south, most of which comprise part of the Magellanic Stream. A detailed analysis of the neutral hydrogen distribution in the Sculptor group is in preparation (Koribalski et al., in preparation). 6 D A T A AVA I L A B IL ITY

497

One of the important goals of the HIPASS project is to deliver to the astronomical community its images of the southern H i sky. In 1999 November, a preliminary release of data from 11 of the total 388 HIPASS images was made. This release, which provided web access to H i spectra in the Centaurus A region, was extended in 2000 May to the entire southern sky. The release pages can be accessed via the `Parkes 21-cm Multibeam Project' page at the address: www.atnf.csiro.au/research/multibeam/ multibeam.html The data in the Centaurus region is the subject of a paper by Banks et al. (1999). Downloaded spectra are extracted from HIPASS images which have been cleaned using the procedure described in Section 5.1, and smoothed spatially to a pixel size of 8 arcmin. Users of the data release are requested to acknowledge the Australia Telescope National Facility as follows: The Parkes telescope is part of the Australia Telescope which is funded by the Commonwealth of Australia for operation as a National Facility managed by CSIRO. Full-resolution H i images will be released in late 2000. 7 S O F TW ARE AVAILABILITY

The software which implements the spectral processing algorithms described in Section 3 is called livedata, and is available in the atnf package of AIPS11. The software which implements the median gridding algorithm is known as gridzilla, and can also be found in AIPS11. A C KNO W LEDGMENTS The Multibeam system was funded by the Australia Telescope National Facility (ATNF) and an Australian Research Council grant. The collaborating institutions are the Universities of Melbourne, Western Sydney, Sydney and Cardiff, Mount Stromlo Observatory, Jodrell Bank Observatory and the ATNF. The Multibeam receiver and correlator was designed and built by the ATNF with assistance from the Australian Commonwealth Scientific and Industrial Research Organisation Division of Telecommunications and Industrial Physics. The low noise amplifiers were provided by Jodrell Bank Observatory. The Multibeam Survey Working Group is acknowledged for its role in planning and executing the HIPASS project. The HIPASS algorithms are implemented in code built upon AIPS11, and the authors are especially grateful for the extensive support provided by various AIPS11 consortium members, and especially for travel funding provided by the ATNF. We express particular thanks to the following AIPS11 personnel: Tim Cornwell, Ger van Diepen, Bob Garwood, Brian Glendenning, Neil Killeen and Darrell Schiebel. We are grateful to Richard Gooch for providing visualisation services in the on-line software implementation. Finally, we express our sincere gratitude to the staff of the Parkes Observatory who have provided magnificent observing support for the survey since the very first HIPASS observations in early 1997.

Figure 9. A sample of HIPASS data: a moment map of the Sculptor region, showing the H i line intensity integrated over a small velocity range. The seven (optically) brightest galaxies in the field are marked; H i emission is easily detected in these galaxies, and also in fainter galaxies throught the region, and in high-velocity cloud complexes towards the south of the image. q 2001 RAS, MNRAS 322, 486 498


498

D. G. Barnes et al.
Sinclair M. W., Wilson W. E., Mooney G. G., Sykes P. B., Bolton R. J., 1997, Proceedings of the 4th International Symposium on Low Temperature Electronics and High Temperature Superconductivity, Proc. Electrochem. Soc., 97, 336 Staveley-Smith L., 1997, Proc. Astron. Soc. Aust., 14, 111 Staveley-Smith L. et al., 1996, Proc. Astron. Soc. Aust., 13, 243 Voss R. F., 1988, in Peitgen H.-O., Saupe D., eds, ASP Conf. Ser. Vol. 145, The Science of Fractal Images. Springer-Verlag, New York Wilson W. E. et al., 1997, Proceedings of the Workshop on Applications of Radio Science. Australian Academy of Science National Committee for Radio Science, Canberra, p. 254 Wright A. E., Otrupcek R., 1990, Parkes Catalogue, Australia Telescope National Facility
A This paper has been typeset from a TEX/L TEX file prepared by the author.

REFER E NC ES
Banks G. D. et al., 1999, ApJ, 524, 612 Barnes D. G., 1998, in Albrecht R., Hook R. N., Bushouse H. A., eds, ASP Conf. Ser. Vol. 145, Astronomical Data Analysis Software and Systems VII. Astron. Soc. Pac., San Francisco, p. 32 Canaris J., 1993, in Dowd A., ed., Proceedings of a Workshop on New Generation Digital Correlators. National Radio Astronomy Observatory, Tucson, p. 117 Freund J. E., 1971, Mathematical Statistics. Prentice-Hall, Englewood Cliffs Kellaway G. P., 1946, Map Projections. Methuen & Co. Ltd., London Kendall M. G., Stuart A., 1963, The Advanced Theory of Statistics. Griffin, London Putman M. E. et al., 1998, Nat, 394, 752

q 2001 RAS, MNRAS 322, 486 498