Difference between revisions of "Compact Source catalogues"

From Planck Legacy Archive Wiki
Jump to: navigation, search
m
m
Line 29: Line 29:
  
 
==== Internal validation ====
 
==== Internal validation ====
The PCCS is validated through an internal Monte-Carlo quality assessment (MCQA) process that uses large numbers of source injection and detection loops on realistic simulated maps to constrain detection characteristics.
+
The catalogues for the HFI channels have primarily been validated through an internal Monte-Carlo quality assessment process that uses large numbers of source injection and detection loops to characterize their properties. For each channel, we calculate statistical quantities describing the quality of detection, photometry and astrometry of the detection code. The detection is described by the completeness and reliability of the catalogue: completeness is a function of intrinsic flux, the selection threshold applied to detection (S/N) and location, while reliability is a function only of the detection S/N. The quality of photometry and astrometry is assessed through direct comparison of detected position and flux density parameters with the known inputs of matched sources. An input source is considered to be detected if a detection is made within one beam FWHM of the injected position.
 
 
Source injection consists on the introduction of fake sources in the real maps. The positions of the injected sources are chosen in order to avoid existing detections in the real maps and previously injected sources. By analysing the recovered injected sources we can determine several statistical properties of the detection process: completeness, photometry accuracy, variation of sensitivity along the sky, etc.
 
 
 
On the other hand, in order to study the reliability, we need to perform realistic all-sky simulations. <span style="color:red">[Description or reference of the simulation used]</span>.  
 
  
 
==== External validation ====
 
==== External validation ====
At the lowest frequencies of Planck, it is possible to validate PCCS source identifications using external data sets, particularly large-area radio surveys. This kind of validation allow us also to characterize the detection process, i.e. to determine the Completeness, Purity and positional accuracy. Moreover, the external validation offers the opportunity for an absolute validation of the different photometries, directly related with the calibration and the knowledge of the beams.
+
At the three lowest frequencies of Planck, it is possible to validate the PCCS source identifications, completeness, reliability, positional accuracy and flux density accuracy using external data sets, particularly large-area radio surveys. Moreover, the external validation offers the opportunity for an absolute validation of the different photometries, directly related with the calibration and the knowledge of the beams.
  
 
At higher frequencies, surveys as the South-Pole Telescope (SPT), the Atacama Cosmology Telescope (ACT) and H-ATLAS or HERMES form Herschel will also be very important, although only for limited regions of the sky. In particular, the Herschel synergy is crucial to study the possible contamination of the catalogues caused by the Galactic cirrus at high frequencies.
 
At higher frequencies, surveys as the South-Pole Telescope (SPT), the Atacama Cosmology Telescope (ACT) and H-ATLAS or HERMES form Herschel will also be very important, although only for limited regions of the sky. In particular, the Herschel synergy is crucial to study the possible contamination of the catalogues caused by the Galactic cirrus at high frequencies.
 
This statistical characterization of the detection process is used to choose the best signal-to-noise threshold for each channel in order to maximize the Completeness without penalizing the Purity.
 
  
 
=== Cautionary notes ===
 
=== Cautionary notes ===

Revision as of 17:49, 11 March 2013

Planck Catalogue of Compact Sources[edit]

The Planck Catalogue of Compact Sources (PCCS) is a sample of reliable sources, both Galactic and extragalactic, extracted directly from the Planck nominal maps. The first public version of the PCCS is derived from the data acquired by Planck between August 13 2009 and November 26 2010. The PCCS consists of nine lists of sources, extracted independently from each of Planck's nine frequency channels. It is fully described in #planck2013-p05.

The whole PCCS can be downloaded here [1].

Detection procedure[edit]

The Mexican Hat Wavelet 2 (MHW2; Gonzalez-Nuevo et al., 2006) is the base algorithm used to produce the single channel catalogues of the PCCS. Although each DPC has is own implementation of this algorithm (IFCAMEX and HFI-MHW), the results are compatible at least at the statistical uncertainty level. Additional algorithms are also implemented, like the multi-frequency Matrix Multi-filters (MTXF; Herranz et al., 2009) and the Bayesian PowellSnake (Carvalho et al. 2009), but for the current version of the PCCS they are used just for the validation of the results obtained by the MHW2.

The full-sky maps are divided into a sufficient number of overlapping flat patches in such a way that 100% of the sky is covered. Each patch is then filtered by the MHW2 with a scale that is optimised to provide the maximum signal-to-noise ratio in the filtered maps. A sub-catalogue of objects is produced for each patch and then, at the end of the process, all the sub-catalogues are merged together, removing repetitions.

The driving goal of the ERCSC was reliability greater than 90 %. In order to increase completeness and explore possibly interesting new sources at fainter flux density levels, however, the initial overall reliability goal of the PCCS was reduced to 80 %. The S/N thresholds applied to each frequency channel have been determined, as far as possible, to meet this goal. The reliability of the catalogues has been assessed using the internal and external validation described below.

At 30, 44, and 70 GHz, the reliability goal alone would permit S/N thresholds below 4. A secondary goal of minimizing the upward bias on flux densities led to the imposition of an S/N threshold of 4.

At higher frequencies, where the confusion caused by the Galactic emission starts to become an issue, the sky has been divided into two zones, one Galactic (52 % of the sky) and one extragalactic (48 % of the sky). At 100, 143, and 217 GHz, the S/N threshold needed to achieve the target reliability is determined in the extragalactic zone, but applied uniformly on sky. At 353, 545, and 857 GHz, the need to control confusion from Galactic cirrus emission led to the adoption of different S/N thresholds in the two zones. The extragalactic zone has a lower threshold than the Galactic zone. The S/N thresholds are given in Table 1.

Bandfilling is the process by which flux density estimates at specific bands are generated based on source positions defined in another band. For the current PCCS release we compute the flux density at 217, 353, and 545 GHz at the positions of each source detected at 857 GHz, using aperture photometry. Bandfilling is not attempted at other frequencies due to the variation in spatial resolution across the bands, which makes multifrequency associations challenging, especially in crowded regions such as the Galactic Plane.

Photometry[edit]

In addition of the native flux density estimation provided by the detection algorithm, three additional measurements are obtained for each of the source in the parent samples. These additional flux density estimations are based on aperture photometry, PSF fitting and Gaussian fitting (see #planck2013-p05 for a detailed description of these additional photometries). The native flux density estimation is the only one that is obtained directly from the filtered maps while for the others the flux density estimates has a local background subtracted. The flux density estimations have not been colour corrected. Colour corrections are available in LFI Appendix and HFI spectral response pages.

Validation process[edit]

The PCCS, its sources and the four different estimates of the flux density, have undergone an extensive internal and external validation process to ensure the quality of the catalogues. The validation of the non-thermal radio sources can be done with a large number of existing catalogues, whereas the validation of thermal sources is mostly done with simulations. These two approaches will be discussed below. Detections identified with known sources have been appropriately flagged in the catalogues.

Internal validation[edit]

The catalogues for the HFI channels have primarily been validated through an internal Monte-Carlo quality assessment process that uses large numbers of source injection and detection loops to characterize their properties. For each channel, we calculate statistical quantities describing the quality of detection, photometry and astrometry of the detection code. The detection is described by the completeness and reliability of the catalogue: completeness is a function of intrinsic flux, the selection threshold applied to detection (S/N) and location, while reliability is a function only of the detection S/N. The quality of photometry and astrometry is assessed through direct comparison of detected position and flux density parameters with the known inputs of matched sources. An input source is considered to be detected if a detection is made within one beam FWHM of the injected position.

External validation[edit]

At the three lowest frequencies of Planck, it is possible to validate the PCCS source identifications, completeness, reliability, positional accuracy and flux density accuracy using external data sets, particularly large-area radio surveys. Moreover, the external validation offers the opportunity for an absolute validation of the different photometries, directly related with the calibration and the knowledge of the beams.

At higher frequencies, surveys as the South-Pole Telescope (SPT), the Atacama Cosmology Telescope (ACT) and H-ATLAS or HERMES form Herschel will also be very important, although only for limited regions of the sky. In particular, the Herschel synergy is crucial to study the possible contamination of the catalogues caused by the Galactic cirrus at high frequencies.

Cautionary notes[edit]

  • Statistical Character : warnings about the statistical analysis of this catalogue... Completeness levels for statistical analysis as number counts, usage of sensitivity maps at fixed completeness level, etc
  • Variability: At radio frequencies, many of the extragalactic sources are highly variable. A small fraction of them vary even on time scales of a few hours based on the brightness of the same source as it passes through the different Planck horns (refs?). Follow-up observations of these sources might show significant differences in flux density compared to the values in the data products. Although the maps used for the PCCS are based on ~2.5 sky coverages, the current version of the PCCS provides only a single average flux density estimate over all Planck data samples that were included in the all sky maps and does not contain any measure of the variability of the sources.
  • Contamination from CO: At infrared/submillimetre frequencies (100GHz and above), the Planck bandpasses straddle energetically significant CO lines (refs?). The effect is the most significant at 100GHz, where the line might contribute more than 50% of the measured flux density. Follow-up observations of these sources, especially those associated with Galactic star-forming regions, at a similar frequency but different bandpass, should correct for the potential contribution of line emission to the continuum flux density of the source. See Planck HFI Core Team (2011b) for details.
  • Photometry: Each source has multiple measures of photometry APERFLUX, GAUFLUX, PSFFLUX and DETFLUX (or native) as defined above. The appropriate photometry to be used depends on the nature of the source. For sources which are unresolved at the spatial resolution of Planck, APERFLUX and DETFLUX are most appropriate. Even in this regime, PSF fits of faint sources fail and consequently these have a PSFFLUX value of NaN (Not a Number). For bright resolved sources, GAUSFLUX might be most appropriate although GAUSFLUX appears to overestimate the flux of sources close to the Galactic plane due to an inability to fit for the contribution of the Galactic background at the spatial resolution of the data.
  • Cirrus/ISM: A significant fraction of the sources detected in the upper HFI bands could be associated with Galactic interstellar medium features or cirrus. The IRAS 100μm surface brightness in MJy sr1 for each of the sources, which is commonly used as a proxy for cirrus, is available through a search of the ERCSC with IRSA. Candidate ISM features can also be selected by choosing objects with EXTENDED=1 although nearby Galactic and extragalactic sources which are extended at Planck spatial resolution will meet this criterion. Alternately, the value of CIRRUS in the catalogue can be utilised to flag sources which might be clustered together and thereby associated with ISM structure.

Planck Sunyaev-Zeldovich catalogue[edit]

The Planck SZ catalogue is a nearly full-sky list of SZ detections obtained from the Planck data. It is fully described in #planck2013-p05a. The catalogue is derived from the HFI frequency channel maps after masking and filling the bright point sources (SNR >= 10) from the PCCS catalogues in those channels. Three detection pipelines were used to construct the catalogue: two implementations of the Matched Multi-Filter (MMF) algorithm and PowellSnakes (PwS), a Bayesian algorithm. A union catalogue is constructed from the detections by all three pipelines. A Galactic dust mask (leaving 85% of the sky), a nearby Galaxy mask and a point source mask are applied a posteriori to remove detections in regions of the sky where foregrounds are likely to cause spurious detections.

Early Release Compact Source Catalogue[edit]

The ERCSC is a list of high reliability (>90%) sources, both Galactic and extragalactic, derived from the data acquired by Planck between August 13 2009 and June 6 2010. The ERCSC consists of:

  • nine lists of sources, extracted independently from each of Planck's nine frequency channels
  • two lists extracted using multi-channel criteria: the Early Cold Cores catalogue (ECC), consisting of Galactic dense and cold cores, selected mainly on the basis of their temperature ; and the Early Sunyaev-Zeldovich catalogue (ESZ), consisting of galaxy clusters selected by the spectral signature of the Sunyaev-Zeldovich effect.

The whole ERCSC can be downloaded here [2].

The ERCSC is also accessible via the NASA/IPAC Infrared Science Archive [3].

References[edit]

<biblio force=false>

  1. References

</biblio>

Data Processing Center

(Planck) High Frequency Instrument

Early Release Compact Source Catalog

Full-Width-at-Half-Maximum

Sunyaev-Zel'dovich