Difference between revisions of "TOI processing LFI"
|Line 337:||Line 337:|
In order to simplify the computation and to reduce the amount of data used in the calibration procedure the data are phase binned in map with
In order to simplify the computation and to reduce the amount of data used in the calibration procedure the data are phase binned in map with 256. During phase binning all the data flagged for maneuvers, planets and gaps, as well as the ones flagged in Level 1 analysis as not recoverable, are ignored.
Revision as of 15:24, 4 February 2015
- 1 Overview
- 2 ADC Correction
- 3 Spikes Removal
- 4 Filling Gaps in the Data
- 5 Gain Modulation Factor
- 6 Diode Combination
- 7 Planet Flagging
- 8 Photometric Calibration
- 9 Noise
- 10 References
The LFI Level2 Pipeline analyzes data from each horn of the instrument separately, one pointing period at time, and stores the results in an object the length of an OD. Each diode of the horn is corrected for systematic effects. Next, measurements of the sky and the 4K load are differenced, then the signals from one diode are combined with signals from the complementary diode in the same radiometer. Finally, photometric calibration is applied for each horn.
Before the Level 2 pipeline is run, the Mission information and data sampling divisions are stored in the database, in order to improve the analysis.
The Mission information is a set of objects, one for each Operational Day (OD, as defined in the glossary), in which are stored pointing period data: the DPC pointing ID (where 1 is the first pointing of the nominal mission), PSO pointing ID, start OBT of the pointing maneuver, start OBT of the stable pointing, OBT of the end of the stable pointing, and spin axis ecliptic longitude and latitude.
The sampling information is a set of objects, one for each LFI frequency, in which are stored for each pointing ID: start OBT of the pointing maneuver, start OBT of the stable pointing, end OBT of the pointing, number of samples of the pointing, number of stable samples of the pointing, start sample of the stable pointing and sample number from the start of the nominal mission. Valid samples and OBTs are defined to be those where any of the radiometers in a frequency band contain valid data.
During the analysis of LFI date, we discovered a systematic common to both white noise and calibration. It turn out to be a non linearity in the Analogue to Digital Converter (ADC) on board. More detail is supplied in Planck-2013-IIPlanck-2013-III, Planck-2015-A03 and Planck-2015-A04.
The mathematical model represents the digital ADC output as:
where DAE gain, is the DAE offset and is the DAE .is the voltage input, is the
We can model the non-linearity as a function of the input voltage. So we have the apparent inferred voltage and we can link it to the actual input voltage with:
Sinceand we can use the much simpler relation:
and we expect it to be very near to unity for all.
To find the response curve we have only the apparent voltage to work with, so we had to use the inverse response functionand replace the real input voltage with times the time varying gain factor .
If we introduce a small signal on top ofwhich leads to increased detected voltage and corresponding apparent voltage increment:
so carrying out the differentiation with respect toto the relation between true and apparent signal voltage leads to:
We now assumeand are fixed and that the variations are due to slow drifts in the gain. So we can isolate the terms:
Combining the equations through the gain factor to remove it:
Rearranging and putting
So there is the expected direct proportionality ofto based on the assumption that the variations in voltage are due to overall gain drift, so the amplitude of voltage and signal will vary together. Then there is the additional differential term which will pull the signal amplitude away from the linear relationship. So if we plot measured white noise or dipole gain factor against recovered voltage, we should see a linear curve with variations due to local slope changes at particular voltages. The linear part can be taken out and the differential part fitted. This was numerically integrated up to get the inverse response curve, which we then used to convert the measured voltages to corrected voltages.
For each of the 44 LFI diodes there is the corresponding corrected object in the Database. Each object contains 4 columns: the input voltages coming from the sky channel and the corresponding linearized output, the input voltages coming from the 4K reference channel and the corresponding linearized output.
Data loaded by the module are used to initialize two different interpolators using CSPLINE and the functions from gsl (GNU Scientific Libraries) libraries. The interpolators are then used to correct each sample.
Some of the LFI receivers exhibit a small artifact with exactly 1 second period, that produces effects visible in the power spectra. The effect is a set of spikes at 1 Hz and harmonics. The spurious signal is very well modeled and is removed from the timelines. More information can be found in Planck-2013-IIPlanck-2013-IIIPlanck-2015-A03Planck-2015-A04.
The cause of the spikes at 1 Hz and harmonics is a tiny 1 second square wave embedded in signals from the affected channels. The method to estimate the 1 Hz signal is to build a template in the time domain synchronized with the spurious signal. The first step is dividing each second of data into time bins using OBT. The number of bins is computed using:
where fsamp is the sampling frequency and is 136 Hz at 70 GHz, 80 at 44 GHz and 56 at 30 GHz. Then the bins vector is initialized with time intervals. To avoid aliasing effects the template resolution is. We can write the process adding an indices to the time sample: the lower index denotes the particular time sample, while the upper index labels the bin into which the sample falls. The linear filter can be written as:
Hereis the filter weight which is determined by where within the bin the sample lies. If we use with only an upper index to denote the start of each bin, then we can write the filter weight as follows:
In other words, the filter weight is the time sample value minus the start of the bin divided by the width of the bin.
We must estimate the parametersfrom the data. With the assumption that the instrument has stable noise properties, we can use a least square algorithm to estimate the bin values:
This can be represented in matrix equation:
with the following definitions:
With these definitions we have to make use of periodic boundary conditions to obtain the correct results, such that if, and , . Once this is done, we have a symmetric tridiagonal matrix with additional values at the upper right and lower left corners of the matrix. The matrix is solved with LU decomposition. In order to be certain of the numerical accuracy of the result, we can perform a simple iteration. The solving of the linear system and the iterative improvement of the solution are implemented as suggested in Numerical Recipes.
For each of the 44 LFI diodes there is the corresponding object in the Database. After studying the amplitude of the spikes at other LFI frequencies, we chose to apply the correction only to the 44 GHz radiometers. Each object contains 3 columns: the bins start time vector, the sky amplitudes and the reference amplitudes.
For each sample the value to be subtracted is computed using:
where k is the index of the bins at a given time.
Filling Gaps in the Data
During the mission, a small number of data packets were lost (Planck-2013-II, Planck-2015-A03). Moreover in two different and very peculiar situations, LFI was shut down and restarted, giving inconsistencies in data sampling. None of these data are used for scientific purpose but to avoid discrepancies in data analysis all of the radiometers at the same frequency must have the same samples.
To ensure this, we compare the length of the data stream to be reduced in a specific pointing period to the data stored in the sample information object. If the length is not the same, the OBT vector is filled with missing sample times, the data vector is filled with zeros, and in the flag column the bit for gap is entered.
Gain Modulation Factor
The pseudo-correlation design of the LFI radiometers allows a dramatic reduction of noise when the and outputs are differenced (see LFI instrument description). The two streams are slightly unbalanced, as one looks at the 2.7 K sky and the other looks at the ~4.5 K reference load. To force the mean of the difference to zero, the load signal is multiplied by the Gain Modulation Factor (R). For each pointing period this factor is computed using (see eq. (3) in LFI description):
Then the data are differenced using:
This value for R minimizes both the 1/f noise and the white noise in the difference timestream. The i index represents the diode and can be 0 or 1.
At this point, we also set the maneuver flag bit to identify which samples have missing data, using the information stored in the sampling information object. This identifies which data to ignore in the next step of the Pipeline.
The R values are stored in the database. At the same time the mean values ofand are stored so they can be used in other steps of the analysis.
The two complementary diodes of each radiometer are combined. The relative weights of the diodes in the combination are chosen for optimal noise. We assign relative weights to the uncalibrated diode streams based on their first order calibrated noise.
From first order calibration we compute an absolute gain LFI description):. The weights for the two diodes ( = 0 or 1) are:and , subtract an estimated sky signal and calculate the calibrated white noise and , for the pair of diodes (see eq. (6) in
where the weighted calibration constant is given by:
The weights are fixed to a single value per diode for the entire dataset. Small variations in the relative noise of the diodes would in principle suggest recalculating the weights on shorter timescales, however, we decided a time varying weight could possibly induce more significant subtle systematics, so chose a single best estimate for the weights for each diode pair.
- Weights used in combining diodes
The weights in the table above are used in the formula:
Measurements of planets have been formed from samples containing flux from the planet, plus a surrounding region, projected onto a grid (microstripes), by assuming an elliptical Gaussian beam using parameters from instrument database.
Microstripes are a way to extract and store relevant samples for planet detection. Relevant samples are samples affected by the planet plus samples in the neighborhood (to establish a background level). The search radius to select samples as relevant is 5 deg around the planet's position, computed at the pointing period mid time. For each sample we store SCET (Spacecraft Event Time), pointing directions and calibrated temperature. Destriping is applied.
Random errors are estimated by taking the variance of samples entering each micromap pixel. This is fast but not exact, since the variance is larger near a bright source. This can cause the noise to be overestimated by a factor of. Given the large S/N for planetary observations, that is not a major drawback.
The apparent position of a planet as seen from Planck at a given time is derived from JPL Horizon. Positions are tabulated in steps of 15 minutes and then linearly interpolated at the sampling frequency of each detector. JPL Horizons tables allow also to derive other quantities such as the planet-Planck distance and the planet-Sun distance and the planet angular diameter, which affects the apparent brightness of the planet.
The antenna temperature is a function of the dilution factor, according to:
whereand are the observed and reduced , the instantaneous angular diameter of the planet and the beam full width at half maximum.
With the above definitioncould be considered as the for a planet with , but a more convenient view is to take a reference dilution factor , as the dilution factor for a standardized angular diameter for the planet and fiducial beam fwhm , , to have:
leading to the following definition of a standardized:
with the advantage of removing variations among different detectors and transits while keeping the value ofsimilar to that seen by the instrument and then allowing a prompt comparison of signals and sensitivities.
The OBT vectors found by the search are saved in a set of objects, one for each horn. In the Level 2 pipeline those OBTs are compared with the OBT vector of the data to set the planet bit flag where needed.
Photometric calibration is the procedure used to convert data from volts to kelvin. The source of the calibration is the well known CMB dipole, caused by the motion of the Solar System with respect to the CMB reference frame. To this signal we add the modulation induced by the orbital motion of Planck around the Sun. The resulting signal is then convoluted with the horn beam to get the observed dipole.
Beam Convolved Dipole
In computing the beam convolved dipole we used an elegant algorithm to save time and computing power. In computing the cosmological dipole signal it is common to assume a pencil-like beam acting as a Dirac delta function. In this case a dipole timeline is defined as:
whereis the pointing direction, in the observer reference frame and is the dipole axis scaled by the dipole amplitude again in the same reference frame.
In general the true signal would have to be convolved with the beam pattern of the given radiometer, usually described as a fixed map in the beam reference frame or as a time dependent map in the observer reference frame. In this case it is easiest to describe the convolution in the beam reference frame, since the function to be convolved then can be described by a single vector.
Denoting withthe matrix converting from the observer to the beam reference frame, so that:
the instantaneous dipole direction in the beam reference frame is:
By denoting witha pointing direction in the beam reference frame then:
whereis a normalization constant.
Denoting with, , the three cartesian components of the the integral of the dot product can be decomposed into three independent integrals:
those integrals define a time independent vector characteristic of each radiometer and constant over the mission.
By using this characteristic vector the calculation of the convolved dipole is simply defined by a dot product of the vectorand the dipole axis rotated in the beam reference frame.
In order to simplify the computation and to reduce the amount of data used in the calibration procedure the data are phase binned in map with256. During phase binning all the data flagged for maneuvers, planets and gaps, as well as the ones flagged in Level 1 analysis as not recoverable, are ignored.
The first order calibration values are given by a least squares fit between the signal and the dipole. For each pointing gain () and offset ( ) values are computed by minimizing:
The sum includes only samples outside a Galactic mask.
The largest source of error in the fit arises from unmodeled sky signal CMB anisotropy.from
The procedure adopted to correct this effect is described below.
The uncalibrated toibelonging to a pointing period is modeled as
where the unknowns OBT, the gain per PID and the offset per PID, and is the total dipole., and are respectively the signal per
This is a non linearminimization problem. It can be linearized with an iterative procedure: for each iteration step a linearized model toi is built as
whereand are the gain and sky signal computed at the previous step of iteration.
A linear fit is performed betweenand to get the new and . The procedure is iterated until convergence.
To reduce the impact of the noise during the iterative procedure the sky estimation is built using data from both radiometers of the same horn.
To improve accuracy given by the iterative algorithm and remove noise from the solution a smoothing algorithm must be performed.
OSGTV is a 3 step smoothing algorithm, implemented with a C++ code.
The gain reconstructed with DaCapo can be expressed as
where thefunction represents temperature fluctuations of the Focal Plane. The time dependence of the “real” gain is modeled as the superposition of a “slow” component, with a time scale of ~3 months, and a “fast” component with a time scale of few PIDs:
The slow component takes into account the seasonal variations of the thermal structure of the spacecraft due to the orbital motion, while the “fast” component describes the thermal effects of the electronics and compressors, as well as single events like the sorption cooler switchover.
To disentangle the components of, we need a parametric “hybrid” approach in three steps.
Step 1. For each PID, we used the 4K total-power and the signal from temperature detectors in the focal plane , subsampled at 1 sample/PID, to track gain changes. This is implemented through a linear fit between and :
where the window lengthof the moving average is proportional to the variance of the dipole in the considered PID. The resulting gain is:
Step 2. The "fast" componentis recovered as follows: we define a maximum moving average window length ,and for each PID we compute the variance of in this window. We define a percentile on the ordered variance array and we compute the corresponding value of the variance . The window length for each PID is then computed as
Ifwe impose . With these window lengths a moving window average is performed on . The averaged gain vector is subtracted from the raw hybrid gain to get .
Step 3. We perform a moving window average onand we compute the variances of the smoothed array. The window length is computed with a linear interpolation between a minimum length defined in the dipole minima and a maximum defined in the dipole maxima. The array of gain variances is weighted with the variance of the dipole. We set a percentile on the variance array and we find the corresponding variance value . Around this value we search for local maxima of the variance array, and we split the domain of the gain in subsets between consecutive maxima. For each subset we perform a moving average with the corresponding window length. The "slow" component is given by the union of these subsets.
The last step in TOI processing is the creation of the calibrated stream. For each sample we have:
where t is the time and k is the pointing period, CMB Dipole convolved with the beam, and is the straylight.is the
This pipeline step aims at the reconstruction of the noise parameters from calibrated flight TOI. The goal is two-fold. On the one hand we need to know the actual noise properties of our instrument in order to properly take them into account, especially during later processing and analysis steps like map-making and power spectrum estimation. On the other hand evaluation of noise properties during the instrument life-time is a way to track down possible variations, anomalies and general deviations from the expected behaviour.
Noise estimation is performed on calibrated data; since we would like to track possible noise variations during the mission life-time, we select data in chunks of 5 ODs (Operational Days). These data are processed by the ROMA Iterative Generalized Least Square (IGLS) map-making algorithm which includes a noise estimation tool. In general IGLS map-making is a quite costly in terms of time and resources required. However the length of the data is such that it can run on the DPC cluster in very short time (~1-2 minutes).
The method implemented can be summerized as follows. We model the calibrated TOI as
whereis the noise vector and is the pointing matrix that links a pixel in the map with a sample in the TOI . The zero-th order estimation of the signal is obtained by simply rebinning TOI into a map. Then an iterative approach follows in which both signal and noise are estimated according to
whereis the noise covariance matrix in the time domain resulting from iteration . After three iterations, convergence is achieved.
We then perform an FFT (Fast Fourier Transform) on the noise time stream from the iterative approach and fit the resulting spectrum.
As already done in the 2013 release, we estimate the parameters of the noise properties (i.e. white noise level, knee-frequency and slope of the low-frequency noise component) by means of a MCMC approach. Therefore on the spectra just described we first compute the white noise level taking the mean of the last 10% of data (at 30 GHz due to the higher value of the knee-frequency this quantity is reduced to 5%). Once the white noise level has been determined we proceed with the actual fitting of knee-frequency and slope. The resulting values reported are the medians of the fitted values for our 5 ODs chunk along the whole mission lifetime.
|Horn||White Noise M [K s ]||White Noise S [K s ]||Knee-frequency M [mHz]||Knee-frequency S [mHz]||Slope M||Slope S|
Time variations of noise parameters are a good tracer of possible modifications in the instrument behaviour and we know that some events capable of affecting instrument behaviour had happened during the mission. Both variations of the physical temperature of the instrument due to the transition in the operations from the first sorption cooler to the second one, as well as the observed degradation in the performances of the first cooler are events that clearly show their fingerprints in the variation of the noise spectra.
In the following figure we report a representative sample of noise spectra, one for each frequency channel (from left to right LFi 18M, LFI 25S and LFI 27M), covering the whole mission lifetime. The white noise is extremely stable at the level of 0.3%. As already noted in the 2013 release, knee-frequencies and slopes are stable until OD 326 and show clear variations and deviations from the simple one knee-frequency one slope model. This is a sign of the degradation of the first cooler inducing thermal variations with a characteristic knee-frequency (different from the radiometric one) and with a steeper slope. Once the second cooler became operational and performed as expected, the noise spectra gradually returned to the original shape at the beginning of the mission. This behaviour is visible, although not identical at each frequency for several reasons e.g. intrinsic thermal susceptibility and position on the focal plane that determines the actual thermal transfer function.
- Planck 2013 results. II. Low Frequency Instrument data processing, Planck Collaboration, 2014, A&A, 571, A2.
- Planck 2013 results. III. Low Frequency Instrument systematic uncertainties, Planck Collaboration, 2014, A&A, 571, A3.
- Planck 2015 results. II. LFI processing, Planck Collaboration, 2016, A&A, 594, A2.
- Planck 2015 results. III. LFI systematics, Planck Collaboration, 2016, A&A, 594, A3.
(Planck) Low Frequency Instrument
Operation Day definition is geometric visibility driven as it runs from the start of a DTCP (satellite Acquisition Of Signal) to the start of the next DTCP. Given the different ground stations and spacecraft will takes which station for how long, the OD duration varies but it is basically once a day.
Data Processing Center
Planck Science Office
analog to digital converter
LFI Data Acquisition Electronics
Cosmic Microwave background