# Difference between revisions of "TOI processing LFI"

## Overview

The LFI Level2 Pipeline analyzes each horn of the instrument separately, one pointing period at time, and store results in object the length of an OD. Each diode of the horn is corrected from systematic, differentiated and then combined with its complementary diode in the same radiometer. The horn is then calibrated and the photometric calibration is applied.

### Pre-processing

Before the run of the Level2 pipeline and to improve the analysis the Mission information and data sampling divisions are stored in the database.

The Mission information is a set of objects, one for each Operational Day (OD, as defined in the glossary), in which are stored Pointing Period data: DPC pointing ID (where 1 is the first pointing of the nominal mission), PSO pointing ID, start OBT of the pointing maneuver, start OBT of the stable pointing, end OBT of the pointing, spin axis ecliptic longitude and latitude.

The sampling information is a set of objects, one for each LFI frequency, in which are stored for each pointing ID: start OBT of the pointing maneuver, start OBT of the stable pointing, end OBT of the pointing, number of samples of the pointing, number of stable samples of the pointing, start sample of the stable pointing and sample number from the start of the nominal mission. Valid samples and OBTs are defined where any of the radiometers from that frequency cohort contain valid data.

During analysis it appeared that white noise and calibration seemed affected by something in common. It turn out to be a non linearity in the Analogic/Digital Converter on board. More on [1]Planck-2013-II , Planck-2013-III .

### Evaluation

The mathematical model represents the digital ADC output as:

where is the voltage input, is the DAE gain, is the DAE offset and is the DAE .

We can model the non-linearity as a function of the input voltage . So we have the apparent inferred voltage and we can link it to the actual input voltage with:

so that:

Since and we can use the much simpler relation:

and we expect it to be very near to unity for all .

To find the response curve we have only the apparent voltage to work with, so we had to use the inverse response function and replace the real input voltage with times the time varying gain factor .

If we introduce a small signal on top of which leads to increased detected voltage and corresponding apparent voltage increment:

so carrying out the differentiation respect to to the relation between true and apparent signal voltage leads to:

We now assume and are fixed and that the variations are due to slow drifts in the gain. So we can isolate the terms:

Combining the equations through the gain factor to remove it:

Rearranging and putting

So there is the expected direct proportionality of to due to the assumption that the variations in voltage are due to overall gain drift, so the amplitude of voltage and signal will vary together. Then there is the additional differential term which will pull signal amplitude away from the linear relationship. So if we plot measured white noise or dipole gain factor against recovered voltage we should see this linear curve with variations due to local slope changes at particular voltages. The linear part can be taken out and the differential part fitted. This was numerically integrated up to get the inverse response curve, what we need to convert the measured voltages to corrected voltages.

### Application

For each of the 44 LFI diodes there is the corresponding object in the Database. Each object contains 4 columns: the input voltages coming from the sky channel and the corresponding linearized output, the input voltages coming from the reference channel and the corresponding linearized output.

Data loaded by the module are used to initialize two different interpolators using CSPLINE and the functions from gsl (GNU Scientific Libraries) libraries. The interpolators are then used to correct each sample.

## Spikes Removal

Some of the LFI receivers exhibit a small artifact with exactly 1 second repetition, which visible in the power spectra. The effect is a set of spikes at 1 Hz and harmonics. The spurious signal is very well modeled and is removed from the timelines. More information can be found in [1]Planck-2013-II , Planck-2013-III .

### Modeling

The cause of the spikes at 1 Hz and harmonics is a tiny 1 second square wave embedded in affected channels. The method to estimate the 1 Hz signal is to build a template in time domain synchronized with the spurious signal. The first step is dividing each second of data into time bins using OBT. The number of bins is computed using:

where fsamp is the sampling frequency and is 136 at 70 GHz, 80 at 44 GHz and 56 at 30 GHz. Then the bins vector is initialized with time intervals. To avoid aliasing effects template resolution is . We can write the process adding an index to the time sample: lower index denotes the particular time sample, while the upper index labels the bin into which the sample falls. The linear filter can be written as:

Here is the filter weight which is determined by where within the bin sample lies. If we use with only an upper index to denote the start of each bin, then we can write the filter weight as follows:

In other words, the filter weight is the time sample value minus the start of the bin divided by the width of the bin.

We must estimate the parameters from the data. With the assumption that the instrument has stable noise properties, we can use a least square algorithm to estimate the bin values:

This can be represented in matrix equation:

with the following definitions:

With these definitions we have to make use of periodic boundary conditions to obtain the correct results, such that if , and , . Once this is done, we have a symmetric tridiagonal matrix with additional values at the upper right and lower left corners of the matrix. The matrix is solved with LU decomposition. In order to be certain of the numerical accuracy of the result, we can perform a simple iteration. The solving of the linear system and the iterative improvement of the solution are implemented as suggested in Numerical Recipes.

### Application

For each of the 44 LFI diodes there is the corresponding object in the Database. Because of the amplitude of the spikes we choose to apply the correction only on the 44 GHz radiometers. Each object contains 3 columns: the bins start time vector, the sky amplitudes and the reference amplitudes.

For each sample the value to be subtracted is computed using:

where k is the index of the bins at a given time.

## Gaps Filling

During the mission some of the data packets were lost ([2]) Planck-2013-II . Moreover in two different and very peculiar situations LFI was shutdown and restarted, giving inconsistencies in data sampling. All of those data aren't used for scientific purpose but to avoid discrepancies in data analysis all of the radiometers at the same frequency must have the same samples.

To accomplish this the length of the data stream to be reduced in a specific pointing period is compared with the data stored in the sample information object. If the length is not the same the OBT vector is filled with missing sample times, the data vector is filled with zeros and in the flag column the bit for gap is raised.

## Gain Modulation Factor

The pseudo-correlation design of the LFI radiometers allows a dramatic reduction of noise when the and outputs are differenced (see LFI instrument description). The two streams are slightly unbalanced, as one looks at the 2.7 K sky and the other looks at the ~4.5 K reference load. To force the mean of the difference to zero, the load signal is multiplied by the Gain Modulation Factor (R). For each pointing period this factor is computed using (see eq. (3) in LFI description):

Then the data are differenced using:

This value for R minimizes the 1/f and the white noise in the difference timestream. The i index represents the diode and can be 0 or 1.

At this point the maneuver flag bit is set to identify which samples have missing data, using the information stored in the sampling information object. This identifies which data to ignore in the next step of the Pipeline.

The R values are stored in the database. At the same time the mean values of and are stored in order to be used in other steps of the analysis.

## Diode Combination

The two complementary diodes of each radiometer are combined. The relative weights of the diodes in the combination are chosen for optimal noise. We assign relative weights to the uncalibrated diode streams based on their first order calibrated noise.

### Evaluation

From first order calibration we compute an absolute gain and , subtract an estimated sky signal and calculate the calibrated white noise and , for the pair of diodes (see eq. (6) in LFI description):. The weights for the two diodes ( = 0 or 1) are:

where the weighted calibration constant is given by:

The weights are fixed to a single value per diode for the entire dataset. Small variations in the relative noise of the diodes would in principle suggest recalculating the weights on shorter timescales, however, we decided a time varying weight could possibly induce more significant subtle systematics, so chose a single best estimate for the weights for each diode pair.

Horn Weight M-00 Weight M-01 Weight S-10 Weight S-11
18 0.567304963 0.432695037 0.387168785 0.612831215
19 0.502457723 0.497542277 0.55143474 0.44856526
20 0.523020094 0.476979906 0.476730576 0.523269424
21 0.500324722 0.499675278 0.563712153 0.436287847
22 0.536283158 0.463716842 0.553913461 0.446086539
23 0.508036034 0.491963966 0.36160661 0.63839339
24 0.602269189 0.397730811 0.456037835 0.543962165
25 0.482050606 0.517949394 0.369618239 0.630381761
26 0.593126369 0.406873631 0.424268188 0.575731812
27 0.519877701 0.480122299 0.484831449 0.515168551
28 0.553227696 0.446772304 0.467677355 0.532322645

### Application

The weights in the table above are used in the formula:

## Planet Flagging

### Extraction Method

The planets Temperature have been estimated from chunk of samples affected, plus a surrounding region, projected onto a grid (microstripes), by assuming an elliptical Gaussian beam using parameters from instrument database.

Microstripes are a way to extract and store relevant samples for planets detection. Relevant samples are samples affected by the planet plus samples in the neighbor. The search radius to select samples as relevant is 5 deg around the planet position, computed at the pointing period mid time. For each sample we store SCET (Spacecraft Event Time), pointing directions and calibrated temperature. Destriping is applied during application.

Random errors are estimated by taking the variance of samples entering each micromap pixel. This is fast and the major problems (near a bright source the noise gives a larger value and it is difficult to extract the correlation matrix) causes the noise to be overestimated by a factor of two that in this situation is not a major drawback.

The apparent position of Planets as seen from Planck at a given time is derived from JPL Horizon. Position are sampled in tables at steps of 15 minutes and then linearly interpolated at the sampling frequency of each detector. JPL Horizons tables allow also to derive other quantities such as the Planet-Planck distance and the Planet-Sun distance nad the planet angular diameter affecting the apparent brightness of the planet.

The antenna temperature is a function of the dilution factor, according to:

where and are the observed and reduced , the instantaneous planets angular diameter and the beam full width half maximum.

With the above definition could be considered as the for a planet with , but a more convenient view is to take a Reference Dilution factor , as the dilution factor for a standardized planet angular diameter and beam fwhm , , to have:

leading to the following definition of a standardized :

with the advantage of removing variations among different detectors and transits while keeping the value of similar to that seen by the instrument and then allowing a prompt comparison of signals and sensitivities.

### Application

The OBT vector found by the search are saved in a set of object, one for each horn. In Level2 Pipeline those OBTs are compared with the OBT vector of the data to raise planet bit flag where needed.

## Photometric Calibration

Photometric calibration is the procedure used to convert data from volts to kelvin. The source of the calibration is the well known CMB dipole, caused by the motion of the Solar System with respect to the CMB reference frame. To this signal we add the modulation induced by the orbital motion of Planck around the Sun. The resulting signal is then convoluted with the horn beam to get the observed Dipole.

### Beam Convolved Dipole

In computing the beam convolved dipole we used an elegant algorithm to save time and computing power. In computing the cosmological dipole signal it is common to assume a pencil-like beam acting as a Dirac delta function. In this case a dipole timeline is defined as:

where is the pointing direction, in the observer reference frame and is the dipole axis scaled by the dipole amplitude again in the same reference frame.

In general the true signal would have to be convolved with the beam pattern of the given radiometer, usually described as a fixed map in the beam reference frame or as a time dependent map in the observer reference frame. In this case it is easiest to describe the convolution in the beam reference frame, since the function to be convolved is described by a single vector.

Denoting with the matrix converting from the observer to the beam reference frame, so that:

the instantaneous dipole direction in the beam reference frame is:

By denoting with a pointing direction in the beam reference frame then:

where is a normalization costant.

Denoting with , , the three cartesian components of the the integral of the dot product can be decomposed into three independent integrals:

those integrals define a time independent vector characteristic of each radiometer and constant over the mission.

Detector ID
LFI18S 1.4105692317321994e-03 -3.7689062388084022e-04 9.9999893412338192e-01
LFI18M 1.1200251268914613e-03 -3.2838598619563524e-04 9.9999931885294768e-01
LFI19S 1.7861136968831050e-03 -4.4036975450455066e-04 9.9999830793473898e-01
LFI19M 1.4292780457919835e-03 -4.7454175238335579e-04 9.9999886598655352e-01
LFI20S 1.7008692096818349e-03 -6.1036624911600191e-04 9.9999836724715374e-01
LFI20M 1.5548897911626446e-03 -5.9289001736737262e-04 9.9999861539862389e-01
LFI21S 1.6975720932854463e-03 6.0961185087824777e-04 9.9999837330986663e-01
LFI21M 1.5486274949897787e-03 5.9228926426513112e-04 9.9999862547220986e-01
LFI22S 1.7861136968831245e-03 4.4036975450366470e-04 9.9999830793473898e-01
LFI22M 1.4292780457920242e-03 4.7454175238250377e-04 9.9999886598655352e-01
LFI23S 1.4105692317321714e-03 3.7689062387997129e-04 9.9999893412338203e-01
LFI23M 1.1200251268914476e-03 3.2838598619481239e-04 9.9999931885294757e-01
LFI24S 3.4636411743209074e-04 -2.8530917087092225e-07 9.9999994001590664e-01
LFI24M 4.3939553230170735e-04 -2.9414231975370517e-07 9.9999990346573508e-01
LFI25S -1.0428719495964051e-04 1.9328051933678115e-04 9.9999997588341061e-01
LFI25M -1.1004766833423990e-04 2.7656488668259429e-04 9.9999995570068612e-01
LFI26S -1.0428719495970346e-04 -1.9328051933760877e-04 9.9999997588341061e-01
LFI26M -1.1004766833430009e-04 -2.7656488668343130e-04 9.9999995570068612e-01
LFI27S 1.6613273546973915e-03 6.6518363019636186e-04 9.9999839875979735e-01
LFI27M 1.5583345016298123e-03 6.4183510236962536e-04 9.9999857981963269e-01
LFI28S 1.6633788116048607e-03 -6.6629002345089925e-04 9.9999839461297824e-01
LFI28M 1.5571200481047094e-03 -6.4144198187461837e-04 9.9999858196366442e-01

By using this characteristic vector the calculation of the convolved dipole is simply defined by a dot product of the vector by the dipole axis rotated in the beam reference frame.

### Binning

In order to simplify the computation and to reduce the amount of data used in the calibration procedure the data are phase binned in map with Nside 256. During phase binning all the data with flagged for maneuvers, planets, gaps and the ones flagged in Level1 analysis as not recoverable are discharged.

### Fit

The first order calibration values are given by a Least Square Fit between the signal and the dipole. For each pointing a gain () and an offset () values are computed minimizing:

The sum includes samples outside a Galactic mask.

The largest source of error in the fit arises from unmodeled sky signal from CMB anisotropy. To correct this we iteratively project the calibrated data (without the dipole) onto a map, scan this map to produce a new TOD with astrophysical signal removed, and finally run a simple destriping algorithm to find the corrections to the gain and offset factors.

To reduce the impact of the noise during the iterative procedure the sky estimation is built using data from both radiometers of the same horn.

### Smoothing

To improve accuracy given by the iterative algorithm and remove noise from the solution a smoothing algorithm must be performed. We used two different algorithms: OSG for the 44 and 70 GHz radiometers, and DV/V Fix for the 30 GHz. The reasons behind this choice can be found in [3]Planck-2013-V .

#### OSG

OSG is a python code that performs smoothing with a 3 step algorithm.

The first step is a Moving Average Window: the gain and offset factors are streams containing one value for each pointing period, that we call dipole fit raw streams. The optimized window has a length of 600 pointing periods.

The second step is a wavelet algorithm, using pywt (Discrete Wavelet Transform in Python) libraries. Both dipole fit raw streams and averaged streams are denoised using wavelets of the Daubechies family extending the signals using symmetric-padding.

The third step is the combination of dipole fit raw and averaged denoised signal using knowledge about the instrument performance during the mission.

#### 4 K total-power and Fix

For the 30 GHz channels we used 4K total-power to track gain changes. The theory and explanation of the choice can be found in [3]Planck-2013-V .

The algorithm uses mean values computed during differentiation and raw gains as they are after iterative calibration, performing a linear weighted fit between the two streams using as weight the dipole variance in single pointing periods. The fit is a single parameter fit, so the offsets are put to zero in this smoothing method. It uses the gsl libraries.

In addition to the smoothing, to better follow sudden gain changes due to instrument configuration changes, a fix algorithm is implemented. The first step is the application of the 4k total-power smoothed gains to the data and the production of single radiometer maps in the periods between events. The resulting maps are then fit with dipole maps covering the same period of time producing two factor for each radiometer: is the result of the fit using the main radiometer and the one coming from the side radiometer. The correction to be applied to the gain values is then computed as:

### Gain Application

The last step in TOI processing is the creation of the calibrated stream. For each sample we have:

where t is the time and k is the pointing period. is the CMB Dipole convolved with the beam.

## Noise

This pipeline step aims at the reconstruction of the noise parameters from calibrated flight TOI. The goal is two-folds: one the one side we need to know the actual noise properties of our instrument in order to properly take them into account especially during the following processing and analysis steps like map-making and power spectrum estimation. On the other side evaluation of noise properties along the instrument life-time is a way to track down possible variations, anomalies and general deviations from the expected behaviour.

### Operations

Noise estimation is performed on calibrated data and since we would like to track possible noise variations along mission life-time, we select data in chunks of 5 ODs (Operational Days). These data are processed by the ROMA Iterative Generalized Least Square (IGLS) map-making algorithm which includes a noise estimation tool. In general an IGLS map-making is a quite consuming in terms of time and resources required. However the length of the data is such that running on the DPC cluster in very short time (~1-2 minutes).

The method implemented can be summerized as follows. We model the calibrated TOI as

$\mathbf{\Delta T} = \mathbf{P} \mathbf{m} + \mathbf{n}$

where $\mathbf{n}$ is the noise vector and $\mathbf{P}$ is the pointing matrix that links a pixel in the map $\mathbf{m}$ with a sample in the TOI $\mathbf{d}$. The zero-th order estimation of the signal is obtained simply rebinning TOI into a map. Then an iterative approach follows in which both signal and noise are estimated according to

$\mathbf{\hat{n}_i} = \mathbf{\Delta T} - \mathbf{P\hat{m}_i}$
$\mathbf{\hat{m}_{i+1}} = \mathbf{(P^T\hat{N}_i^{-1}P)^{-1}P^T\hat{N}_i^{-1}\Delta T}$

where $\mathbf{\hat{N}_i}$ is the noise covariance matrix in time domain out from iteration $i$. After three iterations convergence is achieved.

We then perform an FFT (Fast Fourier Transform) on the noise time stream out from the iterative approach and then fit the resulting spectrum.

### Fitting Pipeline

In the very first release of Planck data, once noise spectra were extracted a simply log-periodogram fitting approach was applied to derive the most important noise parameters (white noise level, knee-frequency and slope of the low-frequency noise component). However during mission life-time there were some specific events (e.g. the switch over of the sorption coolers) that we expect were able to cause variation in instrument behaviour and hence in its noise properties. In this respect we have improved our fitting pipeline adding a Monte Carlo Markov Chain approach to estimate noise parameters.

#### MCMC approach

This new approach allows us to improve our noise model. Indeed this can be parametrized by the usual combination of white plus noise

with three basic noise parameters. However it is also possible to work with a functional form with two more parameters as

This latter could be useful when there are clearly two different behaviour in the low-frequency part of the spectrum where, beside usual radiometric noise, appears signature of thermal fluctuations induced noise.

As for the white noise part, this is, as before, computed making a simple average of noise spectrum on the last 10% of frequency bins. This percentage works well for almost all radiometers at 44 and 70 GHz but it is indeed quite delicate for the 30 GHz radiometers which show typical values of knee-frequency around 100 mHz and, therefore, require a smaller number to get an un-biased white noise estimation. Once white noise is computed, the code creates Markov Chains for the other parameters. Discarding the burn-in period of the chains we can directly get from the chain samples distribution, the expected value and variance of each noise parameters sampled.

The left panel of the following Figure shows a typical spectrum at 70 GHz with superimposed the simple log-periodogram fit (purple line) and the new MCMC derived spectrum (blue line). The right panel instead shows distribution for knee-frequency and slope derived from the example spectrum.

### The final noise parameters

As already reported we know that during the nominal operations there was a quite dramatic change in LFI induced by the switch over of the two sorption coolers and particularly we expect to see the effect of degradation of the performance of the first sorption cooler and the onset of the redundant one.

In the following figure we report a set of noise frequency spectra for three LFI radiometers (LFI28M, LFI24S and LFI18M) from the beginning of the operation till the time of the current data release. Some comments are in order. First of all the white noise level is extremely stable in all the three cases (but this is also true for all the LFI radiometer). Also knee-frequency and low-frequency slope are quite stable till OD 326. After that period spectra show a noise increase and two slopes for the low-frequency part which become more evident for spectra around OD 366 and OD 466 where the first cooler starts to be less effective and produces low-frequency thermal noise. After the switch-over to the redundant cooler data still present (the very last spectrum) thermal noise at very low-frequency. This behaviour is almost present in all radiometers with different trends ranging from the small effect shown by LFI24S to more prominent effect as shown by LFI28M and LFI18M.

## References

1. Planck 2013 results: The Low Frequency Instrument data processing, Planck Collaboration 2013 II, A&A, in press, (2014).
2. Planck 2013 results: LFI Calibration, Planck Collaboration 2013 V, A&A, in press, (2014).

(Planck) Low Frequency Instrument

Operation Day definition is geometric visibility driven as it runs from the start of a DTCP (satellite Acquisition Of Signal) to the start of the next DTCP. Given the different ground stations and spacecraft will takes which station for how long, the OD duration varies but it is basically once a day.

Data Processing Center

Planck Science Office

On-Board Time

analog to digital converter

LFI Data Acquisition Electronics

Cosmic Microwave background

[LFI meaning]: absolute calibration refers to the 0th order calibration for each channel, 1 single number, while the relative calibration refers to the component of the calibration that varies pointing period by pointing period.