1. Introduction
A software package investigates a method of Bayesian Reconstruction through Adaptive Image Notion (BRAIN) [
1] to support and enhance the interferometric (IF) imaging procedure. Extending the works of [
2,
3], the software kit makes use of Gaussian statistics for a joint source detection and background estimation through a probabilistic mixture model technique. The background is defined as the small scale perturbance of the wanted signal, i.e., the celestial sources. As shown in [
3], statistics is rigorously applied throughout the algorithm, so pixels with low intensity can be handled optimally and accurately, without binning and loss of resolution. Following the work of [
2], a 2-D adaptive kernel deconvolution method is employed to prevent spurious signal arising from the dirty beam. Continuum, emission and absorption lines detection is foreseen to occur without an explicit subtraction, but propagating the information acquired on the continuum for line detection.
The novel algorithm aims at improving the CASA [
4,
5] imaging and deconvolution methods currently used, especially for the use of ALMA, but also applicable to next generation instruments as ngVLA [
6] and SKA [
7]. The developing software is going to be as compatible with CASA as possible by using CASA tasks, data formats, including python scripts. BRAIN has the aim to provide an advancement in current issues as stopping thresholding, continuum subtraction, proper detection of extended emission, separation of point-like sources from diffuse emissions, weak signal, mosaics.
In the following sections, a brief description of the ALMA observatory, the acquired data and radio synthesis imaging is provided. The most used software package for ALMA data analysis, i.e., CASA, is introduced and the complex data transformation for the formation of interferometric images is shown. Last, the technique under development is outlined in the main features as differing from [
3].
2. ALMA
Located at 5000 m altitude on the Chajnantor plateau (Chile), ALMA is an aperture synthesis telescope operating over a broad range of observing frequencies in the mm and submm regime of the electromagnetic spectrum, covering most of the wavelength range from 3.6 to 0.32 mm (84–950 GHz).
ALMA is characterized by 66 high-precision antennas and a flexible design: Several pairs of antennas (baselines) build a single filled-aperture telescope whose spatial resolution depends on the used configuration and observing frequency. Two main antennas arrangements are identifiable: the 12-m Array and the Atacama Compact Array (ACA). Firstly, with fifty relocatable antennas on 192 stations, the 12-m Array allows for baselines ranging from 15 m to 16 km. Secondly, ACA is composed by twelve closely spaced 7-m antennas (7-m Array) and four 12-m antennas for single dish observations, named Total Power (TP) Array. The 7-m and the TP arrays sample baselines in the (9–30) m and (0–12) m ranges, respectively. The data re-combination from the 7-m and TP arrays overcome the well-known “zero spacing’’ problem [
8] being crucial during the observation of extended sources. The final resolution reached by these arrays is given by the ratio between the observational wavelength and the maximum baseline for a given configuration. Therefore, the 12-m Array is used for sensitive and high resolution imaging, while ACA is preferable for wide-field imaging of extended structures. The 12-m Array and ACA can reach a fine resolution up to 20 mas at 230 GHz with the most extended configuration and
arcsec at 870 GHz, respectively: These numbers refer to the point spread function (PSF) or Full Width at Half Maximum (FWHM) of the synthesized beam, which is the inverse Fourier transform of a (weighted)
sampling distribution.
ALMA delivers continuum images and spectral line cubes with frequency characterizing the third axis. The data cubes are provided with up to 7680 frequency channels, with the channel width indicating the spectral resolution (from 3.8 kHz to 15.6 MHz).
2.1. ALMA Images
The sky brightness distribution, collected by each antenna, is correlated for given baselines. The measurements obtained from the correlators in an array allow one to reconstruct the complex visibility function of the celestial source. The Van Cittert–Zernike theorem [
9] provides the Fourier transform relation between the sky brightness distribution and the array response (or primary beam) [
10].
In simplified form, the complex visibility can be stated as follows:
In aperture synthesis analysis, the goal is to solve Equation (
1) for
in the image plane by measuring the complex-valued and calibrated
. The coordinates
, measured in units of wavelength, characterize the spatial frequency domain. The
plane is effectively composed by the projections of baselines onto a plane perpendicular to the source direction. The calibrated
and
are given in units of flux density (1
) and of surface brightness (Jy/beam area), respectively. The antenna reception pattern
is relevant for the primary beam correction. The amplitude (
A) and the phase (
) inform us about source brightness and position relative to the phase center at spatial frequencies
u and
v.
2.2. The Data
A direct application of the inverse Fourier transform of
(Equation (
1)) is desirable but suitable only in case of a complete sampling of the spatial frequency domain
. For a given configuration, the array provides a discrete number of baselines, being the number of baselines provided by
, where N is the number of antennas. The sampling of the
plane is consequently limited. Hence, the measured complex visibility function is corrupted by the sampling function:
.
The sampling function
, composed by sinusoids of various amplitude and phase, is characterized as an ensemble of delta functions accounting for Hermitian symmetry. The inverse Fourier transform of this ensemble of visibilities provides the dirty image
:
Following the convolution theorem, Equation (
2) can be written as follows:
The dirty image is the convolution of the sky brightness modified by the antenna primary beam with the dirty beam, given by (FWHM). The true sky brightness reconstruction depends on the image fidelity, since the coverage of the plane is by definition incomplete. For this reason the reconstruction of is challenging due to the artefacts introduced by the dirty beam , the sparsity in the data and the big data volume.
3. Common Astronomy Software Applications
CASA is a suite of tools allowing for calibration and image analyses for radio astronomy data [
11]. It is especially designed for the investigation of data observed with ALMA and the VLA [
12].
The calibration process determines the net complex correction factors to be applied to each visibility. Corrections are applied to the data due to, e.g., temperature effects, atmospheric effects, antenna gain-elevation dependencies. It includes solvers for basic gain (
A,
), bandpass, polarization effects, antenna-based and baseline-based solutions. Between calibration and imaging procedures, image formation follows in three steps: weighting, convolutional resampling, and a Fourier transform. This is a general procedure employed by most systems over the last 30 years. In order to produce images with improved thermal noise and to customize the resolution, the calibrated visibilities undergo a variety of weighting schemes. The commonly used robust weighting [
13] has the property to vary smoothly from natural (provides equal weight to all samples) to uniform (gives equal weight to each measured spatial frequency irrespective of sample density) based on the signal-to-noise ratio of the measurements and a pre-defined noise threshold parameter. The robust weighting scheme has the advantages to make the effective u-v coverage as smooth as possible while allowing for sparse sampling, to modify the resolution and theoretical signal-to-noise of the image. Nonetheless, care has to be taken on the choice of weigthing scheme due to sidelobes effects. The weighted visibilities are resampled onto a regular uv-grid (convolutional resampling) employing a gridding convolution function. A fast Fourier transform is applied on the resampled data and corrections are introduced to transform the image into units of sky brightness. The resulting dirty image
is deconvolved from the well-known dirty beam
. A model of the sky brightness distribution is obtained, allowing for the creation of the dirty/residual image, as the result of a convolution of the true sky brightness and the PSF of the instrument. The dirty image arises or from single pointing or from mosaics. Last, CASA offers facilities for simulating observations.
3.1. Deconvolution
Most interferometric imaging in CASA is done using the
tclean task. This task is composed by several operating modes, allowing for the generation of images from visibilities and the reconstruction of a sky model. Continuum images and spectral line cubes are handled. Image reconstruction occurs employing an outer loop of major cycles and an inner loop of minor cycles, following the Cotton-Schwab CLEAN style [
14]. The major cycle accounts for the transformation between data and image domains. The minor cycle is designed to operate in the image domain. An iterative weighted
minimization is implemented to solve for the measurement equations.
Several algorithms for image reconstruction (deconvolvers) in the minor cycle are available, e.g., Hogbom [
15], Clark [
16], Multi-Scale [
17], Multi-Term [
18]. Each minor cycle algorithm can be characterized by their own optimization scheme, framework and task interface with the pre-requisite to produce a model image as output. BRAIN has the potentials to become a deconvolution algorithm within the
tclean task.
3.2. Simulated ALMA Data
The CASA simulator takes on input a sky model to create a customized ALMA interferometric or total power observations, including multiple configurations of the 12-m array. The CASA simulator is characterized by two main tasks: simobserve, simanalyze. The task simobserve is used to create a model image (or component list), i.e., a representation of the simulated sky brightness distribution, and the data. The dirty image is created with the task simanalyze.
In
Figure 1, 13 point-like and extended sources are simulated at 97 GHz in the 12-m array for an integration time of 30 s and a total observing time of 1 h. The simulated sources are characterized by a Gaussian flux distribution in the range 0.2–5.0 Jy. Low thermal noise is introduced. The image size is composed by
pixels, covering 10 arcsec on the side. These and more sofisticated simulated data are planned to be used for the feasibility study.
4. BRAIN
An ALMA dirty image (Equation (
3)) is characterizable or by continuum or by line detection for a given bandwidth. The detected celestial source intensities can be in absorption or emission with negative or positive values, respectively. We assume that the data went through a robust calibration process. Gaussian statistics describes the data distribution. Nonetheless, efforts are also sought to account for glitches in the calibration process. For those cases, BRAIN is planned to implement an improvement in the data modelling accounting for a Gaussian distribution with a longer tail. The data set in image space is
in Jansky per pixel cell
. Following the work of [
3], an astronomical image consists of a composition of background and celestial signals. The background is defined as the small scale perturbance of the wanted celestial signal. Two complementary hypotheses are introduced:
Hypothesis specifies that the data consists only of background intensity spoiled with noise , i.e., the (statistical) uncertainty associated with the measurement process. Hypothesis specifies the case where additional source intensity contributes to the background. Additional assumptions are that negative and positive values for source and background amplitudes are allowed and that the background in average is smoother than the source signal.
The background signal is propagated from the visibilities noise, taking into account that the Fourier Transform is a linear combination of the visibilities with some rotation (phase factor) applied. The noise on the visibilities is mainly introduced, e.g., by cosmic, sky and instrumental (due to receiver, single baseline or antenna, total collecting area, autocorrelations) signals. It is expected that real and imaginary part in the visibility noise is uncorrelated, with the phase factor not affecting the noise. Moreover, the individual visibilities are combined at the phase center and weighted, gridding correction and primary beam correction are increasing noise in the image. The noise in ALMA images is not uniform, with the noise increasing towards the edge. Mosaics of images are particularly cumbersome.
The background amplitude is modelled with a thin-plate-spline, with the support points chosen sparsely in order not to fit the sources [
3]. The background model takes into account the dirty beam information, in order to deconvolve the dirty beam from the dirty image. The spline model is under further development to account for a more flexible design. The supporting points are analysed to account for a dynamic setting, where the number and positions of the supporting points are chosen on the basis of the data. This work follows [
2] successfully developed in experimental spectra.
Estimates of the hypotheses
and
are the direct effort of this analysis. The likelihood probability for the hypothesis
within Gaussian statistics is:
For the alternative hypothesis, a similar equation is applied with included the signal contribution. Similar to what was done in [
2,
3], the signal is considered as a nuisance parameter. Following the Maximum Entropy distribution, a two-sided exponential function is chosen to describe the mean source intensity in the field (
):
The parameters
and
are positive values, allowing one to introduce two different scales for the signal dependently on its sign. The likelihood for the hypothesis
is given following the marginalization rule:
The data are modelled by a two-component mixture distribution in the parameter space within the Bayesian framework. In this way, background and sources with their respective uncertainties are jointly detected. The likelihood for the mixture model combines the probability distribution for the two hypotheses,
and
:
where
,
,
and
corresponds to the pixels of the complete field. The prior pdfs
and
for the two complementary hypotheses describe the prior knowledge of having background only or additional signal contribution, respectively, in a pixel. These prior pdfs are chosen to be constant, independent of
i and
j:
and
. The likelihood for the mixture model allows us to estimate the parameters entering the models from the data. The ultimate goal is to detect sources independently to their shape and intensity and to provide a robust uncertainty quantification. Therefore, Bayes’theorem is used to estimate the probability of the hypothesis
for detecting celestial sources.
5. Concluding Remarks
The work is ongoing for this project. The novel technique, applied to ALMA interferometric data, is designed to create probability maps of source detection, allowing for a joint estimate of background and sources. A 2-D adaptive kernel deconvolution method will strengthen the deconvolution process of the dirty beam from the dirty image, reducing contaminations. BRAIN has the aim to provide an alternative technique in CASA image analysis. It is foreseen that a robust solution as model image is produced. The next effort is applied on the simulated data. Nonetheless, due to the statistical approach employed, BRAIN is also foreseen to be applied on data re-combination. In fact, advanced methods employing Bayesian probability theory have the potentials to address at best the analysis of combined short-spacing single-dish data with those from an interferometer, as provided by ALMA design.