The busy function: a new analytic function for describing the integrated 21-cm spectral profile of galaxies

Previous attempts to describe the integrated H i spectra of galaxies. Left: example of a spectral profile consisting of two Hermite functions as introduced by Saintonge (2007) and defined in equation (1). The parameters used in creating this example are a₀ = 1, c = 0.3 and σ = 2. Right: example of the profile shape introduced by Obreschkow et al. (2009a,b) as specified in equation (2). The parameters in this case are k₁ = 4.5, k₂ = 1, k₃ = 0.9, k₄ = 25 and k₅ = 2.

While providing a fairly accurate description of the sharp and narrow peaks and broad troughs of typical galaxy spectra, there are a few disadvantages with this approach. First, the profile gets broken up into two separate functions that would have to be fitted separately. Secondly, precise adjustment of the parameters is required to avoid creating large discontinuities at the boundary between the two functions. Another complication arises from the function used to create the central trough of the profile, which is undefined for x² ≥ k₄; this may pose a problem for fitting algorithms. In contrast to the profile used by Saintonge (2007), the profile of Obreschkow et al. (2009a,b) can reproduce the steep flanks and sharp peaks of observed galaxy spectra by decoupling the width of the Gaussian component from the overall profile width.

THE BUSY FUNCTION

Definition

In order to improve on previous attempts to describe the double-horn profile of galaxies, we looked for a function that would allow us to describe the steep flanks often seen in the spectra of galaxies while also recovering the characteristic trough and sharp, narrow peaks of the spectrum. In addition, we require the function to be continuous and differentiable for the purpose of least-squares fitting, for which calculation of the partial derivatives with respect to the function's free parameters, |$\mathrm{\partial} f(x, \boldsymbol {p}) / \mathrm{\partial} p_{i}$|⁠, is required. Such a function can indeed be constructed in a relatively simple fashion by multiplying two error functions and a parabola,

\begin{eqnarray} B_{0}(\xi ) &=& \displaystyle\frac{a{}}{4} \times (\mathrm{erf}[b{} \lbrace w + \xi \rbrace ] + 1) \nonumber \\ && \times (\mathrm{erf}[b{} \lbrace w - \xi \rbrace ] + 1) \times \left( c{} \, \xi ^{2} + 1 \right) \!, \end{eqnarray}

(3)

with ξ ≡ x − x₀. Here, the variable x represents the spectral axis of the data, e.g. frequency or radial velocity. The two error functions, erf(x), generate the profile flanks, while multiplication with a parabola produces the central trough of the profile. Given its characteristics and versatility, we chose to call this function the ‘busy function’. Examples of the busy function with different parameter values are shown in Fig. 2. In this section, we will first discuss the fundamental properties of the basic form of the busy function as specified in equation (3) before introducing a more general, asymmetric form of the function in Section 4.

Figure 2.

The busy function, B0, for different values of c (left) and b (right). The left-hand panel shows the situation for w = 5, a = 0.32, b = 2 and varying c. The right-hand panel depicts w = 3, a = 0.9, c = 0 and varying b.

The busy function, B₀, for different values of c (left) and b (right). The left-hand panel shows the situation for w = 5, a = 0.32, b = 2 and varying c. The right-hand panel depicts w = 3, a = 0.9, c = 0 and varying b.

Free parameters

The busy function is characterized by five free parameters: the centroid of the profile, x₀; the half-width of the profile, w; the total amplitude scaling factor, a; and two additional parameters, b and c. The parameter b controls the steepness of the two error functions constituting the flanks of the spectrum. In the case of b → ∞, the flanks will become infinitely steep, whereas for b → 0, the slope of the flanks will approach zero. The parameter c controls the emphasis of the parabola and hence the amplitude of the central trough. Values of c > 0 imply increasing amplitudes of the trough, whereas for c = 0, the trough will disappear altogether. Negative values of a, b, c and w, while mathematically allowed, are not physically meaningful in the case of H i emission lines, but may be useful in other situations not considered here, such as absorption lines.

General properties

Some of the analytic solutions to the busy function are elaborated and presented in Appendix A. For profiles with bw ≫ 1 (i.e. flat-topped or double-horn profiles), the value at the centre of the profile is simply given by B₀(0) = a. The profile's half-width, w, is equal to half the separation of the two error functions, and hence the full width at half-maximum of the profile is equal to 2w in the case of c = 0 and bw ≫ 1.

An advantage of the busy function, and the motivation for its name, is its versatility when it comes to fitting spectral profiles of different shape. Examples of the busy function mimicking double-peaked profiles of different shape are presented in the left-hand panel of Fig. 2. By carefully choosing appropriate values for b, c and w, almost any shape of (symmetric) double-peaked profile can be reproduced by the function. By using error functions to represent the flanks of the spectrum, we can reproduce the characteristic, steep rise of the spectral profile typically observed in the integrated H i spectra of disc galaxies.

The flexibility of the busy function goes well beyond the fitting of double-peaked profiles, as is illustrated in the right-hand panel of Fig. 2. Here, the parameter c was set to zero to entirely remove the parabolic component and, in combination with different values of b, produces profiles of varying shape ranging from a steep top-hat function (with large b) to a Gaussian function with gradual slopes on both sides (using smaller values of b). In fact, with the right choice of parameters, the product of two error functions will take almost the exact shape of a Gaussian function (see Appendix B). Hence, the busy function is capable of fitting both Gaussian and double-peaked line profiles.

MODIFICATIONS OF THE BUSY FUNCTION

Generalized busy function

In its original form of equation (3), the busy function is symmetric, and the shape of its central trough is determined by the parabolic component. However, at the expense of additional free parameters, the busy function can be generalized to describe a wider range of spectral profiles, e.g. profiles that are not symmetric or have a differently shaped trough. In a more general form, the busy function can be written as

\begin{eqnarray} B_{1}(x) &=& \displaystyle\frac{a{}}{4} \times (\mathrm{erf}[b{}_{1} \lbrace w + x - x_{\rm e} \rbrace ] + 1) \nonumber \\ && \times\, (\mathrm{erf}[b{}_{2} \lbrace w - x + x_{\rm e} \rbrace ] + 1) \times \left( c{} \, |x - x_{\rm p}|^{n} + 1 \right) \!. \end{eqnarray}

(4)

The number of free parameters in this case has increased from 5 to 8, including separate slopes, b₁ and b₂, for the two error functions, separate offsets, x_e and x_p, for the error functions and the polynomial, and a variable degree, n, of the polynomial. The properties of the generalized busy function are similar to those of the original busy function and discussed in Appendix A.

An example of an asymmetric, generalized busy function with a fourth-degree polynomial trough (n = 4) is depicted in the left-hand panel of Fig. 3. The significantly broader trough more closely resembles those typically found in the spectra of many spiral galaxies.

Figure 3.

Left: example of the asymmetric, generalized busy function, B1(x), for b1 = 1, b2 = 1.5, c = 0.0015, xe = 0, xp = −0.2 and n = 4. Right: example of the simplified busy function, B2(ξ), for b = 0.2 and c = 0.045. In both cases, a = 0.4 and w = 6.

Left: example of the asymmetric, generalized busy function, B₁(x), for b₁ = 1, b₂ = 1.5, c = 0.0015, x_e = 0, x_p = −0.2 and n = 4. Right: example of the simplified busy function, B₂(ξ), for b = 0.2 and c = 0.045. In both cases, a = 0.4 and w = 6.

Simplified busy function

A simplification of the busy function can be achieved by replacing the two error functions, erf(ξ), with just a single error function with the argument, ξ, squared. While the shape and properties of the resulting function are very similar to the original busy function, the expressions for the simplified busy function, and in particular its derivatives, are less complex and significantly shorter:

\begin{equation} B_{2}(\xi ) = \frac{a{}}{2} \times \left(\mathrm{erf} \! \left[ b{} \left\lbrace w^{2} - \xi ^{2} \right\rbrace \right] + 1 \right) \times \left( c{} \, \xi ^{2} + 1 \right). \end{equation}

(5)

As before, we define ξ ≡ x − x₀ for simplicity. This function, as depicted in the right-hand panel of Fig. 3, is very similar to the original busy function as specified in equation (3). Like B₀, the simplified busy function can be generalized by introducing an independent x₀ for the parabola or by replacing the parabolic trough with a different function, e.g. a fourth-degree polynomial. The slopes of the two flanks, however, will always be the same.

Some of the analytic solutions to the simplified busy function are discussed in Appendix A. Unlike B₀, there is no combination of parameters for which the profile would closely resemble a Gaussian function. In the best approximation, B₂ will appear more compact than a Gaussian, having steeper flanks and a slightly smaller amplitude.

EXAMPLES

In this section, we present a few examples of fitting the busy function to the H i spectra of observed galaxies to illustrate its usefulness and flexibility.

Symmetric profiles

In Fig. 4, we show integrated H i spectra of the two spiral galaxies NGC 300 (Westmeier, Braun & Koribalski 2011) and NGC 3351/M95 (Walter et al. 2008). All three versions of the busy function, B₀, B₁ and B₂, have been fitted to the spectra using a χ² minimization algorithm. In the case of the generalized busy function, B₁, we assumed symmetry (b₁ = b₂, x_e = x_p), but used a fourth-degree polynomial (n = 4) to generate a broader trough.

Figure 4.

Top panels: fits of symmetric versions of the busy functions B0, B1 and B2 to the integrated H i spectra of NGC 300 (left) and NGC 3351 (right). In the case of B1, we assumed a polynomial trough of degree n = 4. The bottom panels show the residuals between the data and the fits.

Top panels: fits of symmetric versions of the busy functions B₀, B₁ and B₂ to the integrated H i spectra of NGC 300 (left) and NGC 3351 (right). In the case of B₁, we assumed a polynomial trough of degree n = 4. The bottom panels show the residuals between the data and the fits.

NGC 300 is an example of a medium-sized spiral galaxy with a symmetric H i spectrum and a relatively sharp, almost V-shaped trough that is equally well described by either the original busy function, B₀, or the simplified busy function, B₂, both of which use a second-degree polynomial to describe the trough. In contrast to these, the much wider fourth-degree polynomial trough chosen for the generalized busy function, B₁, does not describe the appearance of NGC 300 very well. This is obvious from the residuals between the data and model, which are significantly larger (σ_rms = 0.46 Jy) for B₁ as compared to the other two fits (σ_rms ≈ 0.32 Jy).

Note that the residuals in this and all following examples are due to real structures and asymmetries in the galaxies themselves and much larger than expected from the baseline noise of the integrated spectra alone.

NGC 3351 (M95) is an example of a galaxy with a broad trough. In this case, the generalized busy function, B₁, with its fourth-degree polynomial provides a much better fit (σ_rms = 8.8 mJy) to the integrated H i spectrum than the busy functions B₀ and B₂ (σ_rms ≈ 21.2 mJy).

Asymmetric profiles

As discussed in Section 4, the busy function can be easily generalized to fit asymmetric profiles by simply introducing a separate offset for the polynomial component describing the trough. Two examples of asymmetric busy functions fitted to the integrated H i spectra of the two galaxies NGC 55 (Westmeier, Koribalski & Braun 2013) and NGC 4826 (M64; Walter et al. 2008) are presented in Fig. 5. Here, we fitted two versions of the generalized busy function, B₁, to the spectrum, this time including separate offsets, x_e and x_p, for the error functions and the polynomial trough. This allows the trough to shift with respect to the flanks of the spectrum, producing an asymmetric profile with peaks of different height. The two functions use polynomial degrees of n = 2 and 4, respectively, while adopting a single slope for the two flanks (b₁ = b₂).

Figure 5.

Top panels: fits of two asymmetric versions (xe ≠ xp, but b1 = b2) of the generalized busy function, B1, with different polynomial troughs of degree n = 2 and 4 to the integrated H i spectra of NGC 55 (left) and NGC 4826 (right). The bottom panels show the residuals between the data and the fits.

Top panels: fits of two asymmetric versions (x_e ≠ x_p, but b₁ = b₂) of the generalized busy function, B₁, with different polynomial troughs of degree n = 2 and 4 to the integrated H i spectra of NGC 55 (left) and NGC 4826 (right). The bottom panels show the residuals between the data and the fits.

The integrated spectrum of NGC 55 resembles that of NGC 300 with the exception of a noticeable asymmetry that is well fitted by the generalized busy function. As in the case of NGC 300, the parabolic trough (n = 2) fits better (σ_rms = 0.41 Jy) than the fourth-degree polynomial (σ_rms = 0.62 Jy). Within their uncertainties, the offsets significantly differ between the overall spectrum described by the error functions, x_e = 119.4 ± 0.4 km s⁻¹, and the parabolic trough, x_p = 110.6 ± 1.3 km s⁻¹, thereby quantitatively confirming the intrinsic asymmetry of the spectrum.

The spectrum of NGC 4826 is clearly asymmetric, too. As before, both versions of the generalized busy function provide a good fit to the spectrum, although the fourth-degree polynomial is capable of fitting the broad trough and sharp peaks in the spectrum much better (σ_rms = 14.0 mJy) than the parabolic trough (σ_rms = 17.3 mJy). Again, there is a significant difference between the location of the overall spectrum, x_e = 409.3 ± 0.7 km s⁻¹, and that of the fourth-degree polynomial trough, x_p = 391.5 ± 2.4 km s⁻¹, confirming the intrinsic asymmetry of the spectrum.

PRACTICAL APPLICATION

In this section, we present the practical application of busy function fitting to the integrated spectra of the HIPASS BGC sources (Koribalski et al. 2004). We present both the methodology and the results. Most of the sources in the BGC are individual galaxies with a unique optical counterpart, although 68 sources are flagged as confused, 44 identified as pairs and 11 identified as compact groups. 91 detections do not have an optical identification, mostly as a result of Galactic foreground extinction.

Method

We developed our own software to fit the busy function to all 1000 galaxies in the BGC. The technical details of our implementation are presented in Appendix C. Our software attempts to fit six variants of the busy function with a varying number of free parameters, using the Levenberg–Marquardt algorithm (LMA; Levenberg 1944; Marquardt 1963). The software then selects the best of these fits based on the Akaike information criterion (AIC; Akaike 1974) in an attempt to determine the optimal number of free parameters needed to describe the data. A key feature of our method is the use of multiple (several thousand), short duration (tens of iterations) LMA attempts for each model, each attempt starting from a randomly chosen position in parameter space. This implementation is available in the form of c and c++ libraries and as a python module (see Appendix C for details). The advantages of our implementation are as follows.

Model fits do not suffer from parameter discretization.
The method ensures rapid and efficient exploration of multidimensional parameter spaces.
A covariance matrix is produced for the model fit, thus providing parameter uncertainties as well as correlations.
The likelihood of finding the best global fit is higher.
The code takes advantage of multicore CPUs, distributed systems, and GPUs.
The use of model variants allows us to set model components to exactly zero (true zeroing). This is otherwise impossible in practice because of the limited numerical precision of computers.
The algorithm takes into account the uncertainties of individual data points, resulting in data points with large uncertainties to be effectively excluded from fitting.
The software can easily be made to fit any analytic model or function.

Our implementation of the busy-function-fitting algorithm can be obtained from a dedicated website¹ as a c library, a c++ template library and a python module.

Results

We successfully fitted the busy function to all 1000 HIPASS BGC sources. We accomplished this by using an iterative strategy. In each iteration, we fitted the busy function to all BGC sources that were not successfully fitted in the previous iteration, each time using different random LMA starting positions in parameter space. The details of this procedure are explained in Appendix C. Success or failure of a fit was assessed by checking whether the reduced χ² of the fit was reasonable. In addition, the quality and accuracy of the fit was visually confirmed. The success rate was about 85 per cent for each iteration, and in the fourth and fifth iterations we were only processing two and a single BGC source, respectively. The success of this iterative approach confirms that the failures in any iteration are purely a result of poor LMA starting positions, because they are randomly chosen in each iteration. This also confirms our expectation that we could improve the success rate of each iteration, by increasing the number of LMA starting positions (at an increased computational cost). Alternatively, initial estimates of the free parameters, e.g. from a preceding source finding run, can be used instead of random starting positions in parameter space, thereby avoiding the need for multiple iterations altogether.

To test the performance of the busy function in the case of spectra with lower signal-to-noise ratio, we injected additional noise into the HIPASS BGC spectra to generate spectra with a peak signal-to-noise ratio of 3 and 5. We then refitted the busy function, using the same iterative procedure based on visual inspection. The success rate for both sets of noisier BGC spectra was approximately 90 per cent. After six iterations, we were able to fit a ‘sensible’ busy function (as qualitatively judged by us) to every noisy BGC spectrum.

In Table 1 and Fig. 6, we compare the catalogued observational properties of BGC sources against those derived from the busy function fits, allowing us to infer the quality of the fits. The catalogued parameters simply serve as a reference and are not necessarily unbiased or more accurate than our fitting results. We compare the integrated H i flux (F_int), peak H i flux density (F_peak) and the spectral linewidths at 20 and 50 per cent of the peak flux density (w₂₀ and w₅₀, respectively). The comparison was carried out using both a busy function sampled at the same spectral resolution as the data (SD) and a high-definition (HD) busy function sampled at 100 times the data's spectral resolution. It turns out that there is no discernible benefit to using the HD busy function over the SD busy function, and hence only the SD results are listed in Table 1. We believe that this is a result of the catalogued properties being derived directly from the data. We also recalculated the observational properties directly from the data, following the same approach as described in section 3.3 of the BGC paper (Koribalski et al. 2004). These ‘direct’ properties are used as a sanity check and are calculated in the same manner as the catalogued properties, but using the channel range within which the fitted busy function is ≥1 per cent of its peak value. Any differences between the direct and catalogued properties should solely be due to differences in the channel range used in the measurement.

Figure 6.

Comparison of the integrated flux, peak flux density, and w50/w20 linewidths of the HIPASS BGC galaxies as derived from the busy function fit (ordinate) with the original values listed in the BGC (abscissa; parameters marked with tilde). The solid, grey line represents the identity, while the dashed, red line is the result of a linear regression carried out in linear space (see Table 1).

Comparison of the integrated flux, peak flux density, and w₅₀/w₂₀ linewidths of the HIPASS BGC galaxies as derived from the busy function fit (ordinate) with the original values listed in the BGC (abscissa; parameters marked with tilde). The solid, grey line represents the identity, while the dashed, red line is the result of a linear regression carried out in linear space (see Table 1).

Table 1.

Comparison of the catalogued and calculated HIPASS BGC properties for direct extraction of parameters from the spectrum as well as parametrization based on busy function fitting. For each parameter and signal-to-noise ratio, the table lists two different components of the comparison: the result of a linear regression with the catalogued parameter as the independent variable (best-fitting slope and intercept; see Fig. 6) and the fraction of parameters within a certain percentage of the original, catalogued values.

	Original		S/N = 5		S/N = 3
	Direct	BF fit	Direct	BF fit	Direct	BF fit
	Integrated flux (F_int)
Best-fitting slope	1.02	1.03	0.96	1.05	0.92	1.07
Best-fitting intercept (Jy km s⁻¹)	−1.08	−1.35	2.01	−1.02	4.49	0.14
Within 5 per cent of cat. (per cent)	89.8	83.3	35.8	37.1	22.3	23.2
Within 10 per cent of cat. (per cent)	96.9	95.5	65.6	66.9	42.5	43.2
Within 25 per cent of cat. (per cent)	98.9	99.1	93.2	91.4	77.4	76.4
	Peak flux density (F_peak)
Best-fitting slope	1.00	0.99	1.14	1.04	1.27	1.09
Best-fitting intercept (Jy)	0.00	0.00	−0.01	−0.01	0.01	0.00
Within 5 per cent of cat. (per cent)	99.0	68.6	21.9	28.3	5.4	18.1
Within 10 per cent of cat. (per cent)	99.2	88.0	43.3	54.5	14.7	32.8
Within 25 per cent of cat. (per cent)	99.7	99.1	86.3	91.9	47.1	72.2
	Linewidth (w₅₀)
Best-fitting slope	0.98	0.99	0.94	0.95	0.86	0.87
Best-fitting intercept (km s⁻¹)	0.64	1.47	12.9	6.39	60.2	21.7
Within 5 per cent of cat. (per cent)	99.4	86.6	46.4	50.3	25.1	29.0
Within 10 per cent of cat. (per cent)	99.4	94.0	69.4	72.2	42.4	47.8
Within 25 per cent of cat. (per cent)	99.6	97.7	86.0	89.3	63.2	72.1
	Linewidth (w₂₀)
Best-fitting slope	0.96	0.96	0.90	0.87	0.89	0.80
Best-fitting intercept (km s⁻¹)	8.7	5.6	62.2	39.4	94.9	83.7
Within 5 per cent of cat. (per cent)	94.0	86.0	34.6	47.7	20.7	26.5
Within 10 per cent of cat. (per cent)	94.6	93.2	53.0	68.1	36.8	45.4
Within 25 per cent of cat. (per cent)	96.7	97.9	72.2	82.0	56.7	65.0

	Original		S/N = 5		S/N = 3
	Direct	BF fit	Direct	BF fit	Direct	BF fit
	Integrated flux (F_int)
Best-fitting slope	1.02	1.03	0.96	1.05	0.92	1.07
Best-fitting intercept (Jy km s⁻¹)	−1.08	−1.35	2.01	−1.02	4.49	0.14
Within 5 per cent of cat. (per cent)	89.8	83.3	35.8	37.1	22.3	23.2
Within 10 per cent of cat. (per cent)	96.9	95.5	65.6	66.9	42.5	43.2
Within 25 per cent of cat. (per cent)	98.9	99.1	93.2	91.4	77.4	76.4
	Peak flux density (F_peak)
Best-fitting slope	1.00	0.99	1.14	1.04	1.27	1.09
Best-fitting intercept (Jy)	0.00	0.00	−0.01	−0.01	0.01	0.00
Within 5 per cent of cat. (per cent)	99.0	68.6	21.9	28.3	5.4	18.1
Within 10 per cent of cat. (per cent)	99.2	88.0	43.3	54.5	14.7	32.8
Within 25 per cent of cat. (per cent)	99.7	99.1	86.3	91.9	47.1	72.2
	Linewidth (w₅₀)
Best-fitting slope	0.98	0.99	0.94	0.95	0.86	0.87
Best-fitting intercept (km s⁻¹)	0.64	1.47	12.9	6.39	60.2	21.7
Within 5 per cent of cat. (per cent)	99.4	86.6	46.4	50.3	25.1	29.0
Within 10 per cent of cat. (per cent)	99.4	94.0	69.4	72.2	42.4	47.8
Within 25 per cent of cat. (per cent)	99.6	97.7	86.0	89.3	63.2	72.1
	Linewidth (w₂₀)
Best-fitting slope	0.96	0.96	0.90	0.87	0.89	0.80
Best-fitting intercept (km s⁻¹)	8.7	5.6	62.2	39.4	94.9	83.7
Within 5 per cent of cat. (per cent)	94.0	86.0	34.6	47.7	20.7	26.5
Within 10 per cent of cat. (per cent)	94.6	93.2	53.0	68.1	36.8	45.4
Within 25 per cent of cat. (per cent)	96.7	97.9	72.2	82.0	56.7	65.0

Table 1.

Comparison of the catalogued and calculated HIPASS BGC properties for direct extraction of parameters from the spectrum as well as parametrization based on busy function fitting. For each parameter and signal-to-noise ratio, the table lists two different components of the comparison: the result of a linear regression with the catalogued parameter as the independent variable (best-fitting slope and intercept; see Fig. 6) and the fraction of parameters within a certain percentage of the original, catalogued values.

	Original		S/N = 5		S/N = 3
	Direct	BF fit	Direct	BF fit	Direct	BF fit
	Integrated flux (F_int)
Best-fitting slope	1.02	1.03	0.96	1.05	0.92	1.07
Best-fitting intercept (Jy km s⁻¹)	−1.08	−1.35	2.01	−1.02	4.49	0.14
Within 5 per cent of cat. (per cent)	89.8	83.3	35.8	37.1	22.3	23.2
Within 10 per cent of cat. (per cent)	96.9	95.5	65.6	66.9	42.5	43.2
Within 25 per cent of cat. (per cent)	98.9	99.1	93.2	91.4	77.4	76.4
	Peak flux density (F_peak)
Best-fitting slope	1.00	0.99	1.14	1.04	1.27	1.09
Best-fitting intercept (Jy)	0.00	0.00	−0.01	−0.01	0.01	0.00
Within 5 per cent of cat. (per cent)	99.0	68.6	21.9	28.3	5.4	18.1
Within 10 per cent of cat. (per cent)	99.2	88.0	43.3	54.5	14.7	32.8
Within 25 per cent of cat. (per cent)	99.7	99.1	86.3	91.9	47.1	72.2
	Linewidth (w₅₀)
Best-fitting slope	0.98	0.99	0.94	0.95	0.86	0.87
Best-fitting intercept (km s⁻¹)	0.64	1.47	12.9	6.39	60.2	21.7
Within 5 per cent of cat. (per cent)	99.4	86.6	46.4	50.3	25.1	29.0
Within 10 per cent of cat. (per cent)	99.4	94.0	69.4	72.2	42.4	47.8
Within 25 per cent of cat. (per cent)	99.6	97.7	86.0	89.3	63.2	72.1
	Linewidth (w₂₀)
Best-fitting slope	0.96	0.96	0.90	0.87	0.89	0.80
Best-fitting intercept (km s⁻¹)	8.7	5.6	62.2	39.4	94.9	83.7
Within 5 per cent of cat. (per cent)	94.0	86.0	34.6	47.7	20.7	26.5
Within 10 per cent of cat. (per cent)	94.6	93.2	53.0	68.1	36.8	45.4
Within 25 per cent of cat. (per cent)	96.7	97.9	72.2	82.0	56.7	65.0

	Original		S/N = 5		S/N = 3
	Direct	BF fit	Direct	BF fit	Direct	BF fit
	Integrated flux (F_int)
Best-fitting slope	1.02	1.03	0.96	1.05	0.92	1.07
Best-fitting intercept (Jy km s⁻¹)	−1.08	−1.35	2.01	−1.02	4.49	0.14
Within 5 per cent of cat. (per cent)	89.8	83.3	35.8	37.1	22.3	23.2
Within 10 per cent of cat. (per cent)	96.9	95.5	65.6	66.9	42.5	43.2
Within 25 per cent of cat. (per cent)	98.9	99.1	93.2	91.4	77.4	76.4
	Peak flux density (F_peak)
Best-fitting slope	1.00	0.99	1.14	1.04	1.27	1.09
Best-fitting intercept (Jy)	0.00	0.00	−0.01	−0.01	0.01	0.00
Within 5 per cent of cat. (per cent)	99.0	68.6	21.9	28.3	5.4	18.1
Within 10 per cent of cat. (per cent)	99.2	88.0	43.3	54.5	14.7	32.8
Within 25 per cent of cat. (per cent)	99.7	99.1	86.3	91.9	47.1	72.2
	Linewidth (w₅₀)
Best-fitting slope	0.98	0.99	0.94	0.95	0.86	0.87
Best-fitting intercept (km s⁻¹)	0.64	1.47	12.9	6.39	60.2	21.7
Within 5 per cent of cat. (per cent)	99.4	86.6	46.4	50.3	25.1	29.0
Within 10 per cent of cat. (per cent)	99.4	94.0	69.4	72.2	42.4	47.8
Within 25 per cent of cat. (per cent)	99.6	97.7	86.0	89.3	63.2	72.1
	Linewidth (w₂₀)
Best-fitting slope	0.96	0.96	0.90	0.87	0.89	0.80
Best-fitting intercept (km s⁻¹)	8.7	5.6	62.2	39.4	94.9	83.7
Within 5 per cent of cat. (per cent)	94.0	86.0	34.6	47.7	20.7	26.5
Within 10 per cent of cat. (per cent)	94.6	93.2	53.0	68.1	36.8	45.4
Within 25 per cent of cat. (per cent)	96.7	97.9	72.2	82.0	56.7	65.0

http://code.google.com/p/busy-function-fitting/

Fig. 6 presents a comparison of our parametrization of the HIPASS BGC spectra (without additional noise) with the original measurement of each parameter in the BGC. We accurately recover the fluxes and linewidths of the galaxies from the fitted busy function, with differences of usually well under 10 per cent relative to the original, catalogued parameters. The results illustrate the great accuracy with which galaxy parameters can be derived from busy function fits. The few outliers seen in Fig. 6 are mostly due to artefacts in the data, including interference and variations in the spectral baseline. Such effects can have a strong impact on the accuracy of the parametrization and will potentially affect any parametrization method. Some of the outliers in Fig. 6 in particular could be due to the additional baseline subtraction carried out for the BGC, but not for the busy function fitting, potentially resulting in discrepancies for individual galaxies affected by baseline artefacts.

In Table 1, the calculated and catalogued HIPASS BGC observational properties are compared in two ways. First, a linear regression is carried out to test if a 1:1 correlation exists. The result of the linear regression is shown as the dashed, red curve in Fig. 6 (note that for a non-vanishing intercept the linear fit appears curved in logarithmic space). Secondly, we measure the fraction of calculated values that lie within 5, 10 and 25 per cent of the catalogued values. In doing so, we implicitly ignore the uncertainties in the original BGC parameters, although these would contribute to the measured differences as well. The uncertainties published by Koribalski et al. (2004) are of the order of 10 per cent for F_peak, w₅₀ and w₂₀, and about 15 per cent for F_int.

Two main conclusions can be drawn from Table 1. First, as the signal-to-noise ratio decreases the busy function fits recover the catalogued properties more reliably than direct measurement of the properties does. This is most evident for the peak flux density and, to a lesser degree, for the velocity widths. The reason for the busy function to perform better is that, at low signal-to-noise ratio, the direct parametrization method may be strongly affected by noise, whereas the fit is much less affected by individual noise peaks. There is little advantage, however, when measuring the integrated flux. It should be noted that this is to be expected with Gaussian noise, provided that the chosen channel range contains all of the signal, because the emission is integrated over the entire source. Secondly, despite the differences in the catalogued and calculated properties, we find that the majority of properties derived from the busy function fits are within 5, 10 and 25 per cent of the catalogued properties at signal-to-noise ratios of ≳10, 5 and 3, respectively.

Our analysis demonstrates that the catalogued HIPASS BGC properties can be reliably recovered in a fully automated approach for the majority of sources across a range of signal-to-noise ratios. Potential applications include the parametrization of a large sample of galaxies as well as the construction of realistic H i profiles for simulations such as the S³–SAX simulation (Obreschkow et al. 2009b). Additionally, storing the fitted busy function parameters in addition to the full galaxy spectra will be of particular benefit to large future H i surveys such as WALLABY and Deep Investigation of Neutral Gas Origins (DINGO; Meyer 2009; Duffy et al. 2012), allowing them to include a simplified representation of every integrated H i spectrum in their source catalogue using a maximum of just eight parameter values.

Another great advantage of galaxy parametrization through the method of busy function fitting is the possibility to determine the statistical uncertainties of derived observational parameters from the uncertainties of the fit. This will enable a full error analysis based on the fitting results alone and without the need to modify the input data for that purpose. In Appendix D, we present the detailed description and analysis of two methods to determine uncertainties of observational parameters from the covariance matrix provided by least-squares-fitting algorithms.

CONCLUSIONS

We present a continuous and differentiable analytic function, called the busy function, B₀, and two modifications, B₁ and B₂, designed to describe the typical double-horn profile commonly observed in the integrated H i spectra of spiral galaxies. With a set of five to eight free parameters, the busy function accurately describes a wide range of spectral profiles of galaxies, including symmetric and asymmetric double-horn profiles, simple Gaussian profiles and flat-topped profiles with steep flanks.

The most promising application of the busy function, and the main focus of this paper, is the possibility to automatically fit the integrated H i spectra of a large sample of galaxies. This will allow common observational parameters of galaxies, such as the linewidth or the integrated flux, to be measured with great accuracy and in an automated fashion. In addition, a simple functional representation of each galaxy's integrated spectrum with a maximum of just eight parameters can be stored in a source catalogue. Another potential application of the busy function, although not investigated in this paper, is the generation of a sample of analytic mock profiles of galaxy spectra to be used as templates, either for the purpose of modelling or in matched-filtering algorithms of source-finding pipelines.

A great advantage of parametrizing galaxies by fitting a busy function to the integrated spectrum is the possibility to determine statistical uncertainties of the derived observational parameters. In Appendix D, we present two methods that allow the uncertainties of observational parameters to be determined without the need to modify the input data. This will enable a proper error analysis even in situations where classical Monte Carlo or bootstrap methods cannot be applied, e.g. when a single spectrum of low signal-to-noise ratio needs to be parametrized.

In order to test the suitability of the busy function for automated spectral-line fitting, we implemented an LMA in c/c++ (Appendix C) to fit busy functions to the 1000 galaxies of the HIPASS BGC. Our results demonstrate that it is possible to fit the busy function to a large number of galaxy spectra in a fully automated way without any human intervention. A comparison of several measured galaxy parameters (integrated flux, peak flux density, and w₂₀ and w₅₀ linewidths) with those listed in the HIPASS BGC reveals that we accurately recover the observational parameters of the galaxies from the fit.

For the original spectra, almost all of our measured parameters lie within 25 per cent of the catalogued ones. Even when reducing the peak signal-to-noise ratio of the spectra to 5 and 3, that fraction still remains at about 90 and 70 per cent, respectively. In addition, our measurement based on the busy function fit is often more accurate than the direct parameter measurement carried out on the spectrum. This result illustrates another great strength of the busy function: parametrization based on fitting a busy function to the spectrum is less strongly affected by the noise in individual channels and thus produces more accurate results than any direct measurement. As a consequence, the number of galaxies in an observational sample that can be successfully parametrized would potentially increase compared to conventional parametrization methods, thereby improving the accuracy of scientific studies such as the measurement of the Tully–Fisher relation.

While originally designed to describe the integrated H i emission spectra of galaxies, the busy function's versatility will allow it to be used in many other areas, for instance in the fitting of H i absorption spectra (see Allison et al. 2013 for an actual example), stacked H i spectra of galaxies (e.g. Fabello et al. 2011; Delhaize et al. 2013) and integrated CO spectra of galaxies (e.g. Saintonge et al. 2011; Young et al. 2011; Tacconi et al. 2013).

We thank the members of the H i source-finding collaboration for stimulating discussions that led to the development of the busy function. The Australia Telescope is funded by the Commonwealth of Australia for operation as a National Facility managed by CSIRO. This work made use of THINGS, ‘The H i Nearby Galaxy Survey’ (Walter et al. 2008).

1

2

http://code.google.com/p/busy-function-fitting/

REFERENCES

Akaike

H.

. ,

IEEE Trans. Autom. Control

,

1974

, vol.

19

pg.

716

Allison

J. R.

,

Curran

S. J.

,

Sadler

E. M.

,

Reeves

S. N.

. ,

MNRAS

,

2013

, vol.

430

pg.

157

Baldwin

J. E.

,

Lynden-Bell

D.

,

Sancisi

R.

. ,

MNRAS

,

1980

, vol.

193

pg.

313

Box

G. E. P.

,

Muller

M. E.

. ,

Ann. Math. Stat.

,

1958

, vol.

29

pg.

610

Casertano

S. P. R.

,

Shostak

G. S.

. ,

A&A

,

1980

, vol.

81

pg.

371

Deboer

D. R.

, et al. ,

Proc. IEEE

,

2009

, vol.

97

pg.

1507

Delhaize

J.

,

Meyer

M.

,

Staveley-Smith

L.

,

Boyle

B.

. ,

MNRAS

,

2013

, vol.

433

pg.

1398

Dewdney

P. E.

,

Hall

P. J.

,

Schilizzi

R. T.

,

Lazio

T. J. L. W.

. ,

Proc. IEEE

,

2009

, vol.

97

pg.

1482

Donley

J. L.

, et al. ,

AJ

,

2005

, vol.

129

pg.

220

Duffy

A. R.

,

Meyer

M. J.

,

Staveley-Smith

L.

,

Bernyk

M.

,

Croton

D. J.

,

Koribalski

B. S.

,

Gerstmann

D.

,

Westerlund

S.

. ,

MNRAS

,

2012

, vol.

426

pg.

3385

Fabello

S.

,

Catinella

B.

,

Giovanelli

R.

,

Kauffmann

G.

,

Haynes

M. P.

,

Heckman

T. M.

,

Schiminovich

D.

. ,

MNRAS

,

2011

, vol.

411

pg.

993

Giovanelli

R.

, et al. ,

AJ

,

2005

, vol.

130

pg.

2598

Heald

G.

, et al. ,

A&A

,

2011

, vol.

526

pg.

A118

Hong

T.

, et al. ,

MNRAS

,

2013

, vol.

432

pg.

1178

Koribalski

B. S.

, et al. ,

AJ

,

2004

, vol.

128

pg.

16

Levenberg

K.

. ,

Q. Appl. Math.

,

1944

, vol.

2

pg.

164

Marquardt

D.

. ,

SIAM J. Appl. Math.

,

1963

, vol.

11

pg.

431

Meyer

M.

.

Heald

G.

,

Serra

P.

. ,

Proc. Panoramic Radio Astronomy: Wide-field 1–2 GHz Research on Galaxy Evolution. PoS(PRA2009)015

,

2009

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Obreschkow

D.

,

Croton

D.

,

De Lucia

G.

,

Khochfar

S.

,

Rawlings

S.

. ,

ApJ

,

2009a

, vol.

698

pg.

1467

Obreschkow

D.

,

Klöckner

H.-R.

,

Heywood

I.

,

Levrier

F.

,

Rawlings

S.

. ,

ApJ

,

2009b

, vol.

703

pg.

1890

Oosterloo

T.

,

Verheijen

M. A. W.

,

van Cappellen

W.

,

Bakker

L.

,

Heald

G.

,

Ivashina

M.

.

Torchinsky

S. A.

,

van Ardenne

A.

,

van den Brink-Havinga

T.

,

van Es

A. J. J.

,

Faulkner

A. J.

. ,

Proc. Wide Field Astronomy & Technology for the Square Kilometre Array, PoS(SKADS 2009)070

,

2009

Google Scholar

Google Preview

OpenURL Placeholder Text

WorldCat

Paturel

G.

,

Petit

C.

,

Prugniel

P.

,

Theureau

G.

,

Rousseau

J.

,

Brouty

M.

,

Dubois

P.

,

Cambrésy

L.

. ,

A&A

,

2003

, vol.

412

pg.

45

Roberts

M. S.

. ,

AJ

,

1962

, vol.

67

pg.

437

Saintonge

A.

. ,

AJ

,

2007

, vol.

133

pg.

2087

Saintonge

A.

, et al. ,

MNRAS

,

2011

, vol.

415

pg.

32

Sancisi

R.

. ,

A&A

,

1976

, vol.

53

pg.

159

Swaters

R. A.

,

Sancisi

R.

,

van der Hulst

J. M.

. ,

ApJ

,

1997

, vol.

491

pg.

140

Tacconi

L. J.

, et al. ,

ApJ

,

2013

, vol.

768

pg.

74

Tully

R. B.

,

Fisher

J. R.

. ,

A&A

,

1977

, vol.

54

pg.

661

Walter

F.

,

Brinks

E.

,

de Blok

W. J. G.

,

Bigiel

F.

,

Kennicutt

R. C.

Jr

,

Thornley

M. D.

,

Leroy

A.

. ,

AJ

,

2008

, vol.

136

pg.

2563

Westmeier

T.

,

Braun

R.

,

Koribalski

B. S.

. ,

MNRAS

,

2011

, vol.

410

pg.

2217

Westmeier

T.

,

Koribalski

B. S.

,

Braun

R.

. ,

MNRAS

,

2013

, vol.

434

pg.

3511

Young

L. M.

, et al. ,

MNRAS

,

2011

, vol.

414

pg.

940

Zwaan

M. A.

,

Briggs

F. H.

,

Sprayberry

D.

,

Sorar

E.

. ,

ApJ

,

1997

, vol.

490

pg.

173

APPENDIX A: ANALYTIC SOLUTIONS OF THE BUSY FUNCTION

In this section, we describe some of the analytic solutions for the evaluation of the three busy functions, B_n, with n ∈ {0, 1, 2}, as defined in equations (3)–(5). For simplicity, we will only consider symmetric functions here (i.e. b₁ = b₂ and x_e = x_p for B₁).

It is straightforward to calculate the derivatives of the busy function with respect to x or any of the free parameters, and we will refrain from presenting the rather lengthy analytic expressions here. By setting dB_n/dx = 0, we would in principle be able to calculate the positions of the extrema of B_n, but we have been unable to find a solution to this equation for any of the variants of the busy function, and there may not be an analytic solution other than the trivial solution of x = x₀ (i.e. ξ = 0). Hence, numerical methods will need to be used to determine the positions of the peaks of B_n as well as the resulting galaxy parameters, such as peak flux density or profile width.

For all three versions of the busy function, we can calculate the value at the position of the trivial extremum, ξ = 0, which corresponds to the centre of the profile. The value at the centre of the original busy function, B₀, and the generalized busy function, B₁, is given by

\begin{equation} B_{0}(0) = B_{1}(0) = \frac{a}{4} \, (\mathrm{erf}[bw] + 1)^{2}, \end{equation}

(A1)

which will converge to a for bw ≫ 1 (i.e. broad profiles with steep flanks). Note that this only applies to the symmetric version of B₁ where b₁ = b₂ and x_e = x_p. The value at the centre of the simplified busy function, B₂, is given by

\begin{equation} B_{2}(0) = \frac{a}{2} \left( \mathrm{erf} \! \left[ bw^{2} \right] + 1 \right) \!, \end{equation}

(A2)

which will again converge to a for bw² ≫ 1.

While we cannot produce general solutions for the profile width, there is a simple solution for cases where the polynomial component disappears, i.e. c = 0, and the flanks of the spectrum are sufficiently steep, i.e. large values of bw (for B₀ and B₁) or bw² (for B₂). In this case of ‘boxy’ spectra, the separation between the two error functions of 2w is equal to the full width at half-maximum (or w₅₀) of the profile. In other cases of double-horn profiles with steep flanks, although not exact, this solution may still provide a first-order estimate of w₅₀.

APPENDIX B: RELATION BETWEEN THE BUSY FUNCTION AND THE GAUSSIAN FUNCTION

The Gaussian function is commonly used for the purpose of fitting spectral lines, including the H i spectra of galaxies and gas clouds. In this appendix, we demonstrate that the original busy function, B₀, without the polynomial trough (i.e. c = 0) possesses the flexibility to closely approximate a Gaussian function and hence can describe the generally simple line profiles of dwarf galaxies and face-on galaxies just as well. In order to achieve this, let us first define a normalized Gaussian function of the form

\begin{equation} G(\xi ) = \frac{1}{\sqrt{2 \pi } \, \sigma } \exp \! \left( \! -\frac{\xi ^{2}}{2 \sigma ^2} \right) \end{equation}

(B1)

and a busy function similar to equation (3), but in a simplified form, assuming a width of w = 0 and no polynomial component (c = 0), hence

\begin{equation} B(\xi ) = a(\mathrm{erf}[b\xi ] + 1) (\mathrm{erf}[-b\xi ] + 1). \end{equation}

(B2)

For simplicity, we define ξ ≡ x − x₀, as before. Both functions can be expanded into a Taylor series at the origin, ξ = 0:

\begin{eqnarray} G(\xi ) & =& \displaystyle\frac{1}{\sqrt{2 \pi } \, \sigma } \xi ^{0} - \frac{1}{\sqrt{8 \pi } \, \sigma ^{3}} \xi ^2 + \frac{1}{\sqrt{128 \pi } \, \sigma ^{5}} \xi ^{4} + \cdots \end{eqnarray}

(B3)

\begin{eqnarray} B(\xi ) & =\displaystyle a\xi ^{0} - \frac{4 ab^{2}}{\pi } \xi ^{2} + \frac{8 ab^{4}}{3 \pi } \xi ^{4} + \cdots . \end{eqnarray}

(B4)

Apparently, the Taylor series of the busy function and the Gaussian function are very similar. Both only contain terms of even order, and a comparison of the respective coefficients in each order should thus allow us to derive approximate relations between the parameters of the busy function and the standard deviation, σ, of the Gaussian function.

From a comparison of the zeroth-order and second-order terms of the two Taylor series in equations (B3) and (B4), we obtain the following solutions for the parameters a and b of the busy function as defined in equation (B2):

\begin{eqnarray} a&=\displaystyle \frac{1}{\sqrt{2 \pi } \, \sigma }, \end{eqnarray}

(B5)

\begin{eqnarray} b&=\displaystyle \frac{\sqrt{\pi }}{\sqrt{8} \, \sigma } = \frac{\pi }{2} \, a. \end{eqnarray}

(B6)

The two parameters above describe a family of busy functions that closely resemble a Gaussian function of width σ, although they are approximations only.

An example is shown in Fig. B1, where the red, solid curve shows a Gaussian function, G(ξ), with a width of σ = 3, while the blue, dashed curve shows the busy function, B(ξ), with parameters |$a= 1 / \sqrt{18 \pi }$| and |$b= \sqrt{\pi / 72}$| according to equations (B5) and (B6). Both functions match almost perfectly, in particular, near the origin of ξ = 0. For larger values of |ξ|, the relative difference between B(ξ) and G(ξ) increases, but the absolute difference remains small across the entire domain of the two functions. The busy function's remarkable resemblance of a Gaussian function adds to its versatility when it comes to fitting the wide range of different H i profiles found in galaxies.

Figure B1.

Comparison of the busy function, B(ξ), with a Gaussian function, G(ξ), of dispersion σ = 3. The parameters of the two functions are linked according to equations (B5) and (B6).

APPENDIX C: IMPLEMENTATION OF A BUSY FUNCTION FITTING ALGORITHM

We implemented a busy-function-fitting program using c++ and the c libraries cfitsio and cpgplot. Our fitting program is based on a combination of the LMA (Levenberg 1944; Marquardt 1963) and the AIC (Akaike 1974). We use the LMA to carry out a χ² minimization for six variants of the busy function as listed in Table C1. We use those variants to fix various parameters of the busy function. Note that we use a slight reformulation of the busy function, because we found it to be faster to fit with the LMA. We use the AIC to penalize each busy function variant's χ² value for the number of free parameters and then choose the model with the best resultant χ².

Table C1.

The six busy function variants used in our c/c++/python implementation, their dimensionality/complexity and the associated χ² penalty imposed by the AIC.

No. of free	AIC χ²	Busy function variant
parameters	penalty
4	+ 8	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ])$\|
5	+10	(α/4) × (1 + erf[β₁{x − γ₁}]) × (1 + erf[β₂{γ₂ − x}])
5	+10	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - [0.5 \lbrace \gamma _{1} + \gamma _{2} \rbrace ] \|^{4} \right)$\|
6	+12	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - [0.5 \lbrace \gamma _{1} + \gamma _{2} \rbrace ] \|^{n} \right)$\|
7	+14	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - \theta \|^{n} \right)$\|
8	+16	(α/4) × (1 + erf[β₁{x − γ₁}]) × (1 + erf[β₂{γ₂ − x}]) × (1 + ϕ \|x − θ\|ⁿ)

No. of free	AIC χ²	Busy function variant
parameters	penalty
4	+ 8	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ])$\|
5	+10	(α/4) × (1 + erf[β₁{x − γ₁}]) × (1 + erf[β₂{γ₂ − x}])
5	+10	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - [0.5 \lbrace \gamma _{1} + \gamma _{2} \rbrace ] \|^{4} \right)$\|
6	+12	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - [0.5 \lbrace \gamma _{1} + \gamma _{2} \rbrace ] \|^{n} \right)$\|
7	+14	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - \theta \|^{n} \right)$\|
8	+16	(α/4) × (1 + erf[β₁{x − γ₁}]) × (1 + erf[β₂{γ₂ − x}]) × (1 + ϕ \|x − θ\|ⁿ)

Table C1.

The six busy function variants used in our c/c++/python implementation, their dimensionality/complexity and the associated χ² penalty imposed by the AIC.

No. of free	AIC χ²	Busy function variant
parameters	penalty
4	+ 8	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ])$\|
5	+10	(α/4) × (1 + erf[β₁{x − γ₁}]) × (1 + erf[β₂{γ₂ − x}])
5	+10	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - [0.5 \lbrace \gamma _{1} + \gamma _{2} \rbrace ] \|^{4} \right)$\|
6	+12	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - [0.5 \lbrace \gamma _{1} + \gamma _{2} \rbrace ] \|^{n} \right)$\|
7	+14	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - \theta \|^{n} \right)$\|
8	+16	(α/4) × (1 + erf[β₁{x − γ₁}]) × (1 + erf[β₂{γ₂ − x}]) × (1 + ϕ \|x − θ\|ⁿ)

No. of free	AIC χ²	Busy function variant
parameters	penalty
4	+ 8	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ])$\|
5	+10	(α/4) × (1 + erf[β₁{x − γ₁}]) × (1 + erf[β₂{γ₂ − x}])
5	+10	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - [0.5 \lbrace \gamma _{1} + \gamma _{2} \rbrace ] \|^{4} \right)$\|
6	+12	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - [0.5 \lbrace \gamma _{1} + \gamma _{2} \rbrace ] \|^{n} \right)$\|
7	+14	\|$(\alpha / 4) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace x - \gamma _{1} \rbrace ]) \times (1 + \mathrm{erf}[\beta _{\phantom{0}} \lbrace \gamma _{2} - x \rbrace ]) \times \left( 1 + \phi \, \| x - \theta \|^{n} \right)$\|
8	+16	(α/4) × (1 + erf[β₁{x − γ₁}]) × (1 + erf[β₂{γ₂ − x}]) × (1 + ϕ \|x − θ\|ⁿ)

The LMA was used to carry out the χ² minimization for three reasons. First, the LMA produces a χ² covariance matrix that can be used to calculate both the uncertainties and correlations of the model parameters. Secondly, the LMA does not rely upon a grid of model parameter values, and the model parameters therefore do not suffer discretization (such as in brute-force χ² minimization). Finally, the LMA uses the χ² covariance matrix to ensure that it simultaneously adjusts the model parameter values in a way that achieves the maximum decrement in χ². As model dimensionality increases, this aspect of the LMA becomes increasingly powerful.

We used variable remapping, singular value decomposition and a modified power law to fit the busy function with the LMA. Variable remapping is necessary because the LMA cannot impose any range limits on model parameters. We used the variable remappings in Table C2 to limit the model parameters to sensible ranges and to avoid unphysical parameter values. This includes positive definite values for scaling/normalization parameters and a user-specified range for the error function and power-law origins. We used singular value decomposition to ensure that degenerate model parameter values and extreme noise do not cause the LMA to fail.

Table C2.

The variable mappings used to impose parameter range constraints during fitting. For a given user-specified range, the values mid and amp are the mid-point and distance from either end of the range to the mid-point, respectively.

Variable(s)	Mapping	Range	Reason
α	exp (α)	>0	Meaningful normalization; avoids normalization trade offs.
β, β₁, β₂	exp (β)	>0	Prevents inversion of error function origins (γ, γ₁, γ₂).
γ, γ₁, γ₂	mid + amp sin (γ)	User specified	Avoids irrelevant solutions.
ϕ	exp (ϕ)	>0	Meaningful normalization; avoids normalization trade offs.
θ	mid + amp sin (θ)	User specified	Avoids irrelevant solutions.
n	5 + 3 sin (n)	2 ≤ n ≤ 8	Avoids unphysical solutions.

Variable(s)	Mapping	Range	Reason
α	exp (α)	>0	Meaningful normalization; avoids normalization trade offs.
β, β₁, β₂	exp (β)	>0	Prevents inversion of error function origins (γ, γ₁, γ₂).
γ, γ₁, γ₂	mid + amp sin (γ)	User specified	Avoids irrelevant solutions.
ϕ	exp (ϕ)	>0	Meaningful normalization; avoids normalization trade offs.
θ	mid + amp sin (θ)	User specified	Avoids irrelevant solutions.
n	5 + 3 sin (n)	2 ≤ n ≤ 8	Avoids unphysical solutions.

Table C2.

The variable mappings used to impose parameter range constraints during fitting. For a given user-specified range, the values mid and amp are the mid-point and distance from either end of the range to the mid-point, respectively.

Variable(s)	Mapping	Range	Reason
α	exp (α)	>0	Meaningful normalization; avoids normalization trade offs.
β, β₁, β₂	exp (β)	>0	Prevents inversion of error function origins (γ, γ₁, γ₂).
γ, γ₁, γ₂	mid + amp sin (γ)	User specified	Avoids irrelevant solutions.
ϕ	exp (ϕ)	>0	Meaningful normalization; avoids normalization trade offs.
θ	mid + amp sin (θ)	User specified	Avoids irrelevant solutions.
n	5 + 3 sin (n)	2 ≤ n ≤ 8	Avoids unphysical solutions.

Variable(s)	Mapping	Range	Reason
α	exp (α)	>0	Meaningful normalization; avoids normalization trade offs.
β, β₁, β₂	exp (β)	>0	Prevents inversion of error function origins (γ, γ₁, γ₂).
γ, γ₁, γ₂	mid + amp sin (γ)	User specified	Avoids irrelevant solutions.
ϕ	exp (ϕ)	>0	Meaningful normalization; avoids normalization trade offs.
θ	mid + amp sin (θ)	User specified	Avoids irrelevant solutions.
n	5 + 3 sin (n)	2 ≤ n ≤ 8	Avoids unphysical solutions.

Using the LMA does not solve the problem common to all model-fitting methods based on χ² minimization: there is no guarantee that the LMA will find the global χ² minimum when starting from an arbitrary position in parameter space. We solve this problem by using the LMA to find the nearest χ² minimum for 1000 randomly chosen starting positions. These LMA starting positions are created in two steps. First, α is fixed at 10⁻³ and every other parameter is randomly chosen (with replacement) from the values in Table C3. The corresponding curve is then used to calculate an α and adjusted ϕ that ensure that the model peak is equal to a randomly chosen (with replacement) multiple of the peak data value (also listed in Table C3). We then assume that the model-fitting values of the smallest of these resultant χ² values is a good proxy for the model-fitting values of the global χ² minimum. An additional benefit of this approach is that we do not have to use too many iterations for each LMA process. The number of LMA iterations and starting positions can be traded against each other. This is because as the number of LMA starting positions is increased, the probability of choosing an LMA starting position close to the global χ² minimum increases. For this application, we are using 30 iterations for each LMA process compared to the O(1000) iterations typically used.

Table C3.

The parameter list used to generate LMA starting positions, using random selection with replacement.

Property	LMA seed values
Model peak	1/8, 1/4, 3/8, 1/2, 5/8, 3/4, 7/8, 1 × data maximum
β⁽¹⁾, \|$\beta _{1}^{(1)}$\|⁠, \|$\beta _{2}^{(1)}$\|	0.1, 0.2, 0.4, 0.8, 1.6, 3.2, 6.4, 10, g/1.25, g/2.5, g/5, g/10, g/15, g/20
\|$\gamma _1^{(2)}$\|	min, +r/8, +r/4, +3r/8, +r/2, +5r/8, +3r/4, +7r/8, max
\|$\gamma _2^{(2,3)}$\|	min, +r/8, +r/4, +3r/8, +r/2, +5r/8, +3r/4, +7r/8, max provided γ₂ ≥ γ₁
ϕ for α = 10⁻³	10⁻⁹, 10⁻⁸, 10⁻⁷, 10⁻⁶, 10⁻⁵, 10⁻⁴, 10⁻³, 0.01, 0.1, 1, 10, 100, 10³, 10⁴
θ	γ₁, (γ₂ − γ₁)/6, (γ₂ − γ₁)/3, (γ₂ − γ₁)/2, 2(γ₂ − γ₁)/3, 5(γ₂ − γ₁)/6, γ₂
n	2, 4, 6

Property	LMA seed values
Model peak	1/8, 1/4, 3/8, 1/2, 5/8, 3/4, 7/8, 1 × data maximum
β⁽¹⁾, \|$\beta _{1}^{(1)}$\|⁠, \|$\beta _{2}^{(1)}$\|	0.1, 0.2, 0.4, 0.8, 1.6, 3.2, 6.4, 10, g/1.25, g/2.5, g/5, g/10, g/15, g/20
\|$\gamma _1^{(2)}$\|	min, +r/8, +r/4, +3r/8, +r/2, +5r/8, +3r/4, +7r/8, max
\|$\gamma _2^{(2,3)}$\|	min, +r/8, +r/4, +3r/8, +r/2, +5r/8, +3r/4, +7r/8, max provided γ₂ ≥ γ₁
ϕ for α = 10⁻³	10⁻⁹, 10⁻⁸, 10⁻⁷, 10⁻⁶, 10⁻⁵, 10⁻⁴, 10⁻³, 0.01, 0.1, 1, 10, 100, 10³, 10⁴
θ	γ₁, (γ₂ − γ₁)/6, (γ₂ − γ₁)/3, (γ₂ − γ₁)/2, 2(γ₂ − γ₁)/3, 5(γ₂ − γ₁)/6, γ₂
n	2, 4, 6

Notes. (1) The constant g = 0.747 806 is used to calculate β values corresponding to Gaussian roll-offs of the form, β = g/σ. (2) min and max are the user-specified roll-off range; +r denotes min plus the additional quantity, with r ≡ max − min. (3) γ₁ must be chosen first so that it can be used to limit the number of possible γ₂ values.

Table C3.

The parameter list used to generate LMA starting positions, using random selection with replacement.

Property	LMA seed values
Model peak	1/8, 1/4, 3/8, 1/2, 5/8, 3/4, 7/8, 1 × data maximum
β⁽¹⁾, \|$\beta _{1}^{(1)}$\|⁠, \|$\beta _{2}^{(1)}$\|	0.1, 0.2, 0.4, 0.8, 1.6, 3.2, 6.4, 10, g/1.25, g/2.5, g/5, g/10, g/15, g/20
\|$\gamma _1^{(2)}$\|	min, +r/8, +r/4, +3r/8, +r/2, +5r/8, +3r/4, +7r/8, max
\|$\gamma _2^{(2,3)}$\|	min, +r/8, +r/4, +3r/8, +r/2, +5r/8, +3r/4, +7r/8, max provided γ₂ ≥ γ₁
ϕ for α = 10⁻³	10⁻⁹, 10⁻⁸, 10⁻⁷, 10⁻⁶, 10⁻⁵, 10⁻⁴, 10⁻³, 0.01, 0.1, 1, 10, 100, 10³, 10⁴
θ	γ₁, (γ₂ − γ₁)/6, (γ₂ − γ₁)/3, (γ₂ − γ₁)/2, 2(γ₂ − γ₁)/3, 5(γ₂ − γ₁)/6, γ₂
n	2, 4, 6

Property	LMA seed values
Model peak	1/8, 1/4, 3/8, 1/2, 5/8, 3/4, 7/8, 1 × data maximum
β⁽¹⁾, \|$\beta _{1}^{(1)}$\|⁠, \|$\beta _{2}^{(1)}$\|	0.1, 0.2, 0.4, 0.8, 1.6, 3.2, 6.4, 10, g/1.25, g/2.5, g/5, g/10, g/15, g/20
\|$\gamma _1^{(2)}$\|	min, +r/8, +r/4, +3r/8, +r/2, +5r/8, +3r/4, +7r/8, max
\|$\gamma _2^{(2,3)}$\|	min, +r/8, +r/4, +3r/8, +r/2, +5r/8, +3r/4, +7r/8, max provided γ₂ ≥ γ₁
ϕ for α = 10⁻³	10⁻⁹, 10⁻⁸, 10⁻⁷, 10⁻⁶, 10⁻⁵, 10⁻⁴, 10⁻³, 0.01, 0.1, 1, 10, 100, 10³, 10⁴
θ	γ₁, (γ₂ − γ₁)/6, (γ₂ − γ₁)/3, (γ₂ − γ₁)/2, 2(γ₂ − γ₁)/3, 5(γ₂ − γ₁)/6, γ₂
n	2, 4, 6

Notes. (1) The constant g = 0.747 806 is used to calculate β values corresponding to Gaussian roll-offs of the form, β = g/σ. (2) min and max are the user-specified roll-off range; +r denotes min plus the additional quantity, with r ≡ max − min. (3) γ₁ must be chosen first so that it can be used to limit the number of possible γ₂ values.

To avoid local χ² minima, we have also added an additional criterion to the typical definition of a χ² minimum. A χ² minimum is typically defined as a negligible decrease in χ² for consecutive iterations. In addition to this, we require that the χ² value must not have increased for five consecutive iterations. This avoids χ² minima that are ‘noise troughs’ in unstable regions of parameter space.

It should be noted that there is also a powerful, inherent advantage to model fitting via χ² minimization. Model fitting with χ² minimization takes into account the uncertainty of each individual data point. Our implementation includes this capability, although it was not required for fitting the HIPASS BGC spectra (which we assume to have a constant noise level of 13 mJy). We expect that this feature will be useful for data sets with channel/frequency-dependent noise. Alternatively, this capability can also be exploited by assigning large uncertainties to channels/frequencies affected by radio frequency interference. This will down-weight the significance of such channels/frequencies when parametrizing affected galaxies.

Our implementation of the busy-function-fitting algorithm can be obtained from the project's website² or by sending an e-mail to one of the authors, RJ (russell.jurek@gmail.com). The implementation is available as a c library, a c++ template library and a python module, all of which use openmp to take advantage of systems with multicore CPUs.

APPENDIX D: DETERMINATION OF UNCERTAINTIES

A crucial aspect in the parametrization of spectral profiles of galaxies is the determination of uncertainties. Several different methods have been used in the parametrization of H i lines in the past, including Monte Carlo methods (e.g. Donley et al. 2005) and bootstrap or jackknife methods (e.g. Hong et al. 2013). These methods generally attempt to emulate repeated measurements by either adding artificial noise to the data or by looking at different subsets of a data set. While providing the most accurate assessment of uncertainties (apart from actually repeating the measurement), Monte Carlo and bootstrap methods can be relatively slow due to the need to repeatedly alter the original data. In addition, not all data sets lend themselves to this type of procedure.

In this section, we introduce two different methods that rely entirely on the covariance matrix of the busy function fit and can therefore be applied in situations where Monte Carlo and bootstrap methods will fail, e.g. when only a single spectrum of low signal-to-noise ratio is available. In addition, the introduced methods are much faster than the former because they do not require the original data to be altered in any way.

The classical approach: Monte Carlo and bootstrap methods

As Monte Carlo and bootstrap methods operate on the input data themselves, they are independent of the actual parametrization method used and can therefore be applied in combination with busy function fitting as well. As a demonstration of a Monte Carlo approach, we created 10 000 realizations of the integrated spectrum of the galaxy NGC 3351 (Walter et al. 2008), as shown in Fig. 4, by adding random Gaussian noise with a standard deviation of 50 mJy to the original spectrum. Next, we fitted the generalized busy function, B₁, with the parameter n fixed to 4, to each of the 10 000 spectra, and extracted and analysed the resulting parameters. The results are summarized in Fig. D1 and Table D1. The resulting χ² of the fits follows the expected probability density function with a mean of 〈χ²〉 = 54.6 and |$\langle \chi _{\rm red}^{2} \rangle = 1.03$|⁠. The mean χ² is slightly larger than the number of degrees of freedom of 53 (60 spectral channels less 7 free parameters), but this small discrepancy can be readily explained by intrinsic structure and noise in the original spectrum of NGC 3351.

Figure D1.

$Distribution of busy function parameters and derived observational parameters after fitting the generalized busy function (with n = 4 fixed) to 10 000 realizations of the integrated spectrum of NGC 3351 with artificial noise added. The red curve shows the result of a Gaussian fit to the parameter distribution, with the exception of the $\chi _{\rm red}^{2}$ distribution where we fitted the χ2 probability density function instead of a Gaussian. The unit of flux density is Jy, while spectral parameters are specified in channels.$

Distribution of busy function parameters and derived observational parameters after fitting the generalized busy function (with n = 4 fixed) to 10 000 realizations of the integrated spectrum of NGC 3351 with artificial noise added. The red curve shows the result of a Gaussian fit to the parameter distribution, with the exception of the |$\chi _{\rm red}^{2}$| distribution where we fitted the χ² probability density function instead of a Gaussian. The unit of flux density is Jy, while spectral parameters are specified in channels.

Table D1.

Busy function parameters (upper section) and derived observational parameters (lower section) after fitting the generalized busy function (with n = 4 fixed) to 10 000 realizations of the integrated spectrum of NGC 3351 with artificial noise added. The first two columns list the parameter name and its value derived from fitting the original, high signal-to-noise spectrum. The uncertainties in the upper section were taken from the covariance matrix of the fit, while those in the lower section have been calculated with the new method introduced in Appendix D2. Columns 3 and 4 show the mean, |$\bar{P}$|⁠, and median, |$\tilde{P}$|⁠, of all 10 000 realizations, columns 5 and 6 list the centroid, P₀, and standard deviation, σ_P, of a Gaussian function fitted to the parameter distribution (see Fig. D1) and column 7 shows the skewness, γ₁, of the distribution. The unit of flux density is Jy, while spectral parameters are specified in channels.

Parameter	Original value	\|$\bar{P}$\|	\|$\tilde{P}$\|	P₀	σ_P	γ₁
a	0.121 ± 0.002	0.121	0.121	0.120	0.009	0.00
b₁	0.604 ± 0.036	0.705	0.617	0.585	0.164	7.63
b₂	0.572 ± 0.034	0.651	0.585	0.558	0.152	9.32
c × 10⁶	6.660 ± 0.235	6.860	6.760	6.711	1.151	0.63
x_e	30.183 ± 0.040	30.178	30.176	30.176	0.194	−0.14
x_p	30.206 ± 0.098	30.198	30.200	30.200	0.470	−0.02
w	25.387 ± 0.045	25.365	25.369	25.375	0.215	−0.39
Centroid	30.164 ± 0.122	30.164	30.168	30.161	0.614	0.00
w₅₀	51.612 ± 0.085	51.515	51.510	51.501	0.462	0.17
w₂₀	53.485 ± 0.130	53.432	53.399	53.377	0.736	0.32
F_peak	0.348 ± 0.005	0.364	0.363	0.363	0.023	0.25
F_int	9.618 ± 0.075	9.633	9.634	9.632	0.382	0.00

Parameter	Original value	\|$\bar{P}$\|	\|$\tilde{P}$\|	P₀	σ_P	γ₁
a	0.121 ± 0.002	0.121	0.121	0.120	0.009	0.00
b₁	0.604 ± 0.036	0.705	0.617	0.585	0.164	7.63
b₂	0.572 ± 0.034	0.651	0.585	0.558	0.152	9.32
c × 10⁶	6.660 ± 0.235	6.860	6.760	6.711	1.151	0.63
x_e	30.183 ± 0.040	30.178	30.176	30.176	0.194	−0.14
x_p	30.206 ± 0.098	30.198	30.200	30.200	0.470	−0.02
w	25.387 ± 0.045	25.365	25.369	25.375	0.215	−0.39
Centroid	30.164 ± 0.122	30.164	30.168	30.161	0.614	0.00
w₅₀	51.612 ± 0.085	51.515	51.510	51.501	0.462	0.17
w₂₀	53.485 ± 0.130	53.432	53.399	53.377	0.736	0.32
F_peak	0.348 ± 0.005	0.364	0.363	0.363	0.023	0.25
F_int	9.618 ± 0.075	9.633	9.634	9.632	0.382	0.00

Table D1.

Busy function parameters (upper section) and derived observational parameters (lower section) after fitting the generalized busy function (with n = 4 fixed) to 10 000 realizations of the integrated spectrum of NGC 3351 with artificial noise added. The first two columns list the parameter name and its value derived from fitting the original, high signal-to-noise spectrum. The uncertainties in the upper section were taken from the covariance matrix of the fit, while those in the lower section have been calculated with the new method introduced in Appendix D2. Columns 3 and 4 show the mean, |$\bar{P}$|⁠, and median, |$\tilde{P}$|⁠, of all 10 000 realizations, columns 5 and 6 list the centroid, P₀, and standard deviation, σ_P, of a Gaussian function fitted to the parameter distribution (see Fig. D1) and column 7 shows the skewness, γ₁, of the distribution. The unit of flux density is Jy, while spectral parameters are specified in channels.

Parameter	Original value	\|$\bar{P}$\|	\|$\tilde{P}$\|	P₀	σ_P	γ₁
a	0.121 ± 0.002	0.121	0.121	0.120	0.009	0.00
b₁	0.604 ± 0.036	0.705	0.617	0.585	0.164	7.63
b₂	0.572 ± 0.034	0.651	0.585	0.558	0.152	9.32
c × 10⁶	6.660 ± 0.235	6.860	6.760	6.711	1.151	0.63
x_e	30.183 ± 0.040	30.178	30.176	30.176	0.194	−0.14
x_p	30.206 ± 0.098	30.198	30.200	30.200	0.470	−0.02
w	25.387 ± 0.045	25.365	25.369	25.375	0.215	−0.39
Centroid	30.164 ± 0.122	30.164	30.168	30.161	0.614	0.00
w₅₀	51.612 ± 0.085	51.515	51.510	51.501	0.462	0.17
w₂₀	53.485 ± 0.130	53.432	53.399	53.377	0.736	0.32
F_peak	0.348 ± 0.005	0.364	0.363	0.363	0.023	0.25
F_int	9.618 ± 0.075	9.633	9.634	9.632	0.382	0.00

Parameter	Original value	\|$\bar{P}$\|	\|$\tilde{P}$\|	P₀	σ_P	γ₁
a	0.121 ± 0.002	0.121	0.121	0.120	0.009	0.00
b₁	0.604 ± 0.036	0.705	0.617	0.585	0.164	7.63
b₂	0.572 ± 0.034	0.651	0.585	0.558	0.152	9.32
c × 10⁶	6.660 ± 0.235	6.860	6.760	6.711	1.151	0.63
x_e	30.183 ± 0.040	30.178	30.176	30.176	0.194	−0.14
x_p	30.206 ± 0.098	30.198	30.200	30.200	0.470	−0.02
w	25.387 ± 0.045	25.365	25.369	25.375	0.215	−0.39
Centroid	30.164 ± 0.122	30.164	30.168	30.161	0.614	0.00
w₅₀	51.612 ± 0.085	51.515	51.510	51.501	0.462	0.17
w₂₀	53.485 ± 0.130	53.432	53.399	53.377	0.736	0.32
F_peak	0.348 ± 0.005	0.364	0.363	0.363	0.023	0.25
F_int	9.618 ± 0.075	9.633	9.634	9.632	0.382	0.00

As shown in Fig. D1, most parameters obey an approximately normal distribution, although some are significantly skewed, in particular the two profile slope parameters, b₁ and b₂. Such non-Gaussian distributions imply that, strictly speaking, the parameter uncertainties cannot be expressed in terms of a single number, such as the standard deviation usually reported by least-squares-fitting algorithms. However, as a first-order approximation, the parameters of the busy function can be assumed to follow a normal distribution. The same appears to be true for numerically derived observational parameters (line centroid, w₅₀ and w₂₀ linewidths, peak flux density and integrated flux) as presented in the lower sections of Fig. D1 and Table D1.

A different approach: variation of busy function parameters

While Monte Carlo and bootstrap methods provide a robust way of determining parameter uncertainties, it might not be possible to apply them in certain situations, e.g. when only a single spectrum with low signal-to-noise ratio is available. In such situations, one of the great advantages of the busy function over direct H i parametrization methods comes into play: under the assumption of a Gaussian statistic of the busy function parameters, we can determine the uncertainties of derived parameters, including linewidths and fluxes, from the covariance matrix provided by least-squares-fitting algorithms such as the LMA (Levenberg 1944; Marquardt 1963).

As concluded in Appendix A, the lack of analytical solutions for derived parameters implies that we need to determine the uncertainties of derived parameters numerically. The simplest approach would be to randomly vary the busy function parameters coming out of a fit and recalculate derived parameters such as w₅₀ or F_int. This can be repeated many times, and the uncertainties of the derived parameters can then be determined by either simply taking the standard deviation across all iterations or by fitting a Gaussian to the resulting parameter distribution. This approach implicitly assumes that all busy function parameters are normally distributed (see Appendix D1) and that the individual parameters are entirely uncorrelated.

Most parameters, however, have some degree of correlation (as shown in Fig. D2 for the integrated spectrum of NGC 3351), and it is necessary to take this effect into consideration when randomly varying the fit parameters. This can be achieved with the following method based on the parameter covariance matrix provided by the LMA.

Using the LMA, the busy function is fitted to the integrated spectrum of the galaxy to be parametrized, providing us with values for the free parameters of the busy function, |$\boldsymbol {p}$|⁠, as well as the full covariance matrix, |${\bf C}_{\boldsymbol {p}}$|⁠.
Figure D2.
Open in new tab Download slide
Correlation coefficients of generalized busy function parameters as derived from the covariance matrix of a fit to the integrated spectrum of NGC 3351.
Using the method of Box & Muller (1958), we can then generate M sets of independent, random busy function parameters, |${\boldsymbol p}_{\it m}$|⁠, each following a normal distribution centred on |$\boldsymbol{\boldsymbol {p}}$| with a standard deviation as derived from the square root of the diagonal elements of |${\bf C}_{\boldsymbol{p}}$|⁠.
Next, we need to transform the vectors of random, uncorrelated parameters, |$\boldsymbol {p}_{m}$|⁠, into vectors of random parameters with the correct correlations, |$\boldsymbol {p}_{m}^{\prime }$|⁠, as described by the non-diagonal elements of |${\bf C}_{\boldsymbol {p}}$|⁠. This can be achieved by performing a Cholesky decomposition of the parameter correlation matrix,
\begin{equation} {\bf K}_{\boldsymbol {p}} = {\bf L} {\bf L}^{\rm T}, \end{equation}
(D1)
where |$K_{\boldsymbol{ p}}^{ij} = C_{\boldsymbol{ p}}^{ij} / (\sigma _{i} \sigma _{j})$| and |$\sigma _{i}^{2} = C_{\boldsymbol{ p}}^{ii}$|⁠, and then multiplying each parameter vector with the lower triangular form of the resulting matrix,
\begin{equation} \boldsymbol {p}_{m}^{\prime } = {\bf L} \boldsymbol {p}_{m}. \end{equation}
(D2)
In this step, we need to take into account that each parameter in |$\boldsymbol {p}_{m}$| must be of zero mean and unit variance for the transform to work. Hence, each parameter may need to be scaled and translated before and after the transform.
Lastly, we can numerically derive the desired observational parameters (including centroid, w₅₀, w₂₀, F_peak and F_int) for each of the M correlated busy function parameter sets, |$\boldsymbol {p}_{m}^{\prime }$|⁠. We can then calculate the mean and standard deviation for each parameter across all M iterations.

We tested this parameter variation method on the integrated spectrum of NGC 3351 (Fig. 4), fitting the generalized busy function, B₁, with the parameter n = 4 fixed. The results, listed in the lower section of Table D1, are in good agreement with our expectations, suggesting that the method produces accurate uncertainties. For example, when numerically measuring the flux density at the position of the central trough of the fitted busy function, we derive a value of 0.1210 ± 0.001 839. Both the value and the uncertainty are identical (within the numerical accuracy) with those of the busy function parameter a as derived from the least-squares fit. A similar comparison can be made between the measured linewidth, w₅₀/2 = 25.81 ± 0.042 26, and the parameter w = 25.39 ± 0.045 01 of the busy function. Again, the values and uncertainties agree very well, even though we do not expect an exact identity of w₅₀/2 and w (see Appendix A). The results suggest that the error analysis method based on varying the initial busy function parameters produces accurate measurements of the uncertainty of derived parameters.

A faster approach: linear propagation of the covariance matrix

While the parameter variation method presented in Appendix D2 provides an accurate way of estimating uncertainties, it is relatively slow and inefficient due to the large number of iterations required to achieve sufficient numerical accuracy. However, under the assumption of a linear approximation of the function that translates between the free parameters of the busy function and the derived observational parameters of the spectral profile, we can instead use the error propagation law to determine not just the uncertainties of the derived parameters, but in fact the full covariance matrix.

Let us assume that |$\boldsymbol {p}$| is the parameter vector and |${\bf C}_{\boldsymbol{ p}}$| the covariance matrix of the busy function's free parameters, as before. Let us further assume that there is a differentiable function, |$\boldsymbol {f}$|⁠, that translates between the busy function parameters, |$\boldsymbol {p}$|⁠, and the derived observational parameters, |$\boldsymbol {q} = \boldsymbol {f}(\boldsymbol {p})$|⁠. We can then numerically approximate the Jacobian matrix of |$\boldsymbol {f}$| by varying each input parameter, p_i, by a small amount, ε_i, such that

\begin{equation} J_{ji} \equiv \frac{\mathrm{\partial} f_{j}}{\mathrm{\partial} p_{i}} \approx \frac{f_{j}(\boldsymbol {p} + \varepsilon _{i} \hat{\boldsymbol {e}}_{i}) - f_{j}(\boldsymbol {p})}{\varepsilon _{i}}, \end{equation}

(D3)

where |$\hat{\boldsymbol {e}}_{i}$| is the unit vector in the direction of the ith component of |$\boldsymbol {p}$|⁠. With the Jacobian matrix determined, we can now use the error propagation law to calculate the full covariance matrix, |${\bf C}_{\boldsymbol{ q}}$|⁠, of the derived observational parameters:

\begin{equation} {\bf C}_{\boldsymbol{ q}} = {\bf J} {\bf C}_{\boldsymbol{ p}} {\bf J}^{\rm T}. \end{equation}

(D4)

The uncertainties of the individual parameters, q_i, are then given by the square root of the diagonal elements of the covariance matrix. Note that, for this method to work, it is not necessary to know the analytic expression of |$\boldsymbol {f}(\boldsymbol {p})$|⁠, as the function can be evaluated numerically.

As before, we tested the method of propagating the covariance matrix on the integrated spectrum of NGC 3351 (Fig. 4), using relative offsets of ε_i = |p_i| × 10⁻⁵. The results, including a comparison with the method introduced in Appendix D2, are presented in Table D2. Both the method of parameter variation as well as the method of linear propagation of the covariance matrix yield comparable estimates of the uncertainties of the derived observational parameters of NGC 3351. The largest discrepancy is observed for the peak flux density, F_peak, for which the error propagation method yields an uncertainty that is by about 10 per cent higher than that of the parameter variation method. These small discrepancies are likely due to the linear approximation made in the error propagation method.

Table D2.

Comparison of the uncertainties of the observational parameters of the galaxy NGC 3351 as determined by the methods of parameter variation (Appendix D2) and error propagation (Appendix D3). The last column lists the relative difference between the two methods. Flux parameters are specified in Jy, spectral parameters in channels.

Parameter	Value	Uncert.	Uncert.	Difference
		par. var.	err. prop.	(per cent)
Centroid	30.164	0.1216	0.1213	−0.25
w₅₀	51.612	0.084 52	0.086 48	2.32
w₂₀	53.485	0.1303	0.1245	−4.45
F_peak	0.348	0.004 646	0.005 157	11.00
F_int	9.618	0.074 91	0.074 85	−0.08

Parameter	Value	Uncert.	Uncert.	Difference
		par. var.	err. prop.	(per cent)
Centroid	30.164	0.1216	0.1213	−0.25
w₅₀	51.612	0.084 52	0.086 48	2.32
w₂₀	53.485	0.1303	0.1245	−4.45
F_peak	0.348	0.004 646	0.005 157	11.00
F_int	9.618	0.074 91	0.074 85	−0.08

Table D2.

Comparison of the uncertainties of the observational parameters of the galaxy NGC 3351 as determined by the methods of parameter variation (Appendix D2) and error propagation (Appendix D3). The last column lists the relative difference between the two methods. Flux parameters are specified in Jy, spectral parameters in channels.

Parameter	Value	Uncert.	Uncert.	Difference
		par. var.	err. prop.	(per cent)
Centroid	30.164	0.1216	0.1213	−0.25
w₅₀	51.612	0.084 52	0.086 48	2.32
w₂₀	53.485	0.1303	0.1245	−4.45
F_peak	0.348	0.004 646	0.005 157	11.00
F_int	9.618	0.074 91	0.074 85	−0.08

Parameter	Value	Uncert.	Uncert.	Difference
		par. var.	err. prop.	(per cent)
Centroid	30.164	0.1216	0.1213	−0.25
w₅₀	51.612	0.084 52	0.086 48	2.32
w₂₀	53.485	0.1303	0.1245	−4.45
F_peak	0.348	0.004 646	0.005 157	11.00
F_int	9.618	0.074 91	0.074 85	−0.08

The error propagation method is generally much faster than the parameter variation method described in Appendix D2, as it does not require a large number of numerical iterations. Another advantage of the error propagation method is that it will produce a full parameter covariance matrix ‘for free’ (i.e. without the need for computationally expensive numerical iterations), thus providing information about correlations between observational parameters. This is illustrated in Fig. D3, where the correlation coefficients derived from the covariance matrix, |${\bf C}_{\boldsymbol{ q}}$|⁠, of the fit to the spectrum of NGC 3351 are presented.

Figure D3.