The following article is Open access

The Stellar Mass versus Stellar Metallicity Relation of Star-forming Galaxies at 1.6 ≤ z ≤ 3.0 and Implications for the Evolution of the α-enhancement

, , , , , , , , , , , , and

Published 2022 January 26 © 2022. The Author(s). Published by the American Astronomical Society.
, , Citation Daichi Kashino et al 2022 ApJ 925 82 DOI 10.3847/1538-4357/ac399e

Download Article PDF
DownloadArticle ePub

You need an eReader or compatible software to experience the benefits of the ePub3 file format.

0004-637X/925/1/82

Abstract

We measure the relationship between stellar mass and stellar metallicity for 1336 star-forming galaxies at 1.6 ≤ z ≤ 3.0 using rest-frame far-ultraviolet spectra from the zCOSMOS-deep survey. High signal-to-noise ratio composite spectra containing stellar absorption features are fit with stellar population synthesis model spectra of a range of stellar metallicity. We find stellar metallicities, which mostly reflect instantaneous iron abundances, scaling as $[\mathrm{Fe}/{\rm{H}}]=-(0.81\pm 0.01)+(0.32+0.03)\mathrm{log}({M}_{* }/{10}^{10}{M}_{\odot })$ across the stellar mass range of 109M*/M ≲ 1011. The instantaneous oxygen-to-iron ratio (α-enhancement) inferred using the gas-phase mass–metallicity relation is on average found to be $\left[{\rm{O}}/\mathrm{Fe}\right]\approx 0.47$, being higher than the local $\left[{\rm{O}}/\mathrm{Fe}\right]\approx 0$. The observed changes in [O/Fe] and [Fe/H] are reproduced in simple gas-regulator models with steady star formation histories. Our models show that the [O/Fe] is determined almost entirely by the instantaneous specific star formation rate alone while being independent of the mass and the characteristic of the gas regulation. We also find that the locations of ∼ 1010 M galaxies at z ∼ 2 in the [O/Fe]–metallicity planes are in remarkable agreement with the sequence of low-metallicity thick-disk stars in our own Galaxy. This manifests a beautiful concordance between the results of Galactic archeology and observations of high-redshift Milky Way progenitors. There remains, however, a question of how and when the old metal-rich, low α/Fe stars seen in the bulge had formed by z ∼ 2 because such a stellar population is not seen in our data and is difficult to explain in the context of our models.

Export citation and abstract BibTeX RIS

Original content from this work may be used under the terms of the Creative Commons Attribution 4.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

The metallicity of galaxies can be measured for either stars or the interstellar medium (ISM) through analysis of galaxy spectra, either the stellar absorption lines in the integrated light of the stellar population(s) or the nebular emission lines from gaseous H ii regions (see Maiolino & Mannucci 2019 for a recent review). The gas-phase and stellar metallicities reflect different aspects of the evolutionary history of the galaxies.

The gas-phase metallicity refers usually to the abundance of oxygen relative to hydrogen, which is often measured using emission lines in the rest-frame optical wave band that are produced by the ionized gas in star-forming regions. The so-called "direct method" is based on the detection of faint auroral lines (e.g., [O iii] λ4363) and determines the electron temperature and metallicity with high accuracy (e.g., Andrews & Martini 2013; Ly et al. 2014; Kashino & Inoue 2019; Kojima et al. 2020; Sanders et al. 2020). The so-called "strong-line" methods, which use empirical relations between the metallicity and the ratios of strong optical emission lines (e.g., ([O ii]+[O iii])/Hβ), have been widely applied to estimate metallicities from spectra with low or moderate signal-to-noise ratio (S/N; e.g., Pettini & Pagel 2004; Nagao et al. 2006; Maiolino et al. 2008; Curti et al. 2017). The gas-phase metallicities reflect the "instantaneous" oxygen abundance in star-forming regions at the time of observation.

The stellar metallicity can be measured through absorption lines caused by metal ions, such as iron and magnesium, in the photospheres of stars. Measurement is generally carried out by comparing observed spectra with synthetic spectra from stellar population synthesis models. Some standardized indices, which represent the absorption depths for particular, relatively strong absorption features, or a combination of absorption features, have been conventionally used for estimating the average stellar metallicity (e.g., the 1978 Å index; Rix et al. 2004; see also Halliday et al. 2008; Onodera et al. 2015). More recently, full spectral fitting that uses all the information contained in the spectra has been employed (Onodera et al. 2015; Steidel et al. 2016; Zahid et al. 2017; Leethochawalit et al. 2018; Kriek et al. 2019; Harikane et al. 2020; Topping et al. 2020a, 2020b). In any case, these measurements require a high-S/N detection of the continuum emission and are thus generally more expensive than the gas-phase metallicity measurements using the strong-line methods.

An important point is that, in contrast to the gas-phase metallicity, the stellar metallicity is measured as the luminosity-weighted average value across all the different stellar populations that contribute to the integrated light of the galaxy at a particular wavelength. The inferred metallicity may therefore also depend on which portion of the spectrum is used. For example, the metallicity derived from the rest-frame optical light reflects the light from older (i.e., possibly lower-metallicity) populations, whereas that from the far-ultraviolet (FUV) spectrum is more weighted toward younger (i.e., possibly higher-metallicity) populations and should thus be closer to the abundance in the gas phase.

Another key aspect is that oxygen and iron, usually traced by gas-phase and stellar metallicities, respectively, form through different channels: oxygen or the α-elements are supplied mainly through core-collapse supernovae (CCSNe), while the Type Ia supernovae (SNe Ia) are the main supplier of the iron-peak elements. Therefore, the past star formation history (SFH) of the galaxies is imprinted in the abundance pattern between these species, often called α-enhancement, due to the time delay of SNe Ia (from 40 Myr to several Gyr) since the formation of their progenitor stars.

The overall relationship between galaxy stellar mass (M*) and metallicity, often called the mass–metallicity relation (MZR), has long been thought to be a fundamental measurement to constrain models of galaxy evolution (e.g., Lequeux et al. 1979). In the local universe, a tight correlation between these two quantities has been robustly established both for the gas-phase metallicity (Tremonti et al. 2004; Andrews & Martini 2013; Curti et al. 2020) and for the metallicity of the stellar component (Gallazzi et al. 2005; Zahid et al. 2017) using the Sloan Digital Sky Survey (SDSS; York et al. 2000). At high redshifts, the gas-phase MZR has been measured back to z ∼ 4 by many authors, mostly using strong-line methods (e.g., Erb et al. 2006; Zahid et al. 2011; Yabe et al. 2012; Zahid et al. 2014a, 2014b; Sanders et al. 2015; Kashino et al. 2017) with only a few cases where the direct method has been used (Ly et al. 2016; Sanders et al. 2020). The evolution of the gas MZR is established with the metallicity monotonically decreasing at fixed (observed) M* with redshift.

In contrast, the measurement of the stellar mass–stellar metallicity (Z*) relation (hereafter stellar MZR) beyond the local universe is to date very limited (Cullen et al. 2019; Calabrò et al. 2021). A notable work was recently carried out by Cullen et al. (2019), who presented an M*Z* correlation over 108.5M*/M ≲ 1010.5 using a large statistical sample of star-forming galaxies at z = 2.5–5.0 (see also Cullen et al. 2020). We are, however, still a long way from being able to constrain the evolution of the stellar MZR through cosmic time. Given the limited number of the existing measurements, independent measurements based on a different data set are highly desired.

In this work, we measure the stellar MZR for a large sample of star-forming galaxies at 1.6 ≤ z ≤ 3.0 by utilizing the rest-frame FUV spectra obtained with the VIsible Multi-Object Spectrograph (VIMOS) mounted on the Very Large Telescope (VLT) UT3 in the zCOSMOS-deep survey (Lilly et al. 2007; S. J. Lilly et al., in preparation). We then explore the evolution of the oxygen-to-iron abundance pattern that is inferred from the comparison with the gas-phase metallicity measurements.

The paper is organized as follows. Section 2 presents an overview of the observations and describes the sample selection. Section 3 describes our spectral analysis for estimating the stellar metallicities. The results are presented in Section 4. Section 5 presents further attempts for interpreting the observations by using gas-regulated chemical evolution models to track the iron and oxygen chemical enrichment. We then compare our results and models with data of the Galactic stars to explore the link with the Galactic archeology in Section 6. Section 7 provides a summary of the paper.

We adopt the solar metallicity values of $12+\mathrm{log}{({\rm{O}}/{\rm{H}})}_{\odot }=8.69$ and Z = 0.0142 (Asplund et al. 2009). Here Z denotes the overall metal mass fraction. We use ZFe and ZO when specifying the element, either iron or oxygen. The notation [A/B] denotes the logarithmic elemental abundance ratio normalized to solar, i.e., $[{\rm{A}}/{\rm{B}}]=\mathrm{log}{({N}_{{\rm{A}}}/{N}_{{\rm{B}}})-\mathrm{log}({N}_{{\rm{A}}}/{N}_{{\rm{B}}})}_{\odot }$. The solar oxygen and iron mass fractions are ZO,⊙ = 0.00561 and ZFe,⊙ = 0.00126, respectively. Magnitudes are quoted on the AB system. The Chabrier (2003) initial mass function (IMF) is used throughout. This paper uses a standard flat cosmology (h = 0.7, ΩM = 0.3, ΩΛ = 0.7).

2. Data and Galaxy Sample

2.1. Observations

The zCOSMOS-deep redshift survey has observed around 104 galaxies in the central ∼0.8 deg2 of the COSMOS field (Scoville et al. 2007). Here we provide a brief description and refer the reader to Lilly et al. (2007) and Kashino et al. (2021) for more details.

The observations were carried out using VLT/VIMOS (Le Fèvre et al. 2003) with the low-resolution blue grism with 1farcs0 slits, yielding a spectral resolution of R ∼ 200 and a spectral coverage of ≈3600–6700 Å. The selection of the targets was performed based on a then-current version of the COSMOS photometric catalog. All of the objects were color-selected through a BzK (Daddi et al. 2004) or ugr (Steidel et al. 2004) method with a blue magnitude cut BAB < 25.25. These selection criteria isolate star-forming galaxies in a range 1.4 ≲ z ≲ 3.0 (Lilly et al. 2007). Redshifts were visually inspected in 2D and 1D reduced spectra by identifying multiple prominent spectral features in the rest-frame FUV window or Lyα emission line and break.

Figure 1 shows six representative examples of individual galaxy spectra spanning a range of redshifts and the confidence class. Note that the additional flux calibration described in Section 3.1 has been applied to these spectra. The wavelength regions without data are those impacted by strong sky lines or zeroth-order contamination. Those of class = 3 or 4 present robust redshift identification based on several strong absorption lines and/or a strong Lyα emission line. The identification of such spectral features becomes less secure for class = 2 spectra.

Figure 1.

Figure 1. Examples of representative individual galaxy spectra, after the application of the flux calibration that is described in Section 3.1. Typical spectra are shown for each of the redshift confidence classes (2, 3, and 4). The bottom right panel presents an example of a strong Lyα emitter. The gray lines indicate the observed spectra. The black lines present spectra smoothed with an arbitrary Gaussian kernel of σ = 5 pixel. The brown lines indicate the 1σ error spectra. The blue squares indicate the photometric fluxes within the 12 bands that are used for flux calibration. Positions of some strong spectral features are marked by horizontal lines.

Standard image High-resolution image

2.2. Sample Selection

We constructed the sample used in this paper from the full catalog of the zCOSMOS-deep survey (S. J. Lilly et al., in preparation). The sample is limited to those having a clear photometric counterpart in the COSMOS2015 photometric catalog (Laigle et al. 2016). Galaxies detected in X-rays (Civano et al. 2016) are also excluded to remove possible active galactic nuclei (AGNs) from the sample.

The redshift range for the current analysis is limited to 1.6 ≤ z ≤ 3.0 so that the VIMOS spectrum covers the range of λrest ≈ 1400–1700 Å for all the galaxies. We adopt all objects with a very secure zCOSMOS-deep redshift (Confidence Class = 3 or 4 15 ) within the redshift range. For those with Class = 2, we use only those that are consistent to within ∣zphotzspec∣/(1 + zspec) ≤ 0.1 of the photometric redshift in the COSMOS2015 catalog. 16

The redshifts in these two categories are both estimated to be ≥99% reliable (Lilly et al., in preparation). We do not use any of the objects with less secure redshifts, nor any of those with broad emission lines (i.e., Class +10). Finally, we visually inspected individual sources and excluded 34 sources (2.5% of the remaining sample) for which the FUV continuum is barely detected (which are likely to be Lyα emitters with very faint continua), or that show obvious AGN signatures (e.g., strong Nv λ1240 emission lines), or that suffered from severe spectral contamination from companion sources that fall in the same slit.

The final sample consists of 1336 galaxies. Figure 2 shows the redshift distribution of the sample with the median redshift ${\left\langle z\right\rangle }_{\mathrm{med}}=2.22$. We note that the redshifts were all determined with strong absorption lines owing to mostly carbon and/or silicon ions in the ISM, but not with any kind of stellar iron lines. Here we ignore any possibility that our sample with secure redshifts may be biased toward those with strong iron features, although the strengths of the ISM absorption lines could be correlated at some level with the overall gas-phase metallicity and thus the ISM absorption strengths (Faisst et al. 2016).

Figure 2.

Figure 2. Spectroscopic redshift distribution for the entire sample of 1336 galaxies. Counts per Δz = 0.05 are shown. The vertical dashed line indicates the median redshift of $\left\langle z\right\rangle =2.22$.

Standard image High-resolution image

2.3. Stellar Mass Estimation

We derived stellar masses (M*) for individual galaxies through spectral energy distribution (SED) fitting based on the photometry from the COSMOS2015 catalog, together with the precise spectroscopic redshifts. It should be noted that the stellar mass here denotes the mass of living stars at the time of observation, rather than the integral of the star formation rate (SFR). The SED fitting outlined in this section is done to the broadband photometry and is meant to derive stellar masses. We will independently perform another fitting to the composite FUV spectra using high-resolution model spectra to estimate the stellar metallicities (see Section 3.5).

Our SED fitting procedure uses the photometric fluxes measured within 32 broad-, intermediate-, and narrowband filters from GALEX near-UV to Spitzer/IRAC ch4, as listed in Table 3 of Laigle et al. (2016). For CFHT, Subaru, and UltraVISTA photometry, we used the fluxes measured in a 3''-diameter aperture and applied offsets provided in the catalog to convert them to the total fluxes. All the photometric bands whose rest-frame central wavelengths are within 1960 Å≤ λcen/(1 + z) ≤ 2440 Å were excluded in order to ensure that the SED fitting is not affected by the possible 2175 Å bump feature in the SED of the galaxies (see Kashino et al. 2021).

The stellar masses and SFRs are estimated using a Python code CIGALE (Burgarella et al. 2005; Noll et al. 2009; Boquien et al. 2019) for the SED fitting and adopted the stellar population synthesis models of Bruzual & Charlot (2003) with a Chabrier (2003) IMF. We considered delayed SFHs with an additional recent burst in order to model the long-term star formation that has formed the bulk of the stellar mass and the latest episode of star formation (e.g., Ciesla et al. 2016, 2017; Pearson et al. 2017),

Equation (1)

where ${\mathrm{SFR}}_{\mathrm{delayed}}(t)\propto {{te}}^{-t/{\tau }_{0}}$ and ${\mathrm{SFR}}_{\mathrm{burst}}(t)\propto {e}^{-(t-{t}_{1})/{\tau }_{1}}$ if t > t1 and SFRburst(t) = 0 otherwise. The parameter t denotes the elapsed time since the onset of star formation, t1 the galaxy age when the late episode of star formation onsets, and τ0 and τ1 the e-folding times of the main stellar population and the late starburst population.

In CIGALE, we also accounted for the effect on the photometry of nebular emission lines assuming a common ionization parameter $\mathrm{log}U=-2.8$ and dust emission based on the templates from Dale et al. (2014). The dust attenuation is accounted for using the prescription of Charlot & Fall (2000). The full list of the input parameters is presented in Table 1.

Table 1. Input Parameters of the SED Fitting with CIGALE

ParameterValues
Delayed+Burst SFH
Age (main population) (Myr)1000–4500 in steps of 250
e-folding time of the delayed SFH, τ0 (Myr)1000, 2000, 3000
Age (starburst population) (Myr)50, 100, 150, 200
e-folding time of the late starburst, τ1 (Myr)10,000
Mass fraction of the late burst population0.001, 0.003, 0.010, 0.020, 0.040, 0.100, 0.200, 0.400
Stellar Population: Bruzual & Charlot (2003)
Initial mass functionChabrier (2003)
Metallicity, Z 0.008
Separation age between young and old populations (Myr)10
Dust Attenuation: Charlot & Fall (2000)
AV in the diffuse ISM0.0, 0.05, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8,
 0.9, 1.0, 1.2, 1.4, 1.6, 1.8, 2.0, 2.5, 3.0, 3.5
δISM (power-law slope for the ISM)−0.7
δBC (power-law slope for birth clouds)−1.3
${A}_{V}^{\mathrm{ISM}}/({A}_{V}^{\mathrm{BC}}+{A}_{V}^{\mathrm{ISM}})$ 0.44

Download table as:  ASCIITypeset image

It is known that SED fitting often leads to unrealistically young ages (a few × 100 Myr), and thus spuriously low stellar masses, for high-z star-forming galaxies when there is no limitation on the age. This is because the SED of such galaxies is dominated by the youngest stellar populations (e.g., Maraston et al. 2010). Moreover, it is not plausible that ∼ 1010 M galaxies at z ∼ 2–3 have started to form stars just before we happen to observe them, given the fact that we can find star-forming galaxies continuously at higher redshifts. It has been shown that limiting the minimum age to a Gyr or more better recovers the stellar masses of high-z star-forming galaxies (Maraston et al. 2010; Pforr et al. 2012). To avoid artificially inferring unrealistically young ages, we thus limit the age of the main stellar population to be ≥ 1 Gyr. This corresponds to a formation redshift z ≳ 5 for galaxies at z ∼ 3 and should therefore not be far from reality.

The estimated stellar masses span the range $9.23\leqslant \mathrm{log}{M}_{* }/{M}_{\odot }\leqslant 10.85$ (the 2.5th–97.5th percentiles) with a median value of ${\left\langle \mathrm{log}({M}_{* }/{M}_{\odot })\right\rangle }_{\mathrm{med}}=10.0$. The reduced χ2 values range mostly between 0.4 and 3.3 with the median value of 0.95. Figure 3 shows the distribution of the inferred M* and instantaneous SFR. Note that there is a (spurious) sharp upper boundary of the SFR distribution in Figure 3. This is produced by the limited range of SFR probed by the adopted SFHs. This should be of no consequence, as we do not use the SFR values of the individual galaxies in the following analyses.

Figure 3.

Figure 3. Results of the SED fitting. Left panel: distribution of estimated stellar masses. Right panel: stellar mass vs. instantaneous SFR for the entire sample of 1336 galaxies. For reference, the main-sequence relations are shown, taken from Whitaker et al. (2014, z = 2.0–2.5; red dashed line) and Renzini & Peng (2015, z ≈ 0.1; solid line).

Standard image High-resolution image

For reference, the main-sequence M*–SFR relations at similar redshifts are taken from Whitaker et al. (2014, z = 2.0–2.5). We also plot the local main sequence adapted from Renzini & Peng (2015). Overall, our sample is in good agreement with the main sequence at the earlier epoch, indicating that it should be a representative sample at these redshifts. We note that the magnitude limit, BAB < 25.25, for the parent sample emerges as a cut in SFR at SFR ∼ 5 M yr−1 in Figure 3 (right panel). At $\mathrm{log}{M}_{* }/{M}_{\odot }\lesssim 9.6$, we are missing galaxies with lower SFRs owing to their low observed flux, and thus the results may be biased at the low-mass end.

3. Stellar Metallicity Measurement

In this section, we describe the method to measure the stellar metallicities. We start with the spectrophotometric calibration of each single spectrum and then improve the accuracy of the spectroscopic redshifts of the sources by fitting a common spectral template to each single spectrum.

3.1. Flux Calibration of the VIMOS Spectra

We adopt a spectrophotometric correction to every single spectrum of our sample following the method described in Kashino et al. (2021). The spectra that are produced through the standard zCOSMOS-deep reduction pipeline were flux-calibrated based on standard-star observations. The nominal flux calibration, however, cannot perfectly correct for the effects of finite slit width, imperfect slit centering, and the effects of atmospheric dispersion.

We correct each spectrum with a smooth function of wavelength (see Equation (8) of Kashino et al. 2021) that is constructed using the differences between the actual photometric fluxes in four broadband and eight intermediate-band filters (shown in Figure 4) and the fluxes computed from the pipeline-processed spectrum in these same 12 filter bands. In doing so, we excluded any photometric bands that sampled the rest-frame wavelength of the Lyα emission line. This is because the flux of the strong Lyα line may be differently affected than the continuum flux owing to the possible extended shape of the emission and/or the overestimate of the sky level at these particular wavelengths.

Figure 4.

Figure 4. Left panel: flux ratios, ${F}_{i}^{\mathrm{spec}}/{F}_{i}^{\mathrm{phot}}$, after correction for the sample of 1336 galaxies. Symbols for each single spectrum are connected by a line. Different spectra are colored differently for display purposes. The red solid, dashed, and dotted lines indicate, respectively, the median values, the 16th–84th percentiles, and the 5th–95th percentiles. The horizontal dotted line indicates ${F}_{i}^{\mathrm{spec}}/{F}_{i}^{\mathrm{phot}}=1$. The transmission curves of the relevant photometric bands are shown at the bottom. Right panel: the same flux ratios, but as a function of rest-frame wavelength. The red solid and dashed lines indicate running medians and 16th–84th percentiles with a window size of 100 Å. The vertical dotted lines denote the lower and upper limits of the wavelength range used for the stellar metallicity measurement.

Standard image High-resolution image

Examples of the flux-calibrated spectra are shown in Figure 1, demonstrating that the spectra are in agreement with the photometric fluxes. The left panel of Figure 4 shows how this spectrophotometric calibration works in the observed frame by comparing the spectral broad- and intermediate-band fluxes (${F}_{i}^{\mathrm{spec}}$, where i denotes a filter) recomputed in the flux-corrected spectra with the photometric fluxes (${F}_{i}^{\mathrm{phot}}$) from the COSMOS2015 catalog. The median values of the corrected ${F}_{i}^{\mathrm{spec}}$-to-${F}_{i}^{\mathrm{phot}}$ ratios in each band are all within ±0.015 dex, and the scatter is ∼ 0.02–0.05 dex, depending on the filters. This overall scatter (seen in the rest frame) is quite smaller than before correction ( ∼ 0.1–0.15 dex). Note that there is still scatter for a given filter because the correction used a smooth function of wavelength, so the effects of photometric noise in the different filters are still seen.

The right panel of Figure 4 shows the same data but now shifted to the rest-frame wavelengths. There is no systematic trend in the corrected flux ratios across the entire wavelength range for the stellar metallicity measurement (1274–2053 Å). The running medians with a window size of 100 Å are all within 0.005 dex over this wavelength range of interest. On the other hand, a significant systematic undercorrection is seen around 1216 Å most likely because of sky-subtraction issues associated with strong Lyα lines mentioned above.

This flux calibration is important to obtain composite spectra that correctly reflect the average shape of the galaxies' SEDs. We note, however, that the precision of this calibration is unlikely to be critical for our conclusions, because the stellar metallicities are measured based on the detailed shape of the spectra that results from the blending of numerous narrow stellar absorption lines, whereas the overall shape of the smooth continuum is fit with an arbitrary multiplicative λ-dependent function (see Section 3.5). In the remainder of the paper, the term "observed VIMOS spectra" will always refer to these accurately spectrophotometrically recalibrated spectra.

3.2. Fine Adjustment of Spectroscopic Redshifts

Precise determination of the spectroscopic redshifts is important to reduce the loss of the potential spectral resolution when spectra are stacked. The spectroscopic redshifts in the zCOSMOS-deep catalog have been determined by visually inspecting prominent emission and absorption features in each spectrum, but no systematic spectral fitting has been performed. As a consequence, the spectral redshifts may be more uncertain than the best estimates that can potentially be achieved from the existing spectra; thus, some improvements are possible.

We therefore made small adjustments to the spectroscopic redshifts by fitting a common template spectrum to each of the individual spectra. We constructed this template by stacking the observed spectra of the entire sample of 1336 galaxies used in this work by using the original spectroscopic redshifts from the catalog. In this template fitting, we applied an arbitrary normalization and a wavelength-dependent multiplicative factor that is intended to mimic the Calzetti et al. (2000) dust attenuation.

The differences between the revised and original spectroscopic redshifts, (zorigznew)/(1 + znew), have a Gaussian-like distribution with a standard deviation of 8.4 × 10−4 (252 km s−1) and median of 6.4 × 10−6. We found that, by using the revised spectroscopic redshifts, the resulting stacked spectra are noticeably improved, showing sharper spectral features in both emission and absorption than those seen in the stacks based on the original redshifts. However, the adjustment of the spectroscopic redshifts has little effect on the metallicity measurements, and our conclusions do not change if the stacking is done using the original redshift values.

3.3. Stacking Procedure

To infer the stellar metallicity as a function of stellar mass, we rely on stacked spectra from subsamples of galaxies defined by their stellar mass. The observed spectra are co-added as follows. We first transform all the individual spectra to the rest-frame wavelength based on their adjusted redshift (see Section 3.2) and rebin them to a common wavelength grid with a spacing of 1 Å. This is slightly oversampled relative to the pixel resolution of the original spectra (5.35 Å pixel–1 in the observed frame) and equivalent to the wavelength grid spacing of the model spectra used for spectral fitting (see Section 3.4).

Each spectrum is then normalized by dividing by a fitted continuum of the form ${\lambda }^{\beta }\times {10}^{-0.4{A}_{\lambda }}$, where Aλ is the dust attenuation of the Charlot & Fall (2000) prescription. Here the overall dust attenuation is a free parameter for continuum fitting, independent of the result from the SED fitting in Section 2.3. By doing so, the effects of the variable overall shapes of the spectra are mitigated. We then take the mean value of the individual continuum-divided spectra at each wavelength grid while ignoring any spectral regions that are missing and/or contaminated, for example, by zeroth-order contamination or strong sky lines. Finally, to recover the global shape of the average SED, the stacked spectrum is multiplied by the mean of the fitted continua. We note that normalizing the individual spectra to their continuum level at λrest ∼ 1550 Å produces very similar results. Stacking without any normalization, however, yields more noisy composite spectra and more residuals with respect to the fitted models.

Figure 5 shows the composite spectrum (middle panel) of the entire sample. 17 The number of spectra used at each wavelength is indicated in the top panel. Note that, given the redshift range of our sample, the rest-frame wavelength range of 1400–1700 Å is covered by nearly all the input spectra, while shorter and longer wavelengths are less well sampled. Some prominent spectral features are clearly identified as marked by vertical lines.

Figure 5.

Figure 5. Composite VIMOS spectrum of the entire sample of 1336 galaxies at 1.6 ≤ z ≤ 3.0. Top panel: the number of spectra that have been stacked at each wavelength grid. The horizontal dotted line indicates the number of galaxies in the stack. Middle panel: the stacked spectrum (black line), in which some prominent absorption and emission features are identified as marked by color-coded labels (interstellar absorption features–orange; nebular emission lines–green). The blue line indicates the best-fit BPASS model obtained from the MCMC analysis, and the red line indicates the best-fit model before smoothing. The gray regions indicate the wavelength ranges that are masked out in the fitting (see Table 2). The purple dotted–dashed lines mark the wavelength region that is used for the Rix et al. (2004) 1978 Å index. Bottom panel: residuals from the best fit. The vertical error bars correspond to the associated 1σ errors. The data points that are not included in the fit are shown in gray.

Standard image High-resolution image

3.4. Stellar Population Synthesis Models for Metallicity Estimation

To derive stellar metallicities for the galaxies, we compare our observed composite spectra with model spectra, following the approaches described in Steidel et al. (2016) and Cullen et al. (2019).

We utilize the latest public data release of the population synthesis code "Binary Population and Spectral Synthesis" (BPASSv2.2.1; Eldridge et al. 2017; Stanway & Eldridge 2018). The package provides sets of single stellar population synthesis model spectra at a pixel resolution 1 Å pixel−1 as a function of stellar ages for different IMFs and for discrete (overall) stellar metallicities (Z* = 10−5, 10−4, 0.001, 0.002, 0.003, 0.004, 0.006, 0.008, 0.010, 0.014, 0.020, 0.030, and 0.040). It is important to note that although the BPASS models adopt a fixed abundance ratio based on the solar abundances, the model fit to the rest-frame FUV spectrum is mostly sensitive to the iron abundance of the young stellar component of the galaxies. This is because stellar photospheric absorption features in the wavelength range of interest are primarily due to transitions of highly ionized iron (Dean & Bruhweiler 1985; Brandt et al. 1998). We can therefore translate the inferred metallicity Z* into [Fe/H].

We adopted the BPASSv2.2.1 model that used a Chabrier (2003) IMF with a high-mass cutoff of 100 M and included binary star evolution, and we considered a continuous SFH with a duration of 100 Myr to construct a set of rest-frame UV template spectra. Note that the FUV spectrum is almost independent of the past SFH at ≳ 100 Myr previously, as it is dominated by massive young stars. Indeed, we checked that the results hardly change for different duration times between 10 and 1000 Myr. Figure 6 shows examples of the model spectra at different metallicities, demonstrating that the detailed shapes of the galaxy spectra across the rest FUV window are sensitive to the stellar metallicity. A feature of the BPASS binary models is that they model the broad He λ1640 emission line that originates in the winds from very massive stars (see Section 3.5 for a relevant description).

Figure 6.

Figure 6. The BPASSv2.2.1 stellar population models. All models assume a constant SFR over 100 Myr. Each panel shows the model spectrum in the rest FUV window at $\mathrm{log}({Z}_{* }/{Z}_{\odot })=-2.15$ to −0.25 from the top to the bottom. The gray lines show the full resolution model spectra, which are here normalized by the pseudocontinuum constructed from a spline fit to the selected points (green diamonds; see Table 3 of Rix et al. 2004). The blue lines indicate the models smoothed to the VIMOS resolution (σ = 3.3 Å, or 2500 km s−1 in FWHM; see Section 4). It is clearly seen, even at the VIMOS resolution, that the absorption features are stronger with increasing stellar metallicity. The vertical dashed lines indicate the range of the traditional 1978 Å index (Rix et al. 2004) for reference.

Standard image High-resolution image

We added nebular continuum and line emission to the templates, although it has only a minor contribution of ∼5% in the total continuum flux. The nebular continuum was computed using cloudy v17.00 (Ferland et al. 2017) adopting the BPASS spectrum itself as the incident spectrum. We assumed the electron density to be ne = 300 cm−3 and the ionization parameter to be $\mathrm{log}U=-2.8$. These values are consistent with the recent estimates in z ∼ 2–3 star-forming galaxies (Sanders et al. 2016; Kashino et al. 2017; Strom et al. 2017, 2018) and local star-forming galaxies with specific SFR (sSFR) as high as our sample (∼5 Gyr−1; Kashino & Inoue 2019). We change the gas-phase metallicity to calculate nebular emission according to the stellar metallicity, with an offset of $\mathrm{log}({Z}_{\mathrm{gas}}/{Z}_{* })=0.42$ as reported for z ∼ 2–3 galaxies by Strom et al. (2018). In other words, we considered an enhanced gas-phase metallicity, $12+\mathrm{log}({\rm{O}}/{\rm{H}})=8.69+\mathrm{log}({Z}_{* }/{Z}_{\odot })+0.42$, relative to each Z* of the BPASS template. We will indeed measure consistent $\mathrm{log}({Z}_{\mathrm{gas}}/{Z}_{* })$ or [O/Fe] values in Section 4.3 and discuss the evolution of [O/Fe] in Section 5.

3.5. Fitting Synthesis Model Spectra

Our goal is to estimate average stellar metallicities for binned subsamples of galaxies by using their composite spectra. To do so, we limited the wavelength range between 1274 Å ≤ λrest ≤ 2053 Å and excluded some narrow wavelength regions where the stellar continuum is impacted by interstellar absorption lines or nebular emission lines. The wavelength regions excluded from the fitting are summarized in Table 2. We note that the region of the He λ1640 line is not excluded because the BPASS models include the broad He λ1640 stellar emission line that originates from very hot stars. In fact, as shown in Section 4, the observed He λ1640 feature in the composite spectra could be attributed almost entirely to the contribution from stars that is predicted in the BPASS models.

Table 2. Rest-frame Wavelength Ranges Excluded from Fitting a

${\lambda }_{\min }$ ${\lambda }_{\max }$ Interstellar b
(Å)(Å)Spectral Features
12941318O i λ1302.17, Si ii λ1304.37, Si ii * λ1309.28
13261343C ii λ1334.53
13531361O i λ1355.60, O i λ1358.51
13851411Si iv λλ1393.76, 1402.77
15181535Si ii λ1526.71
15401559C iv λλ1548.19, 1550.77
16001617Fe ii λ1608.45
16581679O iii] λλ1660.81, 1666.15, Al ii λ1670.79
17071712Ni ii λ1709.60
17391744Ni ii λ1741.55
17491754Ni ii λ1751.91
17581763C ii λ1760.5 (blended)
18061811Si ii λ1808.01
18141820Si ii λλ1816.93, 1817.45
18461871Al iii λ1854.72, 1862.79
18801885Si iii] λ1882.71
18901895Si iii] λ1892.03
18981917C iii] λ1906.68, 1908.73

Notes.

a The wavelength range used for the fit is limited to λrest = 1274–2053 Å. b Wavelengths are all given in vacuum.

Download table as:  ASCIITypeset image

An added advantage of the full spectral fitting over using the traditional indices (e.g., the 1978 Å index) is that the former does not need to identify the pseudosmooth continuum. Contrarily, the latter requires it for measuring equivalent widths (EWs) of particular stellar absorption lines. The measurements of the EWs of faint features are quite sensitive to the assumed continuum level, while the determination of the pseudocontinuum has to be based on very limited wavelength regions free from narrow features, thus being easily affected by noise. The full spectral fitting is free from these difficulties.

We fit the constructed BPASS models to our composite spectra based on the maximum likelihood estimation. We assume that the associated errors are Gaussian and independent, so the logarithm of the likelihood ${ \mathcal L }$ is given by

Equation (2)

where ${f}_{i}^{\mathrm{obs}}$ is the observed composite spectrum at the ith wavelength grid, ${f}_{i}^{\mathrm{model}}$ is the model spectrum for a given set of parameters, and σi is the error on the observed flux. The summation is over all wavelength grids that are included in the fit.

For our models, we adopted eight free parameters: the logarithm of the overall stellar metallicity ($\mathrm{log}{Z}_{* }$), the Gaussian smoothing kernel σsmooth applied to the model spectra, and six parameters for a fifth-order polynomial function that is multiplied by the template spectrum:

Equation (3)

where ${f}_{i}^{\mathrm{BPASS}}({Z}_{* })$ is the template spectrum at given Z* in arbitrary units, in which the nebular emission is accounted for. The polynomial term is ideally intended to reflect the change of the overall shape of the spectra due to dust attenuation. 18

To sample the posterior probability distribution of the model parameters, we employed a Markov Chain Monte Carlo (MCMC) technique using the emcee package for Python (Foreman-Mackey et al. 2013). In our fitting, we adopted a uniform prior probability function for each parameter. For stellar metallicity, a flat prior is adopted in log space (i.e., $\mathrm{log}({Z}_{* })$) between $-5\leqslant \mathrm{log}({Z}_{* })\leqslant \mathrm{log}(0.040)$ (which corresponds to $-3.15\leqslant \mathrm{log}({Z}_{* }/{Z}_{\odot })\leqslant 0.44$) according to the possible choices in the BPASS models. Since the models are only provided at 13 discrete metallicity values (see Section 3.4), we interpolated the flux values in $\mathrm{log}{Z}_{* }$$\mathrm{log}(\mathrm{flux})$ space between the models. This enables us to generate a model at any metallicity value within the available range.

In the fitting, we need to match the spectral resolution of the model templates to that of the observed composite spectra. However, the spectral resolution of the composite spectra is not precisely known. It may vary from galaxy to galaxy within a subsample because of different velocity dispersions of the galaxies and different slit illumination profiles. We thus leave the smoothing scale σsmooth, in units of Å, 19 as a free parameter rather than using a predetermined value. In the fitting, the model spectra are smoothed by a Gaussian kernel with σsmooth, and the smoothing scale that gives the best fit to the data is constrained along with other parameters.

3.6. Accuracy of the Metallicity Measurements

Before applying our procedure to the data, we evaluate the potential systematic uncertainties in the metallicity measurements that may come from the noise and the limited spectral resolution of the spectra.

As we will show below, the width of the smoothing kernel, σsmooth, is estimated from the fits to be ∼ 3.3 Å. This is large enough to wash out individual narrow stellar absorption features in the model spectra. Still, the model spectra with different metallicities will retain their different characteristic shapes that result from blending of the narrow features after smoothing. Thus, the metallicity measurement is indeed possible. The possible systematic uncertainties, however, should be evaluated.

For this purpose, we constructed a set of artificial spectra using the BPASS models at different metallicities. We smoothed them with σsmooth = 3.3 Å and modified the overall shape using the representative polynomial function as in Equation (3). We then added Gaussian noise with the moderate wavelength dependence that replicates the 1σ error spectra. We considered noise with four different levels: the best had the noise seen in the composite spectrum of the entire sample, i.e., the S/N at λrest ∼ 1450 Å, S/N1450 ≈ 190 per 1 Å. The other three are degraded to have twice (S/N1450 ≈ 95), five times (S/N1450 ≈ 38), or 10 times (S/N1450 ≈ 19) higher noise than the first one. The stacked spectra of the subsamples selected by stellar mass have noise levels between these values, depending on the number of galaxies in the bin and their average brightness.

We then attempted to measure the stellar metallicity in the same way as from the data in order to evaluate how accurately the input metallicities can be recovered. We repeated each setup 10 times with different random noise realizations. Here only a single fiducial duration of star formation (100 Myr) is considered.

Figure 7 shows the results of this exercise, separately, for four different noise levels. The difference between the inferred and input metallicities is shown as a function of the input value. The error bars indicate the 16th–84th percentiles of the inferred posterior for individual measurements. The results from the 10 trials with different random noises are shown separately with slight offsets in the x-axis. We also compile the posterior probability distributions of $\mathrm{log}{Z}_{* }$ from all 10 trials and derive the median and the 16th–84th percentiles that are shown by blue solid and dashed lines.

Figure 7.

Figure 7. The results of testing how accurately the input metallicities can be recovered with realistic noise and the spectral resolution of the actual data. The difference between the derived metallicity and the input metallicity is shown as a function of the input value. The four panels show the results for different levels of random noise, from top to bottom, the same level as the stack of the entire sample (S/N ≈ 190 Å−1), and twice, five times, and 10 times higher than the top one (S/N ≈ 95, 38, and 19 Å−1, respectively). Red symbols indicate individual results of the MCMC analysis at different input metallicities, each for 1 of 10 realizations of the random noise. The results are shown with small offsets along the x-axis for display purposes. Blue solid and dashed lines indicate, respectively, the median and the 16th–84th percentiles of the posterior probability distribution compiling all 10 trials.

Standard image High-resolution image

Generally, the uncertainties reduce with increasing metallicity because the stellar absorption features in the spectrum become more prominent and sensitive to the metallicity value at higher metallicity. It can be seen that, with all these noise levels, the systematic biases in the metallicity measurements are negligible with no trend across the metallicity range of interest.

4. Results

Figure 8 shows the posterior distributions of the parameters for the composite stack of the entire sample of 1336 galaxies. 20 The stellar metallicity, Z*, and smoothing scale, σsmooth, are constrained with a single preferred solution ($\mathrm{log}({Z}_{* }/{Z}_{\odot })=-0.812\pm 0.008$ and σsmooth = 3.30 ± 0.07 Å). Here the uncertainties denote the statistical 16th–84th confidence intervals in the posterior probability distribution functions inferred from MCMC fitting and do not therefore include any systematic uncertainties ascribed to the methodology, e.g., the choice of the stellar population synthesis model used in the fits. The constraints on the coefficients pk in Equation (3) also show single peaks in the posterior distributions. This also holds when the sample is divided into stellar mass bins.

Figure 8.

Figure 8. Posterior probability distribution functions for $\mathrm{log}({Z}_{* }/{Z}_{\odot })$ and smoothing scale σsmooth. The contours on the 2D plots correspond to enclosing 68% and 95% of the posterior probability. The vertical lines in the 1D plots indicate the 16th, 50th (median), and 84th percentiles. Here we omit the posteriors of the coefficients pk in Equation (3) that were simultaneously fitted.

Standard image High-resolution image

Figure 5 shows the best-fit model for the stack of the entire sample. The detailed shape of the FUV continuum is well reproduced after smoothing the original BPASS model spectrum with σsmooth = 3.3 Å. The recovered smoothing width σsmooth is marginally less than the nominal resolution of the VIMOS spectrograph with the LR blue grism and 1''.0 arcsec slits of ≈ 3.4 Å, possibly due to nonuniform illumination of the slit. 21

For comparison to a standardized metallicity indicator, we also measured the "1978 Å index" of Rix et al. (2004), i.e., the EW across 1935–2020 Å, to be 2.62 ± 0.08 Å. The calibration of Rix et al. (2004, see Equation (8)) converts this into $\mathrm{log}({Z}_{* }/{Z}_{\odot })=-0.93\pm 0.05$, which is consistent within ∼0.1 dex with the fiducial result. 22 Note that the statistical uncertainty here is about six times larger than that of the fiducial result because the spectral regions used for the 1978 Å index are limited.

In the following, we present the measurement of the stellar MZR based on the stacks in stellar mass bins. We here recall that the metallicity estimated from the photospheric absorption in the FUV spectra reflects the iron abundance (e.g., Steidel et al. 2016) in short-lived, recently formed stars and thus can be regarded as being almost equivalent to the gas-phase value.

4.1. Stellar Mass versus Stellar Metallicity

To investigate the relation between stellar mass and stellar metallicity, we divided the sample of 1336 star-forming galaxies into bins of stellar mass in two different binning schemes: first we equally split the sample into six mass bins, and then we divided the sample into bins with a constant width of ${\rm{\Delta }}\mathrm{log}{M}_{* }=0.2\,\mathrm{dex}$. After stacking, we performed the MCMC analysis to fit the BPASS template to the stacked spectra as described in Section 3.5.

Figure 9 shows the composite spectra and the corresponding best-fit models in the six equally populated subsamples. It is noticeable that the overall slope of the spectra becomes shallower for higher M* owing to the increasing average dust attenuation. The overall shape and detailed features are both well reproduced across the entire wavelength range of interest by the model spectra.

Figure 9.

Figure 9. Composite VIMOS spectra and the best-fit BPASS models are shown, separately, for the six subsamples equally separated by stellar mass. The median masses are indicated in each panel. Each panel is in the same format as Figure 5.

Standard image High-resolution image

The results in different bins are summarized in Table 3. The stellar metallicities are measured to range over $-1.3\,\lesssim \mathrm{log}({Z}_{* }/{Z}_{\odot })\lesssim -0.4$, increasing with M*. Note that the BPASS models cover this observed range of metallicity with a sufficiently large margin at either end, and thus our MCMC analysis should not be affected by the artificial limit of the explored metallicity range. We obtained consistent results from the fixed-width binning scheme.

Table 3. Stack Statistics and Metallicity Estimates a

$\mathrm{log}\left({M}_{* }/{M}_{\odot }\right)$ N S/N1450 b $\mathrm{log}\left({Z}_{* }/{Z}_{\odot }\right)$ c
MedianMin–Max  ([Fe/H])
9.978.38–11.481336189 $-{0.812}_{-0.008}^{+0.008}$
Binned Equally into Six Subsamples in M*
9.458.38–9.6222271 $-{1.045}_{-0.029}^{+0.029}$
9.749.63–9.8322272 $-{0.901}_{-0.030}^{+0.029}$
9.899.83–9.9722381 $-{0.848}_{-0.018}^{+0.017}$
10.069.97–10.1422386 $-{0.785}_{-0.018}^{+0.017}$
10.2410.14–10.3422382 $-{0.717}_{-0.017}^{+0.017}$
10.5210.34–11.4822373 $-{0.662}_{-0.012}^{+0.016}$
Binned into the 0.2 dex Width Intervals in M*
9.068.91–9.101313 $-{1.281}_{-0.230}^{+0.159}$
9.249.10–9.303528 $-{1.273}_{-0.180}^{+0.143}$
9.409.30–9.508246 $-{1.087}_{-0.412}^{+0.095}$
9.629.50–9.7016967 $-{0.970}_{-0.031}^{+0.031}$
9.829.70–9.9025783 $-{0.837}_{-0.017}^{+0.017}$
10.009.90–10.1028798 $-{0.792}_{-0.016}^{+0.016}$
10.2010.10–10.3022186 $-{0.711}_{-0.017}^{+0.017}$
10.3710.30–10.5014762 $-{0.684}_{-0.017}^{+0.011}$
10.5910.50–10.706442 $-{0.672}_{-0.021}^{+0.025}$
10.7810.70–10.902927 $-{0.611}_{-0.037}^{+0.037}$
11.0010.91–11.101720 $-{0.393}_{-0.078}^{+0.072}$

Notes.

a The top row is for the stack of the entire sample. The middle set of rows is for six equally populated bins of M*, and the bottom set of rows is for stacks in 0.2 dex fixed-width M* bins. b The S/N per unit pixel (1 Å) of the stacked spectra around 1450 Å, represented by the median S/N within λrest = 1430–1470 Å. c Metallicity estimates, represented by the median value of the posterior probability distribution function obtained from the MCMC analysis. The associated errors correspond to the 16th–84th percentiles. Because the fit is sensitive to the iron abundance, this can be translated into [Fe/H].

Download table as:  ASCIITypeset image

Figure 10 shows the relationship between stellar mass and stellar metallicity. The x-axis values of the points correspond to the median stellar masses within the bin, and the horizontal error bars indicate the minimum and maximum M* values in each bin. The y-axis values and the error bars indicate the median and the 16th–84th percentiles of the posterior distribution of $\mathrm{log}({Z}_{* }/{Z}_{\odot })$. There is a tight correlation between these quantities, though the $\mathrm{log}({Z}_{* }/{Z}_{\odot })$ values in the lowest mass bins are relatively insecure. A linear fit to the results in log−log space, obtained from binning the sample into 0.2 dex width bins, yields

Equation (4)

where the errors in the parentheses denote the nominal 1σ errors. This is shown as the blue line in Figure 10.

Figure 10.

Figure 10. Stellar metallicity as a function of stellar mass, i.e., the stellar MZR, for our sample of star-forming galaxies at 1.6 ≤ z ≤ 3.0 (median $\left\langle z\right\rangle =2.22$). Here the stellar metallicity mostly reflects the iron abundance ([Fe/H]). The red squares and blue circles show the results based on two different binning schemes (see text). The blue solid line indicates the best linear fit to our data (Equation (4)) with the 1σ (dark-blue region) and 2σ (light-blue region) confidence limits. For comparison, the relation for a galaxy sample at 2.5 ≤ z ≤ 5.0 ($\left\langle z\right\rangle =3.5;$ green diamonds) is taken from Cullen et al. (2019).

Standard image High-resolution image

This result is generally in good agreement with that of Cullen et al. (2019), who used stacks of in total 681 star-forming galaxies at z = 2.5–5 ($\left\langle z\right\rangle \approx 3.5$). There is a difference at the low-mass end, where the Cullen et al. (2019) data show a tentative sign of flattening at $8.8\lesssim \mathrm{log}{M}_{* }/{M}_{\odot }\lesssim 9.5$, which, however, is not seen in our data (it should be noted that their lowest mass point is only an upper limit). Overall, our metallicities are slightly lower at given M*, which appears to be counter to the expected redshift evolution since the Cullen et al. (2019) sample is generally at somewhat higher redshifts. This apparent discrepancy may be accounted for by a systematic effect from the methodology that we now explain.

The method in Cullen et al. (2019) is very similar to ours and was based on stacks of rest-frame FUV spectra and model templates with various metallicities. The authors, however, used the Starburst99 high-resolution WM-basic stellar population models for their fiducial results. They compared their fiducial estimates to the ones derived using the BPASSv2.1 models and found that the use of the BPASS models leads to systematically lower metallicities by ∼ 0.1 dex. Accounting for this systematic offset will then bring the two results into better agreement. Note that the lower-metallicity limit of the Starburst99 model is $\mathrm{log}({Z}_{* }/{Z}_{\odot })=-1.15$, which is not sufficiently low for unbiased fitting for our sample, and thus we adopted the BPASS model in this work.

Our results also appear to be lower by ∼ 0.4 dex than the average measurement of $\mathrm{log}({Z}_{* }/{Z}_{\odot })=-0.425\pm 0.159$ 23 at M* ∼ 1010 M obtained by Halliday et al. (2008) using the Rix et al. (2004) 1978 Å index. This offset suggests that there possibly remain substantial systematic uncertainties in the stellar metallicity measurements for different samples and methodologies.

It should also be noted that the stellar metallicities may be underestimated because the integrated stellar emission would be biased toward stars formed in the less obscured and thus lower-metallicity environments. Such systematic biases in the stellar metallicity measurements will be explored in future papers.

4.2. Comparison with the Local Stellar MZR

We now compare our result with other work at low redshifts to see how the stellar MZR evolves through cosmic time. Zahid et al. (2017) established the stellar MZR in the range 108.5M*/M ≲ 1011 using ∼2 × 105 star-forming galaxies at z < 0.25 (median $\left\langle z\right\rangle =0.08$) from SDSS. The authors employed a full spectral fitting approach using the observed composite spectra in the rest-frame optical. Their result is in agreement with the measurements obtained from spectroscopy of individual stars in nearby galaxies compiled by Kudritzki et al. (2016). Furthermore, comparing to the stellar metallicities of dwarf galaxies, measured by Kirby et al. (2013), revealed that the M*Z* correlation holds smoothly down to M* ∼ 104 M.

We note that the metallicities from Kirby et al. (2013) are intended to purely reflect the iron abundance (Fe/H). We assume that other local metallicity measurements also reflect the iron abundance. Particularly, this approximation can be justified for the Zahid et al. (2017) measurements that are obtained from comparison between the observed rest optical spectra and the models assuming solar α/Fe. This is because the inferred total metallicity (Z*) is known to be related to [Fe/H] via $[\mathrm{Fe}/{\rm{H}}]=\mathrm{log}({Z}_{* }/{Z}_{\odot })-A[\alpha /\mathrm{Fe}]$ with A ≈ 0.8–0.9 (Trager et al. 2000). Indeed, we will find [α/Fe] ≈ [O/Fe] ∼ 0 for those galaxies.

Figure 11 compares these low-redshift measurements with our results at high redshift. Our high-redshift stellar MZR is clearly offset below the local relation (at our sampled masses) with ${\rm{\Delta }}\mathrm{log}Z\approx 0.8\,\mathrm{dex}$ at a given mass. We should note that the local MZR is not yet corrected for the underestimation that arises from the measurements based on the rest-frame optical integrated light of the galaxies. We will take account of this correction, which is about +0.1 dex (see Appendix), in the subsequent sections, where we will compare high- and low-redshift MZRs more precisely.

Figure 11.

Figure 11. The stellar MZR for our sample (blue circles) at $\left\langle z\right\rangle =2.22$ in comparison with those for z ∼ 0 galaxies: SDSS galaxies (gray circles; Zahid et al. 2017), individual stars in nearby galaxies (red open squares; Kudritzki et al. 2016), and dwarf galaxies (orange open diamonds; Kirby et al. 2013). These stellar MZRs refer to iron abundances. The blue solid line indicates the best linear fit to our data (Equation (4)) with the 1σ (dark-blue region) and 2σ (light-blue region) confidence limits. The black solid line indicates the best-fit relation of Equation (5) to the local sample. The stellar metallicities of z ∼ 2.2 galaxies are lower by ∼ 0.8 dex than the local galaxies at a given stellar mass. Note that the local stellar metallicities are not corrected for the reduction caused by the measurements based on the rest-frame optical spectra (see Section 4.3).

Standard image High-resolution image

We employed the empirical parameterization introduced by Curti et al. (2020, Equation (2)) to express the local stellar MZR:

Equation (5)

where Z0 is the asymptotic metallicity at the massive end, M0 is a characteristic mass where the relation begins to flatten, γ is the power-law slope of the relation at M*M0, and β determines the width of the transition region between the two extremes. Fitting the measurements from Zahid et al. (2017) yields ${({{ \mathcal Z }}_{0},\,{M}_{0},\,\gamma ,\,\beta )}_{z\sim 0}$ = (0.049, 10.24, 0.40, 6.36). In Figure 11, this best-fit relation clearly represents the local measurements.

Comparing with the local relation, our result at z = 1.6–3.0 is found to have a slightly shallower slope, 0.32 ± 0.03 (see above Equation (4)). As noted above, the high-redshift relation is offset to lower metallicities and, perhaps for this reason, does not show evidence of a saturation at the high-mass end.

4.3. The [O/Fe]–Metallicity Relations in Low- and High-redshift Galaxies

As already noted, the gas-phase and stellar "metallicities" can be more or less translated, at both high and low redshifts, into oxygen and iron abundances, respectively. Comparison of the two may therefore give some insight into the dependence and evolution of the relative abundance ratio, O/Fe, often used as a proxy of the α-enhancement.

The gas-phase metallicity reflects the instantaneous oxygen abundance at the time of observation. Measuring the gas-phase iron abundance is, however, quite challenging because of the faintness of iron emission lines, as well as the uncertainties in dust depletion factors; we therefore need to rely on the stellar metallicities. The stellar metallicities of our high-z galaxies estimated from the rest-frame FUV spectra, which are dominated by short-lived, recently formed stars, offer a more "instantaneous" measurement, similar to the gas-phase O/H. As we discuss in Appendix, we find that the FUV-weighted [Fe/H] is consistent within ∼0.02 dex with the gas-phase values. We can thus adopt the ratio of the gas-phase oxygen abundance to the stellar iron abundance of z ∼ 2.2 galaxies, [O/Fe], as a proxy of the instantaneous α-enhancement of the gas in the galaxies for the remainder of the paper.

In contrast, the iron abundance based on the rest-frame optical spectra could be lower than the instantaneous value because of the substantial contribution to the integrated light from older stars. This is the case of the stellar MZR at z ∼ 0 from Zahid et al. (2017). Therefore, the comparison between the two [Fe/H] measurements is not straightforward. As shown in Appendix, the offset with respect to the instantaneous [Fe/H] is probably around ≈ − 0.1 dex for low-redshift galaxies. In the subsequent sections, we thus shift the local stellar MZR by ${\rm{\Delta }}\mathrm{log}({Z}_{* })=+0.1\,\mathrm{dex}$ so that it better reflects the instantaneous iron metallicity. This correction, however, does not have any significant impact on our conclusions.

We take the gas-phase [O/H] MZRs at z ∼ 0 and z ∼ 2.2, respectively, from Curti et al. (2020, Equation (2)) and Sanders et al. (2020). The former uses the latest accurate calibration between the optical strong-line ratios and the metallicity determined from the direct method. The latter is purely based on the direct method O/H measure in individual 18 galaxies at z = 1.7–3.6 (median $\left\langle z\right\rangle =2.17$). An important caveat is that the sample of Sanders et al. (2020) is not representative of our zCOSMOS-deep sample, but rather a compilation from the literature of different surveys. The following analysis is thus based on an assumption that the result of Sanders et al. (2020) and ours both independently represent the same, typical star-forming galaxy population at these epochs.

Sanders et al. (2020) corrected their O/H measurements for the residuals around the epoch's main sequence, ${{\rm{\Delta }}}_{\mathrm{MS}}\mathrm{log}(\mathrm{SFR})$, of their sample galaxies to obtain more representative metallicities at given redshift. The corrections achieve ∼0.2 dex on average. These corrections are, however, questionable at some level owing to systematic uncertainties in determining the shape of the main sequence and the ${{\rm{\Delta }}}_{\mathrm{MS}}\mathrm{log}(\mathrm{SFR})$ dependence of the metallicity. In particular, the authors adopted a locally derived relation, ${\rm{\Delta }}({\rm{O}}/{\rm{H}})=-0.29\times {{\rm{\Delta }}}_{\mathrm{MS}}\mathrm{log}(\mathrm{SFR})$. However, this would be a function of redshift, and indeed a weaker dependence, or none at all, has been claimed for z ∼ 2–3 galaxies in other papers (Onodera et al. 2016; Sanders et al. 2021). We therefore adopt here their direct measurements of O/H (Equation (7) of Sanders et al. 2020) with a moderate constant correction of ${\rm{\Delta }}\mathrm{log}({\rm{O}}/{\rm{H}})=+0.1$. Although this choice is more or less arbitrary, it is indeed within the statistical error of the direct measurement.

A similar bias might also be expected in our own sample. However, we do not find a significant offset from the main sequence at this epoch given by Whitaker et al. (2014) in the M*–SFR diagram of Figure 3. We therefore assume that the observed galaxies and their stellar metallicities are reasonably representative of galaxies of the given M* at these epochs.

In the following analysis, we ignore the possible depletion of oxygen onto dust grains in the ISM, which would cause some underestimate of the gas-phase O/H. The depletion of O is estimated to be ∼ − 0.09 ± 0.05 dex in z ∼ 2 star-forming galaxies (Steidel et al. 2016). This is within the observed uncertainties.

In Figure 12, we compare the empirical fits to the four MZRs (stellar and gas phase; z ∼ 0 and z ∼ 2.2). All the metallicities are here normalized to the solar values. It can be seen that the evolution in [Fe/H] between z ∼ 0 and z ∼ 2.2 is larger than that seen in [O/H]. At M* ∼ 1010 M, the [Fe/H] metallicities at z ∼ 2.2 are about 0.9 dex lower than locally (including the +0.1 dex shift in the local stellar MZR; see above), whereas the [O/H] metallicities are only about 0.4 dex lower than that at z ∼ 0. In other words, the offset between the instantaneous [O/H] and [Fe/H] metallicities evidently increases with redshift; put another way, at M* = 1010 M, [O/Fe] ≈ 0 is indicated locally, but [O/Fe] = 0.47 ± 0.12 at z ∼ 2.2. Given the similar slopes of the Fe and O MZRs at z ∼ 2.2, the average [O/Fe] is consistent with being independent of M*. The implied value of [O/Fe] at z ∼ 2.2 is close to the average [O/Fe] ≈ 0.4 reported for star-forming galaxies at z ∼ 2.3 (Strom et al. 2018; Cullen et al. 2021). It also approaches the predicted maximum value (∼0.6) that is predicted for metal enrichment from CCSNe (Nomoto et al. 2006; Kobayashi et al. 2020) alone, suggesting that the production of iron by SNe Ia had not progressed far at these redshifts.

Figure 12.

Figure 12. Comparison between the stellar ([Fe/H]) and gas-phase ([O/H]) MZRs at low and high redshifts: stellar MZR at z ∼ 0 (gray solid line; Zahid et al. 2017 fitted with Equation (5)), stellar MZR at z ∼ 2.2 (thick solid blue line; our data, Equation (5)), gas MZR at z ∼ 0 (gray dashed line; Curti et al. 2020), gas MZR at z ∼ 2.2 (red dashed line; Sanders et al. 2020). Here the local stellar MZR is shifted by $\mathrm{log}{Z}_{* }=+0.1$ dex so that it reflects better the instantaneous iron abundance in the gas phase. No correction is applied for the z ∼ 2.2 stellar MZR, as it is based on the rest-frame FUV spectra. The light-blue and red shaded regions indicate, respectively, the 68% confidence limits of the fits.

Standard image High-resolution image

We can in principle eliminate M* from the gas and stellar MZRs (taken from Figure 12) to yield the relations between [O/Fe] and either [O/H] or [Fe/H]. This is done in Figure 13, which shows the [O/Fe] as a function of [Fe/H] (left panel) and [O/H] (right panel). The mass ranges of the z ∼ 0 and z ∼ 2.2 relations are limited to $\mathrm{log}({M}_{* }/{M}_{\odot })=8.5\mbox{--}11.0$ and 8.8–10.7, respectively.

Figure 13.

Figure 13. The implied [O/Fe] as a function of the iron abundance [Fe/H] (left panel) and the oxygen abundance [O/H] (right panel) obtained by eliminating M* from the M*–[Fe/H] and M*–[O/H] relations shown in Figure 12. The blue solid and black dashed lines indicate the inferred relations at z ∼ 2.2 and z ∼ 0, respectively. The mass ranges are limited to $\mathrm{log}({M}_{* }/{M}_{\odot })=8.5\mbox{--}11.0$ (z ∼ 0) and 8.8–10.7 (z ∼ 2.2), respectively. For the z ∼ 2.2 relations, the shaded regions indicate the 68% confidence limit of each relation. The squares indicate the values at M* = 1010 M at each redshift. The error bar corresponds to the 1σ error. Note that the error in [O/Fe] is correlated with that in the x-axis values; thus, the error bar is nearly vertical in the left panel while being tilted in the right panel because it is dominated by the error in [O/H], while the error in [Fe/H] is much smaller than the symbol size.

Standard image High-resolution image

Of course, this procedure of eliminating M* from the MZR fits may not produce the same [O/Fe] versus metallicity relations as would be obtained by considering individual galaxies, because it does not consider the scatter in the observed quantities at a given M*. In other words, this procedure is tantamount to assuming that galactic mass is the primary driver of the variations in the [Fe/H] and [O/H] within the observed sample. In the current study, we have no alternative to this procedure, because we relied on stacked spectra (in stellar mass bins) for our metallicity measurements.

The results show that these relations have negative slopes in both panels and at both redshifts. Interestingly, the low-redshift [O/Fe]–[Fe/H] (and [O/Fe]–[O/H]) relations in Figure 13 appear to broadly line up with the overall evolutionary vectors from z ∼ 2.2 to z ∼ 0. The high-redshift [O/Fe]–[Fe/H] (and [O/Fe]–[O/H]) relation is also consistent with having the same slope, but this slope is quite uncertain.

Uncertainties in the slopes of the input MZR can propagate to have a large effect on the resultant slopes of the inferred [O/Fe] versus metallicity relations. We constructed confidence intervals on these slopes by considering the range of possible linear relations that are obtained using random combinations of fits (within the uncertainties) to the input M*–[O/H] and M*–[Fe/H] relations. These are shown in Figure 13. Flat or even positive correlations between [O/Fe] and overall metallicities are evidently allowed by our available high-redshift data and analysis methods. We do not try to evaluate the range of possible slopes at low redshift because, while the statistical errors in the fits are very small, it is difficult for us to assess any systematic uncertainties in those taken from the independent studies.

Given the large uncertainties in the slopes of these relations, we focus instead on the [O/Fe] and [Fe/H] (or [O/H]) values at a single fiducial mass of M* = 1010 M (close to the middle of our high-z mass range) as being representative of each redshift. These representative values for M* = 1010 M at high and low redshifts are shown by the two blue and black squares in each of the panels of Figure 13.

5. Modeling of Iron and Oxygen Abundances

In this section, we explore the observed evolution in both [O/Fe] and [Fe/H] (and [O/H]) in the context of "flow-through" gas-regulated models of galaxies. The goal is to demonstrate that the simple chemical evolution model explains well all the observed changes in these quantities from z ∼ 2 to z ∼ 0 while assuming that the galaxies have followed the evolving main sequence through cosmic time. In particular, we will derive the evolutionary tracks in the [O/Fe]–[Fe/H] (and [O/H]) planes to show that all galaxies must follow limited evolutionary paths in these diagrams and the locations of the galaxies are determined almost entirely by sSFR alone.

All the metallicities, [Fe/H] and [O/H] (and thus [O/Fe]), refer to the instantaneous (gas-phase) values if not specified throughout the section.

5.1. Model Framework

In what follows, we adopt the gas-regulator model of Lilly et al. (2013) in which the SFR is instantaneously regulated by the mass of gas (Mgas) present in some reservoir, via the star formation efficiency (SFE = SFR/Mgas), and with a wind-driven mass loss that scales with the SFR via a "mass-loading" factor η. Mass conservation then straightforwardly gives (see Equation (9) of Lilly et al. 2013)

Equation (6)

where Φ is the mass inflow rate and r is the recycling factor (or called the return fraction), i.e., the fraction of mass that is formed into stars and then at later times returned to the ISM.

Obviously, once the two parameters, SFE and mass loading η, are specified (possibly as a function of mass and/or redshift), the gas accretion history (and gas content history Mgas(t)) of a given system follows completely from its SFH, since ${\dot{M}}_{\mathrm{gas}}$ will also be given by the change in SFR(t). Note that it is the changing gas reservoir that distinguishes this "gas-regulator" model from the "bathtub" models of Bouché et al. (2010) and Davé et al. (2012), in which ${\dot{M}}_{\mathrm{gas}}$ is set to be zero.

This means that, within the context of the gas-regulator model, the instantaneous metallicity of the gas reservoir will also be completely determined by the SFH once the (possibly mass- and/or epoch-dependent) SFE and η are specified, along with an assumption of the metallicity of the inflowing gas; we will here assume for simplicity that this is zero.

In the following, we consider the instantaneous gas-phase metallicity as the ratio of metal mass in the gas phase and gas mass at given time. We do not consider metal depletion onto dust grains, and thus the gas-phase metal mass represents all metals except those locked in surviving stars and those metals gone in the wind.

We can assume that the oxygen is produced only by CCSNe that occur "promptly" (with zero time delay) after the birth of the progenitor stars. The change of oxygen mass MO in the gas phase is therefore expressed as

Equation (7)

where ${y}_{{\rm{O}}}^{\mathrm{CC}}$ is the IMF-weighted oxygen yield defined as the oxygen mass synthesized and then returned into the ISM per unit mass formed, 24 ZO( = MO/Mgas) is the oxygen abundance, and ${Z}_{{\rm{O}}}^{\inf }$ is the oxygen abundance of the infalling gas.

We use Equation (7) to numerically track the chemical evolution. However, it is also useful to formalize the metallicity too. The change in ZO is then obtained by eliminating Φ using Equation (6) as

Equation (8)

The metallicity in an equilibrium condition is thus derived by setting dZO/dt to zero, i.e.,

Equation (9)

Lilly et al. (2013) showed that the timescale for driving ZO toward ${Z}_{{\rm{O}}}^{\mathrm{eq}}$ is shorter than the timescale on which the equilibrium conditions are varying. Assuming equilibrium is therefore a good approximation.

For iron, we must consider the substantial amount of iron that is produced by SNe Ia, which occur with some considerable delay after the birth of their stellar progenitors. In this analysis, we adopt the expression for the distribution of the SN Ia delay time, fIa(t), that was formalized by Greggio (2005) for a single stellar population (see also Greggio et al. 2008; Greggio 2010). We consider contributions from the single-degenerate and double-degenerate channels. Note that fIa(t) is normalized so that the time integration equals 1. Figure 14 shows the adopted delay time distribution (DTD), compared with a simple exponential parameterization used in Andrews et al. (2017) and Weinberg et al. (2017). The adopted one is more sensitive to SFR at earlier times in the past.

Figure 14.

Figure 14. The adopted SN Ia DTD adapted from Greggio (2005) is compared to the exponential DTD adopted in Andrews et al. (2017). Both are normalized so that the total number of events equals 1. The bottom panel shows the cumulative DTDs.

Standard image High-resolution image

Using fIa, the change of the gas-phase iron mass MFe is written as

Equation (10)

where ${y}_{\mathrm{Fe}}^{\mathrm{CC}}$ is the CCSN iron yield and ${Z}_{\mathrm{Fe}}^{\inf }$ is the iron abundance of the infalling gas. The last term denotes the contribution from SNe Ia, where ${y}_{\mathrm{Fe}}^{\mathrm{Ia}}$ is the time-integrated SN Ia yield of iron for unit mass formed. 25

The steady-state iron metallicity is then written as

Equation (11)

Assuming ${Z}_{{\rm{O}}}^{\inf }={Z}_{\mathrm{Fe}}^{\inf }=0$, Equations (9) and (11) yield

Equation (12)

Equation (13)

where the approximation holds when the SNe Ia dominate the iron production. This indicates that the [O/Fe] is approximately proportional to the number ratio of the CCSNe and SNe Ia at any time. In other words, as the denominator is some kind of average SFR in the past, the [O/Fe] would be tightly correlated with the sSFR. We will see this in Section 5.3.

The parameters in these equations are not very well constrained from observations. We therefore basically follow the "fiducial" choice of Andrews et al. (2017) and Weinberg et al. (2017): r = 0.4 and ${y}_{{\rm{O}}}^{\mathrm{CC}}=0.017$ (see also Vincenzo et al. 2016 for the IMF-weighted yield and return mass fraction). As discussed below (Section 5.2.2), we adjusted the values of the CCSN and SN Ia iron yields, taking ${y}_{\mathrm{Fe}}^{\mathrm{CC}}=0.00081$ (instead of 0.0012) and ${y}_{\mathrm{Fe}}^{\mathrm{Ia}}=0.0022$ (instead of 0.0017), together with the parameters determining the SFE and η. As noted above, for simplicity we set both ${Z}_{{\rm{O}}}^{\inf }$ and ${Z}_{\mathrm{Fe}}^{\inf }$ to zero.

5.2. Calculating Chemical Evolutionary Tracks

5.2.1. Choice of Star Formation Histories

Again for simplicity, we construct a representative set of SFHs by integrating the evolving main sequence of star-forming galaxies across cosmic time. We adopted the local M*–SFR relation (z ≈ 0.08) derived by Renzini & Peng (2015) and the redshift evolution at fixed M* as follows:

Equation (14)

for z ≤ 2.4 and

Equation (15)

for z > 2.4 (see, e.g., Lehnert et al. 2015; Tasca et al. 2015).

The inferred evolutionary tracks in the M*–SFR plane and SFHs are shown in Figure 15. In the former, we mark the values of our representative galaxies at z = 0.08 and z = 2.22. These simulated galaxies at z = 2.22 are in broad agreement with our sample galaxies, as shown by gray dots.

Figure 15.

Figure 15. Left panel: the evolving main sequence of star-forming galaxies shown by gray lines at z = 1, 2, 3, 4 and 5 from bottom to top. The curves show the evolutionary tracks of galaxies at given stellar masses in steps of 0.2 dex at z = 0, color-coded by redshift. The black diamonds and red circles mark the time steps of z = 2.22 and z = 0.08, respectively, for all shown tracks. Gray dots indicate our sample galaxies at 1.6 ≤ zspec ≤ 3.0. Right panel: the corresponding SFHs. Each line is color-coded by stellar mass at z = 0. The gray dots indicate SFR versus zspec for our sample galaxies.

Standard image High-resolution image

5.2.2. Parameters of the Regulator Systems and Iron Yields

In order to derive the time-dependent gas content of each model galaxy from their individual SFHs and thereby compute the corresponding chemical evolution, we need to define the two parameters of the gas-regulator system, the mass-loading factor η and SFE. We assume that these values scale with the instantaneous M* of the system, but that they are not redshift dependent, i.e.,

Equation (16)

Equation (17)

Here we consider the minimum value for η at high masses (assuming a negative a) for better representation of the saturation feature of the local MZR at the high-mass end. Although this is an arbitrary treatment, the bending in the average η toward high masses is seen owing to effects of AGNs in simulations (Nelson et al. 2019).

Now Equations (7) and (10) can be used to compute the evolution with time of [O/H] and [Fe/H] for any arbitrary SFR(t), i.e., for each of the representative SFH identified above. However, using the fiducial iron yields given by Andrews et al. (2017) and Weinberg et al. (2017) gives an [Fe/H] MZR that is slightly higher at z ∼ 2.2 but lower locally than those observed at each redshift shown in Figure 12. We also notice that the value of ${y}_{\mathrm{Fe}}^{\mathrm{CC}}=0.0012$ adopted in the above papers gives [O/Fe] (=0.5 with ${y}_{{\rm{O}}}^{\mathrm{CC}}=0.017$) from pure CCSNe that is smaller than the values seen in the literature (∼0.6; e.g., Nomoto et al. 2006). We therefore allowed the CCSN and SN Ia iron yields to vary in order to better reproduce the observed MZRs.

We used an MCMC algorithm to determine values of the six parameters to be $({\eta }_{10},a,{\eta }_{\min },{\mathrm{SFE}}_{10}({\mathrm{Gyr}}^{-1}),b,{y}_{\mathrm{Fe}}^{\mathrm{CC}},{y}_{\mathrm{Fe}}^{\mathrm{Ia}})$ =(2.58, −0.267, 2.34, 0.310, 0.043, 0.00083, 0.00215) that, using our representative SFHs, well reproduce the M*–[O/H] and M*–[Fe/H] relations at z ∼ 0 and z ∼ 2.2 shown in Figure 11.

We note that these values are not far from those suggested from observations and/or simulations. The mass-loading factor and its scaling are similar to what is found in Muratov et al. (2015, see their Equation (8)). The typical (total gas) depletion timescales (tdep = 1/SFE) of several Gyr have been observed (Bigiel et al. 2008) and reproduced in simulations (Semenov et al. 2017). From Equations (14) and (17), we obtain ${M}_{\mathrm{gas}}\propto {M}_{* }^{0.72}$, which is in broad agreement with the scaling relation between the molecular gas mass and stellar mass ($\sim {M}_{* }^{0.59};$ Tacconi et al. 2020). The corrections of the iron yields are also small ( ∼ 30%) with respect to the fiducial values.

5.2.3. Calculated Chemical Evolutionary Tracks

In Figure 16, we show the chemical evolution tracks for the input SFHs in the iron and oxygen MZR diagrams. The individual lines correspond to each of the SFHs that were shown in Figure 15. As there, the positions of the galaxies at z = 2.22 and at z = 0.08 are marked (again, black diamonds and red circles, respectively). At z = 2.22 we limit those to having M* higher than ≈ 108.8 M, which is the lower mass limit of our high-z sample.

Figure 16.

Figure 16. Left panel: the calculated evolutionary tracks in the M*–[O/H] diagram in comparison with the observations; Curti et al. (2020) for z ∼ 0 (black dashed line) and Sanders et al. (2020) for z ∼ 2.2 (blue solid line; shifted by +0.1 dex as mentioned in Section 4.3). The color-coding is according to the redshift. Black diamonds mark those with M* ≳ 108.8 M at z = 2.22, and red circles mark those at z = 0.08. Right panel: same as the left panel, but in the M*–[Fe/H] diagram. The observed relations come from Zahid et al. (2017) at z ∼ 0 (shifted by −0.1 dex as mentioned in Section 4.3) and our result (Equation (4)).

Standard image High-resolution image

This figure illustrates how the models well reproduce the observed MZRs in both oxygen and iron, including their slopes, at both z ∼ 0 and z ∼ 2.2. This was, of course, to be expected since both the SFE(M*) and η(M*) functions of the regulator model and the iron yields have been adjusted so as to match these overall MZRs at both z ∼ 2 and z ∼ 0, although we stress that the adopted parameters are, in our view, completely reasonable.

Rather, our interest in constructing these models is in examining and understanding the expected relations between the overall metallicity and the α-enhancement at these two redshifts, especially in comparison with the evolutionary offset between them. We address this in the following sections of the paper.

5.3. The Expected [O/Fe]–Metallicity Relation

In Figure 17, we show the α-enhancement versus overall metallicity of these same models, i.e., [O/Fe] as a function of either [Fe/H] (left panel) or [O/H] (right panel). As in Figure 16, the model galaxies with M* ≳ 109 M are marked at z = 2.2 and at z = 0 in order to compare with the observations (as in Figure 13). Both the evolutionary tracks in this diagram and the variations within the population at fixed epoch (locus of black and red points) may be read from this diagram.

Figure 17.

Figure 17. The calculated evolutionary tracks in the [Fe/H]–[O/Fe] (left) and [O/H]–[O/Fe] (right) diagrams in comparison with the observations (same as Figure 13). Visualization of the evolutionary tracks is the same as in Figure 16. The horizontal dotted line marks [O/Fe] = 0 (i.e., the solar value) in both panels.

Standard image High-resolution image

It can be seen that the modeled evolutionary tracks are in good agreement with the overall change in both metallicity and [O/Fe] between z ∼ 2.2 and the present epoch. This is not surprising because the model parameters were tuned (within reasonable ranges) to fit the oxygen and iron MZRs, and thus implicitly the [O/Fe], at both redshifts.

The models present variations across mass that show a negative slope between these quantities. The observed slopes are similar to what are observed at these redshifts. However, as mentioned in Section 4.3, large uncertainties in the slopes do not enable us to make any robust statement. We thus focus on the representative [O/Fe] and [Fe/H] (or [O/H]) values at our single fiducial mass of M* = 1010 M at each redshift (squares in Figure 17).

It is noticeable that, at least for the range of representative SFHs considered, the model tracks all follow a relatively narrow path in the [O/Fe]–[Fe/H] plane. The tightness of this path is enhanced because, at a given epoch, the loci of (model) galaxies of different masses are close to being parallel to the individual evolutionary tracks.

We now turn to investigating whether the observed offset between z ∼ 0 and z ∼ 2.2 (i.e., the "evolution vector") in this diagram can be explained by the change with redshift in some other parameter, and whether that parameter, if present, might also be responsible for producing the variation within the population at a single epoch. While the [O/Fe] (or generally α/Fe) is often loosely considered as being indicative of the "age" of a stellar system, because of the time delay in producing much of the iron via SNe Ia, it should really reflect, especially in a continuous "flow-through" scenario, the sSFR. The sSFR reflects the ratio of the current SFR to some average SFR in the past. This ratio will thus determine the relative number of CCSNe and SNe Ia at any point in time, which will then determine the "instantaneous" α-enhancement of the gas (see Equation (13)).

To explore this, we show in Figure 18 the same evolutionary tracks (as in Figure 17) but now color-coded by the instantaneous sSFR (top panels) and M* (bottom panels) of the model galaxy. Now the different dependences of the evolutionary tracks on these quantities are quite obvious: it is clear that the [O/Fe] is very tightly correlated with the sSFR across the whole range of these evolutionary tracks and is largely unaffected by either the metallicity or the mass of the system. This is indicated by the striking horizontal banding of the sSFR-coded colors in the top panels. In contrast, the inclined M*-coded color banding in the bottom panels shows that the metallicity (whether [O/H] or [Fe/H]) depends on both the stellar mass and [O/Fe]. In other words, using the tight relation between [O/Fe] and sSFR, the metallicity depends on the mass and sSFR together.

Figure 18.

Figure 18. Same as Figure 17, but the evolutionary tracks are here color-coded by the instantaneous sSFR (top panels) and stellar mass (bottom panels). We omit all the symbols except the representative measurements at two redshifts.

Standard image High-resolution image

Figure 19 shows the same data in a different way, demonstrating that, at least in the gas-regulated models and representative SFHs considered in this paper, there is remarkably little scatter between [O/Fe] and the instantaneous sSFR of the model galaxies. The [O/Fe] scales as ≈ sSFR0.34 at $\mathrm{log}(\mathrm{sSFR}\,[{\mathrm{Gyr}}^{-1}])\lt 0.5$.

Figure 19.

Figure 19. [O/Fe] as a function of sSFR for all the model tracks. The squares indicate the representative measurements at M* = 1010 M at the two redshifts. Visualization of the evolutionary tracks is the same as in Figure 16.

Standard image High-resolution image

These results support the idea that the tight relation between [O/Fe] and sSFR is quite fundamental, being almost independent of the epoch, the shape of the SFHs, and thus, in our modeling, the present-day mass and thus the values of SFE and η.

Lastly, we compare these results to the concept of the so-called "fundamental metallicity relation" (FMR; e.g., Mannucci et al. 2010; Lara-López et al. 2010; see also Andrews & Martini 2013; Lara-López et al. 2013). The FMR established that the SFR appears to be a second parameter in the gas-phase [O/H] MZR at low redshift, in the sense that higher-SFR galaxies at a given mass have lower oxygen metallicities. Further, it was then shown that the observed evolutionary change in [O/H] to high redshift (at a given mass) was the same as that obtained by simply extrapolating the trend with SFR established at z ∼ 0 to the much higher SFR seen at high redshift.

An important insight into the FMR came from the introduction of the gas-regulated model of galaxies (Lilly et al. 2013). Assuming a quasi-equilibrium state, the (gas-phase) metallicity of the galaxy is set "instantaneously" by two considerations. The first is the specific rate at which the system is being fed by gas (dMgas/dt/Mgas), which itself can be inferred from the sSFR. The second is the values of the two regulator parameters: the SFE and mass loading η. The latter parameters determine the gas content of the system that is necessary to achieve the required sSFR (see Lilly et al. 2013 for details and discussion). The evolving sSFR of main-sequence galaxies can be viewed in this framework as the primary driver of the observed redshift evolution of the gas metallicity, but the positive MZR at a given epoch is the consequence of the change in sSFR along the main sequence, together with the mass dependences of the SFE and mass loading η.

It is important to clarify that the dependence of [O/H] on the mass-dependent SFE and η (see Lilly et al. 2013) contrasts with the fact that [O/Fe] is determined almost entirely by the sSFR alone, acting as a proxy of the number ratio of CCSNe and SNe Ia at a given time. The variations in SFE and η (with mass) cause the variations in O/H at fixed sSFR (thus at fixed O/Fe) and thus the range in the evolutionary tracks in the [O/Fe]–metallicity diagrams (Figure 17). Note that, in the context of the gas-regulator model, if SFE and η are constant, the gas-phase metallicity is also determined by sSFR alone. The [O/Fe]–metallicity tracks in Figure 17 will thus be completely independent of SFHs, having no scatter, while, at a single epoch, the values of [O/Fe] and metallicities vary along this single path if the sSFR changes with mass along the main sequence, or a population has a scatter in SFR at fixed mass as expected in the real universe.

6. Comparison with Galactic Stars

An important and revealing comparison may be made between our high-redshift galaxies and those of individual Galactic stars. Are we seeing at high redshift the formation of stars of the same type as seen today in the Galaxy? This enables a direct confrontation between "Galactic archeology" and observations of galaxies at high redshifts.

6.1. Thick-disk Stars and High-z Galaxies

Figure 20 shows data for solar neighbor FGK stars in the Milky Way (MW) adapted from Ramírez et al. (2013) and Amarsi et al. (2019). These stars are separated into the thin-disk, thick-disk, and halo populations. It is known that stars in the so-called "thick disk" are typically older, have lower metallicity, and are α-enhanced (i.e., have higher [O/Fe]) compared to stars in the "thin disk." It is indeed clear that thin- and thick-disk stars occupy on average different regions in the [O/Fe]–metallicity plane. Halo stars present large scatters in both metallicity and α-enhancement but mostly have lower Z and higher α than the thin-disk stars.

Figure 20.

Figure 20. The observed data for z ∼ 2 and z ∼ 0 galaxies (same as in Figure 17) are compared to the data for the individual Galactic stars (Ramírez et al. 2013; Amarsi et al. 2019) in the [O/Fe]–metallicity diagrams. The stars are separated into thin-disk stars (dark-blue diamonds), thick-disk stars (red circles), and halo stars (green triangles). The gray curves show the model evolutionary tracks that are shown in Figure 17.

Standard image High-resolution image

Galactic stars, especially the thick-disk stars, clearly lie along the locus extending between the two M* ∼ 1010 M squares in Figure 20. In particular, it can be seen that our best-estimate locations of M* = 1010 M galaxies at z ∼ 2 in the two [O/Fe]–metallicity diagrams lie at the upper tail of the sequence of thick-disk stars in our own Galaxy.

Figure 20 also shows that the simple models constructed in Section 5 successfully reproduce the locus of these Galactic stars. This gives added confidence that they are reasonable.

Martig et al. (2016) found the median ages of the thick-disk population of stars to be ≈9–5 Gyr, decreasing from the inner to the outer disk. Kilic et al. (2017) also estimated the average age of the thick-disk stars to be ≈ 9 Gyr. It is thus quite plausible that those Galactic thick-disk stars located around our high-z measurement (i.e., the relatively metal-poor α-enhanced stars) formed around z ∼ 2–3, i.e., some 10 Gyr ago and some 3.5 Gyr after the big bang. We have already discussed how, in terms of the simple flow-through scenario of chemical evolution discussed in Section 5, it is natural to get these high α-enhancements ∼10 Gyr ago. Note that, in our modeling, present-day galaxies with the same mass as the Galaxy (6.4 × 1010 M; McMillan 2011) had a mass around this fiducial value of M* = 1010 M at z ∼ 2.

The evident agreement of [O/Fe] and [Fe/H] (or of [O/Fe] and [O/H]) between the old stars in the Galactic thick disk and the high-redshift galaxies with masses comparable to those expected for MW progenitors provides, in our opinion, a beautifully direct link between the results of Galactic archeology and observations of galaxies in the high-z universe that has long been assumed but rarely established directly (see Cullen et al. 2021 for a similar conclusion).

6.2. The α-dichotomy and Old, Low-α Stars in the MW Bulge

In the subsection above, we found a general agreement between the chemical abundances seen in high-z (z ∼ 2) galaxies and the MW thick-disk population, plus the estimated ages of the latter, and it supports the idea that, when observing z ∼ 2 galaxies, we are witnessing the formation of stars that will constitute the thick-disk population in MW-like galaxies. However, as we will see below, some caveats and questions are posed when a detailed comparison is made with bulge stars in the Galaxy.

Recently, Queiroz et al. (2020) have presented a detailed study of chemical abundances from APOGEE data (Abolfathi et al. 2018) combined with stellar distances from Gaia (Gaia Collaboration et al. 2018). They constructed detailed distributions in [α/Fe] versus [Fe/H] plots at several radial distances from the center of the Galaxy and at three vertical distances from the Galactic plane (see their Figure 6). Note that they measure [α/Fe] using different multiple α-elements, and the comparison to [O/Fe] probably needs some offset (order 0.1 dex; McWilliam et al. 2008; Jönsson et al. 2020). They showed that there are basically two disjoint populations in the [α/Fe]–[Fe/H] plot producing, at each spatial location, a bimodal distribution of stars (the so-called α-dichotomy). Broadly speaking, these two distinct populations comprise a metal-rich (solar or more), low-α (close to solar) component and a lower-metallicity, high-α component. Globally, ∼50% of bulge stars belong to the metal-rich component (Zoccali et al. 2017).

In the bulge region (R < 2 kpc), the supersolar metal-rich component with [α/Fe] spread around zero dominates. Moving radially outward in the plane of the disk, the metal-rich component shifts slightly to lower (i.e., subsolar) metallicities without significant change in [α/Fe], while the metal-poor, high-α component progressively vanishes. Instead, moving vertically away from the plane, it is the metal-rich component that tends to vanish, leaving the metal-poor component dominant, without much change in its abundances. This variation in the spatial manifestation of the bimodality may therefore be related to the "thin" and "thick" disk components, at least at intermediate radii. However, the dominant metal-rich component in the central bulge region challenges this simple picture and poses questions for the beautiful concordance between z ∼ 2 galaxies and the thick-disk stars shown in Section 6.1.

It is clear that we find no evidence in our zCOSMOS-deep sample at z ∼ 2.2 for star formation with abundances comparable to the metal-rich low-α component. Some studies, however, have indicated that this metal-rich component is dominated by stars of age ∼10 Gyr or older, as demonstrated by multiband Hubble Space Telescope photometry extending from the near-UV to the near-IR (Brown et al. 2009; Renzini et al. 2018). 26 Based on these indications, this component seems to have formed at z ∼ 2. The obvious question is, why are such abundances not seen in our z ∼ 2 zCOSMOS-deep galaxies? If we suppose that MW progenitor(s) should indeed have been found in our sample of main-sequence star-forming galaxies at z ∼ 2.2, then there are some possible explanations for this observational discrepancy.

The first possibility is that the central metal-rich component in the MW had not in fact formed by z ∼ 2, i.e., that the ages of these stars have been overestimated. This possibility would be consistent with our observational results and also with our models, since these suggest that galaxies with SFHs that follow the evolution of the main sequence cannot have [α/Fe] ∼ 0 at z ∼ 2. The main problem with this first possibility is the observational evidence for old ages for these metal-rich low-α stars. One way out could come from assuming that the central regions of the MW are unusual and not representative of the general galaxy population seen in the high-redshift universe.

A second possibility is that metal-rich stars are indeed forming at z ∼ 2, as indicated by their ages, but that these metal-rich stars will not have contributed to the rest-frame FUV spectra on account of being heavily obscured by dust, which may be common in a supersolar-metallicity environment.

A second possibility is that metal-rich stars are indeed forming at z ∼ 2, as indicated by their ages, but that these metal-rich stars have contributed to the rest-frame FUV spectra on account of being heavily obscured by dust, which may be common in a supersolar-metallicity environment. The global dust obscuration in the FUV estimated from SED fitting is on average AFUV ≈ 1.4 at M* = 1010 M, increasing with M* from ∼0 ( ∼ 109 M) to ∼3 ( ∼ 1011 M). This means that on average 70%–80% of the light from SFR in the galaxies at M* = 1010 M has been absorbed. If there is a large variation in dust obscuration across the galaxy, increasing toward regions that are more metal-rich and solar-α, the light from this SFR may have been preferentially, and possibly completely, removed from the observed FUV spectra.

We can, however, test the hypothesis of substantial heavily obscured star formation making almost no contribution to the FUV light by examining the far-infrared emission of our sample using the public super-deblended catalog (Jin et al. 2018). Only 54 sources out of the entire sample of 1336 galaxies have FIR luminosity detected at > 3σ. Even for these FIR-selected galaxies the nominal FIR-based SFRs (median ∼ 140 M yr−1; estimated in the same way as in Kashino et al. 2021) are only 50% higher than the SED-based SFR estimates ( ∼ 100 M yr−1). There is therefore no compelling evidence with the available data of substantial "hidden" SFR across the entire sample.

The third possibility is that the formation of the metal-rich component was completed well before z ∼ 2, and that such stars are present in the zCOSMOS-deep galaxies but already old enough so as not to contribute to the ultraviolet light. This may be partly supported by observations that have revealed metal-rich (≳solar) stars in quiescent galaxies at z ∼ 2 (Estrada-Carpenter et al. 2019; Morishita et al. 2019). However, such a scenario seems implausible to us for several reasons. Pushing star formation to earlier epochs will clearly tend to increase the α-enhancement, not decrease it, exacerbating the problems identified in the previous paragraph. Not least, the stars would have had to form in a still shorter time at even higher sSFR. Indeed, Onodera et al. (2015) found an enhanced average [α/Fe] ( ≈ 0.3), along with supersolar [Z/H] ( ≈ 0.24), for quenched galaxies at z ∼ 1.6.

Lastly, we should also consider a bias in the selection of our sample, namely, that galaxies containing metal-rich star formation may be not represented in our blue-selected (BAB ≲ 25) zCOSMOS-deep sample, which excludes objects with low FUV continuum. It is naturally expected that galaxies either dominated by metal-rich star formation or having experienced starbursts are dust rich, and thus their rest-frame UV emission may well disappear below the selection limit. A highly complete study, including UV faint sources, is desired to gain a more robust conclusion.

We return to emphasize the basic difficulties (required in both the second and third possibilities discussed above) of achieving low (solar) α-enhancement at z ∼ 2, at least using our fiducial SN Ia DTD that provides the satisfactory match presented in Section 5. We showed in Section 5.3 that the SFHs that are obtained by integrating the main sequence simply do not yield gas abundances with low α-enhancements at these redshifts. This is ultimately because of the high sSFR of the main sequence at z ∼ 2. Except in extremely contrived scenarios, low α-enhancement stars must be formed at low sSFR. If a significant mass of stars is to be formed, then this star formation must be maintained over an extended period of time, i.e., over timescales of order sSFR−1. This is naturally achieved at late epochs in the universe, but it is hard to see how this can happen at much earlier times. Quite independent of our own observational results, this conundrum represents a basic puzzle about the origin of the low α-enhancement metal-rich central population identified in the central kiloparsec of our Galaxy by Queiroz et al. (2020) if these stars are truly over 10 Gyr old rather than ≲ 4 Gyr old (i.e., formed after z ∼ 0.5) as indicated by our model (Figure 17).

It is clear that our models, which employ representative smooth SFHs, produce one continuous sequence of stars in the α–metallicity plane, rather than the two distinct components as observed (Queiroz et al. 2020). This suggests that the construction of the MW was a more complex phenomenon. Recently, an SFH with an extended gap ( ∼ 6 Gyr) between the early formation of the metal-poor high-α and the metal-rich low-α components has been proposed to reproduce the α-dichotomy of the MW stars (Lian et al. 2020; Queiroz et al. 2021). As star formation of the metal-poor component quenched, the ISM was enriched only in Fe by ongoing production of SNe Ia on a ∼ 1 Gyr timescale, and the metal-rich, low-α populations were produced from the low-α gas when star formation resumed after the gap. In this scenario, the MW would have been in the gap at z ∼ 2–3. But this is manifestly not the case for the majority of our sample galaxies, as they actually have high sSFRs. This would suggest that the α-dichotomy may be a characteristic of the MW rather than a common feature, though some cases of "rejuvenation" after a hiatus in star formation have been proposed (Mancini et al. 2019). We should also mention another work by Clarke et al. (2019), who proposed clumpy star formation led by disc fragmentation as the origin of high-α populations, and claimed that the α-dichotomy is common in massive galaxies as high-z galaxies commonly show clumpy morphologies.

Lastly, there is another possibility to be mentioned, namely, that the SN Ia DTD may shift to shorter delays with increasing metallicity, in particular in the supersolar regime. For example, in the double-degenerate scenario for SN Ia progenitors, the event results from the merging of two white dwarfs (WD) as they spiral in owing to gravitational wave radiation. Thus, the DTD is controlled by the distribution of the binary WD separations as they emerge from their last common envelope event. As the delay time scales as the fourth power of such separation (e.g., Greggio 2005), a ∼30% reduction in the distribution of WD separations would result in a shift by a factor of ∼5 in the DTD, e.g., moving the median delay time from, say, ∼1 Gyr to ∼200 Myr. The amount of orbital shrinkage during common envelope events is notoriously hard to predict; nevertheless, we note that stellar sizes are larger at high metallicity, and hence stars fill their Roche lobe at an earlier evolutionary phase compared to the case at lower metallicity. An effect of metallicity on the WD orbital separations and on the resulting DTD is therefore to be expected, though hard to quantitatively predict, and the few attempts in this direction remain inconclusive (Meng et al. 2011; Meng & Yang 2012; Kistler et al. 2013). The old ages, high metallicity, and low [α/Fe] of bulge stars could then be more easily reconciled if the DTD were to move to substantially shorter delays in the supersolar regime.

It is clear to us that integrated light spectroscopy of high-redshift galaxies, such as presented in this paper, can highlight the problem, but it is unlikely to enable us to understand the complexities revealed by individual star abundances across the body of the MW. It is possible that high-resolution imaging and integrated field spectroscopy with the forthcoming JWST instruments will finally give clues to the solution of this puzzle.

7. Summary

We have measured the stellar metallicities of galaxies for 1336 star-forming galaxies at 1.6 ≤ z ≤ 3.0 using high-S/N stacked low-resolution (R ∼ 200) rest-frame FUV spectra from the zCOSMOS-deep survey. The metallicities were estimated using fits of high-resolution model spectra constructed from stellar population synthesis models across a range of the stellar metallicity. These metallicity estimates enabled us to construct the relationship between stellar mass and stellar metallicity, i.e., the stellar MZR at z ∼ 2.2.

The measured stellar metallicities, which mostly reflect the iron abundance, range between $-1.3\lesssim \mathrm{log}{Z}_{* }/{Z}_{\odot }\lesssim -0.4$ across the stellar mass range 109M*/M ≲ 1011. Because they are based on the spectra of short-lived massive stars, we argue that this iron abundance should be representative of the gas phase in these galaxies. Our measurements are consistent with the one previous work on similarly high-z galaxies (Cullen et al. 2019).

A clear positive correlation between stellar mass and stellar metallicity is established, in which the metallicity scales as $\mathrm{log}({Z}_{* }/{Z}_{\odot })=-0.81+0.32\mathrm{log}({M}_{* }/{10}^{10}{M}_{\odot })$ (Equation (4)). Comparing with iron metallicity data at z ∼ 0 from the literature, we find that the z ∼ 2.2 stellar MZR is offset by ∼0.8 dex below the local relation.

Adding published [O/H] measurements at both high and low redshifts, we implied [O/Fe] ratios (i.e., α-enhancement) to be ≈ 0.47 ± 0.12 at z ∼ 2 for M* ∼ 1010 M galaxies. This is considerably enhanced against the local value of [O/Fe] ∼ 0 and, in fact, approaches the [O/Fe] ∼ 0.6 limit imposed by the yields of CCSNe. This indicates that SNe Ia have not yet contributed very much to the iron supply at these epochs.

These results are then compared with the expectations of "flow-through" gas-regulator models, especially in the context of evolutionary tracks in the [O/Fe]–metallicity plane. In constructing these, it is assumed that the SFHs follow those implied by the evolving main sequence of star-forming galaxies. Adjusting the regulator parameters and iron yields within very reasonable ranges, it is found that the models can reproduce the evolution of [O/Fe] and [Fe/H] (or [O/H]) from z ∼ 2 to z ∼ 0. The models predict that galaxies at M* ∼ 109–1011 locally should lie on a relatively narrow locus in the [O/Fe]–[Fe/H] (or [O/H]) plane if they have been continuously forming stars and following the main sequence in the past.

An important insight obtained from this modeling is that the instantaneous (gas-phase) α-enhancement is determined almost entirely by the instantaneous sSFR of the galaxy. This is because the sSFR is a good proxy of the instantaneous number ratio of CCSNe to SNe Ia, which is what effectively determines the gas-phase α-enhancement in flow-through models. The variations in [O/Fe] among a galaxy population at a single epoch arise as a result of the mass dependence of the main-sequence sSFR, while the variations in the evolutionary tracks in the [O/Fe]–metallicity planes arise as a result of the dependence of the regulator parameters (SFE and mass loading η) on the stellar mass.

We found that z ∼ 2 galaxies at a representative mass of 1010 M have similar α-enhancement and metallicity to the low-metallicity thick-disk stars in our own Galaxy. These Galactic stars were probably formed around 10 Gyr ago, when the MW presumably had a stellar mass of around this same value, 1010 M. This observation therefore provides an unusually direct concordance between the results of Galactic archaeology and observations of presumed MW progenitors at high redshift.

There remains, however, an open question about the formation of the population of old metal-rich stars seen in the MW bulge with low (roughly solar) α-enhancement. Our rest-frame FUV data at z ∼ 2 show no evidence of such high metallicities and low-α enhancements. Three possible explanations of this observational discrepancy have been considered: (i) These central Galactic stars may not be as old as so far estimated with a variety of methods, and formed well after z ∼ 2. Besides being in contrast with current age estimates, this option would predict the existence of supersolar star-forming galaxies at lower redshifts, which have not been observed so far. (ii) They may indeed be forming at z ∼ 2 but doing so in highly obscured environments as one may expect in a supersolar-metallicity regime. (iii) They may have already formed well before z ∼ 2. We discussed how both the second and third possibilities are problematic in terms of achieving low α-enhancements by z ∼ 2 and, as a possible solution, the possibility that the SN Ia delay time distribution shifts to substantially shorter delays in the supersolar-metallicity regime.

Spatially resolved imaging and spectroscopy, which will be enabled by JWST, may be able to constrain the formation scenarios of the possible metal-rich bulges and the detailed process of chemical evolution through cosmic history. Furthermore, the forthcoming multiobject spectrographs, i.e., Subaru/PFS and VLT/MOONS, will enable less biased studies with highly complete samples, including UV faint sources that were excluded in this study.

This research is based on observations undertaken at the European Southern Observatory (ESO) Very Large Telescope (VLT) under the Large Program 175.A-0839 and has been supported by the Swiss National Science Foundation (SNF) and JSPS KAKENHI grant No. JP21K13956. This work made use of v2.2.1 of the Binary Population and Spectral Synthesis (BPASS) models as described in Eldridge et al. (2017) and Stanway & Eldridge (2018). Y.P. acknowledges National Science Foundation of China (NSFC) grant Nos. 12125301, 11773001, and 11991052.

Appendix A: Offset between Gas-phase and Stellar Fe/H

It is straightforward to calculate mass-weighted stellar metallicities, ${Z}_{\mathrm{Fe},* }^{M}$, once SFR(t) and (gas-phase) ZFe(t) are specified:

Equation (A1)

To calculate luminosity-weighted stellar metallicities, we additionally need a library of the spectra for a single stellar population as a function of age. We adopted the same BPASSv2.2.1 template spectra as in the main analysis and calculated the stellar metallicity weighted by the average luminosity around 1500 Å (FUV) and 5500 Å (optical):

Equation (A2)

where ${L}_{\lambda }^{\mathrm{SSP}}(t)$ (λ = 1500 or 5500 Å) is the luminosity density at a particular wavelength of a single stellar population of age t. Note that, in this equation, we ignore the effects of possible differential dust attenuation between the younger and longer-lived stellar components.

We calculated Equations (A1)–(A2) for the same evolutionary tracks obtained in Section 5. Figure 21 shows the ratios of the stellar [Fe/H], weighted by either mass, L1500 (FUV), or L5500 (optical), to the instantaneous gas-phase [Fe/H] as a function of cosmic time. The different evolutionary tracks correspond to the different SFHs and are color-coded by the present-day stellar mass. The two redshifts z = 0.08 and 2.2 are marked.

Figure 21.

Figure 21. The stellar-to-gas iron metallicity ratio as a function of cosmic time for representative SFHs adopted in Section 5. The stellar Z*,Fe values are inferred weighting by stellar mass, FUV luminosity, or optical luminosity, as labeled. The evolutionary tracks for the individual SFHs are color-coded by the present-day stellar mass. Two redshifts, z = 2.2 of our high-z sample and z = 0.08 of the local sample, are marked by vertical dotted lines. The horizontal dotted line marks Z*,Fe = Zgas,Fe.

Standard image High-resolution image

As expected, the L1500-weighted [Fe/H] is similar to the gas-phase value within ∼0.02 dex at z = 2.2, and almost equivalent at z = 0 when the metallicity change is slow. In contrast, the mass-weighted and L5500-weighted [Fe/H] values show substantial offsets and some scatters for different SFHs. At z = 0.08, the offsets in the L5500-weighted values are in a range of − (0.08–1.5) dex, being larger (in negative) for larger masses, i.e., lower sSFR. We adopted −0.1 dex, the value for M*(z ∼ 0) = 1010 M, as a representative value for correcting the local stellar [Fe/H] from Zahid et al. (2017), so that they reflect better instantaneous values for comparison with our FUV-based [Fe/H] and the model values in Section 4.3 and later.

Footnotes

  • 15  

    The definition of the quality flags follows Lilly et al. (2007, see Section 4.3). The evaluation of the reliability will be detailed in S. J. Lilly et al. (in preparation).

  • 16  

    This threshold corresponds to ≈ 3 × the standard deviation of (zphotzspec)/(1 + zspec) and thus is meant to include all sources whose photometric redshift is broadly consistent with the spectroscopic redshift while excluding catastrophic failures.

  • 17  

    The composite spectrum and those in mass bins are available in electronic form from https://github.com/kashinod/zCOSMOS-deep/tree/main/Kashino_et_al_2021b.

  • 18  

    We tried an alternative, more physically motivated dust model assuming an attenuation curve and obtained constraints on Z* and σsmooth that are equivalent to the fiducial results. However, the degeneracy between the parameters, e.g., overall scaling, overall dust attenuation, and the slope of an attenuation curve, causes poorer convergence in the MCMC chains, and therefore we adopted the fifth-order polynomial for our fiducial analysis.

  • 19  

    Note that the wavelength sampling is 1 Å pixel−1.

  • 20  

    This figure was created using the Python module corner.py (Foreman-Mackey 2016).

  • 21  

    The resolution R = 200 in FWHM at the center of the spectral window (∼5150 Å) corresponds to σ ≈ 3.4 Å in the rest frame for the median redshift of 2.2.

  • 22  

    Here we adopted Z = 0.0142, while Rix et al. (2004) assumed Z = 0.020.

  • 23  

    This value is converted to Z = 0.0142.

  • 24  

    This definition of the yield is different from that in some other literature where the yield is denoted in units of mass that is locked up into long-lived stars and remnants. The difference between these definitions is thus a factor of (1 − r).

  • 25  

    The SN Ia yield ${y}_{\mathrm{Fe}}^{\mathrm{Ia}}$ is usually expressed as the product ${K}_{\mathrm{Fe}}^{\mathrm{Ia}}{R}_{0}$, where R0 ($\sim {10}^{-3}\,{M}_{\odot }^{-1};$ Greggio et al. 2008; Maoz & Mannucci 2012) is the time-integrated number of SNe Ia per unit stellar mass formed and ${K}_{\mathrm{Fe}}^{\mathrm{Ia}}$ is the average mass of iron from an individual SN Ia.

  • 26  

    In this respect we note that Bensby et al. (2017) derive a broad age distribution for the supersolar low-α stars in the bulge, but still a fraction of them are given ages older than 8–10 Gyr.

Please wait… references are loading.
10.3847/1538-4357/ac399e