Introduction

Soil respiration or soil CO2 efflux has been studied for over 80 years (Lundegardh 1927). However, our inability to understand the mechanisms driving this flux limits our capacity to predict future atmospheric CO2 concentrations under different climate and global change scenarios. Complications arise because of multiple controlling mechanisms that interact over several temporal scales (hours to millennia), and depend on complex biological (i.e., plant and microbial) and physical (e.g., diffusion, mass transport, photodegradation) factors (Davidson and Janssens 2006; Heimann and Reichstein 2008; Trumbore 2006). In principle, soil respiration is defined as the combination of two biological sources, autotrophic respiration by plant roots and associated microorganisms (i.e., rhizosphere respiration), and heterotrophic respiration via microbial decomposition of soil organic matter (Hanson et al. 2000; Högberg and Read 2006; Ryan and Law 2005). In this manuscript, we will concentrate only on these biological components, however, in some ecosystems, carbonate weathering (Morner and Etiope 2002) or photodegradation (Austin and Vivanco 2006) can contribute significantly to the soil CO2 efflux.

Analyzing and understanding the dynamics of soil respiration demand different levels of complexity depending on the spatial and temporal scale of interest. One level of analysis is at the spatial scale of a few meters and at the temporal scale of the growing season where few measurements in space and time may be enough to describe the seasonal pattern in soil respiration. Another level of analysis involves the links between the soil–plant-atmosphere continuum at short (e.g. daily) and medium (e.g. seasons-years) temporal scales where a combination of techniques and multiple temporal and spatial observations are required. A third, and more general level of analysis is when soil respiration is studied at large spatial scales (e.g. regional-continental) where emerging properties on the soil–plant-atmosphere continuum are influenced by long-term processes (e.g. years-millennia) of climate patterns, vegetation type and carbon (C) stored in soils. This level is important for long-term sources and sinks of soil C and is sensitive to climate change (Davidson and Janssens 2006). For our purposes, we will focus on the soil–plant-atmosphere continuum at short and medium temporal scales because most of the CO2 efflux comes from faster cycling C sources (Högberg and Read 2006). Thus, we do not directly address issues of long-term dynamics of soil C as they pertain to the third level of complexity. However, we recognize that it is important to understand all levels and a combination of multiple methods/techniques are needed to properly address short and long-term terrestrial ecosystem C dynamics.

Difficulties in observing and quantifying belowground processes have been a major challenge towards understanding soil respiration, and several reviews have addressed soil respiration methodological approaches within the last decade (Hanson et al. 2000; Kuzyakov 2006; Raich and Schlesinger 1992; Raich and Tufekcioglu 2000; Ryan and Law 2005; Subke et al. 2006). However, recent technological advances in soil respiration measurements (e.g. automated measurements, and isotope measurements) are providing unprecedented information at multiple temporal scales (Carbone and Vargas 2008).With these new resources, the scientific community could improve the understanding of the processes that regulate soil respiration to move beyond empirical models that are driven primarily by simple temperature and soil moisture relationships (Davidson et al. 2006; Kuzyakov and Gavrichkova 2010).

The scientific community faces several key unanswered questions for developing process-based models of soil respiration at short and medium temporal scales. These include: (1) What is the role of plants and associated microorganisms in regulating soil respiration? (2) What are the time lags between C fixation and its return to the atmosphere by autotrophic and heterotrophic respiration? (3) What are the key processes/mechanisms that regulate these return times?

The primary objective of this paper is to highlight innovative directions in soil respiration research. First, we highlight a few new experimental and technical advances. Second, we discuss challenges and potential analyses with continuous measurements of soil respiration. Third, we present new perspectives on modeling development and model-data integration. Finally, we conclude with a recommendation for stronger interactions between experimental and modeling scientists as the most promising way to improve our understanding of terrestrial C cycling.

New measurement techniques

Continuous measurements of soil respiration

Our understanding of the mechanisms that regulate soil respiration is mostly based on sporadic manual measurements (Bond-Lamberty and Thomson 2010) that do not accurately represent daily and seasonal patterns because they miss many days of the year, nighttime fluxes, phenology, and episodic events such as precipitation pulses. The more recent availability of automated soil respiration measurements now provides sub-hourly information to observe short-term variation in soil respiration. Over longer timescales (e.g., seasonal-interannual), these automated measurements allow for better estimates of the magnitude of soil respiration and can complement tower-based (i.e., eddy covariance) ecosystem (Lavigne et al. 1997) and understory (Baldocchi and Meyers 1991) measurements for site-level C balance approximations. Thus, with automated soil respiration measurements we can observe temporal variability in soil respiration (i.e., hourly-interannual), and obtain high time resolution input parameters for models.

Two primary methods of automated measurements of soil respiration have been developed. The first method includes both open (through flow) and closed chambers (circular flow) measurements (Goulden and Crill 1997; Irvine et al. 2008; Pumpanen et al. 2004; Savage and Davidson 2003). Automated chambers measure the CO2 flux at the soil surface, and therefore integrate all biophysical processes that contribute to soil respiration. Previous studies have discussed in detail technical issues related with chamber-based measurements of soil respiration and are not discussed here (Hutchinson and Rochette 2003; Pumpanen et al. 2004; Rochette et al. 1992; Savage and Davidson 2003).

The second method to measure soil respiration involves automated soil CO2 concentration profiles using gas wells (Hirsch et al. 2004; Risk et al. 2002a) or solid state CO2 sensors (Hirano et al. 2003). With soil CO2 concentrations profiles, the location of production of CO2 within the soil profile can be determined (Hashimoto and Komatsu 2006; Risk et al. 2002b), but the calculation of soil respiration is dependent on the proper estimate of multiple physical factors such as: (1) variation in soil water content that change CO2 diffusivity in the soil (Šimůnek and Suarez 1993); and (2) the proper estimation of tortuosity, soil texture, bulk density and porosity which are difficult to determine in rocky soils or water saturated soils (Jassal et al. 2005; Pumpanen et al. 2008; Turcu et al. 2005). Despite limitations of both methods, when used in combination, multiple studies have shown good agreement between the two (Jassal et al. 2005; Tang et al. 2005b; Turcu et al. 2005; Vargas and Allen 2008b).

One valuable product of automated measurements of soil respiration is the ability to observe diel patterns. There is evidence that diel cycles can be large and change abruptly day to day. For example during the early growing season in a semi-arid shrub ecosystem, soil respiration is tightly coupled to soil temperature until day of the year (DOY) 126 (Fig. 1a). Immediately after DOY 126, soil respiration is decoupled from soil temperature during the mid-day, and progressively more decoupled with any measured temperature (air, soil at multiple depths) throughout the growing season. Carbone et al. (2008) postulated that these patterns are regulated by differences in the timing and contribution of heterotrophic and autotrophic respiration in this ecosystem. Similar patterns have been observed across different vegetation types (Bahn et al. 2008; Baldocchi et al. 2006; Tang et al. 2005a; Vargas and Allen 2008a, b). However, in other ecosystems the apparent decoupling between soil respiration and soil temperature may be a result of physical processes such as changes in heat and CO2 transport in the soil as shown by modeling approaches (Jassal et al. 2004; Pumpanen et al. 2003).

Fig. 1
figure 1

Examples of diel patterns of soil respiration in a semi arid ecosystem modified from Carbone et al. (2008) showing soil temperature (grey and dashed line) and the sudden shift in the diel pattern of soil respiration (black) and b continuous measurements of soil CO2 production in an oak woodland from Vargas et al. (2010a) during dry season when trees are active but grasses are dead, and wet season when grasses are active but deciduous trees have no leaves

A second application of automated measurements is the potential to measure the location of soil CO2 production deeper in the soil using CO2 concentrations profiles (Risk et al. 2002b), and then related the production with biophysical factors (Hasselquist et al. 2010; Vargas et al. 2010a). For example, in an oak woodland savannah a vertical array of soil CO2 sensors have been installed to test differences in soil CO2 production between trees and grasses (Baldocchi et al. 2006; Vargas et al. 2010a). During the dry season the annual grasses are dead and the upper soil layers are dry, but the trees with deeper roots are active and higher soil CO2 production is observed at a deeper layer (up to 60 μmol CO2 m−3 s−1, Fig. 1b). In contrast, during the winter rains when the annual grasses are active but the deciduous trees are inactive, soil CO2 production is higher at shallow depths but with substantially lower magnitudes (up to 8 μmol CO2 m−3 s−1, Fig. 2b) than during the dry season associated with higher tree photosynthesis rates. Mechanisms that influence soil CO2 production at the top layers of the soil may be different than those that regulate production at deeper soil layers because of differential spatial distributions of heterotrophic and autotrophic respiration sources, and changes in soil CO2 diffusion across seasons.

Fig. 2
figure 2

Continuous measurements of soil respiration after a rain pulse event in the understory (black) and open grassland (grey) of a Mediterranean oak woodland. In this ecosystem photosynthesis is a function of soil moisture and light, but the soil respiration response is a function of number of antecedent rain events and antecedent sunlight (Baldocchi et al. 2006)

A third application of automated soil respiration measurements is the ability to measure the response of soils to stochastic precipitation pulse events (Fig. 2, Daly et al. 2008). The rapid rewetting of soils may yield a pulse in soil CO2 production that may persist for several days (Fierer and Schimel 2003) and could contribute to a large percentage of ecosystem respiration (Xu and Baldocchi 2004). Multiple efforts have been done to identify processes that regulate soil CO2 production after water pulses such as the ‘Birch Effect’ (Birch 1959; Jarvis et al. 2007), or the relationship between photosynthesis and number of antecedent rain events (Xu and Baldocchi 2004). Now with automated soil respiration measurements one can measure the patterns of soil CO2 production after a rain pulse event and estimate the contribution of soil CO2 pulses for the annual soil respiration budget.

Isotope approaches

Isotopic approaches are promising experimental techniques to obtain direct, quantitative links to plant and microbial mechanisms driving variation in soil respiration (Bowling et al. 2008; Hanson et al. 2000; Paterson et al. 2009; Trumbore 2006, 2009). These links include quantifying (1) the transport of C within plants, (2) the transfer of C from roots to microbes, SOM, and/or respiration, and (3) the contributions to soil respiration from autotrophic and heterotrophic sources.

Experimentally separating soil respiration into autotrophic and heterotrophic sources is an important challenge facing current soil respiration research (Hanson et al. 2000; Kuzyakov and Gavrichkova 2010). Isotopic partitioning methods are advantageous over other techniques, such as trenching or girdling, because they impart much less disturbance to the plant-soil system. However, isotopic partitioning methods require that there be discernible differences between the isotopic signatures of autotrophic and heterotrophic sources (end members). For this reason, partitioning soil respiration with natural abundance 13C has only been possible in ecosystems where there has been a shift in vegetation types using different photosynthetic (e.g. C3 and C4) pathways (Hanson et al. 2000; Kuzyakov 2006; Paterson et al. 2009; Subke et al. 2006).

Background abundance 14C measured by accelerator mass spectrometry (AMS) is proving to be an effective approach for soil respiration partitioning. This is because in many ecosystems there are large differences in the Δ14C signatures of autotrophic and heterotrophic respiration due to differential incorporation of ‘bomb’ C (Gaudinski et al. 2000; Trumbore 2000). ‘Bomb’ C is the result of thermonuclear weapons testing in the 1950–60s in which the atmospheric concentration of 14CO2 was increased almost two-fold. Since testing ceased in 1963, dilution from ocean and land CO2 exchange, and the burning of fossil fuels has caused the 14C of the atmosphere to gradually decline (Levin and Hesshaimer 2000). Therefore, because the Δ14C signature of C in vegetation reflects that of the atmosphere in which it was photosynthesized, the Δ14C signatures of new photosynthetic products (the primary source for autotrophic respiration) is close to that of the current atmosphere. In contrast, the Δ14C signature of heterotrophic respiration is elevated with respect to the current atmosphere because it integrates decomposition from both new and accumulated SOM. On average, heterotrophic respiration reflects Δ14C signature of C photosynthesized years to decades earlier (when there was more ‘bomb’ C in the atmosphere; Trumbore 2000). Thus, soil respiration can be separated in the field, using a simple isotope mass balance approach, where autotrophic and heterotrophic respiration ‘end member’ Δ14C signatures are determined by separate incubations of roots and SOM respectively (Gaudinski et al. 2000; Schuur and Trumbore 2006).

To date, this approach has been used to look at autotrophic and heterotrophic contributions over seasonal to interannual timescales in temperate (Cisneros-Dozal et al. 2006; Gaudinski et al. 2000), boreal (Czimczik et al. 2006; Hahn et al. 2006; Schuur and Trumbore 2006), and semi-arid (Carbone et al. 2008) ecosystems; and with manipulations of rain (Borken et al. 2006) and snow (Muhr et al. 2009). In addition, 14C partitioning may also be an effective approach to test more specific process-level hypotheses on shorter timescales, particularly when combined with automated soil respiration measurements (Carbone et al. 2008). Because 14C is an indicator of age, these measurements can also be used to investigate the effects of moisture pulses on microbial decomposition of SOM (Fig. 2), and test the hypothesized mechanisms to explain the ‘Birch Effect’ (Birch 1959; Jarvis et al. 2007). For example, is the primary response of heterotrophic respiration after re-wetting due to (1) the mineralization of microbial cellular materials released with the rapid increase in water potential stress (younger C) or (2) the physical breakdown of soil aggregates exposing older, yet still readily decomposable C (Muhr and Borken 2009).

This technique is not without limitations, which have been thoroughly discussed (Bowling et al. 2008; Hanson et al. 2000; Paterson et al. 2009; Trumbore 2006, 2009). Still it is important to mention that it assumes that the isotopic signatures of the ‘end members’ are representative. Capturing the heterotrophic respiration end member can be challenging, due to lateral spatial heterogeneity associated with new C inputs, but also due to vertical changes in the location of microbial decomposition with soil temperature and moisture variations. The autotrophic respiration end member can also vary (albeit less so) within ecosystems, across vegetation types, and seasonally depending on stored carbohydrate (older C) contributions to root respiration (Czimczik et al. 2006; Schuur and Trumbore 2006). Finally, 14C measurements are expensive so studies are limited in the number of replicates in time and space, and interpretation of the results must take into account sample size and uncertainty within the measurements.

The addition of tracers to ecosystems to follow the fate of C belowground is another valuable isotopic approach. Tracer studies include both pulse- and continuous labeling applications with isotopically distinct (enriched or depleted, 13C or 14C) substrates (e.g. CO2, litter, carbohydrates) introduced to the atmosphere or soil (Hanson et al. 2000; Kuzyakov 2006; Pataki et al. 2003; Pendall et al. 2004). Using ecosystem-level (i.e. field-based) CO2 pulse-labeling one can address issues about: (1) allocation on new C belowground (2) time lags between assimilation and soil respiration, (3) the role of photosynthesis and substrate supply in modulating soil respiration on the diel timescale, and (4) the transfer of C to microbial pathways.

Enhanced detection of 14C measured by AMS now allows for low-level 14C pulse-label applications (i.e. amounts well-below regulated health standards) in natural ecosystems. This approach is particularly valuable to trace the contribution of new C into small and long-lived pools because the label signal strength can be easily 6 orders of magnitude greater than possible with 13C (Carbone et al. 2007; Carbone and Trumbore 2007). In addition, AMS can now easily measure samples as small as 10 μg C (vs. 1 mg, Santos et al. 2007a, b), thus there is potential to capture labeled-C in specific compounds, like phospholipid fatty acids (PLFAs), and identify incorporation in different microbial C pathways (Kramer and Gleixner 2006; Kramer et al. 2010).

Novel ecosystem studies are also being conducted with 13C pulse-labels. Högberg et al. (2008) traced the fate of new C through various tree components, and into microbial pools. Most recently, Bahn et al. (2009) reported an innovative study in which 13C of soil respiration was measured by tunable diode laser absorption spectrometry (TDLAS). This combination of high-temporal resolution 13C measurements of soil respiration (i.e. real-time isoflux) plus manipulations of photosynthesis (shading) enabled new insights into substrate supply and aboveground links with soil respiration on the diel to weekly timescale. This approach is promising, particularly with the development and maturity of more field-worthy laser technologies, such as cavity-ring down spectroscopy (Wahl et al. 2006).

Exploring continuous measurements of soil respiration

The importance of quality assurance (Q a), quality control (Q c) and gap filling

Increases in dataset length, due to new automated measurements of soil respiration, will require specific protocols for Q a/Q c and gap filling. These protocols are important to produce clean time series that are useful for subsequent analyses and data-model fusion. Examples have been proposed for data quality and calculation of uncertainties with automated soil respiration chambers (Savage et al. 2008). Furthermore, the scientific community using the eddy covariance method, which measures the molar density of CO2 and vertical wind perturbation to calculate fluxes of CO2 between ecosystems and the atmosphere (Baldocchi 2003; Goulden et al. 1996; Valentini et al. 1996), has detailed standardized methods for data Q a/Q c (Papale et al. 2006). Future gap-filling approaches of automated measurements of soil respiration could incorporate non-linear regression, marginal distribution sampling, or artificial neural network based techniques (see Moffat et al. 2007). These techniques have been widely tested for gap filling of eddy covariance data and could be easily adapted for soil respiration research. Common harmonization procedures for soil and ecosystem CO2 fluxes will benefit future synthesis activities to better understand terrestrial carbon fluxes.

Time series analyses on soil respiration research

Once continuous measurements of soil respiration have been revised for Q a/Q c one can start analyzing the time series. It is evident that biophysical factors (e.g. temperature, soil moisture, photosynthesis) that regulate soil respiration operate on a wide range of temporal scales. Thus, we propose that complex spectral methods are valuable to understand the temporal correlation between soil respiration and those factors. New approaches with time series analyses could focus on the study of the frequency domain of soil respiration signals (both fluxes and isotopes) because they contain strong periodicities from diurnal to seasonal scales (Fig. 3). Identifying important periodicity on these signals can help to identify when different biophysical mechanisms are relevant for driving soil respiration fluxes. Time series analyses such as cross-correlation may not be appropriate because biometeorological signals, including time series of soil respiration, are non-stationary as they are subject to phenomena such as rain pulse events.

Fig. 3
figure 3

Global wavelet power spectrum of over 2 years of measurements of soil respiration in a mixed temperate forest in southern California (Vargas et al. 2010b). Legends in figure indicate factors that potentially influence soil respiration at specific time-periods (i.e. days, weeks, months and years). The y axis represents how much “energy” or power is represented by the signal at specific time-periods

An alternative method is the use of wavelet analysis which has been widely used for climate research, and previous studies have discussed in detail this technique (Torrence and Compo 1998). Wavelet analysis performs the estimation of the spectral characteristics of a time series as a function of time. A close analysis of any continuous measurements of soil respiration will reveal that the signal changes with time over a range of time-periods from days to years (Fig. 3). Intermediate time-periods (i.e., weeks to months) need to be separately investigated because episodic and transient events (i.e., heat waves, rain pulses) substantially influence soil CO2 processes at multiple temporal scales (Vargas et al. 2010b).

A subsequent technique is wavelet coherence analysis, which is used to analyze the temporal correlation between two time series. This technique has also been explained and discussed in previous studies (Grinsted et al. 2004; Torrence and Compo 1998). For example, using wavelet coherence analysis one can identify the temporal correlation between soil respiration and soil temperature or soil respiration and soil moisture. Using this technique Vargas and colleagues (2010b) reported that the temporal correlations of the time series between soil respiration and soil temperature are not consistent with time. This means that the temporal correlation between these two time series may not be significant every day of the year because other biophysical factors (e.g. photosynthesis, changes in soil CO2 diffusion rates) may have a stronger control over the signal of soil respiration. If these temporal correlations are not consistent in time, then it may be important that future model development and model parameterization consider when variations in soil respiration are significantly influenced by soil temperature at multiple temporal scales.

This brings us to a final application of time series analysis, which is the possibility to test the performance of soil respiration models in the frequency domain. In other words, one could identify when a model fails (i.e. has large residuals). To date, model performance is mainly done in the time domain where the goal is to match the measurements with the model outputs (Fig. 4a). However, it is difficult to identify under which circumstances the model (empirical or process based) fails, and therefore testing the model output in the frequency domain may become more common (Vargas et al. 2010b). For example, one can estimate the residuals of a model (Fig. 4b) and then explore them in the frequency domain to identify at which time-periods (e.g., weeks, months, and years) the residuals are persistent. In our example, the empirical model has large residuals at time-periods of 70 and 200 days (Fig. 4c). Once these time-periods have been identified, one could revisit the model and modify it based on the information gathered in the frequency domain. An alternative approach is the application of Singular System Analysis (SSA) to extract specific oscillatory patterns as a way of decomposition of the time series to explore the temporal patterns of model performance at different time scales (Mahecha et al. 2010). If model development is not the goal one can use time series analysis to propose hypotheses for further research based on information about when the model fails.

Fig. 4
figure 4

a Comparison of model results and measurements of daily average of soil respiration in a mixed temperate forest in southern California (Vargas and Allen 2008b). Model results were calculated using the empirical relationship: soil respiration = \( {B_{0}}{\text{exp}}^{({{{B}_{1}}T})};\) where B0 and B1 are constant model parameters and T is soil temperature. b Residuals of the empirical model as a function of time. c Wavelet power spectrum of model residuals showing substantial model error at time-periods of 70 and 200 days

Perspectives on modeling development and model-data integration

The scientific community is producing unprecedented datasets, and with this new data, new model developments and model-data integration exercises are expected. However, the modeling community is experiencing its own challenges and opportunities aided by the wealth of data collected in recent years. In this section we discuss our perspective about emergent ideas for modeling development and model-data integration for soil respiration research.

Model improvement is often hampered by inappropriate interpretation. One example is when process level parameters (e.g., enzyme activities) are mixed with whole-system level parameters (e.g., Q 10 of soil respiration) and both of these parameters are directly transferred into process models without accounting for confounding effects (Reichstein and Beer 2008). Therefore, one alternative treatment of the transition between scales (i.e., process level vs. whole-system level parameters) could be data-assimilation (Williams et al. 2009).

In the topic of soil respiration and C dynamics there are large uncertainties in model structure. Hence we caution the community that ‘blind’ data assimilation is not appropriate. We propose that data assimilation for soil respiration research should be replaced by a more general term ‘model-data synthesis’ or ‘model-data integration’, which consists of several steps (Fig. 5). At each of these steps, there is interplay between data, model structure, and the model developer. The process detail depends somewhat on whether the problem is focused on state estimation of the system, or on parameter estimation of the model. If state estimation is the goal, then model states are adjusted to generate closer agreement with the observations. If model parameter estimation is the goal, then model parameters are adjusted so that the model state(s) come into closer agreement with the observations. Following the optimization of model parameters (defined here loosely to potentially include both parameters sensu stricto and state variables), it is critical that further analyses be conducted by the modeler to: (1) quantify uncertainties in optimized parameters; (2) evaluate the plausibility and temporal stability of optimized parameter values; (3) understand when and why the model is failing; and (4) identify opportunities for model improvement (e.g., re-formulation of structure and process representation). When treated in this manner, we propose that model data synthesis has relevance to both basic and applied scientific questions.

Fig. 5
figure 5

The multi-step process for model-data fusion: a conceptual diagram showing the main steps (and the iterative nature of these steps) involved in a comprehensive data-model synthesis (after Williams et al. 2009)

We briefly bring to attention to three interconnected problems, which occur with model-data fusion and complex models: equifinality, overparameterisation, and effective parameterization. Equifinality describes the fact, that a certain dataset can be described equally well by different models or model parameterizations (Luo et al. 2009; Williams et al. 2009). As an illustrative example, consider the CO2 flux from a soil compartment, which is often described as Flux = pool * rate constant, where the rate constant is the parameter and the pool a state variable. In this case many combinations of pool and rate constant can describe soil CO2 fluxes, if they are inversely related (i.e., the rate constant is not well-constrained by the fluxes alone). Thus, additional measurements of pool sizes or combinations of pool estimates and fluxes may yield better constraints. Finally, one can make model experiments and suggest new empirical experimental approaches to test further hypotheses developed from this approach (Wutzler and Reichstein 2008).

It is known that process models try to generate system behavior (e.g., soil CO2 fluxes) by describing the sub-systems (i.e., autotrophic and heterotrophic respiration) and their interactions if possible from first principles. This often leads to models with many parameters, which are hard to parameterize (overparameterized models), and the above-mentioned equifinality problem could emerge. As shown by Reichert (1997) overparameterization often leads to highly correlated parameters. If this is the case, one can apply a Bayesian approach where prior information on the parameter distribution can be used for better model parameterization (Van Oijen et al. 2005).

Finally, we believe that the most critical problem is ‘confounded parameterization’. It occurs when the model structure is too dissimilar from the real world where certain processes are missing, or when there are systematic data-biases. Under those conditions, parameters are found in model-data fusion approaches to best ‘describe’ the observations. However, they do not represent ‘real’ process parameters because they are statistically confounded parameters. Hence, the process model loses its predictive capacity, which should originate from a sound process representation because the confounded parameter does not describe the system behavior correctly under modified conditions. Statistical models, such as the one used to generate the “modeled values” in Fig. 4, also experience this problem.

The phenomenon of ‘confounded parameterization’ is critical because it is not easily detectable (in contrast to overparameterization which can be tested for instance by looking at the covariance matrix of the parameter estimates). Ignoring this phenomenon could degrade model-data fusion into a bad fitting exercise. There are several studies where effective parameterization has been identified to be important for terrestrial C cycling research. For example, the case of estimation of respiration temperature sensitivity can be overestimated by confounding seasonal cycles when one is analyzing the bulk (or total) time-series data with a simple Q 10 model (Carrara et al. 2004; Reichstein et al. 2005). Furthermore, Carvalhais et al. (2008) showed that the steady state assumption in terrestrial C cycle models leads to effective and erroneous parameter estimates when the considered ecosystem is not in steady state. This is an area of active research where several techniques should be explored to avoid confounding effects with model parameterization. One example is the above-mentioned separation of model performance in time-scales (e.g. if by definition a model does not represent a certain time-scale it should not be evaluated against that). We propose that similar exercises should be done to avoid ‘confounded parameterization’ to study the influence of rain pulse events on soil respiration (Figs. 2, 3). In this case if a model does not describe the process behind them, the rain pulses could be interpreted as high (direct) sensitivity of soil respiration to moisture. This could lead the confounded (and erroneous) respective parameters, which for example would wrongly describe the response of the system to a mean change in soil moisture.

An ongoing challenge is to solve how the modeling community will cope with these problems for model-data integration and model developing. Our intention is not to provide solutions for soil respiration modeling, but to address these emergent ideas to stimulate constructive scientific discussion. We are moving to an era of regional and global research networks that will continue collecting unprecedented biogeochemical information (e.g. FLUXNET, NEON, ICOS). With increasing data resources we need to move beyond empirical models and integrate the large amount of multiple observations (e.g. soil and ecosystems CO2 fluxes, isotope data) into process-based models that should represent the “breathing of the biosphere” at multiple temporal and spatial scales. This will require a stronger interaction between modelers and experimentalists to improve model structures and generate further hypothesis for future generations of ecosystem experiments.

Future considerations

Current technological advancements are helping to improve our understanding of biophysical processes that regulate soil respiration and terrestrial C cycling. Yet, major challenges remain ahead of us: (1) quantifying autotrophic and heterotrophic sources of soil respiration across different ecosystems in time and space, (2) understanding the control of vegetation on soil respiration including lags and transport of C from photosynthesis to soil respiration; (3) incorporating disturbances into process-based models; and (4) measuring in high resolution (spatially and temporally) and upscaling predictions of soil respiration and C storage to larger spatial scales (e.g. landscape-global) and temporal scales (e.g. interannual to millennia). Combining studies with multiple techniques including isotopes, automated measurements of soil respiration and soil CO2 production with ecosystem fluxes (i.e. eddy covariance) will yield complementary information about C cycle processes and may influence on how we parameterize and construct models. Traditionally, new experiments have been based on observations, however, researchers can learn from model results, and design new experiments to test further hypotheses developed by models. This approach can help improve both models and experiments. Thus, greater cooperation between modelers and experimentalists is a potentially powerful approach to increase our understanding of soil respiration and the underlying processes that drive this important flux.