1 Introduction

Soil persists as a temporal reservoir for phosphorus (P) in plant production, but only a relatively small fraction of total soil P is available for direct plant or microbial uptake. Inorganic and organic P forms in soils are a result of multifaceted P turnover processes which are affected by plants, microorganisms, and abiotic factors (Annaheim et al. 2013). As in organic farming crops are to be nourished primarily through the soil ecosystem (Council Regulation (EC) No 834/2007 on Organic Production), the farmers should contribute to maintaining and enhancing soil fertility. In organic farming, the recycling of wastes and by-products of plant and animal production (e.g., livestock manures, compost) and low soluble mineral fertilizers (e.g., rock phosphates) are permitted for P import. Furthermore, organic farmers try to foster biological processes in the soil for P mobilization by diverse crop rotations and organic fertilization. Nevertheless, many of them tolerate P budget deficits not knowing when soil reserves will be depleted (Gosling and Shepherd 2005). In organic agricultural soils in Europe, P is often enriched due to former application of inorganic and organic fertilizers (Tóth et al. 2014). This offers reserves for organic farms with low external feed and fertilizer import. But, continuous P budget deficits may lead to low plant-available P contents in soil.

The availability of P in soil depends on the P input and is also regulated by biological and biochemical processes. The biological oxidation of organic P in soil is largely mediated by microbial activity and by phosphatases of either plant or microbial origin, whereas the mobilization of less-available inorganic P sources is supported by, e.g., root-induced acidification in the rhizosphere and excretion of organic acids (Hinsinger 2001; Richardson et al. 2011). The application of organic matter can improve soil chemical and physical properties as well as the nutrient availability in soil (Manlay et al. 2007). On the other hand, the amount and quality of the organic matter supplied, mainly its C:N:P ratio, also affect the soil microbial activity (Güsewell and Gessner 2009). Increasing contents of inorganic P in soil can reduce or mask P mobilization processes and may also reduce the excretion of phosphatases and, as a consequence, the hydrolyzation of organic P compounds (Dick et al. 2011).

Worldwide, the recommended extraction procedures for the determination of plant-available P in soil differ but fertilizer recommendations were usually given with special attention to soil inorganic P pools (Jordan-Meille et al. 2012). However, organic P constitutes an important proportion of total P in soil of up to 94% (Achat et al. 2009). The soil organic P fraction includes different chemical forms which differ in their potential availability for plants (Annaheim et al. 2013). Especially in grazed pastures, organic P is seen as an important source to sustain the P nutrition of plants, but quantitative data on the release of orthophosphate are widely missing (Nash et al. 2014).

In general, legume species were often described to mobilize poorly available P forms, mainly due to the exudation of organic acids, ions, and phosphatases (Li et al. 2007). The cultivation of legumes in crop rotations is therefore, besides being an important source of N, seen as a good opportunity to utilize soil P reserves.

Against this background, we expected that the soil P availability and enzymatic activity would behave differently in dependence of the farming systems over the study period. We hypothesized that under conditions of long-term negative P budget in organic farming, the decline in readily available soil P pools would be less pronounced in a dairy system (arable land and grassland) than in a stockless system (arable land). We believe this was due to the manure backflow to fields, the higher percentages of forage legumes, and longer soil cover during the year in the dairy systems.

2 Material and methods

2.1 Study site

The study on arable and grassland fields was undertaken at the organic experimental station of the Thünen-Institute of Organic Farming in Trenthorst/Wulmenau in North Germany (53° 46′ N, 10°31′ E, average precipitation 706 mm year−1, mean temperature 8.8 °C) which was converted to organic farming in 2001. No mineral or imported organic P fertilizers were applied since then. The soils were characterized as Cambisols and Luvisols from boulder clay with sandy loamy texture (means: arable land 46% sand, 34% silt, 18% clay; grassland 38% sand, 41% silt, 16% clay).

Former conventional crop rotations were focused on winter cereals and rapeseed production (N fertilization level 200–250 kg ha−1 year−1, dry matter grain yields 8–10 t ha−1 and 3–4.5 t ha−1, respectively). In 2001/2002, soil fertility was strengthened by introducing clover varieties for biological N fixation before the experimental farm was divided into different organic farming systems with fixed field size and positions in 2002/2003. In our study, adjacent fields of a stockless and a mixed dairy system were analyzed. These arable systems had a 6-year crop rotation on six fields, but differed in input of livestock manure, crops, and percentage of forage legumes and grain legumes (Table 1). Permanent grassland was additionally used as feed resource in the dairy system (Fig. 1). Here, mineral fertilization was omitted and organic fertilization was reduced with conversion to organic farming. A 1-ha long-term monitoring plot (LTM plot) with four georeferenced sampling points (LTM points) was located within each field. The LTM points had a distance of >30–60 m and were used for regular soil and plant sampling. In the following, results of long-term development of plant-available P(CAL) in soil were based on analysis of data from samples of all LTM points in all LTM plots of all years. For detailed analyses of P fractions, organic P, and enzymatic activity in soils, we focused on LTM points of four LTM plots (Fig. 2, focus areas). Average values of other chemical topsoil properties (C, N, K, Mg, pH) in February 2003 and in February 2013 (winter season) are given in Table 1 to show the general trends in development over time.

Table 1 Crops grown in the focus areas within the arable crop rotations in Trenthorst (harvest years 2001-2013), main sward components in the focus areas of grassland and average values for main soil chemical parameters in 2003 and 2013 (per year n=24 for arable crop rotations, 0-30 cm sampling depth, and n=8 for grassland, 0-10 cm sampling depth); ± values indicate standard errors
Fig. 1
figure 1

View of fields included in the study on long-term organic farming systems in Trenthorst/Wulmenau (North Germany). a Permanent grassland grazed by cows and harvested since year <1950 (54 ha). b Arable land divided into a stockless system without external P input (30 ha) and a dairy system with internal P backflow via manure (62 ha) since year 2001

Fig. 2
figure 2

Map showing the fields of arable land (dairy and stockless) and grassland at the organic experimental station Trenthorst/Wulmenau, with long-term monitoring (LTM) plots for soil sampling and analysis. Four LTM plots were chosen as focus areas (asterisk) for detailed studies of soil P characteristics

In the stockless system, grains were exported and clover mulch and straw always remained on the fields. No organic or mineral fertilizers were applied to this system during 2001–2013. In the dairy system, clover grass and straw were used as forage and litter in the stable, and solid and liquid manures were applied to arable land and grassland according to the management scheme of the farm. In the fields we focused on, manure was applied in 2004, 2005, 2007, and 2010 on arable land and in 2005, 2007, 2009, and 2012 on grassland. The chemical composition of the manure was not recorded continuously, but in the period 2010 to 2012, 5.3 kg P ha−1 year−1 and 1.4 kg P ha−1 year−1 have been applied on arable land and grassland of the dairy farm, respectively (Ohm et al. 2015). Grassland was grazed and harvested for silage and hay. Feedstuff imports did not balance out the field and farm P budgets of the mixed dairy system. In a close view, arable and grassland fields of the dairy system had P deficits of −10.9 and −7.9 kg P ha−1 year−1 in that time, respectively (Ohm et al. 2015). In the dairy arable system, average grain yields of legumes and cereals were between 2.5 and 5 t ha−1 dry matter. Average dry matter yields of maize and grass-clover (2–3 cuts) for silage preparation were between 7.7 and 11 t ha−1. Grassland yields with grazing and harvesting were determined to be between 4 and 6 t ha−1 year−1 in that time. Also, the stockless system was in a continuous P export situation since 2001. Here, an average dry matter grain yield level of 2 t ha−1 year−1 (dry matter grain yields of rapeseed, legumes, or cereals between 1 and 5 t ha−1) with mean P concentrations in grains of 3.5 g P kg−1 (typical P contents 2.8 and 6.8 g kg−1 in cereal and legume grains, respectively) in 10 of 13 harvest years on six fields meant an average field budget deficit of −5.4 kg P ha−1 year−1.

2.2 Soil sampling

Soil samples to determine enzyme activities were collected from three drillings of 4 cm diameter with a depth of 30 cm around each LTM point in the focus areas in May 2001 and again in May 2013 (spring season). These samples were stored frozen at −70 and −20 °C, respectively. Also, various inorganic P forms, organic P and total P were analyzed, this additionally in soil sampled on arable land in February 2009 and stored frozen at −20 °C. Changes over time in plant-available P(CAL) were described by annual sampling during 2001–2013 in arable land and from 2003 to 2013 in grassland. These samples were collected in February or March (winter season) of each year on all 14 LTM plots of the long-term monitoring scheme (Fig. 2). The sampling depth for these samples was 0–30 cm in arable land and 0–10 cm in grassland.

2.3 Soil and plant analyses

In soils, the concentrations of biochemical indicators for P mobilization (acid and alkaline phosphatases), of microbial activity (dehydrogenases), of the P availability to plants (P(CAL)), of mineral P fractions, of organic P, and of total P were evaluated.

For the enzymatic analyses, the frozen soil samples were thawed and kept at room temperature for 24 h. Roots were removed by tweezers before chemical extraction. The acid and alkaline phosphatase activities were determined after incubating 1 g soil with p-nitrophenolphosphate for 1 h at 37 °C at a pH of either 6.5 (acid phosphatase) or 11.0 (alkaline phosphatase) by photometrical measurement of the p-nitrophenol (p-NP), which was extracted with 1 M CaCl2 and 0.5 M NaOH, respectively (Tabatabai and Bremner 1969). The dehydrogenase activity was analyzed according to Thalmann (1968) by suspending 1 g soil in 0.8% triphenyltetrazolium chloride (TTC) solution followed by incubation at 37 °C for 24 h. TTC will be reduced to triphenylformazan (TPF) by most microorganisms. The released TPF was extracted with acetone and measured photometrically at 546 nm. The other parameters were determined in air-dried soil samples. The plant-available soil P concentration was extracted with calcium acetate lactate as P(CAL) (Schüller 1969), which is a standard soil P test in Germany. To characterize the inorganic soil P forms in more detail, a modified sequential P fractionation method was applied in accordance with Hedley et al. (1982). The most labile P fractions were removed with mild extractants, followed by increasingly stronger extractants to identify less biologically available fractions. The following inorganic P fractions with decreasing solubility were derived stepwise H2O-P > NaHCO3-P > NaOH-P > HCl-P > Residual P. Residual P after HCl extraction and total P in the original soil sample were extracted from the soil by 18 M H2SO4 and the following addition of H2O2 (30%) (Tiessen et al. 1993). To estimate the organic P concentration in soil (Porg), 10 g of dry soil were treated for 16 h with 0.25 M NaOH–0.05 M Na2-EDTA in a soil-to-solution ratio of 1:10 (w:v). For inorganic P, the extracts were diluted 10 times before colorimetric measurement with malachite green at 623 nm (Ohno and Zibilske 1991). For total P in the NaOH-EDTA extract, a persulfate digestion was performed with 1 mL of NaOH-EDTA extracts and 10 mL of digestion mix (6 g NH4-persulfate in 100 mL 0.9 M H2SO4). In the neutralized digests, total P was determined by colorimetric measurement. Organic P was calculated as the difference between total P and inorganic P. Total P in plant material was determined photometrically after nitric acid digestion in a microwave oven (Huang et al. 2004).

2.4 Statistical analysis

Statistical analyses were conducted using R 3.2.2. Results on P fractions, total P, Porg, and activity of phosphatases and dehydrogenases from the focus areas of 2001, 2009, and 2013 were compared by pairwise t tests or, if data were not normally distributed, with the paired Wilcoxon signed-ranks test. The development of soil P(CAL) concentrations between 2001 and 2013 was analyzed using linear mixed-effects models. As the data were not normally distributed, they were log-transformed. For the data from arable land (dairy and stockless), the model was set up with year, farming system, and the interaction of these two as fixed effects and LTM point nested within LTM plot (see Fig. 2) and crop (Table 1) as random intercept effects. The model for the grassland data used year as fixed effect and LTM point nested within LTM plot as random intercept effect. The R package lme4 (R package version 1.1-8) was used for the modeling of mixed effects. Marginal and conditional R 2 were calculated for the mixed models by using the function r.squaredGLMM from the R package MuMIn (R package 1.15.1). While marginal R 2 deals with the variance explained by the fixed factors, conditional R 2 deals with the variance explained by the entire model (fixed + random factors). Least-square means (LS-means) were extracted after modeling using R package LS-means. A linear regression was performed based on LS-means for analysis of the development of P(CAL) over years in the farming systems. Possible relations between P in plants and P(CAL) in soils were analyzed by the Spearman rank correlation.

3 Results and discussion

To test the hypothesis, we compared the development of the soil characteristics in the three farming systems: two arable systems (dairy and stockless) and a grassland system from 2001 to 2013. We expected stronger reductions of soil P concentrations and lower microbial activities in the stockless arable system than in the other tested systems.

3.1 P(CAL) contents in soil

The mixed models for arable land and grassland had a marginal R 2 of 0.21 and 0.52 and conditional R 2 of 0.69 and 0.60, respectively. There are relatively high explanation rates of the variance with the entire models for arable land and grassland, whereas the different marginal R 2 indicate a higher explanation rate of the fixed effect (year) in grassland compared to the fixed effects (year, farming system) used in the model approach for arable land. Maybe this effect was due to higher variability of crops and to soil cultivation in arable systems compared to the continuous plant cover of soils in grassland.

Results from other long-term experiments with arable land (Tunney et al. 2003) and grassland (Messiga et al. 2015) have shown that plant-available soil test P is closely related to P budgets and that a single linear regression can describe changes in soil test P for both negative and positive P budgets. Also in all our systems we found a significant reduction of the P(CAL) values (Fig. 3) by linear regression. The average yearly reduction of the P(CAL) values was 1.76 mg kg−1 in the stockless arable plots, 1.41 mg kg−1 in the dairy arable plots, and 3.17 mg kg−1 in the grassland plots (Fig. 3).

Fig. 3
figure 3

Development of plant-available P(CAL) values (LS-means) in soils over the years within the stockless and dairy farming system (n = 24 per year and crop rotation, sampling depth: 0–30 cm, 2001, 2003–2013) and on grassland (n = 8 per year, sampling depth: 0–10 cm, 2003–2013) and acceptable lower threshold according to Kuchenbuch and Buczko (2011). A declining trend is visible in all three systems. Error bars indicate the 95% confidence interval of the LS-means. Regression equations: Grassland: y = −3.177× + 172.48, adj. r 2 = 0.30, p = 0.046, Stockless arable: y = −1.763× + 91.5, adj. r 2 = 0.66, p = 0.001, Dairy arable: y = −1.415× + 80.54, adj. r 2 = 0.50, p = 0.006

Grassland had higher starting values in P(CAL) and higher data variability than arable land (Fig. 3). Higher variability could be an effect of excrements of grazing cattle in the soil top layer and also of missing soil tillage which would have eased distribution of P from organic matter from plant residues and organic fertilization in the arable systems. In another study soils that were high in P exhibited a faster change in soil test P because of the lower capacity to bind P (Zhan et al. 2015). Besides the initial P(CAL) values the relation between P budget and changes in soil test P depends also on the location, soil type and pH value (Messiga et al. 2015). As we compared fields from one location we can assume that differences in soil characteristics like pH and soil texture will be less relevant in arable land, whereas lower pH in grassland and permanent plant coverage might have led to different dynamics. Weather effects as reasons for different development could be almost excluded as the P(CAL) content in the grassland and the arable system did not show the same pattern over the years.

Looking at possible effects of different crops, relatively high values of the labile inorganic P fractions were found after cultivation of legumes in arable land (Table 2) (see also Chapter 3.2). But neither the contents of the labile P fractions nor of P(CAL) contents in the focus areas of both rotations supported clear differences in development in both arable systems. With regard to the determination of labile P fractions, since only 3 years and a small number of samples were analyzed, possible effects of soil heterogeneity might have increased the variability of data. A more consistent trend could be derived from the higher number of data used for regression analysis of P(CAL) contents over all 12 years. Thus, to avoid overinterpretation of data from single events, we focused on general trends and a meaningful placement and discussion of the data within the system differences in the following.

Table 2 Development of soil P characteristics (P fractions, and enzyme activity). Measurements are done at the same LTM-points in 2001, 2009, and 2013 at arable land and grassland in Trenthorst (arable land stockless: n=4, arable land dairy: n=4, grassland: n=8, 0-30 cm).

With the continuous decrease in plant-available P(CAL) in soil, minimum thresholds given for plant-available P(CAL) necessary for sufficient plant growth have not yet been reached. Kuchenbuch and Buczko (2011) derived a lower threshold of 44 mg P(CAL) kg−1 in topsoil of conventionally managed fields. P fertilizer should be applied according to the plant removal to secure yields then. For the lower yield level of organic farms in Germany, even lower P(CAL) contents may be sufficient. So, it can be assumed that soil P does not limit the plant P supply and yields on the studied site yet. This is supported by our findings that even at sample points with plant-available P(CAL), concentrations in soils around that threshold, relatively high P concentrations in plant tissue and cereal grain yields (between 3 and 5 t ha−1 dry matter) and grassland yields (3.5–5 t dry matter) adequate for organic farming were reached. Also soil P(CAL) and plant P concentrations at the LTM points had only low correlations in wheat and triticale (n = 60, samples after tillering, taken from 2004 to 2011 and in 2013: Spearman’s rank correlation, ρ = 0.372, p = 0.003**) and were not correlated in grassland at all. The average P concentration of wheat and triticale plants grown in the arable systems at the beginning of stem elongation (BBCH 30/31), sampled during 2004–2010 plus in 2013 (n = 60), was 0.23% P in dry matter (range 0.15 to 0.34%). The mean P concentration in grassland sward was 0.28% P in dry matter with a range between 0.13 and 0.45% P.

The development of P(CAL) concentrations towards the minimum threshold in soil can contribute to an efficient utilization of P sources and the reduction of P losses in agroecosystems (Cordell et al. 2009). Similarly, Simpson et al. (2011) stated that managing soil fertility near the critical P level maximizes the P efficiency. So, mining P soil reserves as what happened in the systems of this study can replace the external P supply with fertilizers for a certain time span. However, in a long term, this practice is not sustainable and the P applications within a crop rotation should correspond to the P removal by crop harvests. For the grassland, however, the decrease of the relatively high initial soil P(CAL) values could have been useful to reach the range for the optimal P concentration and to reduce the risk of P losses.

3.2 Hedley P fractions and organic P in soil

The soil P fractions were analyzed in 2001, 2009, and 2013 and in the focus areas only. The values of the labile inorganic soil P fractions (H2O-P, NaHCO3-P) of the arable systems decreased during the study period, while significant reductions occurred mainly from 2009 to 2013 (Table 2). Concentrations of labile P pools in soils depend on many influencing factors (see above) and can vary widely in dependence of the field conditions (e.g., Messiga et al. 2015; Requejo and Eichler-Löbermann 2014), which underlines the importance of long-term studies with a dense sampling scheme. In this context, the P(CAL) values did not differ between each single year, although in the long term, a significant decrease of the P(CAL) values could be derived by regression (see above).

The labile P fractions in the arable soils after growing clover grass were relatively high in 2009. Forage legumes have a relatively high P demand which is associated with N2 fixation, and as stated in the introduction, legume crops are known to mobilize less available P resources in soil. Therefore, the high labile P amounts in both arable systems in 2009 (Table 2) after growing red clover, might be explained by the active mobilization of the legumes and by a high P release during decomposition of plant residues (Talgre et al. 2014). The fact that P mobilizing species in crop rotations can enhance growth and P uptake in following crops is well described (e.g., Simpson et al. 2011).

Lower contents of the labile P fractions for grassland were found in 2013 in comparison to 2001 (no sampling in 2009) (Table 2). As mentioned above, the grassland soil had much higher initial soil P concentrations which make a significant decline under P budget deficits more probable.

The development of P supply in Trenthorst topsoil generally suggests that labile inorganic P fractions in soils will decrease over time with P budget deficits. However, the more stable inorganic P fractions (residual P, HCl-P, and also total P) remained almost unaffected in arable land over 12 years (Table 2). When appropriately addressed, they might be seen as potential buffer for future P release. Generally, mechanisms are known to address the utilization of this buffer by different crop species, microorganisms, and agricultural measures (Simpson et al. 2011). Nevertheless, the use of these mechanisms at the borderline of acceptable minimal contents of plant-available P in soils has not been developed for practical agricultural management so far and they will depend very much on environmental factors.

The initial Porg contents in the grassland soils (675 mg kg−1) were about six times as high as in the arable soils (about 105 kg mg kg−1), and accounts for about 50% of the total P (in the arable fields only about 20%) (Table 2). Although in tendency, the Porg contents in soil decreased in all systems during the study period, significantly lower contents were only found for the dairy arable system in 2013 when compared with 2009.

3.3 Activities of enzymes in soil

The microbial activities (activity of dehydrogenase) were about three to five times higher in the grassland soil than in the soil of the arable systems (Table 2), which is most probably related to the higher soil organic C contents (Table 1). It is well documented that an increase in soil organic matter enhances microbial biomass and the microbial activity of soils (Hupfauf et al. 2016). A higher microbial activity indicates a greater potential for mineralizing nutrients from organic compounds, which can replenish the labile inorganic soil P pools. Due to higher values, especially in grassland, we expected higher mobilization, e.g., from Porg. However, in our study, this replenishment could neither be found in grassland nor in arable fields. Quite the reverse, labile P forms were decreased in all systems after 12 years. In grassland, also the activity of dehydrogenase in soils decreased significantly which is in conjunction with the reduction of the soil organic C contents from 35 to 23 g kg−1 (Table 1). This view is supported by findings, that the structure and nature of C compounds can also have an effect of microbial processes in soil (Abraham and Chudek 2008). Plant litter is a primary source of food and energy for soil biota and its composition will therefore also have an effect on the microbial activity. Additionally, soil moisture and temperature are known to influence the microbial activity (Chen et al. 2003). However, as the activity of the dehydrogenase was relatively stable in the arable farming systems we do not believe that weather and soil conditions at sampling time were the major reasons for the decreased values in the grassland. But, quality changes in soil were visible in grassland by increased C:N ratios and decreased total N concentrations compared to the beginning (Table 1). So, the decreasing trend in dehydrogenase activity in grassland might be related to decreasing organic matter input and of its changing quality due to lower fertilization level and lower plant growth with the introduction of organic farming in 2001. However, at the end of the study, the organic C contents in the grassland were still double that of the arable fields.

Looking at the phosphatases, the alkaline phosphatase is mainly produced by soil microorganisms, and the extent of its excretion should be coupled to the microbial activity (Albiach et al. 2000). Indeed, the alkaline phosphatase activity was found to be higher in the grassland than in the arable systems, although the differences were not as pronounced as for the microbial activities. Similarly, the activities of the acid phosphatase, which can be excreted by both crops and microorganisms, were also higher in the grassland soil. Relevant changes of the phosphatase activities from 2001 to 2013 were found only in the stockless arable system for the acid phosphatase (from 133 to 160 μg P-NP g soil−1). Here, one should consider that the stockless arable system had the lowest initial activity of acid phosphatase. The values were increased compared to the values at the beginning of the study, probably caused by a broader range of crop species, especially the introduction of forage legumes (Table 1) with the conversion to organic farming. As the activities of the phosphatases tended to increase in all systems, this could also be related to the decrease of P(CAL) values and the other labile P forms which might have stimulated enzyme excretion. Whereas a negative correlation between soil-available P and phosphatase activity was described before, e.g., by Balota et al. (2011) a significant correlation between P(CAL) and phosphatase activity could not be derived from the data analyzed in this study.

4 Conclusions

The 12-year negative P budget reduced the level of plant-available P(CAL) and labile inorganic P forms (Hedley fractionation) in soil of all organic farming systems. This showed that higher percentages of forage legumes and manure recycling in mixed arable crop rotations and grassland do not necessarily mitigate decreases of plant-available P contents in soil as compared to a stockless crop system. Nevertheless, P(CAL) contents remained above the threshold value for a sufficient P supply to plants. The question of whether P mobilization from soil reserves with further decreasing available P contents, e.g., by enhanced soil microbiological and rhizosphere effects, at this site will be more pronounced in the future, is still open. However, independent of the farming system, it will be necessary to find a site-specific fertilizer application rate to balance the P export with the harvested crops for the future.