Next Article in Journal
Recovery of Vegetation Cover and Soil after the Removal of Sheep in Socorro Island, Mexico
Previous Article in Journal
Deriving Merchantable Volume in Poplar through a Localized Tapering Function from Non-Destructive Terrestrial Laser Scanning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Developing Biomass Equations for Western Hemlock and Red Alder Trees in Western Oregon Forests

by
Krishna P. Poudel
and
Hailemariam Temesgen
*
Department of Forest Engineering, Resources, and Management, Oregon State University, Corvallis, OR 97331, USA
*
Author to whom correspondence should be addressed.
Forests 2016, 7(4), 88; https://doi.org/10.3390/f7040088
Submission received: 29 February 2016 / Revised: 15 April 2016 / Accepted: 19 April 2016 / Published: 21 April 2016

Abstract

:
Biomass estimates are required for reporting carbon, assessing feedstock availability, and assessing forest fire threat. We developed diameter- and height-based biomass equations for Western hemlock (Tsuga heterophylla (Raf.) Sarg.) and red alder (Alnus rubra Bong.) trees in Western Oregon. A system of component biomass equations was fitted simultaneously with a constrained seemingly unrelated regression. Additionally, a linear model that predicts total aboveground biomass as a function of DBH and height was also fitted. The predicted total biomass was then apportioned to different components according to the predicted proportions from beta, Dirichlet, and multinomial log-linear regressions. Accuracy of these methods differed between species with higher root mean squared error (RMSE) being produced in red alder trees. Within species, the accuracy of the equation for bole biomass was better than the equations for other components. None of these methods stood out as a clear winner, but the multinomial log-linear regression produced marginally better results compared to other methods in terms of RMSE, except for Western hemlock bark biomass and red alder bole and branch biomass. The equations based on a seemingly unrelated regression provided lower RMSEs for those species-component combinations.

1. Introduction

The U.S. forest carbon inventories until 2009 were based on tree biomass estimates obtained from Jenkins et al. [1] biomass equations (the Jenkins equations, hereafter) along with sample tree measurements and forest area estimates of the U.S. Forest Service’s Forest Inventory and Analysis (FIA) [2]. Current U.S. official carbon inventories are based on tree biomass estimates obtained from the component ratio method (CRM) described by [3] and understory vegetation biomass estimates obtained from the Jenkins equations [4].
The Jenkins equations were developed using the modified meta-analysis of the compiled diameter-based equations for total aboveground biomass (AGB). Merchantable stem wood, stem bark, and foliage biomass in their method is estimated as the proportion of AGB through a nonlinear exponential function of diameter at breast height (DBH), whereas biomass in stump and branches is calculated by subtraction. The Jenkins equation for AGB is a single entry equation that uses DBH as the only predictor of AGB. Therefore, with their equation form, biomass continues to increase as diameter increases [2], does not account for the variation in stem form [5], and produces the same AGB for trees with the same DBH but different tree height.
The importance of quantifying forest biomass has increased with its consistent recognition as a potential alternative to fossil fuel. Therefore, the need for an accurate and consistent method of estimating total and component biomass is undoubted. In 2009, the FIA updated its biomass estimation protocol by switching to the CRM to estimate biomass of medium and large trees. The CRM was proposed for a consistent national biomass estimation based on FIA volume estimates [5]. However, because different FIA units use different volume equations, the biomass of trees with the same diameter and of the same species differ by regions [2]. The CRM calculates biomass in a merchantable bole by converting sound wood volume to bole biomass using wood-specific gravity factors compiled by [6]. Similarly, biomass in bole bark is based on the percentage of bark and bark-specific gravities given in [6]. The biomass of tops and limbs is calculated as a proportion of the bole biomass based on component proportions from the Jenkins equations. The biomass in stump wood and bark is based on volume equations in [7] and the compiled set of wood and bark-specific gravities. Total aboveground biomass is obtained by summing these component masses.
The Pacific Northwest unit of the FIA (FIA-PNW) uses its specific set of equations to calculate tree biomass. Stem wood biomass, regardless of whether it is the merchantable bole or the total stem, is calculated from the cubic volume estimates and wood density factors. Each tree species is associated with a set of local volume and component biomass equations [5]. The regional models, however, may not be unbiased at the local scale if there is spatial variation in the tree form due to one or more unknown predictors.
The estimation of AGB is necessary for forest carbon reporting, whereas the estimation of component biomass is necessary for assessing feedstock availability for bioenergy plants and fire threat assessment. There are two approaches that one can take when estimating total and component biomass: 1) estimate total AGB and distribute it to different components (e.g., the Jenkins method) and 2) estimate component biomasses and sum them to obtain the total (e.g., the FIA-PNW method). However, the accuracy of estimation varies significantly between total and component models. When component models are fitted, the strength of the relationship exhibited by bark, branch, and foliage is nowhere near that for the stem wood [8] or the total AGB. Moreover, the component biomass estimation in hardwoods requires a more innovative approach due to its decurrent form [9].
Allometric equations link total tree or component biomass with easily measurable dendrometric information such as DBH (and height) through regression. Development of such equations should be based on sound statistical formulations and rational biological considerations [10]. Complex allometric equations that include other explanatory variables in addition to DBH provide a better prediction of AGB than the simple DBH-based power model [11]. When component biomass is obtained as a proportion, it is essential that the regression for the modeling proportion assume non-normal error, since proportions are bounded between zero and one and do not cover the entire real line. However, there is not enough consideration given in this regard (e.g., [1]). Recently, [12] used beta, Dirichlet, and multinomial log-linear (MLL) regressions to predict the proportion of AGB in different components for Douglas fir and lodgepole pine trees. These regressions assume that the dependent variable(s) follow(s) beta, Dirichlet, and multinomial distributions, respectively and outperformed the commonly used, seemingly unrelated regression in their study.
Western hemlock and red alder are two important tree species of the PNW. Western hemlock accounts for approximately 14% of gross live volume and approximately 8% of total AGB in the PNW. In terms of volume and biomass in this region, it is second only to Douglas fir, which accounts for approximately 30% of gross volume and 51% of live tree biomass [5]. Similarly, of the ten native species of Alnus, red alder is the only one that reaches commercial size and abundance and is also the most common and important hardwoods in the PNW, accounting for 60% of total hardwood volume in the northwest [13]. Both Western hemlock and red alder trees serve multiple purposes, including timber and pulpwood production. Additionally, the red alder has high biodiversity values and plays a critical role in riparian zone conservation.
Using the biomass data collected from the Willamette National Forest, Oregon, we examined two approaches to estimate total and component biomass in Western hemlock and red alder trees. In the first approach, we simultaneously fit a system of biomass equations using a seemingly unrelated regression. The equation for total AGB is constrained such that the prediction from the total AGB equation is compatible with the sum of the predicted values of the component biomass equations. In the second approach, we first fit a linear model that predicts total AGB as a function of DBH and total tree height. The predicted total AGB is then apportioned to different components according to the predicted proportions from beta, Dirichlet, and MLL regressions. Performance of these approaches was evaluated based on the bias and root mean square error (RMSE) they produced.

2. Materials and Methods

2.1. Study Area

This study was carried out in the Willamette National Forest, an approximately 678,013-ha area that stretches for approximately 177 km along the western slopes of the Cascade Range (44°3′ N, 122°5′ W) in the western part of the U.S. state of Oregon. The elevation of the forest ranges from about 460 to 3200 m. About one-fifth of the forest, 154,106 ha, is designated for wilderness. Douglas fir is the dominant species with at least 15 other conifer species including hemlock, cedar, pine, and several species of fir common on the forest. The forest receives approximately 2000 to 3800 mm of annual rainfall, and the average annual temperature ranges from 5 to 19 °C. The forest has diverse soils, but their parent material is of volcanic origin. The locations of stands from which the sample trees were selected are shown in Figure 1.

2.2. Data

Detailed biomass data was collected by destructively sampling 14 red alder (Alnus rubra Bong.) and 12 Western hemlock (Tsuga heterophylla (Raf.) Sarg.) trees. The fieldwork was carried out between the first week of July and the third week of September in 2013. Tree level attributes such as DBH, total height, crown base height (height to the base of the first live branch), crown width, and main stem diameter at 0.15 m (stump height), 0.76 m, 1.37 m (breast height), and 2.4 m above ground, and every 1.22 m afterwards, were recorded. The average DBH was 27.1 cm (range 16.3–51.8 cm) and 44.7 cm (range 18.0–69.9 cm), and average height was 23.7 m (range 12.8–31.6 m) and 31.3 m (range 14.4 m–39.8 m), for red alder and Western hemlock trees, respectively.
A total of nine (four, three, and two from bottom, middle, and top equal-length stratums of sample trees) first-order branches, i.e., the branches that are directly attached to the main stem, per tree were randomly selected to obtain separate oven dry weights of branch wood and foliage. Individual branch wood and foliage biomass was obtained by fitting a species-specific log linear model as a function of branch diameter.
Stem wood and bark biomass was based on 3–5-cm-thick disks taken from the top of the stump and every 5.18 m. Stem wood biomass was calculated by multiplying the volume of each 5.18-m section by the average density of the disks taken from two ends, whereas stem bark biomass was based on the bark volume and bark density of the bark sample taken from the disks. Total stem wood and stem bark biomass was obtained by summing respective section masses. For details of the component and total biomass calculation process, refer to [12]. A summary of the data used in this study is presented in Table 1.

2.3. Methods

We used two different approaches to estimate total and component biomass in Western hemlock and red alder trees. In the first approach, we fit a simultaneous system of component biomass equations with a seemingly unrelated regression (SUR). In the second approach, total AGB was first predicted as a function of DBH and total tree height using a linear model. The proportion of biomass in different aboveground components was predicted using beta, Dirichlet, and MLL regressions. The actual amount of component biomass was then obtained by distributing the predicted value of total AGB using predicted proportions in different regressions.

2.3.1. Seemingly Unrelated Regression (SUR)

The SUR model was first introduced by [14] and by [15] in biomass estimations to ensure the additivity of nonlinear biomass equations. Usually, the component models are fitted as a system of equations, and the equation for total AGB is constrained such that the prediction from the total AGB equation is compatible with the sum of the predicted values of the component biomass equations. Since the error terms in different component biomass equations are correlated, a SUR allows more efficient parameter estimation than an ordinary least squares regression [15]. We fitted a nonlinear system of equations using a constrained SUR. The equations for component and total AGB were in the following form:
B O L = e x p ( a 11 + a 12 X 1 + a 13 X 2 ) + e 1
B R K = e x p ( a 21 + a 22 X 1 + a 23 X 2 ) + e 2
B C H = e x p ( a 31 + a 32 X 1 + a 33 X 2 ) + e 3
F O L = e x p ( a 41 + a 42 X 1 + a 43 X 2 ) + e 4
T O T = i , j e x p ( a i j + a i j X 1 + a i j X 2 ) + e 5
where, BOL, BRK, BCH, FOL, and TOT are bole, bark, branch, foliage, and total AGB, respectively, aij, (i = 1, 2, 3, 4, and j = 1, 2, 3) are parameters to be estimated from the data, and X1 and X2 are natural logarithms of DBH and total tree height, respectively. The SAS procedure PROC MODEL [16] was used to fit SUR models.

2.3.2. Beta Regression

A normal error regression assumes that the dependent variable lies in the real line and therefore may not be appropriate when the dependent variable is restricted to (0, 1) intervals such as percentage, proportion, and fraction or rate. If the dependent variable is a univariate continuous variable bounded between 0 and 1; the obvious choice of error distribution is the beta distribution because it has the same bounds. Beta regression in forestry has been used by [17] to model the percentage of canopy cover, by [18] to estimate the riparian percentage of shrub cover, and by [19] in modeling the percentage of tree canopy cover. Recently, [12] used the beta regression to estimate the proportion of biomass in different aboveground components in Douglas fir and lodgepole pine trees.
Beta density for modeling purposes has been described in the form of Equation (6) by [20].
f ( y ;   μ ,   ϕ ) = Γ ( ϕ ) Γ ( μ ϕ ) Γ ( ( 1 μ ) ϕ ) y μ ϕ 1 ( 1 y ) ( 1 μ ) ϕ 1 ,   0 < y < 1
where, 0 < μ < 1 and φ > 0, with E(y) = μ and V a r ( y ) = μ ( 1 μ ) 1 + ϕ .
Beta regression is an extension of generalized linear models and ensures the logical prediction of percentages [17]. The beta regression model can be written as:
g ( μ i ) = x i T β = η i
where, g(·) is strictly the increasing double differentiable link function that maps (0, 1) into the real line ℝ, x i = ( x i 1 , , x i k ) T is a vector of k explanatory variables, β = ( β 1 , , β k ) T is a vector of regression coefficients, and η i is a linear predictor (i.e., η i = β 1 x i 1 + + β k x i k , usually x i 1 = 1 for all i, so that the model has an intercept) [21]. The logit link function g ( μ ) = log ( μ 1 μ ) was used; thus, the predicted values are obtained as μ i = e x p ( η i ) 1 + exp ( η i ) . Parameter estimation is based on the method of maximum likelihood and was performed in R 3.2.3 [22] with function betareg in library betareg [21].

2.3.3. Dirichlet Regression

The Dirichlet regression is useful when the dependent variable is a vector of proportions that represent the components as a percentage of the total so that the component proportions sum to 1. It is assumed that the dependent variable follows a Dirichlet distribution, a multivariate generalization of the beta distribution having the general form:
f ( y ; α ) = 1 B ( α ) c = 1 C y c ( α c 1 )
where αc > 0, ∀ c are the shape parameters for each components, y c ( 0 , 1 ) and c = 1 C y c = 1 . B ( α c ) = c = 1 C Γ ( α c ) Γ ( c = 1 C α c ) is the multinomial beta function, and Γ(·) is the gamma function. The Dirichlet distribution reduces to the beta distribution when there are only two components. [23] used similar parameterization of [20] to represent the Dirichlet distribution as follows:
f ( y ; μ ,   ϕ ) = 1 B ( μ ϕ ) c = 1 C y c ( μ c ϕ 1 )
where 0 < μc < 1 and φ > 0 ( μ c = α c ϕ and ϕ = α 0 = c = 1 C α c are mean and precision parameters, respectively). The regression model for mean is then formulated as follows:
g μ ( μ c ) = η μ c = X β c
where g μ ( · ) is the link-function; with the logit link function, the predicted values are calculated as μ r = 1 1 + a = 1 C exp ( X β a ) for the reference component and μ c = exp ( X β a ) 1 + a = 1 C exp ( X β a ) for other components. For the details of parameterization of Dirichlet distribution and the method of parameter estimation in the Dirichlet regression, refer to [23]. The Dirichlet regression was performed in R 3.2.3 [22] with function DirichReg in library DirichletReg [23], using component bole as the reference group (choice of a reference group is arbitrary).

2.3.4. Multinomial Log-linear Regression

The data formatting and model formulation in the MLL regression followed the same approach as described in [12]. Four components of aboveground biomass (bole, bole bark, branch, and foliage) were first set to four nominal values. Then, the models to predict proportions of total tree biomass found in bole, bole bark, branch, and foliage were fit simultaneously using a multinomial logit model. The predicted probabilities of multinomial logit fit can be considered as the proportion of biomass in each component [8]. The resulting prediction equations for component proportions are:
p B O L = 1 1 + e ( a 1 + a 2 X 1 + a 3 X 2 ) + e ( b 1 + b 2 X 1 + b 3 X 2 ) + e ( c 1 + c 2 X 1 + c 3 X 2 )
p B R K = e ( a 1 + a 2 X 1 + a 3 X 2 ) 1 + e ( a 1 + a 2 X 1 + a 3 X 2 ) + e ( b 1 + b 2 X 1 + b 3 X 2 ) + e ( c 1 + c 2 X 1 + c 3 X 2 )
p B C H = e ( b 1 + b 2 X 1 + b 3 X 2 ) 1 + e ( a 1 + a 2 X 1 + a 3 X 2 ) + e ( b 1 + b 2 X 1 + b 3 X 2 ) + e ( c 1 + c 2 X 1 + c 3 X 2 )
p F O L = e ( c 1 + c 2 X 1 + c 3 X 2 ) 1 + e ( a 1 + a 2 X 1 + a 3 X 2 ) + e ( b 1 + b 2 X 1 + b 3 X 2 ) + e ( c 1 + c 2 X 1 + c 3 X 2 )
where pBOL, pBRK, pBCH, and pFOL are proportions of total aboveground biomass in bole, bark, foliage, and branch, respectively; X1 = DBH; X2 = total tree height; and ai, bi, ci (i = 1, 2, 3) are regression coefficients to be estimated from the data. The MLL regression was performed in R 3.2.3 [22] with function multinom in library nnet [24], using component bole as the reference group. The amount of biomass present in each component was used as the frequency weight.

2.4. Evaluation

Performance of these methods were compared using bias, bias percent, RMSE, RMSE percent, and R2 values calculated as follows:
B i a s = i = 1 n ( y i y i ^ ) n
B i a s   P e r c e n t = 100 × B i a s y ¯
R M S E = i = 1 n ( y i y i ^ ) 2 n
R M S E   P e r c e n t = 100 × R M S E y ¯
R 2 = 1 i = 1 n ( y i y ^ i ) 2 i = 1 n ( y i y ¯ ) 2
where yi and y i ^ are the observed and predicted biomass, respectively, in the ith tree, and y ¯ is the average biomass of n trees.

3. Results and Discussion

The majority of the aboveground biomass was contained in the bole component for both species. It ranged from 63.9% to 83.1% (74.7% on average) of AGB in Western hemlock trees and from 66.2% to 86.6% (77.4% on average) in red alder trees. Bark accounted for the smallest proportion of AGB in Western hemlock, ranging from 3.5% to 5.3%, whereas, for red alder, foliage was the component with the smallest proportion, accounting for only 0.2% to 2.6% of AGB. Branch and foliage biomass in Western hemlock ranged from 8.4% to 23.6% and 3.1% to 9.4%, respectively. In red alder trees, branch and bark biomass ranged from 4.7% to 27.5% and 3.8% to 8.4% of AGB, respectively. All regression methods to predict biomass were applied to both species.

3.1. Seemingly Unrelated Regression (SUR)

The estimated correlation matrix of residuals obtained from the SUR model fit for Western hemlock (corrWH) and red alder (corrRA) biomass equations were:
c o r r W H ( e i ) = [ 1 0.17 0.84 0.75 0.71 1 0.10 0.30 0.40 1 0.97 0.21 1 0.10 1 ]
and
c o r r R A ( e i ) = [ 1 0.18 0.60 0.31 0.95 1 0.50 0.69 0.003 1 0.22 0.81 1 0.21 1 ]
Both positive and negative correlations among model residuals were observed. Cross-equation error correlation in Western hemlock models ranged from −0.84 (between branch and bole) to 0.97 (between branch and foliage), whereas, in red alder models, it ranged from −0.50 (between branch and total) to 0.95 (between bole and total). Parameter estimates and their approximate standard errors for SUR models are given in Table 2. The foliage model for red alder performed very poorly (AdjustedR2 = 0.30) compared to the models for other components. We do not have a good explanation for the smaller variation indicated by the foliage model, but it is possible that the smaller values of foliage biomass compared to the biomass in other components impacted the estimation. There are two options one could achieve foliage biomass estimation: (1) fit the foliage model outside of the system or (2) combine the foliage and branch wood as one component, which is probably less desirable. When the foliage model for red alder was fitted outside of the system, the AdjustedR2 increased to 0.77 without impacting the fit of other components in the system of equations. The evaluation statistics produced by SUR models in estimating component and total aboveground biomass are presented in Table 3. Our evaluation statistics are based on the SUR system that includes the foliage biomass equation. It was evident that the prediction of biomass in Western hemlock was more accurate in terms of bias and RMSE compared to the prediction in red alder. More complex allometric equations are likely needed to improve the accuracy of biomass equations for hardwoods. Power models are commonly used in biomass estimation. However, the power models did not perform as well as the models we have used (Equations (1)–(5)). Due to the smaller sample size, we did not try very complex models. Even though biomass models are commonly faced with heterogeneous error variance, residual analysis in our study did not show any pattern of heteroscedasticity. The problem of heteroscedasticity, if present, can be resolved by using a weighted regression. Discussion on approaches to find an appropriate weight function for nonlinear biomass models can be found in [25]. We would like to note that some of the parameter estimates for red alder component models obtained from SUR are “unusual”, specifically coefficient a31. We tried several starting values to make sure that the minimization attained the global minimum. However, the parameter estimates were stable. One reason for “unusual” parameter estimates could be due to the smaller sample size available for model fitting.

3.2. Regressions to Predict Proportion

The estimation of proportion itself does not provide any value without the estimate for total aboveground biomass. Observed total aboveground biomass could have been used for the sample data, but that does not provide any significance for future applications. Therefore, a linear model relating total aboveground biomass with DBH and height (Equation (22)) was first fitted. Parameter estimates and the index of fit (AdjustedR2) of this model for both species are given in Table 4.
ln ( A G B ) = a + b ln ( D B H ) + c ln ( H T ) + e
AGB in original scale, using this model, can be calculated as A G B = a 0 D B H b H T c , where a 0 = C F × e x p ( a ) . AdjustedR2 of the original models were 0.9985 and 0.9653 for Western hemlock and red alder, respectively.
The proportion of biomass in different aboveground components were related to DBH and height through beta, Dirichlet, and MLL regressions. The independent variables were used in their original form in proportion models because they provided better fit compared with the models with log-transformed independent variables. In beta regression, different component proportions are fitted independently of each other. The PseudoR2 (an R2-like measure calculated based on estimated likelihood) values of beta regression ranged from 0.3562 (bark model) to 0.8305 (bole model) for Western hemlock and 0.6093 (bark model) to 0.7164 (bole model) for red alder trees (Table 5). The evaluation statistics produced by the beta regression models are given in Table 6. The root mean squared errors of the beta regression models for Western hemlock were similar to those obtained from the SUR models. However, for the red alder trees, the results were variable. Foliage and bark models in beta regression reduced RMSE by 25% and 6%, but, for branch and bole biomass, the beta regression produced RMSEs that were respectively 22% and 3% higher than that produced by the SUR models.
In Dirichlet and MLL regressions, the component proportions are fitted simultaneously, which ensures that the predicted proportions sum to 1. Parameter estimates obtained from the Dirichlet regressions were different from those obtained from beta regression (Table 5 vs. Table 7). Similar to beta regression, the amount of variation explained by the Western hemlock bark model in the Dirichlet regression was very low (0.2788) compared to other models (0.8323 and 0.8665 for foliage and branch models, respectively). The evaluation statistics produced by the Dirichlet regression are presented in Table 8. RMSEs for the Dirichlet regression models for Western hemlock were within 15% for all components. The RMSEs for bole and bark biomass estimation was 12.5% and 11.8%, respectively, for red alder trees, whereas the RMSEs for foliage and branch biomass estimation was 52.3% and 54.4%, respectively.
The parameter estimates obtained from the MLL regression are presented in Table 9, and evaluation statistics are given in Table 10. Even though the form of the prediction equations for Dirichlet and MLL regressions are the same, parameter estimates obtained from these two methods were different (Table 7 vs. Table 9). In comparison to the Dirichlet regression, the MLL models reduced RMSEs by 1.7%, 2.9%, 1.0% and 0.4% for foliage, bark, branch, and the bole biomass estimation of Western hemlock. For the red alder trees, the MLL regression models reduced RMSEs by 6.2% and 15.1% for foliage and branch biomass estimation but it produced 1.1% and 2.6% higher RMSE for bark and bole biomass estimation, respectively. Figure 2 shows the complete comparison of percentage root mean squared error obtained by using different methods to estimate components of AGB.

4. Conclusions

We fitted DBH- and height-based biomass equations for Western hemlock and red alder trees using different regression approaches. Even though a method that is consistently better than others is desired, none of the methods we used was a clear winner when all methods (SUR, beta, Dirichlet, and MLL) were compared to one another. Accuracy of these methods differed between species, with higher root mean square error being produced in red alder trees, for which more complex models than the models for conifers might be required. Within species, the accuracy of the equation for bole biomass was better than the equations for other components. Total AGB can be estimated with high accuracy using a simple linear model that related the logarithm of AGB to the logarithms of DBH and height (AdjustedR2 0.99 and 0.97, for the untransformed model, and RMSE 2.47 and 18.82% for Western hemlock and red alder, respectively) and distributed to different components using the proportions predicted through beta, Dirichlet, and MLL regressions.
The beta regression can be used to fit independent proportion models, and Dirichlet and MLL regressions can be used to simultaneously fit proportion models. Predicted values of proportions in beta regression may not sum to 1. In that case, one can fit C − 1 equations, where C is the number of components, and calculate the proportion of remaining components by subtraction or, alternatively, by dividing the predicted proportion in each component by the sum of all predicted proportions. Methods to predict proportions are promising because they satisfy the assumption of error distribution. The measures of fit obtained from these methods (R2 and PseudoR2) were very high compared to the R2 values of the Jenkins equations for component ratios. The Dirichlet and MLL regressions have an additional attractive property of the unit sum of predicted proportions. We further prefer the Dirichlet regression to MLL because MLL requires a non-standard data processing and actually produces the probabilities of a nominal variable occurring [8]. On the other hand, given the fact that more than 70% of biomass, on average, is contained in the main stem, the choice of biomass equations should be governed by the objective of management. For our relatively small dataset, the methods to predict proportion were unbiased and should be tested with a larger sample size.

Acknowledgments

This project received financial support from the Forest Inventory and Analysis Unit of the US Forest Service for the collection and analyses of data and during the writing of the manuscript. The findings and conclusions in the paper are those of the authors and do not necessarily represent the views of the Forest Inventory and Analysis Unit of the USDA. Any use of trade names is for description purposes only and does not imply endorsement by the authors. The funders had no role in the decision to publish, or in the preparation of the manuscript. We thank Andy Gray and James Westfall for their support and encouragement.

Author Contributions

The authors contributed equally to this work. Krishna P. Poudel and Hailemariam Temesgen conceived and designed the model forms and comparisons; Krishna P. Poudel analyzed the data and performed the experiments; Krishna P. Poudel and Hailemariam Temesgen wrote the paper.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Jenkins, J.C.; Chojnacky, D.C.; Heath, L.S.; Birdsey, R.A. National-scale biomass estimators for United States tree species. For. Sci. 2003, 49, 12–35. [Google Scholar]
  2. Heath, L.S.; Hansen, M.; Smith, J.E.; Miles, P.D.; Smith, B.W. Investigation into calculating tree biomass and carbon in the FIADB using a biomass expansion factor approach. In Proceedings of the Forest Inventory and Analysis (FIA) Symposium 2008, Park City, UT, USA, 21–23 October 2008; McWilliams, W.H., Moisen, G.G., Czaplewski, R.L., Eds.; USDA Forest Service, Rocky Mountain Research Station: Fort Collins, CO, USA, 2008. [Google Scholar]
  3. Woodall, C.; Heath, L.S.; Domke, G.M.; Nichols, M.C. Methods and Equations for Estimating Aboveground Volume, Biomass, and Carbon for Trees in the US Forest Inventory; Gen. Tech. Rep. NRS-GTR-88; U.S. Department of Agriculture, Forest Service: Newtown Square, PA, USA, 2010; p. 30.
  4. EPA. Inventory of U.S. greenhouse gas emissions and sinks: 1990–2013; United States Environmental Protection Agency: Washington, DC, USA, 2015.
  5. Zhou, X.; Hemstrom, M.A. Estimating Aboveground Tree Biomass on Forest Land in the Pacific Northwest: A Comparison of Approaches; USDA Forest Service, Pacific Southwest Research Station: Portland, OR, USA, 2009.
  6. Miles, P.D.; Smith, W.B. Specific Gravity and Other Properties of Wood and Bark for 156 Tree Species Found in North America; USDA Forest Service: Newton Square, PA, USA, 2009.
  7. Raile, G.K. Estimating Stump Volume; USDA Forest Service: St Paul, MN, USA, 1982.
  8. Boudewyn, P.; Song, X.; Magnussen, S.; Gillis, M. Model-based, Volume-to-biomass Conversion for Forested and Vegetated Land in Canada; Natural Resources Canada, Canadian Forest Service, Pacific Forestry Center: Victoria, British Columbia, 2007. [Google Scholar]
  9. Westfall, J.A.; MacFarlane, D.W.; Weiskittel, A.R. Biomass measurement and modeling challenges for hardwood species in the northern region. In Proceedings of the Moving from Status to Trends: Forest Inventory and Analysis (FIA) Symposium 2012, Baltimore, MD, USA, 4–6 December 2012.
  10. Temesgen, H.; Affleck, D.; Poudel, K.P.; Gray, A.; Sessions, J. A review of the challenges and opportunities in estimating above ground forest biomass using tree-level models. Scand. J. Forest For. Res. 2015, 30, 326–335. [Google Scholar] [CrossRef]
  11. Picard, N.; Rutishauser, E.; Ploton, P.; Ngomanda, A.; Henry, M. Should tree biomass allometry be restricted to power models? For. Ecol. Manag. 2015, 353, 156–163. [Google Scholar] [CrossRef]
  12. Poudel, K.P.; Temesgen, H. Methods for estimating aboveground biomass and its components for Douglas fir and lodgepole pine trees. Can. J. For. Res. 2016, 46, 77–87. [Google Scholar] [CrossRef]
  13. Niemiec, S.S.; Ahrens, G.R.; Willits, S.; Hibbs, D.E. Hardwoods of the Pacific Northwest. West. Hardwoods 1995, 32, 115. [Google Scholar]
  14. Zellner, A. An efficient estimation of seemingly unrelated regression equations and tests of aggregation bias. J. Amer. Stat. Assoc. 1962, 57, 500–509. [Google Scholar] [CrossRef]
  15. Parresol, B.R. Additivity of nonlinear biomass equations. Can. J. For. Res. 2001, 31, 865–878. [Google Scholar] [CrossRef]
  16. SAS. Base SAS 9.4 Procedures Guide: Statistical Procedures, 2nd ed.; SAS Institute Inc: Cary, NC, USA, 2013. [Google Scholar]
  17. Korhonen, L.; Korhonen, K.T.; Stenberg, P.; Maltamo, M.; Rautiainen, M. Local models for forest canopy cover with beta regression. Silva Fenn. 2007, 41, 671. [Google Scholar] [CrossRef]
  18. Eskelson, B.N.; Madsen, L.; Hagar, J.C.; Temesgen, H. Estimating Riparian understory vegetation cover with Beta regression and copula models. For. Sci. 2011, 57, 212–221. [Google Scholar]
  19. Coulston, J.W.; Moisen, G.G.; Wilson, B.T.; Finco, M.V.; Cohen, W.B.; Brewer, C.K. Modeling percent tree canopy cover: A pilot study. Photogramm. Eng. Remote Sens. 2012, 78, 715–727. [Google Scholar] [CrossRef]
  20. Ferrari, S.; Cribari-Neto, F. Beta regression for modelling rates and proportions. J. App. Stat. 2004, 31, 799–815. [Google Scholar] [CrossRef]
  21. Cribari-Neto, F.; Zeileis, A. Beta Regression in R. J. Stat. Soft. 2010, 34, 1–24. [Google Scholar] [CrossRef]
  22. R Core Team. R: A language and environment for statistical computing. Available online: http://www.R-project.org/ (accessed on 9 March 2015).
  23. Maier, M.J. DirichletReg: Dirichlet Regression for Compositional Data in R. Available online: http://epub.wu.ac.at/4077/ (accessed on 29 January 2014).
  24. Venables, W.N.; Ripley, B.D. Modern Applied Statistics with S, 4th ed.; Springer: New York, NY, USA, 2002. [Google Scholar]
  25. Zeng, W.S.; Tang, S.Z. Bias correction in logarithmic regression and comparison with weighted regression for nonlinear models. For. Res. 2011, 23, 137–143. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Map of the study area. Location of stands from which sample trees were selected are indicated by stars.
Figure 1. Map of the study area. Location of stands from which sample trees were selected are indicated by stars.
Forests 07 00088 g001
Figure 2. Comparison of root mean square error percent obtained by using beta, Dirichlet, multinomial log-linear, and seemingly unrelated regressions to estimate component of aboveground biomass in (A) Western hemlock and (B) red alder trees.
Figure 2. Comparison of root mean square error percent obtained by using beta, Dirichlet, multinomial log-linear, and seemingly unrelated regressions to estimate component of aboveground biomass in (A) Western hemlock and (B) red alder trees.
Forests 07 00088 g002
Table 1. Summary statistics of tree data used in this study. CV is the coefficient of variation.
Table 1. Summary statistics of tree data used in this study. CV is the coefficient of variation.
VariableMinimumMeanMaximumCV
Western hemlock
Foliage (kg)10.077.6184.00.82
Branch (kg)20.0236.7609.00.97
Bark (kg)4.062.1147.00.76
Bole (kg)72.01122.72512.00.76
Total (kg)106.01499.03446.00.78
DBH (cm)18.044.769.90.40
Total Height (m)14.431.339.80.29
Red alder
Foliage (kg)1.03.515.01.06
Branch (kg)15.069.4445.01.64
Bark (kg)3.024.885.00.91
Bole (kg)53.0303.91129.00.95
Total (kg)80.0401.61674.01.05
DBH (cm)16.327.151.80.37
Total Height (m)12.823.731.60.23
Table 2. Parameter estimates and their approximate standard errors (SE) for the component and total aboveground biomass models for red alder and Western hemlock trees. Model forms for component and total aboveground biomass (AGB) are given in Equations (1)–(5).
Table 2. Parameter estimates and their approximate standard errors (SE) for the component and total aboveground biomass models for red alder and Western hemlock trees. Model forms for component and total aboveground biomass (AGB) are given in Equations (1)–(5).
ParameterWestern HemlockRed Alder
EstimateApprox. SEEstimateApprox. SE
a11−3.494380.45210−5.518550.91800
a121.528030.067201.654520.14530
a131.313160.141601.737200.39850
a21−6.032970.61420−5.083271.24330
a221.870540.100901.473810.27690
a230.825380.179001.066420.57770
a31−7.651330.96880−29.016007.24370
a323.853020.211802.328640.89350
a33−0.613110.206207.507813.00380
a41−7.294481.145500.930473.18410
a422.904390.217402.037551.79700
a430.055290.29450−1.896642.16760
Table 3. Evaluation statistics (bias, bias percent, root mean square error (RMSE), and RMSE percent) in component biomass estimation in Western hemlock and red alder trees using seemingly unrelated regression (SUR).
Table 3. Evaluation statistics (bias, bias percent, root mean square error (RMSE), and RMSE percent) in component biomass estimation in Western hemlock and red alder trees using seemingly unrelated regression (SUR).
ComponentBias (kg)Bias PercentageRMSE (kg)RMSE Percentage
Western Hemlock
Bole−20.26−1.8063.455.65
Bark−0.03−0.064.336.98
Branch14.566.1533.9014.32
Foliage5.076.5412.5916.23
Total−0.66−0.0434.972.33
Red Alder
Bole5.001.6426.828.83
Bark−2.16−8.733.9215.82
Branch10.9315.7521.9731.64
Foliage−1.88−53.742.9383.69
Total11.882.9643.7810.90
Table 4. Parameter estimates and their standard errors for the log–log model (Equation (22)) to estimate total aboveground biomass in Western hemlock and red alder trees. CF is the log-bias correction factor calculated as exp ( MSE 2 ) .
Table 4. Parameter estimates and their standard errors for the log–log model (Equation (22)) to estimate total aboveground biomass in Western hemlock and red alder trees. CF is the log-bias correction factor calculated as exp ( MSE 2 ) .
SpeciesParameter Estimates (S.E.)Adjusted-R2CF
abc
Western Hemlock−3.41345
(0.12227)
1.93633
(0.05889)
0.90878
(0.07164)
0.99861.0009
Red Alder−3.62175
(0.40260)
1.90612
(0.15135)
0.97674
(0.19756)
0.98081.0070
Table 5. Parameter estimates and pseudo-R2 values for the component biomass models for Western hemlock and red alder trees using beta regression.
Table 5. Parameter estimates and pseudo-R2 values for the component biomass models for Western hemlock and red alder trees using beta regression.
ModelParameter EstimatesPseudo-R2
InterceptDBHHT
Western Hemlock
Bole0.30314−0.020930.055540.8305
Bark−3.33911−0.004360.012430.3562
Branch−0.953800.02793−0.065250.8281
Foliage−1.855140.00911−0.044180.7244
Red Alder
Bole0.28216−0.043990.091470.7164
Bark−3.67702−0.020440.062540.6093
Branch−0.352530.06547−0.135260.6986
Foliage−2.154220.02285−0.129460.6472
Table 6. Evaluation statistics (Bias, bias percent, RMSE, and RMSE percent) in component biomass estimation in Western hemlock and red alder trees using beta regression.
Table 6. Evaluation statistics (Bias, bias percent, RMSE, and RMSE percent) in component biomass estimation in Western hemlock and red alder trees using beta regression.
ComponentBiasBias PercentageRMSERMSE Percentage
Western Hemlock
Bole−1.08−0.1062.415.56
Bark−0.18−0.296.079.78
Branch1.670.7029.3612.41
Foliage0.901.1512.5616.19
Red Alder
Bole0.340.1136.9712.17
Bark−0.41−1.672.429.75
Branch5.718.2337.2153.60
Foliage0.277.602.0558.72
Table 7. Parameter estimates for the component biomass models for Western hemlock and red alder trees from using the Dirichlet regression. Component bole was used as reference group for both species.
Table 7. Parameter estimates for the component biomass models for Western hemlock and red alder trees from using the Dirichlet regression. Component bole was used as reference group for both species.
ModelParameter Estimates
InterceptDBHHTR2
Western Hemlock
Bark−2.848350.00112−0.002580.2788
Branch−0.740790.02901−0.069690.8665
Foliage−1.468960.01408−0.056030.8323
Red Alder
Bark−3.16541−0.008860.037010.5689
Branch−0.253350.06685−0.138130.7397
Foliage−1.768090.03266−0.139950.7374
Table 8. Evaluation statistics (Bias, bias percent, RMSE, and RMSE percent) in component biomass estimation in Western hemlock and red alder trees using the Dirichlet regression.
Table 8. Evaluation statistics (Bias, bias percent, RMSE, and RMSE percent) in component biomass estimation in Western hemlock and red alder trees using the Dirichlet regression.
ComponentBiasBias PercentageRMSERMSE Percentage
Western Hemlock
Bole−0.89−0.0863.515.66
Bark−1.07−1.736.7510.87
Branch2.491.0529.5912.50
Foliage0.600.7812.0015.46
Red Alder
Bole0.220.0738.0112.51
Bark−1.06−4.302.9311.83
Branch6.799.7837.8654.54
Foliage−0.35−10.091.8352.31
Table 9. Parameter estimates for the component biomass models for Western hemlock and red alder trees using multinomial log-linear regression model. Component bole was used as reference group for both species.
Table 9. Parameter estimates for the component biomass models for Western hemlock and red alder trees using multinomial log-linear regression model. Component bole was used as reference group for both species.
ModelParameter Estimates
InterceptDBHHTR2
Western Hemlock
Bark−2.653120.00333−0.011910.2959
Branch−0.994680.02917−0.062610.8667
Foliage−1.824780.01372−0.045270.8326
Red Alder
Bark−3.07429−0.012580.036960.5495
Branch−0.407010.07676−0.143420.7327
Foliage−1.643220.05711−0.180880.6863
Table 10. Evaluation statistics (Bias, bias percent, RMSE, and RMSE percent) in component biomass estimation in Western hemlock and red alder trees using multinomial log-linear regression.
Table 10. Evaluation statistics (Bias, bias percent, RMSE, and RMSE percent) in component biomass estimation in Western hemlock and red alder trees using multinomial log-linear regression.
ComponentBiasBias PercentageRMSERMSE Percentage
Western Hemlock
Bole0.640.0659.085.26
Bark0.040.074.937.94
Branch0.370.1627.3211.54
Foliage0.080.1010.7013.79
Red Alder
Bole3.000.9945.8915.10
Bark0.210.833.1912.88
Branch2.343.3727.4039.47
Foliage0.051.401.6146.12

Share and Cite

MDPI and ACS Style

Poudel, K.P.; Temesgen, H. Developing Biomass Equations for Western Hemlock and Red Alder Trees in Western Oregon Forests. Forests 2016, 7, 88. https://doi.org/10.3390/f7040088

AMA Style

Poudel KP, Temesgen H. Developing Biomass Equations for Western Hemlock and Red Alder Trees in Western Oregon Forests. Forests. 2016; 7(4):88. https://doi.org/10.3390/f7040088

Chicago/Turabian Style

Poudel, Krishna P., and Hailemariam Temesgen. 2016. "Developing Biomass Equations for Western Hemlock and Red Alder Trees in Western Oregon Forests" Forests 7, no. 4: 88. https://doi.org/10.3390/f7040088

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop