|
|
||||||||
a Ghent University, Dep. Soil Management and Soil Care, Coupure links 653, B-9000 Gent, Belgium
b Belgian Nuclear Research Centre, Boeretang 200, B-2400 Mol, Belgium
Corresponding author (wim.cornelis{at}rug.ac.be)
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: MAMD, mean of the absolute value of mean differences MD, mean difference Mr, mean of the Pearson correlation coefficient MRC, moisture retention curve MRMSD, mean of the root of mean squared differences PTF, pedotransfer function RMSD, root of mean squared difference SDRMSD, standard deviation of the root of mean squared differences
| INTRODUCTION |
|---|
|
|
|---|
Although the MRC is of great importance in present-day agricultural, ecological, and environmental soil research, it is not a readily available soil property. The main reason is that its measurement is expensive, time consuming, and labor intensive. Therefore, models have been developed to predict the MRC from more easily measurable and more readily available soil properties, like particle-size distribution, organic matter content, and dry bulk density. These models are referred to as pedotransfer functions (PTFs) (Bouma, 1989).
Since every PTF is developed on the basis of a database of a limited number of soil samples, it is not always clear to what extend these functions can be used in the case of soil conditions other than those under which they were developed. The aim of our study was, therefore, to evaluate (i) the general applicability and (ii) the prediction accuracy of some of the most commonly cited and some recently developed PTFs that use soil properties such as particle-size distribution (sand, silt, and clay content), organic matter or organic C content, and dry bulk density to predict the MRC. Pedotransfer functions that did not perform very well in previous studies or that needed more detailed information were not considered for evaluation.
Three main approaches to estimate the MRC are generally considered, which we combined in three groups. The Group 1 PTFs estimate the water content of the soil at certain matric potentials using multiple linear regression (Gupta and Larson, 1979; Rawls and Brakensiek, 1982) or artificial neural networks (Pachepsky et al., 1996). The Group 2 PTFs predict the parameters of a closed-form analytical equation such as the model of Brooks and Corey (1964) (Rawls and Brakensiek, 1985) or the van Genuchten equation (1980). This is done through multiple linear regression (Vereecken et al., 1989; Scheinost et al., 1997; Minasny et al., 1999; Wösten et al., 1999) or artificial neural networks (Pachepsky et al., 1996; Schaap and Leij, 1998; Minasny et al., 1999; Schaap et al., 1998, 1999). The Group 3 PTFs are based on a physical-conceptual approach of the water retention phenomenon (Arya and Paris, 1981; Haverkamp and Parlange, 1986) and use fractal mathematics and scaled similarities (Tyler and Wheatcraft, 1989; Comegna et al., 1998).
| MATERIALS AND METHODS |
|---|
|
|
|---|
Rawls and Brakensiek (1982) estimated moisture content within the same matric potential range using the same soil properties. Their data originate from 2543 horizons from across the USA. To increase the accuracy of the regression equations, the moisture content at -1500 kPa or at both -33 and -1500 kPa was introduced and added to the regression equations. However, the latter equations were not retained in our study, since these extra data are mostly not available.
Group 2 Pedotransfer Functions
Pedotransfer functions that estimate retention function parameters have a greater ability to be used in flux models than PTFs that predict water content at certain matric potentials because of the continuous result (Tietje and Hennings, 1993; van den Berg et al., 1997). Rawls and Brakensiek (1985) developed one of the few PTFs that use the Brooks and Corey model (1964)
![]() | (1) |
is the moisture content (m3 m-3),
s is the moisture content at saturation (m3 m-3),
r is the residual moisture content (m3 m-3),
is the matric potential (kPa),
b is the bubbling pressure (kPa), and
is a pore-size distribution index (dimensionless). Their regression equations were formulated for natural soils and use porosity, clay content, and sand content as input variables.
Alternatively, the van Genuchten equation (1980),
![]() | (2) |
(kPa-1), and n and m (dimensionless) are regression coefficients, has an inflection point that allows better performance than the Brooks and Corey model, particularly near saturation (van Genuchten and Nielsen, 1985). Therefore, the latter is the most frequently used model for the MRC. Vereecken et al. (1989) used multiple linear regression with sand and clay content, organic C content, and bulk density data from undisturbed samples of 182 horizons of 40 Belgian soil series to solve for the parameters of the van Genuchten equation.
Wösten et al. (1994) developed a class PTF, referred to as the Staring series. Although these series are not a PTF in a strict sensethe van Genuchten parameters are not predicted from a function as such, but their calculated values are given in tabular format for different soil unitsit was retained here as it is often referred to. The Staring series is based on 620 soil moisture retention curves from 36 different Dutch soil units. The required information is soil unit, which can be obtained from clay, silt, sand, and organic matter content or directly from soil maps, and from location within the profile (i.e., topsoil or subsoil).
More recently, Scheinost et al. (1997) developed a PTF that is particularly designed for a highly variable landscape. Their functions are based on 87 undisturbed soil samples collected in northern Germany and needs particle-size distribution, organic C content, and porosity. The van Genuchten parameters
s,
r, and m are predicted by the multiple linear regression equations found by Vereecken et al. (1989). The parameters
and n, however, are related to the geometric mean particle diameter and its standard deviation, in an attempt to include some physical meaning to the PTF. These two parameters were predicted using nonlinear regression to all data, and this is referred to by Minasny et al. (1999) as extended nonlinear regression.
Schaap et al. (1999) predicted the parameters of the van Genuchten equation by using an artificial neural network. Their PTF was based on 1209 soil samples, originating from 30 sources in the USA. One-half of them were used as a calibration set, the others as a validation set. The input variables of the Schaap et al. (1999) PTF retained in this study are clay, silt, and sand content, and dry bulk density. The other PTFs proposed by Schaap et al. (1999), which enable the prediction of the MRC with less or more input variables, were not evaluated here.
Finally within Group 2, the PTFs of Wösten et al. (1999) were evaluated. These authors used multiple linear regression to predict the parameters of the van Genuchten equation with data from 4030 horizons from all over Europe. A class as well as a continuous PTF was developed. The class PTF, referred to as the HYPRES series, was obtained by subdividing the database into 11 soil textural classes, and it gives the van Genuchten parameters in tabular format, whereas the continuous PTF does not consider any grouping. Input data needed are sand, silt, and clay content; bulk density; organic matter content; and a qualitative variable, indicating whether topsoil or subsoil is considered. Note that the terms class and continuous PTF used here have different definitions from those used by Minasny et al. (1999).
Group 3 Pedotransfer Functions
The physical-conceptual PTFs were not retained in this study because of the following reasons. First, the PTFs in this group require a particle-size distribution as detailed as possible. The use of only three classes reduces their performance considerably (Tietje and Tapkenhinrichs, 1993). Second, the model of Arya and Paris (1981) has been developed on a limited database, and hence the accuracy decreases significantly when extrapolating to other soils (Tietje and Tapkenhinrichs, 1993). Third, Bird et al. (1996) and Bird and Dexter (1997) noted that the errors present in PTFs based on the fractal dimension, as is the case with, for example, the PTF of Tyler and Wheatcraft (1989), can be quite significant. Finally, this group of PTFs performed poorly in the evaluation of Tietje and Tapkenhinrichs (1993), due to the reasons mentioned above.
Table 1 gives an overview of the input data needed for the different PTFs retained in this study. The data ranges of the PTF's calibration data sets are shown in Table 2.
|
|
(
) data set was obtained, the replicates were mixed, and organic matter content and particle-size distribution in three fractions were determined on the mixed soil samples. Particle-size distribution was determined with the pipette method (Gee and Bauder, 1986). Since the German classification, as used by Scheinost et al. (1997), considers a boundary of 63 µm as the separation between silt and sand, our silt and sand content values had to be converted. This was done by loglinear interpolation (Scheinost et al., 1997). Organic matter was determined by means of the Walkley and Black (1934) method.
|
Comparison of Measured and Predicted Moisture Retention Curves
To compare the measured and predicted retention curves, two approaches can be considered. In a first approach, the chosen validation indices are numerically calculated based on predicted and measured water contents at certain matric potentials. In a second approach, measured and predicted functions are integrated between top and bottom boundaries. However, this requires a continuous function to be fitted to the measured
(
) data pairs. In this study, only the second approach was retained for reasons as outlined below. First, it smooths the sometimes irregular data points, which can occur even if the MRC is based on two to six replications. Second, the matric potentials applied in our measurements were different from those at which the moisture content was predicted in the Group 1 PTFs. Third, integration is more objective than simple summation, because the latter depends on the choice of the matric potentials that were applied to determine the corresponding moisture content experimentally. A PTF can perform very well in only one part of the MRC, and when the chosen matric potentials fall mainly in that part (which is not known a priori), this will result in a better validation. Finally, for modeling purposes a continuous function is required anyhow. The van Genuchten equation was selected, for reasons mentioned earlier. As it is a smooth continuous function, it is useful for numerical modeling purposes and can be combined analytically with pore-size distribution models of hydraulic conductivity (e.g., Mualem, 1976). It is, therefore, used in many soil water and solute transport computer-simulation models like HYDRUS (Vogel et al., 1996), WAVE (Vanclooster et al., 1996), and SWAP (van Dam et al., 1997).
The prediction accuracy of the PTFs for a given soil was determined by three complementary indices: the mean difference MD (m3 m-3) and the root of the mean squared difference RMSD (m3 m-3) between the measured and predicted MRC, and the Pearson correlation coefficient r. Let 
mi be the measured moisture retention function for soil i [i.e., a continuous van Genuchten curve fitted to the discrete set of measured
(
) values], and 
Pi the predicted moisture content function for soil i (i.e., a continuous Brooks and Corey or van Genuchten curve as predicted by the PTF or obtained by curve fitting in case of the Group 1 PTFs), where i = 1, 2, ... n, with n the total number of soils in the evaluation data set. Then the MD (m3 m-3) for soil i was calculated as
![]() | (3) |
Use of the MD allows evaluating the bias of the MRC and its absolute value should be as small as possible. However, underestimation or overestimation of the MRC may cancel out resulting in a low absolute value, even if the fit is poor.
The RMSD (m3 m-3) for soil i was calculated as
![]() | (4) |
The RMSD is always positive and the model's performance increases the more it approaches zero. It is an indication for the overall error of the evaluated function.
The Pearson correlation coefficient r (dimensionless) for soil i was calculated as
![]() | (5) |
mi = 
ba
mid
is the mean moisture content of the measured MRC for soil i, and
pi = 
ba
Pid
is the mean moisture content of the predicted MRC for soil i. This index is a measure for the linearity between measurements and predictions. An r value that approaches 1 means that the measured and predicted data pairs are linearly located around the line of perfect agreement (or 1:1 line). Hence, the predicted curve is of comparable shape as the measured curve.
The integration boundaries a and b are set here to log(0.25 kPa) and log(1500 kPa), respectively, which is within the range of the measured MRC. The use of log|
| was preferred to avoid assigning too much weight to more negative matric potentials (Tietje and Hennings, 1993). The van Genuchten parameters in 
mi were obtained by means of the MS Windows version of the RETC-code (van Genuchten et al., 1991), which uses the nonlinear least-squares analysis algorithm of Marquardt (1963) minimizing the sum of squared residuals. The five van Genuchten parameters were calculated without imposing any restrictions on the unknown parameters, as this gives superior fits (van Genuchten et al., 1991). In order to determine 
Pi in the case of the two Group 1 PTFs,
s was kept constant and it was given a value equal to the total pore volume calculated from bulk density (which is an input parameter of both PTFs). When total pore volume was lower than the moisture content at -4 kPa matric potential,
s was set equal to a value slightly higher than
-4kPa. The computation of the van Genuchten function by means of the artificial neural network of Schaap et al. (1999) was executed with Rosetta 1.0, a MS Windows program developed by M.G. Schaap.
To illustrate the complementary character of these indices, the MRCs as calculated from the PTFs of Rawls and Brakensiek (1982), Vereecken et al. (1989), and Scheinost et al. (1997), and the MRC obtained by curve fitting Eq. [2] to the measured data are given for a sandy loam soil (Fig. 2)
. When comparing the PTFs of Vereecken et al. (1989) and Scheinost et al. (1997), the absolute value of MD calculated for the Vereecken PTF was somewhat higher than the Scheinost PTF's absolute value of MD (0.0189 and 0.0154 m3 m-3, respectively). Nevertheless, the RMSD of the Vereecken PTF was more than twice as low as the RMSD of the Scheinost PTF (0.0272 and 0.0547 m3 m-3, respectively). The r values were of the same order of magnitude (0.9948 and 0.9899, respectively). On the other hand, the Rawls and Brakensiek (1982) and the Vereecken et al. (1989) PTF have comparable RMSD values (0.0283 and 0.0272 m3 m-3, respectively), but differ substantially in r value (0.9764 and 0.9948, respectively). Thus, although the global error is more or less the same, the MRC predicted by the Vereecken et al. (1989) PTF follows the shape of the measured MRC better than does the Rawls and Brakensiek (1982) PTF. This is important for many purposes, for example, to calculate the differential water capacity C(d
/dh), a parameter that is often used to solve the well-known Richards equation for water transport numerically. The MD of the Rawls and Brakensiek (1982) PTF was 0.0118 m3 m-3, which is lower than the MD of the Vereecken et al. (1989) PTF. It may be clear from this example that the three validation indices are complementary and that use of only one index can lead to erroneous conclusions as regards the behavior of a PTF.
|
The validation indices described above enabled us to rank the PTFs. A final ranking was based on the mean of the rankings given to the PTFs for each validation index separately. Each validation index was given an equal weight; however, the mean of MD was not considered. A ranking was introduced since the evaluation of PTFs should not be based on one validation index only, but should take into account the different indices simultaneously (Whitmore, 1991; Boucneau et al., 1998). A low mean of RMSDs, for example, only indicates a good correspondence between measured and predicted MRCs, but does not say anything about underestimation or overestimation within the MRC.
Finally, the accuracy of the different PTFs can be depicted graphically by plotting measured vs. predicted water contents at certain matric potentials. In analogy with the different validation indices, the measured values correspond with water contents calculated from a fitted van Genuchten equation at the given matric potentials. Plots were drawn at
values of -0.3, -3, -10, -31, -98, and -1500 kPa. Note that -0.3 kPa corresponds with near saturation conditions. The
values -10 and -31 kPa are often considered as close to field capacity conditions of many soils, whereas -1500 kPa is close to the permanent wilting point of many crops (Cassel and Nielsen, 1986). The critical matric potentials at which many crops undergo water stress are in the order of magnitude of -98 kPa for too dry conditions and close to -3 kPa for too wet conditions (Tayler and Ashcroft, 1972; Wesseling, 1991).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
Evaluation of the Pedotransfer Functions
As was mentioned above, the measured MRCs correspond with a continuous van Genuchten curve fitted to the discrete set of measured
(
) data pairs. The R2 of all fitted curves was between 0.976 and 0.999 with a mean of 0.994 and a standard deviation of 0.005. These measured curves therefore represent the actual MRCs very well.
Although most of the PTFs were not applicable to all the soil samples of the evaluation data set, the PTFs were evaluated on both the range-dependent and the complete data set. The values calculated for the different validation indices are presented in Tables 3 and 4. Note that narrowing the evaluation data set did not alter the PTFs performance considerably when considering all validation indices, except for the PTF of Scheinost et al. (1997). This model showed some improvement in prediction accuracy if only those soil samples that fall within the original data set were used, which is 70% of our soil samples. Hence it moves one rank. The PTF of Rawls and Brakensiek (1985) and Schaap et al. (1999) on the other hand showed even higher values of the mean of the absolute values of MD and the mean of RMSD values, despite the canceling out of respectively 42 and 13% (see Table 2) of the soil samples in case of the range-dependent evaluation. Note, however, that the increase of the error could also be due to a fewer number of samples.
|
|
values. The same is true for the overestimation of the Scheinost et al. (1997) PTF. The Wösten et al. (1994) PTF underestimates mainly at moisture contents below -31 kPa. This was also observed for the Vereecken et al. (1989) PTF, which is in contradiction with findings of Kern (1995). He reported a slight tendency to overestimate water content at -1500 kPa matric potential in case of the Vereecken et al. (1989) PTF. The overestimation of the PTFs of Gupta and Larson (1979) and Rawls and Brakensiek (1982) is only pronounced near saturation, that is, at a matric potential of -0.3 kPa. However, the Gupta and Larson (1979) PTF also overestimates considerably at the dry end of the MRC, that is, at a matric potential of -1500 kPa. When comparing the mean of the absolute values of MD with the absolute value of the mean of MDs, and from Fig. 3 and 4, it is clear that the PTFs of Rawls and Brakensiek (1985), Scheinost et al. (1997), and Schaap et al. (1999) almost systematically underestimate or overestimate for all soils. The relative differences between the mean of the absolute values of MD and the absolute value of the mean of MDs are low. The PTF of Vereecken et al. (1989) and the continuous PTF of Wösten et al. (1999) show slight bias between the soils, whereas the bias observed for the three other PTFs is considerable.
|
|
As regards the mean of RMSDs, again the Vereecken et al. (1989) PTF shows the lowest values, meaning that the predicted MRC follows the measured MRC relatively well. By far the highest values result from the Rawls and Brakensiek (1985) and Schaap et al. (1999) PTFs. The other PTFs have intermediate values. Note that the Scheinost et al. (1997) PTF shows a considerable decline in the mean of RMSDs when restricting the ranges of the evaluation data set to those of the calibration data set.
The mean of r values reveals a somewhat different pattern in terms of the model's ranking. The correspondence between measured and predicted MRCs is still highest for the Vereecken et al. (1989) function and lowest for the Rawls and Brakensiek (1985) PTF. However, poor correspondence is observed for the PTF of Scheinost et al. (1997). Narrowing the evaluation data set has the highest impact on the Scheinost et al. (1997) function, although it only moves one rank. The other PTFs appear to show an intermediate correspondence.
As concerns the standard deviations of RMSDs, again the same trend can be perceived: the PTF of Vereecken et al. (1989) performs best, followed by the three Wösten et al. (1994)(1999) PTFs. This means that their performance is the least dependent on the soil type. Also the Schaap et al. (1999) PTF performs quite well as regards the standard deviation, despite high RMSD values. When restricting the Scheinost et al. (1997) PTF to the ranges of its calibration data set, only a slight improvement is observed, although its RMSD is relatively low. Hence this PTF has a low regularity.
From Fig. 3 and 4 it can be deduced that most of the models predict best near saturation (
= -0.3 kPa). Only the two Group 1 PTFs do not perform very well at the very wet end of the MRCs. This must be attributed to the relatively low matric potential (-4 kPa) at which these PTFs start to predict moisture content. Relatively good predictions can also be observed at the dry end of the MRCs (
= -1500 kPa). Compared with the other PTFs, the prediction error of the PTF of Schaap et al. (1999) is quite high at this dry end. All PTFs show, however, the highest errors at -10 and -31 kPa matric potential, which correspond with moisture conditions near field capacity. The exception here is the Rawls and Brakensiek (1982) PTF that performs worst near saturation. The low prediction accuracy that was observed for the Rawls and Brakensiek (1985) PTF when considering the different validation indices is not specifically due to the lower performance of the Brooks and Corey (1964) function. If this had been the case, prediction errors would only be large in the near-saturation range of the MRC. Figure 4 reveals that the discrepancies between measured and predicted moisture contents are also quite considerable in the drier range of the MRC.
Effect of Soil Properties on the Function's Performance
In order to determine the effect of organic matter content and bulk density on the performance of the PTFs, the soil samples were sorted according to the USDA soil classes. As Fig. 5
shows, a distinct relation between organic matter content and RMSD and between bulk density and RMSD could not be observed for the Vereecken et al. (1989) model. When applying a linear regression to the data, the R2 values were 0.002 for organic matter and 0.019 for bulk density. Comparable conclusions could be drawn with the other PTFs and the other validation indices.
|
|
| CONCLUSIONS |
|---|
|
|
|---|
It is worthwhile noting here that at least those PTFs that were developed and calibrated on soil samples collected in the Low Lands (Belgium and the Netherlands) perform the best. An explanation could be that the soil samples from our evaluation data set came from the same Low-Land population. The soils from the Low Lands that were considered in this study were formed under more or less the same conditions. One should, therefore, be careful when applying a PTF in a geographical area different from the one where the samples of the calibration data set were taken. Nevertheless, the Vereecken et al. (1989) PTF, for example, has proven to perform very well in studies where large data sets were used that were derived from different geographical areas (Tietje and Tapkenhinrichs, 1993; Kern, 1995; Schaap et al., 1998).
Finally, it can be concluded from this study that the simple multiple regression functions developed more than a decade ago by Vereecken and coworkers (1989) are still the best to predict the soil moisture retention curve from easily available soil properties. None of the evaluated pedotransfer functions that have been developed in recent years performs better.
| ACKNOWLEDGMENTS |
|---|
| NOTES |
|---|
|
|
|---|
Received for publication June 5, 2000.
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
Q. Shu, Z. Liu, and B. Si Characterizing Scale- and Location-Dependent Correlation of Water Retention Parameters with Soil Physical Properties Using Wavelet Techniques J. Environ. Qual., October 23, 2008; 37(6): 2284 - 2292. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Lilly, A. Nemes, W. J. Rawls, and Ya. A. Pachepsky Probabilistic Approach to the Identification of Input Variables to Estimate Hydraulic Conductivity Soil Sci. Soc. Am. J., January 11, 2008; 72(1): 16 - 24. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. M. Arya, D. C. Bowman, B. B. Thapa, and D. K. Cassel Scaling Soil Water Characteristics of Golf Course and Athletic Field Sands from Particle-Size Distribution Soil Sci. Soc. Am. J., January 11, 2008; 72(1): 25 - 32. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Nemes, W. J. Rawls, and Y. A. Pachepsky Use of the Nonparametric Nearest Neighbor Approach to Estimate Soil Hydraulic Properties Soil Sci. Soc. Am. J., February 2, 2006; 70(2): 327 - 336. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. M. Cornelis, M. Khlosi, R. Hartmann, M. Van Meirvenne, and B. De Vos Comparison of Unimodal Analytical Expressions for the Soil-Water Retention Curve Soil Sci. Soc. Am. J., October 27, 2005; 69(6): 1902 - 1911. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Vanderlinden, J. V. Giraldez, and M. Van Meirvenne Soil Water-Holding Capacity Assessment in Terms of the Average Annual Water Balance in Southern Spain Vadose Zone J., May 12, 2005; 4(2): 317 - 328. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. P. Chan and R. S. Govindaraju Estimating Soil Water Retention Curve from Particle-Size Distribution Data Based on Polydisperse Sphere Systems Vadose Zone J., November 1, 2004; 3(4): 1443 - 1454. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Gerard, M. Tinsley, and K. U. Mayer Preferential Flow Revealed by Hydrologic Modeling Based on Predicted Hydraulic Properties Soil Sci. Soc. Am. J., September 1, 2004; 68(5): 1526 - 1538. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. M. Cornelis, D. Gabriels, and R. Hartmann A Conceptual Model to Predict the Deflation Threshold Shear Velocity as Affected by Near-Surface Soil Water: I. Theory Soil Sci. Soc. Am. J., July 1, 2004; 68(4): 1154 - 1161. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. M. Cornelis, D. Gabriels, and R. Hartmann A Conceptual Model to Predict the Deflation Threshold Shear Velocity as Affected by Near-Surface Soil Water: II. Calibration and Verification Soil Sci. Soc. Am. J., July 1, 2004; 68(4): 1162 - 1168. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Nemes, M. G. Schaap, and J. H. M. Wosten Functional Evaluation of Pedotransfer Functions Derived from Different Scales of Data Collection Soil Sci. Soc. Am. J., July 1, 2003; 67(4): 1093 - 1102. [Abstract |