SSSAJ Grow Your Career with SSSA
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (3)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Mueller, T. G.
Right arrow Articles by Shearer, S. A.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Mueller, T. G.
Right arrow Articles by Shearer, S. A.
Agricola
Right arrow Articles by Mueller, T. G.
Right arrow Articles by Shearer, S. A.
Related Collections
Right arrow Soil Fertility and Productivity
Right arrow Spatial Distribution
Right arrow Geostatistics
Right arrow Spatial Variability
Right arrow Site-Specific Analysis
Published in Soil Sci. Soc. Am. J. 68:2042-2047 (2004).
© 2004 Soil Science Society of America
677 S. Segoe Rd., Madison, WI 53711 USA

Division S-8—Nutrient Management & Soil & Plant Analysis

Map Quality for Ordinary Kriging and Inverse Distance Weighted Interpolation

T. G. Muellera,*, N. B. Pusuluria, K. K. Mathiasa, P. L. Corneliusa, R. I. Barnhisela and S. A. Shearerb

a University of Kentucky, Dep. of Agronomy, N-122 Ag. Science North, Lexington, KY 40502
b Dep. of Biosyst. and Agric. Eng., University of Kentucky, 218 C.E. Barnhart, Lexington, KY 40546-0276

* Corresponding author (mueller{at}uky.edu)


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 REFERENCES
 
The selection of a spatial interpolation methods will impact the quality of site-specific soil fertility maps. The objective of this study was to describe and predict the relative performance of inverse distance weighted (IDW) and ordinary kriging. Soil samples were collected on 30.5-m grids for fields in five Kentucky counties and analyzed for pH, buffer pH, P, K, Ca, and Mg. From these data sets, 61-m grid subsets were extracted. Data were interpolated with IDW and kriging procedures. Prediction efficiency (PE) was determined using an independent dataset (PEvalidation) and with cross-validation (PEcross-validation). Multiple stepwise regression was used to develop models that described the relative performance of ordinary kriging and IDW with statistical properties of the data. At the 30.5-m grid scale, the performance of ordinary kriging relative to IDW improved as the range of spatial correlation increased and fit of the semivariogram model improved. However, at the 61.0-m grid scale, the performance of ordinary kriging relative to IDW diminished as the degree of spatial structure increased and the fit of the semivariogram model improved. Alone, PEcross-validation poorly describes the performance of PEvalidation across locations, soil properties, and sampling intervals (r2 = 0.18). However, in combination with the range of spatial correlation, substantial variability at the 30.5-m grid scale was described for variables with sample semivariograms that reached plateaus (R2 = 0.61). In some situations, better decisions will be made regarding the use of these methods by considering the range of spatial correlation and cross-validation statistics.

Abbreviations: FULL, datasets created by combining the validation and 30.5-m grid datasets • IDW, inverse distance weighted • IDW1.2, IDW interpolation with a distance exponent of 1.2 • MSE, mean squared error • PE, prediction efficiency PEvalidation, PE determined using an independent data set • PEcross-validation, PE determined with cross-validation • {Delta}PEvalidation, difference in PE between ordinary kriging and IDW1.2 as determined using an independent validation dataset • {Delta}PEcross-validation, difference in PE between ordinary kriging and IDW1.2 as determined with cross-validation • r2semivariogram, F, the coefficient of determination for the relationship between semivariogram pairs and the semivariogram model values • RangeF, range of spatial correlation as determined with the FULL datasets • RangeS, range of spatial correlation as determined with the sample dataset RSV relative structural variability • RSVF, RSV as determined with the FULL datasets • RSVS, RSV as determined with the sample datasets • {Delta}RSV, RSVF – RSVS • SSFM, Site-specific fertility management


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 REFERENCES
 
THE ECONOMICS OF site-specific fertility management (SSFM) depend on the adequacy of methods used for sampling, laboratory analysis, determining lime and fertilizer recommendations, mapping, and application of lime and fertilizer. It may be possible to predict which procedures will perform best in specific situations based on statistical properties of the variables being mapped. If possible, this will require a fundamental understanding of the factors that influence the relative effectiveness of these procedures.

Two of the most commonly used interpolation methods for SSFM are IDW and kriging (Kravchenko and Bullock, 1999). The IDW procedure has been used primarily because it is simple and quick. Kriging has been used because it provides best linear unbiased estimates; however, it is more complex and time-consuming. For mathematical descriptions of these methods, see Weber and Englund (1992), Hosseini et al. (1994), Gotway et al. (1996), Nalder and Wein (1998), Kollias et al. (1999), Kravchenko and Bullock (1999), Schloeder et al. (2001), and Kravchenko (2003).

Many studies have compared IDW and kriging. In some cases, the performance of kriging was generally better than IDW (Tabios and Salas, 1985; Hosseini et al., 1994; Dalthorp et al., 1999; Kravchenko and Bullock, 1999). In other studies, IDW generally out performed kriging (Weisz et al., 1995; Nalder and Wein, 1998; Weber and Englund, 1992). Often, however, the results have been mixed (Gotway et al., 1996; Kollias et al., 1999; Schloeder et al., 2001; Mueller et al., 2001; Lapen and Hayhoe, 2003).

The relevance of these studies depends on the methods used for analyses. Some have assessed predicted and measured values using independent validation data sets to compare IDW and ordinary kriging (Tabios and Salas, 1985; Laslett et al., 1987; Wartenberg et al., 1991; Weber and Englund, 1992; Weisz et al.; 1995; Gotway et al., 1996; Mueller et al., 2001; Kravchenko, 2003). The exact number of validation points needed to produce accurate results is not known. However, five validation points in one study may have been insufficient (Tabios and Salas, 1985). Cross-validation (Isaaks and Srivastava, 1989) has been used in many cases to compare differences between kriging and IDW (Tabios and Salas, 1985; Hosseini et al., 1994; Nalder and Wein, 1998; Kravchenko and Bullock, 1999; Schloeder et al., 2001; Lapen and Hayhoe, 2003). However, some have cautioned against this procedure for selecting interpolation procedures for mapping (Isaaks and Srivastava, 1989; Cressie, 1993; Goovaerts, 1997). In one instance, it was demonstrated that the use of cross-validation could lead to erroneous conclusions regarding the relative performance of two interpolators (Isaaks and Srivastava, 1989). Unfortunately, this issue has not been extensively studied.

Few have attempted to explain the underlying factors that impact the relative performance of ordinary kriging and IDW. One study investigated the impact of scale of sampling on the relative performance of ordinary kriging and IDW (Mueller et al., 2004). The performance of kriging improved relative to IDW interpolation as sampling intensity increased. The impact of scale is important because sampling intensities may vary widely. Situations have been identified where map quality has been poor at intensities ≤0.09-ha (Mueller et al., 2001). Some farmers, however, are sampling with grids as large as 4 ha. Several studies have examined the relative performance of kriging and IDW within this range (i.e., 0.09 ≤ grid size ≤ 4.0 ha) (Gotway et al., 1996; Kravchenko and Bullock, 1999; Mueller et al., 2001; Kravchenko, 2003). However, many studies have been conducted at lower (i.e., >4-ha grids; Tabios and Salas, 1985; Laslett et al., 1987; Weber and Englund, 1992; Hosseini et al., 1994; Nalder and Wein, 1998; Lapen and Hayhoe, 2003) or higher intensities (i.e., <0.09-ha grids; Dalthorp et al., 1999). Unfortunately, it may be difficult to use these studies to aid with site-specific, soil fertility management decision-making.

The degree to which spatial structure is known impacts the relative performance of IDW and kriging. One study found that by better resolving the spatial structure with additional closely spaced samples, ordinary kriging generally outperformed IDW (Mueller et al., 2001). In a simulation study (Kravchenko, 2003), IDW was compared with kriging with known (i.e., semivariogram models were determined from an exhaustive dataset) and unknown (i.e., semivariogram models were determined from the sample dataset) spatial structure. As might be expected, the performance of kriging improved relative to IDW when spatial structure was known. Given the importance of spatial structure, it may be possible to use geostatistical indices to predict the relative performance of ordinary kriging and IDW.

Despite the work done in this area, little is known about the factors that impact the relative performance of IDW and kriging at sampling intensities used for site-specific management. Therefore, we considered two objectives. The first was to describe the factors that may determine differences between the performance of IDW and ordinary kriging. The second was to explore potential ways in which these differences might be predicted from statistical properties of the data. These analyses were conducted using soil fertility data from another published study (Mueller et al., 2004). Ordinary kriging and IDW interpolation were conducted. Multiple regression analyses were conducted to relate the statistical properties of the data with the relative performance of these procedures.


    MATERIAL AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 REFERENCES
 
This study was conducted for five fields in Calloway (36°32'37''N long., 88°27'7''W lat.), Hardin (37°31'7''N long., 85°56'13''W lat.), Hopkins (37°26'16''N long., 87°26'5''W lat.), Nelson (37°39'45''N long., 85°30'11''W lat.), and Shelby (38°1'32''N long., 85°27'40''W lat.) Counties, Kentucky. Soil sampling, laboratory analysis, extraction of additional datasets, soil series, and management of these fields have been described by Mueller et al. (2004).

Semivariance values were calculated with SAS (SAS Institute, Cary, NC) procedure VARIOGRAM for each of the datasets. Semivariogram models were fit based on visual assessment. Parameters included the nugget, partial sill, and range of spatial correlation. The total sill is the sum of the nugget and partial sill. The range of spatial correlation refers to the practical range. For exponential models, the practical range is the lag distance at which the semivariance is equal to 95% of the total sill. The relative structural variability is the ratio of the partial to total sill. For more detailed descriptions of these parameters, see Isaaks and Srivastava (1989) and Schabenberger and Pierce (2002). A summary of the statistical parameters for the FULL dataset is presented in Mueller et al. (2004).

For each of the FULL datasets, semivariogram values for individual pairs were calculated with Variowin (Pannatier, 1996). The r2 values for the relationship between the individual pairs and the model semivariograms (r2semivariogram, F) were calculated for the FULL dataset for lags between 0 and 26 m. This index described the adequacy with which the semivariogram model described the spatial variability in the field.

Interpolated grids (1 by 1 m) were calculated with ordinary kriging and IDW (Surfer Version 8, Golden Software, Golden, CO) for the 30.5- and 61.0-m grids datasets. Validation with an independent dataset and cross-validation were used to obtain predicted and measured values. Bilinear interpolation was used to obtain the predicted values for the independent dataset. Errors associated with bilinear interpolation were very small as determined by Mueller et al. (2001)(2004). As described by Mueller et al. (2004), the predicted and measured values were used to calculate various measures of map quality including mean squared error (MSE), bias, precision, and PE.

Preliminary analyses were conducted to determine appropriate interpolation parameters for subsequent analyses. For kriging and IDW, 15 sample points were used to calculate weighted spatial averages. This value was chosen because beyond 15 points, there was little change in prediction efficiency values. For all IDW interpolation, a distance exponent value of 1.2 was used. This value maximized average prediction efficiency across locations, sampling intensities, and soil properties. For all IDW interpolations, a search radius of 125 m was used because there was little change or improvement in prediction efficiency values with larger distances. For ordinary kriging, the search radius was set to be the lag distance beyond which the semivariogram model no longer represented the spatial structure of the data.

The SAS (Cary, NC) PROC REG procedure (SELECTION = Stepwise; SLENTRY = 0.3; SLSTAY = 0.2) was used to model prediction efficiency calculated with an independent validation dataset (PEvalidation). Diagnostics for assessing multicolinearity (i.e., intercept adjusted variance-inflation factors and condition indices statistics) were calculated by using the VIF and COLLINOINT model options. The INFLUENCE option of PROC REG was used to calculate the RStudent statistic for identifying potential outliers. Many of the pH and Bph points in the Kentucky and Michigan datasets were indicated as potential outliers. Careful examination of plots of the difference between modeled and actual semivariogram values for individual pairs versus lag distance revealed spatial anomalies for pH and Bph. The behavior occurred because the electrodes did not have time to adequately equilibrate. Therefore, each pH measurement was correlated with a previous analysis. Since all but the Shelby location samples were analyses in an order that corresponded to the spatial distribution of points across the fields, the correlations in the analyses were manifested as spatial trends in the data. For these reasons, Mueller et al. (2004) removed these points before the final analyses. These outliers have also been removed in this study.

The 30.5- and 61.0-m grid data were grouped into two categories based on semivariogram analysis of the FULL dataset. The first group included observations for variables with semivariograms that reached plateaus as determined by the FULL dataset analysis. The second group included observations of variables with semivariograms that did not reach plateaus as determined by the FULL dataset analysis. For both groups, separate multiple regression analyses were performed using the (i) 30.5- and 61.0-m grids, (ii) the 30.5-m grids, and (iii) the 61.0-m data.

For each of the datasets, models were developed using two sets of candidate regressors. The first set included sample intensity and the following variables from the FULL dataset:

The second set included sample intensity and the following variables from the sample datasets. These included

For independent variables in the first and second sets, the natural logarithms, squares, cubes, and fourth powers these variables were also used as candidate variables when possible. Exceptions included sampling intensity and PEcross-validation, S. To avoid multicolinearity, not all variables were allowed in the model. The final model included no more than one untransformed variable (e.g., rangeS) or one of its transformations (e.g., natural logarithm of rangeS, the square, cube, or fourth power of rangeS) for each variable that was considered for entry in the model.


    RESULTS AND DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 REFERENCES
 
The {Delta}PEValidation statistic was the difference in prediction efficiency between ordinary kriging and IDW1.2 as determined using an independent validation dataset. Positive values indicated that ordinary kriging outperformed IDW1.2. This statistic was negative for 18 of 22 sample data sets that had been modeled with only a nugget structure. While all soil properties in this study were spatially structured according to the full dataset analysis (Mueller et al., 2004), spatial structure was not apparent for two variables at the 30.5-m grid scale and 20 observations at the 61.0-m grid scale. For these datasets, prediction efficiency values were 15% lower on average with ordinary kriging compared with IDW1.2. Clearly, of the two methods in question, IDW1.2 was a better choice. Because the variables that were not spatially structured behaved differently than all others, these observations were excluded from the multiple regression analyses.

Fundamental Factors Impacting {Delta}PEValidation
The {Delta}PEValidation statistic generally became more positive as {Delta}RSV decreased (Table 1). Recall that the {Delta}RSV was the difference between the RSV determined with the analyses of the FULL datasets (RSVF) and the RSV determined with the analyses of the sample datasets (RSVS). Because RSVS values were generally underestimates of the degree of true spatially structured, {Delta}RSV values were generally positive. The t-values from these models indicated that the {Delta}RSV values were in some cases the most significant of the factors examined that influenced the relative performance of IDW1.2 and ordinary kriging.


View this table:
[in this window]
[in a new window]
 
Table 1. Multiple stepwise regression model for {Delta}PEValidation. Candidate regressors included statistics determined from the FULL and sample datasets. The observations included datasets with semivariograms that did (Models 1, 2, and 3) and did not (Models 4, 5, and 6) reach plateaus according to the FULL dataset analyses.

 
For variables with semivariograms that reached plateaus according to the FULL dataset analysis, the relationship between {Delta}PEvalidation and spatial structure depended on sampling intensity (Table 1). Specifically, {Delta}PEValidation increased as the dispersion of the semivariogram pairs decreased (i.e., as R2semivariogram, F increased) and the rangeF of spatial autocorrelation increased at the 30.5-m sampling intensity (Model 2 in Table 1). At the 61.5-m grid scale, {Delta}PEValidation increased as R2semivariogram, F and the RSVF decreased (Model 3 in Table 1). The seemingly contradictory impact of spatial structure was associated with the resolution of sample semivariograms at shorter lags. At the 30.5-m grid sample intensity, spatial structure generally was adequately represented with the sample semivariograms. This was evident by the improvement in the performance of ordinary kriging with improved spatial structure of the data. However, short-range spatial variability was generally not adequately resolved with the 61.0-m grid sample datasets. So as spatial structure improved, the model parameters as determined from the sample datasets (i.e., RSVS) became increasingly inadequate. Therefore, the performance of ordinary kriging diminished as the structure of the underlying spatial process improved at the 61.5-m grid scale.

For variables with semivariograms that did not reach a plateau according to the FULL dataset analysis, little variation in {Delta}PEValidation could be explained (i.e., Models 4, 5, and 6 in Table 1). The spatial structure of the data did likely impact the relative performance of ordinary kriging and IDW1.2 for these variables. Unfortunately, the exponential model semivariogram parameters were not simply a function of the spatial structure of the data. They also depended on the modeling choices made by the authors. Specifically, there were limitless combinations of model parameters that could have adequately fitted the experimental semivariograms. Models with either small range and sill values or large range and sill values could have been adequately fitted to semivariograms exhibiting intrinsic behavior. Therefore, no clear relationship with spatial structure parameters could be defined.

These results substantiate those of Mueller et al. (2001) and Kravchenko (2003) who found that the performance of ordinary kriging was better when spatial structure was more accurately defined either with closely spaced samples (Mueller et al., 2001) or with an exhaustive dataset (Kravchenko, 2003). Clearly, in this study, {Delta}RSVF, the degree and range of spatial structure and the dispersion of observations about the semivariogram model (i.e., r2semivariogram, F) were significant factors in the regression models because the relative performance of ordinary kriging and IDW was governed by the degree to which the true spatial structure was known.

{Delta}PEValidation as a Function of Statistics Calculated from the Sample Datasets
The {Delta}PEcross-validationS statistic was a significant factor in several of the models (Table 2). Recall that {Delta}PEcross-validationS was the difference in prediction efficiency between ordinary kriging and IDW1.2 as determined with cross-validation. Cross-validation clearly was useful here for comparing interpolation procedures when used in conjunction with the rangeS or RSVS statistics. Unfortunately however, without other factors in the model, the explanatory values of the models diminished markedly across locations, soil properties, and sampling intensities (i.e., r2 = 0.18). Therefore, caution should still be used when interpreting studies that have compared IDW and kriging solely with cross validation (e.g., Tabios and Salas, 1985; Hosseini et al., 1994; Nalder and Wein, 1998; Kravchenko and Bullock, 1999; Schloeder et al., 2001; Lapen and Hayhoe, 2003).


View this table:
[in this window]
[in a new window]
 
Table 2. Multiple stepwise regression model for {Delta}PEValidation. Candidate regressors included statistics determined from the sample datasets. The observations included datasets with semivariograms that did (Models 7, 8, and 9) and did not (Models 10, 11, and 12) reach plateaus according to the FULL dataset analyses.

 
For variables that reached plateaus (Table 1), the performance of ordinary kriging improved relative to IDW1.2 as the rangeS increased at the 30.5-m grid scale (Model 8) and as rangeS decreased across both scales (Model 7). Clearly, the influence of the rangeS was similar to the influence of the rangeF described in Models 1 and 2 (Table 1). For those variables that did not reach plateaus, ordinary kriging improved relative to IDW1.2 as RSVS increased. This was consistent with the impact of {Delta}RSV in Model 4 (Table 1), which had a negative coefficient as described in Table 1.

Separately, cross-validation and sample spatial structure parameters explained insufficient variability for predicting the relative performance of ordinary kriging and IDW. Together, they explained 43 and 61% of the variability of {Delta}PEValidation (Table 2). Plots of estimated and measured prediction efficiency values (Fig. 1) demonstrated the performance of the regression models with significant factors developed with sample dataset statistics as candidate regressors.



View larger version (12K):
[in this window]
[in a new window]
 
Fig. 1. Estimated versus measured {Delta}PEValidation for multiple regression Models 7, 8, and 10 (Table 2). The data included only those observations that were used to generate the models.

 
Interpretations of these models (Table 2) and plots of predicted versus measured (Fig. 1) should be considered in the context of management. The question that cartographers must ask is whether ordinary kriging will outperform IDW1.2 interpolation. Because this is an either-or decision, the quantities required to evaluate the models are the number of points classified correctly (i.e., observations in the upper right or lower left quadrants) and incorrectly (upper left and lower right quadrants) classified. Observations were classified correctly in 75% of the cases for Model 7 and 8 and 52% of the cases for Model 9.


    CONCLUSIONS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 REFERENCES
 
For sample datasets with semivariograms that did not indicate the presence of spatial structure, IDW1.2 was a better choice than ordinary kriging with a nugget model. Cross-validation should not be used as the sole criteria for deciding whether to use one interpolation procedure over another. However, when used in combination with the sample range of spatial correlation values for semivariogram models that reach a plateau, cross-validation may be a useful factor for predicting the relative performance of ordinary kriging and IDW.


    ACKNOWLEDGMENTS
 
The authors acknowledge the USDA (Special Grant Contract # 99-34408-7561). In addition, we are grateful to Mike Ellis, the Luck brothers, Rick Murdock, the Peterson brothers, and Charlie Stuecker for providing access to their farms to conduct this research. We would especially like to express our appreciation to Danna Reid, Frank Sikora, and the staff of the University of Kentucky soil-testing lab for conducting the physical and chemical analyses for this study.

Received for publication September 2, 2003.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIAL AND METHODS
 RESULTS AND DISCUSSION
 CONCLUSIONS
 REFERENCES
 




This article has been cited by other articles:


Home page
Soil Sci.Home page
T.-L. Liu, K.-W. Juang, and D.-Y. Lee
Interpolating Soil Properties Using Kriging Combined with Categorical Information of Soil Maps
Soil Sci. Soc. Am. J., May 23, 2006; 70(4): 1200 - 1209.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF) Free
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via ISI Web of Science (3)
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Mueller, T. G.
Right arrow Articles by Shearer, S. A.
Right arrow Search for Related Content
PubMed
Right arrow Articles by Mueller, T. G.
Right arrow Articles by Shearer, S. A.
Agricola
Right arrow Articles by Mueller, T. G.
Right arrow Articles by Shearer, S. A.
Related Collections
Right arrow Soil Fertility and Productivity
Right arrow Spatial Distribution
Right arrow Geostatistics
Right arrow Spatial Variability
Right arrow Site-Specific Analysis


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
The SCI Journals Agronomy Journal Crop Science
Journal of Natural Resources
and Life Sciences Education
Vadose Zone Journal
Journal of Plant Registrations Journal of
Environmental Quality
The Plant Genome