|
|
||||||||
a USDA-NRCS, Soil Quality Institute, 2150 Pammel Dr., Ames, IA 50011-4420
b USDA-ARS, National Soil Tilth Lab., 2150 Pammel Dr., Ames, IA 50011-4420
* Corresponding author (andrews{at}nstl.gov)
| ABSTRACT |
|---|
|
|
|---|
Abbreviations: AGG, water-stable aggregates AWC, plant-available water-holding capacity Db, bulk density EC, electrical conductivity MBC, microbial biomass C MDS, minimum data set NRI, Natural Resources Inventory PMN, potentially mineralizable N R2, coefficient of determination SMAF, Soil Management Assessment Framework SOM, soil organic matter SQ, soil quality T, soil loss tolerance SAR, sodium adsorption ratio TOC, total organic C WS, watershed
| INTRODUCTION |
|---|
|
|
|---|
Although some misconceptions exist, the recent emphasis on soil function (and dynamic soil quality) is not intended to detract from the importance of soil taxonomy, where inherent soil properties, resulting from the five soil forming factors (Jenny, 1941), and land use suitability are emphasized. Soil quality uses taxonomy as a foundation (Karlen et al., 2003). The specific definition of soil quality for a particular soil is dependent on its inherent capabilities, the intended land use, and the management goals. For instance, optimum levels of organic matter (and other soil properties) will differ depending on the condition under which the soils formed, leading to variation in potential functioning. The use-dependence of the SQ concept can be illustrated simply: the functions, properties, and processes necessary to hold up a physical structure are not the same as those needed to grow a crop. More subtly, the soil qualities (functions or properties) critical for environmentally benign land application of animal waste are not identical to those for maximized productioneven within the same field or under the same crop.
As with defining SQ, assessing SQ also requires consideration of taxonomy, land use and management goals. Appropriate SQ assessment measures a soil's changes in function in response to management, within the context of what the soil is being asked to do, its inherent properties, and environmental influences, such a temperature and precipitation. The target or optimum soil quality is not one standard for the USA or the world; instead, it is a series of thresholds defined by limiting factors and user needs.
Indicators of SQ can be defined loosely as those soil properties and processes that have greatest sensitivity to changes in soil function. Doran and Parkin (1996) emphasized that SQ indicators should correlate well with ecosystem processes, integrate soil properties and processes, be accessible to many users, sensitive to management and climate, and, whenever possible, be components of existing databases. Indicator groups or MDSs, used to indirectly measure soil function, must be sufficiently diverse to represent the chemical, biological, and physical properties and processes of complex systems (Gregorich et al., 1994; Doran and Parkin, 1996; Snakin et al., 1996; Karlen et al., 2003). While the concept of indirect measures (indicators) has been widely used to monitor water quality (Karr, 1981), the idea was first applied to soil function through pedotransfer functions (Larson and Pierce, 1991). Because there are so many competing uses and inherent limitations for the world's soils, the components of a MDS are not universal: the appropriate indicators for indirect assessment of soil function are determined by which functions are critical to meet management goals (Harris et al., 1996; Andrews et al., 2002a).
Larson and Pierce (1991) argued that the measure of SQ in agriculture should no longer be limited to productivity goals, inferring that emphasizing productivity may have contributed to soil degradation in the past. The design of any generalized assessment tool for SQ must be flexible enough to capture multiple soil functions in various combinations. This must be accomplished with respect to the broader goals of sustaining plant and animal productivity, maintaining or enhancing water and air quality, and supporting human health and habitation (Karlen et al., 1997). In addition, a SQ assessment tool needs to interpret the indicators of those functions in terms of the inherent abilities of the soil and climate in which the assessment takes place. Such a tool would address most of the misgivings and misconceptions among those who have reservations regarding the SQ concept (e.g., Sojka et al., 2003), by using quantitative laboratory analyses, providing site-specific interpretations, and evaluating and understanding management effects on a specific soil resource with respect to multiple endpoints (which are outcomes driven by management or societal goals, e.g., productivity and environmental quality).
The objectives for this current work were (i) to design a tool to assess the relative effects of management on SQ based on indicator measurement, and (ii) to test the framework for transferability across soils, climate, and management practices.
This paper outlines a three-step framework (without describing details of the computer code), called the Soil Management Assessment Framework (SMAF), and imparts the results of its application to four case studies that vary in climate, management practice, spatial extent, and soil type. The case study demonstration shows how the framework interprets soil indicator data and computes relative SQ indices to compare management practices or monitor change over time. We call on other researchers to continue to test, critique, and refine the SMAF as a tool for sustainable soil management.
| MATERIALS AND METHODS |
|---|
|
|
|---|
|
To generate a list of suggested indicators, a user of the tool replies to a number of questions, one of which pertains to the user's primary management goal for the site. A table in the database identifies the critical functions associated with each management goal: maximize productivity, waste recycling, or environmental protection (Table 1) For example, if the user chooses waste recycling as the primary management goal, the program identifies the functions nutrient cycling, water relations, filtering and buffering, and resistance and resilience as important to that goal. There are currently three management goals and six functions identified in the program's database but more can easily be added if additional land uses are targeted.
|
|
Indicator Interpretation
After selecting (Step 1) and measuring the appropriate indicators for the MDS, indicator interpretation (Step 2) involves transformation of each observed MDS indicator value using nonlinear scoring curves (e.g., Karlen and Stott, 1994; Andrews et al., 2002a, 2002b). It is assumed that indicator measures are performed according to standard methods for the near surface (015 cm) and that sampling design is appropriate for the area to be assessed (see the case study section for examples). Measured values are transformed into unitless values so that scores may be combined to form a single value in Step 3. The use of scoring curves for data analysis and synthesis allows interpretations to reflect both ecosystem function and farmer and societal values regarding crop production and environmental protection (Schiller et al., 2001). For example, society currently places a value on the protection of surface water, therefore, the measurements for soil P that are above what is necessary for crop production receive lower scores, particularly on sloping land, to reflect the increased risk of surface water contamination (Fig. 2)
. Scoring curves are used in a similar manner in a variety of disciplines such as measurement of utility in economics (e.g., Norgaard, 1994), evaluation of decision outcomes in multi-objective decision science (e.g., Yakowitz et al., 1993), and assessment and modeling in systems engineering (e.g., Wymore, 1993).
|
|
The nonlinear scoring algorithms were originally constructed using a curve-fitting program, CurveExpert v. 1.3 shareware (available online at http://curveexpert.webhop.biz/ [verified 22 June 2004]). The curve shapes were determined by literature review and consensus of collaborating researchers. Total organic C (TOC) and water stable aggregation (AGG) are ascending logistic or more-is-better functions based on their roles in soil fertility, water partitioning, and structural stability (Tiessen et al., 1994; Herrick and Wander, 1998). Plant available water holding capacity (AWC) was assigned a more-is-better curve, based on the role of water availability for crop productivity and other biological activity (e.g., Gregory et al., 2000). The more-is-better curve was also used for potentially mineralizable N (PMN) based on nutrient availability and a theorized relationship between microbial activity and plant productivity (e.g., Hendrix et al., 1990; Sparling, 1997). The more-is-better curve was also used for microbial biomass C (MBC) based on its role as a pool of readily available C and N and an association with improved soil structural functioning (Elliott and Coleman, 1988; Hendrix et al., 1990). A lower asymptotic or less-is-better function was used for bulk density (Db) because of the inhibitory effect that high Db often has on root growth and soil porosity (Grossman et al., 2001b). Variations of mid-point optimum or Gaussian functions were used for soil pH (Whittaker et al., 1959; Smith and Doran, 1996) and electrical conductivity (EC) (Tanji, 1990) based on crop sensitivity and effects on nutrient availability. Scores for sodium adsorption ratio (SAR) were dependent on potential for soil dispersion, environmental (water quality) risk, and associated EC levels (Oster and Schroer, 1979; Hansen and Grattan, 1992). The mid-point optimum curve for P is based on crop response and environmental risk (Pierzynski et al., 1994; Maynard, 1997).
We assume that the expected range for each indicator will vary according to site-specific controlling factors, such as climate or inherent soil properties. For instance, in a southeastern U.S. Ultisol, a TOC of 2% would be considered a high value; this soil would receive a high TOC score. In a Midwestern Mollisol, however, a TOC of 2% would be considered a low value, consistent with a degraded soil. It would receive a correspondingly low score. The factors controlling these differences in expected range for TOC include average annual precipitation, average annual temperature, soil texture, and soil taxonomic suborder (as a surrogate for inherent soil organic matter). To model these associations between indicators, function, and controlling factors, one must have knowledge of (or make assumptions about) not only the appropriate curve shape (based on the indicator's relationship to ecosystem function) but also the expected direction of change in curve inflections as major controlling factors change. For instance, as temperature and precipitation increase, expected TOC decreases due to increased decomposition rates under these conditions. This results in a shift to the left in the algorithm's inflection points.
We used CurveExpert v. 1.3 shareware to identify which parameters in the scoring curve algorithms needed to change to best represent the relationships between each indicator and soil function(s) in various systems, for example, climate, soil, and crop combinations. We also, by default, identified those parameters in the algorithms that do not change, which are termed fixed parameters. We then determined, via literature review, the most important controlling factors for each MDS indicator. We linked these controllers to the site-specific parameters, using database tables in Java and look-up tables in Excel. For example, the controlling factors for the Db scoring-algorithm are texture and mineralogy (Table 3). Using a logic statement, the program chooses between two sets of parameters for one algorithm based on soil texture. As texture becomes coarser, Parameters b, c, and d change to reflect a greater tolerance of higher Db's before root restriction or aeration become a problem. In clayey soils, mineralogy comes into play, such that glassy and smectitic soils have a lower tolerance of high Db's compared with other mineral classes (Grossman et al., 2001a). Table 3 shows the algorithms, fixed parameters, site-specific parameters, and controlling factors for 10 scoring curves.
For some of the site-specific parameters, there were too many controlling factors to identify exact values for every possible combination. To circumvent this problem, we grouped some factors into classes that behave similarly. For instance, soil texture is grouped into five classes based on the work of Quisenberry et al. (1993). Conversely, a few parameters are modeled as continuous (rather than step) functions. For example, we used observed TOC as a site-specific factor in the test P scoring algorithm. By identifying expected trends in function due to controlling factors, this approach yields site-specific soil indicator interpretations without the need to construct formal thresholds for every possible combination of soil, climate, and crop.
Using these algorithms and their fixed and site-specific parameters, we created scoring curves that shift to provide a site-specific interpretation for each indicator. Figure 2 illustrates a hypothetical example of this phenomenon for the indicator soil test P (test P). The left side of the P scoring curve (ascending to an upper-asymptote) is based primarily on crop requirements (or crop class) and well supported in the literature (e.g., Maynard, 1997). The right side (lower-asymptote) reflects environmental risk (P runoff to surface water) and is based primarily on slope. (Slope is grouped into five classes: 02, 24, 48, 816, and >16%.) The right-side portion of the curve is currently less well defined in the literature. Both sides of the curve are influenced by observed TOC (as a continuous function), soil texture class (as a step function), and method of soil P detection (acting as classes to form a step function) as well (Fig. 2 and Table 3).
Integration into an Index
Step 3 of the SMAF, index integration, is optional but offers the potential to integrate all of the indicator scores from the previous interpretation step into a single, additive index value. This value is considered to be an overall assessment of SQ, reflecting management practice effects on soil function. Andrews et al. (2002a) found few differences among various integration techniques including additive (e.g., Andrews and Carroll, 2001), weighted (Harris et al., 1996); and maxmin objective functions (e.g., Yakowitz et al., 1993) when used to combine nonlinearly scored indicator values. Therefore, we chose the simplest alternative, the additive index, for the integration step. This step is accomplished by summing the scores for each indicator, dividing by the total number of indicators, and then multiplying by 10 (Eq. [1]):
![]() | [1] |
Using the number of indicators in the MDS as a divisor corrects for any missing data in the data set. The index value was multiplied by 10 to provide index values in a range (1 to 10 rather than 0 to 1) found to be more amenable for producers and other potential users (Andrews et al., 2003).
The Case Studies
Site Descriptions and Experimental Designs
We applied the SMAF to four large, existing data sets from studies conducted at different scales and regions within the USA. Site descriptions for each of the studies are presented below; metadata is summarized in Table 4.
|
For the Iowa case study, we used data collected in 1994 and 1995 from two field-scale watersheds (WS) with a 25+ yr tillage system comparison at the Deep Loess Research Station near Treynor, IA. One of the watershed treatments (WS2) was cropped to continuous corn (Zea mays L.) on the contour from 1964 to 1995. The other watershed treatment (WS3) was used for cattle grazing from 1964 to 1972, and then converted to continuous corn production using ridge tillage in 1972 (Cambardella et al., 2004). Soils at summit positions are Monona silt loams (fine-silty, mixed, superactive, mesic Typic Hapludolls). Ida or Dow silt loam soils (fine-silty, mixed, calcareous, mesic Typic Udorthents) are found in backslope positions. Footslope soils are generally Napier or Kennebec silt loams (fine-silty, mixed, superactive, mesic, Cumulic Hapludolls) (Karlen et al., 1999). Twelve sampling locations were distributed within each WS based on soil series, slope, and erosion class. Locations were consistent each year.
A third study, the Sustainable Agriculture Farming Systems project, involved 1.2-ha plots managed using different vegetable production systems near Davis, CA (Clark et al., 1998, 1999a, 1999b) in a randomized split plot design. The four management system treatments differed by crop rotation and use of external inputs: conventional 2 yr (Conv-2), conventional 4 yr (Conv-4), low input, and organic. Both conventional treatments received applications of synthetic pesticides and fertilizers at rates recommended for the region by University of California Cooperative Extension Service. The Conv-2 rotation consists of processing tomato (Lycopersicon esculentum Mill.) and wheat. The Conv-4 rotation was tomato, corn, safflower (Carthamus tinctorius L.), and wheat and dry beans (Phaseolus vulgaris L.) (double cropped). The organic treatment used composted and aged animal manures, rotations of winter cover crops, and some organic supplements for fertility and pest management. The low treatment combined both synthetic and organic techniques: synthetic fertilizer was applied at about one half the recommended rate and pesticide use was reduced by cultivation and hand hoeing. The organic and low treatments had identical cash crop rotations of tomato, safflower, corn, oats (Avena sativa L.) + vetch (Vicia spp.), and dry beans (double cropped) (Clark et al., 1998, 1999a, 1999b; Andrews et al., 2002a). All possible entry points for the rotations were represented each year. The soils were classified as Reiff loams (coarse-loamy, mixed, nonacid, thermic Mollic Xerofluvents) and Yolo silt loams (fine-silty, mixed, nonacid, thermic Typic Xerorthents). We used the 1996 data set.
The smallest scale case study was a plot-scale experiment in Georgia to compare the residual effects of fresh versus composted broiler litter on SQ in tall fescue (Festuca arundinacea) pasture, 3 yr after application (Andrews, 1998; Andrews and Carroll, 2001) in May 1995. Four experimental treatments (split application totals) consisted of surface-applied poultry (broiler) litter applied at approximately 1845 kg N ha1; surface-applied composted broiler litter at approximately 1845 kg N ha1 (representing the high end of litter application rates in the region); surface-applied ammonium nitrate treatment providing 100 kg N ha1, 13 kg P ha1, and 33 kg K ha1; and a no amendment control, applied in a randomized complete block design (Tyson, 1994). The high litter amendment rates were representative of (the high end of) surface applications for waste disposal in the region. The study was conducted at two locations, with four blocks at each site: near Calhoun, GA, on a Conasauga silt loam (fine, mixed, semiactive, thermic, Oxyaquic Hapludalfs) in the Southern Appalachian Ridges and Valleys region; and near Farmington, GA, on a Cecil sandy loam (fine, kaolinitic, thermic, Typic Kanhapludults) in the Piedmont region.
Laboratory Analyses
The SQ indicators were measured for bulked core samples taken from 0 to 10 cm (for NRI), 0 to 15 cm (at IA and CA), and 0 to 5 cm (for GA) of soil at each case study location, using standard methods. The NRI dataset included approximately 20 chemical, biological, and physical indicators (Brejda et al., 2000a, 2000b). The Iowa study included 21 chemical, biological, and physical indicators SQ indicators (Cambardella et al., 2004). The California dataset was comprised of 19, mostly soil chemical, SQ indicators (Clark et al. (1998).) The Georgia study included 38 chemical, biological, and physical SQ indicators (Andrews and Carroll, 2001). The indicators with methods described below are included in at least one case study MDS.
Aggregate stability was assessed according to methods described by Cambardella and Elliott (1993) and expressed as the percentage of the total soil that was water-stable macroaggregates >250 µm in diameter. Available water capacity was estimated by difference in water retention between soils held at 1.5 and at 0.01 MPa (15 and 0.1 bar) (Klute, 1986). Bulk density was estimated by a modified core method (Blake and Hartge, 1986), in which soil moisture content, determined by drying a subsample of the cored soil at 105°C, was used to convert the total mass of the field-moist soil core to an oven-dry weight. Electrical conductivity (Rhoades, 1982) of saturated pastes (U.S. Salinity Laboratory Staff, 1954) was measured using a conductivity meter. Microbial biomass C was measured by fumigationextraction method (Sparling and Ross, 1993) for the NRI, Iowa, and Georgia case studies, and by the chloroform-incubation method (Horwath et al., 1996) for the California study. Soil pH was determined in 1:1 soil/water for NRI and Georgia, 2:1 soil/water in Iowa, and saturated paste in California (Thomas, 1996). Potentially mineralizable N was measured using a 28-d aerobic incubation methods described by Drinkwater et al. (1996) (for IA and NRI) or Bundy and Meisinger (1994) (for CA). Sodium Adsorption Ratio was calculated using results from saturated paste extracts of Na+, Ca2+, and Mg2+ in milliequivalents per liter (U.S. Salinity Laboratory Staff, 1954). Soil test P was measured via a different method for the case studies: for NRI and IA studies, the method followed the Mehlich-III extraction procedure (Mehlich, 1984); at the California site it was determined by extracting samples with a 0.5 M sodium bicarbonate solution (Olsen et al., 1954); and the GA study used Mehlich-I double acid extraction (Kuo, 1996). Extractions were followed by inductively coupled plasma emission (NRI) or colorimetric detection via molybdate reaction. Total organic C (after removal of carbonates with 1 M H2SO4) was determined via dry combustion of dried ground samples using a gas analyzer (Pella, 1990).
Endpoint Measures
One reason the case studies were selected to demonstrate the framework was that their large, existing data sets included either direct endpoint measures or other indirect endpoint surrogates, which reflected the management or societal goals at each site. The available endpoint data varied with each case study. Some data were available for each sample point and other data were only available as treatment means (noted below). When only means were available, comparisons were made among treatment means rather than individual sample points. These endpoint measures and surrogates served as proxies for the identified management goals and were used to validate the efficacy of the MDS and indicator scoring.
The NRI case study included multiple land uses. Therefore, we assumed the management goal to be environmental protection. The endpoints were a percentage of C change and a nematode maturity index, which were calculated from the existing data. We defined the percentage of change in TOC to be the percentage of difference in observed TOC and mean soil survey TOC value for each observed soil series, as an indirect measure of C sequestration. We calculated the nematode maturity index based on the method of Bongers (1990). It is a measure of system disturbance based on the relative abundance of nematode functional groups or feeding guilds. The maturity index met the selection criteria in the selection rules database (in Step 1) (Table 2) and thus was a potential MDS indicator for this case study. We chose to use it as a endpoint measure instead, in part because there were few endpoints available for this dataset but also because: (i) increased biodiversity is a legitimate societal goal and (ii) the expertise and time involved in performing this measure makes it an unattractive indicator for future use.
Productivity was assumed to be the management goal for the Iowa case study. The endpoint measures were yield (Mg ha1), collected from plots at various landscape positions (e.g., hilltop or shoulder/summit, sideslope or backslope, and toeslope) that were adjacent to the soil sampling sites, and sedimentation (Mg ha1), measured as soil material leaving each WS in stream water collected at a weir. The endpoint surrogates, available as WS means, included pesticide application rates for atrazine and metalochlor (liters applied), as representatives of potential soil and water contamination.
The management goal for the California case study was assumed to be productivity. The endpoint measures used net revenues for each system (treatment means) and net revenues and yield for tomatoes (the main cash crop) (Clark et al., 1999b). The endpoint surrogates used included: water use efficiency (mm water x crop yield1); weed cover (%); and the number of tillage operations per year (treatment means only) (Andrews et al., 2002a), included as an indirect measure of soil disturbance.
The Georgia study's assumed goal was waste recycling. The available endpoint data, from Years 1 and 2, were amount of litter applied (kg dry litter applied ha1), using a conversion factor adapted from Safley and Safley (1991) to represent compost as an equivalent volume of fresh litter, and fescue yield (kg dried biomass ha1). Both litter applied and yield were available as treatment means only. From Year 3, when soils were sampled, the endpoint surrogates used were soil extractable As and Cu, determined via Mehlich-I double acid extraction (Amacher, 1996). These metals are common poultry feed fungicidal components and serve as proxies for all metal contaminants.
Statistics
We used JMP v. 3 software for Windows (SAS Institute, Cary, NC) for all statistical analyses. We performed analysis of variance (ANOVA) on the observed and the scored MDS indicators, to compare the statistical differences between treatments in each case study with and without scoring (Step 2). We also examined the overall index values in Step 3 by plotting and analyzing treatment means and standard deviations using ANOVA and Student's t. Finally, we performed stepwise regressions of all available indicators (independent variables) and endpoints (as iterative dependent variables) for each case study. The p values for acceptance and rejection in the stepwise models were 0.25 and 0.1, respectively. Examining the incidence of MDS indicators that were not added to the regression models acted as a check of the indicator selection step. We also examined the efficacy of indicator scoring by comparing the R2 for regressions using scored versus observed data. The indicators available for the regressions were limited to those having scoring curves (see Table 3), to allow the direct comparison of observed and scored regression results. To make the R2 values comparable among models with different numbers of parameters, we report the adjusted coefficients of determination (R2) value, which uses the degrees of freedom in its calculation (SAS Institute, 1995).
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
Currently, SMAF's focus is on agricultural land use but that, too, is flexible. This tool is concerned with the evaluation of change in soil function as a result of management, using SQ indicators (that are dynamic or use-dependent soil properties) interpreted with respect to taxonomy, climate, land use, and goals. The tool, therefore, does not address the question of whether the user's goals are actually appropriate for a particular site. That would be a land capability question that relates to inherent soil properties that do not readily change due to management.
The results from the four case studies, presented below, are organized according to the three steps of the framework.
Indicator Selection
Using the decision rules developed for Step 1 of the SMAF, a set of indicators (MDS) was chosen for each case study. Table 2 illustrates the subset of rules applied to generate the case study MDSs. The NRI MDS was composed of AGG, MBC, pH, PMN, test P, and TOC, following rules for the environmental protection goal in arid systems. The indicators selected for the Iowa MDS were: AGG, BD, MBC, pH, PMN and TOC, applying rules for a productivity goal, tillage comparisons, and assessments over time. The California MDS indicators were: EC, pH, SAR, test P, and TOC, using criteria such as productivity goal, arid region, and organic amendment comparison. For the Georgia study, the selected MDS indicators were: AWC, BD, MBC, pH, test P, and TOC, applying the criteria for manure management. The selected MDSs consisted of biological, chemical, and physical SQ indicators suggested by several authors including Larson and Pierce (1991), Doran and Parkin (1994)(1996), and Seybold et al. (1998). All of these published SQ minimum data sets include TOC and pH, which were selected as indicators for all four studies, because of their importance and influence on so many critical soil functions. However, it is the decision rules process of Step 1 that we are trying to convey, not the specific indicators selected. Therefore, these results should not be taken as a recommendation for the indicators named.
To check the efficacy of SMAF Steps 1 and 2, we performed stepwise regressions using the observed and scored indicators as independent variables and endpoints (or endpoint surrogates) as iterative dependent variables for each case study (Table 5) (Results germane to Step 2 are discussed in the next section.). Most of the selected MDS indicators were added to the stepwise regressions models, with the following exceptions: scored AGG was not added to the NRI regressions, PMN was not part of any of the Iowa stepwise regressions; and scored Db was not selected for the Georgia regressions. Although these indicators were not part of the MDSs, each was suggested for possible MDS inclusion according to the rules table. The authors chose not to include every possible indicator in an effort to keep the MDSs' size small.
|
In a study comparing indexing methods, Andrews et al. (2002a) found that both expert opinion- and multivariate statistic-selected MDS indicators performed equally well in describing the variation in endpoint measures or their surrogates, such as yield, water use efficiency, and weed biomass. In the multivariate selection method, indicator covariance was checked and redundant indicators eliminated from the MDS. Since this added step did not affect the MDS performance, covariance is not considered in the current selection step. (The covariance of indicators acts as a built-in weighting factor for the related indicators in Step 3, when indicator scores are combined.) The decision rules of Step 1 were based on expert opinion of those familiar with each indicator and literature review, similar to the Andrews et al. (2002a) study but formalized into an expert system format.
In practice, Step 1 would occur before data collection. Because the case data was pre-existing, indicator selection was constrained by the indicators available in each data set. Nevertheless, using the assumed management goals and site-specific data for each study resulted in lists of suggested indicators that included a subset of the available data that could be used for MDSs. We envision this step being useful by itself for choosing indicators for adaptive management, monitoring, or assessment. Conversely, it could be skipped entirely when a minimum data set already exists.
Indicator Interpretation
After selecting an appropriate MDS for each study, we evaluated the framework's sensitivity to site-specific differences by examining shifts in scoring algorithms (or expected ranges) for each indicator. Plotting the observed and scored values for each indicator illustrated how the algorithms responded to changes in controlling factors, such as soil type, crop, and climate, within and among the case studies (Fig. 3)
. The shifting inflection points and expected ranges were based on the best available information but can be refined and altered as the knowledge base improves. Again, it is the mathematical framework allowing for site-specific interpretation that we wish to emphasize, not the specific parameters.
|
When examining the results of the site-specific interpretation step, four main patterns in treatment differences emerged when treatment means for the scored and observed indicators were compared using ANOVAs. These four patterns demonstrated the benefits associated with using scoring based on performance of function to help interpret indicator data for multiple objectives, including productivity and environmental protection.
The first and most commonly occurring pattern was characterized by similar results between case study treatments for an observed and scored indicator (Pattern 1) (Fig. 4a) . Pattern 1 was seen for AGG, MBC, pH, PMN, and TOC in the NRI study; MBC at the IA study; EC, test P, and TOC at the California study; and AWC and pH in the Georgia case studies. This seemed to occur whenever the majority of observations fell on the ascending portion of the scoring algorithm, giving a more-is-better result. The second pattern was when the observed and scored indicator values had opposite results (e.g., the highest observed treatment was the lowest after scoringa less is better effect) (Pattern 2) (Fig. 4b). This pattern was seen for test P in the NRI study; Db at Iowa; pH at California; and test P in the Georgia case study. This opposite scoring pattern was seen when the majority of observations occurred on the descending portion of a curve. Pattern 2 for NRI was driven by the test P values in woodland, where very high observed test P values resulted in significantly lower scores for test P in this land use (data not shown). This could be due to P applications in managed forests or simply due to inherently high P in relatively undisturbed soils. If the latter, then the low score is unwarranted and would call for an adjustment to the algorithm under this land use. Unfortunately, the authors did not have enough information about the NRI management practices to make a determination. Because the curves were developed primarily for agricultural land uses, we also looked at treatment differences in a more homogeneous subset of the NRI: cropped Xerolls. In those two-way ANOVAs, by cropping practice (continuous small grains vs. wheat-fallow) and tillage (till v. no-till), all indicators fit under Pattern 1, similar results for observed and scored. (Fig. 4a shows the Pattern 1 seen for PMN in the NRI subset).
|
The fourth, least common, and unexpected pattern (Pattern 4) occurred when the observed results for an MDS indicator showed no significant differences among treatments but scored results were significantly different (Fig. 4d). This was only seen for two indicators, pH and AGG, at the Iowa study. In a two-way ANOVA, pH at Iowa was only significant for the scored indicator among watersheds (Fig. 4d). The AGG scores were significantly different among years but observed AGG data were not (data not shown). This unexpected pattern occurred when the observed data had scores in the ascending and descending portions of a Gaussian curve with few, if any, points in the optimum range (e.g., top of the curve), such as for soil pH in WS2 in 1994 of the Iowa case study (Fig. 5) . These WS2 pH data resulted in a lower score compared with the WS3 data, which fell largely in the optimum range of the curve, even though the means for observed pH were not significantly different. This suggests that important information can be captured by scoring that might otherwise go undetected when examining observed means alone.
|
While using existing data and surrogate endpoint measures is a good beginning, an ideal test of SMAF efficacy has yet to be performed. The ideal would be a study explicitly designed for this purpose that included endpoint measures specific to each management goal and soil function of interest.
Index Integration
Step 3, integration of indicator scores into an index, is probably the most controversial. Concerns most often expressed are about the use of an index as a regulatory tool. Yet, in focus groups with farmers, an integrative index was very attractive as a monitoring tool, as long as they could also access individual indicator information from which to make specific management decisions (Andrews et al., 2003). If desired, however, Steps 1 and 2 could stand alone: one could use the SMAF only to select the most appropriate indicators to measure or only to interpret existing data. Step 3, on the other hand, relies on the scored indicators from Step 2, because the indicator measurements must be transformed into unitless values before they can be meaningfully combined. While the first two steps are the most critical, Step 3 allows one to see the overall health of the soil, without the distraction of (potentially) conflicting individual indicator results.
Comparisons of index outcomes using the NRI data showed that SQ was significantly greater in forage, range, and woodland soils than for soils under CRP, continuous grain or wheat-fallow, with the latter being the lowest (Fig. 6a) for all land uses. Since the CRP data were grouped together without respect to length of time in CRP, this land use probably included some of the most degraded soils along with soils that had a recovery period. This supposition also explains the higher variance observed for CRP compared with all other land uses. When we examined the homogeneous subset of NRI data, cropped Xerolls, differences between cropping systems were only significant under no-tillage (Fig. 6b). A similar result was also observed using individual SQ indicator data for a long-term cropping systems study in the Northern Great Plains (B. Wienhold, personal communication, 2003.)
|
The Framework showed that soils used for organic vegetable production in California had significantly higher SQ index ratings than for the two conventional rotations (Fig. 6d). This result using the SMAF is comparable with that of Andrews et al. (2002a), where scoring curves were created specifically for the California data set (as opposed to using the site-specific factors to shift parameters in the scoring algorithms). This suggests that the site-specific scoring of the current framework is working as well as individually tailored SQ assessments.
Using data from the Georgia case study, the SMAF showed significant differences in SQ index results only for the Ultisol site (Fig. 6e). None of the MDS indicator scores were significantly different among the management treatments at the Conasauga site. At the Ultisol site, individual scores for pH, AWC, and test P differed among treatments. The score for pH was significantly greater for compost compared with the chemical and control. Scored AWC was higher in compost and litter compared with chemical. However, it was the soil P score, which was significantly lower for compost compared with the other treatments (the converse of observed Pa Pattern 2) that drove the index outcome. For this reason, the compost treatment had a significantly lower index outcome than the other treatments at the Ultisol site.
The ability to assess relative system function, whether as an integrative index or individual indicator scores, may have strong implications for both natural resources management and policy. We anticipate that this framework, when fully developed, could have such diverse uses as: evaluating the effects of bioenergy crop production (or residue harvest) on soil resources at the field scale; assessing management intensities at the farm scale for Conservation Security Program Enhancement Payments; and interpreting watershed scale data (both measured and modeled) as part of the Conservation Efforts Assessment Program. The framework's flexibility, leading to site-specific interpretations, is its biggest strength.
| CONCLUSIONS |
|---|
|
|
|---|
Currently the framework is being further evaluated and improved as part of the USDA-ARS Soil Resource Management National Program but a prototype is also available to other interested users. We fully acknowledge and anticipate that the current version will undergo significant improvement as the capacity for additional SQ indicators is added and as knowledge improves regarding soil function in various ecosystems. We will continue to work with experts that are familiar with each of the biological, chemical and physical indicators under a variety of management practices and ecosystems to improve selection rules and interpretation algorithms relative to management goals and site-specific factors. In addition, we fully expect that the integration step will be improved in the future by the inclusion of rules for weighting indicators' scores in some situations. Due to its flexible, database-driven design, many aspects of the SMAF can be changed and updated simply. In addition, efforts to improve the user interface have begun. When fully operational, we expect an advanced version to be even more flexible, user-friendly, and useful for relating the effects of soil management on various soil functions in a broad range of settings. We conclude that SMAF is a useful assessment tool that, with further standardization and validation, may help move soil conservation and resource management beyond assessments of soil erosion and changes in productivity toward stewardship (Wander and Drinkwater, 2000), with such diverse applications as model output interpretation, conservation program assessment, soil test interpretation, green payments and conservation credits.
| ACKNOWLEDGMENTS |
|---|
| NOTES |
|---|
|
|
|---|
Received for publication November 20, 2003.
| REFERENCES |
|---|
|
|
|---|