Phase 2 Tutorial: Remote-sensing Supported Digital Soil Mapping in
Transcription
Phase 2 Tutorial: Remote-sensing Supported Digital Soil Mapping in
Cooperative Ecosystem Studies Unit (CESU) Phase 2 Tutorial: Remote-sensing Supported Digital Soil Mapping in South Florida PI: Dr. Sabine Grunwald Co-PI: Dr. Todd Z. Osborne Collaborators: Rick Robbins, Howard Yamataki, and Tom Weber, Natural Resources Conservation Service (NRCS) Graduate student: Jongsung Kim (Ph.D. student) Project period: 9/2008 to 9/2010 Contract Number: 68-3A75-4-73 Mod 3 Contract No. 00069677 This report summarizes the research activities of the project “Remote-sensing Supported Digital Soil Mapping in South Florida” funded by Natural Resources Conservation Service (NRCS). Goals of the Project To develop procedures for remote-sensing supported digital soil mapping of various soil properties in Water Conservation Area-2A. Specific Objective Develop models to predict various types of soil properties: (i) soil taxonomic classes; (ii) soil morphological properties; and (iii) physico-chemical soil properties. Assess the usefulness to incorporate remote sensing imagery into soil prediction models. Develop a tutorial for remote-sensing based digital soil mapping that enables transfer of the methods to other soil survey regions. Overview This tutorial serves as a comprehensive documentation for the modeling process to predict soil properties especially soil series and soil properties. Study Area: Water Conservation Area-2A, Greater Everglades, Florida Field Data: 108 points of field data (soil series and properties) including X and Y coordinate Software: 1) ERDAS Imagine 2011 (Earth Resource Data Analysis System Inc., Atlanta, GA) for remote sensing imagery analysis 2) ArcGIS 10 (Environmental Systems Research Institute, Redlands, CA) for data extraction 3) CART (Salford Systems, San Diego, CA) for tree-based soil prediction models development 4) ISATIS (Geovariances Inc., France) for the total phosphorus prediction models development 5) Microsoft Excel (Microsoft Corporation, Redmond, WA) and SAS (SAS Institute Inc., Cary, NC) for data preparation and data analysis Tutorial for Remote Sensing Based Digital Soil Mapping The development of computers and information technology lead soil scientists to incorporate remote sensing technologies into soil prediction models to characterize various soil forming factors. Remote sensing has been widely used to not only identify land cover/land use changes but also predict soil properties. Because it supports rapid and inexpensive data collection over large areas, comprehensive research has been focused on predicting soil properties using geostatistical/hybrid methods at various scales and regions with a variety of remote sensing imagery. Remote sensing is ideally suited to derive environmental data for wetlands areas that are complex ecosystems and are difficult to access. It is an important tool to determine land resources, to observe land surface covers at various spatial scales, and to determine changes that result from natural and anthropogenic processes (Schmid et al., 2008). It has many advantages especially in wetland and soil mapping. For instance, remote sensing based monitoring using temporal sequences of satellite images can capture seasonal patterns in wetlands. Also remote sensing data provide information on surrounding land uses of wetlands and changes over time (Ozesmi et al., 2002). (1) Remote sensing data sources Remote sensing is a measurement of the electromagnetic energy reflected or emitted from objects on the Earth’s surface and the amount of radiation, for any given material, that is reflected at varying wavelength (Jensen, 2005). It means different materials have different reflectance characteristics (Compare Figure 1-1). For instance, vegetation reflects wavelength energy in the range of 0.52 to 0.6 μm (green channel) and 0.76 to 0.90 μm (near-infrared channel) but absorbs wavelength energy in the range of 0.63 to 0.69 μm (red channel). Water reflects wavelength energy in the range in 0.45 to 0.52 μm (blue channel) and absorbs other energy. Therefore, remote sensing can play a role in the identification, inventory, and mapping of vegetation and soils that are on the surface of the Earth, whereby the impact of soil grain size, organic matter, and water content on soil spectral reflectance are identified (Jensen, 2000). The data for remote sensing imagery is collected by two types of sensors: passive, and active. Passive sensors simply use incoming electromagnetic energy from the Sun and collect/record the electromagnetic radiation energy that is reflected or emitted from the earth (Figure 1-2). In contrast, active sensors use machine-made electromagnetic energy. The artificial energy washes the target area (terrain) and then collect/record the amount of radiant flux scattered back toward the sensor (Jensen, 2005). Figure 1-1 Reflectance spectrum of five types of land-cover (figure from center for remote imaging, sensing & processing: http://www.crisp.nus.edu.sg/~research/tutorial/optical.htm) Figure 2-2 Schematic diagram of remote sensing There are varieties of satellite platforms which have different remote sensors: Advanced spaceborne thermal emission and reflection radiometer (ASTER), Landsat multispectral scanner (MSS), Landsat thematic mapper (TM), Landsat enhanced thematic mapper plus (ETM+), Moderate resolution imaging spectroradiometer (MODIS) terra and aqua, Satellite pour l’observation de la terre (SPOT) 4 and 5 high resolution visible infrared sensor, etc. Because of its differential spectral, spatial, temporal, and radiometric resolution, each remote sensing data derived from different remote sensors (satellite platforms) has different reflectance information. The differences of resolutions among some of the widely used remote sensors are shown in Figure 1-3 and 1-4. 1) Brief definitions of each resolution a. Spectral resolution: the number and dimension of spectral regions (e.g. blue, green, red, near-infrared bands) b. Spatial resolution: the size of a grid cell or pixel (e.g. 10 x 10 m, 30 x 30 m, 250 x 250 m) c. Temporal resolution: how often the sensor acquires data (e.g. every 2 days, 16 days) d. Radiometric resolution: the sensitivity of detectors (e.g. 8-bit, 9-bit, 10-bit) 2) Visit the website for detail information for each satellite platform and sensor: a. ASTER: http://asterweb.jpl.nasa.gov/ b. Landsat: http://landsat.usgs.gov/index.php c. MODIS: https://lpdaac.usgs.gov/lpdaac/products/modis_overview d. SPOT: http://www.spot.com/web/SICORP/403-sicorp-spot-images.php Figure 1-3 Remote sensors spatial resolutions Figure 1-4 Spatial and spectral resolution of the Landsat Multispectral Scanner, Landsat Thematic Mapper, and SPOT Sensor System (Figure from Jenson, Introductory Digital Image Processing: A Remote Sensing Perspective, 2005). (2) How to download remote sensing images United States Geological Survey (USGS) Earth Resources Observation and Science (EROS) Center offers free-downloadable aerial photography, satellite imagery, elevation, and land cover data through several viewer tools such as EarthExplorer, Glovis. In this tutorial, ‘Global visualization viewer (Glovis)’ was chosen to show the brief steps how to search available remote sensing images and how to download them. Among various remote sensing images Landsat and MODIS are the most heavily used freedownloadable remote sensing images and both of them are available in Glovis. 1) Visit one of the following websites (search and viewer tools). a. Glovis: http://glovis.usgs.gov/ b. EarthExplorer: http://edcsns17.cr.usgs.gov/NewEarthExplorer/ c. WIST: https://wist.echo.nasa.gov/ d. There are lots of websites where you can access the remote sensing images. 2) Select Satellite platform, sensor, and study area. a. Satellite platform (A in Figure 2-1): Landsat, MODIS, ASTER etc. ‘Landsat’ was chosen for the tutorial. b. Sensor (B in Figure 2-1): Landsat Enhanced Thematic Mapper Plus (ETM+), Thematic Mapper (TM), Multi-Spectral Scanner (MSS) etc. ‘Landsat 7 ETM+’ was chosen for the tutorial. c. Select study area (C in Figure 2-1). ‘Florida’ was chosen for the tutorial. Figure 2-1 U.S. Geological survey global visualization viewer (Glovis) 3) Search available images and download the selected images. a. Select path/row or latitude/longitude of the study area (A in Figure 2-2). To search Landsat images users can select path and row. Landsat data uses Worldwide Reference System (WRS) which is a global notation system. For instance, path 15 and row 42 contains the Greater Everglades area in Florida. b. Select maximum cloud cover (B in Figure 2-2). c. Explore available images. Select a date (month and year). d. Once you get an image which is suitable for your study, click ‘Add’ then ‘Send to Cart’ (C in Figure 2-2). You can add as many images as you want. e. Give brief information (e-mail address) to USGS and then they will send an email to you with a link to download the images. f. Download the images through the link. Figure 2-2 USGS global visualization viewer (Glovis) – the Greater Everlgades, FL (3) How to import/export the downloaded remote sensing images Once you get the images you need to convert the file format to use for developing soil prediction models. For instance, Landsat image files which were created by USGS contain data in Geographic Tagged Image-File Format (GeoTIFF) and MODIS image data are in Hierarchical Data Format-Earth Observing System (HDF-EOS). Because different satellite images have different data format, the images need to be imported into ERDAS Imagine to convert file format into an image (IMG) to proceed with the image analysis. 1) How to read the remote sensing image file name: Different images have different naming convention. e.g. Landsat image file: LE70150422010047EDC00 LE7 Platform and sensor: Landsat 7 ETM+ 015042 Path/Row (Worldwide Reference System) 2010047 Date of acquisition: Year and accumulated day (YYYYDDD) (e.g. 2010047 is 47th day of 2010 = February 16th, 2010) Landsat image file: LT50400362010286EDC00 Platform and sensor: Landsat 5 TM LT5 040036 Path/Row (Worldwide Reference System) 2010286 Date of acquisition: Year and accumulated day (YYYYDDD) (e.g. 2010286 is 286th day of 2010 = October 13th, 2010) MODIS image file: MOD13Q1.A2010049.h10v06.005.2010067103334 MOD13Q1 .A2010049 .h10v06 .005 .2010067103334 Product short name: MOD13Q1 = 16 day Vegetation index (250m) Date of acquisition: Year and accumulated day (e.g.2010049 = 49th day of 2010 = February 18th) Tile identifier (Sinusoidal tiling system): horizontal 10, vertical 06 Collection version Date of production: Year, accumulated day, hour, minute, and second (YYYYDDDHHMMSS) Visit websites of each remote sensing image to know the naming rule in detail. 2) Open the ERDAS Imagine 2011. a. Go to ‘Manage data’ tab and click ‘Import’ button in conversion group (A in Figure 3-1) b. Select file format (B in Figure 3-1). Click and browse to the remote sensing image that you want to import. ERDAS Imagine 2011 supports a variety of remote sensing image conversions. For instance, you can choose ‘Landsat 7 from USGS’ as your import file format (C in Figure 3-1). c. Select input file (‘*.tif’ in case of Landsat, ‘*.hdf’ in case of MODIS) and name the output file (*.img) (Figure 3-2). e.g. Input file: LE70150422010047EDC00.tar.gz Output file:Landsat7_0216 (you name it).img d. Follow same protocols when you export remote sensing files to other formats. Figure 3-1 Import/Export data function in ERDAS Imagine 2011 Figure 3-2 ERDAS Imagine: data import/export ♦ Import function works differently between ERDAS Image 2011 and previous version of ERDAS Imagine. a. If you are using ERDAS Imagine 2011, simply follow steps described above. ERDAS Imagine 2011 supports a variety of remote sensing image conversions. However, if you are using ERDAS 2010 or previous version, you need additional steps. For example, Landsat data files provided by USGS are in zipped file format (*.tar.gz) so that you need to extract (unzip) the files before you import a Landsat image. b. In case of Landsat data file, each band comes as a separate file (e.g. LE70150422010047EDC00_B01.tif (Blue band); LE70150422010047EDC00_B02.tif (Green band); LE70150422010047EDC00_B03.tif (Red band)). It exits as individual layer after file conversion (import) so that you need to stack all layers for further image analysis. To stack layers in ERDAS Imagine 2010 and previous version: tool bar Interpreter Spectral Layer Stack Browse and add all layers that you want to stack (e.g. blue, green, red, near-infrared, mid-infrared) c. ERDAS Imagine 2011 supports automatic layer stack for Landsat data file. It means once you import downloaded file, you can use the *.img file for the image analysis. Layer Stack function in ERDAS Imagine 2011: Raster tab Resolution group: Spectral Layer Stack 3) Display image. a. Go to ‘File’ and click ‘Open (A in Figure 3-3)’. b. Browse to your image file (*.img) and select (B in Figure 3-3). c. Click ‘Raster options’ tab (C in Figure 3-3) and select layers (bands) to color (red, green, and blue) (Figure 3-4) and select OK then the image will display. ♦ Color composition (Compare Figure 3-5) Each pixel of remote sensing image composed with Red, Green, and Blue (RGB) component except pseudo color and gray scale display when you display remote sensing images. You can choose which band goes to red, green, or blue. For instance, if you select band 3 (red wavelength energy) of Landsat data goes to the red component, band 2 (green wavelength energy) goes to the green component, and band 1 (blue wavelength energy) goes to the blue component, we call it 3-2-1 RGB composite image or ‘true-color composite image’ display. If you select band 4 (nearinfrared wavelength energy) goes to the red component, band 3 (red wavelength energy) goes to the green component, and band 2 (green wavelength energy) goes to the blue component, we call it is 4-3-2 RGB composite image or ‘false-color infrared image’ display. The color compositions are only for image display. Figure 3-3 ERDAS Imagine: image display Figure 3-4 ERDAS Imagine: image display – raster option Figure 3-5 Color composition (Landsat 7 image): true-color composite (above) and false-color infrared (below) image (4) How to find layer information and how to project/reproject the images Once you downloaded a remote sensing image reviewing metadata (text format), the description of the data, are important. The metadata provide general information about the satellite platform and image. Some of them provide ground control points if the image is rectified and information for each layer (band). However, the metadata of imagery can be seen in table or histogram format in ERDAS Imagine or ArcGIS. 1) How to find the detailed information of each band a. Click ‘Metadata’ (exclamation mark) in information group at Home. b. It provides information about your image. The layer pull down (A in Figure 4-1) dictates which layer’s information you are looking at. It provides each layer’s information such as data type, range of pixel values for each layer, units, projection, datum, and pixel size. Histogram of the image pixel values is also shown if you click the histogram icon (B in Figure 4-1). Figure 4-1 Image metadata in ERDAS Imagine 2) How to subset (clip) the study area a. To subset your study area, go to ‘Home’ tab and click ‘Inquire’ in the information group (A in Figure 4-2). Click ‘Inquire box’ then square box and information box will be appear on you window. b. You can put X and Y values (upper left X; upper left Y; lower right X; lower left Y) manually into the inquire information window (B in Figure 4-2) or click ‘Select’ (A in Figure 4-2) and then move/drag the box where you wish to select as your subset area. c. After you select the subset area, go to ‘Raster’ tab and click ‘Subset & Chip’ in geometry group. Select ‘Create Subset Image’. d. Browse where the subset image will be stored and name the output file (A in Figure 4-3). Click ‘From Inquire Box’ (B in Figure 4-3) and select the data type of the output file (C in Figure 4-3). Finally select the layers you want to subset (D in Figure 4-3) and hit ‘OK’. e. Open the subset file and confirm that your study area is correctly stored or not (Figure 4-4). Figure 5-2 Inquire box in ERDAS Imagine Figure 6-3 Subset window in ERDAS Imagine Figure 7-4 Subset image for Water Conservation Area 2A, Greater Everglades, FL 3) How to project/reproject the image a. All remote sensing imagery and geographical information system (GIS) data layers should have one common map projection to proceed with any spatial analysis. For instance, if you want to compare three different images which are Landsat ETM+, SPOT, and MODIS, and they have different projections. Landsat ETM+ and SPOT images have Universal Transverse Mercator (UTM) projection but MODIS image has Sinusoidal projection. In this case, first, you need to decide which projection is suitable as a ‘common map projection’. If you choose UTM projection as a common projection, then you need to reproject only MODIS image from Sinusoidal to UTM projection. Projection or reprojection can be done in ERDAS Imagine or ArcGIS. b. In ERDAS Imagine, go to ‘Raster’ tab and click ‘Spatial’ in resolution group. Select ‘Reproject’ and select output projection for the image. c. In ArcGIS, go to ArcToolbox. Under ‘Data Management Tools’ click ‘Projection and Transformations’ and then select ‘Project Raster’ under ‘Raster’ category. ♦A map projection is used to portray all or part of the round Earth on a flat surface. This cannot be done without some distortion. Every projection has its own set of advantages and disadvantages. There is no ‘best’ projection. –National mapping division, USGS- Figure 8-5 Reprojection of MODIS image: Sinusoidal (left) and UTM projection (right) (5) Image registration/rectification Raw data of remotely sensed image do not contain data that were already in their proper geometric x, y locations. Usually remote sensing images could contain a different level of geometric distortion due to internal and/or external reasons. For instance, the rotation of the Earth or remote sensing system itself (e.g. scanning variation) could cause internal geometric errors and variation of spacecraft’s orbit or sun angle changes could cause external geometric errors (Jensen, 2005). It is necessary to preprocess and remove the geometric distortion so that features in remote sensing image are put into ‘real-world’ coordinate system. Usually image to map rectification and image to image registration method is used to correct geometric distortion. Image to map rectification: obtain reference points (ground control points, GCP) directly from a map, digital orthophoto quarter-quad (DOQQ) images, or through GPS. Image to image registration: obtain reference points from an already geocorrected image. 1) How to rectify the image a. Open the image which you wish to rectify and a map which you wish to obtain and ground control points. b. Click ‘Control Points’ in transform & orthocorrect group (A in Figure 5-1) then a window should appear that allows you to select the Geometric Model. Select a geometric model that you want to use (B in Figure 5-1). In this tutorial, Polynomial geometric model was chosen. Figure 5-1 Image to map rectification in ERDAS Imagine: Landsat ETM+ (left) and Digital orthophoto quarter-quad (DOQQ) image (right) c. Once you select the geometric model, the ground control points tool reference setup window will appear. Browse and select the map that you wish to obtain, e.g. road, lake, buildings, ground control points (reference map). Merged DOQQ image was used for the tutorial. d. Confirm the projection and datum when the reference map information box appears then click OK and wait for all windows to position themselves (Figure 52). e. It is ready to collect GCPs. Click crosshair button on tool bar (A in Figure 5-2). Use ‘Zoom-in’, ‘Zoom-out’, and ‘Pan’ button to explore the entire image and select good GCPs. Once you decide good GCP, click crosshair button and select a point on your image (e.g. Landsat image). Click again the crosshair button and then select a point on your reference map (e.g. DOQQ). Then each of the x and y coordinates of your image and map will be added automatically under your GCPs area (A in Figure 5-3). Collect as many GCPs as you can. Be sure that you need to collect GCPs spread well throughout the entire area. Figure 5-2 Image to map rectification in ERDAS Imagine: Geometric correction window f. After collecting GCPs, click the sigma button on the tool bar for geometric correction error assessment (rule of thumb: root mean square should be less than half resolution of the image; B in Figure 5-3). g. According to your GCPs, the image will be resampled. Figure 5-3 Image to map rectification in ERDAS Imagine (6) Band ratioing/Vegetation index It is essential to understand the meaning of the features or pixel values of the image. What do high pixel values in red band mean? What do low pixel values in near-infrared band mean? If an object observes green wavelength energy and reflect red wavelength energy, the pixel values of the object can be low in the green band and high in the red band. However, understanding the relationship between pixel values of images and features or phenomena in nature environment is essential. Each band of imagery has meaning but sometimes a combination of two or more bands contains even more information. Band ratioing or ratio transformation is a process by which reflectance values of pixels in one band are divided by the values of their corresponding pixels in another band (Jenson, 2005). It allows creating a new output image which is informative. These ratios may enhance or subdue certain attributes found in the image, depending on the spectral characteristics in each of the two bands chosen. For example, the ratio of red and near-infrared bands provides vegetation information and blue and near-infrared ratio provides water information. Similarly vegetation indices also use the band ratioing concept. Vegetation indices are widely used to assess vegetation coverage, changes, and condition of plant vigor since the 1960s. Some important vegetation indices can be derived from MODIS, Landsat ETM+, and SPOT images. However, some of the indices are sensor specific. For instance, NDVIgreen cannot be derived from MODIS image because MODIS does not have a green band. Some commonly used vegetation indices are shown in Table 1. Table 1. Selected remote sensing vegetation indices. Formula* Indices Normalized difference vegetation index Normalized difference vegetation green index Transformed vegetation index Enhanced vegetation index** References Rouse et al., 1974 Gitelson et al., 1996 Nellis and Briggs, 1992 Huete et al., 1997 Normalized Difference Water Index Gao, 1996 Reduced Simple Ratio Chen et al., 2002 Mid-infrared index Musick and Pelletier, 1988 * NDVI = Normalized difference vegetation index, NIR = Near-infrared, SWIR = Shortwaveinfrared; ** Empirical parameters for enhanced vegetation indix (EVI) of MODIS: C1=6.0; C2=7.5; G=2.5; L=1.0. 1) How to do band ratioing a. Go to the ‘Raster’ tab and click ‘Functions’ in scientific group. Select ‘two image functions’. b. Browse the input files. Different bands in the same input file (e.g. band 1 and band 2) or different input files (e.g. Landsat and SPOT) can be used (A in Figure 6-1). c. Browse to a folder in which the output file is stored. d. Click drop-down button and select an operator that you want to use (e.g. +, -, *, /…) and select data type then click OK. 2) How to derive vegetation indices a. Go to the ‘Raster’ tab and click ‘Unsupervised’ in classification group. Select ‘NDVI’ or ‘Indices’. b. Browse input file and output file (you name it). c. Select the vegetation index which you want to calculate (e.g. NDVI, IR/R, VI…) and data type (A in Figure 6-2). d. The equation will appear when you select a vegetation index (B in Figure 6-2). Once you confirmed the equation, click OK then a newly derived image which contains the vegetation index in each pixel will be added. You may confirm the ranges of the images using a histogram or metadata (e.g. NDVI values should be between ‘-1’ and ‘+1’). Figure 6-1 Band ratioing: two input operators Figure 6-2 Vegetation indices Figure 6-3 Vegetation indices: Normalized difference vegetation index (7) Extract reflectance or ratio/vegetation values for soil sampling locations It is impossible to sample all soils exhaustively, i.e. at all locations, to analyze their taxonomic or biogeochemical properties because it is costly, labor and time intensive. Thus, soil sampling can only be done at sparse locations and models are used to estimate properties at unsampled locations. Even though remote sensing images can support continuous dense information throughout the study area, the images should be connected with site-specific soil data and this can be done through x and y coordinates. The images can be merged with site-specific soil properties through x and y coordinates in ERDAS Imagine or ArcGIS. Reflectance values and derived spectral indices from images need to be extracted for each (soil) sampling point after the merge process for further prediction model development. The extraction process can be easily done using ArcGIS. ERDAS Imagine also has a way (e.g. making process models to extract the pixel values using ‘Model Maker’ under ‘Toolbox’) to assign raster pixel values to shape points. However, the steps are complicated and slow because spatial modeling processes in ERDAS Imagine rasterize the points internally during the process. Hence, ArcGIS is used for the pixel values extraction in the tutorial. 1) How to merge the image with site-specific soil properties in ERDAS Imagine a. Open the remote sensing image and click ‘Add’ to add site-specific soil properties (*.shp). b. Browse the soil property layer and add. Please make sure that your ‘file type’ is in shapefile (*.shp) when you browse the soil layer. c. Once you have the same map-projection, the soil layer will be automatically displayed above the remote sensing image. d. You may edit the soil layer (e.g. attribute table) under ‘Vector’ tab. 2) How to merge the image with site-specific soil properties and extract pixel values in ArcGIS a. Open a ArcMap and add all images (e.g. Landsat ETM+, SPOT, MODIS, NDVI derive from Landsat, NDVI derive from SPOT) and GIS layers (e.g. elevation, geophysical data) and sampling points layer (e.g. from a soil survey) (Figure 7-1). b. Go to ‘Spatial Analyst Tools’ in ArcToolbox and select ‘Extraction’. There are two options that you can use to extract raster pixel values to shape points. The first one is a ‘Extract by Points’ function. It is a function that extracts pixel values of one raster layer to shape points. For instance, NDVI image can be chosen as input raster and only the NDVI values which coincide with soil sampling locations will be extracted. The second option is a ‘Sample’ function. Multiple rasters can be added as input rasters in ‘Sample’ method. For instance, NDVI, NDVIgreen, SPOT, MODIS, etc. can be added as input rasters and be extracted at one time. Note that when you use ‘Sample’ you need to be sure about the order of the input raster files. An output table will be written in the order of the input rasters. c. Add the input rasters (A in Figure 7-2) and point layer (B in Figure 7-2). Name the output table. Click OK. d. The output table is in data base format (*.dbf) and it can be opened in Excel or other spreadsheets (Figure 7-3). e. Finally confirm what kind of predictor variables are collected and extracted (e.g. variables shown in Table 2 are used for the prediction models development in this tutorial). Everything is ready now! Figure 7-1 A merged image (NDVI is shown as top layer) with site-specific soil properties Figure 7-2 Pixel value extraction: Sample in ArcMap Figure 7-3 Output table which contains soil properties and pixel values of spectral indices Table 2. Example of environmental predictor variables used for development of soil prediction models. Property Land cover data Topographic data Attributes* Vegetation Elevation (m) Landform MODIS: NDVI, TVI, EVI Landsat ETM+: blue, green, red, NIR, MIR, PAN, NDVI, NDVIgreen, TVI, MidIR, Principal Component Analysis (PCA) 1 to 6, Tasseled Cap Transformed data (TC) 1 to 6, NDVI 3x3 window, NDVI 7x7 window SPOT: green, red, NIR, SWIR, PAN, NDVI, NDVIgreen, TVI, NDWI, RSR, PCA 1 to 4, NDVI 3x3 window, NDVI 9x9 window Gamma-ray (potassium) Bouguer gravity Isostatic residual gravity x coordinates (m) y coordinates (m) xy coordinates Distance to water control structures (m) Remote sensing data Geophysical data Geographic data * Values were extracted from each local pixel. (8) Conceptual soil-landscape modeling McBratney et al. (2003) formalized the SCORPAN soil prediction model rooted in Jenny’s (1941) factorial model which describes soil as a function of five soil forming factors. For empirical quantitative descriptions of relationships between soil and other spatially referenced factors, the SCORPAN model includes spatial position and soil itself as a soil forming factor; Sc or Sa [x,y,~t] = f (S[x,y,~t], C[x,y,~t], O[x,y,~t], R[x,y,~t], P[x,y,~t], A[x,y], N) where Sc = soil class Sa = soil attributes S = soil, other properties of the soil at a point Eq. (1) C = climate O = organisms including humans R = relief P = parent material A = age (time factor) N = space x, y = x and y coordinates t=time Figure 8-1 Conceptual soil-landscape modeling: SCORPAN Based on the SCORPAN model, in this tutorial, prediction models for soil series and biogeochemical soil properties are developed using remote sensing imagery which can provide information about the O factor. (9) Soil series prediction models development Classification Trees can be used to build predictive soil models. Classification Trees are a type of regression which is capable to predict classes (e.g. soil series) from various predictor variables (e.g. GIS and remote sensing data which characterize the study area). Interpretations and predictions of environmental data and especially soil data are challenging because of their complexities and interrelationships. Tree-based methods are suited for dealing with highly complex data that are non-Gaussian (De’ath and Fabricius, 2000). Tree structures are generated through recursive partitioning the data into a number of groups (McBratney et al., 2003). Tree-based models can be implemented in single tree mode or as committee trees using hundreds of trees as described in Grunwald et al. (2009). The CART software package (Salford Systems, San Diego, CA) is used for tree-based soil series prediction models development in this tutorial. Please visit the website (http://salfordsystems.com/cart.php) for detail information about CART. The spectral data and derived indices from different remote sensing images that were collected through step (1) to (7) can be included. Vegetation, topographic, geophysical, and geographic data along with the site-specific soil series data are also used to build classification trees. In this tutorial, soil series that were collected from 108 sites during field sampling by Natural Resources Conservation Service (NRCS) and UF staff in WCA-2A are used. There are seven soil series: Gator, Gator with limestone bedrock, Lauderhill, Okeelanta, Pahokee, Terra Ceia, and Terra Ceia with limestone bedrock. However, there is a limitation to predict bedrock because there is no available information about parent material and bedrock in WCA-2A, so that Gator with limestone bedrock is considered as Gator series, and Terra Ceia with limestone bedrock is considered as Terra Ceia series to develop prediction models for soil series. There are 7 observations for Gators, 5 for Lauderhill, 14 for Okeelanta, 25 for Pahokee, and 57 for Terra Ceia soil series. 1) How to build a data file for CART a. CART reads data in various format including delimited ASCII text files, SAS, SPSS, and Excel files. b. CART is case insensitive. Variable names have to start with a letter and should have only letters, numbers, or underscores. c. CART supports both character and numeric variables. d. Variable names must not exceed 32 characters. (Reference: Steinberg and Colla, CART-Classification and Regression Trees, 1997) 2) How to build soil series prediction models a. Prepare your dataset including GIS, spectral and vegetation indices. b. Open CART (Figure 9-1) and click on File. Go to ‘Open’ and then ‘Data files’ to bring in your dataset. Or click open folder button. c. Browse and select the input data file. The input data file window should appear (Figure 9-2). You can see the brief information for the input data file and variables such as how many records does the input file have, how many variables does it have, how many character or numeric variables does it have (A in Figure 9-2). You may sort the variables by file order or alphabetical order. d. You may exclude some of the variables from your input dataset by clicking ‘Stats…’, review all individual data by clicking ‘View Data…’, and start to build your models by clicking ‘Model…’ in the Activity menu (B in Figure 9-2). e. CART automatically distinguishes character and numeric variables from the input data file. If the variable name ends with ‘$’ symbol, it means that the variable is assumed as a character variable (Compare variable name A and B in Figure 9-3) and others are treated as numeric variables. Also character variables are automatically treated as categorical variables (C in Figure 9-3). Categorical valuables also can be selected manually by putting a checkmark inside the checkbox. f. Select ‘Target (A in Figure 9-3)’ and ‘Predictor (B in Figure 9-3)’ variables. If no predictor is selected, CART treats all remaining variables as predictor variables. In this case, the target variable = soil series; and predictor variables are SCORPAN variables extracted from remote sensing images and GIS data layers. g. Select Tree Type (D in Figure 9-3): ‘Classification’, ’Regression’, or ‘Unsupervised’. Classification tree: Use when your target variable is a ‘Categorical’ variable (e.g. soil series). Regression tree: Use when your target variable is a ‘Continuous’ variable (e.g. soil phosphorus). h. Select ‘Single (CART) tree’ or ‘Committee (Combine) tree’ (E in Figure 9-3). Figure 9-1 Opening CART Figure 9-2 Input data file window Figure 9-3 Model setup window - Model i. Select the validation method in ‘Testing’ tab (e.g. V-fold cross-validation, variable separates test when you have a separate dataset for validation) (Figure 94). j. Set ‘Standard error rule’ for selection of the best tree (e.g. minimum cost tree regardless of size, within one standard error of minimum) in ‘Best Tree’ tab (Figure 9-4). k. Select ‘Splitting method (e.g. Gini, Entropy, Twoing)’ and ‘Favor even splits level’ in ‘Method’ tab (Figure 9-5). Figure 9-4 Model setup window – Testing (left) and Best Tree (right) Classification Trees Gini: Gini looks for the largest class in your dataset and costs are incorporated by adjusting prior probabilities. For instance, Gini splitting method tries to isolate ‘Terra Ceia’ series which have largest number of observation from all other classes in soil series dataset. Symmetric Gini: Costs are made symmetric and directly incorporated into the impurity function at the tree-growing stage. Entropy: Entropy looks for splits where as many levels as possible are divided perfectly or near perfectly. Class Probability: Class probability method builds probability tress instead of classification trees. Twoing: Twoing segments the classes of target variable into two groups. It attempts to make groups that together add up to 50 % of the input dataset. Ordered Twoing: Ordered towing method can handle ordered target variable. (Reference: Steinberg and Colla, CART-Classification and Regression Trees, 1997) Figure 9-5 Model setup window – Method (A: Classification trees, B: Regression trees) l. Set ‘Minimum node sizes’ for parent node and terminal node in ‘Advanced’ tab. m. Once the model parameters are set, click ‘Start’. Result navigator should appear (Figure 9-6). n. The navigator shows trees with its cost. The trees can be grown or pruned by the model developer (A in Figure 9-6). The best performing prediction model is the one with the lowest relative cost (error). Brief model statistics such as number of predictors, nodes, and minimum node cases are shown right of the navigator. o. CART supports several options for displays and reports (B in Figure 9-6). In essence, classification trees document the splitting rules of environmental predictor (SCORPAN) variables used in a tree to predict a target variable (here: soil series) in form of terminal nodes. p. Only splitters are displayed when you select a ‘Splitters…’ (Left in Figure 9-7) and ‘Tree details…’ shows specific splitting criteria, number of cases in each class and so on (Right in Figure 9-7). ‘Summary reports…’ shows ‘Gains chart’ for each class (Left in Figure 9-8), terminal nodes information, and variable importance (Right in Figure 9-8). Figure 9-6 Classification tree model Figure 9-7 Classification tree model displays and reports: Splitters (left) and tree details (right) Figure 9-8 Summary reports: Gains chart (left) and variable importance indicating which of the predictor variables had the highest importance to predict a variable of interest (right) 3) Error assessment The confusion error matrix is shown under the ‘Prediction Success’ tab (Figure 9-9). It can be utilized to assess the uncertainty of predictions. Confusion error matrices show the amount of average accuracy and overall accuracy. ‘Average accuracy’ is the average of each class’s accuracy and ‘accuracy in each class’ is determined by dividing the total number of correctly classified cases in a class (e.g. Gator soil) by the total number of observed cases in that class (e.g. Gator soil). The ‘overall accuracy’ is determined by dividing the total number of correctly classified cases (sum of diagonal of confusion matrices) by the total number of observed cases. Examples of the confusion error matrix are shown in Table 3-1 and 3-2. These are results of model development for soil series using remote sensing spectral data in WCA-2A. Figure 9-9 Error assessment for classification tree model Table 3-1. Confusion error matrix for single tree soil series prediction model with Landsat ETM+ image; Twoing classification method; average accuracy = 59.00 %, overall accuracy = 57.41 %. Predicted (Classified) soil series Class Observed case Gator Gator N=12 Lauderhill N=22 Okeelanta N=13 Pahokee N=0 Terra Ceia N=61 7 5 0 2 0 0 Lauderhill 5 0 5 0 0 0 Okeelanta 14 0 2 6 0 6 Pahokee 25 3 12 1 0 9 Terra Ceia 57 4 3 4 0 46 Table 3-2. Confusion error matrix for single tree soil series prediction model with SPOT image; Entropy classification method; average accuracy = 64.55 %, overall accuracy = 59.26 %. Predicted (Classified) soil series Class Observed case Gator Gator N=12 Lauderhill N=11 Okeelanta N=25 Pahokee N=11 Terra Ceia N=49 7 5 0 2 0 0 Lauderhill 5 0 5 0 0 0 Okeelanta 14 0 0 8 2 4 Pahokee 25 3 6 5 6 5 Terra Ceia 57 4 0 10 3 40 You could produce a map with the classification model that you developed for soil series using ArcGIS. The ‘raster calculator’ can be used to produce a map as shown in Figure 9-10 and 9-11. Figure 9-10 Single tree model (left) and prediction map of soil series (right) using Landsat ETM+ image in WCA-2A Figure 9-11 Single tree model (left) and prediction map of soil series (right) using SPOT image in WCA-2A (10) Soil property prediction model development Comprehensive research has been focused on predicting soil properties using geostatistical/hybrid methods at various scales and regions with a variety of remote sensing imagery. Various statistical and geostatistical prediction methods with field observation data and remote sensing images can be used to predict the spatial distribution of soil properties. In this tutorial, total phosphorus (TP) data that were collected and analyzed from 108 sites soil cores during field sampling are used. TP in WCA-2A ranged from 165.4 to 1694.0 mg kg-1 with an average of 601.4 mg kg-1 at layer 1 (0 to 10 cm), 27.6 to 2288.0 mg kg-1 with an average of 399.1 mg kg-1 at layer 2 (10 to 20 cm). The ISATIS software package (Geovariances Inc., France) is used to develop the TP prediction models. Topographic, geophysical, geographic, spectral data and indices from remote sensing images can be used to assess the trend models for regression kriging. Figure 10-1 provides an overview of soil property prediction model development. Figure 10-1 Overview of soil biogeochemical property prediction models development. 1) What is a semivariogram? Semivariograms measure the spatial dependence of soil properties using the semivariance (Webster and Oliver, 2007). Goovaerts (1997) mentioned that the semivariogram provides information on the range of spatial correlation as well as the magnitude of spatial variability. Nugget variance, range and sill variance of each semivariogram are parameters provided for interpretation of spatial dependence (McBratney and Pringle, 1999). The nugget effect represents random variation caused by measurement error or short-range variation (e.g. due to microbial activities) that could not be detected. Usually, the semivariance increases with sampling distances and then approaches a constant value called the sill. The distance at which the sill is achieved is called the range of spatial dependence (Figure 10-2). Typically, pairs of samples closer together show smaller semivariance values, whereas samples further away from each other show larger semivariances. It means samples separated by distances closer than the range are spatially related, while samples separated by greater distances are not spatially related. The semivariance is described by the formula: (h) 1 m {s( xi ) s( xi h)}2 2m i 1 Eq. (2) where γ(h) is the average semivariance of the soil property, m is the number of pairs of sampling points, s is the value of the property S (e.g. soil TP), x is the coordinate of the point i, and h is the lag or separation distance (Webster and Oliver, 2007). Figure 10-2 Semivariogram, showing the range, nugget, and sill. 2) What is Kriging? Kriging is a statistically-based estimator of spatial variables such as soil properties (Bolstad, 2008) and the most commonly used interpolation method in soil science. It provides a solution to the problem of estimation based on a continuous model of stochastic spatial variation (Webster and Oliver, 2007). Ordinary Kriging is the most common type of kriging and it assumes that we do not know the mean a priori (Meul and Van Meirvenne, 2003; Webster and Oliver, 2007). Information from the semivariograms is used in ordinary kriging to derive estimates across a defined region. It uses a search neighborhood function including only neighboring data values within the search window (certain range of window) to predict values at an un-sampled location. Typically near points from the target carry more weight than more distant ones and clustered points carry less weight individually than isolated ones at the same distance (Webster and Oliver, 2007). Regression Kriging is a combination of traditional statistics’ regression analysis and geostatistical analysis. The two components of the prediction (trend and kriging) are computed separately in regression kriging (Odeh et al., 1995). The deterministic component is modeled using multiple linear regressions and the stochastic component is modeled using ordinary kriging or simple kriging of the regression residuals (Odeh et al., 1995; Stacey et al., 2006; Rivero, 2006). The advantage of regression kriging is that kriging after regression may improve prediction in comparison to when doing regression or kriging only (Odeh et al., 1995). 3) How to import an input file and build a variogram model in ISATIS Kriging method is used to develop TP prediction model for the tutorial. ArcGIS supports kirging through the Geostatistical analyst extension (Figure 10-3). However, ISATIS is used to do semivariogram fitting and kriging in this tutorial because it affords the user more control over the geostatistical modeling process than in ArcGIS. ISATIS uses ‘variogram’ instead of ‘semivariogram’. Figure 10-3 Geostatistical wizard for Kriging in ArcMAP a. Open ISATIS (Figure 10-4) and create workspace for the model development. Select ‘Data File Manager’ under ‘File’ menu and then click ‘Create’. ‘Create study’ window should appear (Figure 10-4). Name the workspace and select location. Click ‘Create’ button then your workspace is set. Figure 10-4 ISATIS open window Figure 10-5 Create study window Figure 10-6 Import input file in ISATIS b. To import a prepared input file which contains soil properties, environmental ancillary data, and remote sensing spectral data and indices, click ‘File’ and ‘Import’. ISATIS supports several kind of data formats such as ASCII, ArcView, CAD, Excel, etc. (Figure 10-6). Protocols are the same for all data formats. Excel file is imported as an example in this tutorial. Select ‘Excel…’ and then ‘Excel import’ should appear. c. Import the input file (*.xls or *.xlsx; A in Figure 10-7) and click ‘First available row contains field manes’ if your input file does (B in Figure 10-7). d. Set directory and file for ISATIS points file (C in Figure 10-7). e. Click ‘Automatic’ in data files and then variables name should show up (D in Figure 10-7). Click each variable and select ‘Field type’ (e.g. easting, northing, numeric) and unit (E in Figure 10-7). Click ‘Preview’ to confirm the data and then click ‘Import’ (F in Figure 10-7). f. For the experimental semivariogram set and data analysis, go to ‘Statistics’ and click ‘Exploratory data analysis’. Click ‘Data file’ (A in Figure 10-8) and select variable that you want to analyze (soil TP in layer 1 in this tutorial; B in Figure 10-8). ISATIS supports various types of exploratory data analysis such as histogram and Q-Q plot (C in Figure 10-8). Explore your data and decide what further steps you need (e.g. transformation, outliers). Figure 10-7 Import Excel file into ISATIS (left), points file selector window (upper right), and preview window (lower right) g. For the experimental semivariogram set, go to ‘Application’ in the variograms window (D in Figure 10-8) and then the ‘Variogram calculation parameters’ window should appear (Left in Figure 10-9). Set the variogram parameters (A in Figure 10-9) and click ‘OK’. Go back to variogram window and click ‘Application’ again. Select ‘Save in parameter file’ under application and name the experimental variogram (e.g. variogram_TP_tutorial). Click ‘Add’ and ‘OK’. Figure 10-8 Exploratory data analysis window (left) and various graphs (right) Figure 10-9 Variogram calculation parameters window (left) and variogram with number of data pairs (right) h. Go to ‘Statistics’ and click ‘Variogram fitting’. Once the variogram fitting window appears, bring in the experimental variogram (A in Figure 10-10) which was saved in step (g). Click ‘Variogram model’ and name the output file (fitted variogram model; B in Figure 10-10). To see the fitted variogram, check ‘Fitting window’ in Graphic windows (D in Figure 10-10). i. Click ‘Edit’ button (C in Figure 10-10) and then the edit window should appear. In ‘Edit’ window, develop variogram models. Play with several parameters. Try to find the “best” variogram model for the variable. You may want to add a ‘Nugget effect’ into your variogram. Select ‘Nugget effect’ and click ‘Add’. Set ‘Sill’. Figure 10-10 Variogram fitting window Figure 10-11 Variogram fitting edit window (left) and fitted variogram (right) j. Click ‘Add’ again and select model for variogram structure (e.g. spherical model, exponential, Gaussian…) and set the range and sill (Figure 10-11). k. Click ‘Test’ to see your variogram fitting and if you are satisfied, click ‘OK’ and ‘Run (Save)’. 4) How to interpolate soil properties using the variogram model (ordinary and block kriging) a. Click ‘Estimation’ under the ‘Interpolate’ menu and then select ‘Kriging’ method that you wish to use (Figure 10-12). For example, if the data do not follow a normal distribution or are severely skewed, you can choose the lognormal kriging method. Lognormal kriging automatically transforms values using natural logarithm and back-transforms after interpolation. b. Click ‘(Co-)Kriging’ and select calculation option (e.g. punctual, block; A in Figure 10-13) and number of variables. c. Select an input file and output file (e.g. variable name such as ‘TP_prediction’), and fitted variogram model that was developed in previous stage (B in Figure 1013). You can also edit the variogram in this step. If you want to perform ‘Simple kriging’ or ‘Factorial kriging’, you can select the method in ‘Special model options (C in Figure 10-13)’. d. Click ‘Neighborhood’ and add a neighborhood file for the kriging. Name the new file (e.g.neighborhood_TP_tutorial) and then click ‘Add’ and ‘OK’. e. Once the neighborhood file is added, you can set neighborhood parameters such as ‘search ellipsoid (distance), minimum and optimum number of samples, and number of sectors. Click ‘Edit (D in Figure 10-13)’ and set neighborhood parameters. Figure 10-12 Kriging methods in ISATIS f. Click ‘Run’. g. For the error assessment, the cross-validation method is used. Go to ‘Statistics’ and then click ‘Modeling’, select ‘Cross-validation’. The cross-validation window should appear (Right in Figure 10-13). Set the number of variables and select a ‘Data file’ and variables which will be cross-validated. Bring the variogram model and neighborhood and then click ‘Test’. The results window should appear (e.g. Figure 10-14). h. Try to build several variogram models and neighborhoods. Run your models and do the test. Try to find a best model for your variable. i. Display your kriging result under the ‘Display’ menu in ISATIS or export the output file as ArcView or ASCII format. Go to ‘File’ and click ‘Export’. Select output format and then bring the output file into ArcMap to display the result (e.g. Figure 10-15). Figure 10-13 Kriging (left) and cross-validation (right) in ISATIS Figure 10-14 Cross-validation results Figure 10-15 Block kriging result map for TP in WCA-2A 5) Regression kriging Regression kriging is a combination of traditional statistics’ regression analysis for the deterministic component (multiple linear regressions for trend) and geostatistical analysis for the stochastic component (kriging for residuals) (Odeh et al., 1995). a. Correlation analysis is performed to look at the relationship among predictor (SCORPAN) variables and then multiple linear regression method is employed for trend analysis in SAS (Figure 10-15). b. Once you finish trend analysis, produce a trend map with the formula (e.g. lnTP = 5.28 + (0.0073×Near-infrared reflectance value) + (8.34×NDVI) + …). Simple way to produce a trend map is to use ‘Raster calculator’ in ArcMap. Figure 10-15 Correlations and multiple linear regression (above) and correlation result (below) c. Derive the residuals by comparing the observed values and values derived from the regression equation for all sampling locations: observation values – trend value; or soil lnTP observations – soil lnTP predictions from the regression model d. Then follow the same steps as explained in (4) How to interpolate soil properties using the variogram model (ordinary and block kriging). e. Combine the trend map and residual map (e.g. Figure 10-16). For this step the ‘Raster calculator’ can be used in ArcMap. Our research results show that remote sensing derived properties explained about 50% of the variability of soil TP in the topsoil. Spectral indices infer on the chlorophyll content, stress, and composition of vegetation, which are correlated with soil TP content. Also landscape features such as tree islands and hammocks were captured by the TP model using remote sensing spectral data as a predictor variable. The same steps as outlined in this tutorial can be utilized to develop prediction models for other soil classes (e.g. soil order, soil suborder) or soil properties (e.g. soil organic carbon, soil total nitrogen). Figure 10-16 Regression kriging result map for TP in WCA-2A References Bolstad, P. 2008. GIS Fundamentals: a First Text on Geographic Information Systems. Eider Press, Minnesota. Chen, J.M., G. Pavlic, L. Brown, J. Cihlar, S.G. Leblanc, H.P. White, R.J. Hall, D.R. Peddle, D.J. King, J.A. Trofymow, E. Swift, J. Van der Sanden, and P.K.E. Pellikka. 2002. Derivation and validation of Canada-wide coarse-resolution leaf area index maps using high-resolution satellite imagery and ground measurements. Remote Sensing of Environment 80: 165-184. De’ath, G., and K.E. Fabricius. 2000. Classification and regression trees: a powerful yet simple technique for ecological data analysis. Ecology 81:3178-3192. Gao, B.-C. 1996. NDWI-A normalized difference water index for remote sensing of vegetation liquid water from space. Remote sensing of Environment 58:257-266. Gitelson, A.A., Y.J. Kaufman, and M.N. Merzlyak. 1996. Use of a green channel in remote sensing of global vegetation from EOS-MODIS. Remote Sensing of Environment 58: 289298. Goovaerts, P. 1997. Geostatistics for Natural Resources Evaluation. Oxford University Press, New York. Grunwald S., S.H. Daroub, T.A. Lang and O.A. Diaz. 2009. Tree-based modeling of complex interactions of phosphorus loadings and environmental factors. Sci. of the Total Environ. 407: 3772-3783. Huete, A.R., H.Q. Liu, K. Batchily, and W. van Leeuwen. 1997. A comparison of vegetation indices over a global set of TM images for EOS-MODIS. Remote Sensing of Environment 59: 440-451. Jensen, J.R. 2000. Remote Sensing of the Environment: an Earth resource perspective. Prentice Hall, New Jersey. Jensen, J.R. 2005. Introductory Digital Image Processing: a Remote sensing perspective. Prentice Hall, New Jersey. McBratney, A.B., and M.J. Princgle. 1999. Estimating Average and proportional variograms of soil properties and their potential use in precision agriculture. Precision Agriculture 1: 125152. McBratney, A.B., M.L Mendonca Santos, and B. Minasny. 2003. On digital soil mapping. Goederma 117: 3-52. Meul, M., and M.Van Meirvenne. 2003. Kriging Soil Texture under Different Types of Nonstationarity. Geoderma 112: 217-233. Musick, H.B., and R.E. Pelletier. 1988. Response to soil moisture of spectral indexes derived from bidirectional reflectance in thematic mapper wavebands. Remote Sensing of Environment 25: 167-184. Nellis, M.D., and J.M. Briggs. 1992. Transformed vegetation index for measuring spatial variation in drought impacted biomass on Konza Prairie, Kansas. Transactions of the Kansas Academy of Science 95:93-99. Odeh, I.O.A., A.B.McBratney, and D.J.Chittleborough. 1995. Further results on prediction of soil properties from terrain attributes: heterotopic cokriging and regression-kriging. Geoderma 67: 215-226. Ozesmi, S.L., and M.E. Bauer. 2002. Satellite remote sensing of wetlands. Wetlands Ecology and Management 10: 381-402. Rivero, R.G. 2006. Development of predictive models of soil phosphorus in water conservation area-2A (Everglades), integrating remote sensing, GIS and geostatistics. Doctoral dissertation, University of Florida. Rouse, J.W., R.H. Haas, J.A. Schell, and D.W. Deering. 1974. Monitoring vegetation systems in the Great Plains with ERTS. NASA. Goddard space flight center 3rd ERTS-1 symposium 1:309-317 Schmid, T., M. Koch, M. DiBlasi, and M. Hagos. 2008. Spatial and spectral analysis of soil surface properties for an archaeological area in Aksum, Ethiopia, applying high and medium resolution data. Catena 75:93-101. Stacey, K.F., R.M.Lark, A.P.Whitmore, and A.E.Milne. 2006. Using a process model and regression kriging to improve predictions of nitrous oxide emissions from soil. Geoderma 135: 107-117. Steinberg, D., and P. Colla. 1997. CART-Classification and Regression Trees. San Diego, CA: Salford Systems. Webster, R. and M.A. Oliver. 2007. Geostatistics for Environmental Scientist (Second ed.). John Wiley & Sons Ltd. England.