Identification_Information: Citation: Citation_Information: Originator: USGS, Forest and Rangeland Ecosystem Science Center, Snake River Field Station Publication_Date: 20050908 Title: Current Distribution of Sagebrush and Associated Vegetation in the Columbia Basin and Southwestern Regions Edition: version 1.0 Geospatial_Data_Presentation_Form: raster digital data Online_Linkage: http://sagemap.wr.usgs.gov Larger_Work_Citation: Citation_Information: Description: Abstract: A new regional dataset was produced using decision tree classifier and other techniques to model landcover. Multi-season satellite imagery (Landsat ETM+, 1999-2003) and digital elevation model (DEM) derived datasets (e.g. elevation, landform, aspect, etc.) were utilized to derive rule sets for the various landcover classes. Eleven mapping areas, each characterized by similar ecological and spectral characteristics, were modeled independently of one another. An internal validation for modeled classes was performed on a withheld 20% of the sample data to assess model performance. Results of the validation will be presented in the final report and are not available at this time. Mapping area models were mosaicked to create the Columbia Basin Regional Dataset (Idaho, Oregon and Washington), which was subsequently combined with the Southwest Regional Gap Landcover Dataset to create the final seamless 8 state regional landcover map. The final map contains 126 Landcover classes (103 NatureServe Ecological Systems, 7 NLCD and 16 non-native vegetation classes) and has a minimum mapping unit (MMU) of approximately 1 acre. Purpose: The digital landcover dataset may be used for various purposes with user's discretion. Specifically, this dataset was created for assessments of shrubland and associated landcover types. These data are not intended to be used at scales larger than 1:100,000. Time_Period_of_Content: Time_Period_Information: Range_of_Dates/Times: Beginning_Date: 1999 Ending_Date: 2003 Currentness_Reference: ground condition Status: Progress: Complete Maintenance_and_Update_Frequency: None planned Spatial_Domain: Bounding_Coordinates: West_Bounding_Coordinate: -124.705960 East_Bounding_Coordinate: -101.334773 North_Bounding_Coordinate: 50.921525 South_Bounding_Coordinate: 29.755954 Keywords: Theme: Theme_Keyword_Thesaurus: none Theme_Keyword: landcover Theme_Keyword: vegetation cover Theme_Keyword: sagebrush Theme_Keyword: shrubland Theme_Keyword: ecological system Place: Place_Keyword_Thesaurus: none Place_Keyword: Columbia Basin Place_Keyword: Southwest U.S. Place_Keyword: Idaho Place_Keyword: Oregon Place_Keyword: Washington Place_Keyword: Utah Place_Keyword: Colorado Place_Keyword: Nevada Place_Keyword: Arizona Place_Keyword: New Mexico Place_Keyword: Intermountain West Access_Constraints: none Use_Constraints: Appropriate scale for these data is 1: 100,000 smaller. The user assumes responsibility when using this dataset. Point_of_Contact: Contact_Information: Contact_Organization_Primary: Contact_Organization: USGS Snake River Field Station Contact_Person: Steve Hanser Contact_Address: Address_Type: mailing address Address: 970 Lusk St City: Boise State_or_Province: Idaho Postal_Code: 83706 Country: USA Contact_Voice_Telephone: 208-426-2892 Contact_Facsimile_Telephone: 208-426-5210 Contact_Electronic_Mail_Address: shanser@usgs.gov Data_Set_Credit: Steve Hanser USGS, Forest and Rangeland Ecosystem Science Center, Snake River Field Station, 970 Lusk Street, Boise, ID, 83706 Steve Knick USGS, Forest and Rangeland Ecosystem Science Center, Snake River Field Station, 970 Lusk Street, Boise, ID, 83706 Jon Hak Oregon Natural Heritage Program, 1322 SE Morrison Street, Portland, OR 97214-2423 Jimmy Kagan Oregon Natural Heritage Program, 1322 SE Morrison Street, Portland, OR 97214-2423 Southwest Regional GAP Project RS/GIS Laboratory, College of Natural Resources, UMC 5275, Utah State University, Logan, UT 84322-5275 NatureServe: NatureServe, 2400 Spruce St., Suite 201, Bolder, CO 80302 USGS/EROS Data Center: EROS Data Center, USGS, Sioux Falls, SD 57198 Native_Data_Set_Environment: Microsoft Windows 2000 Version 5.0 (Build 2195) Service Pack 4; ESRI ArcCatalog 9.0.0.535 Data_Quality_Information: Attribute_Accuracy: Attribute_Accuracy_Report: Model validation for this dataset was performed by testing model accuracy using a 20% withheld portion of the sample data. Results of validation will be available online at http://sagemap.wr.usgs.gov and are not included in this file. Logical_Consistency_Report: Not applicable for raster data Completeness_Report: All cells within the Columbia Basin (ID, WA, and OR) and Southwest (AZ, CO, NV, NM and UT) regional boundaries have an Value attribute. See Process_Description for more details. Lineage: Source_Information: Source_Citation: Citation_Information: Originator: United States Geological Survey, EROS Data Center, National Elevation Dataset Publication_Date: 19990101 Title: 30 Meter Digital Elevation Model Geospatial_Data_Presentation_Form: raster digital data Online_Linkage: http://ned.usgs.gov/ Type_of_Source_Media: digital tape media Source_Time_Period_of_Content: Time_Period_Information: Single_Date/Time: Calendar_Date: 19990101 Source_Currentness_Reference: publication date Source_Citation_Abbreviation: USGS Source_Contribution: A digital elevation model (DEM) obtained from the National Elevation Dataset (NED) in 1999 was used to generate the landform GIS dataset. Source_Information: Source_Citation: Citation_Information: Originator: United States Geological Survey, EROS Data Center, Multi-Resolution Land Characteristics Consortium Publication_Date: 1999-2001 Title: Landsat 7 , ETM+ Imagery Geospatial_Data_Presentation_Form: remote sensing image Online_Linkage: http://www.mrlc.gov/index.asp Type_of_Source_Media: digital tape media Source_Time_Period_of_Content: Time_Period_Information: Range_of_Dates/Times: Beginning_Date: 1999 Ending_Date: 2003 Source_Currentness_Reference: publication date Source_Citation_Abbreviation: USGS Source_Contribution: Landsat 7 ETM+ Imagery provided for Spring, Summer and Fall dates between 1999 and 2003 Source_Information: Source_Citation: Citation_Information: Originator: RS/GIS Laboratory, College of Natural Resources, Utah State University Publication_Date: 20040915 Title: 'PROVISIONAL' Digital Landcover Dataset for the Southwestern United States Edition: version 1.0 Geospatial_Data_Presentation_Form: raster digital data Online_Linkage: http://earth.gis.usu.edu/swgap/ Larger_Work_Citation: Citation_Information: Originator: USGS GAP Analysis Program Publication_Date: Unknown Title: Tentative title "Southwest Regional GAP Analysis Project Final Report." Type_of_Source_Media: digital tape media Source_Time_Period_of_Content: Time_Period_Information: Single_Date/Time: Calendar_Date: 20040915 Source_Currentness_Reference: publication date Source_Citation_Abbreviation: Southwest GAP Landcover Source_Contribution: Landcover for the Southwestern States (CO, UT, NV, AZ, and NM) Process_Step: Process_Description: Introduction: This project was a coordinated multi-institution endeavor. The USGS, Snake River Field Station coordinated activities to assure as much standardization as possible. Detailed documentation on process steps will be posted online at http://sagemap.wr.usgs.gov and included with individual mapping area datasets as they become available. The following provides a brief outline of the process steps. 1) Mapping area delineation: The Columbia Basin Region (Idaho, Oregon, and Washington) was divided into 11 ecologically and spectrally similar mapping areas. Bailey's (1995) and Omernik's (1987) ecoregions, Landsat TM imagery and existing landcover maps were used as a backdrop for digitizing boundary lines. Columbia Basin mapping area boundaries were matched with the SWGAP mapping areas across the Oregon/Idaho and Utah/Nevada borders and extended 50km into the SWGAP regional boundary in order to utilize SWGAP training data and facilitate edge matching. The responsibility for mapping was split with the USGS-Snake River field Station mapping the Idaho mapping areas with exception of the Owyhee mapping area and the Oregon Natural Heritage Program mapping Oregon and Washington. 2) Predictor layer preparation: Landsat 7 ETM+ images were selected from 1999-2003 for three seasons: spring, summer and fall. Scenes were selected for optimal representation of seasonal phenology, and minimal cloud cover. Landsat scenes were standardized using the MRLC 2001 Preprocessing Procedure and mosaicked for each mapping area. Thirty-meter digital elevation data, provided by the National Elevation Dataset (1999) were mosaicked for the region and subset for each mapping area. The digital elevation data was used to derive attributes such as aspect and landform for each mapping area. Each mapping area had a 2 km overlap with the adjacent mapping area, providing an overall 4 km overlap region between mapping areas. 3) Training sample collection: Approximately 128,000 samples were used for the 3-state region. Field surveys were conducted during the 2002 and 2003 field seasons and involved ocular estimates of biotic and abiotic characteristics, which were recorded on a field form, and subsequently entered into a database. Percent cover of dominant species for Trees, Shrubs, Grasses and Forbs were recorded, as were physical data such as elevation, slope and aspect. A GPS coordinate pair and a polygon was digitized using a laptop computer with TM imagery as a backdrop to record the location of each sample site. A cost analysis was performed by NatureServe to help maximize the sampling effort. A combination of distance to road and landcover composition from previous land cover maps was used to identify regions that were most efficient for data collection. Crews traveled to the regions identified by the cost model and then traversed all navigable roads in the area and opportunistically selected samples based on appropriate size and composition (representative) of stands. Samples were also obtained from other projects, from imagery, DOQ or aerial photo interpretation. In many of the mapping areas the additional data was primary source of data. Each sample location was assigned an appropriate landcover label. Natural and semi-natural vegetation classes were assigned a label based on the Ecological System concept developed by NatureServe. 4) Landcover modeling: The majority of natural and semi-natural landcover classes were modeled using a decision tree classifier. This was done using a custom interface for ERDAS Imagine (developed under contract by Earthsat, Corp. for USGS Eros Data Center) that facilitated the integration of the spatial modeling capabilities of Imagine with the decision tree/data mining capabilities of the See5 software (www.rulequest.com). Approximately 20 sub-samples were randomly selected from each sample site polygon, and were used as separate replicates within the decision tree classifier. These sub-samples were 'drilled' through the predictor layers to obtain training information for the decision tree classifier. The decision tree classifier was run using the See5 software with subsequent generation of decision tree 'ruleset'. The rules were then spatially applied to create a GIS dataset in *.img format. Choice of optimal predictor layers for each model was determined iteratively, through examination of the spatial output of the models and results of the model validation error matrices. In Oregon and Washington, an additional modeling step (Jennings et al. 2004) was used by the Oregon Natural Heritage Program to model shrub cover for each map zone in order to separate shrubland and steppe communities. Some landcover types were not mapped using the decision tree classifier (e.g. burn scars, water bodies, developed and agricultural areas, etc.). These classes were mapped using other techniques such as localized unsupervised classification or screen digitizing. 5) Model validation: Decision tree models were validated by generating initial models using 80% of available samples, while withholding 20% of samples. Withheld samples were randomly selected and stratified by cover class (i.e. proportion of withheld samples per cover class was the same for both the training set and the validation set). Withheld sample polygons were intersected through the spatially applied decision rules (i.e. landcover map) to create an error matrix, presenting users, producers and overall accuracies. The kappa statistic was also calculated for the error matrix. This validation approach does not explicitly present an accuracy of the map. This approach only provides a measure of the ability of the decision tree model to 'predict' landcover in geographic regions where samples were not used, and. Also of importance, for some classes that were modeled with the decision tree classifier, the number of withheld samples was small. Additionally, a small number of classes were not mapped using the decision tree classifier due to the relative rarity of occurrence. 6) Map refinement (by mapping area): The objective of the project was to produce the best map possible. Therefore, the next step was to generate a final decision tree model using 100% of the available sample data. This resulted in a GIS dataset (*.img format) containing all the modeled landcover classes. This dataset was generalized to the minimum mapping unit (MMU) of 1 acre using Imagine's CLUMP utility (4 connected neighboring pixels) and then Imagine's ELIMINATE utility with a minimum clump of pixels set to approximately 1 acre (5 pixels). The non-modeled landcover classes (e.g. developed, agriculture, water, etc.) were then superimposed over the generalized modeled landcover classes using a conditional statement with Imagine's graphical modeler. 7) Regional mosaic: Using the 4 km overlap region between mapping areas a cutline was used to edge-match adjacent mapping areas where landcover discontinuities resulted from the modeling process although the majority of the overlapping areas were consistent. The Columbia Basin region (ID, OR, and WA) and the Southwest Regional Gap Landcover Dataset were then mosaicked -utilizing a cutline and the 50km overlap between the regions. 8) Data formatting for distribution: The landcover modeling resulted in a final unsigned 16 bit *.img file. To make the data more practical for distribution, the 16 bit image was converted to ArcInfo grid format. Process_Date: 200509 Spatial_Data_Organization_Information: Direct_Spatial_Reference_Method: Raster Raster_Object_Information: Raster_Object_Type: Grid Cell Row_Count: 70865 Column_Count: 54012 Vertical_Count: 1 Spatial_Reference_Information: Horizontal_Coordinate_System_Definition: Planar: Map_Projection: Map_Projection_Name: Albers Conical Equal Area Albers_Conical_Equal_Area: Standard_Parallel: 29.500000 Standard_Parallel: 45.500000 Longitude_of_Central_Meridian: -96.000000 Latitude_of_Projection_Origin: 23.000000 False_Easting: 0.000000 False_Northing: 0.000000 Planar_Coordinate_Information: Planar_Coordinate_Encoding_Method: row and column Coordinate_Representation: Abscissa_Resolution: 30.000000 Ordinate_Resolution: 30.000000 Planar_Distance_Units: meters Geodetic_Model: Horizontal_Datum_Name: North American Datum of 1983 Ellipsoid_Name: Geodetic Reference System 80 Semi-major_Axis: 6378137.000000 Denominator_of_Flattening_Ratio: 298.257222 Entity_and_Attribute_Information: Detailed_Description: Entity_Type: Entity_Type_Label: shrubmap Attribute: Attribute_Label: ObjectID Attribute_Definition: Internal feature number. Attribute_Definition_Source: ESRI Attribute_Domain_Values: Unrepresentable_Domain: Sequential unique whole numbers that are automatically generated. Attribute: Attribute_Label: Value Attribute_Definition: Unique identifier for the landcover class. Attribute_Definition_Source: None Attribute_Domain_Values: Unrepresentable_Domain: Ecological System, NLCD, and Distrubed Class Values, See Land Cover Description Document Attribute_Value_Accuracy_Information: Attribute_Value_Accuracy: See Documentation Attribute: Attribute_Label: Count Attribute_Definition: Number of cells/pixels for each class. Attribute_Definition_Source: ESRI Attribute_Domain_Values: Unrepresentable_Domain: Numbers that are automatically generated. Attribute: Attribute_Label: Code Attribute_Definition: Ecological System Code Attribute_Definition_Source: NatureServe Attribute_Domain_Values: Codeset_Domain: Codeset_Name: Ecological Systems Codeset_Source: NatureServe Attribute: Attribute_Label: Label Attribute_Definition: Ecological System Name Attribute_Definition_Source: NatureServe Attribute_Domain_Values: Unrepresentable_Domain: Ecological System Name Overview_Description: Entity_and_Attribute_Overview: The following fields are present in the dataset: VALUE: Unique identifier for the landcover class. COUNT/HISTOGRAM: Number of cells/pixels for each class. CODE: Ecological System Code NAME: Ecological System Name Complete descriptions of each landcover class are available in the document titled: "Land Cover Legend: Current Distribution of Sagebrush and Associated Vegetation in the Columbia Basin and Southwestern Regions" Entity_and_Attribute_Detail_Citation: NatureServe. 2005. International Ecological Classification Standard: Terrestrial Ecological Classifications. NatureServe Central Databases. Arlington, VA. U.S.A. Data current as of 20 September 2005. See documentation for complete listing of attribution definitions: Land cover legend: Current distribution of sagebrush and associated vegetation in the Columbia Basin and southwestern states. Distribution_Information: Distributor: Contact_Information: Contact_Organization_Primary: Contact_Organization: USGS, Snake River Field Station Contact_Person: Steve Hanser Contact_Position: Wildlife Biologist Contact_Address: Address_Type: mailing address Address: 970 Lusk St City: Boise State_or_Province: Idaho Postal_Code: 83706 Country: U.S.A. Contact_Voice_Telephone: 208-426-2892 Contact_Facsimile_Telephone: 208-426-5210 Contact_Electronic_Mail_Address: shanser@usgs.gov Resource_Description: Landcover dataset Distribution_Liability: We make every effort to provide and maintain accurate, complete, usable, and timely information on our web sites. However, some USGS data and information accessed through these pages may, of necessity, be preliminary in nature and presented prior to final review and approval by the Director of the USGS. These data and information are provided with the understanding that they are not guaranteed to be correct or complete. Users are cautioned to consider carefully the provisional nature of these data and information before using them for decisions that concern personal or public safety or the conduct of business that involves substantial monetary or operational consequences. Conclusions drawn from, or actions undertaken on the basis of, such data and information are the sole responsibility of the user. Standard_Order_Process: Digital_Form: Digital_Transfer_Information: Format_Name: ArcInfo GRID format (ESRI) Format_Version_Number: Workstation ArcInfo 8.0.2 File_Decompression_Technique: Compression type *.zip. For windows use WinZip. Transfer_Size: 588.009 Digital_Transfer_Option: Online_Option: Computer_Contact_Information: Network_Address: Network_Resource_Name: http://sagemap.wr.usgs.gov Fees: none Metadata_Reference_Information: Metadata_Date: 20050926 Metadata_Contact: Contact_Information: Contact_Organization_Primary: Contact_Organization: USGS, Snake River Field Station Contact_Person: Steve Hanser Contact_Position: Wildlife Biologist Contact_Address: Address_Type: mailing address Address: 970 Lusk St. City: Boise State_or_Province: Idaho Postal_Code: 83706 Country: U.S.A. Contact_Voice_Telephone: 208-426-2892 Contact_Facsimile_Telephone: 208-426-5210 Contact_Electronic_Mail_Address: shanser@usgs.gov Metadata_Standard_Name: FGDC Content Standards for Digital Geospatial Metadata Metadata_Standard_Version: FGDC-STD-001-1998 Metadata_Time_Convention: local time Metadata_Access_Constraints: none Metadata_Use_Constraints: none Metadata_Security_Information: Metadata_Security_Classification_System: none Metadata_Security_Classification: unclassified Metadata_Security_Handling_Description: none Metadata_Extensions: Online_Linkage: http://www.esri.com/metadata/esriprof80.html Profile_Name: ESRI Metadata Profile Metadata_Extensions: Online_Linkage: http://www.esri.com/metadata/esriprof80.html Profile_Name: ESRI Metadata Profile Metadata_Language: English