The State of the Data tools are a set of python based geoprocessing tools that evaluate the quality of topographic feature data in terms of its completeness, logical consistency, positional accuracy, source lineage, temporal accuracy, and thematic accuracy.
The National Geospatial-Intelligence Agency (NGA) desires to maintain a full understanding of the adequacy of current data and product holdings. Areas where data and products meet or exceed requirements are distinguished from those where mission-relevant shortfalls exist. Analytic methodologies will be used to delineate those areas where mission-relevant data enrichment and production is required, whether due to pre-existing deficiencies, recent changes in world events, or changes in mission needs or priorities.
- Provide online, on-demand access to GEOINT
- Deliver data, products and services through customer Self-Service
- Operations in the Data-Rich Environment
- Content Producers → Content Managers
- Provide insight into data quality, mission / portrayal suitability, and fit-for-purpose
- Content Suitability Indicators
- Systematic approach using statistics and science to objectively evaluate data on a global scale
- Facilitates transparency of GEOINT through discoverability, accessibility and usability
This toolbox contains all current State of the Data analysis scripts as ArcGIS geoprocessing tools.
This toolbox is a subset of the tools in the State of the Data toolbox that have been designed to be run with feature services or enterprise geodatabase features and will output existing content suitability indicators.
- Identifies attribution deficiencies at the feature level for user defined attributes
- Identifies attribution deficiencies at the feature level based on mission or data requirements
- Identifies areas which may have missing features
- Line: total line length; Point: feature count; Surface: area
- Creates a statistical summary of the attribution assessment over user defined areas of interest
- Creates a statistical summary of the attribution assessment of TDS 6.1 data over user defined areas of interest
- Statistically summarizes the positional accuracy of a feature class based on feature level metadata
- Compares two corresponding line feature classes to:
- Match similar features
- Calculate their offset from each other
- Summarize those results over a given area of interest
- Compares two corresponding line feature classes (one of which is in the TDS 6.1 schema) to:
- Match similar features
- Calculate their offset from each other
- Summarize those results over a given area of interest
- Statistically summarizes the source of features in a feature class given a selected source attribute field
- Source field could be editor name, organization, product, etc.
- Statistically summarizes the source of features in the TDS 6.1 schema source given the values in the TDS source field
- Identifies features that may require updating due to currency and detected change
- Statistically summarizes the source date of features in a feature class given a selected source date attribute field
- Source date could be the last edit date, the feature creation date, the source data of the imagery from which the feature was derived from, etc.
- Statistically summarizes the source date of features in the TDS 6.1 schema given the values in the TDS source date field
- Comapres two outputs from Temporal Currency and ranks cells in terms
- Identifies areas where feature collection requirements properly correlate to population
- Performs a statistical summary of a feature subtype or selected attribute
The State of the Data tools use Python and Esri's arcpy library. In order for the tools to run, ArcGIS Desktop verision 10.4+ or ArcGIS Pro 1.2+ must be installed. If that condition is met, you should be able to clone this repo and run the tools as ArcGIS geoprocessing tools through ArcMap or ArcGIS Pro or as stand alone scripts.
- Derek Silva (Derek.A.Silva@nga.mil)
In aphabetical orther by last name:
- Gregory Brunner (gbrunner@esri.com)
- Andrew Chapkowski (achapkowski@esri.com)
- Tim Hollingshead (thollingshead@esri.com)
- Todd Wever (twever@esri.com)
This tool was developed at the National Geospatial-Intelligence Agency (NGA) in collaboration with ESRI. The government has "unlimited rights" and is releasing this software to increase the impact of government investments by providing developers with the opportunity to take things in new directions. The software use, modification, and distribution rights are stipulated within the MIT license.
All pull request contributions to this project will be released under the MIT or compatible license. Software source code previously released under an open source license and then modified by NGA staff is considered a "joint work" (see 17 USC § 101); it is partially copyrighted, partially public domain, and as a whole is protected by the copyrights of the non-government authors and must be released according to the terms of the original open source license.