-
Notifications
You must be signed in to change notification settings - Fork 15
GeMS Tools Documentation
GeMS Tools is an ArcGIS toolbox to facilitate working with the GeMS geologic map database schema.
To obtain and install GeMS Tools, see the README file at https://github.com/usgs/gems-tools-pro.
If you see a need to correct or improve this documentation, please feel free to edit this wiki.
These scripts are far from perfect. When things fail, here are a couple of things to look at.
- Check for unexpected file and directory locks. Have you Stopped Editing? Is there another ArcMap process lurking somewhere with a lock on your database? Quit everything Arc, open Windows Task Manager, check for orphan ArcMap and ArcCatalog process, and restart ArcMap.
- Some of the scripts appear to run into name-space issues that I (RH) don't understand. Quit ArcMap and try running the script from ArcCatalog.
Scripts .docx to DMU, Deplanarize CAF, DMU to .docx, and FGDC CSDGM2 Metadata are particularly problematic. MapOutline doesn't handle non-NAD27/NAD83 datums well.
Here is the current tool set. Click on a tool name to jump to its documentation.
Tool | Create and edit database | Cartography | Finalize database | Validate database |
---|---|---|---|---|
(re)Set ID values | X | |||
DOCX to DMU | X | |||
Attribute by Key Values | X | |||
Build Metadata | X | |||
Compact and Backup | X | X | ||
Create New Database | X | |||
Deplanarize CAF | X | |||
DMU to DOCX | X | X | ||
Geologic Names Check | X | X | ||
Inclination Numbers | X | |||
Make Polygons | X | |||
Make Topology | X | |||
MapOutline | X | |||
Project Map Data to Cross Section | X | |||
Relationship Classes | X | |||
Set PlotAtScale Values | X | X | ||
Set Symbol Values | X | X | ||
Symbol to RGB | X | |||
Topology Check | X | X | ||
Translate to Shapefiles | X | |||
Validate Database | X | X |
GeMS-style databases use _ID values as primary keys; these values are repeated as ID values in other tables where they serve as foreign keys to tie tables together. (re)Set ID values generates _ID values while preserving any links established by existing _ID and ID values. As an option, GUIDs may be substituted for plain-text _ID and ID values.
This script modifies the input geodatabase. Make a backup copy (with Compact and Backup) before you run it!
Parameter | Explanation | Data Type |
---|---|---|
Input_GeMS-style_geodatabase | The geodatabase for which _ID values are to be created or recreated. Must exist. May be file geodatabase (.gdb) or personal geodatabase (.mdb). | Workspace |
Use_GUIDs (Optional) | Default is unchecked (false),which creates _ID values as several characters which denote the table (e.g., MUP for MapUnitPolys) followed by consecutive zero-padded integers: MUP0001, MUP0002, MUP0003, etc. If checked, creates GUIDs (Globally-Unique IDs which are many-byte nonsense strings) for _ID values. | Boolean |
Do_not_reset_DataSource_IDs | If unchecked, resets values of DataSources_ID and all DataSourcesID, LocationSourceID, AnalysisSourceID and similar that refer to DataSources_ID. Default is checked, which leaves these values unchanged. | Boolean |
Reads contents of a Description Of Map Units Microsoft Word document formatted according to USGS Pubs template MapManuscript_v3-1_06-22.dotx, calculates HierarchyKey, and partially fills in a GeMS-style DescriptionOfMapUnits table. If the workspace does not have a DescriptionOfMapUnits table, one will be created. HierarchyKey values are calculated based on paragraph styling and indentation. If a table already exists, the tool will attempt to match updates with existing rows.
DMU paragraphs in the manuscript must be formatted with 'DMU' styles, for example:
-
DMU-HeadingN
(number, N, may up to 5) DMU Headnote - 1 line
DMU Headnote - More Than 1 line
DMU Headnote Paragraph
DMU Unit 1 (1st after heading)
-
DMU Unit N
(number, N, may be up to 5)
The paragraph content will be parsed into the table in the following ways:
- Heading text will be saved in Name.
- Headnote text will be saved in Description, with or without formatting tags as desired. See below and checkbox help.
- The unit label will be saved in MapUnit and, optionally, in Label with formatting tags. See below and checkbox help.
- The unit name will be saved in Name.
- The unit age will be saved in Age.
- The unit description will be saved in Description, with or without formatting tags as desired. See below and checkbox help.
Note that not all styles in 'DMU_template.docx' in the Resources folder of this toolbox are recognized by this tool, in particular, paragraph styles such as DMU NoIndent
, DMU Quotation
, and DMU - List Bullet
are ignored.
Regarding text formatting tags, ArcGIS Pro recognizes two flavors of tags depending on where the text is intended to be displayed. For labelling and annotation, ArcGIS text formatting tags are recognized while HTML tags are used in pop-ups. The tool assumes that all text in Description would only be viewed in pop-ups whereas the text in Label will likely be used for labeling but might also appear in pop-ups. For Description there is only the choice to have character styling converted to HTML tags while both options exist for Label text. Only a small set of either ArcGIS Text Formatting Tags or HTML tags are written by the tool:
- ArcGIS only:
<fnt>
,<bol>
,<ita>
,'size'
,'italic'
,'style'
and'wght'
attributes in<fnt>
- HTML only:
<span>
,<strong>
,<em>
,'font-weight'
and'font-style'
attributes in<span>
- Both ArcGIS and HTML:
<sup>
,<sub>
Parameter | Explanation | Data Type |
---|---|---|
DMU manuscript file | Path to the Word document. | File |
Geologic map geodatabase | GeMS-style database. DescriptionOfMapUnitst table DMU table may be empty, partly complete or not exist at run time. | Workspace |
Length of HierarchyKey segments | All segments less than this number will be padded to the left with zeros. It is best practice to always pad to at least 3 spaces (the default) to try to avoid the value ever being interpreted as a date datatype. | Integer |
Add ArcGIS tags to Label (for annotation) | Convert character styling of the unit label in the Word doc into ArcGIS text formatting tags and saved in Label. | Boolean |
Add HTML tags to Label (for pop-ups) | Convert character styling of the unit label in the Word doc into HTML formatting tags and saved in Label. | Boolean |
Add HTML tags to Description (for pop-ups) | Convert character styling of the unit description in the Word doc into HTML formatting tags and saved in Description. | Boolean |
- docx (https://python-docx.readthedocs.io/en/latest/) included with toolbox in folder
Scripts\docx
Attribute By Key Values steps through an identified subset of feature classes in the GeologicMap feature dataset and, for specified values of an independent field, calculates values of multiple dependent fields. It is useful for translating single-attribute datasets into GeMS format, and for using GeMS to digitize in single-attribute mode.
Many geologic map database schemas have characterized features--especially lines--with a single attribute such as"contact--approximate". GeMS characterizes lines by multiple attributes, such that
LTYPE="contact--approximate"
might, depending on map scale and the author’s intentions, translate to
Type="contact”
IsConcealed="N"
LocationConfidenceMeters=150
ExistenceConfidence="certain"
IdentifyConfidence="certain"
Symbol="01.01.03"
This tool simplifies the translation from such schemas into GeMS.
Parameter | Explanation | Data Type |
---|---|---|
Input_geodatabase | An existing GeMS-style geodatabase with a GeologicMap feature dataset. | Workspace |
Key_Value_file | A pipe ( | ) -delimited text file that describes mapping from unique values of an independent attribute to values of multiple dependent attributes. See file Dig24K_KeyValues.txt (should be located in folder GeMS_Toolbox/Resources) for an example with format instructions. | Text File |
Attribute By Key Values requires an accessory keyvalue file. You must create a plain-text file (use gedit, Notepad or Wordpad, save as .txt) that looks like:
ContactsAndFaults
LTYPE|Type|LocationConfidenceMeters| ExistenceConfidence| IdentityConfidence| Symbol
contact|contact| 20| certain| certain| 01.01.01
contact--approximate|contact| 150| certain| certain| 01.01.03
…
OrientationPoints
STYPE|Type|LocationConfidenceMeters|OrientationConfidenceDegrees|IdentityConfidence|Symbol
bedding|bedding| 20| 5| certain| 06.01.01
…
An example keyvalue file is provided with the toolbox, in the folder GeMS_Tools\Resources. Important aspects of the keyvalue file are:
- It contains one or more sets of header lines, such as lines 1, 2, and 6, 7 above. Header lines come in pairs: the first line identifies a feature class within the GeologicMap feature dataset. The second line names the independent attribute within that feature class and then the dependent attributes whose values will be calculated based upon the independent attribute.
- All lines after a pair of header lines are definitions for the respective values specified in that header line, until the next pair of header lines is encountered.
- Values within lines are separated by the pipe symbol (|). This permits the use of commas within values. Trailing (or leading) spaces are acceptable, but not required.
This file can also be created and edited with a spreadsheet program (e.g., LibreOffice Calc, Microsoft Excel), saving as a .csv file and setting the delimiter to “|”.
Attribute By Key Values will:
-
Read the keyvalue file for instructions
-
Open the GeMS-format database and calculate unassigned feature attributes (values of , zero-length string, or zero) based on key values (LTYPE, STYPE, or other). Note that if any attributes have already been assigned they will NOT be changed WITH THE EXCEPTION OF NUMERIC FIELDS WITH ZERO (0) VALUE. This allows you to somewhat easily override the default attributes
-
Write short messages to the script output window if unknown key values are encountered
Tool Attribute By Key Values can be run multiple times during the course of building a geodatabase (recommended) or just once at the end.
Build Metadata helps elaborate FGDC CSDGM metadata for a GeMS-style geodatabase. The database can be an ArcGIS file geodatabase or a geopackage.
A fully GeMS-compliant database contains the information required for many elements in CSDGM metadata, but which must nonetheless be transcribed into a separate file. Likewise, definitions of GeMS-required tables and fields can be found in the GeMS schema publication but are also required in the metadata. This tool tries to automate the creation of as many of those elements as possible and also adds boilerplate language appropriate to the schema.
Note that this ArcGIS Pro tool does not use the same workflow as the three ArcMap metadata tools and the output is different. The result of running all of the ArcMap tools is a separate metadata record for each table in the database and the metadata are imported back into the embedded ArcGIS metadata. Instead, this tool writes out one database-level record wherein all feature classes and tables are enumerated and defined. This is more in-line with how metadata describing USGS data releases that consist of several related tables are being published. Though this makes for longer metadata records, there is only one to have to look through and keep associated with the database. Furthermore, because the GeMS submittal process does not require embedded metadata (and is moot if submitting a geopackage), this tool forgoes that step. But let us know if you think a tool like that would be useful or would like to add one of your own creation to the toolbox.
Dependencies
Because of a bug in ArcGIS Pro, this tool relies on the open-source GDAL library to collect some of the metadata required by the CSDGM standard for geospatial data. This library is installed with ArcGIS Pro so no extra installation is necessary but you will likely need to configure one environment variable. If you get an error when running the tool that advises you to set PROJ_LIB
to the location of proj.db, follow the steps below:
- Locate proj.db and copy the path to the folder it is in. It will probably be within the ArcGIS Pro installation folder at
ArcGIS\Pro\Resources\pedata\gdaldata
but could be somewhere else. - In the Windows Search box on the taskbar, type 'environment variable' and click on the result that reads 'Edit environment variables for your account' (you can 'Edit the system environment variables' if you like but you will probably need admin privileges).
- Click New and use
PROJ_LIB
for the Variable name and the path you copied in step 1. for the Variable value. - Close all open dialogs and re-start ArcGIS Pro.
Use
To use this tool, first decide if ArcGIS embedded metadata is to be used as the starting point for building the rest of the metadata or if metadata are to be created from scratch. This only applies to file geodatabases as ArcGIS metadata cannot be stored inside geopackages. Note that feature class and table-specific metadata will not be exported; only the metadata embedded in the top-level file-geodatabase container.
If you have added non-GeMS tables or fields to the database, you must provide Entity, Attribute, and Domain definitions and definition sources in the metadata for those items. You can do this in a metadata editor after running the tool or you may provide a path at runtime to a file in which the definitions are stored. The definitions must be formatted in python dictionaries. Examples are provided in my_definitions.py
in the Resources folder of the toolbox.
To either the embedded or built-from-scratch metadata, the tool will add:
- sources from DataSources to Source Information elements (depending on some choices outlined below)
- Entity, Attribute, and Domain Definitions and Definition Sources from built-in GeMS defintions and, if specified, a custom definitions file
- a Bounding Coordinates element built from the maximum Bounding Coordinates of
MapUnitPolys
andContactsAndFaults
- a Spatial Data Organization Information element (which ArcGIS Pro does not export)
- a Spatial Reference element derived from
MapUnitPolys
. There can be only one spatial reference section in CSDGM2 metadata so other spatial references, of basedata or cross sections for example, will be ignored. - GeMS-related text to Supplemental Information, Attribute Accuracy, and Horizontal Positional Accuracy Report elements
If there are missing Entity, Attribute, or Domain definitions, you may choose to leave them blank or replaced with a flag, "MISSING", so that you can find them in a text editor. But if you are using a validating metadata editor, such as Metadata Wizard, those occurrences of "MISSING", though meaningless for the metadata, will not be considered invalid.. It may be best to leave missing definitions blank so that they can be flagged as errors. Metadata Wizard, at least, will color those empty text entry boxes red so you can easily see what is still required.
You can export the metadata at this point and fill in other required sections in a metadata editor or you can specify a path to a template metadata file to which the metadata generated so far should be added.
When working in ArcGIS, every geoprocessing task run on a file geodatabase is recorded as a Process Step in Lineage. This section of the metadata is arguably better used for recording less granular steps in the worklow, so if you find this level of detail distracting, you can choose to have the steps removed completely or replaced by process steps recorded in the template metadata.
For validation of the exported metadata, the tool sends the file to the USGS Geospatial Metadata Validation Service API. The API re-orders any out-of-order elements, re-writing the XML file in the process, and outputs an error log and, if chosen, a more human-readable version of the metadata.
At this point, open the output xml file, which will be in the same folder as the source database, in your favorite metadata editor and fill in the blanks, validating either in the application or with the USGS metadata service until you get compliant metadata. It is NOT recommended to finalize the metadata in ArcGIS. On import, ArcGIS will convert the FGDC-CSDGM metadata into ESRI metadata and though you can edit individual CSDGM elements, upon export, you will likely not get the results you expect.
Finally, have the record reviewed by a skilled metadata reviewer.
Parameter Label | Parameter Name | Explanation | Data Type |
---|---|---|---|
GeMS database | dataset | GeMS-compliant file geodatabase or geopackage | path to dataset |
Start with embedded metadata? | embedded_metadata | Should metadata embedded in a file geodatabase be exported as the starting document to which GeMS metadata will be added? Boolean. False by default. Optional. | boolean |
Custom Definition file | my_defs_file | Path to a python (.py ) file storing dictionaries of definitions of non-GeMS tables and fields. See my_definitions.py in the resources sub-folder of the GeMS Tools folder as an example. Optional. | path to file |
template | Template metadata | Path to an XML file with reusable boilerplate language that will be added to the output metadata. This file will not be overwritten. Optional. | path to file |
Sources | sources | Which set(s) of sources should be saved in the output metadata? Sources can come from the DataSources table, the Data Source elements in the embedded metadata, and/or the Source_Information elements in the template metadata. Choices, depending on previous parameter choices, are:
|
string |
History (process steps) | history | Which record of processing history should be saved in the output metadata? History can come from the Geoprocessing History elements in the embedded metadata or Process_Step elements in template metadata. Choices are:
|
string |
Missing definitions and sources | missing | How should the Definition and Definition_Source elements for Entities (tables) and Attributes (fields) be filled out if no definition is found? Choices are:
|
string |
Export .txt version of metadata? | export_text | Should a text (more human readable) version of the output metadata by exported? Boolean. False by default. Optional. | boolean |
Compact and Backup compacts a database and copies it to an archive version. The archive version is named geodatabasename_current date. Multiple backups in a single day will have suffixes a, b, c, etc.
Parameter | Explanation | Data Type |
---|---|---|
Input_geodatabase | The geodatabase which will be compacted and then backed up | Workspace |
Message_for_log_file (Optional) | Optional message to be written, with timestamp and username, to file 00log.txt inside the geodatabase directory. If this is a personal geodatabase, the write should fail gracefully and this message will not be recorded. | String |
Create New Database creates a new GeMS-style geodatabase.
Note that with default settings this tool creates only the minimum required feature dataset, feature classes, and tables. Check the appropriate boxes to add OrientationPoints, GeologicLines, etc. See the GeMS documentation for the purposes of these optional elements. Change the number of cross sections to 1 (or more) to create feature dataset(s) for cross sections.
This tool may take several minutes to run.
Parameter | Explanation | Data Type |
---|---|---|
Output_Workspace | Name of a directory. Must exist and be writable. | Folder |
Name_of_new_geodatabase | Name of a file or directory to be created; must not exist in output workspace. Use .gdb extension to create a file geodatabase, .mdb extension to create a personal geodatabase. If no extension is given, will default to .gdb. | String |
Spatial_reference_system | May select an ESRI projection file, import a spatial reference from an existing dataset, or define a new spatial reference system from scratch. | Coordinate System |
Optional_feature_classes_tables_ and_feature_datasets (Optional) | Select items from this list as needed. Note that if you later discover you need an additional feature class, table, or feature dataset, you may 1) define this element from scratch in ArcCatalog, or 2) run this tool again, creating a new geodatabase with the same spatial reference system, and creating the needed feature class(es), table(s), or feature dataset(s). Then copy and paste the additional elements into your existing geodatabase. | Multiple Value |
Number_of_cross_sections | An integer in the range 0 to 26. | Long |
Enable_edit_tracking | If checked, enables edit tracking on all feature classes. Adds fields created_user, created_date, last_edited_user, and last_edited_date. Dates are recorded in database (local) time. Default is checked. This parameter is ignored if the installed version of ArcGIS is less than 10.1. | Boolean |
Add_fields_for_cartographic_ representations | Default is unchecked. If checked, adds a representation--fields RuleID and Override--and representation rules to feature classes ContactsAndFaults, GeologicLines, and OrientationPoints, and equivalent feature classes in any CrossSection feature datasets that are created. Also adds coded-value domains that tie RuleID values (consecutive integers) to FGDC symbol identifiers (e.g., 1.1.7 for a dotted contact). The representation rules (aka symbols) are from the Arizona Geological Survey and are a subset of the FGDC symbology with a few additional symbols. Rules and coded-value domains are copied from a geodatabase and .lyr files in the Resources\CartoRepsAZGS directory within the GeMS toolbox directory. | Boolean |
Add_LTYPE_and_PTTYPE | If checked, adds LTYPE field to ContactsAndFaults and GeologicLines and adds PTTYPE field to OrientationPoints. Useful for digitizing purposes, or for ingesting ALACARTE-style data sets. Default is unchecked. With the use of Feature Templates for digitizing, values of LTYPE may be helpful proxies for clusters of Type - IsConcealed - LocationConfidenceMeters - ExistenceConfidence - IdentityConfidence - Symbol values. | Boolean |
Add_standard_confidence_values | Default is checked. If checked: 1) Attaches standard values of "certain" and "questionable" as a coded-value domain to all ExistenceConfidence, IdentityConfidence, and ScientificConfidence fields. 2) Adds definitions and definition source for "certain" and "questionable" to the Glossary table. 3) Adds the definition source (FGDC-STD-013-2006) to the DataSources table. The use of these particular values, or of only 2 values for confidence, is not required. You may use other values, but they must be defined in the Glossary table. | Boolean |
Deplanarize CAF removes excess nodes from arcs in the ContactsAndFaults feature class of the GeologicMaps feature dataset of a GeMS-style geodatabase. Note: This script has not been extensively tested. PLEASE back up your geodatabase before running it. Examine the results for correctness.
Arcs are IDENTITYed with MapUnitPolys to ascertain their bounding MapUnits. Arc-end points are created, labeled with XY, and sorted by XY to identify nodes. If 2 arcs meet at a node and have identical values for attributes Type, IsConcealed, ExistenceConfidence, IdentityConfidence, LocationConfidenceMeters, DataSourceID, Label, and Notes, they are merged. If 3 arcs meet at a node, the HierarchyKey values of the bounding map units are used to determine which pair of arcs bound the youngest polygon and thus should be continuous across that node if their values of attributes Type, IsConcealed, ExistenceConfidence, IdentityConfidence, LocationConfidenceMeters, DataSourceID, Label, and Notes are identical. If 4 arcs meet at a node, one of the arcs should be IsConcealed = 'Y'. The remaining 3 arcs are then treated as a 3-arc node.
Input geodatabase is assumed to have a GeologicMap feature dataset that contains feature classes ContactAndFaults and MapUnitPolys and a DescriptionOfMapUnits table.
This script will fail if the geodatabase has a Topology class that involves ContactsAndFaults.
Feature classContactsAndFaults is assumed to have attributes Type, IsConcealed, ExistenceConfidence,IdentityConfidence, LocationConfidenceMeters, DataSourceID, Label, Notes, ContactsAndFaults_ID, and Symbol. Feature class MapUnitPolys is assumed to have attribute MapUnit. Table DescriptionOfMapUnits is assumed to have attributes MapUnit and HierarchyKey. HierarchyKey must be populated and populated correctly.
Nodes are named by their XY coordinates recorded to within 0.01 map units. We assume no nodes are so close that they have the same name.
Parameter | Explanation | Data Type |
---|---|---|
Input_geodatabase | Should be a GeMS-style geodatabase. | Workspace |
Reads a GeMS-style DescriptionOfMapUnits table and creates a Microsoft Word .docx file formatted according to USGS Pubs template MapManuscript_v3-1_06-22.dotx. The table need not be in a geodatabase. It can be in any format that ArcGIS Pro can load as a table in a map such as a CSV, Geopackage table, or Excel spreadsheet.
You may specify the Word document styles to use in ParagraphStyle or let the tool try to determine the appropriate styles based on the table attributes.
When specifying ParagraphStyles in the table, use the following:
-
DMU-HeadingN
(number, N, may up to 5) DMU Headnote - 1 line
DMU Headnote - More Than 1 line
DMU Headnote Paragraph
DMU Unit 1 (1st after heading)
-
DMU Unit N
(number, N, may be up to 5)
See 'DMU_template.docx' in the Resources folder of this toolbox for examples of when to use these styles and a few other styles that can be used in special cases.
When letting the tool determine the style, the following conditions will be evaluated:
- Any row with a Name value but no MapUnit value will be considered a heading
- Any row with a Description value but no MapUnit or Name value will be considered a headnote
- Any row with a value in MapUnit will be considered a unit
- Make sure your HierarchyKey values sort appropriately and encode the parent-child relationships you intend to get the results you expect in the Word document.
You may choose to have the tool try to convert text formatting tags in Label, Name, and Description but only a small set of either ArcGIS Text Formatting Tags or HTML tags are recognized by the tool:
- ArcGIS only:
<fnt>
,<bol>
,<ita>
,'size'
,'italic'
,'style'
and'wght'
attributes in<fnt>
- HTML only:
<span>
,<strong>
,<em>
,'font-weight'
and'font-style'
attributes in<span>
- Both ArcGIS and HTML:
<sup>
,<sub>
Parameter | Explanation | Data Type |
---|---|---|
DMU table | APath to the GeMS-style DescriptionOfMapUnits table or the name of the table view of a DMU table in the map. Standalone files can be any format that ArcGIS Pro can load as a table such as CSV, Geopackage, or Excel spreadsheet. | table view or standalone file |
Output folder | Path to the output directory. | Folder |
Output filename | Name of output file with or without .docx extension |
String |
Calculate paragraph styles | Attempt to determine the paragraph style based on attributes of the DMU. | Boolean |
Use Label field for unit abbreviations | Try to convert HTML formatting tags in Description to Word character styles | Boolean |
Attempt to preserve formatting tags in Description field | If checked, creates a List of Map Units which omits map unit descriptions. | Boolean |
Open document when done | Open the resulting document in Word in editing mode when finished. | Boolean |
- docx (https://python-docx.readthedocs.io/en/latest/) included with toolbox in folder
Scripts\docx
- BeautifulSoup - installed in default ArcGIS Pro miniconda environment
Geologic Names Check automates some of the steps in a geologic names review as required by USGS publication policy. It searches within the DescriptionOfMapUnits table for names and usages found in the U.S. Geologic Names Lexicon (Geolex) and provides a report template in spreadsheet form for the author and reviewer to use during the review process. The tool reports the Geolex names found within the map unit name, the usages associated with those names, and whether or not the author's choice of geographic extent matches that found in Geolex. Comparisons of age and status (formal vs informal) are not at this time considered.
Parameter | Explanation | Data Type |
---|---|---|
DMU_Table | a GeMS-compliant DescriptionOfMapUnits table. Must contain the fields HierarchyKey, MapUnit, Fullname, and Age, but they may be in any order, need not be CamelCase, and there are no special constraints on how the values are formatted. Values in Name are searched for Geolex names. If no Geolex name is found, Fullname is searched as well. | Table |
States_extent | One or more (comma separated) state or territory abbreviations. Examples are "WA", or "ID,OR,WA" | String |
open_report_when_completed? (optional) | Should the Excel report file be opened when the script has finished running? | Boolean |
More info at Filling out the Geologic Names Report
Creates annotation feature class OrientationPointLabels with dip and plunge numbers for appropriate features within OrientationPoints. Adds a layer representing the new annotation feature class to your map composition.
Note that this script invokes a definition query using PlotAtScale values. These must exist! If the output layer appears to be empty, open layer properties and delete the definition query. Do you see inclination numbers? Good. Run Set PlotAtScales and then rerun Inclination Numbers
If this script fails because of locking issues, try (a) Stop Editing, or (b) save any edits and save the map composition, exit ArcMap, and restart ArcMap. Maybe the script will then run satisfactorily.
Parameter | Explanation | Data Type |
---|---|---|
Feature_dataset | The feature dataset with class OrientationPoints for which inclination annotation is to be created. | Feature Dataset |
Map_scale_denominator | Denominator of the map scale fraction. For a 1:24,000-scale map, enter 24000. | Double |
Inclination Numbers calls function isPlanar, defined in GeMS_utilityFunctions.py:
returns True if orientationType is a planar (not linear) feature
def isPlanar(orientationType):
planarTypes = ['joint','bedding','cleavage','foliation','parting']
isPlanarType = False
for pT in planarTypes:
if pT in orientationType.lower():
isPlanarType = True
return isPlanarType
Make Polygons:
- Creates (or recreates) feature classMapUnitPolys from lines in ContactsAndFaults, excluding lines for which IsConcealed='Y'
- Attributes polygons using temporary label points created from any polygons in the pre-existing MapUnitPolys and any label points in feature class MapUnitPoints
- Flags (a) polygons with multiple, conflicting, label points, (b) multiple, conflicting label points within a single polygon, (c) polygons with blank or null MapUnit values, and (d) contacts that separate polygons of the same MapUnit. These errors are written to feature classes errors_multilabelPolys errors_multilabels, errors_unlabeledPolys, and errors_excessContacts
- IDENTITYs the new MapUnitPolys with a temporary copy of the old MapUnitPolys. Any polygons--or fragments of polygons--that have changed their MapUnit are saved in feature class edit_ChangedPolys
- Optionally, saves old feature class MapUnitPolys to MapUnitPolysNNN, where NNN is a successively higher integer
- When used within ArcMap, selecting a MapUnitPolys layer will cause the data source of the layer to be changed to the newly created polygon feature class with no other change in layer properties (e.g., symbolization)
Note that if you remove a contact or fault that separates two existing polygons with different MapUnit values and run Make Polygons, the resulting multi-(temporary) label polygon is correctly flagged as an error. Re-running Make Polygons will (incorrectly) cause this error to disappear! Moral: Run Make Polygons once and FIX THE PROBLEMS.
While running, Make Polygons writes (overwrites) and deletes temporary feature classesxxxlabel, xxxpolys, and xxxtlabels, all within the GeologicMap feature dataset.
Parameter | Explanation | Data Type |
---|---|---|
input_geodatabase | An existing GeMS-style geodatabase. | Workspace |
Save_old_MapUnitPolys (Optional) | If checked, saves old MapUnitPolys feature class to feature class MapUnitPolysNNN, where NNN is a successively higher zero-padded integer. Default is checked (true). | Boolean |
Saved-layer_directory (Optional) | Directory in which .lyr files are saved for any map layers with sources MapUnitPolys, errors_excessContacts, errors_multilabelPolys, errors_multilabels, and errors_unlabeledPolys. These .lyr files are deleted when script completes. Must have write permission. Default is the directory that hosts the input geodatabase. | Folder |
Label_points_feature_class (Optional) | An optional point feature class with attribute MapUnit (and perhaps other attributes), which may used to label polygons. Familiar to those who used workstation ArcInfo, in which such features were necessary. ArcGIS does not require label points; polygons can be created and attributed without them. | Feature Class |
Creates and validates a topology feature class within a GeMS-style feature dataset. The new topology class is named GeologicMap_Topology (for the GeologicMap feature class) or xxx_Topology (for all other feature classes, where xxx is the prefix for the ContactsAndFaults-equivalent feature class within that feature dataset). Any existing topology with this name will be deleted. The input feature dataset should contain feature classes xxxContactsAndFaults and xxxMapUnitPolys (where xxx may be null).
Esri topology rules applied are:
- Must Not Overlap (Line)
- Must Not Self-Overlap (Line)
- Must Not Self-Intersect (Line)
- Must Be Single Part (Line)
- Must Not Have Dangles (Line)
- Must Not Overlap (Area)
- Must Not Have Gaps (Area)
- Boundary Must Be Covered By (Area-Line)
Parameter | Explanation | Data Type |
---|---|---|
Input_feature_dataset | Dataset | |
use_MUP_rules | Default = checked. If checked, adds rules that involve MapUnitPolys feature class (no gaps, no overlaps, boundaries must be covered by ContactsAndFaults). In some cases it is useful to build a topology that does not incorporate MapUnitPolys, most commonly so that this topology need not be deleted before (re)making polygons. | Boolean |
MapOutline calculates a map boundary and tics for rectangular (in latitude-longitude space) areas. Locations of boundary and tics are projected to the specified output coordinate system.
Locations of boundary and tics may be specified in either NAD27 or NAD83, independently of the output datum and projection. This may be useful if, for example, it is desirable that a map boundary coincides with that of a published USGS quadrangle map (most of which are located at even latitude-longitude values in NAD27) but because the mapper is working with GPS coordinates or dense lidar data, both of which are commonly in NAD83-based projections, the map database should be in a NAD83-based projection.
Output feature classes MapOutline and Tics are written to the top level of the output geodatabase. Any existing feature classes with these names will be overwritten.
Parameter | Explanation | Data Type |
---|---|---|
SE_longitude | Longitude of SE corner of map rectangle. May be in decimal degrees (-120.625), degrees - decimal minutes (-120 37.5) or degrees - minutes - seconds (-120 37 30). West longitudes (all of US) are negative. | String |
SE_latitude | Latitude of SE corner of map rectangle. May be in decimal degrees (42.375), degrees - decimal minutes (42 22.5) or degrees - minutes - seconds (48 22 30). South latitudes (none in US) are negative. | String |
width_(longitudinal_extent) | E-W extent of map rectangle. Values <= 5 are assumed to be in degrees. Values >5 are assumed to be in minutes. Enter 600 to obtain a 10 degree extent. | Double |
height__(latitudinal_extent) | N-S extent of map rectangle. Values <= 5 are assumed to be in degrees. Values >5 are assumed to be in minutes. Enter 600 to obtain a 10 degree extent. | Double |
tic_spacing | Value is in decimal minutes | Double |
Is_NAD27 | Check to locate map boundary and tics at NAD27 lat-long positions. Uncheck to locate at NAD83 positions. | Boolean |
output_geodatabase | Must be an existing, writable geodatabase. Please do not select a feature dataset or feature class (though ArcGIS will let you do so). | Workspace or Feature Dataset |
output_coordinate_system | Browse to select coordinate system from ArcGIS-provided coordinate system definitions, to import a coordinate system from an existing data set (RECOMMENDED) or define a coordinate system from scratch. | Coordinate System |
scratch_workspace | Directory that is writable. Files xxxbox.csv, xxxtics.csv, xxx1.dbf, xxx1.dbf.xml will be written to this directory and then deleted. Existing files with these names will be lost! | Folder |
GeMS_ProjectCrossSectionData.py
Project Map Data toCross Section generates backdrop feature classes useful in constructing a geologic cross section. Inputs include the GeologicMap feature dataset of a GeMS-style geodatabase, a cross-section line (feature class, feature layer, or selection), and a DEM. The cross-section line need not be straight.
Final output is written to feature dataset CrossSectionxx,where xx is Output_name_token. This feature dataset will be created if it does not exist.
By default, all feature classes in the GeologicMap feature dataset are projected into the cross section feature dataset, with the exception of feature classes whose names begin with errors_ and ed_. Alternately, you may choose to project specified feature classes.
-
Polygons are projected to line segments that follow the topographic profile of the section line. Projected lines for MapUnitPolys (or any other polygon feature class that spans the entire section line) constitute the topographic profile.
-
Lines are projected to short vertical line segments located at the point where lines cross the section line. Line segments for ContactsAndFaults are below the topographic profile. All other line feature classes are projected to line segments above the topographic profile.
-
Points are projected to points at the appropriate elevation in the vertical plane of the section line.
For point feature classes the distance from the section line and the local azimuth of the section line at the projection point are recorded in fields DistanceFromSection and LocalCsAzimuth. If a point feature class has fields Azimuth and Inclination (e.g., OrientationPoints), new fields ApparentDip, Obliquity (angle between Azimuth and LocalCsAzimuth), and PlotAzimuth (rotation field for symbolization) are calculated.
Output feature classes are named ed_CsxxInputFeatureClass, where xx is Output_name_token. Existing feature classes with these names will be overwritten.
This script also creates empty GeMS feature classes CSxxContactsAndFaults, CSxxMapUnitPolys, and CSxxOrientationPoints if they are not already present in the output feature dataset. You may find it useful to load data from the appropriate ed_CSxxxx feature class into these classes.
Parameter | Explanation | Data Type |
---|---|---|
GeMS-style_geodatabase | An existing GeMS-style geodatabase. May be .gdb or .mdb. Must have a GeologicMap feature data set. | Workspace |
Project_all_features_in_GeologicMap | Default = true (checked). If checked, all point, line, and polygon feature classes within GeologicMap feature dataset (except for certain edit classes) are projected into cross section. If checked, the Feature_classes_to_Project parameter is ignored. If unchecked, you should specify which feature classes should be projected with the Feature_classes_to_Project parameter | Boolean |
Feature_classes_to_Project (Optional) | Used only if Project_all_features_in_GeologicMap is False (unchecked). Specify which point, line, and polygon feature classes to project. Note that the tool will happily project feature classes that are not within the GeologicMap feature dataset. This may not be meaningful! | Multiple Value |
DEM | Digital elevation model that encompasses section line and buffered selection polygon. Z units should be equal to XY units of GeologicMap feature dataset. | Raster Dataset |
Section_line | Feature class, feature layer, or selection that contains ONLY ONE element. Section line need not be straight. Zigs are OK and you can make a section along a stream course. Points to be projected onto section line are selected by buffering the section line with FLAT line ends. To project points that are beyond the ends of the section line, extend the section line. Points are projected to nearest point on section line using LocateFeaturesAlongRoutes. | Feature Layer |
Start_quadrant | Where does section start? (This may not control the section orientation. If you don't get the results you expect, try flipping the section line.) | String |
Output_name_token | Short text token used to name output feature dataset and feature classes. The output feature dataset will be named CrossSection Output_name_token. If this feature dataset does not exist it will be created. Output feature classes will be named ed_CSOutput_name_tokenInput_featureclass_name Suggested values are A, B, C, ... If the cross section has vertical exaggeration that is not 1, set the token to B5x, or C10x, or ... | String |
Vertical_exaggeration | Default is 1. If you change to another value, suggest you incorporate this value in the output name token. | Double |
Selection_distance | Distance, in GeologicMap XY units, within which point features are projected onto the cross section plane. | Double |
Add_LTYPE_and_PTTYPE | Feature classes CSxxMapUnitPolys, CSxxContactsAndFaults, and CSxxOrientationPoints will be created in the output feature dataset if they are not already present. Check this box to add LTYPE field to CSxxContactsAndFaults and PTTYPE field to CSxxOrientationPoints. | Boolean |
Force_exit | If checked, use sys.exit() to force an exit with error. Allows re-run of tool without re-entry of all parameters. Default is false (unchecked). | Boolean |
Scratch_workspace (Optional) | If blank, output feature dataset will be used as scratch workspace. Temporary feature classes have names that begin with 'xxx'. Existing feature classes with these names will be overwritten. | Workspace or Feature Dataset |
Save_intermediate_data | Default = NO (unchecked). This script creates temporary tables in the input geodatabase and temporary feature classes in the scratch workspace (default is the output feature dataset). If this box is unchecked, these tables and feature classes will not be deleted when the script finished. Check this box if you need these temporary data for troubleshooting. Note that using the default scratch workspace, saving intermediate data and then running the tool to create another feature dataset will not work. You must first delete the temporary data files within the first feature dataset. | Boolean |
A GeMS geodatabase has numerous implicit relationships. For example,
- table DescriptionOfMapUnits references table DataSources via the DefinitionSourceID
- table DescriptionOfMapUnits references table GeoMaterialDict via GeoMaterial
- feature class MapUnitPolys references table DescriptionOfMapUnits via MapUnit
- feature class MapUnitPolys references table DataSources via DataSourceID
- feature class MapUnitPolys references table Glossary via IdentifyConfidence
These relationships may be made explicit as joins or relates (which live in an .mxd document) or as relationship classes which live in the geodatabase. Relationship Classes creates a number (though in many cases not all useful or all possible) of relationship classes within a GeMS-style database.
Note that:
- Relationship classes are named by the referencing table and field, e.g., DMU_DefinitionSource, DMU_GeoMaterial, MUP_MapUnit, MUP_DataSource, MUP_IdentityConfidence
- Any existing relationship class with the same name is overwritten
- This script creates a RelationshipClasses feature dataset. If this feature dataset already exists it is overwritten. Newly-created relationship classes are NOT placed in this feature dataset, though you may do so afterwards
Parameter | Explanation | Data Type |
---|---|---|
GeMS_geodatabase | Name of geodatabase to which relationship classes will be added. Note that any existing relationship classes with the same names will be deleted. | Workspace |
Sets values of item PlotAtScale so that a definition query
[PlotAtScale] >=MapScale
limits displayed features to those that can be shown without crowding at the specified map scale. Note that MapScale is the DENOMINATOR of the scale ratio; that is, PlotAtScale of 24000 means feature can be plotted without crowding at map scales of 1:24,000 and larger.
Input feature class must have PlotAtScale field. If input feature class is named "OrientationPoints", point to be plotted of a too-close pair is biased towards (a) upright or overturned bedding (not bedding without facing direction), (b) bedding (not joint, foliation, lineation, ...), and (c) lower value of OrientationConfidenceDegrees. Large feature classes (>1,000) and large Maximum_value_of_PlotAtScale (>100,000) lead to long calculation times. A more efficient algorithm might be in order.
Parameter | Explanation | Data Type |
---|---|---|
Feature_class | Must have PlotAtScale field. | Feature Class |
Minimum_separation _(mm) | Set this on basis of symbol diameter, in mm on the page. For FGDC structure symbols, 8 works OK. | Double |
Maximum_value_of_PlotAtScale | Large values (e.g., >100,000) with large feature classes (e.g., > 1,000 features) can take a LONG time to calculate. | Double |
Set Symbol Values sets the Symbol attribute for some features in a GeMS-style geodatabase to match symbol IDs in the GSC implementation of the FGDC Digital Cartographic Standard forGeologic Map Symbolization (FGDC-STD-013-2006).
Values for line symbols are calculated on the basis of map scale and the GeMS attributes Type, IsConcealed, LocationConfidenceMeters, ExistenceConfidence,and IdentityConfidence.
Values for orientation point symbols are calculated from Type and OrientationConfidenceDegrees, for example,
for Type = bedding:
if OrientationConfidenceDegrees <= threshold:
symbol = 06.02
if OrientationConfidenceDegrees > threshold:
symbol = 06.33 #bedding symbol with open center
This script calculates Symbol for feature classes ContactsAndFaults, GeologicLines, and OrientationPoints.
Symbols are only calculated for recognized Type values. Recognized Type values are given in file Type-FgdcSymbol.txt, which is located in the Resources folder of GeMS Tools. This file may be edited to reflect your choices for Type values, add additional Type values, or change the correlation between *Type *and symbol ID.
Parameter | Explanation | Data Type |
---|---|---|
Feature_dataset | The feature dataset with classes xxxContactsAndFaults, xxxGeologicLines, and xxxOrientationPoints for which Symbol values are to be calculated. xxx may be null (the GeologicMap feature dataset), "CSA" for CrossSectionA, or similar. | Feature Dataset |
Map_scale_denominator | Denominator of the map scale fraction. For a 1:24,000-scale map, enter 24000. | Double |
Certain_to_approximate_threshold_mm_on_map | The value of LocationConfidenceMeters, converted to mm on the map, at which lines change from "certain" (continuous) to "approximate" (dashed). If map scale denominator is 24,000 and the certain to approximate threshold is 1.0, lines with LocationConfidenceMeters <= 24 will have continous-line symbols. If map scale denominator is 100,000 and the certain to approximate threshold is 2.5, lines with LocationConfidenceMeters <= 250 will have continuous-line symbols. | Double |
Use_inferred_short_dash_line_symbols | FGDC-STD-013-2006 defines "approximate" (long dash) and "inferred" (short dash) symbols and distinguishes inferred on the basis of how-located, not how-well-located. Yet by this how-located standard (not directly observed), most geologic lines are inferred, and this is not common usage. Check this box to use "inferred" symbols for lines with LocationConfidenceMeters greater than a threshold value. | Boolean |
Approximate_to_inferred_threshold_mm_on_map (Optional) | The value of LocationConfidenceMeters, converted to mm on the map, at which lines change from "approximate" (long dash) to "inferred" (short dash). If threshold is 4 and map scale denominator is 24,000, lines with LocationConfidenceMeters > 98 will be drawn with short dashes. | Double |
Use_approximate_strike-and-dip_symbols | Default is checked. Uncheck to avoid use of open-center symbols for approximately-oriented bedding. | Boolean |
OrientationConfidenceDegrees_threshold | FGDC-STD-013-2006 describes special symbols for some common orientation-data types to denote "when the measurement of strike and (or) dip value is approximate but the location of observation is accurate." For these types, if OrientationConfidenceDegrees > threshold value, the approximate symbol is assigned. | Double |
Set_polygon_symbols_and_labels | If checked, Symbol and Label values will be calculated for MapUnitPolys, using values from table DescriptionOfMapUnits. Default is checked. | Boolean |
If you check Set_polygon_symbols_and_labels there must be a DescriptionOfMapUnits table with rows for each map unit AND with values, for each map unit, of MapUnit, IdentityConfidence, Symbol, and Label.
Calculates values of AreaFillRGB in table DescriptionOfMapUnits of a GeMS-style geodatabase. Symbol values must be present and are assumed to reference the WPGCYMK color set, which is included in style file USGS Symbols2.style.
Parameter | Explanation | Data Type |
---|---|---|
Input_geodatabase | An existing geodatabase. May be a file (.gdb) or personal (.mdb) geodatabase. | Workspace |
- Calls module colortrans.py, which calls module wpgdict.py. Both are included in the *GeMS_Toolbox/Scripts *directory
Topology Check evaluates topological aspects of feature datasets in a GeMS-style geologic map geodatabase. It does not substantially alter the input geodatabase. Output is a new geodatabase *inGeodatabase_*errors.gdb and an HTML file inGeodatabase_topologyReport.html. For each feature dataset evaluated the output geodatabase will contain a feature dataset of the same name that contains feature classes which identify possible errors of various kinds. Default is that only the GeologicMap feature dataset is evaluated.
Note that not all possible topological errors will be identified by this script. Some “errors” flagged by this script may be acceptable. Use your judgment!
The Nodes and MapUnit adjacency modules use functions isFault and isContact. These functions, listed below, may need to be modified for some geodatabases.
def isFault(lType):
if lType.upper().find('FAULT') > -1:
return True
else:
return False
def isContact(lType):
lType = lType.upper()
if lType.find('CONTACT') > -1:
return True
elif lType.find('FAULT'):
return False
elif lType.find('SHORE') > -1 or lType.find('WATER') >-1:
return True
elif lType.find('MAP') > -1: # is map boundary?
return False
elif lType.find('GLACIER') > -1 or lType.find('SNOW') >-1 or lType.find('ICE') > -1:
return True
else:
return False
Parameter | Explanation | Data Type |
---|---|---|
Geodatabase | Input File Geodatabase. | Workspace |
Validate_topology_of_all_feature_datasets (Optional) | Default is unchecked (false). Check to evaluate topology of all feature datasets within input geodatabase. Evaluation of a feature dataset that lacks an identifiable ContactsAndFaults feature class will trigger an error. | Boolean |
Validate_topology_of_these_feature_datasets (Optional) | Select specific feature datasets to evaluate. Note that if no feature datasets are selected and Validate_topology_of_all_feature_datasets is unchecked the GeologicMap feature dataset will be evaluated by default. Evaluation of a feature dataset that lacks an identifiable ContactsAndFaults feature class will trigger an error. Checking "Validate_topology_of_all_data_sets" overrides any selection made here. | Multiple Value |
Line_and_polygon_topology | Default is checked (true). Check to create and validate an ArcGIS topology class for each feature dataset. See discussion below. Any identified errors are written to feature classes errors_xxxTopology_line, errors_xxxTopology_point, and errors_xxxTopology_poly. | Boolean |
Nodes | Geologic map-logic imposes another set of topological constraints on nodes (line junctions) in the ContactsAndFaults feature class. See discussion below. | Boolean |
Fault_directions | Fault lines that are contiguous generally should have the same direction. This, for example, allows thrust teeth to consistently be on same side. Fault lines are identified with the SQL query "TYPE" LIKE '%fault%' Nodes where fault direction changes are written to feature class errors_xxxFaultDirNodes. See discussion below. |
Boolean |
MapUnit_adjacency | Creates tables, in the output HTML file, of adjacent map units for three classes of lines. See discussion below. | Boolean |
Identify_duplicate_point_features | Scans each point feature class and identifies possible duplicate features. Any duplicates are listed in tables dups_XXXXXX. See discussion below. | Boolean |
Identify_short_lines_small_polys_and_sliver_polys | Scans ContactsAndFaults and MapUnitPolys feature classes to identify lines that are shorter than a threshold value, polygons that are smaller than a threshold value, and polygons that have area-circumference ratios (polygon width) that are smaller than a threshold value. Map scale denominator (below) must be set correctly! Identified features are written to feature classes errors_xxxShortArcs, errors_xxxSkinnyPolys, and errors_xxxSmallPolys. | Boolean |
Map_scale_denominator | Value used to identify too-short arcs, too-small polys, and too-skinny polys. Integer. | Long |
minimum_line_length__mm | Threshold value, in millimeters at map scale, used to identify too-short arcs. | Double |
minimum_poly_area__sq_mm | Threshold value, in square millimeters at map scale, used to identify too-small polygons. | Double |
minimum_poly_width__mm | Threshold value, in millimeters at map scale, used to identify sliver polygons. | Double |
force_exit_with_error | Default is unchecked (false). When checked, forces an error upon normal completion of script. Useful when debugging, as it returns focus to the script window while preserving all input values. | Boolean |
- Must Not Overlap (Line) xxxContactsAndFaults
- Must Not Self-Overlap (Line) xxxContactsAndFaults, xxxGeologicLines
- Must Not Self-Intersect (Line) xxxContactsAndFaults, xxxGeologicLines
- Must Be Single Part (Line*) xxxContactsAndFaults, xxxGeologicLInes*
- Must Not Overlap (Area) xxxMapUnitPolys
- Must Not Have Gaps (Area) xxxMapUnitPolys
- Boundary Must Be Covered By (Area-Line) xxxMapUnitPolys / xxxContactsAndFaults
Note the lack of a rule Must Not Have Dangles (Line). This is because dangling fault lines (and dangling concealed fault or contact lines) are common in geologic maps and are not errors. Dangling contacts (which are errors) are identified by the Nodes option.
- More than 4 lines never join at a node.
- Four lines join at a node only if two opposite lines are contacts (not concealed) and the other lines are both of the same Type and one or both is concealed. This explicitly disallows the possibility of faults that do not offset contacts.
- Where three lines join, either none are concealed or all are concealed. An exception is where two of the lines are the map boundary.
- There should be no pseudonodes (junctions between only two lines where both lines have same values for Type, LocationConfidenceMeters, ExistenceConfidence, IdentityConfidence, and DataSourceID). Pseudonodes where the lines are identical—i.e., the line joins itself (forming a closed loop) are OK.
- Nodes with one line are dangles. Dangles are permissible where the line is a fault or the line is concealed. Note that some fault dangles are errors. To evaluate conformance with these rules, the input ContactsAndFaults feature class is first planarized, that is, all lines are broken at intersections. Nodes that do not meet these constraints are written to output feature class errors_xxxBadNodes.
Note that this routine may not identify some direction errors where three fault arcs meet. Also, some changes of fault-arc direction are intentional, for example on a scissors fault where the DOWN side changes along fault strike.
ContactsAndFaults can be IDENTITYed with MapUnitPolys to ascertain the map units on either side. Results are displayed in tables arranged by LEFT MAP UNIT (Y axis) and RIGHT MAP UNIT (X axis). Values on each axis are sorted by DescriptionOfMapUnits.HierarchyKey. Arcs in ContactsAndFaults are divided into 3 groups and a table is written for each:
- Concealed contacts and faults (IsConcealed = 'Y')
- Contacts (not concealed) (isContact and IsConcealed = 'N')
- Faults (not concealed) (isFault and IsConcealed = 'N')
These tables can be useful for identifying mis-tagged polygons and lines. In the table for Concealed contacts and faults, all arcs should list along the diagonal (same unit on either side). Table Contacts (not concealed) should have no diagonal values (no internal contacts) and significant unconformities will correspond to rows & columns with large numbers of populated cells. Table Faults (not concealed) makes it easy to identify faults that cut young map units; in many cases these are likely to be digitizing errors.
Identifying information for concealed lines that separate polygons with different MapUnit values and non-concealed contacts that separate polygons with identical MapUnit values is written to HTML tables Bad concealed contacts and faults and Internal contacts.
Flags those features that:
- Have the same location, and
- Have the same Type values, and
- If attributes Azimuth and Inclination are present, have the same Azimuth and Inclination values
Note that entries in a geochem, geochron, or sample table that represent multiple samples taken at a single locale will (incorrectly) generate errors.
Uses the values for minimum line length, minimum polygon area, and minimum polygon width to identify arcs and polygons that are, perhaps, too small. Note that none of these are in any sense errors in the content of the geologic map.
These inventories may be useful to find digitizing errors and to help enforce cartographic standards regarding minimum map-unit polygon size.
Translate to Shapefiles converts a GeMS-style ArcGIS geodatabase to two shapefile packages:
- OPEN--Consists of shapefiles, additional .dbf files, and pipe-delimited text files. Field renaming is documented in output file logfile.txt. This package will be a complete transcription of the geodatabase without loss of any information.
- SIMPLE--Consists of shapefiles alone. Tables Glossary, DataSources, and DescriptionOfMapUnits are joined to selected feature classes within feature dataset GeologicMap, long fields are truncated, and these feature classes are written to shapefiles. Field renaming is documented in output file logfile.txt. This package is a partial (incomplete) transcription of the geodatabase, but will be easier to use than the OPEN package.
Output is written to directories DBName-simple and DBName-open, where DBName is the name of the input geodatabase, without gdb or mdb suffix. If these directories already exist, any files within them will be deleted.
Parameter | Explanation | Data Type |
---|---|---|
Input_geodatabase | An existing geodatabase. May be a file (.gdb) or personal (.mdb) geodatabase. | Workspace |
Output_workspace | Must be an existing folder. Output folders DBName-open and DBName-simple will be written here, as well as temporary geodatabase xxDBName. | Folder |
Validate Database audits a geodatabase for conformance with the GeMS schema and reports compliance as “may be LEVEL 1 COMPLIANT”, “is LEVEL 2 COMPLIANT”, or “is LEVEL 3 COMPLIANT”. It also optionally runs the USGS Geospatial Metadata Validation Service to check for formal errors in either the embedded geodatabase-level FGDC metadata or a stand-alone file. The tool goes through the compliance rules below one at a time logging each violation as an error. Beyond the basic requirements of the GeMS schema, it also makes other checks for data integrity and logs violations as warnings. Warnings do not necessarily need to be addressed but doing so enhances consistency and usability. The tool prepares a report in the form of two linked html files which show:
- Input parameters at run time
- Level of compliance
- Number of errors found for each rule with a link to the details of those errors listing one or more of the following:
- the table in which the error is found
- the field in which the error is found
- the ID value of the row in which the error is found
- the offending value
- Warnings
- Table and field extensions to GeMS
- A table showing in which feature datasets each of the map units in the DescriptionOfMapUnits table is found
- Contents of all non-spatial tables
- An inventory of the contents of the database with a count of the rows in each table
The names of the output html files consist of the name of the database (including the extension to differentiate gdbs from gpkgs) followed by -Validation.html
and -ValidationErrors.html
. The former is the principal summarizing report and contains links to the latter which lists error and warning details. You can rename the Validation
file, but do not move or rename the ValidationErrors
file or links inside the Validation
file will be broken.
If the metadata are validated, the auto-generated validation report listing errors and warnings is copied to the output folder. Again, to maintain a link in the Validation
file, do not move or rename this file.
If topology is to be checked, the tool first looks for an embedded topology class to validate. If that is absent, a new geodatabase named Topology.gdb
is created in the output directory and a feature dataset and topology class is created for each pair of MapUnit
and ContactsAndFaults
featureclasses and then validate. Topology pairs are identified by either being in the same feature dataset or by identical prefixes and/or suffixes added to the base GeMS name.
Compliance criteria
Level 1:
- No overlaps or internal gaps in map-unit polygon layer
- Contacts and faults in single feature class
- Map-unit polygon boundaries are covered by contacts and faults lines
Databases with a variety of schema may meet these criteria. Validate Database cannot confirm LEVEL 1 compliance.
Level 2:
- 2.1 Has required elements: nonspatial tables DataSources, DescriptionOfMapUnits, GeoMaterialDict; feature dataset GeologicMap with feature classes ContactsAndFaults and MapUnitPolys
- 2.2 Required fields within required elements are present and correctly defined
- 2.3 GeologicMap topology: no internal gaps or overlaps in MapUnitPolys, boundaries of MapUnitPolys are covered by ContactsAndFaults
- 2.4 All map units in MapUnitPolys have entries in DescriptionOfMapUnits table
- 2.5 No duplicate MapUnit values in DescriptionOfMapUnit table
- 2.6 Certain field values within required elements have entries in Glossary table
- 2.7 No duplicate terms in Glossary table
- 2.8 All SourceID values in required elements have entries in DataSources table
- 2.9 No duplicate DataSources_ID values in DataSources table
Level 3
- 3.1 Table and field definitions conform to GeMS schema
- 3.2 All map-like feature datasets obey topology rules. No MapUnitPolys gaps or overlaps. No ContactsAndFaults overlaps, self-overlaps, or self-intersections. MapUnitPoly boundaries covered by ContactsAndFaults
- 3.3 No missing required values
- 3.4 No missing terms in Glossary
- 3.5 No unnecessary terms in Glossary
- 3.6 No missing sources in DataSources
- 3.7 No unnecessary sources in DataSources
- 3.8 No map units without entries in DescriptionOfMapUnits
- 3.9 No unnecessary map units in DescriptionOfMapUnits
- 3.10 HierarchyKey values in DescriptionOfMapUnits are unique and well formed
- 3.11 All values of GeoMaterial are defined in GeoMaterialDict. GeoMaterialDict is as specified in the GeMS standard
- 3.12 No duplicate _ID values
- 3.13 No zero-length or whitespace-only strings
Note that using this script with a geodatabase with a schema that differs significantly from GeMS may not yield a useful report.
Parameter | Explanation | Data Type |
---|---|---|
Input geodatabase or geopackage (required) | Path to the database | Workspace |
Output_workspace (optional) | If not supplied, a folder called 'validate' will be created in the parent folder of the database | Folder |
Metadata file (optional) | Path to metadata file to validate | File |
Validate embedded metadata? (optional) | True or false whether the embedded metadata of the file geodatabase as a whole (feature datasets, feature classes, tables, etc. will be ignored) are to be validated. Use this option only if the metadata for the geodatabase as a whole contains complete metadata for all elements and an Extent element (bounding box coordinates). If the metadata pass validation, submit the exported XML with the geodatabase. If you edit the exported file to correct errors, import it back into the geodatabase to keep the stand-alone and the embedded metadata in sync. If left blank and metadata file is blank, no metadata will be validated. Applies only to geodatabases and is not visible on the parameter form when validating geopackages. | Boolean |
Report errors with _ID (optional) | True or false whether errors in the report should be identified by values in the TableName_ID field, not the OBJECTID field. Useful in the case of validating a copy of an enterprise database that has been exported (which overwrites OBJECTIDs) to a file geodatabase but errors will be corrected in the enterprise database. For those tables that do not have a TableName_ID field or where the value is null, the OBJECTID will be reported. | Boolean |
Skip topology? (optional) | True or false whether checking topology should be skipped. Saves time when the state of the topology is known and the tool is being run frequently to check other aspects of the database, but the database will FAIL compliance at levels 2 and 3. Note that if the geodatabase has an embedded topology that can be checked, selecting this option does not save much time. Leave unchecked for the final generation of validation report. | Boolean |
Refresh GeoMaterialDict (optional) | Databases built with earlier versions of the GeMS toolbox will generate numerous errors associated with GeoMaterialDict and GeoMaterial values. Check this box to replace the GeoMaterialDict table in the database with the current version. The GeoMaterials domain (available as a picklist while editing the DescriptionOfMapUnits table) is also replaced. This option permanently modifies the geodatabase. Be sure you have saved a backup | Boolean |
Delete unused Glossary and DataSources rows (optional) | True or false whether to automatically delete any rows in Glossary and DataSources that describe Terms and Sources that are unused elsewhere in the database. Useful in the case when project-wide tables of sources and definitions are copied into a new database and not all entries in those tables are relevant for a particular map area. Note that deletion of a Glossary row may render a DataSource row unneeded. Rerun the Validate Database script to discover such errors. This option permanently modifies the geodatabase. Be sure you have saved a backup. | Boolean |
Compact GDB? (optional) | True or false whether to compact the file geodatabase. Compacting a frequently edited geodatabase cleans up storage and unused space while also reducing it in size. Not applicable to geopackages. | Boolean |
Open report when done? (optional) | True or false whether to open the Validation report HTML file in a browser immediately upon finishing. | Boolean |