Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Functional updates to R package - improved user options & record nhdplusTools retrieval metadata #27

Closed
wants to merge 141 commits into from
Closed
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
141 commits
Select commit Hold shift + click to select a range
1c420db
feat: developing algorithm training and evaluation module
glitt13 Aug 22, 2024
7f3921e
fix: minor bug fixes with paths and np array values retrieval
glitt13 Aug 22, 2024
d9c8dfe
feat: create initial fs_algo package
glitt13 Aug 22, 2024
1ebcc23
feat: contain all training/eval into a single class
glitt13 Aug 23, 2024
1a74ca6
feat: simplify evaluation file write and module import
glitt13 Aug 23, 2024
a9019bd
feat: add basic unit testing for AlgoTrainEval class
glitt13 Aug 23, 2024
3a7025e
feat: convert save dir structure creation and fsds dataset reader int…
glitt13 Aug 23, 2024
5e922c4
Update README.md
glitt13 Aug 23, 2024
b6dffaf
feat: simplify aspects of attribute organization and combining with m…
glitt13 Aug 23, 2024
84d9de6
feat: beginning to convert attribute wrangling into a class
glitt13 Aug 23, 2024
f78cb2c
feat: add algorithm configuration file
glitt13 Aug 23, 2024
f1d15a8
feat: established class for attribute configuration file, scripts fun…
glitt13 Aug 23, 2024
b75e3d5
feat: add verbose option
glitt13 Aug 23, 2024
02325af
fix: update to warnings.warn()
glitt13 Aug 23, 2024
2c480d6
feat: building out additional unit tests for AttrConfigAndVars class
glitt13 Aug 23, 2024
2c2ae67
chore: remove spaces
glitt13 Aug 27, 2024
dd8f796
chore: remove spaces
glitt13 Aug 27, 2024
f96ca62
feat: add unit test for fs_read_attr_comid
glitt13 Aug 28, 2024
96debe1
feat: add UserWarnings and associated unit test
glitt13 Aug 28, 2024
c7f8e61
feat: add unit tests for _find_feat_srce_id, fs_retr_nhdp_comids and …
glitt13 Aug 29, 2024
54a3a35
feat: add unit test for fs_save_algo_dir_struct
glitt13 Aug 29, 2024
619e1a5
feat: a basic unit test for _open_response_data_fsds
glitt13 Aug 29, 2024
f28d90d
chore: simplify algo script based on functionality moved into fs_algo…
glitt13 Aug 29, 2024
48a1e6a
doc: add sphinx documentation to _read_attr_config and fs_read_attr_c…
glitt13 Aug 29, 2024
b9b034e
doc: add sphinx-formatted documentation to the functions in the fs_a…
glitt13 Aug 29, 2024
75cfb58
fix: changes vars to attrs in AlgoTrainEval arg
glitt13 Aug 29, 2024
901ff10
fix: added the new parameters that were hard-coded (test_size & seed)
glitt13 Aug 29, 2024
f4780dc
fix: swapped the train/test fractions to appropriate printout order
glitt13 Aug 29, 2024
fa703c4
feat: make sphinx documentation
glitt13 Aug 30, 2024
43d88c0
fix: reinstall sphinx docs for fsds_proc
glitt13 Aug 30, 2024
342838d
fix: remove unused path_camels
glitt13 Aug 30, 2024
283da30
fix: remove unused references to path_camels
glitt13 Aug 30, 2024
10cea4f
fix: update standard fsds_proc config files to create netcdf rather t…
glitt13 Aug 30, 2024
591cbed
doc: update config file documentation on preferred save_type
glitt13 Aug 30, 2024
6a4d645
doc: update description of yaml file's dataset
glitt13 Aug 30, 2024
8af7720
fix: update config files with featureID and featureSource entries
glitt13 Aug 30, 2024
d4c842b
fix: change vars to attrs based on package's object name change
glitt13 Aug 30, 2024
f179124
fix: change logic to ensure config file read if dataset attribute rea…
glitt13 Aug 30, 2024
7275e8a
feat: add a raw data input checker/corrector for cases when nwissite …
glitt13 Aug 31, 2024
1f0a8be
fix: changed path_data to represent the raw input files containing co…
glitt13 Aug 31, 2024
36414d5
fix: added appropriate fillna for nwissite gage ids not needed to be …
glitt13 Aug 31, 2024
728f1b4
fix: adjust path check for attributes instead of algo
glitt13 Aug 31, 2024
aef2f01
doc: add descriptive notes on algo pre-processing and suggest future …
glitt13 Aug 31, 2024
3b19a2a
doc: simplify attr_config, change dir_attrs to dir_db_attrs
glitt13 Aug 31, 2024
1dc0c6d
chore: add some additional hydroatlas and USGS NHD variables for cons…
glitt13 Aug 31, 2024
f6ccc91
chore: add updated attribute variables to config files, based on top …
glitt13 Aug 31, 2024
5137b99
fix: add error handling when hydrofabric could not be downloaded for …
glitt13 Sep 2, 2024
c91e25f
fix: avoid index error generated from attr_ddf_sub.shape[0].compute()…
glitt13 Sep 2, 2024
7d8b0f7
fix: change fs_read_attr_comid to return pd.DataFrame instead of dask…
glitt13 Sep 2, 2024
4743a47
feat: add NA drop prior to train/test split
glitt13 Sep 2, 2024
15be337
feat: create a separate function that standarizes the algorithm file …
glitt13 Sep 2, 2024
b4484aa
doc: add documentation to the std_algo_path func
glitt13 Sep 2, 2024
49b878e
Merge pull request #1 from glitt13/train_algo
glitt13 Sep 3, 2024
22835f5
feat: create script to generate algo prediction data for testing
glitt13 Sep 3, 2024
346f668
feat: generating predictions from trained algos under dev
glitt13 Sep 3, 2024
b6723fd
feat: add processing of xssa locations, randomly selecting a subset t…
glitt13 Sep 3, 2024
66f51dc
feat: develop algo prediction's config ingest, and determine paths to…
glitt13 Sep 4, 2024
becfc45
feat: add config file path builder
glitt13 Sep 4, 2024
10d2422
fix: resolve merge conflict
glitt13 Sep 4, 2024
8fe0a7c
feat: create metric prediction and write results to file
glitt13 Sep 4, 2024
0fffc45
feat: build unit test for build_cfig_path()
glitt13 Sep 5, 2024
62ff9aa
feat: build unit test for build_cfig_path()
glitt13 Sep 5, 2024
42edc50
feat: add unit testsfor std_pred_path and _read_pred_comid; test cove…
glitt13 Sep 5, 2024
f07b9c9
feat: add oob = True as default for RandomForestRegressor
glitt13 Sep 5, 2024
bae5175
feat: add hyperparameterization capability using grid search and asso…
glitt13 Sep 6, 2024
90c1443
feat: add unit testing for train_eval()
glitt13 Sep 6, 2024
5adb43b
chore: change algo config for testing out hyperparameterization
glitt13 Sep 6, 2024
b0c3ef2
chore: add UserWarning category specification to warnings.warn
glitt13 Sep 6, 2024
3abfb08
fix: algo config assignment accidentally only looked at first line of…
glitt13 Sep 6, 2024
c7de9ae
fix: make sure that hyperparameter key:value pairings contained insid…
glitt13 Sep 6, 2024
3e60519
fix: adjust unit test's algo_config formats to represent the issue of…
glitt13 Sep 6, 2024
b4034e8
fix: _check_attributes_exist now appropriately reports missing attrib…
glitt13 Sep 6, 2024
7e982d2
fix: ensure algo and pipeline keys contain algo and pipeline object t…
glitt13 Sep 6, 2024
5d368c0
Update pkg/fs_algo/fs_algo/fs_algo_train_eval.py
glitt13 Sep 16, 2024
e34640a
Update pkg/fs_algo/fs_algo/fs_algo_train_eval.py
glitt13 Sep 16, 2024
f70f4d3
feat: merge the algo prediction features from the pred_alg branch
glitt13 Sep 16, 2024
bd42892
chore: Update README.md
glitt13 Sep 16, 2024
c7dddde
merge upstream/main
glitt13 Sep 30, 2024
0080556
fix: remove network hardcoding for lyrs in proc_attr_wrap call
glitt13 Oct 2, 2024
1d5b4e2
fix: rename ext to fileext since ext is a pre-defined object
glitt13 Oct 4, 2024
88cd39a
fix: change unit test use of ext to fileext
glitt13 Oct 4, 2024
292e949
feat: experimenting with attribute grabbing
glitt13 Oct 4, 2024
3717fbc
doc: revise function documentation for clarity
glitt13 Oct 13, 2024
b607501
chore: rename fsds to fs in all python-related files and config files
glitt13 Oct 13, 2024
4df6fed
chore: rename fsds_proc directory to fs_proc
glitt13 Oct 13, 2024
202e338
chore: rename additional fsds to fs
glitt13 Oct 13, 2024
f3c9e9c
chore: rename remaining fsds to fs
glitt13 Oct 13, 2024
3410c21
doc: minor change to install instructions of fs_proc
glitt13 Oct 13, 2024
41717b9
feat: add requirements for fs_algo package
glitt13 Oct 13, 2024
8bc5bf1
feat: add requirements.yml for conda environment of fs_algo/fs_proc p…
glitt13 Oct 13, 2024
05b23f6
Merge pull request #2 from glitt13/fs_id
glitt13 Oct 13, 2024
7b29342
doc: add details on func for creating col_schema_df
glitt13 Oct 14, 2024
20b5a6c
feat: add nwissite gage id leading zero checker as automated step
glitt13 Oct 14, 2024
4dccd31
fix: new line continuation in f-string messages related to nwis checker
glitt13 Oct 14, 2024
e80175a
fix: update local config path and example in script
glitt13 Oct 14, 2024
2e64ae2
doc: change install description for this package
glitt13 Oct 14, 2024
9b048e6
fix: modify logical test on elif featureSource == nwissite
glitt13 Oct 14, 2024
3dbb9b9
feat: update and add new unit testing that accommodates the check_fix…
glitt13 Oct 14, 2024
a1e6acd
Merge pull request #3 from glitt13/fs_id
glitt13 Oct 14, 2024
dfd661b
fix: update temp directory assignment to work with non-Unix systems
glitt13 Oct 14, 2024
70792db
doc: minor adjustment for instructional example on running unit tests
glitt13 Oct 14, 2024
6cdfa4d
Merge pull request #4 from glitt13/fs_id
glitt13 Oct 14, 2024
e757599
Make the change match the exact repo name
bolotinl Oct 16, 2024
397fbcc
Make changes match exact repo name
bolotinl Oct 16, 2024
c46ff89
doc: minor changes that will be removed: comid loc lookup
glitt13 Oct 16, 2024
1f9718c
merge: upstream/main merge
glitt13 Oct 16, 2024
71c0b14
fix: rename fsds to fs in files corresponding to proc.attr.hydfab R p…
glitt13 Oct 16, 2024
d916a03
feat: update R package with name change of fsds to fs
glitt13 Oct 16, 2024
8943877
chore: update fsds to fs in config files and R unit tests
glitt13 Oct 16, 2024
3522370
doc: update README from fsds to fs in non-url instances
glitt13 Oct 16, 2024
6427c1a
doc: Update README.md
glitt13 Oct 16, 2024
3a07b0b
Update README.md
glitt13 Oct 16, 2024
702654e
merge upstream/main
glitt13 Oct 16, 2024
9afb17a
merge: upstream/main
glitt13 Oct 16, 2024
217b53e
merge origin/main
glitt13 Oct 16, 2024
ecd1a7e
merge: fsds to fs changes
glitt13 Oct 16, 2024
61368a2
chore: rename fsds_attrs_grab.R to fs_attrs_grab.R and add updated Rd…
glitt13 Oct 16, 2024
0eb76ee
merge: sync up with fsds to fs changes
glitt13 Oct 16, 2024
97902d6
doc: update arg name change of ext to fileext
glitt13 Oct 17, 2024
07d00ca
doc: remove commented out code and create delineations on code sections
glitt13 Oct 17, 2024
823e15b
doc: make proc_attr_hydatl local_path error more explicit on what fea…
glitt13 Oct 17, 2024
6680621
refactor: make usgs nhdplus s3 query more efficient by decreasing dat…
glitt13 Oct 17, 2024
c4ab9e6
refactor: make usgs nhdplus s3 query more efficient by decreasing dat…
glitt13 Oct 17, 2024
d259277
merge upstream/main
glitt13 Oct 17, 2024
6f32ac2
refactor: change sublist items to appear as items rather than values …
glitt13 Oct 22, 2024
4073904
feat: add handling of optional arguments for hydrofabric using hfab_c…
glitt13 Oct 22, 2024
3dc2a2f
fix: suppress warnings generated by arrow::open_dataset when grabbing…
glitt13 Oct 22, 2024
1f73232
feat: update config file, and unit testing config files/unit test wit…
glitt13 Oct 22, 2024
850c975
fix: address indexing error with querying NULL values identified by h…
glitt13 Oct 22, 2024
852f5b2
feat: add standard file write of the nldi features retrieved by nhdpl…
glitt13 Oct 22, 2024
27cfa94
fix: resolve merge conflicts
glitt13 Oct 22, 2024
7fa16b7
chore: remove erroneous HEAD from a merge conflict
glitt13 Oct 22, 2024
0fa4d55
Merge branch 'main' into hfab
bolotinl Oct 23, 2024
0be6ab2
doc: correct mis-spellings
glitt13 Oct 24, 2024
5c643c7
fix: resolve merge conflicts from hfab branch
glitt13 Oct 24, 2024
5c373c3
fix merge conflicts
glitt13 Oct 24, 2024
36685d6
feat: create function for writing nldi metadata, e.g. ids, coords to …
glitt13 Oct 25, 2024
3bc9791
feat: add gage_id column to standard ouptput from proc_attr_gageids
glitt13 Oct 25, 2024
ac284e4
fix: namespace dplyr::select
glitt13 Oct 25, 2024
d91bed6
fix: modify save location of nldi file; add namespace to dplyr::selec…
glitt13 Oct 25, 2024
afe2a7d
refactor: change how attribute metadata is written by having a user d…
glitt13 Oct 30, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 1 addition & 1 deletion pkg/proc.attr.hydfab/DESCRIPTION
Original file line number Diff line number Diff line change
@@ -1,6 +1,6 @@
Package: proc.attr.hydfab
Title: Grab and process catchment attributes using the hydrofabric
Version: 0.0.1.0010
Version: 0.0.1.0013
Authors@R:
c(person("Guy", "Litt", , "[email protected]", role = c("aut", "cre"),
comment = c(ORCID = "https://orcid.org/0000-0003-1996-7468")),
Expand Down
2 changes: 2 additions & 0 deletions pkg/proc.attr.hydfab/NAMESPACE
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,7 @@

export(check_attr_selection)
export(grab_attrs_datasets_fs_wrap)
export(hfab_config_opt)
export(proc_attr_exst_wrap)
export(proc_attr_gageids)
export(proc_attr_hf)
Expand All @@ -11,3 +12,4 @@ export(proc_attr_usgs_nhd)
export(proc_attr_wrap)
export(read_loc_data)
export(retrieve_attr_exst)
export(write_meta_nldi_feat)
236 changes: 208 additions & 28 deletions pkg/proc.attr.hydfab/R/proc_attr_grabber.R

Large diffs are not rendered by default.

2 changes: 1 addition & 1 deletion pkg/proc.attr.hydfab/flow/flow.install.proc.attr.hydfab.R
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@ if ('bolotin' %in% Sys.getenv("HOME")) {
fs_dir <- file.path(Sys.getenv("HOME"),"git","formulation-selector")
}
# Run unit tests?
RunTest <- FALSE#TRUE Default FALSE prevents s3 data downloading in unit testing (FALSE=fast)
RunTest <- FALSE #TRUE Default FALSE prevents s3 data downloading in unit testing (FALSE=fast)
ShowTestCovr <- FALSE # Only possible if RunTest==TRUE. Even slower though.
# ---------------------------------------------------------------------------- #
# Enter in all R packages here
Expand Down
15 changes: 12 additions & 3 deletions pkg/proc.attr.hydfab/flow/fs_attrs_grab.R
Original file line number Diff line number Diff line change
Expand Up @@ -47,6 +47,13 @@ dir_base <- glue::glue(base::unlist(raw_config$file_io)[['dir_base']])#file.path
dir_std_base <- glue::glue(base::unlist(raw_config$file_io)[['dir_std_base']]) #file.path(dir_base,"input","user_data_std") # The location of standardized data generated by fs_proc python package
dir_db_hydfab <- glue::glue(base::unlist(raw_config$file_io)[['dir_db_hydfab']]) # file.path(dir_base,'input','hydrofabric') # The local dir where hydrofabric data are stored to limit s3 connections
dir_db_attrs <- glue::glue(base::unlist(raw_config$file_io)[['dir_db_attrs']]) # file.path(dir_base,'input','attributes') # The parent dir where each comid's attribute parquet file is stored in the subdirectory 'comid/', and each dataset's aggregated parquet attributes are stored in the subdirectory '/{dataset_name}
ds_type <- try(base::unlist(raw_config$file_io)[['ds_type']])
if('try-error' %in% base::class(ds_type)){
ds_type <- ''
}
write_type <- glue::glue(base::unlist(raw_config$file_io)[['write_type']])# file format for writing writing NLDI feature metadata. Default 'parquet'. May also select 'csv'.
path_meta <- base::unlist(raw_config$file_io)[['path_meta']] # Full file path for writing NLDI feature metadata of training data formatted for glue::glue(). Default: "{dir_std_base}/{ds}/nldi_feat_{ds}_{ds_type}.{write_type}"


# Read s3 connection details
s3_base <- base::unlist(raw_config$hydfab_config)[['s3_base']]#s3://lynker-spatial/tabular-resources" # s3 path containing hydrofabric-formatted attribute datasets
Expand Down Expand Up @@ -92,12 +99,14 @@ Retr_Params <- base::list(paths = base::list(
dir_db_hydfab=dir_db_hydfab,
dir_db_attrs=dir_db_attrs,
s3_path_hydatl = s3_path_hydatl,
dir_std_base = dir_std_base),
dir_std_base = dir_std_base,
path_meta = path_meta),
vars = sub_attr_sel,
datasets = datasets
datasets = datasets,
ds_type = ds_type,
write_type = write_type
)
# PROCESS ATTRIBUTES

ls_comids <- proc.attr.hydfab:::grab_attrs_datasets_fs_wrap(Retr_Params,overwrite = TRUE)

# --------------------------- Compile attributes --------------------------- #
Expand Down
11 changes: 11 additions & 0 deletions pkg/proc.attr.hydfab/inst/extdata/attr_source_types.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
# The name is the expected format for variables specified in proc.attr.hydfab
hydroatlas_attributes:
- 'name': 'ha_vars'
- 'internal_dataset_name' : 'hydroatlas__v1'
camels_attributes:
- 'name': 'camels_vars'
usgs_attributes:
- 'name': 'usgs_vars'
- 'internal_dataset_name' : 'usgs_nhdplus__v2'
hydrofabric_attributes:
- 'name': 'hfab_vars'
Binary file not shown.
Binary file not shown.
Binary file not shown.
Original file line number Diff line number Diff line change
Expand Up @@ -19,16 +19,25 @@ file_io: # May define {home_dir} for python's '{home_dir}/string_path'.format(ho
- 'dir_std_base' : '{dir_base}/user_data_std' # Required. The location of standardized data generated by fs_proc python package
- 'dir_db_hydfab' : '{dir_base}/hydrofabric' # Required. The local dir where hydrofabric data are stored (limits the total s3 connections)
- 'dir_db_attrs' : '{dir_base}/attributes' # Required. The parent dir where each comid's attribute parquet file is stored in the subdirectory 'comid/', and each dataset's aggregated parquet attributes are stored in the subdirectory '/{dataset_name}
- 'ds_type': 'training' # Required string. Recommended to select 'training' or 'prediction', but any string will work. This string will be used in the filename of the output metadata describing each data point's identifer, COMID, lat/lon, reach name of the location. This string should differ from the string used in the prediction config yaml file. Filename: `"nldi_feat_{dataset}_{ds_type}.csv"` inside `dir_std_base / dataset / `
formulation_metadata:
- 'datasets': # Required. Must match directory name inside dir_std_base. May be a list of items, or simply sublist 'all' to select everything inside dir_std_base for attribute grabbing.
- 'juliemai-xSSA' # Required. In this example case, it's a sublist of just one thing.
- 'formulation_base': 'Raven_blended' # Informational. Unique name of formulation.
hydfab_config: # Required section describing hydrofabric connection details and objects of interest
- 's3_base' : "s3://lynker-spatial/tabular-resources" # Required. s3 path containing hydrofabric-formatted attribute datasets
- 's3_bucket' : 'lynker-spatial' # Required. s3 bucket containing hydrofabric data
- 'ext' : 'gpkg' # Required. file extension of the hydrofrabric data. Default 'gpkg'.
- 'hf_cat_sel': "total" # Required. Options include 'total' or 'all'; total: interested in the single location's aggregated catchment data; all: all subcatchments of interest
attr_select: # The names of variable sublistings are standardized, e.g. ha_vars, usgs_vars, sc_vars
- s3_base: "s3://lynker-spatial/tabular-resources" # Required. s3 path containing hydrofabric-formatted attribute datasets
- s3_bucket: 'lynker-spatial' # Required. s3 bucket containing hydrofabric data
- hf_cat_sel: "total" # Required. Options include 'total' or 'all'; total: interested in the single location's aggregated catchment data; all: all subcatchments of interest
- gpkg: # Optional. A local gpkg file. Default 'NULL'. See hfsubsetR::get_subset()
- hfab_retr: FALSE # Optional, Boolean. Defaults to the hfab_retr argument default in the proc_attr_wrap() function (TRUE). Should the hydrofabric data be downloaded? Hydrofabric data download may not be necessary. Processing is faster if set to FALSE
- hf_version: "2.1.1" # Optional, character string. Defaults to the hf_version argument default in hfsubsetR::get_subset() function. The hydrofabric version.
- domain: "conus" # Optional, character string. Defaults to the hf_version argument default in hfsubsetR::get_subset() function. The hydrofabric domain.
- type: "nextgen" # Optional, character string. Defaults to the hf_version argument default in hfsubsetR::get_subset() function. The hydrofabric type.
- lyrs: # Optional, sublist of character strings. Defaults to the hf_version argument default in hfsubsetR::get_subset() function. Hydrofabric layers to extract.
- 'divides'
- 'network'
- source: "s3://lynker-spatial/hydrofabric"
attr_select: # The names of variable sublistings are standardized with _vars, e.g. ha_vars, usgs_vars, sc_vars
- 's3_path_hydatl': '{s3_base}/hydroATLAS/hydroatlas_vars.parquet' # path to hydroatlas data formatted for hydrofabric. Required only if hydroatlas variables desired.
- 'ha_vars': # hydroatlas variables. Must specify s3_path_hydatl if desired.
- 'pet_mm_s01'
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -19,17 +19,26 @@ file_io: # May define {home_dir} for python's '{home_dir}/string_path'.format(ho
- 'dir_std_base' : '{dir_base}/user_data_std' # Required. The location of standardized data generated by fs_proc python package
- 'dir_db_hydfab' : '{dir_base}/hydrofabric' # Required. The local dir where hydrofabric data are stored (limits the total s3 connections)
- 'dir_db_attrs' : '{dir_base}/attributes' # Required. The parent dir where each comid's attribute parquet file is stored in the subdirectory 'comid/', and each dataset's aggregated parquet attributes are stored in the subdirectory '/{dataset_name}
- 'ds_type': 'training' # Required string. Recommended to select 'training' or 'prediction', but any string will work. This string will be used in the filename of the output metadata describing each data point's identifer, COMID, lat/lon, reach name of the location. This string should differ from the string used in the prediction config yaml file. Filename: `"nldi_feat_{dataset}_{ds_type}.csv"` inside `dir_std_base / dataset / `
formulation_metadata:
- 'datasets': # Required. Must match directory name inside dir_std_base. May be a list of items, or simply sublist 'all' to select everything inside dir_std_base for attribute grabbing.
- 'juliemai-xSSA' # Required. In this example case, it's a sublist of just one thing.
- 'formulation_base': 'Raven_blended' # Informational. Unique name of formulation.
hydfab_config: # Required section describing hydrofabric connection details and objects of interest
- 's3_base' : "s3://lynker-spatial/tabular-resources" # Required. s3 path containing hydrofabric-formatted attribute datasets
- 's3_bucket' : 'lynker-spatial' # Required. s3 bucket containing hydrofabric data
- 'ext' : 'gpkg' # Required. file extension of the hydrofrabric data. Default 'gpkg'.
- 'hf_cat_sel': "total" # Required. Options include 'total' or 'all'; total: interested in the single location's aggregated catchment data; all: all subcatchments of interest
attr_select: # The names of variable sublistings are standardized, e.g. ha_vars, usgs_vars, sc_vars
- 's3_path_hydatl' : '{s3_base}/hydroATLAS/hydroatlas_vars.parquet' # path to hydroatlas data formatted for hydrofabric. Required only if hydroatlas variables desired.
- s3_base: "s3://lynker-spatial/tabular-resources" # Required. s3 path containing hydrofabric-formatted attribute datasets
- s3_bucket: 'lynker-spatial' # Required. s3 bucket containing hydrofabric data
- hf_cat_sel: "total" # Required. Options include 'total' or 'all'; total: interested in the single location's aggregated catchment data; all: all subcatchments of interest
- gpkg: # Optional. A local gpkg file. Default 'NULL'. See hfsubsetR::get_subset()
- hfab_retr: FALSE # Optional, Boolean. Defaults to the hfab_retr argument default in the proc_attr_wrap() function (TRUE). Should the hydrofabric data be downloaded? Hydrofabric data download may not be necessary. Processing is faster if set to FALSE
- hf_version: "2.1.1" # Optional, character string. Defaults to the hf_version argument default in hfsubsetR::get_subset() function. The hydrofabric version.
- domain: "conus" # Optional, character string. Defaults to the hf_version argument default in hfsubsetR::get_subset() function. The hydrofabric domain.
- type: "nextgen" # Optional, character string. Defaults to the hf_version argument default in hfsubsetR::get_subset() function. The hydrofabric type.
- lyrs: # Optional, sublist of character strings. Defaults to the hf_version argument default in hfsubsetR::get_subset() function. Hydrofabric layers to extract.
- 'divides'
- 'network'
- source: "s3://lynker-spatial/hydrofabric"
attr_select: # The names of variable sublistings are standardized with _vars, e.g. ha_vars, usgs_vars, sc_vars
- 's3_path_hydatl': '{s3_base}/hydroATLAS/hydroatlas_vars.parquet' # path to hydroatlas data formatted for hydrofabric. Required only if hydroatlas variables desired.
- 'ha_vars': # hydroatlas variables. Must specify s3_path_hydatl if desired.
- 'pet_mm_s01'
- 'cly_pc_sav'
Expand Down
3 changes: 3 additions & 0 deletions pkg/proc.attr.hydfab/man/grab_attrs_datasets_fs_wrap.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

26 changes: 26 additions & 0 deletions pkg/proc.attr.hydfab/man/hfab_config_opt.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

3 changes: 2 additions & 1 deletion pkg/proc.attr.hydfab/man/proc_attr_gageids.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

13 changes: 11 additions & 2 deletions pkg/proc.attr.hydfab/man/proc_attr_hf.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

3 changes: 2 additions & 1 deletion pkg/proc.attr.hydfab/man/proc_attr_usgs_nhd.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

10 changes: 9 additions & 1 deletion pkg/proc.attr.hydfab/man/proc_attr_wrap.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

21 changes: 21 additions & 0 deletions pkg/proc.attr.hydfab/man/write_meta_nldi_feat.Rd

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

Loading