Forcing Processor

Forcingprocessor converts National Water Model (NWM) forcing data into Next Generation National Water Model (NextGen) forcing data. This tool provides the forcing pre-processing for the NextGen Research DataStream.

The motivation for this tool is NWM data is gridded and stored within netCDFs for each forecast hour. Ngen inputs this same forcing data, but in the format of catchment averaged data time series data.

Install

From root

pip install -e .

Run the forcingprocessor

python src/forcingprocessor/processor.py ./configs/conf.json

Prior to executing the processor, the user will need to obtain a geopackage file to define the spatial domain. The user will define the time domain by generating the forcing filenames for processor.py via nwm_filenames_generator.py, which is explained here. Note that forcingprocessor will calcuate weights if not found within the geopackage file.

Example `conf.json`

{
    "forcing"  : {
        "nwm_file"     : "",
        "gpkg_file"    : ""
    },

    "storage":{
        "output_path"      : "",
        "output_file_type" : []
    },    

    "run" : {
        "verbose"       : true,
        "collect_stats" : true,
        "nprocs"        : 2
    },

    "plot":{
        "nts"        : 24,
        "ngen_vars"  : [
            "DLWRF_surface",
            "APCP_surface",
            "precip_rate",
            "TMP_2maboveground"
        ] 
    }
}

`conf.json` Options

1. Forcing

Field	Description	Required
nwm_file	Path to a text file containing nwm file names. One filename per line. Tool to create this file	✅
gpkg_file	Geopackage file to define spatial domain. Use hfsubset to generate a geopackage with a `forcing-weights` layer. Accepts local absolute path, s3 URI or URL. Also acceptable is a weights parquet generated with weights_hf2ds.py, though the plotting option will no longer be available.	✅

2. Storage

Field	Description	Required
storage_type	Type of storage (local or s3 URI)	✅
output_path	Path to write data to. Accepts local path or s3 URI	✅
output_file_type	List of output file types, e.g. ["tar","parquet","csv","netcdf"]	✅

3. Run

Field	Description	Required
verbose	Get print statements, defaults to false	✅
collect_stats	Collect forcing metadata, defaults to true	✅
nprocs	Number of data processing processes, defaults to 50% available cores

4. Plot

Use this field to create a side-by-side gif of the nwm and ngen forcings

Field	Description	Required
nts	Number of timesteps to include in the gif, default is 10
ngen_vars	Which ngen forcings variables to create gifs of, default is all of them

ngen_variables = [
    "UGRD_10maboveground",
    "VGRD_10maboveground",
    "DLWRF_surface",
    "APCP_surface",
    "precip_rate", 
    "TMP_2maboveground",        
    "SPFH_2maboveground",
    "PRES_surface",
    "DSWRF_surface",
]

nwm_file

A text file given to forcingprocessor that contains each nwm forcing file name. These can be URLs or local paths. This file can be generated with the nwmurl tool and a generator script has been provided within this repo. The config argument accepts an s3 URL.

python nwm_filenames_generator.py conf_nwm_files.json

An example configuration file:

{
   "forcing_type" : "operational_archive",
   "start_date"   : "202310300000",
   "end_date"     : "202310300000",
   "runinput"     : 1,
   "varinput"     : 5,
   "geoinput"     : 1,
   "meminput"     : 0,
   "urlbaseinput" : 7,
   "fcst_cycle"   : [0],
   "lead_time"    : [1]
}

Weights

To calculate NextGen forcings, "weights" must be calculated to extract polygon averaged data from gridded data. The weights are made up of two parts, the cell_id and coverage. These are calculated via exactextract within weights_hf2ds.py, which is optionally called from forcingprocessor.

If a geopackage is supplied to forcingprocessor, it will be searched for the layer forcings-weights. If this layer is found, these weights are used during processing. If not, forcingprocessor will call weights_hf2ds.py to calculate the weights (cell_id and coverage) for every divide-id in the geopackage. This can take time, so forcingprocessor will write a parquet of weights out in the metadata, that can be reused in future forcingprocessor executions.

Example of direct call

python3 forcingprocessor/src/forcingprocessor/weights_hf2ds.py \
--outname ./weights.parquet \
--input_file ./nextgen_VPU_03W.gpkg

Name		Name	Last commit message	Last commit date
Latest commit History 411 Commits
.github		.github
configs		configs
docker		docker
docs		docs
src/forcingprocessor		src/forcingprocessor
tests		tests
CREDITS.md		CREDITS.md
LICENSE		LICENSE
LICENSE.md		LICENSE.md
ODbl.md		ODbl.md
README.md		README.md
STATUS.md		STATUS.md
pyproject.toml		pyproject.toml
setup.cfg		setup.cfg
versions.yml		versions.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Uh oh!

Repository files navigation

Forcing Processor

Install

Run the forcingprocessor

Example `conf.json`

`conf.json` Options

1. Forcing

2. Storage

3. Run

4. Plot

nwm_file

Weights

About

Licenses found

Uh oh!

Releases 3

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

License

Licenses found

CIROH-UA/forcingprocessor

Folders and files

Latest commit

History

Repository files navigation

Forcing Processor

Install

Run the forcingprocessor

Example conf.json

conf.json Options

1. Forcing

2. Storage

3. Run

4. Plot

nwm_file

Weights

About

Topics

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Example `conf.json`

`conf.json` Options

Packages