Add precipitation histogram to prognostic run report #1271

oliverwm1 · 2021-06-18T19:11:53Z

Useful to show a histogram of precipitation on prognostic run reports. Example report HERE.

Significant internal changes:

Added function to compute histograms of arbitrary variables. For now, just doing surface precipitation rate.
Create new process_diagnostics.html page to the prog run report. Moved diurnal cycle there, and added plot of precipitation histogram.
Added metric to compute percentiles from above histograms. For now, computing 25th, 50th, 75th, 90, 99, and 99.9th percentiles. e.g.:

"percentile_25/total_precip_to_surface": {
      "value": 0.3747434426709828,
      "units": "mm/day"
}

Sped up report test by only creating report for a single run (since no longer require >1 run for report generation).

Tests added

oliverwm1 · 2021-06-18T22:54:09Z

workflows/prognostic_run_diags/fv3net/diagnostics/prognostic_run/compute.py

@@ -229,16 +227,6 @@ def _assign_diagnostic_time_attrs(
    return diagnostics_ds


-def dump_nc(ds: xr.Dataset, f):


We switched to using the vcm.dump_nc version of this a while ago, so this func is unused.

oliverwm1 · 2021-06-22T21:18:26Z

workflows/prognostic_run_diags/fv3net/diagnostics/prognostic_run/compute.py

+    logger.info("Computing histograms for physics diagnostics")
+    counts = xr.Dataset()
+    for varname in prognostic.data_vars:
+        count, bins = np.histogram(


I found using np.histogram much faster (and simpler code-wise) than doing an xarray groupby.

oliverwm1 · 2021-06-22T23:18:46Z

workflows/prognostic_run_diags/fv3net/diagnostics/prognostic_run/views/static_report.py

-def plot_1d(
-    run_diags: RunDiagnostics, varfilter: str, run_attr_name: str = "run",
-) -> HVPlot:
+def plot_1d(run_diags: RunDiagnostics, varfilter: str) -> HVPlot:


deleted unused argument run_attr_name

nbren12

Looks great. Apart from some minor comments below, I had some thoughts

I was pretty confused about why the metric functions returned xr.Dataset rather than floats, before realizing that the to_dict function looks at the units and values. It would be clearer if the metrics functions returned dataclasses with .value and .units attrs, so that we are passing the minimal amount of data around.
It would be great to enhance the visibility of the scalar metrics, so that they are amongst the first things we see. e.g. by showing a table instead of the histograms. The histograms are kind of useless and ugly IMO, especially for a single run report.

workflows/prognostic_run_diags/fv3net/diagnostics/prognostic_run/compute.py

workflows/prognostic_run_diags/fv3net/diagnostics/prognostic_run/metrics.py

oliverwm1 · 2021-06-23T16:14:51Z

I was pretty confused about why the metric functions returned xr.Dataset rather than floats, before realizing that the to_dict function looks at the units and values. It would be clearer if the metrics functions returned dataclasses with .value and .units attrs, so that we are passing the minimal amount of data around.

yeah I agree the xr.Dataset is a bit overkill. We could improve this in another PR.

It would be great to enhance the visibility of the scalar metrics, so that they are amongst the first things we see. e.g. by showing a table instead of the histograms. The histograms are kind of useless and ugly IMO, especially for a single run report.

ouch, poor histogram ;) It would be helpful to have the verification data on there, I agree. The example is particularly noisy since it's a short run. But yeah, a table showing the percentiles for each run would be quite useful.

nbren12 · 2021-06-28T22:13:42Z

ouch, poor histogram ;)

sorry. I didn't mean your histograms, i meant the bar charts with the metrics on the main page. The histograms are great!

oliverwm1 · 2021-06-29T01:06:51Z

ouch, poor histogram ;)

sorry. I didn't mean your histograms, i meant the bar charts with the metrics on the main page. The histograms are great!

haha sounds good. I agree the bar charts are hard to parse. I'll work on adding a metrics table with the things we actually care about.

Oliver Watt-Meyer added 6 commits June 18, 2021 19:10

Add OrderedList class to report

98e8061

Add diagnostic of precipitation histogram

41976a2

Stop requiring two runs if making report from folder

df2db9a

Better OrderedList API

56bc134

Set histogram diagnostic coordinate to bin midpoints

ebccbef

Add process.html page with diurnal cycle and precip PDF

29a8f80

oliverwm1 commented Jun 18, 2021

View reviewed changes

Oliver Watt-Meyer added 3 commits June 18, 2021 23:12

Remove obsolete failing units tests

53ce8fd

Merge branch 'master' into feature/process-page

2bc64e1

Case HISTOGRAM_BINS.keys() to list

7085fc9

oliverwm1 commented Jun 22, 2021

View reviewed changes

Oliver Watt-Meyer added 4 commits June 22, 2021 23:09

Save bin widths to allow accurate cdf calculation

095423d

Add metric for various percentiles

02d5be2

Delete unused constant

5647c1e

Rename function to not be duplicated

eb099c9

oliverwm1 marked this pull request as ready for review June 22, 2021 23:16

oliverwm1 commented Jun 22, 2021

View reviewed changes

Fix navigation link

caec3f9

nbren12 approved these changes Jun 23, 2021

View reviewed changes

Oliver Watt-Meyer added 6 commits June 23, 2021 19:04

Add table of precip percentiles to process page

ee55d10

Refactor compute_percentile function and add tests

0fa128f

Add docstring

48e9f6a

Refactor histogram calc to new function in VCM

92edd8c

Fix lint and typechecking

c3c476b

Compute 3-hourly mean before histogram

7097de2

oliverwm1 enabled auto-merge (squash) June 23, 2021 22:11

oliverwm1 merged commit 638006a into master Jun 23, 2021

oliverwm1 deleted the feature/process-page branch June 23, 2021 22:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add precipitation histogram to prognostic run report #1271

Add precipitation histogram to prognostic run report #1271

oliverwm1 commented Jun 18, 2021 •

edited

Loading

oliverwm1 Jun 18, 2021

oliverwm1 Jun 22, 2021

oliverwm1 Jun 22, 2021

nbren12 left a comment •

edited

Loading

oliverwm1 commented Jun 23, 2021 •

edited

Loading

nbren12 commented Jun 28, 2021

oliverwm1 commented Jun 29, 2021

		@@ -229,16 +227,6 @@ def _assign_diagnostic_time_attrs(
		return diagnostics_ds


		def dump_nc(ds: xr.Dataset, f):

Add precipitation histogram to prognostic run report #1271

Add precipitation histogram to prognostic run report #1271

Conversation

oliverwm1 commented Jun 18, 2021 • edited Loading

oliverwm1 Jun 18, 2021

Choose a reason for hiding this comment

oliverwm1 Jun 22, 2021

Choose a reason for hiding this comment

oliverwm1 Jun 22, 2021

Choose a reason for hiding this comment

nbren12 left a comment • edited Loading

Choose a reason for hiding this comment

oliverwm1 commented Jun 23, 2021 • edited Loading

nbren12 commented Jun 28, 2021

oliverwm1 commented Jun 29, 2021

oliverwm1 commented Jun 18, 2021 •

edited

Loading

nbren12 left a comment •

edited

Loading

oliverwm1 commented Jun 23, 2021 •

edited

Loading