Vapor Saturation (QSat) table computation Fortran vs C f32 issues #88

FlorianDeconinck · 2024-09-12T15:37:12Z

QSat, saturation of vapor, is based on the calculation of Ice and Liquid tables estimated at an incremental temperature values. The table is filled using an exact formulation for both water species described in our code in pyMoist.saturation.qsat_liquid and pyMoist.saturation.qsat_ice.

Code has been heavily debugged to be correct but numeric doesn't validate, which bubbles to qsat itself being wrong and this code is at the base of a lot of moist computation. We side-stepped the validation issue by loading the table from netcdf.

Debugging further we figured out that the issues boils down to actual C (python) vs Fortran numerical differences. Here's two calculation that don't agree with all compilation flags deactivated

C:

#include <stdio.h>

double S16 = 0.516000335e-11 * 100.0;
double S15 = 0.276961083e-8 * 100.0;
double S14 = 0.623439266e-6 * 100.0;
double S13 = 0.754129933e-4 * 100.0;
double S12 = 0.517609116e-2 * 100.0;
double S11 = 0.191372282e0 * 100.0;
double S10 = 0.298152339e1 * 100.0;
double S26 = 0.314296723e-10 * 100.0;
double S25 = 0.132243858e-7 * 100.0;
double S24 = 0.236279781e-5 * 100.0;
double S23 = 0.230325039e-3 * 100.0;
double S22 = 0.129690326e-1 * 100.0;
double S21 = 0.401390832e0 * 100.0;
double S20 = 0.535098336e1 * 100.0;

const float TI = 178.16f;
const float TT = TI - 273.16f;

int main(int argc, void *argv)
{
    float EX = TT * (TT * (TT * (TT * (TT * (TT * S16 + S15) + S14) + S13) + S12) + S11) + S10;
    printf("EX: %f\n", EX);
    return 0;
}

prints:

EX: 0.003803

F90:

program qsat
    real*8 :: S16= 0.516000335E-11*100.0
    real*8 :: S15= 0.276961083E-8 *100.0
    real*8 :: S14= 0.623439266E-6 *100.0
    real*8 :: S13= 0.754129933E-4 *100.0
    real*8 :: S12= 0.517609116E-2 *100.0
    real*8 :: S11= 0.191372282E+0 *100.0
    real*8 :: S10= 0.298152339E+1 *100.0
    real*8 :: S26= 0.314296723E-10*100.0
    real*8 :: S25= 0.132243858E-7 *100.0
    real*8 :: S24= 0.236279781E-5 *100.0
    real*8 :: S23= 0.230325039E-3 *100.0
    real*8 :: S22= 0.129690326E-1 *100.0
    real*8 :: S21= 0.401390832E+0 *100.0
    real*8 :: S20= 0.535098336E+1 *100.0
    
    real*4, parameter :: TI = 178.16
    real*4 :: TT = TI - 273.16
    real*4 :: EX = 0

    EX = TT * (TT * (TT * (TT * (TT * (TT * S16 + S15) + S14) + S13) + S12) + S11) + S10

    print *, EX

end program qsat

prints

3.77704715E-03

Both of those examples are gcc and gfortran with no options.

We need to figure out how we can make those agree, if at all.

Parent: #57

Match C and Fortran
Deploy "numerical fix" on the DSL stack, even it means a debug mode running slowly
OR Document findings

PS:

The actual errors

For an expected value of

The text was updated successfully, but these errors were encountered:

FlorianDeconinck · 2024-09-12T16:52:40Z

Digging in the assembly, we discovered that the value of the constants are different, e.g.

Language	S16
C	5.16000335000000020638850831546182856834903418530302587897e-10
F90	5.160003535564783305744640529155731201171875e-10

Considering the operations comprised of mult and add, this brings the difference. If we hardcode the Fortran assembly-level value into C, we validated

… differences between Fortran and C interpretation of constants (see GEOS-ESM/SMT-Nebulae#88 for more information). Table values are instead read in from netCDF file as a temporary solution.

FlorianDeconinck · 2024-09-12T18:51:05Z

Same result under icx and ifx, with clang frontend

FlorianDeconinck · 2024-09-12T19:11:54Z

It seems Fortran is putting a 32--bit representation into a 64-bit registry on literals. E.g. doing the following on C gives the same result

const double S16 = 0.516000335e-11f * 100.f;

FlorianDeconinck · 2024-09-12T19:17:56Z

Best match

Lang	Ex
C	0.003777
F90	3.77704715E-03
Py	0.0037770470880218454

With code as follows

#include <stdio.h>

const double S16 = 0.516000335e-11f * 100.f;
const double S15 = 0.276961083e-8f * 100.f;
const double S14 = 0.623439266e-6f * 100.f;
const double S13 = 0.754129933e-4f * 100.f;
const double S12 = 0.517609116e-2f * 100.f;
const double S11 = 0.191372282e0f * 100.f;
const double S10 = 0.298152339e1f * 100.f;

const float TI = 178.16f;
const float TT = TI - 273.16f;

int main(int argc, char **argv)
{
    float EX = TT * (TT * (TT * (TT * (TT * (TT * S16 + S15) + S14) + S13) + S12) + S11) + S10;
    printf("EX: %f\n", EX);
    return 0;
}

program hello
    real*8 :: S16= 0.516000335E-11*100.0
    real*8 :: S15= 0.276961083E-8 *100.0
    real*8 :: S14= 0.623439266E-6 *100.0
    real*8 :: S13= 0.754129933E-4 *100.0
    real*8 :: S12= 0.517609116E-2 *100.0
    real*8 :: S11= 0.191372282E+0 *100.0
    real*8 :: S10= 0.298152339E+1 *100.0
    
    real*4, parameter :: TI = 178.16
    real*4, parameter :: TT = TI - 273.16
    real*4 :: EX = 0

    EX = TT * (TT * (TT * (TT * (TT * (TT * S16 + S15) + S14) + S13) + S12) + S11) + S10

    print *, 'EX', EX

end program hello

import numpy as np

S16 = np.float64(np.float32(0.516000335e-11) * np.float32(100.0))
S15 = np.float64(np.float32(0.276961083e-8) * np.float32(100.0))
S14 = np.float64(np.float32(0.623439266e-6) * np.float32(100.0))
S13 = np.float64(np.float32(0.754129933e-4) * np.float32(100.0))
S12 = np.float64(np.float32(0.517609116e-2) * np.float32(100.0))
S11 = np.float64(np.float32(0.191372282e0) * np.float32(100.0))
S10 = np.float64(np.float32(0.298152339e1) * np.float32(100.0))
S26 = np.float64(np.float32(0.314296723e-10) * np.float32(100.0))
S25 = np.float64(np.float32(0.132243858e-7) * np.float32(100.0))
S24 = np.float64(np.float32(0.236279781e-5) * np.float32(100.0))
S23 = np.float64(np.float32(0.230325039e-3) * np.float32(100.0))
S22 = np.float64(np.float32(0.129690326e-1) * np.float32(100.0))
S21 = np.float64(np.float32(0.401390832e0) * np.float32(100.0))
S20 = np.float64(np.float32(0.535098336e1) * np.float32(100.0))

TI = np.float32(178.16)  # 273.16 - 95.0
TT = TI - np.float32(273.16)

EX = TT * (TT * (TT * (TT * (TT * (TT * S16 + S15) + S14) + S13) + S12) + S11) + S10

print(S16)
print(EX)

* SaturationVaporPressureTable code first pass QSatIce missing before test can be done * QSat_Ice_scalar_exact * Fix calculation. Table generates OK. * Fix Murphy and CAM * Rename estimate_table -> table Verbose the README * Functional QSat module. Can handle optional RAMP/PASCALS/DQSAT inputs. Fails to verify at majority of points using original metric, has not been tested with new metrics. Error patterns of the ES tables suggest arise during math within qsat_liquid and qsat_ice. Errors are small (10^-4). Further testing using find_klcl is in process to see if these errors propogate further or are locked within QSat. * More clean up, first attempt at QSat function * Working QSat function. QSat class can still be called outside of stencil to calculate QSat for entire field, or function can be called from within the stencil to calculate QSat at individual points. To use function, the QSat class must be initialized before stencil call and the tables must be passed down to the function as an input. * Minor change for improved DQSAT functionality * Minor change for improved DQSAT functionality * Two new functions: QSat_Float_Liquid and QSat_Float_Ice. These two functions are a port of QSATLQU0/QSATICE0 when UTBL is false. Notable difference from QSat_Float: QSat_Float features a smooth transition from ice to liquid water (based on the TMIX parameter) while these two functions have a hard transition at 0C (should transition from calling one to the other). UNTESTED FOR NOW, NO EASILY ACCESSABLE TEST CASE * Tiny little bugfix fro QSat_Float_Liquid and QSat_Float_Ice * Fixed indexing error in QSat_Float_Liquid/Ice, and inserted a workaround for bad interaction between dace backend and cast to int * Code cleanup, ready to be pulled back into main branch * Few more cleanups, modification to translate test to test sat tables * QSat verifies if tables are read in from netcdf (this is the current state of the code). Generated tables do not yet verify. * Fix run tests * Functional QSat module. Table calculations are still incorrect due to differences between Fortran and C interpretation of constants (see GEOS-ESM/SMT-Nebulae#88 for more information). Table values are instead read in from netCDF file as a temporary solution. * Forgot to apply fix to other QSat functions. All functions now work properly * Linting * Refactor table_method to an enum Rename formulation to types * Clean up translate test Revert run_tests.sh to common one * Revert run_tests.sh to common one * Clean up debug fields for table --------- Co-authored-by: Florian Deconinck <[email protected]> Co-authored-by: Florian Deconinck <[email protected]>

FlorianDeconinck · 2024-09-27T13:34:14Z

This was fixed in GEOS-ESM/GEOSgcm_GridComp#1007

The issues was a mix of 32 representation pushed into a 64-bit float and the fact that in python an i32 * f32 will result in a f64 which then percolate in the calculation and shift the errors into 64-bit land compared to the Fortran.

The actual line for this work was:

-            t = i * DELTA_T + TMINTBL
+            t = Float(i * DELTA_T) + TMINTBL

* First bit of the evap_subl_pdf port. Initial calculation verifies, meltfrz does not. * Updates to the evap_subl_pdf loop. meltfrz now verifies (excluding minor random errors on the order of 10^-8 which have yet to be investigated). evap and subl are believed to verify pending a fix to the cloud_effective_radius functions. Error in cloud_effective_radius functions appears to be a reappearance of the 32-bit precision errors previous noted when using these functions (errors are on the order of 10^-12). First attempt at hystpdf port, but still missing many pieces and does not execute. * Additional minor changes to hystpdf, various other minor QOL changes * SaturationVaporPressureTable code first pass QSatIce missing before test can be done * QSat_Ice_scalar_exact * Fix calculation. Table generates OK. * Fix Murphy and CAM * Rename estimate_table -> table Verbose the README * Creation of QSat stencil, which still needs a lot of work. Need to push this code back to dsl/qsat branch as well at some point to have a working qsat function to pull on for all needs, but for now it is in evap_subl_pdf for development so that I can test incorporating it into the evap loop * Functional QSat module, fails at majority of points but error pattern suggests numerics are as close as I can get them while remaining true to Fortran * Updates to QSat and a few other minor QOL changes * One more change that wasn't staged for some reason * Functional QSat module. Can handle optional RAMP/PASCALS/DQSAT inputs. Fails to verify at majority of points using original metric, has not been tested with new metrics. Error patterns of the ES tables suggest arise during math within qsat_liquid and qsat_ice. Errors are small (10^-4). Further testing using find_klcl is in process to see if these errors propogate further or are locked within QSat. * More clean up, first attempt at QSat function * Working QSat function. QSat class can still be called outside of stencil to calculate QSat for entire field, or function can be called from within the stencil to calculate QSat at individual points. To use function, the QSat class must be initialized before stencil call and the tables must be passed down to the function as an input. * Functional find_klcl port. Verifies at all but 2 points. Two failed cells are believed to be due to errors in QSat. * Minor change for improved DQSAT functionality * Minor change for improved DQSAT functionality * Reorganization of functions/stencils from evap_subl_pdf loop to new design that (should) work for the full GFDL_1M module. New stencil: hystpdf (in process) New functions: pdffrac, pdfcondensate, bergeron_partition (in process) Bergeron_partition calls GEOS_QsatLQU/ICE, which reaches into the non-table side of the QSAT world (not currently implemented, going to tackle that next). All new additions will be tested as one once the entire hystpdf is ported. * Two new functions: QSat_Float_Liquid and QSat_Float_Ice. These two functions are a port of QSATLQU0/QSATICE0 when UTBL is false. Notable difference from QSat_Float: QSat_Float features a smooth transition from ice to liquid water (based on the TMIX parameter) while these two functions have a hard transition at 0C (should transition from calling one to the other). UNTESTED FOR NOW, NO EASILY ACCESSABLE TEST CASE * Completed (but untested) hystpdf. Lots of problems compiling the stencil. * Tiny little bugfix fro QSat_Float_Liquid and QSat_Float_Ice * hystpdf compiles, few fixes to deal with various gt4py/ndsl related errors/workarounds * Fixed indexing error in QSat_Float_Liquid/Ice, and inserted a workaround for bad interaction between dace backend and cast to int * hystpdf nearly verifies. lots of small errors, potentially rounding/32bit problems * Code cleanup, ready to be pulled back into main branch * Few more cleanups, modification to translate test to test sat tables * QSat verifies if tables are read in from netcdf (this is the current state of the code). Generated tables do not yet verify. * Fix run tests * Functional QSat module. Table calculations are still incorrect due to differences between Fortran and C interpretation of constants (see GEOS-ESM/SMT-Nebulae#88 for more information). Table values are instead read in from netCDF file as a temporary solution. * Merged new QSat, manually fixed the qsat bugs that didn't make it pre-merge * Debugging hystpdf, commiting now to pull new dsl/develop pr with QSat changes * Functional hystpdf, verifies... when constants behave * Functional, verified evap_subl_pdf loop. Moved find_klcl into new misc_unused directory Pre-commit run, fails in one spot b/c one comment is "too long" * Update GEOSagcm_GridComp/GEOSphysics_GridComp/GEOSmoist_GridComp/pyMoist/misc_unused/find_klcl.py Co-authored-by: Florian Deconinck <[email protected]> * Bunch of changes per Florian's comments on the PR * Linting (and one file rename) * Update GEOSagcm_GridComp/GEOSphysics_GridComp/GEOSmoist_GridComp/pyMoist/pyMoist/GFDL_1M/GFDL_1M.py Co-authored-by: Florian Deconinck <[email protected]> * More changes for the PR * One last change * Lint * Changes per Tobias' comments less a proper variable naming scheme and implementation of new k-level feature. Will rename after k-level feature is implemented, want to remain true to fortran until development is finished for easier tracking. * Couple more changes based on Florian & Tobias comments. find_klcl removed b/c it is unfinished and unused. will be recreated later on when it is needed * Delete GEOSagcm_GridComp/GEOSphysics_GridComp/GEOSmoist_GridComp/pyMoist/tests/scripts/run.out * Lint & revert run_tests.sh * ToDo for renaming hystpdf --------- Co-authored-by: Florian Deconinck <[email protected]>

FlorianDeconinck mentioned this issue Sep 12, 2024

{P} Moist physics Grid Component #57

Open

11 tasks

FlorianDeconinck self-assigned this Sep 25, 2024

FlorianDeconinck closed this as completed Sep 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vapor Saturation (QSat) table computation Fortran vs C f32 issues #88

Vapor Saturation (QSat) table computation Fortran vs C f32 issues #88

FlorianDeconinck commented Sep 12, 2024 •

edited

Loading

FlorianDeconinck commented Sep 12, 2024

FlorianDeconinck commented Sep 12, 2024

FlorianDeconinck commented Sep 12, 2024

FlorianDeconinck commented Sep 12, 2024

FlorianDeconinck commented Sep 27, 2024

Vapor Saturation (QSat) table computation Fortran vs C f32 issues #88

Vapor Saturation (QSat) table computation Fortran vs C f32 issues #88

Comments

FlorianDeconinck commented Sep 12, 2024 • edited Loading

FlorianDeconinck commented Sep 12, 2024

FlorianDeconinck commented Sep 12, 2024

FlorianDeconinck commented Sep 12, 2024

FlorianDeconinck commented Sep 12, 2024

FlorianDeconinck commented Sep 27, 2024

FlorianDeconinck commented Sep 12, 2024 •

edited

Loading