gmtb-gfsphysics optimization: speedup of ccpp init #48

climbfuji · 2018-03-14T17:33:43Z

This PR improves the performance of the CCPP implementation in FV3 v0 by

making use of threading (i.e. OpenMP parallel regions) where possible
by allocating Interstitial(nt) using the thread nt instead of thread 1 (first touch principle)
by using new functionality of the CCPP infrastructure that allows to copy an existing CCPP suite data structure from one cdata structure to another

This PR also

adds the missing calls to ccpp_finalize in IPD_CCPP_driver.F90 (IPD step 5)
performs some cleanup work and initialization of intent(out) variables in GFS_MP_generic_pre.f90 and GFS_calpreciptype.F90

The results are bit for bit compatible (tested on Theia/Intel and Macbook/GNU with OpenMP enabled).

…MP_generic_pre.f90

…calpreciptype.f90

…) derived type so that it gets allocated in the right place in memory (first touch principle): GFS_layer/GFS_driver.F90

…ata_block to avoid reading SDF multiple times, cleanup and formatting changes: IPD_layer/IPD_CCPP_driver.F90

… into gmtb-gfsphysics-optimization

llpcarson · 2018-03-14T21:37:39Z

Dom - My tests today, and yesterday, are not giving bit-for-bit results, but I'm not sure that they should. The branch I used yesterday (gmtb-fv3-cleanup-and-error-handling) may or may not have had all of the same fixes as this one. Is this a concern worth tracking down? If your tests show bit-for-bit before/after, that's sufficient for me.

climbfuji · 2018-03-14T21:40:24Z

@llpcarson Did you create a new baseline or use my baseline? With the changes to the optimization flags in gmtb-gfsphysics PR47, the results have changed (see discussion there) and I did not update the "official" baseline yet. The following one works for me:

/scratch4/BMC/gmtb/Dom.Heinzeller/gmtb-fv3/reference/intel/C96fv3gfs2016092900/

llpcarson · 2018-03-14T21:44:01Z

I was just comparing my run from yesterday and mine from today, so I believe both of them have the PR47 included, but I'll re-confirm That was it --- the branch I tested yesterday did not have that PR included... so this one is ready to commit!

…

On Wed, Mar 14, 2018 at 3:40 PM, Dom Heinzeller ***@***.***> wrote: @llpcarson <https://github.com/llpcarson> Did you create a new baseline or use my baseline? With the changes to the optimization flags in gmtb-gfsphysics PR47, the results have changed (see discussion there) and I did not update the "official" baseline yet. The following one works for me: /scratch4/BMC/gmtb/Dom.Heinzeller/gmtb-fv3/reference/ intel/C96fv3gfs2016092900/ — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#48 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AHTrInvKV9vyNS7RvwbdLOr-vOUYKqghks5teY5MgaJpZM4Sq3qH> .

llpcarson

Approved

climbfuji · 2018-03-14T21:47:58Z

My reference is from straight after pr47 went in, i.e. March 12/13, before the merge this morning and the PRs afterwards. I do get bfb identical results for the C96 test case. But I definitely want to understand why you don't!

llpcarson · 2018-03-14T21:52:30Z

My runs today match that baseline, too. So, the branch I grabbed earlier was pre-PR47 (which was OK for timing, but not for b4b)

…

On Wed, Mar 14, 2018 at 3:47 PM, Dom Heinzeller ***@***.***> wrote: My reference is from straight after pr47 went in, i.e. March 12/13, before the merge this morning and the PRs afterwards. I do get bfb identical results for the C96 test case. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#48 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AHTrImwIzphn7EXSgSTDdIO5JbJnZfrJks5teZAOgaJpZM4Sq3qH> .

climbfuji · 2018-03-14T21:53:09Z

Great, thank you!

* fv3atm issue NCAR#37: fix the real(8) lat/lon in netcdf file * fv3atm NCAR#35: Reducing background vertical diffusivities in the inversion layers * fv3atm NCAR#24: bug in gfsphysics/physics/moninedmf_hafs.f * fv3atm NCAR#18: Optimize netcdf write component and bugfix for post and samfdeepcnv.f * set (0-1) bounds for ficein_cpl * remove cache_size due to lower netcdf verion 4.5.1 on mars * Change ice falling to 0.9 in gfsphysics/physics/gfdl_cloud_microphys.F90

…r_master Update gsd/develop from NCAR master

DomHeinzeller added 5 commits March 13, 2018 14:49

Cleanup work: initialization of intent(out) variables in physics/GFS_…

5e2b41a

…MP_generic_pre.f90

Cleanup work: initialization of intent(out) variables in physics/GFS_…

7b5daf6

…calpreciptype.f90

Performance tuning: let each OpenMP thread create its Interstitial(nt…

8610c83

…) derived type so that it gets allocated in the right place in memory (first touch principle): GFS_layer/GFS_driver.F90

Performance tuning: run ccpp_init in parallel, use cdata%suite for cd…

2b6e810

…ata_block to avoid reading SDF multiple times, cleanup and formatting changes: IPD_layer/IPD_CCPP_driver.F90

Merge branch 'features/ccpp' of https://github.com/NCAR/gmtb-gfsphysics…

b0d32c4

… into gmtb-gfsphysics-optimization

climbfuji requested review from llpcarson, ligiabernardet and grantfirl March 14, 2018 17:36

llpcarson approved these changes Mar 14, 2018

View reviewed changes

climbfuji merged commit ed0b2e5 into NCAR:features/ccpp Mar 15, 2018

climbfuji deleted the gmtb-gfsphysics-optimization-speedup-of-ccpp-init-rebased branch March 15, 2018 17:41

climbfuji pushed a commit to climbfuji/ccpp-physics that referenced this pull request Aug 10, 2020

Merge pull request NCAR#48 from climbfuji/update_gsd_develop_from_nca…

2735570

…r_master Update gsd/develop from NCAR master

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gmtb-gfsphysics optimization: speedup of ccpp init #48

gmtb-gfsphysics optimization: speedup of ccpp init #48

climbfuji commented Mar 14, 2018

llpcarson commented Mar 14, 2018

climbfuji commented Mar 14, 2018

llpcarson commented Mar 14, 2018 via email •

edited

Loading

llpcarson left a comment

climbfuji commented Mar 14, 2018 •

edited

Loading

llpcarson commented Mar 14, 2018 via email

climbfuji commented Mar 14, 2018

gmtb-gfsphysics optimization: speedup of ccpp init #48

gmtb-gfsphysics optimization: speedup of ccpp init #48

Conversation

climbfuji commented Mar 14, 2018

llpcarson commented Mar 14, 2018

climbfuji commented Mar 14, 2018

llpcarson commented Mar 14, 2018 via email • edited Loading

llpcarson left a comment

Choose a reason for hiding this comment

climbfuji commented Mar 14, 2018 • edited Loading

llpcarson commented Mar 14, 2018 via email

climbfuji commented Mar 14, 2018

llpcarson commented Mar 14, 2018 via email •

edited

Loading

climbfuji commented Mar 14, 2018 •

edited

Loading