Skip to content

Optimization options #825

Answered by DavidHuber-NOAA
hertneky asked this question in Q&A
Jun 6, 2023 · 1 comments · 3 replies
Discussion options

You must be logged in to vote

@hertneky It's interesting that the runtime was so much slower when running 6:48. When run this way, OMP_NUM_THREADS_RUN_FCST needs to be set to 1 (it's 2 by default) so each PE runs on one core instead of 2. Did you try this? In my experience,, running with more instances and fewer threads tends to be faster. You might try setting that in your config file and rerunning setup.py. Note that this will use ~2x the memory on the node, so you might want to see if the 6:24 case was already bumping up against that limit. If you are running slurm for your job manager, you can check this with sacct -j <job number> -o "JobName,MaxRSS", where MaxRSS is the maximum memory used by a single PE. Multipl…

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@hertneky
Comment options

hertneky Jul 7, 2023
Maintainer Author

@DavidHuber-NOAA
Comment options

@hertneky
Comment options

hertneky Jul 18, 2023
Maintainer Author

Answer selected by hertneky
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants