Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Model blowup on ecmwf-atos for TCO319-DART configuration, mzapponi_refactoring4awicm3.1 branch #611

Open
tsemmler05 opened this issue Jul 2, 2024 · 1 comment

Comments

@tsemmler05
Copy link
Collaborator

When running above model configuration with git commit bcbb117, the coupled model blows up after a number of time steps. The number of time steps depends on if it is a cold start or a restart run. If it is a restart run, it also depends on which restart files are used. eta_z is from one time step to the next out of the range -10 / +10; from one to the next time step the temperature drops from a reasonable value (for example 8 or 15 C) to a large negative value (around -25 C).

When I change to the git commit e2f45f0, the one that is running without any problems on DKRZ levante, I get a seg fault straight away:

FESOM start iteration before the barrier...
FESOM start iteration after the barrier...

^[[32m____________________________________________________________^[[0m
^[[7;32m --> FESOM STARTS TIME LOOP ^[[0m
file : namelist.io open ok
io_listsize will be changed from 100 to 39 !
adding I/O stream 2D for sst
[ac1-3055:2759318:0:2759318] Caught signal 11 (Segmentation fault: address not mapped to object at address 0x10)

@JanStreffing
Copy link
Collaborator

Hey Tido, could you compile with debug flags to change the error into something more interpretable? To do that you need to change a bit the src/CMakeLists.txt . Which compile are you using?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants