Updating the before_notebook `20_start-postgresql.sh` and `40_prepare_aiida.sh` #476

mikibonacci · 2024-07-03T21:15:22Z

Fixings are done as for arm64 I encountered issues in restarting the container.

1 - Deleting old ${PSQL_LOGFILE}.1.gz file before trying to compressed again in 20_start-postgresql.sh. Otherwise, this can give errors and so the container will exit with non-zero status.
2 - Clean stop of the daemon before the "verdi storage migrate --force" in 40_prepare_aiida.sh. This is done because sometimes the restart of the docker container will give error in trying to migrate, as the daemon is still running. I am not able to reproduce the error in a standard way, however.

for arm64 I encountered the issues in restarting the container. 1- Deleting old ${PSQL_LOGFILE}.1.gz file before trying to compressed again. Otherwise, this can give errors and so the container will exit with non-zero status. 2- Clean stop of the daemon before the "verdi storage migrate --force" in `40_prepare_aiida.sh`. This is done because sometimes the restart of the docker container will give error in trying to migrate, as the daemon is still running. I am not able to reproduce the error in a standard way, however.

for more information, see https://pre-commit.ci

stack/base-with-services/before-notebook.d/20_start-postgresql.sh

danielhollas · 2024-07-03T21:42:24Z

stack/base/before-notebook.d/40_prepare-aiida.sh

@@ -74,6 +74,8 @@ load_computer('${computer_name}').set_minimum_job_poll_interval(${job_poll_inter
 else

  # Migration will run for the default profile.
+  ## We need to stop the daemon before.
+  verdi daemon stop


I think we need to understand why this is needed, since the daemon cannot be running at the container startup.

@sphuber does verdi daemon stop do some kind of cleanup that might explain why it is needed here? It is true that we currently don't gracefully stop the daemon when we stop the container so there might perhaps be some stale pid files or something?

@mikibonacci what is the exact error that you saw coming from the verdi storage migrate command?

Depends on the version. The behavior has changed a bit in the 2.x minor releases. But currently, yes, verdi daemon stop should clean up stale PID-files. And it is possible that some other commands will check if the daemon "is running" based on that pid file. Would have to look into more detail if you know exactly which version you are targeting

Thanks! We're currently on 2.5, and hopefully will switch to 2.6 soon.

I was trying to get rid of verdi daemon stop because it adds 2s to container startup. Perhaps we can just remove the stale pid files manually?

Hi @danielhollas and @sphuber, the exact error printed in the log is:

2024-07-03 15:41:58 Critical: Migration aborted, the daemon for the profile is still running.

The target AiiDA version is aiida-core==2.5.1 (build=pyhca7485f_0, channel=conda-forge) indeed.

I was trying to get rid of verdi daemon stop because it adds 2s to container startup. Perhaps we can just remove the stale pid files manually?

Well if you are absolutely certain that the daemon shouldn't be running, you could just manually remove the pid file .aiida/daemon/circus-{profile-name}.pid which should be very fast. The verdi daemon stop command will actually try to reach the daemon, which has a 2 second timeout by default.

Command verdi storage migrate has access to daemon client, so probably another solution is when --force flag is used, it does clean stale PID files before migrate.

I am not sure that it would be a good idea to have --force ignore a running daemon. The --force flag is there to not have to type out the confirmation message. This has been the case for a long time and users may have grown used to that. If we now also have it start ignoring a running daemon, that might be dangerous.

Okay, then manually delete it as Miki did here is fine.

danielhollas · 2024-07-03T21:44:14Z

Thanks for catching these issues.

I encountered issues in restarting the container.

How exactly are you restarting the container?

mikibonacci · 2024-07-03T22:05:45Z

For the restart, I used aiidalab-launch and also the manual startup (docker desktop and terminal). These issues are there only when I use arm64 machine (my Mac). That's strange.

Co-authored-by: Daniel Hollas <[email protected]>

for more information, see https://pre-commit.ci

…e-aiida.sh` This is faster than `verdi daemon stop`. We suppose that the daemon is not running when we start/restart the container.

stack/base/before-notebook.d/40_prepare-aiida.sh

Co-authored-by: Daniel Hollas <[email protected]>

stack/base/before-notebook.d/40_prepare-aiida.sh

danielhollas · 2024-07-04T13:07:07Z

@mikibonacci can you test the latest version of the image (ghcr.io/aiidalab/full-stack:pr-476).

Once you confirm that it works reliably for you I'll merge and release a new version.

Thanks for catching this and the fixes!

stack/base/before-notebook.d/40_prepare-aiida.sh

mikibonacci · 2024-07-04T15:54:58Z

Hi @danielhollas , I tested the ghcr.io/aiidalab/full-stack:pr-476 and it works well for me! Thanks a lot for the review!

danielhollas · 2024-07-04T16:11:51Z

Thanks for testing @mikibonacci. I'll release a new version soon with your fix.

I've opened #477 as a follow up to this to prevent these issues in the future.

danielhollas · 2024-07-04T17:17:24Z

@mikibonacci new version is released. 🚀

mikibonacci requested review from danielhollas and unkcpz July 3, 2024 21:15

[pre-commit.ci] auto fixes from pre-commit.com hooks

9bb9234

for more information, see https://pre-commit.ci

danielhollas requested changes Jul 3, 2024

View reviewed changes

mikibonacci and others added 3 commits July 4, 2024 00:06

Update stack/base-with-services/before-notebook.d/20_start-postgresql.sh

604d0e6

Co-authored-by: Daniel Hollas <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

23c6120

for more information, see https://pre-commit.ci

Clean up stale PID-files before verdi migrate --force in `40_prepar…

8be6bf6

…e-aiida.sh` This is faster than `verdi daemon stop`. We suppose that the daemon is not running when we start/restart the container.

mikibonacci commented Jul 4, 2024

View reviewed changes

stack/base/before-notebook.d/40_prepare-aiida.sh Outdated Show resolved Hide resolved

Update stack/base/before-notebook.d/40_prepare-aiida.sh

75e6811

Co-authored-by: Daniel Hollas <[email protected]>

danielhollas reviewed Jul 4, 2024

View reviewed changes

stack/base/before-notebook.d/40_prepare-aiida.sh Show resolved Hide resolved

pgrep verdi daemon

201ba78

danielhollas reviewed Jul 4, 2024

View reviewed changes

stack/base/before-notebook.d/40_prepare-aiida.sh Outdated Show resolved Hide resolved

Fix rm command

25e937b

danielhollas self-requested a review July 4, 2024 16:06

danielhollas approved these changes Jul 4, 2024

View reviewed changes

danielhollas merged commit c4dbc02 into main Jul 4, 2024
15 checks passed

danielhollas deleted the fix/arm64/daemon_clean_stop branch July 4, 2024 16:09

danielhollas mentioned this pull request Jul 4, 2024

Add a test for container restart #477

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updating the before_notebook `20_start-postgresql.sh` and `40_prepare_aiida.sh` #476

Updating the before_notebook `20_start-postgresql.sh` and `40_prepare_aiida.sh` #476

mikibonacci commented Jul 3, 2024

danielhollas Jul 3, 2024

sphuber Jul 3, 2024

danielhollas Jul 3, 2024

mikibonacci Jul 3, 2024

sphuber Jul 4, 2024

unkcpz Jul 4, 2024 •

edited

Loading

sphuber Jul 4, 2024

unkcpz Jul 4, 2024

danielhollas commented Jul 3, 2024

mikibonacci commented Jul 3, 2024

danielhollas commented Jul 4, 2024

mikibonacci commented Jul 4, 2024

danielhollas commented Jul 4, 2024

danielhollas commented Jul 4, 2024

Updating the *before_notebook* 20_start-postgresql.sh and 40_prepare_aiida.sh #476

Updating the *before_notebook* 20_start-postgresql.sh and 40_prepare_aiida.sh #476

Conversation

mikibonacci commented Jul 3, 2024

danielhollas Jul 3, 2024

Choose a reason for hiding this comment

sphuber Jul 3, 2024

Choose a reason for hiding this comment

danielhollas Jul 3, 2024

Choose a reason for hiding this comment

mikibonacci Jul 3, 2024

Choose a reason for hiding this comment

sphuber Jul 4, 2024

Choose a reason for hiding this comment

unkcpz Jul 4, 2024 • edited Loading

Choose a reason for hiding this comment

sphuber Jul 4, 2024

Choose a reason for hiding this comment

unkcpz Jul 4, 2024

Choose a reason for hiding this comment

danielhollas commented Jul 3, 2024

mikibonacci commented Jul 3, 2024

danielhollas commented Jul 4, 2024

mikibonacci commented Jul 4, 2024

danielhollas commented Jul 4, 2024

danielhollas commented Jul 4, 2024

Updating the before_notebook `20_start-postgresql.sh` and `40_prepare_aiida.sh` #476

Updating the before_notebook `20_start-postgresql.sh` and `40_prepare_aiida.sh` #476

unkcpz Jul 4, 2024 •

edited

Loading