Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

rt.sh crashes with 'Permission denied' error while creating run directory on Gaea #2198

Closed
DusanJovic-NOAA opened this issue Mar 19, 2024 · 4 comments · Fixed by #2195
Closed
Assignees
Labels
bug Something isn't working

Comments

@DusanJovic-NOAA
Copy link
Collaborator

Description

When I run regression test on Gaea rt.sh crashes with 'Permission denied' error while creating run directory. Script tried to create directory under /gpfs/f5/epic/scratch, and I do not have permission to write to that directory.

To Reproduce:

  1. Run rt.sh on Gaea from a user account which is not member of the epic project.

Additional context

Output

+ PARTITION=c5
+ STMP=/gpfs/f5/epic/scratch
+ PTMP=/gpfs/f5/epic/scratch
+ SCHEDULER=slurm
+ mkdir -p /gpfs/f5/epic/scratch/Dusan.Jovic
mkdir: cannot create directory ‘/gpfs/f5/epic/scratch/Dusan.Jovic’: Permission denied
++ echo 'rt.sh error on line 897'
rt.sh error on line 897
++ cleanup

@DusanJovic-NOAA DusanJovic-NOAA added the bug Something isn't working label Mar 19, 2024
@FernandoAndrade-NOAA
Copy link
Collaborator

Hi @DusanJovic-NOAA, users outside of the epic group on Gaea will need to update the paths referring to /gpfs/f5/epic/ in order to run WM RTs. A permissions update to the directories unfortunately likely wouldn't work as I've noticed something on Gaea's back-end has regularly been resetting directories to their more restricted defaults after the C5/F5 transition.

@jkbk2004
Copy link
Collaborator

might need to add one more global variable to set a f5 scratch project specific path name: default epic.

@DusanJovic-NOAA
Copy link
Collaborator Author

Maybe something like this:

diff --git a/tests/rt.sh b/tests/rt.sh
index fd6464c6..823bca5a 100755
--- a/tests/rt.sh
+++ b/tests/rt.sh
@@ -674,8 +674,9 @@ elif [[ $MACHINE_ID = gaea ]]; then
   QUEUE=normal
   COMPILE_QUEUE=normal
   PARTITION=c5
-  STMP=/gpfs/f5/epic/scratch
-  PTMP=/gpfs/f5/epic/scratch
+  dprefix=${dprefix:-/gpfs/f5/$ACCNR/scratch/$USER}
+  STMP=${STMP:-$dprefix/RT_BASELINE}
+  PTMP=${PTMP:-$dprefix/RT_RUNDIRS}
 
   SCHEDULER=slurm

Similar to how it's done on Jet.

@JustinPerket
Copy link

Maybe something like this:

diff --git a/tests/rt.sh b/tests/rt.sh
index fd6464c6..823bca5a 100755
--- a/tests/rt.sh
+++ b/tests/rt.sh
@@ -674,8 +674,9 @@ elif [[ $MACHINE_ID = gaea ]]; then
   QUEUE=normal
   COMPILE_QUEUE=normal
   PARTITION=c5
-  STMP=/gpfs/f5/epic/scratch
-  PTMP=/gpfs/f5/epic/scratch
+  dprefix=${dprefix:-/gpfs/f5/$ACCNR/scratch/$USER}
+  STMP=${STMP:-$dprefix/RT_BASELINE}
+  PTMP=${PTMP:-$dprefix/RT_RUNDIRS}
 
   SCHEDULER=slurm

Similar to how it's done on Jet.

I do something very similar. but I define STMP and PTMP in my environment. I would be happy to see a fix like this so I won't have to keep editing rt.sh as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

Successfully merging a pull request may close this issue.

4 participants