Segfault fix #30

alexandrebouchard · 2023-03-06T17:30:06Z

Post-resolution analysis/summary.

The bugs encountered and addressed here fall into two categories:

B1. There was a new released version of Turing breaking tests (potentially just for Julia nightly?)
B2. Various subtle and non-deterministic MPI bugs, e.g. related to interaction of GC and MPI

By coincidence, CI started failing with both B1 and B2 around the same time, in the first case, because of a new Turing release, in the second, by chance since it occurs non-deterministically. For B1, we simply removed Turing from test dependencies; all Turing integration correctness checks can be done with DynamicPPL only it turns out after some changes in the Turing test code.

After solving a first category B2 bug, we added more MPI implementations in our set of tests, revealing more B2-type bugs. The full set discovered here is:

B2a. All request objects returned by MPI.isend, MPI.Isend, MPI.isend, etc should be captured and either MPI.free()'ed or WaitXXX()'ed. Otherwise, we rely on Julia's GC to tell the MPI C code to free request object, which there are only 2^16 of in MPICH. Eventually, we hit situations where GC does not occur early enough, especially in non-allocating or toy examples. Since MPICH does not perform bound checks, it does not provide a descriptive error and instead segfaults. Moreover, since MPI.jl's built-in test mpiexecs are compiled without debug info, it is difficult to diagnosis such issue. See pmodels/mpich#6432 (comment) for more information.

B2b. A distinct MPI + Julia GC-related issue popped up: according to JuliaParallel/MPI.jl#337, Julia "hijacks" the sigsegv signal (segmentation fault) to synchronize threads pausing for Julia's stop-the-world GC. The problem is that some MPI implementations' mpiexec parent process intercept that sigsegv signal and trigger a global crash whenever Julia GC occurs in a multithreaded context. Moreover, the described error, "segmentation fault", has nothing to do with the underlying problem (for us, the fact that B2b popped up just after solving a genuine segfault, B2a---literally minutes after---made this especially confusing!). For some MPI implementations, this undesirable interference between Julia GC and mpiexec can be worked around (JuliaParallel/MPI.jl#337), but for others no workaround are currently known (e.g. at least some Intel MPI versions, which we reported to MPI.jl, JuliaParallel/MPI.jl#725 but also at least one version of OpenMPI, i.e. OpenMPI4.0 while OpenMPI4.1 is ok. Such unresolved issues have been reported before by others, e.g. https://discourse.julialang.org/t/julia-crashes-inside-threads-with-mpi/52400/5). In summary, B2b is due to a Julia GC design decision, so the best we can do is in the future print a nicer error message. The current one is very cryptic. The good news is that for the vast majority of main-stream MPI implementations, this can be worked around by telling MPI to ignore sigsegv. If one is stuck with a closed source MPI without workaround, one can always either use single-thread, or stop GC. See #32 for more detailed proposed enhancement regarding the error messages.

B2c. To test some OpenMPI implementations, the "--oversubscribe" flag should be added to mpiexec. Moreover the error message explaining this is in some circumstances "eaten up" leaving only a nondescript Julia pipeline_error stack trace. We now automatically add that flag in our test cases.

B2d. A subtle problem due to MPI.Init() silently changing ENV variables. In specific circumstances this causes a crash: 1. parent process is using a system library. 2. parent process' test first call local pigeons. 3. that in turns called mpi_active() which internally used MPI.Init() to see if Comm_size > 1. 4. that had the side effect of changing ENV. 5. then, when calling pigeons MPI tests, these ENV variables were passed to ChildProcess, prevening mpiexec to start and resulting in a cryptic pipeline_error message without any details in the chain of events 1-5. The new approach calls MPI.Init() only when in the context of running under MPI.

alexandrebouchard · 2023-03-06T17:49:00Z

But one of the crashes "reproduces" the segfault. So there seems to be at least two issues (not suggesting they are related)

segfault with current main suspects: MPI.jl, libraries MPI.jl calls, or Julia
some non-determinism in the build process, main suspect: Turing.jl

This reverts commit 69bb270.

alexandrebouchard · 2023-03-06T19:00:34Z

does not seem to be due to :funnelled arg
does not seem to be due to a faulty tag ub

alexandrebouchard · 2023-03-06T19:27:03Z

One hypothesis: pmodels/mpich#6432 (comment)

hzhou · 2023-03-06T22:57:45Z

src/mpi_utils/Entangler.jl

@@ -105,7 +105,7 @@ mpi_active() =
        Comm_size(COMM_WORLD) > 1
    end

-init_mpi() = Init() #threadlevel = :funneled)
+init_mpi() = Init(threadlevel = :funneled)


As troubleshooting, you should try MPI_THREAD_MULTIPLE. The default is thread-single, which is nearly the same as thread-funneled. But if you have race condition into MPI, then you need thread-multiple.

Thank you for the suggestion! Unfortunately, the segfault still arise with threadlevel = :multiple. On the positive side, I am now able to reproduce the problem locally.

alexandrebouchard · 2023-03-07T17:05:32Z

Can reproduce the crash locally now with e.g.:

using Pigeons
pigeons(target = Pigeons.TestSwapper(0.5), n_rounds = 12, n_chains = 200, on = ChildProcess(n_local_mpi_processes = 4))

miguelbiron · 2023-03-07T17:24:52Z

I also get segfault locally with the example above.

julia> versioninfo()
Julia Version 1.8.5
Commit 17cfb8e (2023-01-08 06:45 UTC)
Platform Info:
  OS: Linux (x86_64-linux-gnu)
  CPU: 8 × Intel(R) Core(TM) i7-6820HQ CPU @ 2.70GHz
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-13.0.1 (ORCJIT, skylake)
  Threads: 1 on 8 virtual cores

alexandrebouchard · 2023-03-07T17:25:02Z

Created branch https://github.com/Julia-Tempering/Pigeons.jl/tree/investigate-segfault-openmpi-hack to make local tests on OpenMPI. Instructions: check out that branch, then

using MPIPreferences; MPIPreferences.use_jll_binary("OpenMPI_jll")
exit()
julia
...

alexandrebouchard · 2023-03-07T17:42:52Z

The problem does not arise with Open MPI, confirming this is likely MPICH related. (with open MPI an unrelated warning message pops up but does not cause a crash, it is documented in open-mpi/ompi#7393 and the fix discussed there removes the warning message).

alexandrebouchard · 2023-03-07T18:09:54Z

Next step on this will be to build a local MPICH with debug symbols on (see JuliaParallel/MPI.jl#720 which also provides a nice template for testing several mpi systems in CI).

Can follow the Yggdrasil script to help with that and also to check if the problem also arises with ch4: https://github.com/JuliaPackaging/Yggdrasil/blob/master/M/MPICH/build_tarballs.jl#L74

…rent project

.github/workflows/CI.yml

src/Pigeons.jl

src/submission/ChildProcess.jl

src/mpi_utils/Entangler.jl

src/submission/MPI.jl

test/runtests.jl

src/submission/ChildProcess.jl

test/turing.jl

Before this, we had crashed such as in 18ef655 Here is what was happening in these crash 1. parent process is using a system library 2. parent process' test first call local pigeons 3. that in turns called mpi_active() which internally used MPI.Init() to see if Comm_size > 1 4. that had the side effect of changing ENV 5. then, when calling pigeons MPI tests, these ENV variables were passed to ChildProcess, causing problems The new approach avoids to call MPI.Init() frivolously

Keeping only the suspected faulty test (by commenting out rest (!))

23b59d3

alexandrebouchard added 5 commits March 6, 2023 09:57

Removing Turing from test to isolate only one error at the time

40001d0

Check behaviour on different threadlevels

ff5aac9

Back to same threadlevel as not changing crashing behaviour

d00ff53

More conservative tag ub?

69bb270

Revert "More conservative tag ub?"

d94252f

This reverts commit 69bb270.

hzhou reviewed Mar 6, 2023

View reviewed changes

alexandrebouchard added 4 commits March 6, 2023 15:15

Trying on OpenMPI instead of mpich

bcf016d

Going back to mpich, trying threadlevel = :multiple

379df4b

Back to funneled as :multiple still crashes

2cd3571

Further simplification

c8a0261

alexandrebouchard and others added 5 commits March 8, 2023 09:59

Candidate fix

6217650

Reintroducing other tests

1f16936

purge tests from Turing, use only DynamicPPL

0ca198f

Another free

ee276fc

Trying to run tests with more rounds

491c26b

alexandrebouchard changed the title ~~Keeping only the suspected faulty test (by commenting out rest (!))~~ Segfault fix Mar 8, 2023

alexandrebouchard marked this pull request as ready for review March 9, 2023 00:39

miguelbiron added 4 commits March 8, 2023 16:41

system MPI test

fab7934

fix typos

2ed0aae

remove step that does not make sense outside MPI.jl

c271261

setup MPIPreferences in the test env

e3f569a

miguelbiron and others added 14 commits March 9, 2023 15:24

simplify logging

700c712

force instantiate + precompile

ebcb3a6

Add mpi_args to mpi_test

ad69b34

Temporary: trying to speed up some key tests

6bde8fe

Fix

7b5e7d4

Fix the fix + reintroducing the system-MPI tests

fc34b85

Add back libmpich-dev to resume investigation on ghostbug

5ba839e

Trying to simplify CI setup needed to reproduce ghostbug

2b37c0f

Fix last commit

0847901

toy_mvn not enough to manifest ghostbug, trying Turing

f75a280

test mpich+openmpi using brew

93033fc

fix wrong abi detection for mpich

d46d429

move xtra args to MPI struct + remove prints + failsafe for empty cur…

7a11074

…rent project

mpiexec args for childprocess

d89911a

alexandrebouchard commented Mar 10, 2023

View reviewed changes

miguelbiron added 3 commits March 10, 2023 19:40

re-introduce all CI tests + fix bug in building mpi cmd

aa233e8

add support for using without Project.toml

42993fd

add comment explaining why we wait on Isend

42bdd75

alexandrebouchard commented Mar 11, 2023

View reviewed changes

src/submission/ChildProcess.jl Outdated Show resolved Hide resolved

test/turing.jl Show resolved Hide resolved

miguelbiron and others added 6 commits March 11, 2023 10:54

mpiexec_args is a Cmd now

c5cb70a

re-instate all tests

9e31b49

Test hypothesis that GC+multithread is issue; determine all MPIs affects

74a4355

force instantiate in mpi_test + add MicrosoftMPI test

7454ae9

adding back all test

18ef655

This was referenced Mar 13, 2023

Improve error message for MPI implementations incompatible with Julia multi-threading #32

Open

Openmpi bugs #31

Closed

miguelbiron merged commit c513ee0 into main Mar 13, 2023

alexandrebouchard mentioned this pull request Mar 13, 2023

Add support for MPI functionalities on windows #34

Open

miguelbiron deleted the investigate-segfault branch June 15, 2023 14:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Segfault fix #30

Segfault fix #30

alexandrebouchard commented Mar 6, 2023 •

edited

Loading

alexandrebouchard commented Mar 6, 2023 •

edited

Loading

alexandrebouchard commented Mar 6, 2023

alexandrebouchard commented Mar 6, 2023

hzhou Mar 6, 2023

alexandrebouchard Mar 7, 2023

alexandrebouchard commented Mar 7, 2023

miguelbiron commented Mar 7, 2023

alexandrebouchard commented Mar 7, 2023

alexandrebouchard commented Mar 7, 2023

alexandrebouchard commented Mar 7, 2023 •

edited

Loading

Segfault fix #30

Segfault fix #30

Conversation

alexandrebouchard commented Mar 6, 2023 • edited Loading

alexandrebouchard commented Mar 6, 2023 • edited Loading

alexandrebouchard commented Mar 6, 2023

alexandrebouchard commented Mar 6, 2023

hzhou Mar 6, 2023

Choose a reason for hiding this comment

alexandrebouchard Mar 7, 2023

Choose a reason for hiding this comment

alexandrebouchard commented Mar 7, 2023

miguelbiron commented Mar 7, 2023

alexandrebouchard commented Mar 7, 2023

alexandrebouchard commented Mar 7, 2023

alexandrebouchard commented Mar 7, 2023 • edited Loading

alexandrebouchard commented Mar 6, 2023 •

edited

Loading

alexandrebouchard commented Mar 6, 2023 •

edited

Loading

alexandrebouchard commented Mar 7, 2023 •

edited

Loading