WIP, doesn't even compile #4201

durban · 2024-12-15T13:22:22Z

This is on top of @djspiewak's wip/multithreaded-wstp branch. I did nothing so far, except default to SleepSystem in 489fbb9. This makes it possible to run IOSpec on scala-native; it (sometimes) passes on my machine ;-). It's probably too early to have this as a PR, but I'm doing it anyway to avoid duplicating work, and maybe to have a discussion about EpollSystem (see below). (@djspiewak feel free to close this PR if you have other plans with your branch.)

durban · 2024-12-15T13:32:00Z

So, I'm running this on Linux, and without 489fbb9 it tries to use EpollSystem. But starting the tests (e.g., testsNative/testOnly cats.effect.IOSpec) only leads to a hanging process. It seems to me, that the WSTP threads are waiting in epoll_wait, and a (the?) GC thread seems to wait for mutator threads to reach a safepoint(?). (Details: the GC is in thread_yield, called by Synchronizer_acquire.) I'm speculating here, but maybe Thread.sleep() does "something", which EpollSystem doesn't, before epoll_waiting?

EDIT: what I wrote above is probably completely wrong... 🤷‍♂️

djspiewak · 2024-12-15T15:45:20Z

This is great! Thank you for pushing this forward to the next obstacle.

One of the things that occurred to me as I poked at my branch originally is SN's tooling for introspecting thread state is really really limited so far as I understand it. Maybe this is just my ignorance and LLVM has some magic we could turn on, but I strongly suspect we're going to need better introspection to run down some of these problems.

What I'm thinking is we're probably going to end up building that, or at least leaning in heavily to do so, and that's probably a large part of what we'll need to do to get this off the ground. We should chat with the SN folks.

armanbilge · 2024-12-15T21:15:14Z

The reason it's hanging is because we haven't implemented interruption yet for the Native I/O-polling systems. This wasn't necessary when it was single-threaded, but now it's critical :)

cats-effect/core/native/src/main/scala/cats/effect/unsafe/EpollSystem.scala

Line 67 in 489fbb9

def interrupt(targetThread: Thread, targetPoller: Poller): Unit = ()

Compare with:

cats-effect/core/jvm/src/main/scala/cats/effect/unsafe/SelectorSystem.scala

Lines 97 to 100 in 9ce05f2

    
           def interrupt(targetThread: Thread, targetPoller: Poller): Unit = { 
        
             targetPoller.selector.wakeup() 
        
             () 
        
           }

armanbilge · 2024-12-15T21:19:00Z

Oh, the other reason it may be hanging is indeed related to GC. On Scala Native, blocking native calls need to be annotated explicitly with the @blocking annotation, so that it does the necessary book-keeping so it's possible to GC while a thread is stuck in that blocking call.

https://github.com/scala-native/scala-native/blob/c7b54a18e3ff11d8b2792f16fbb6e97780314014/nativelib/src/main/scala/scala/scalanative/unsafe/package.scala#L103-L106

For now it's fine to just mark it @blocking, but b/c this comes at a performance cost, we should actually make two separate epoll_wait bindings. One will use @blocking for when the timeout is > 0, and the other will not, for when the timeout == 0.

djspiewak · 2024-12-16T15:20:53Z

The reason it's hanging is because we haven't implemented interruption yet for the Native I/O-polling systems. This wasn't necessary when it was single-threaded, but now it's critical :)

You know, I didn't even think about this. Makes loads of sense though. Pipes time!

…SN blocking

durban · 2024-12-20T14:40:40Z

@armanbilge Thanks, I've tried to do the 2 things you mentioned. In 62b8141 I turned on the EpollSystem again, and tried implementing interrupt, and added the scala-native blocking annotation. This way testsNative/testOnly cats.effect.IOSpec passes on my machine. It obviously needs more work (e.g., I think interrupt is not threadsafe), but at least it's a step in the right direction.

armanbilge · 2024-12-20T15:52:15Z

core/native/src/main/scala/cats/effect/unsafe/EpollSystem.scala

+      // TODO: this is not threadsafe, we're reading `interruptFd` without synchronization:
+      if (unistd.write(this.interruptFd, buf, 8.toCSize) == -1) {


It doesn't need to be, it will be synchronized by workerThreadPublisher at its callsites.

cats-effect/core/jvm/src/main/scala/cats/effect/unsafe/WorkStealingThreadPool.scala

Lines 324 to 326 in 8d92651

workerThreadPublisher.get()

val worker = workerThreads(index)

system.interrupt(worker, pollers(index))

djspiewak · 2024-12-26T22:35:39Z

Interesting. So I merged your branch with series/3.x and now I'm getting the following:

[error] Unknown DWARF abbrev code: 26
[error] 
[error] STACKTRACE
[error] 
[error] java.lang.RuntimeException: Unknown DWARF abbrev code: 26
[error] 
[error] 
[error] This looks like a specs2 exception...
[error] Please report it with the preceding stacktrace at http://github.com/etorreborre/specs2/issues
[error]  
[error] Error: Total 1, Failed 0, Errors 1, Passed 0
[error] Error during tests:
[error] 	cats.effect.IOSpec

Edit: Appears to be a macOS only thing. Compiles and runs on Linux. Lovely.

djspiewak · 2024-12-27T15:33:37Z

Okay got around the issue with Lorenzo's help. It's fixed in SN main, so I updated to a local snapshot (lol) on my branch and made progress. I'll dig into interruption for kqueue

djspiewak · 2024-12-27T22:17:03Z

Update:

We no longer need the snapshot. I've pushed the magic compiler settings incantation on my branch
Also pushed is an initial stab at using EVFILT_USER to handle kqueue interrupts. I implemented it by basically putting that event permanently into the front of the changes array and then triggering it on the kqfd on interrupt(). In principle, this makes sense, but it doesn't actually work. The threads wake up but things spin forever. I'm probably being dumb. Have fun.

Will get back to this later.

djspiewak · 2024-12-28T03:17:29Z

I pushed more. Kqueue is pretty close to working I think.

djspiewak · 2024-12-28T19:19:12Z

Update: I've got testsNative/test almost entirely green on macOS.

durban · 2024-12-28T19:28:42Z

@djspiewak Yeah, I don't know anything about macos; I can't even help testing it (I have no access to such a system). Anyway, I've merged your branch into this PR, so that your progress is visible here too.

djspiewak · 2024-12-28T19:29:40Z

I think this PR will become the PR, more than likely

djspiewak · 2024-12-28T19:30:42Z

Oh, if you want to just focus on Linux + epoll, that works perfectly. There's also more stuff to go through and pull out of jvm-specific areas and into jvm-native areas (e.g. the IOPlatformSpecs, which are all intermixed and weird, or interruptibleImpl, etc). Lots of stuff strewn around.

durban · 2024-12-28T19:50:52Z

Okay, I'll try to look at one of those when I'll have some time. (Btw, I think you can push directly to this PR; at least I've checked that github checkbox.)

djspiewak · 2024-12-28T19:58:37Z

Alrighty, I'm going to deprecate my fork's branch then and we can centralize on yours.

djspiewak · 2024-12-28T19:58:59Z

Oh btw as a planning process note… Once we can get something which is mergeable, we should do that, close out this PR, and use normal PR process from that point forward

djspiewak · 2024-12-28T21:01:24Z

Btw I can't actually push to your branch. Pushing to mine for now.

…ed with native

… scalafmt)

djspiewak · 2024-12-28T22:21:55Z

Status report:

I think I've got all (but one) of the tests working and all the functionality is shifted over aside from interruptible-related things and IOAppSpec (this is worth checking though just in case I missed something). The latter is dependent on the sbt yak shave (the BuildInfo stuff). The interruptible stuff just needs to be stubbed around basically since we don't have the right exception types in SN; fairly trivial. The one failing test is "handle lots of simultaneous events" in FileDescriptorPollerSpec, which is annoying since it probably actually means something. Everything else is passing.

I took the time to factor out kevent64 into blocking and non-blocking variants. The pattern works pretty well so we should probably do something similar for epoll.

This still doesn't compile on Scala 3 because of the epoll_event tag shenanigans. That's going to take some more compiler sweet talking.

One thing that has come out as part of this is our conditional source matrix is getting pretty hairy. It might be worth taking a pass over things and updating our encodings to be what we want them to be (e.g. we have Platform traits, but some of them are cross-platform, and we also have the isJVM/isNative/etc checks, and sometimes we mix them all together for maximal confusion).

Anyway, apart from the FD weirdness, I think kqueue is pretty much ready to go. It could be optimized more but I'm not worried about performance yet. We're very close on this branch as a whole, tbh.

Edit: I've confirmed that Linux and macOS are now basically in the same state, which is reassuring. I actually wonder if something might be odd about that particular test rather than the underlying polling system or WSTP?

durban · 2024-12-29T10:49:40Z

Btw I can't actually push to your branch

Huh, that's strange. I double checked, and the "allow edits by maintainers" is selected. I don't know what's going on. Anyway, I'm sure we'll be able to solve it with some combination of merge and push...

armanbilge · 2024-12-29T18:00:50Z

I've confirmed that Linux and macOS are now basically in the same state,

This is not my observation.

On Linux, the entire FileDescriptorPollerSpec is broken. There seems to be a segfault when registering the pipes with the epoll.
On macOS, only the "handle lots of simultaneous events" is failing. It can segfault, or for smaller numbers simply hang and timeout.

djspiewak · 2024-12-29T23:11:30Z

There seems to be a segfault when registering the pipes with the epoll

Do segfaults manifest as fatal errors in sbt? If so then that's exactly what I'm experiencing, not a failing test.

djspiewak · 2024-12-29T23:15:07Z

Oh I see what you mean. Hmm. I must have just gotten a segfault in both and assumed they were the same issue. Unfortunate.

durban · 2024-12-31T10:29:50Z

I think I've fixed the segfault in FileDescriptorPollerSpec for EpollSystem (see f4bc66a). I think the stackallocs were somehow wrong: type epoll_event <: AnyRef makes scala-native think it's 16 bytes. While there is a Tag[epoll_event] with the correct size, stackalloc doesn't seem to use it(?). If I use the correct size manually, the segfault disappears. (I'm sure there is a nicer solution...)

durban · 2024-12-31T11:00:38Z

Okay, so it seems that in SN 0.4 stackalloc takes an implicit Tag; probably that's why this used to work. In 0.5 it doesn't; but then how does it know the size? 🤷‍♂️

djspiewak and others added 5 commits December 8, 2024 09:56

Got the ball rolling on SN 0.5

caaac9b

Fixed more weird 0.4 -> 0.5 things

cc600ac

Got WSTP-related stuffs compiling on native

d7306c7

Reimplemented pats of AtomicIntegerFieldUpdater; now it's all linking

426776b

Use SleepSystem by default on native

489fbb9

djspiewak mentioned this pull request Dec 18, 2024

Release for Scala Native 0.5.x #4076

Open

durban added 2 commits December 20, 2024 15:36

Reënable EpollSystem, implement interrupt for it, mark epoll_wait as …

62b8141

…SN blocking

More helpful TestTimeoutException

7f0a559

armanbilge reviewed Dec 20, 2024

View reviewed changes

Merge branch 'series/3.x' into wip/multithreaded-wstp

8ebf321

djspiewak added 2 commits December 27, 2024 11:46

Adjusted DWARF version to avoid issues with 0.5.6 on macOS

afba12f

Started noodling with kqueue interrupts

3a88218

Rewrote most of the kqueue stuff to be simpler

e01f25b

djspiewak added 7 commits December 28, 2024 10:19

Forgot to bump the base version

06b0a65

Enabled concurrent Ref on native

ca7c89d

Make kqueue compatible with parallel GC on SN

081bc56

Enabled higher iterations from ContSpec on native

b0caa11

Made Deferred parallelism specs common across JVM and native

6e75e13

Shifted JVM IO functionality to share with native

65fd3e2

Made Dispatcher functionality common across JVM and native

e818c50

djspiewak added 3 commits December 28, 2024 12:43

Enabled higher parallelism on native queue specs

dfc9a17

Skip Dispatcher interruption spec for the time being

eab5150

Generalized high precision native nowMicros

22110b6

djspiewak added 3 commits December 28, 2024 14:04

Restored syscall-reducing optimization in kqueue implementation

f86ed07

Swapped out LongMap for TrieMap for callbacks in kqueue

4ec0a9a

prePR

47befdd

djspiewak added 4 commits December 28, 2024 15:04

Shifted JVM-specific MapRef support to share with native

5804694

A bit of yak shaving for scala 3 and unused warnings

72c7098

Factored non-JVM-specific highly concurrent IO specs out to be shar…

bb84b42

…ed with native

Factored kevent64 out into blocking and non-blocking variants (also…

010a820

… scalafmt)

Merge branch 'series/3.x' into wip/multithreaded-wstp

0b545af

EpollSystem: fix stackallocs; use TrieMap

f4bc66a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP, doesn't even compile #4201

WIP, doesn't even compile #4201

durban commented Dec 15, 2024

durban commented Dec 15, 2024 •

edited

Loading

djspiewak commented Dec 15, 2024 •

edited

Loading

armanbilge commented Dec 15, 2024 •

edited

Loading

armanbilge commented Dec 15, 2024

djspiewak commented Dec 16, 2024

durban commented Dec 20, 2024

armanbilge Dec 20, 2024

djspiewak commented Dec 26, 2024 •

edited

Loading

djspiewak commented Dec 27, 2024

djspiewak commented Dec 27, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024

durban commented Dec 28, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024

durban commented Dec 28, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024 •

edited

Loading

durban commented Dec 29, 2024

armanbilge commented Dec 29, 2024

djspiewak commented Dec 29, 2024

djspiewak commented Dec 29, 2024

durban commented Dec 31, 2024

durban commented Dec 31, 2024 •

edited

Loading

		// TODO: this is not threadsafe, we're reading `interruptFd` without synchronization:
		if (unistd.write(this.interruptFd, buf, 8.toCSize) == -1) {

	workerThreadPublisher.get()
	val worker = workerThreads(index)
	system.interrupt(worker, pollers(index))

WIP, doesn't even compile #4201

Are you sure you want to change the base?

WIP, doesn't even compile #4201

Conversation

durban commented Dec 15, 2024

durban commented Dec 15, 2024 • edited Loading

djspiewak commented Dec 15, 2024 • edited Loading

armanbilge commented Dec 15, 2024 • edited Loading

armanbilge commented Dec 15, 2024

djspiewak commented Dec 16, 2024

durban commented Dec 20, 2024

armanbilge Dec 20, 2024

Choose a reason for hiding this comment

djspiewak commented Dec 26, 2024 • edited Loading

djspiewak commented Dec 27, 2024

djspiewak commented Dec 27, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024

durban commented Dec 28, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024

durban commented Dec 28, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024

djspiewak commented Dec 28, 2024 • edited Loading

durban commented Dec 29, 2024

armanbilge commented Dec 29, 2024

djspiewak commented Dec 29, 2024

djspiewak commented Dec 29, 2024

durban commented Dec 31, 2024

durban commented Dec 31, 2024 • edited Loading

durban commented Dec 15, 2024 •

edited

Loading

djspiewak commented Dec 15, 2024 •

edited

Loading

armanbilge commented Dec 15, 2024 •

edited

Loading

djspiewak commented Dec 26, 2024 •

edited

Loading

djspiewak commented Dec 28, 2024 •

edited

Loading

durban commented Dec 31, 2024 •

edited

Loading