perf(query-core): Improve mutationCache implementation performance #8451

gnoff · 2024-12-17T23:04:21Z

This work is stacked on #8450

In the prior change I modeled unscoped mutations as a separate data store in a way that would speed up methods operating on the internal state of the cache when using unscoped mutations. In this change I further refatored the internal implementation to make removals faster (O(1) generally) and to make the aggregate APIs like getAll and find faster by avoiding multiple concatenations and flattening.

Implementation Details

Scopes are more like scheduling metadata than an intrinsic structure to the set of mutations. In this change I updated the storage format of mutations so that they can be cleared more efficiently. Using a Set allows us to have O(1) removals. We can also construct the Array for getAll() more efficiently by not having to flatten the inner scope arrays. Additionally the scope array is now mutated both on add and remove to avoid allocating extra arrays and doing additional copies.

I rewrote clear to not use this.remove which avoids many intermediate operations against the underlyign data structures.

I considered using an Array again for the primary storage format and this may actually be better if the expected number of mutations isn't excessively large but since I don't know enough about mutation usage I stuck with a Set for the better deletion performance at larger mutation size.

The mutationCache implementation models all mutations as scoped even when the scope is implicit. With implicit scopes there can only ever be one mutation per scope so the overhead of tracking the scope is unecessary. Additionally because scopes need a unique value the current implementation bootstraps this value using a random source (Date.now()). While this is practically likely to be fine it is not impossible that there is a scope collision with mutation hydration. Modeling the internal state of the cache without deriving virtual scopes simplifies the implementation in memory and code and avoids the possibily of scope collision by design. This should provide a modest perf bump for mutations that do not use scopes and be neutral for mutations that do use scopes.

…rformance by modeling mutations in a single set Scopes are more like scheduling metadata than an intrinsic structure to the set of mutations. In this change I updated the storage format of mutations so that they can be cleared more efficiently. Using a Set allows us to have O(1) removals. We can also construct the Array for getAll() more efficiently by not having to flatten the inner scope arrays. Additionally the scope array is now mutated both on add and remove to avoid allocating extra arrays and doing additional copies. I rewrote clear to not use this.remove which avoids many intermediate operations against the underlyign data structures. I considered using an Array again for the primary storage format and this may actually be better if the expected number of mutations isn't excessively large but since I don't know enough about mutation usage I stuck with a Set for the better deletion performance at larger mutation size.

gnoff added 2 commits December 17, 2024 13:06

github-actions bot added the package: query-core label Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(query-core): Improve mutationCache implementation performance #8451

perf(query-core): Improve mutationCache implementation performance #8451

gnoff commented Dec 17, 2024

perf(query-core): Improve mutationCache implementation performance #8451

Are you sure you want to change the base?

perf(query-core): Improve mutationCache implementation performance #8451

Conversation

gnoff commented Dec 17, 2024

Implementation Details