Compressor Optimizer #367

Tabaie · 2024-11-30T01:13:15Z

This PR implements issue #366.

The compressor now attempts to recompress everything from scratch upon encountering a full blob, before attempting to bypass compression altogether.

Checklist

I wrote new tests for my new core changes.
I have successfully run tests, style checker and build against my new changes locally.
I have informed the team of any breaking changes if there are any.

codecov-commenter · 2024-11-30T01:20:13Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 70.40%. Comparing base (972cc85) to head (02a6635).

Additional details and impacted files

@@             Coverage Diff              @@
##               main     #367      +/-   ##
============================================
+ Coverage     68.22%   70.40%   +2.18%     
+ Complexity     1128     1063      -65     
============================================
  Files           319      305      -14     
  Lines         12795    12186     -609     
  Branches       1277     1165     -112     
============================================
- Hits           8729     8580     -149     
+ Misses         3540     3121     -419     
+ Partials        526      485      -41

Flag	Coverage Δ		*Carryforward flag
hardhat	`98.59% <ø> (-0.11%)`	⬇️
kotlin	`68.09% <ø> (+2.24%)`	⬆️	Carriedforward from 94e9da9

*This pull request uses carry forward flags. Click here to find out more.

see 54 files with indirect coverage changes

gbotrel · 2024-12-03T21:22:57Z

So to sum up, this is a non-breaking change, we could compress better (by ~7%) if we re-compress the full blob each time we attempt to append a block; but doing so is too slow so you propose to keep the original method, that would give a "preliminary result", and change the result behind the scenes between calls?

Wouldn't this have some side effects @jpnovais ? (i.e basically the compressor could say "we compressed block 1, it takes N Bytes, current blob is now at 100kB", then recompress it with more context and update the internal state to "current blob is 98kB" without notifying the coordinator. ).

Re implementation, before introducing an async optimizer, I'ld prefer to understand perf constraints better; i.e. how long it takes now, how long it would take if we recompress all the blob at each append, and within what limit we need operate . i.e. if we say the compressor could take as much as XXXms , then we may just want to have a simpler cleaner code and kill this async pattern.

jpnovais · 2024-12-03T21:46:19Z

My input on this optimization is:

Context:

The coordinator does not actively keep track of the current blob size, that is the responsibility of the compressor, which is aware of blob limit.
Coordonator calls fun CanWrite(data: ByteArray, data_len: Int): Boolean before calling Write, if returns false then coordinator knows the blob is full and starts a new one

My take based on the above:

As mentioned, Asyc can be problematic so I would avoid it. Also, I think we don't need it here.
My suggestion: when coordinator calls CanWrite before returning false, the compressor would try to perform the full compression an see if that extra blob fits.
- pros:
  - lazy computation - it only does the "full re-compression" when it reaches the limit;
  - 100% transparent to the coordinator and keeps same API. This leaves room for further internal code performance optimizations ;

@Tabaie @gbotrel WDYT?

gbotrel · 2024-12-03T21:50:16Z

Yep that would make less CPU use to do it only when full . But this last call to "CanWrite" may be 10x slower than the previous calls.
So what are the bounds perf-wise we need to operate in? If a call to CanWrite takes 500ms is that acceptable? (not saying it will, just want a rough order of magnitude of bound)

jpnovais · 2024-12-04T12:57:33Z

So what are the bounds perf-wise we need to operate in? If a call to CanWrite takes 500ms is that acceptable? (not saying it will, just want a rough order of magnitude of bound)

It's ok to have a call to CanWrite that takes 500ms at the of the blob, as long as the preceding calls are not affected timewise.

Tabaie · 2024-12-06T19:02:11Z

I agree, doing it synchronously at the end is a good idea. In fact it's similar to the "no compress" logic we already have.

gbotrel

LGTM 👍 I would probably just add one or two test to ensure this is correctly triggered and that the internal state is correctly reset after

Signed-off-by: Arya Tabaie <[email protected]>

jpnovais · 2024-12-14T12:41:38Z

The difference, from the user's point of view, is output nondeterminism, as it will differ based on how much time the compressor has had to optimize.

What does this mean, "based on how much time the compressor has had to optimize" from the coordinator's PoV?

…izer

Tabaie · 2024-12-16T17:30:54Z

The difference, from the user's point of view, is output nondeterminism, as it will differ based on how much time the compressor has had to optimize.

What does this mean, "based on how much time the compressor has had to optimize" from the coordinator's PoV?

This was for the parallel optimizer so it no longer applies. Removing from description.

Tabaie · 2024-12-18T22:49:05Z

The difference, from the user's point of view, is output nondeterminism, as it will differ based on how much time the compressor has had to optimize.

What does this mean, "based on how much time the compressor has had to optimize" from the coordinator's PoV?

This was for the parallel optimizer so it no longer applies. Removing from description.

@jpnovais Does it look good now?

jpnovais · 2024-12-19T06:56:07Z

@Tabaie sure. Sorry for the delay and not reverting on the 1st comment.

ivokub

Seems good to me. Before merging can you confirm that by referencing to the same buffer when recompressing we don't have some internal overwrites of bytes.Buffer which could cause issues?

Also some comments about simplifying error handling.

That said - imo the current approach may be error prone when wanting to make changes in the future. It heavily relies on reverting the compressor in case the block doesnt fit (or when we only run CanWrite). And the state logic imo is complex enough that it is diffcult to follow and test.

What I would recommend is to add a method to clone the compressor using a pool and then perform operations on the cloned compressor. If write succeeds then we update the reference in BlobMaker, otherwise we discard the cloned compressor (by resetting and putting it back into the pool). Internally it would imo also require using a pool of bytes.Buffer.

Imo trying to optimize for memory here doesn't make a lot of sense as the blocks/blobs in general are small (imo blob maximally 2MB and maximum uncompressed data assuming 5x compression 10MB) relative to the rest of memory usage.

From my side this is not blocker for merging the PR for now, but would definitely simplify the implementation and guard against bugs in the future.

prover/lib/compressor/blob/v1/blob_maker.go

Tabaie added 3 commits November 29, 2024 17:48

feat clock based optimizer

b2a5010

feat synchronously-called optimizer

dc76a1e

perf: insistent optimizer

a5292bb

Tabaie added enhancement New feature or request performances Label the current work as being directed toward performance optimization Data compressor labels Nov 30, 2024

Tabaie requested review from jpnovais, julien-marchand and gbotrel November 30, 2024 01:13

Tabaie self-assigned this Nov 30, 2024

Tabaie linked an issue Nov 30, 2024 that may be closed by this pull request

"Wholesale" blob compression #366

Open

4 tasks

Tabaie had a problem deploying to docker-build-and-e2e November 30, 2024 01:15 — with GitHub Actions Error

fix private header

85ca792

Tabaie temporarily deployed to docker-build-and-e2e November 30, 2024 01:25 — with GitHub Actions Inactive

Merge branch 'main' into compressor/perf/optimizer

88c2c50

Tabaie had a problem deploying to docker-build-and-e2e December 2, 2024 04:03 — with GitHub Actions Error

Merge branch 'main' into compressor/perf/optimizer

e28ee63

Tabaie had a problem deploying to docker-build-and-e2e December 3, 2024 17:12 — with GitHub Actions Error

Merge branch 'main' into compressor/perf/optimizer

aec0e51

Tabaie had a problem deploying to docker-build-and-e2e December 3, 2024 18:55 — with GitHub Actions Error

Merge branch 'main' into compressor/perf/optimizer

a0f261e

Tabaie had a problem deploying to docker-build-and-e2e December 6, 2024 18:54 — with GitHub Actions Error

refactor: synchronous optimization

8f370b5

Tabaie temporarily deployed to docker-build-and-e2e December 6, 2024 21:23 — with GitHub Actions Inactive

gbotrel previously approved these changes Dec 9, 2024

View reviewed changes

Merge branch 'main' into compressor/perf/optimizer

3871d5c

Signed-off-by: Arya Tabaie <[email protected]>

Tabaie dismissed gbotrel’s stale review via 3871d5c December 14, 2024 00:17

Tabaie had a problem deploying to docker-build-and-e2e December 14, 2024 00:19 — with GitHub Actions Error

Tabaie added 2 commits December 16, 2024 08:53

Merge remote-tracking branch 'origin/main' into compressor/perf/optim…

0a86a6e

…izer

chore delete EmptyBlob again

4552442

Tabaie had a problem deploying to docker-build-and-e2e December 16, 2024 14:59 — with GitHub Actions Error

Merge branch 'main' into compressor/perf/optimizer

2a11734

Tabaie temporarily deployed to docker-build-and-e2e December 16, 2024 17:33 — with GitHub Actions Inactive

Merge branch 'main' into compressor/perf/optimizer

c8a5a87

Tabaie requested a review from ivokub December 18, 2024 14:38

Tabaie had a problem deploying to docker-build-and-e2e December 18, 2024 14:41 — with GitHub Actions Error

Merge branch 'main' into compressor/perf/optimizer

b6092c6

Tabaie had a problem deploying to docker-build-and-e2e December 18, 2024 19:53 — with GitHub Actions Error

Merge branch 'main' into compressor/perf/optimizer

94e9da9

Tabaie had a problem deploying to docker-build-and-e2e December 18, 2024 22:48 — with GitHub Actions Error

jpnovais approved these changes Dec 19, 2024

View reviewed changes

ivokub approved these changes Dec 19, 2024

View reviewed changes

Merge branch 'main' into compressor/perf/optimizer

02a6635

Tabaie deployed to docker-build-and-e2e December 20, 2024 17:53 — with GitHub Actions Active

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compressor Optimizer #367

Compressor Optimizer #367

Tabaie commented Nov 30, 2024 •

edited

Loading

codecov-commenter commented Nov 30, 2024 •

edited

Loading

gbotrel commented Dec 3, 2024

jpnovais commented Dec 3, 2024

gbotrel commented Dec 3, 2024 •

edited

Loading

jpnovais commented Dec 4, 2024

Tabaie commented Dec 6, 2024

gbotrel left a comment

jpnovais commented Dec 14, 2024

Tabaie commented Dec 16, 2024

Tabaie commented Dec 18, 2024

jpnovais commented Dec 19, 2024

ivokub left a comment

Compressor Optimizer #367

Are you sure you want to change the base?

Compressor Optimizer #367

Conversation

Tabaie commented Nov 30, 2024 • edited Loading

Checklist

codecov-commenter commented Nov 30, 2024 • edited Loading

Codecov Report

gbotrel commented Dec 3, 2024

jpnovais commented Dec 3, 2024

gbotrel commented Dec 3, 2024 • edited Loading

jpnovais commented Dec 4, 2024

Tabaie commented Dec 6, 2024

gbotrel left a comment

Choose a reason for hiding this comment

jpnovais commented Dec 14, 2024

Tabaie commented Dec 16, 2024

Tabaie commented Dec 18, 2024

jpnovais commented Dec 19, 2024

ivokub left a comment

Choose a reason for hiding this comment

Tabaie commented Nov 30, 2024 •

edited

Loading

codecov-commenter commented Nov 30, 2024 •

edited

Loading

gbotrel commented Dec 3, 2024 •

edited

Loading