i#7113 decode cache: Add analyzer library for decode_cache_t #7114

abhinav92003 · 2024-12-09T21:54:14Z

Adds a new drmemtrace_decode_cache library to cache information about decoded instructions using decode_cache_t. This can be used by analysis tools that need to decode the instr encodings in the trace, to avoid overhead of redundant decodings which can get expensive.

The library allows the tools to specify what information they need to cache. Also, it uses instr_noalloc_t when possible to reduce heap usage and allocation/deallocation overhead.

If the trace does not include embedded encodings or if the user wants to get encodings from the app binaries using module_mapper_t instead, they can provide the module file path to the init API on the decode_cache_t object. decode_cache_t keeps a single initialized module_mapper_t at any time, which is shared between all decode_cache_t objects (even the ones of different template types); this is done by tracking the count of active objects using the module mapper.

decode_cache_t provides the clear_cache() API which can be used in parallel_shard_exit() to keep memory consumption in check by free-ing up cached decoding info that may not be needed for result computation in later print_results() which has to wait until all shards are done.

Refactors the invariant checker and opcode mix tools to use this library.

Modifies add_encodings_to_memrefs to support a mode where encodings are not set in the generated test memref but only the instr addr and size fields are set.

Makes the opcode cache in opcode_mix_t per-shard instead of per-worker. Decodings must not be cached per-worker as that may cause stale encodings for non-first shards processed by the worker. This means the worker init and worker exit APIs can be removed now from opcode_mix_t.

Adds decode_cache_test and opcode_mix_test unit tests that verify operation of the decode_cache_t.

Issue: #7113

Adds a new library to cache information about decoded instructions. This can be used by analysis tools that need to decode the instr encodings in the trace. The library allows the tools to specify what information they need to cache. Refactors the invariant checker tool to use this library. Issue: #7113

clients/drcachesim/tools/instr_decode_cache.cpp

clients/drcachesim/tests/instr_decode_cache_test.cpp

clients/drcachesim/tools/instr_decode_cache.h

clients/drcachesim/tools/invariant_checker.cpp

clients/drcachesim/tools/invariant_checker.h

clients/drcachesim/tools/instr_decode_cache.h

abhinav92003 · 2024-12-11T02:37:40Z

Decided to try out an alternate way to support module-mapper-decoding in instr_decode_cache_t that came out of offline discussion. Okay to hold off on the re-review until then (Cannot undo re-request review)

derekbruening

Blank review to reset the requested review state.

…upport to instr_decode_cache_t

abhinav92003 · 2024-12-17T05:15:21Z

Almost all concerns from the review and offline discussions have been addressed, so this is ready for a re-review. Added a TODO for a couple items: dr_set_isa_mode for regdeps, and the Windows test-only i#5960. PTAL.

derekbruening

This is a large diff. Did not make it through all the files yet but sending comments so far. May be delayed on finishing as have other work to get to.

clients/drcachesim/CMakeLists.txt

clients/drcachesim/tests/decode_cache_test.cpp

clients/drcachesim/tools/common/decode_cache.h

clients/drcachesim/tools/view.cpp

clients/drcachesim/tools/common/decode_cache.h

derekbruening · 2024-12-17T20:32:24Z

clients/drcachesim/tools/common/decode_cache.h

+ *  An \p decode_cache_t for testing which uses a \p test_module_mapper_t.
+ */
+template <class DecodeInfo>
+class test_decode_cache_t : public decode_cache_t<DecodeInfo> {


Does this need to be in this header? Shouldn't this be in the test file?

Seemed convenient to have the test class available at a common location. But sure, it'll be better to find a new test-only common location.

Actually test_module_mapper_t also lives in raw2trace_shared.h. Do we want to update all such test classes to be in their own separate header?

clients/drcachesim/tools/common/decode_cache.h

derekbruening

Rest of files. Maybe main points are that optimizations in opcode_mix (worker instead of shard data; further caching) are being thrown out: seems like we want to keep at least the worker data.

derekbruening · 2024-12-17T23:05:49Z

clients/drcachesim/tests/memref_gen.h

-            pair.memref.instr.encoding_is_new = true;
+            if (!set_only_instr_addr) {
+                memcpy(pair.memref.instr.encoding, &decode_buf[offset], instr_size);
+                pair.memref.instr.encoding_is_new = true;


Is your code assuming encoding_is_new is always initialized? E.g., "if (!use_module_mapper_ && memref_instr.encoding_is_new) {" in decode_cache.h?

In that condition, encoding_is_new will be read only if the user didn't ask to use the module mapper, in which case the init() time check will ensure the trace filetype supports embedded-encodings, in which case encoding_is_new will be set.

clients/drcachesim/tools/opcode_mix.cpp

clients/drcachesim/tools/common/decode_cache.h

clients/drcachesim/tools/opcode_mix.cpp

clients/drcachesim/tools/opcode_mix.h

clients/drcachesim/tools/opcode_mix.cpp

derekbruening · 2024-12-17T23:25:59Z

clients/drcachesim/tools/view.cpp

+    // XXX: We could potentially use instr_decode_cache_t here (i#7113) and avoid the
+    // repeated instr decoding logic. However, we want to preserve the legacy view
+    // tool output format which uses disassemble_to_buffer, and disassemble_to_buffer
+    // does the decoding on its own internally, while adding a bunch of non-trival


Seems fine to update the view tool separately (this PR is already large), so long as we know it won't require interface changes to this library. If it used instr_disassemle(), and the library sets ISA regdeps, will it all work out?

abhinav92003 · 2024-12-22T20:11:34Z

Location for test_decode_cache_t and providing raw instr encoding for view_t need more discussion; resolved other threads.

Ready for re-review. Though no rush because of the holidays.

abhinav92003 added 7 commits December 9, 2024 16:53

Docx improvement, and handle regdeps branch_target case.

abebffc

Use instr_noalloc_t where possible.

18f7028

Remove redundant test.

4487168

move impl to cpp

41595eb

Move impl to cpp

d2e94c7

Cleanup and aarch64 mov fix.

f0f8a74

abhinav92003 changed the title ~~i#7113: Add library to cache information about decoded instructions~~ i#7113: Add analyzer library to cache instr decode info Dec 10, 2024

Fix windows bug

a1b1d63

abhinav92003 requested a review from derekbruening December 10, 2024 03:05

derekbruening reviewed Dec 10, 2024

View reviewed changes

abhinav92003 added 2 commits December 10, 2024 16:43

Reviewer suggested changes

db8a3ad

Cleanup

1e810b5

abhinav92003 requested a review from derekbruening December 11, 2024 02:13

derekbruening reviewed Dec 11, 2024

View reviewed changes

abhinav92003 mentioned this pull request Dec 13, 2024

i#7113 decode cache: move module read into raw2trace_shared #7124

Merged

abhinav92003 added 12 commits December 14, 2024 00:15

Merge branch 'master' into i7113-decode-cache-lib

1fc4c04

Merge branch 'master' into i7113-decode-cache-lib

bf76f70

Add instr_decode_cache_t support to opcode_mix; add module_mapper_t s…

45e062f

…upport to instr_decode_cache_t

Drop instr_ from instr_decode_cache

5e28112

Handle missing use_module_mapper case

0a33a51

Fix clang-format

29d10a3

Make add_decode_info simpler and fix build error

0e2df67

Cleanup

716a0ea

Proactive destruction of module mapper

fefe38b

Remove stale file

2f0a708

Move impl to cpp

84a2039

Fix when we use module mapper in opcode mix

141e3c5

abhinav92003 changed the title ~~i#7113: Add analyzer library to cache instr decode info~~ i#7113 decode cache: Add analyzer library for decode_cache_t Dec 16, 2024

abhinav92003 added 14 commits December 16, 2024 11:22

Use filetype instead of encoding_is_new

b2ba91c

Cleanup

d70e227

Add tmate to windows test

d51d823

Remove test filter

0000c5b

Add missing standalone_init

3737ec5

Add tmate again

652bab0

Remove drmemtrace_static from test deps

1177304

Keep obj count tracking for tests

96efb50

Keep only one bool for use_module_mapper

31e1eab

Convert to doc comment

3702f29

Add tmate... again

d7a4d10

Disable module mapper tests on Windows due to i#5960

57f34ad

Remove tmate

3f9cc4e

Add TODO for some future items

44971f6

abhinav92003 requested a review from derekbruening December 17, 2024 05:15

More apt function visibility

3042d6b

derekbruening reviewed Dec 17, 2024

View reviewed changes

abhinav92003 added 9 commits December 19, 2024 10:42

Merge branch 'master' into i7113-decode-cache-lib

2b301b5

Reviewer suggested changes

4157f23

Add clear_cache API for parallel_shard_exit

0f50370

Add optimization to avoid repeated module map lookups

8dc950c

Remove common-case opt. Need add_decode_info for new encodings

160e052

Optimize lookups into the cache

74da310

Skip re-decoding on invalid cached decode info. It's redundant.

6bcc33b

Cleanup

b24d79a

Avoid DecodeInfo object construction when not needed.

e175cd9

abhinav92003 requested a review from derekbruening December 22, 2024 20:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

i#7113 decode cache: Add analyzer library for decode_cache_t #7114

i#7113 decode cache: Add analyzer library for decode_cache_t #7114

abhinav92003 commented Dec 9, 2024 •

edited

Loading

abhinav92003 commented Dec 11, 2024

derekbruening left a comment

abhinav92003 commented Dec 17, 2024

derekbruening left a comment

derekbruening Dec 17, 2024

abhinav92003 Dec 17, 2024

abhinav92003 Dec 19, 2024

derekbruening left a comment

derekbruening Dec 17, 2024

abhinav92003 Dec 17, 2024

derekbruening Dec 17, 2024

abhinav92003 commented Dec 22, 2024

i#7113 decode cache: Add analyzer library for decode_cache_t #7114

Are you sure you want to change the base?

i#7113 decode cache: Add analyzer library for decode_cache_t #7114

Conversation

abhinav92003 commented Dec 9, 2024 • edited Loading

abhinav92003 commented Dec 11, 2024

derekbruening left a comment

Choose a reason for hiding this comment

abhinav92003 commented Dec 17, 2024

derekbruening left a comment

Choose a reason for hiding this comment

derekbruening Dec 17, 2024

Choose a reason for hiding this comment

abhinav92003 Dec 17, 2024

Choose a reason for hiding this comment

abhinav92003 Dec 19, 2024

Choose a reason for hiding this comment

derekbruening left a comment

Choose a reason for hiding this comment

derekbruening Dec 17, 2024

Choose a reason for hiding this comment

abhinav92003 Dec 17, 2024

Choose a reason for hiding this comment

derekbruening Dec 17, 2024

Choose a reason for hiding this comment

abhinav92003 commented Dec 22, 2024

abhinav92003 commented Dec 9, 2024 •

edited

Loading