feat(cheatcode): `startDebugTraceRecording`, `stopDebugTraceRecording`, and `getDebugTraceByIndex` for ERC4337 testing #8571

boolafish · 2024-07-31T05:53:31Z

Motivation

ERC4337 (account abstraction) has several rules/restrictions on what the UserOperation can do (see: ERC7562). Specifically, it has rules on what opcodes it can access and limitations on certain storage access as well.

These limitations are quite implicit and might not be easily identified during contract development. We want to provide an easier way for developers and security researchers to check/test if the rules are being followed. We aim to enable these checks by running forge test after writing specific tests for that purpose.

Solution

To support this, we need new cheatcodes that can record the debug traces during EVM execution so we can know the opcodes and storages being accessed. With this cheatcode, we will be able to have a helper contract that checks whether the test executions comply with ERC4337 restrictions.

In this solution, we added two cheatcodes:

startDebugTraceRecording: This starts the recording of the debug trace data.
stopAndReturnDebugTraceRecording: This stops and returns the recording of the debug trace data.

Test

An ERC4337 checker tool and its tests are in the repo: https://github.com/boolafish/erc4337-checker/blob/main/test/ERC4337Checker.t.sol

DaniPopes · 2024-07-31T07:42:14Z

crates/cheatcodes/src/inspector.rs

+                current_opcode, interpreter.stack(),
+            )
+        );
+        let mem_inputs = get_memory_input_for_opcode(


This is not the right path, we already have a tracer and we should reuse that through CheatcodeExecutor

this can likely just be:

start: enables tracer at opcode tracing level if not already enabled, saves the current trace index

stop: takes the range of traces from saved start to current and abi encodes it

cc @klkvr

yep, I think this could be possible by extending CheatcodesExecutor with something like this:

fn start_steps_recording(&mut self, cheats: &mut Cheatcodes); fn get_recorded_step(&mut self, cheats: &mut Cheatcodes) -> Vec<CallTraceStep>;

where CallTraceStep is coming from revm-inspectors and recorded by altering config of InspectorStack.inner.tracer

oooh, so if I get this comment right, the idea here is to use self.tracer instead where it already have all the tracing needed. And it seems like by self.tracer.traces().into_nodes() I can get the Vec<CallTraceNode>, where it has the CallTrace that I need.

Am i getting this right?

also, I can assume the nodes are append only, so I can just rely on the idx field of the CallTraceNode to determine the range according to your description above, right?

yep, once you alter its config it would start recording steps, and you can just collect them by going through nodes

we'd also need some flattening logic here similar to (but likely simpler as we don't need to collect DebugNodes, just a list of steps)

foundry/crates/debugger/src/node.rs

Line 37 in 26a7559

pub fn flatten_call_trace(arena: CallTraceArena, out: &mut Vec<DebugNode>) {

as I believe we want to allow recording of trace steps for subcalls

would like to consult a bit, got one problem related to this approach after trying it:

For some background on my local env, I have a repository with forge tests that uses this new cheatcodes and I build the forge locally to use on the test repository. When I try the tracer approach, It panics with the error message: more traces were filled than started. I can, however, temporary fix this by turn on the tracing from start in the following function in crates/evm/evm/src/inspectors/stack.rs to fix this:

pub fn tracing(&mut self, mode: TraceMode) { if let Some(config) = mode.into_config() { *self.tracer.get_or_insert_with(Default::default).config_mut() = config; } else { // self.tracer = None; <-- comment out this line // Chage: ensures that the tracing will occur on start. self.tracer.get_or_insert_with(Default::default); } }

A sample implementation that updates the config is in the same file:

impl CheatcodesExecutor for InspectorStackInner { fn get_inspector<'a, DB: DatabaseExt>( &'a mut self, cheats: &'a mut Cheatcodes, ) -> impl InspectorExt<DB> + 'a { InspectorStackRefMut { cheatcodes: Some(cheats), inner: self } } // Newly added function here! fn start_steps_recording(&mut self, cheats: &mut Cheatcodes) { // Ensure the tracer exists and configure it let tracer = self.tracer.get_or_insert_with(Default::default); tracer.update_config(|_config| TracingInspectorConfig::all()); } .....skip }

Wonder if you know any idea/directions to solve the "more traces were filled than started" issue? The current one(turning on the tracer) does not really seems like a great idea....

Still having the same issue. Currently, I set the trace to use "none()" configuration so make it a dummy one on start instead of having none in the tracer field on start to solve the issue.

crates/cheatcodes/spec/src/vm.rs

zerosnacks · 2024-07-31T09:57:59Z

This sounds related to #6704, tagging it here

Would like to make sure this PR covers the design goals of #6704 as the proposed cheatcode name slightly differs

boolafish · 2024-08-23T14:46:40Z

@DaniPopes @klkvr just tagging as I am not sure if re-open a PR from WIP will trigger notifications or not. This should be ready and has accommodated the previous review comments. Sorry for taken so long due to OOO and fixing some OOM bugs on my side.

boolafish · 2024-08-26T00:10:33Z

Sorry for the failed CI, have fixed those and tested in my own repo workflow: https://github.com/boolafish/foundry/actions/runs/10535493491?pr=1

klkvr

I think we should move logic for start_steps_recording and stop_and_get_recorded_step from CheatcodesExecutor to cheatcode implementations

We've just merged #8696 which added getter for tracing_inspector to CheatcodesExecutor and example of how we can track step ranges.

regarding more traces were filled than started I think this occurs because in cases when tracing is disabled and vm.startDebugTraceRecording enables tracer, tracer will receive a call_end invocation for that cheatcode call which will not have a node to fill. we can try working around this by creating a fake trace node in this situation. Another approach could be to always enable tracer by default (as it's done now), though this might be expensive in some cases. wdyt @DaniPopes ?

crates/cheatcodes/src/evm/opcode_utils.rs

crates/cheatcodes/src/inspector.rs

crates/evm/evm/src/inspectors/stack.rs

boolafish · 2024-08-27T05:21:21Z

@klkvr thanks for the review, should have fixed all comments aside from the one re: more traces were filled than started.

we can try working around this by creating a fake trace node in this situation

Might need a bit more elaboration on this. Not exactly sure how to do this on my end. But will also wait to see if that is the preferred approach I guess.

klkvr

overall lgtm, sorry for the delay here

left nit on trace mode and question on UX of fetching steps, I'd prefer us to return an array of them if possible

cc @DaniPopes do you have an idea on how we could support this without requiring users to increase verbosity for all tests?

boolafish · 2024-09-20T04:03:10Z

thanks for the review! Have fixed for the comments.

Personally I think it will be great to have a way to specify like a --tracer flag to turn on, or, specify tracer config without -vvv. With current limitation, I would imagine project interested to use this will need to isolate the tests by folder and run those separately in CI as verbose mode indeed prints too many stuff.

boolafish · 2024-09-26T00:46:01Z

bounce for review again 🙏

feat: capture stack inputs as part of the opcode feat: record opcode -> record debug trace fix: memory OOG, need to only use needed stack, mem input fix: missing op code, instruction results fix: accessing out-of-bound idx memory When running on some project, we noticed that it sometimes try to access memory with out of bound index and panics. This commit fix it by: 1. Enfore reset to Nonce after stopDebugTraceRecording(), this ensures the `some(..) = ...` part will not be triggered 2. Change how opcode_utils.rs accesses memory. Return empty vector if trying access out-of-bound memory.

This commit also cleans up the previous implementaiton on inspector. And then change the cheatcode interface to be of three steps: 1. start recording debug trace 2. stop recording 3. get the debug trace by index The reason is to avoid out-of-memory issue by returning the whole traces at once.

Since enabling dummy tracer still come with performance impact, remove the auto dummy tracer initiation. The cheatcode will return explicit error and require the test to be run in -vvv mode to have the tracer enabled by default.

There was OOM concern but using the get-by-index style, despite improved, does not solve the root cause. The main issue is that the tracer config did not turn off after the stop recording cheatcode being called. It seems too much burden for the tracer to record the returned traces inside forge tests as the tests will also pass around the debug traces, causing memory boost. This commit also only turns on necessary tracer config instead of using all().

boolafish · 2024-09-30T00:41:46Z

bounce for a review 🙏 @DaniPopes @klkvr

boolafish requested review from DaniPopes, mattsse, klkvr and Evalir as code owners July 31, 2024 05:53

boolafish marked this pull request as draft July 31, 2024 05:53

boolafish force-pushed the erc4337-tool-main branch 2 times, most recently from 50d4c0f to d81495d Compare July 31, 2024 07:21

DaniPopes reviewed Jul 31, 2024

View reviewed changes

crates/cheatcodes/spec/src/vm.rs Outdated Show resolved Hide resolved

zerosnacks linked an issue Jul 31, 2024 that may be closed by this pull request

feat(cheatcodes): add vm.getStateDiffOpcodes to access opcodes inside of tests #6704

Open

zerosnacks added this to the v1.0.0 milestone Jul 31, 2024

zerosnacks added A-cheatcodes Area: cheatcodes T-feature Type: feature labels Jul 31, 2024

boolafish changed the title ~~feat(forge): new cheatcode startDebugTraceRecording and stopAndReturnDebugTraceRecording~~ feat(cheatcode): startDebugTraceRecording and stopAndReturnDebugTraceRecording for ERC4337 testing Aug 1, 2024

zerosnacks mentioned this pull request Aug 2, 2024

feat(cheatcodes): ability to capture and store state diffs #2846

Open

klkvr mentioned this pull request Aug 19, 2024

feat: vm.pauseTracing + vm.resumeTracing #8696

Merged

boolafish force-pushed the erc4337-tool-main branch from ed3ad33 to 72d6ca9 Compare August 23, 2024 01:41

boolafish marked this pull request as ready for review August 23, 2024 03:15

boolafish changed the title ~~feat(cheatcode): startDebugTraceRecording and stopAndReturnDebugTraceRecording for ERC4337 testing~~ feat(cheatcode): startDebugTraceRecording, stopDebugTraceRecording, and getDebugTraceByIndex for ERC4337 testing Aug 23, 2024

boolafish requested a review from DaniPopes August 23, 2024 14:46

boolafish force-pushed the erc4337-tool-main branch 2 times, most recently from 9ff4d0e to 03bacff Compare August 24, 2024 03:47

boolafish force-pushed the erc4337-tool-main branch from 03bacff to 0575405 Compare August 26, 2024 08:44

klkvr requested changes Aug 26, 2024

View reviewed changes

boolafish force-pushed the erc4337-tool-main branch 2 times, most recently from cfac4e3 to bde1de2 Compare August 27, 2024 03:03

klkvr requested changes Sep 19, 2024

View reviewed changes

boolafish force-pushed the erc4337-tool-main branch from b0f4fd0 to 141aea7 Compare September 20, 2024 03:07

boolafish requested a review from klkvr September 20, 2024 04:03

boolafish force-pushed the erc4337-tool-main branch 2 times, most recently from cacd9eb to 1dad1a1 Compare September 26, 2024 00:45

boolafish requested review from grandizzy, yash-atreya and zerosnacks as code owners September 26, 2024 00:45

boolafish force-pushed the erc4337-tool-main branch 2 times, most recently from ad998df to c1a6c5b Compare September 26, 2024 01:23

boolafish added 16 commits September 27, 2024 10:16

test: add DebugTrace.t.sol for the debug trace cheatcode

58582dc

fix: rebase errors

b46a242

fix: rebase duplication

37d3e0a

feat: replace instruction result with isOutOfGas

0dbcdcb

fix: CI issues

ca04434

fix: remove DebugTrace wrapper in inspector

9833033

fix: revert to original tracer config when stops

a10e88c

chore: reuse existing opcode functions

cdbdc80

chore: refactor, fmt, clippy run

f973e6f

chore: use ref instead of clone, returning Error when not able to access

4594dc4

chore: move buffer to evm_core from debugger

bad4a4a

chore: cleanup comments, typo

8c3ca8c

boolafish force-pushed the erc4337-tool-main branch from c1a6c5b to 8c3ca8c Compare September 27, 2024 01:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(cheatcode): `startDebugTraceRecording`, `stopDebugTraceRecording`, and `getDebugTraceByIndex` for ERC4337 testing #8571

feat(cheatcode): `startDebugTraceRecording`, `stopDebugTraceRecording`, and `getDebugTraceByIndex` for ERC4337 testing #8571

boolafish commented Jul 31, 2024 •

edited

Loading

DaniPopes Jul 31, 2024

klkvr Jul 31, 2024 •

edited

Loading

boolafish Jul 31, 2024

klkvr Jul 31, 2024 •

edited

Loading

boolafish Aug 2, 2024

boolafish Aug 23, 2024 •

edited

Loading

zerosnacks commented Jul 31, 2024 •

edited

Loading

boolafish commented Aug 23, 2024

boolafish commented Aug 26, 2024

klkvr left a comment •

edited

Loading

boolafish commented Aug 27, 2024

klkvr left a comment •

edited

Loading

boolafish commented Sep 20, 2024 •

edited

Loading

boolafish commented Sep 26, 2024

boolafish commented Sep 30, 2024

feat(cheatcode): startDebugTraceRecording, stopDebugTraceRecording, and getDebugTraceByIndex for ERC4337 testing #8571

Are you sure you want to change the base?

feat(cheatcode): startDebugTraceRecording, stopDebugTraceRecording, and getDebugTraceByIndex for ERC4337 testing #8571

Conversation

boolafish commented Jul 31, 2024 • edited Loading

Motivation

Solution

Test

DaniPopes Jul 31, 2024

Choose a reason for hiding this comment

klkvr Jul 31, 2024 • edited Loading

Choose a reason for hiding this comment

boolafish Jul 31, 2024

Choose a reason for hiding this comment

klkvr Jul 31, 2024 • edited Loading

Choose a reason for hiding this comment

boolafish Aug 2, 2024

Choose a reason for hiding this comment

boolafish Aug 23, 2024 • edited Loading

Choose a reason for hiding this comment

zerosnacks commented Jul 31, 2024 • edited Loading

boolafish commented Aug 23, 2024

boolafish commented Aug 26, 2024

klkvr left a comment • edited Loading

Choose a reason for hiding this comment

boolafish commented Aug 27, 2024

klkvr left a comment • edited Loading

Choose a reason for hiding this comment

boolafish commented Sep 20, 2024 • edited Loading

boolafish commented Sep 26, 2024

boolafish commented Sep 30, 2024

feat(cheatcode): `startDebugTraceRecording`, `stopDebugTraceRecording`, and `getDebugTraceByIndex` for ERC4337 testing #8571

feat(cheatcode): `startDebugTraceRecording`, `stopDebugTraceRecording`, and `getDebugTraceByIndex` for ERC4337 testing #8571

boolafish commented Jul 31, 2024 •

edited

Loading

klkvr Jul 31, 2024 •

edited

Loading

klkvr Jul 31, 2024 •

edited

Loading

boolafish Aug 23, 2024 •

edited

Loading

zerosnacks commented Jul 31, 2024 •

edited

Loading

klkvr left a comment •

edited

Loading

klkvr left a comment •

edited

Loading

boolafish commented Sep 20, 2024 •

edited

Loading