Sourcify Tests by mjoerussell · Pull Request #1305 · NomicFoundation/slang

mjoerussell · 2025-04-15T22:08:20Z

Run e2e tests on real-world contracts using the Sourcify dataset. This will eventually replace the Sanctuary tests.

Still to-do:

Better diagnostic messages when parsing fails (restore messages that we get in Sanctuary tests)
Test for more import resolution corner cases
CI Job

The test runner checks the sourcify manifest for a list of all shards for a particular chain, then downloads each sequentially, unpacks, and finally tests each source file. This is a very basic runner and there are a lot of improvements that can be made, plus a lot of missing features. But this is a pretty good start.

…imports * Reuse a buffer when reading source files for efficiency * Reorganize code

…e parallelized. Now they are using proper iterators, and `Contract::Item` is a struct which contains a file handle instead of the file contents. This means that we can still use a shared buffer to read all of the files, improving performance, without having to borrow the string through several layers of iterators. This was what prompted me to move away from "real" iterators previously. Like the sanctuary tests, I'm using `rayon` to parallize the runner. Here, contracts are processed in parallel, but the source files within a single contract are processed sequentially. Most contracts don't have a large number of source files and trying to parallelize them as well would be unnecessary complexity.

…t them "in the background". This means less time waiting in-between shards, which matters more now that processing each shard is internally parallelized. There could still be some improvements here, specifically with making the fetches truly async. Right now we're using `reqwest`, which requires `tokio` to be used as the async runtime. It's possible that we could find a different library that would allow us to make async calls using only the `futures` crate, which would be much better suited to our use case.

…al thread for it

…of how imports/files are resolved when building a compilation unit, since the previous way wasn't working in practice.

… specify a specific contract to test

* Allow the user to not include partial_match contracts. They are still included by default * Categorize contracts between full_match and partial_match

…ning up confusing code. * Fetch archives in the main thread, and process them in a separate thread. This means that the thread doesn't have to take ownership of the `Repository` instance, and so that will get dropped at the correct time. * Following that change, moved the fetching logic into a closure that we invoke immediately. This is all so that `tx` (Sender) can be dropped before calling `process_thread.join()`. Otherwise, the processing thread will get stuck waiting for a new message forever and will never be joined. * Removing all of the custom iterators that I built for `ContractArchive` and `Contract`. The `Contract` iterators were no longer being used since I started building compilation units by traversing the import tree. The `ContractArchive` iterator was replaced by a function `ContractArchive::contracts`, which returns an `impl Iterator`. I thought that this would be a much simpler way to express this logic, especially since the other custom iterator was removed. * Adding additional terminal output to inform the user about the progress of the test runner.

* Emit events report after testing is complete

changeset-bot · 2025-04-15T22:08:24Z

⚠️ No Changeset found

Latest commit: 2d889f8

Merging this PR will not cause a version bump for any packages. If these changes should not result in a new version, you're good to go. If these changes should result in a version bump, you need to add a changeset.

This PR includes no changesets

When changesets are added to this PR, you'll see the packages that this PR includes changesets for and the associated semver types

Click here to learn what changesets are, and how to add one.

Click here if you're a maintainer who wants to add a changeset to this PR

mjoerussell · 2025-04-16T15:31:31Z

Just ran a parsing test locally on the entire Ethereum mainnet. Here are the high-level stats:

Source Files	3,149,211
Contracts Passed	820,565
Contracts Failed	4
Unresolved Imports	0

2 of the parse failures were related to multi-line comments. Example:

Error: Expected ContractKeyword or ImportKeyword or InterfaceKeyword or LibraryKeyword or PragmaKeyword.
     ╭─[Presscoins.sol:335:1]
     │
 335 │ ╭─▶ /** PressOnDemand is the first decentralized ondemand platform with in-Dapp store,
     ┆ ┆
 341 │ ├─▶   /*
     │ │
     │ ╰───────── Error occurred here.
─────╯

2 of the parse failures were on function arguments which used the indexed keyword twice. Example:

Error: Expected CloseParen or Comma.
     ╭─[MEXPToken.sol:207:32]
     │
 207 │     event Burn(address indexed indexed from, uint256 value);
     │                                ─────────────┬─────────────
     │                                             ╰─────────────── Error occurred here.
─────╯

… Sanctuary tests. There are 4 known ones in the Ethereum mainnet, but right now I only have the address for 1

… when there is no context

* Add GitHub workflow for sourcify tests

OmarTawfik

Thanks for working on this!

crates/solidity/testing/sourcify/src/import_resolver.rs

crates/solidity/testing/sourcify/src/main.rs

crates/solidity/testing/sourcify/src/command.rs

…ed list of chains

.github/workflows/sourcify_single_chain.yml

.github/workflows/sourcify_all_chains.yml

Co-authored-by: Omar Tawfik <15987992+OmarTawfik@users.noreply.github.com>

…o try to test the workflow. This trigger should be removed before merging because we don't actually want it to run on every PR.

OmarTawfik

Left a couple of comments. But otherwise LGTM.
Thanks!

… pr trigger

* Test workflow inputs with quoted strings instead of "raw" strings

github-actions added 18 commits March 31, 2025 08:12

Starting to add tests for sourcify

b318f2f

* Build a CompilationUnit from source files in a contract, resolving …

b6e12d5

…imports * Reuse a buffer when reading source files for efficiency * Reorganize code

Use json feature of reqwest

f2fa3eb

Fetch archive data on the main thread instead of spawning an addition…

b30c5ac

…al thread for it

Starting to add bindings tests. As part of this, I'm reworking a lot …

b9ff97f

…of how imports/files are resolved when building a compilation unit, since the previous way wasn't working in practice.

Fix path resolution when the source file was imported with a URL

1ca5524

Add sharding options, which are not used yet. Also add the ability to…

9b2fa58

… specify a specific contract to test

* Use sharding options

ad3227a

* Allow the user to not include partial_match contracts. They are still included by default * Categorize contracts between full_match and partial_match

Run infra lint

3e7673e

* Remove some unused code

67f316c

* Emit events report after testing is complete

Add ShowCombinedResultsCommand from Sanctuary tests

1d2208c

Add more chains

86296ce

Fix corner case bugs in import resolution

f357d74

Run infra lint

b4ba88b

Improve error reporting for parse errors

89eaa28

github-actions added 7 commits April 16, 2025 10:33

Remove unused function

a6051a5

Merge branch 'main' into feature/sourcify

dca9529

Fix clippy lint errors

e231037

Add the ability to skip contracts with known parser bugs, like in the…

2785daa

… Sanctuary tests. There are 4 known ones in the Ethereum mainnet, but right now I only have the address for 1

Fix a bug in ImportRemap, matches_context should always return true…

7805cea

… when there is no context

* Add --check-infer-version like in the Sanctuary tests

9768423

* Add GitHub workflow for sourcify tests

Run infra lint

594d9c2

mjoerussell marked this pull request as ready for review April 16, 2025 22:16

mjoerussell requested a review from a team as a code owner April 16, 2025 22:16

Fix lint issues

5ef993d

OmarTawfik requested changes Apr 29, 2025

View reviewed changes

github-actions added 3 commits April 29, 2025 11:52

--shards-count -> --shard-count

a3e75d7

Let users select chains with their IDs instead of provided a predifin…

4e5045c

…ed list of chains

Add tests for ImportResolver

4d48e4d

mjoerussell requested a review from OmarTawfik April 30, 2025 15:34

Merge branch 'main' into feature/sourcify

db85e08

OmarTawfik reviewed May 5, 2025

View reviewed changes

.github/workflows/sourcify_single_chain.yml Outdated Show resolved Hide resolved

OmarTawfik reviewed May 5, 2025

View reviewed changes

.github/workflows/sourcify_all_chains.yml Show resolved Hide resolved

mjoerussell and others added 2 commits May 5, 2025 08:53

Update .github/workflows/sourcify_single_chain.yml

73a2d8e

Co-authored-by: Omar Tawfik <15987992+OmarTawfik@users.noreply.github.com>

TEMP: Add 'pull_request' trigger for sourcify_all_chains workflow t…

8270e05

…o try to test the workflow. This trigger should be removed before merging because we don't actually want it to run on every PR.

OmarTawfik approved these changes May 5, 2025

View reviewed changes

github-actions added 14 commits May 5, 2025 09:11

Update name of called workflow: 'sourcify' -> 'sourcify_single_chain'

ae50712

Remove runs-on

7a5aab6

Remove quotes around variables that do not resolve to string values

8e2878f

Update variable handlebars

53024ec

Update variable handlebars

91227ce

Add temp defaults for bindings and infer_version while testing with a…

60d7ef6

… pr trigger

Debug terminal::step crash

10409f6

Strip repo root for contract archive display_path

2cca411

Avoid underflow when calculating spacer_width in Terminal::banner

dabc1e3

* Expect results file to contain SOURCIFY instead of SANCTUARY

eb4af9c

* Test workflow inputs with quoted strings instead of "raw" strings

Restore default values

1840c24

Remove testing features from sourcify_all_shards workflow

3b1ec52

Remove check_parser option, we always want to check the parser

f013656

Merge branch 'main' into feature/sourcify

2d889f8

mjoerussell added this pull request to the merge queue May 6, 2025

Merged via the queue into NomicFoundation:main with commit 8d1cde7 May 6, 2025
2 checks passed

mjoerussell deleted the feature/sourcify branch May 6, 2025 13:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sourcify Tests#1305

Sourcify Tests#1305
mjoerussell merged 77 commits intoNomicFoundation:mainfrom
manastech:feature/sourcify

mjoerussell commented Apr 15, 2025 •

edited

Loading

Uh oh!

changeset-bot bot commented Apr 15, 2025 •

edited

Loading

Uh oh!

mjoerussell commented Apr 16, 2025

Uh oh!

OmarTawfik left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

OmarTawfik left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mjoerussell commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

changeset-bot bot commented Apr 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

⚠️ No Changeset found

Uh oh!

mjoerussell commented Apr 16, 2025

Uh oh!

OmarTawfik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

OmarTawfik left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mjoerussell commented Apr 15, 2025 •

edited

Loading

changeset-bot bot commented Apr 15, 2025 •

edited

Loading