Skip to content

Conversation

@aheejin
Copy link
Member

@aheejin aheejin commented Dec 18, 2024

When deterministic_paths is set, we are currently using -ffile-prefix-map to produce a deterministic prefix (/emsdk/emscripten) in data and debug info, which was recently done in #23222 and #23225. -ffile-prefix-map is an alias for both -fmacro-prefix-map, which affects __FILE__ macro, and -fdebug-prefix-map, which affects the debug info section.

This lets the contents of __FILE__ use the deterministic prefix /emsdk/emscripten unconditionally, i.e., even if deterministic_paths is not turned on. This is to produce reproducible builds and also make the code size tests' results consistent across platforms. The reason we still allow the real local path to be used in the debug info section when deterministic_paths is not set is to allow local developers to see their real paths.

This is basically implementing what's suggested in #23195 (comment)

This also turns deterministic_paths on for the Ninja path in embuilder for consistency with the non-Ninja path.

Fixes #23195.

When `deterministic_paths` is set, we are currently using
`-ffile-prefix-map` to produce the same path in data and debug info. In
the case of absolute paths, their emscripten path is replaced with a
fake path `/emsdk/emscripten`, and in the case of relative paths, all
path relative to the emscripten directory is removed, so
`../../system/lib/somefile.c` becomes `system/lib/somefiles.c`.
https://github.com/emscripten-core/emscripten/blob/f66b5d706e174d9e5cc6122c06ea29dcd2735cd0/tools/system_libs.py#L472-L477
https://github.com/emscripten-core/emscripten/blob/f66b5d706e174d9e5cc6122c06ea29dcd2735cd0/tools/system_libs.py#L495-L501

But this does not make relative paths and absolute paths the same, which
can be a problem when data generated by `__FILE__` macro is included in
one of code size tests. This problem is discussed in emscripten-core#23195.

This PR makes `__FILE__` macro produce the same data in all cases by
using the fake path `/emsdk/emscripten` as its base, so that it wouldn't
change any results for code size tests. This is done by
`-fmacro-prefix-map`. This differs from the current behavior because we
don't handle relative and absolute paths differently.

For the debug info, when `deterministic_paths` is set, this uses a fake
path `/emsdk/emscripten` as a base emscripten path. When
`deterministic_paths` is not set, this uses real local absolute paths in
the debug info. This allows local developers to see their real paths in
the debug info while continuing to use the same (fake) path
`/emsdk/emscripten` we have used so far for the release binaries. Users
can set their debug base path to whatever path they like, but given that
we have used `/emsdk/emscripten` in release binaries for a while, it is
possible that some users have set their configuration with this
directory, so it would be better not to break them by changing it. This
is done by `-ffile-prefix-map` as we have done so far, which is an alias
for both `-fdebug-prefix-map` and `-fmacro-prefix-map`.

This is basically implementing what's suggested in
emscripten-core#23195 (comment)
and
emscripten-core#23195 (comment)

This also turns `deterministic_paths` on for the Ninja path in
embuilder for consistency with the non-Ninja path.

Fixes emscripten-core#23915.
cflags += [f'-ffile-prefix-map={relative_source_dir}/=']
cflags += [f'-ffile-prefix-map={source_dir}=/emsdk/emscripten',
'-fdebug-compilation-dir=/emsdk/emscripten']
cflags += [f'-ffile-prefix-map={relative_source_dir}={FAKE_EMSCRIPTEN_PATH}']
Copy link
Collaborator

@sbc100 sbc100 Dec 18, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you split this line out into its own PR? Something like "system_libs.py: Use the same deterministic path when building in batch mode". This seems like a separate and uncontroversial bugfix.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done: #23222

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great!

Once #23222 lands maybe the next step should just be to unconditionally enable determinisic_paths? i.e. just remove the non-deterministic mode. We can always add it back later if it turns out someone wants it... but I doubt they wil.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought what we were going to do was to unconditionally use -fmacro-prefix-map but use -ffile-prefix-map (which also enables -fdebug-prefix-map) only in deterministic_paths? This is basically what #23195 (comment) suggested but you also said maybe even that was not necessary... But anyway this PR does not remove the undeterministic mode. I can do either.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I rebased this PR on top of #23222 and #23225. This still uses deterministic_paths. Please let me know if you think it'd better go unconditionally set deterministic_paths and remove the variable, i.e., you don't need your real local paths in the debug info.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think just removing deterministic_paths and always doing it this way makes sense. Sorry for the back on forth on this.

This is an automatic change generated by tools/maint/rebaseline_tests.py.

The following (4) test expectation files were updated by
running the tests with `--rebaseline`:

```
other/codesize/test_codesize_cxx_except.size: 170928 => 170950 [+22 bytes / +0.01%]
other/codesize/test_codesize_cxx_except_wasm.size: 142143 => 142154 [+11 bytes / +0.01%]
other/codesize/test_codesize_cxx_except_wasm_exnref.size: 144730 => 144741 [+11 bytes / +0.01%]
other/codesize/test_codesize_cxx_mangle.size: 232437 => 232468 [+31 bytes / +0.01%]

Average change: +0.01% (+0.01% - +0.01%)
```
Comment on lines 606 to 608
cflags += [f'-ffile-prefix-map={source_dir}={DETERMINISITIC_PREFIX}',
f'-ffile-prefix-map={relative_source_dir}={DETERMINISITIC_PREFIX}',
f'-fdebug-compilation-dir={DETERMINISITIC_PREFIX}']
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

if deterministic_paths part hasn't changed; I only merged two cflags += ... into one and changed the order of the flags.

@aheejin aheejin marked this pull request as ready for review December 21, 2024 08:39
aheejin added a commit to aheejin/emscripten that referenced this pull request Dec 27, 2024
This removes `deterministic_paths` option by turning it on all the time,
in order to produce reproducible builds, both for `__FILE__` macro and
debug info paths.

This is an alternative to emscripten-core#23212, which did not remove
`deterministic_paths` but always set only `-fmacro-prefix-map` on.

This PR is what was suggested in
emscripten-core#23195 (comment)
and
emscripten-core#23212 (comment).

Fixes emscripten-core#23195.
@aheejin
Copy link
Member Author

aheejin commented Dec 27, 2024

Will close in favor of #23256.

@aheejin aheejin closed this Dec 27, 2024
@aheejin aheejin deleted the macro_prefix branch December 27, 2024 09:07
aheejin added a commit that referenced this pull request Jan 2, 2025
This removes `deterministic_paths` option by turning it on all the time,
in order to produce reproducible builds, both for `__FILE__` macro and
debug info paths.

This is an alternative to #23212, which did not remove
`deterministic_paths` but always set only `-fmacro-prefix-map` on.

This PR is what was suggested in

#23195 (comment)
and

#23212 (comment).

Fixes #23195.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Handling __FILE__ in code size tests in CircleCI

2 participants