Fix includes normalize win32 by digit-google · Pull Request #2552 · ninja-build/ninja

digit-google · 2025-01-13T15:33:19Z

Remove the use of fixed-size buffers in src/includes_normalize-win32.cc in order to support long file paths.

This also requires using GetFullPathNameW() as, surprisingly, GetFullPathNameA() would fail with long paths even when long paths are enabled on the host system running the tests.

To achieve this, introduce ConvertWin32UnicodeToUTF8() and ConvertUTF8ToWin32Unicode() functions to perform conversions properly in src/util.cc.

jhasse · 2025-01-15T18:33:46Z

Have you created a bug report for Microsoft? I think we should try to avoid working around bugs in Windows as much as possible, so that the underlying issue can be fixed and everyone benefits.

digit-google · 2025-02-07T15:19:42Z

So I actually tried to file a bug, which requires registering a Microsoft account (don't have one, don't want one), and from what I can gather on various Internet comments, essentially goes into a /dev/null bucket most of the time. Even if they fixed this, existing Vista / Windows 10 / whatever installs would never get the fix, so I think we'll have to just work-around this here.

jhasse · 2025-02-07T16:25:28Z

I think someone who wants to keep this moving probably has an account and could do it for us. Until then let's merge the #else version so that we are doing the correct thing and would immediately benefit after a Windows update.

huber-mvtec · 2025-02-07T16:28:34Z

@jhasse After reading the Maximum File Path Limitation documentation again from my understanding this section states that only the *W functions don't have any MAX_PATH limitations, which is in line with @digit-google observation that GetFullPathNameA() fails for long paths even when the system option is set in the registry and during build.

Therefore I don't understand how working with GetFullPathNameA() can fix the problem.

Thanks to the both of you for looking into this.

jhasse · 2025-02-07T22:34:01Z

The fact that only the W versions don't have the MAX_PATH limitation is a bug on Microsoft's side which I don't want to work around.

huber-mvtec · 2025-02-10T10:26:50Z

@jhasse While I agree with you about this being a bug on Microsoft's side (I would say the whole existence of a MAX_PATH in 2025 is a bug), i strongly doubt that this is something they will likely fix. The implications of such a change could be quite big from Microsoft's perspective, because potentially people will depend on the current (documented) behavior.

Just to clarify:

You will not merge the PR as is but only the #else part with the *A version, and ninja wont support long path names under Windows for the time being, right? If this is the case, as a compromise would you accept a cmake option to switch the behavior?
Would you merge the *W part if there is a bug report at Microsoft?

Thank you.

digit-google · 2025-02-10T11:35:09Z

It's even crazier than that:

The W functions are the only documented APIs that support long paths, when the feature is enabled and the application has the right manifest attribute. See for yourself: https://learn.microsoft.com/en-us/windows/win32/fileio/maximum-file-path-limitation?tabs=registry
The fact that most A functions support them too when the mode is enabled is an implementation detail, that just happens to work most of the time. There is no guarantee that this won't break in the future (though Microsoft is very good at maintaining backwards compatibility). Maybe GetFullPathNameA() does not work because it returns an output path (while most other functions take an input path). Who knows?
Ninja relying on A functions for long paths support could be considered a bug in Ninja itself. The correct fix is to only use W functions for all file path-related operations.

I would not be surprised if a bug filed for this "issue" would be closed as "working as intended". We can try, but I don't have high hopes it will work.

jhasse · 2025-02-10T16:54:19Z

You will not merge the PR as is but only the #else part with the *A version, and ninja wont support long path names under Windows for the time being, right?

Well, not for the normalization of long paths at least. There is a subset of features where we support long paths already.

If this is the case, as a compromise would you accept a cmake option to switch the behavior?

No, if someone builds from source anyway, they can checkout one of the PRs that use the *W function.

Would you merge the *W part if there is a bug report at Microsoft?

That depends on Microsoft's answer.

There is no guarantee that this won't break in the future (though Microsoft is very good at maintaining backwards compatibility).

The best thing to make them not break it, is to rely on it ;)

Ninja relying on A functions for long paths support could be considered a bug in Ninja itself.

We're using A functions for UTF-8, not for long path support.

sztomi · 2025-03-31T12:07:59Z

As a data point, I managed to build a large Qt codebase that previously failed to compile because of moc-generated long relative include paths; it works with the ninja from this branch.

sztomi · 2025-03-31T13:16:46Z

@jhasse To also reflect on the discussion above: from its inception, long path support was documented to only be supported with the *W APIs. The fact that it's not ergonomic/surprising that the *A versions don't support it does not make it a bug. It's unlikely that Microsoft will act on (or respond to) this. Converting paths to wide chars is the correct thing to do on Windows and not a workaround.

jhasse · 2025-03-31T15:51:59Z

The fact that Microsoft documented their bug does not make it not a bug.

sztomi · 2025-04-01T15:28:43Z

@jhasse On the other hand (and I say this with no malice), your judgement does not make it a bug. It is not documented as a bug, there is simply a list of functions on MSDN that support long paths if the setting is enabled in the registry and the application manifest, and they happen to be the *W family of functions only.

Whether or not it is a bug, boils down to what the intention of Microsoft was while introducing long paths. It is a valid choice to only add a new feature to the *W APIs (even if we, not being aware of the entire implementation constraints feel like we would have chosen differently).

jhasse · 2025-04-01T20:32:30Z

Well, I disagree.

sztomi · 2025-04-03T09:41:01Z

I filed a bug a report: https://aka.ms/AAvdklt (unfortunately there is no way to view it in a browser, it will try to open the awful Feedback Hub app on desktop). I'll update this thread if I get any kind of feedback.

sztomi · 2025-05-06T15:46:14Z

No reaction after a month from Microsoft.

pkasting · 2025-09-22T19:51:58Z

This behavior is intentional on Microsoft's part, for backward compat reasons (the same reason why the A/W split was introduced alongside macros mapping to one or the other to begin with). Their assumption is that the only apps using the A versions are older apps, for whom the APIs must not change behavior, even for arguable "bugfix" reasons. So this will never be implemented in the A versions.

From Chromium's perspective, we no longer care, since we no longer build with ninja. However, as a neutral third party, this is something where ninja should change its behavior (to use the W versions, not because it wants UCS-2 or because it wants this bug fix per se, but because that is what "apps which support more modern conventions such as Unicode and long paths" are assumed to use).

src/includes_normalize-win32.cc

jhasse · 2025-09-22T21:02:04Z

Their assumption is that the only apps using the A versions are older apps, for whom the APIs must not change behavior, even for arguable "bugfix" reasons. So this will never be implemented in the A versions.

Even the A versions absolutely change behavior, that's what manifests are for.

UCS-2 was a mistake and that's why the A API is the future (with the introduction of UTF-8). It's true that there was a time where the programming community thought that wide charsets are more modern but that has definitely shifted in recent years.

pkasting · 2025-09-22T22:35:56Z

In theory, yes, Microsoft could have made these changes to the A APIs under the manifest gating. But in practice that's not how these specific APIs work. The A ones do not change at all, and the W ones change under manifest gates.

I consider this overly cautious, and I understand where you're coming from in principle. It is certainly how some of their APIs work, after all. But it is what it is. I could be missing something about how AppCompat and future planned registry tweaks affect this that makes Microsoft's behavior sane; either way, i think the future is not the A APIs (which are often not so much UTF-8 as ASCII or 1252, but it varies by function), but something entirely different. Until then, the change here seems like the best course to me.

pkasting · 2025-09-23T15:11:46Z

(See also https://learn.microsoft.com/en-us/windows/win32/intl/conventions-for-function-prototypes, especially the highlighted note: "New Windows applications should use Unicode...They should be written with generic functions, and should define UNICODE to compile the functions into Unicode functions." The "A" versions are for ANSI, i.e. the old Windows code pages, and may have arbitrary functional gaps; they most definitely do not guarantee UTF-8 input/output will always work.)

jhasse · 2025-09-23T16:37:19Z

Of course Microsoft wants to shift the blame on the developers instead of fixing their shit. As I have said here and in other issues/PRs: It's a bug on Microsoft's side which I don't want to work-around.

pkasting · 2025-09-23T19:00:10Z

Shrug; you can believe what you want. They have stated reasons, they don't consider it buggy but a principled compromise, they believe changing this is worse than not changing it. Regardless of the truth of their position, the only practical impact refusing to accept the PR here has at this point is that ninja is unusable for certain users and projects, punishing users for something they didn't cause and can't control.

As I said, I don't personally care. Chromium has ditched ninja and now Valve has too, so it doesn't affect me.

Cheers,
PK

mummynobbit · 2025-09-23T19:43:33Z

How come long paths issue is not fixed? I remember seeing a manifest embedded in the executable in one of the latest releases of ninja (1.12.1).
For which users doesn't it work exactly?

sztomi · 2025-09-24T11:03:35Z

@mummynobbit you are commenting on the exact issue that needs fixing. I personally encountered this when building a large-ish Qt project. moc generates deeply nested include paths, ninja tries (and fails) to normalize them on Windows.

@jhasse I would argue that the question of this being a bug on Microsoft's end or not is completely orthogonal to the issue. Currently this bug is present on this platform and there is a valid, documented fix for it. ninja is broken without the fix. I can see in the repo history that you are willing to merge workarounds for compatibility with other external software, so why not Windows? (even though I disagree with calling this a workaround, but whatever, let's call it that). It seems like you are letting whatever feelings you may have towards this platform or Microsoft cloud your judgement. And I'm saying this as someone who has strong feelings and opinions regarding Microsoft, Windows and their role in our society. That doesn't change the fact that they exist and widely used, so we need to live with them.

panther7 · 2025-09-24T14:23:48Z

@jhasse Is some other fix for Windows builder without this PR fix?

jhasse · 2025-09-24T18:30:35Z

How come long paths issue is not fixed? I remember seeing a manifest embedded in the executable in one of the latest releases of ninja (1.12.1).

The flag in the manifest only results in some of Microsoft's APIs to support long paths. This results in some cases being fixed, but other still not working.
Also there were (are?) some Ninja internal path limitations.

For which users doesn't it work exactly?

We don't know, the bug reports haven't been a lot more detailed than "works" or "doesn't work".
It seems to depend on the Windows Version, the compiler being used, the CRT being used, Registry flags and maybe even some other factors.

@jhasse Is some other fix for Windows builder without this PR fix?

Sorry, what do you mean?

digit-google · 2025-09-25T11:30:20Z

I agree with @pkasting and others here that.

Independently from what you may think about Microsoft and the decisions they've made, they have been consistent about documenting that only the W functions are compatible with long path support, and that applications should be modified to drop usage of the A functions for this.

The fact that some, but not all, A functions do work when the manifest includes the right attribute cannot even be relied upon. In theory, though unlikely, it could break in any future Windows update. Plus the A functions that do not work, as shown in this PR, and which already create issues.

In other words, Ninja should use the documented and supported way to support long paths.
Not doing it is just punishing Ninja users for no good technical reason.

Moreover, the current, incomplete, state of long path support leads to situations where failures are confusing to developers, and not easily reproducible between different setups.

@jhasse, can you reconsider your position for the benefit of Ninja users on Windows and, to be frank, the sanity of Ninja maintainers (you included) since this issue pops up frequently in the issue tracker?

jhasse · 2025-09-25T15:04:37Z

Yes. The discussion drifted a bit away from this PR on to the general issue, let's focus again.

Would it be possible to add a test for this? Also if you would create a PR with the #else version I could merge that right away as mentioned above.

(I also think that putting <> includes over "" includes is completely wrong, did clang-format do this?)

digit-google · 2025-09-25T15:22:29Z

Would it be possible to add a test for this?

Actually, the tests were changed to check the new behavior (but I failed to update the comments in them, so this wasn't obvious, I'll fix it).

Also if you would create a PR with the #else version I could merge that right away as mentioned above.

I assume you mean just removing the use of fixed-size buffers, right? Yes I can, but the function will still fail due to the GetFullPathNameA() failure. Please confirm if this is what you really want (I see little benefit in this).

(I also think that putting <> includes over "" includes is completely wrong, did clang-format do this?)

Indeed, this is what clang-format does with the current .clang-format settings. I try to use git clang-format as much as possible for my CLs so I don't have to deal with style issues myself. If you feel strongly about that style, I suggest we discuss this in a separate PR where we update .clang-format first.

Do not use fixed-size buffers in order to better support long file paths. This does *not* get rid of the failure described at [1], because GetFullPathNameA() will fail with long paths, even when the feature is enabled in the application's manifest. Insteead, using GetFullPathNameW() is required to completely fix this issue. [1] ninja-build#2442 (comment)

These will be used by functions that need to use Unicode Win32 APIS, like GetFullPathNameW() which supports long paths, instead of GetFullPathNameA() which does not, even when long paths are enabled on the system!

Use GetFullPathNameW() instead of GetFullPathNameA() to ensure that normalization works for all long paths on Windows. Adjust unit tests accordingly. Fixes ninja-build#2442

digit-google · 2025-09-26T09:43:42Z

I have refactored the PR into four commits for clarity:

The first two remove some bounded buffer usage and removes the using namespacec std, but only fix one partial issue related to long paths.
The last two introduce UTF-8 <> Win32 Unicode conversion functions, and fix the problem properly using GetFullPathNameW().

Both adjust the unit-tests accordingly.

panther7 · 2025-09-30T06:51:12Z

@digit-google hi, i tried your ninja (1.14.0.git) and my build always end with full rebuild "warning: premature end of file; recovering"

digit-google · 2025-09-30T08:54:03Z

@digit-google hi, i tried your ninja (1.14.0.git) and my build always end with full rebuild "warning: premature end of file; recovering"

That means your .ninja_deps is corrupted, and the truncation that is applied by Ninja in this case doesn't fix it.
I have seen similar issue crop up rarely in my project's build, and diagnosed this to an invalid checksum value (but don't have a good repro case or explanation for this).

Maybe what you are seeing is incidental, what happens if you remove the .ninja_deps file and rebuild in your case?

panther7 · 2025-09-30T22:30:37Z

@digit-google It's clear build, .ninja_deps is fresh file.

builds commands are:

# fresh repo/dir
# ...
ninja -C out/x64 electron
# ... full build
# ... done
ninja -C out/x64 electron:electron_dist_zip
# ninja: warning: premature end of file; recovering
# ... full build again

digit-google · 2025-10-01T09:38:31Z

That doesn't seem related to this PR at all, why are you asking this here?

Also what do you mean exactly by "your ninja" here?

If you are talking about the tip-of-tree of github.com/ninja-build/ninja, then please file a different Github issue with reproduction steps to carry the conversation there.

If you are talking about the Fuchsia Ninja program instead, report that through the Fuchsia public tracker instead. Please do not pollute upstream issues with unrelated content.

panther7 · 2025-10-01T09:49:00Z

I only say, that ninja v1.31.1 works properly, but "1.14.0.git" from fix-includes_normalize-win32 has bug with full rebuild.

digit-google · 2025-10-01T10:14:46Z

Thanks for the clarification. In that case what about 1.14.0.git tip-of-tree (i.e. without the fix-includes_normalize-win32 commits)?

panther7 · 2025-10-02T13:23:55Z

I tried 1.14.0 from master, and I think, that works properly.
Build 1.14.0 from fix-includes_normalize-win32 repeating build like this.
@digit-google

bebuch · 2025-11-03T10:42:01Z

I can confirm that a Ninja compiled from this branch was able to build our CMake code base on Windows 11, which consists of over 6,000 steps, including various third-party tools such as Qt, protobuf/gRPC, ActiveX, etc.

@jhasse Are more changes required? We would be very happy to see this in an offical release.

bebuch · 2025-11-13T22:00:34Z

@jhasse ping

jhasse · 2025-11-29T18:54:33Z

Do I understand it correctly that @panther7 found a regression?

digit-google · 2026-01-12T16:53:02Z

Sorry for the long answer.

Do I understand it correctly that @panther7 found a regression?

This is unclear, @panther7 , can you clarify exactly what you are seeing? As I wrote previously, it looks like you are trigerring another Ninja bug that may be unrelated to this PR. Exact repro steps would thus be very welcomed here.

lygstate · 2026-02-07T16:06:41Z

I am also facing this issue, don't know if its another Ninja bug

[0/245] Running ninja for QtWebEngineCore in D:/work/xtal/xtal-wasm/build-msvc/qtwebengine/src/core/Release/AMD64ninja: warning: premature end of file; recovering
ninja: Entering directory `D:/work/xtal/xtal-wasm/build-msvc/qtwebengine/src/core/Release/AMD64'
[215/18340] CXX obj/base/allocator/partition_allocator/src/partition_alloc/allocator_base/process_handle_win.obj
C:\Program Files (x86)\Windows Kits\10\\include\10.0.26100.0\\um\handleapi.h(27): warning C4005: 'INVALID_HANDLE_VALUE': macro redefinition
../../../../../../qt-everywhere-src-6.10.2/qtwebengine/src/3rdparty/chromium/base/allocator/partition_allocator/src\partition_alloc/partition_alloc_base/win/windows_types.h(69): note: see previous definition of 'INVALID_HANDLE_VALUE'
[295/18340] CC obj/third_party/nasm/nasm/outmacho.obj

lygstate · 2026-02-07T16:52:51Z

Sorry for the long answer.

Do I understand it correctly that @panther7 found a regression?

This is unclear, @panther7 , can you clarify exactly what you are seeing? As I wrote previously, it looks like you are trigerring another Ninja bug that may be unrelated to this PR. Exact repro steps would thus be very welcomed here.

      unsigned checksum = *reinterpret_cast<unsigned*>(buf + size - 4);
      int expected_id = ~checksum;
      int id = static_cast<int>(nodes_.size());
      if (id != expected_id || node->id() >= 0) {
        read_failed = true;
        break;
      }

anything wrong with this?
id == expected_id but node->id() >=0

I found ../../../../../../qt-everywhere-src-6.10.2/qtwebengine/src/3rdparty/chromium/third_party/protobuf/src/google/protobuf/arenaz_sampler.h appeared twice
in .ninja_deps
so what's the cause?

lygstate · 2026-02-07T20:26:56Z

Sorry for the long answer.

Do I understand it correctly that @panther7 found a regression?

This is unclear, @panther7 , can you clarify exactly what you are seeing? As I wrote previously, it looks like you are trigerring another Ninja bug that may be unrelated to this PR. Exact repro steps would thus be very welcomed here.

I am verfied the issue is triggered this MR, I redo this mr at #2723, and now it's fine

digit-google mentioned this pull request Jan 13, 2025

[Windows] Long paths issue with Ninja >=1.12 #2442

Closed

digit-google force-pushed the fix-includes_normalize-win32 branch from 0d08963 to 0e706ed Compare February 7, 2025 15:14

digit-google mentioned this pull request Feb 7, 2025

Properly support for Windows long file paths through Unicode Win32 API calls. #2410

Closed

cristitep-nxp mentioned this pull request Feb 13, 2025

Bulit dev_composite_cdc_vcom_cdc_vcom_freertos fails nxp-mcuxpresso/vscode-for-mcux#63

Open

lesteve mentioned this pull request Apr 2, 2025

BUG: Build from source can fail on Windows for scikit-learn v1.6.1 with Ninja mkdir error scikit-learn/scikit-learn#31123

Closed

codebytere mentioned this pull request Jul 22, 2025

Windows build failure: The filename or extension is too long electron/build-tools#675

Closed

pkasting reviewed Sep 22, 2025

View reviewed changes

src/includes_normalize-win32.cc Outdated Show resolved Hide resolved

pkasting mentioned this pull request Sep 22, 2025

Enable "longpath" support on Window #2359

Closed

digit-google force-pushed the fix-includes_normalize-win32 branch from 0e706ed to 864c812 Compare September 23, 2025 06:28

digit-google added 4 commits September 26, 2025 11:37

includes_normalize: Remove using namespace std + reformat.

7ad66b9

Add Win32 Unicode <-> UTF-8 conversion functions.

4c8cab9

These will be used by functions that need to use Unicode Win32 APIS, like GetFullPathNameW() which supports long paths, instead of GetFullPathNameA() which does not, even when long paths are enabled on the system!

includes_normalizes: Full fix for long path support.

842fca0

Use GetFullPathNameW() instead of GetFullPathNameA() to ensure that normalization works for all long paths on Windows. Adjust unit tests accordingly. Fixes ninja-build#2442

digit-google force-pushed the fix-includes_normalize-win32 branch from 864c812 to 842fca0 Compare September 26, 2025 09:41

lygstate mentioned this pull request Feb 7, 2026

Fix includes normalize win32 #2723

Open

Conversation

digit-google commented Jan 13, 2025

Uh oh!

jhasse commented Jan 15, 2025

Uh oh!

digit-google commented Feb 7, 2025

Uh oh!

jhasse commented Feb 7, 2025

Uh oh!

huber-mvtec commented Feb 7, 2025

Uh oh!

jhasse commented Feb 7, 2025

Uh oh!

huber-mvtec commented Feb 10, 2025

Uh oh!

digit-google commented Feb 10, 2025

Uh oh!

jhasse commented Feb 10, 2025

Uh oh!

sztomi commented Mar 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sztomi commented Mar 31, 2025

Uh oh!

jhasse commented Mar 31, 2025

Uh oh!

sztomi commented Apr 1, 2025

Uh oh!

jhasse commented Apr 1, 2025

Uh oh!

sztomi commented Apr 3, 2025

Uh oh!

sztomi commented May 6, 2025

Uh oh!

pkasting commented Sep 22, 2025

Uh oh!

Uh oh!

jhasse commented Sep 22, 2025

Uh oh!

pkasting commented Sep 22, 2025

Uh oh!

pkasting commented Sep 23, 2025

Uh oh!

jhasse commented Sep 23, 2025

Uh oh!

pkasting commented Sep 23, 2025

Uh oh!

mummynobbit commented Sep 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sztomi commented Sep 24, 2025

Uh oh!

panther7 commented Sep 24, 2025

Uh oh!

jhasse commented Sep 24, 2025

Uh oh!

digit-google commented Sep 25, 2025

Uh oh!

jhasse commented Sep 25, 2025

Uh oh!

digit-google commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

digit-google commented Sep 26, 2025

Uh oh!

panther7 commented Sep 30, 2025

Uh oh!

digit-google commented Sep 30, 2025

Uh oh!

panther7 commented Sep 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

digit-google commented Oct 1, 2025

Uh oh!

panther7 commented Oct 1, 2025

Uh oh!

digit-google commented Oct 1, 2025

Uh oh!

panther7 commented Oct 2, 2025

Uh oh!

sztomi commented Mar 31, 2025 •

edited

Loading

mummynobbit commented Sep 23, 2025 •

edited

Loading

digit-google commented Sep 25, 2025 •

edited

Loading

panther7 commented Sep 30, 2025 •

edited

Loading

lygstate commented Feb 7, 2026 •

edited

Loading