[clang] Apply internal buffering to clang diagnostics printing #113440

Fznamznon · 2024-10-23T10:25:19Z

To stabilize output of clang when clang is run in multiple build threads, the whole diagnostic message is written first to internal buffer and then the whole message is put to the output stream which usually points to stderr which is unbuffered by default. This helps to avoid printing to stderr with small chunks and interleaving of multiple diagnostic messages.

This is also fixing a slight regression that happened somewhere in clang-17. I checked clang-14 and its output is slightly more stable.
In general the tool that is run in several threads can't control output interleaving so this change might be questionable. It is up to build's system/script/tool that is running clang to handle output interleaving. However, it seems for example Make doesn't prevent compiler's output interleaving in case of many threads and gcc's output is much more stable. I was using a makefile to build 6 files within 6 thread, each file contained a warning, clang's output was like (just an illustration, it is pretty random):

gcc's output using the same makefile:

So I decided to give it a try since it could greatly improve user experience for some cases. It turned out simple and gave a relatively stable result.

clang's output after the change:

I'm not sure how to test this properly, so the PR doesn't contain any test. But please let me know what you think, I'm open to any suggestions.

To stabilize output of clang when clang is run in multiple build threads the whole diagnostic message is written first to internal buffer and then the whole message is put to the output stream which usually points to stderr to avoid printing to stderr with small chunks and interleaving of multiple diagnostic messages.

llvmbot · 2024-10-23T10:25:54Z

@llvm/pr-subscribers-clang

Author: Mariya Podchishchaeva (Fznamznon)

Changes

To stabilize output of clang when clang is run in multiple build threads, the whole diagnostic message is written first to internal buffer and then the whole message is put to the output stream which usually points to stderr which is unbuffered by default. This helps to avoid printing to stderr with small chunks and interleaving of multiple diagnostic messages.

This is also fixing a slight regression that happened somewhere in clang-17. I checked clang-14 and its output is slightly more stable.
In general the tool that is run in several threads can't control output interleaving so this change might be questionable. It is up to build's system/script/tool that is running clang to handle output interleaving. However, it seems for example Make doesn't prevent compiler's output interleaving in case of many threads and gcc's output is much more stable. I was using a makefile to build 6 files within 6 thread, each file contained a warning, clang's output was like (just an illustration, it is pretty random):

gcc's output using the same makefile:

So I decided to give it a try since it could greatly improve user experience for some cases. It turned out simple and gave a relatively stable result.

clang's output after the change:

I'm not sure how to test this properly, so the PR doesn't contain any test. But please let me know what you think, I'm open to any suggestions.

Full diff: https://github.com/llvm/llvm-project/pull/113440.diff

3 Files Affected:

(modified) clang/include/clang/Frontend/TextDiagnostic.h (+11-1)
(modified) clang/lib/Frontend/TextDiagnostic.cpp (+13-3)
(modified) clang/lib/Frontend/TextDiagnosticPrinter.cpp (+7-3)

diff --git a/clang/include/clang/Frontend/TextDiagnostic.h b/clang/include/clang/Frontend/TextDiagnostic.h
index a2fe8ae995423b..e47a111c1dcdc2 100644
--- a/clang/include/clang/Frontend/TextDiagnostic.h
+++ b/clang/include/clang/Frontend/TextDiagnostic.h
@@ -16,6 +16,7 @@
 #define LLVM_CLANG_FRONTEND_TEXTDIAGNOSTIC_H
 
 #include "clang/Frontend/DiagnosticRenderer.h"
+#include "llvm/ADT/SmallString.h"
 #include "llvm/Support/raw_ostream.h"
 
 namespace clang {
@@ -33,8 +34,15 @@ namespace clang {
 /// DiagnosticClient is implemented through this class as is diagnostic
 /// printing coming out of libclang.
 class TextDiagnostic : public DiagnosticRenderer {
-  raw_ostream &OS;
+  raw_ostream &Out;
   const Preprocessor *PP;
+  // To stabilize output of clang when clang is run in multiple build threads
+  // the whole diagnostic message is written first to internal buffer and then
+  // the whole message is put to the output stream Out which usually points to
+  // stderr to avoid printing to stderr with small chunks and interleaving of
+  // multiple diagnostic messages.
+  SmallString<1024> InternalBuffer;
+  llvm::raw_svector_ostream OS;
 
 public:
   TextDiagnostic(raw_ostream &OS, const LangOptions &LangOpts,
@@ -104,6 +112,8 @@ class TextDiagnostic : public DiagnosticRenderer {
 
   void emitBuildingModuleLocation(FullSourceLoc Loc, PresumedLoc PLoc,
                                   StringRef ModuleName) override;
+  void endDiagnostic(DiagOrStoredDiag D,
+                     DiagnosticsEngine::Level Level) override;
 
 private:
   void emitFilename(StringRef Filename, const SourceManager &SM);
diff --git a/clang/lib/Frontend/TextDiagnostic.cpp b/clang/lib/Frontend/TextDiagnostic.cpp
index 4119ce6048d45d..fddb846745aa83 100644
--- a/clang/lib/Frontend/TextDiagnostic.cpp
+++ b/clang/lib/Frontend/TextDiagnostic.cpp
@@ -656,7 +656,11 @@ static bool printWordWrapped(raw_ostream &OS, StringRef Str, unsigned Columns,
 TextDiagnostic::TextDiagnostic(raw_ostream &OS, const LangOptions &LangOpts,
                                DiagnosticOptions *DiagOpts,
                                const Preprocessor *PP)
-    : DiagnosticRenderer(LangOpts, DiagOpts), OS(OS), PP(PP) {}
+    : DiagnosticRenderer(LangOpts, DiagOpts), Out(OS), PP(PP),
+      OS(InternalBuffer) {
+  this->OS.buffer().clear();
+  this->OS.enable_colors(true);
+}
 
 TextDiagnostic::~TextDiagnostic() {}
 
@@ -664,7 +668,7 @@ void TextDiagnostic::emitDiagnosticMessage(
     FullSourceLoc Loc, PresumedLoc PLoc, DiagnosticsEngine::Level Level,
     StringRef Message, ArrayRef<clang::CharSourceRange> Ranges,
     DiagOrStoredDiag D) {
-  uint64_t StartOfLocationInfo = OS.tell();
+  uint64_t StartOfLocationInfo = Out.tell();
 
   // Emit the location of this particular diagnostic.
   if (Loc.isValid())
@@ -677,7 +681,7 @@ void TextDiagnostic::emitDiagnosticMessage(
     printDiagnosticLevel(OS, Level, DiagOpts->ShowColors);
   printDiagnosticMessage(OS,
                          /*IsSupplemental*/ Level == DiagnosticsEngine::Note,
-                         Message, OS.tell() - StartOfLocationInfo,
+                         Message, Out.tell() - StartOfLocationInfo,
                          DiagOpts->MessageLength, DiagOpts->ShowColors);
 }
 
@@ -1545,3 +1549,9 @@ void TextDiagnostic::emitParseableFixits(ArrayRef<FixItHint> Hints,
     OS << "\"\n";
   }
 }
+
+void TextDiagnostic::endDiagnostic(DiagOrStoredDiag D,
+                                   DiagnosticsEngine::Level Level) {
+  Out << OS.str();
+  OS.buffer().clear();
+}
diff --git a/clang/lib/Frontend/TextDiagnosticPrinter.cpp b/clang/lib/Frontend/TextDiagnosticPrinter.cpp
index dac5c44fe92566..6e24a19a1ad501 100644
--- a/clang/lib/Frontend/TextDiagnosticPrinter.cpp
+++ b/clang/lib/Frontend/TextDiagnosticPrinter.cpp
@@ -133,12 +133,16 @@ void TextDiagnosticPrinter::HandleDiagnostic(DiagnosticsEngine::Level Level,
   // diagnostics in a context that lacks language options, a source manager, or
   // other infrastructure necessary when emitting more rich diagnostics.
   if (!Info.getLocation().isValid()) {
-    TextDiagnostic::printDiagnosticLevel(OS, Level, DiagOpts->ShowColors);
+    SmallString<1000> OutText;
+    llvm::raw_svector_ostream OutputBuffer(OutText);
+    OutputBuffer.enable_colors(true);
+    TextDiagnostic::printDiagnosticLevel(OutputBuffer, Level,
+                                         DiagOpts->ShowColors);
     TextDiagnostic::printDiagnosticMessage(
-        OS, /*IsSupplemental=*/Level == DiagnosticsEngine::Note,
+        OutputBuffer, /*IsSupplemental=*/Level == DiagnosticsEngine::Note,
         DiagMessageStream.str(), OS.tell() - StartOfLocationInfo,
         DiagOpts->MessageLength, DiagOpts->ShowColors);
-    OS.flush();
+    OS << OutputBuffer.str();
     return;
   }

tbaederr · 2024-10-23T14:33:18Z

What does "run in multiple build threads" mean? When does that happen?

Fznamznon · 2024-10-23T14:45:50Z

What does "run in multiple build threads" mean? When does that happen?

What I meant is a build of some project using clang as a compiler and make -j N where N > 1 (or a similar tool).

tbaederr · 2024-10-23T15:05:03Z

clang/lib/Frontend/TextDiagnosticPrinter.cpp

  // This is important as if the location is missing, we may be emitting
  // diagnostics in a context that lacks language options, a source manager, or
  // other infrastructure necessary when emitting more rich diagnostics.
  if (!Info.getLocation().isValid()) {


So the problem you're describing only exists for diagnostics without valid locations? 🤔

No, it exists for all of them. The handling is a bit different for diagnostics without valid locations, so I had to modify this place and TextDiagnostic class to handle the case with valid locations.

Fznamznon · 2024-10-23T15:19:45Z

BTW, the test failure IS caused by the patch. It somehow caused permanent swap of what clang-tidy prints in clang-tidy-run-with-database.cpp test. Not sure why...

cor3ntin · 2024-10-24T12:12:40Z

Are we calling SetUnbuffered somewhere we shouldn't?

Fznamznon · 2024-10-24T13:06:03Z

Are we calling SetUnbuffered somewhere we shouldn't?

Well, yeah, but I think we should?

llvm-project/llvm/lib/Support/raw_ostream.cpp

Line 910 in 37832d5

// Set standard error to be unbuffered.

The thing is though, I've tried setting SetBuffered somewhere in TextDiagnosticPrinter's constructor and it didn't work out. Seems calling SetBuffered on raw_ostream doesn't do much.

Fznamznon · 2024-12-03T10:47:54Z

Ping? Does that make sense? If so, I'll try to figure clangd output swap that caused lit fail (which seems harmless).
If not, I'll abandon this.

AaronBallman

This feels like we're hacking around a deeper issue with raw_ostream; I would have expected that SetBuffered() is honored. But it also worries me that we're manually enabling colors in all circumstances and no tests broke.

I'm also a bit worried we're updating TextDiagnostic but not SARIFDiagnostic; does emitting to SARIF also have interleaving issues?

AaronBallman · 2024-12-03T17:32:24Z

clang/lib/Frontend/TextDiagnostic.cpp

+    : DiagnosticRenderer(LangOpts, DiagOpts), Out(OS), PP(PP),
+      OS(InternalBuffer) {
+  this->OS.buffer().clear();
+  this->OS.enable_colors(true);


Shouldn't this be looking at DiagOpts->ShowColors?

Good point. I think we do have tests checking colors. We probably don't have the tests that check there is no colors.

AaronBallman · 2024-12-03T17:33:00Z

clang/lib/Frontend/TextDiagnosticPrinter.cpp

-    TextDiagnostic::printDiagnosticLevel(OS, Level, DiagOpts->ShowColors);
+    SmallString<1000> OutText;
+    llvm::raw_svector_ostream OutputBuffer(OutText);
+    OutputBuffer.enable_colors(true);


Same here as above; shouldn't this be looking at ShowColors first?

Fznamznon · 2024-12-04T11:16:32Z

I'm also a bit worried we're updating TextDiagnostic but not SARIFDiagnostic; does emitting to SARIF also have interleaving issues?

It seems it does. Is it a problem?

This feels like we're hacking around a deeper issue with raw_ostream; I would have expected that SetBuffered() is honored.

I'll try to see what happens in raw_ostream. I'm not entirely sure why SetBuffered doesn't work. My understanding was that we probably shouldn't set SetBuffered even if it worked on llvm::errs since it intentionally is created unbuffered.

Fznamznon · 2024-12-06T16:24:58Z

@AaronBallman
Tt seems SetBuffered is not honored for stderr intentionally :

Here we return 0

llvm-project/llvm/lib/Support/raw_ostream.cpp

Line 858 in e68a3e4

// the complexity.

To not set the buffer here

llvm-project/llvm/lib/Support/raw_ostream.cpp

Line 105 in e68a3e4

SetUnbuffered();

AaronBallman · 2024-12-06T17:28:01Z

@AaronBallman Tt seems SetBuffered is not honored for stderr intentionally :

Here we return 0

llvm-project/llvm/lib/Support/raw_ostream.cpp

Line 858 in e68a3e4

// the complexity.

To not set the buffer here

llvm-project/llvm/lib/Support/raw_ostream.cpp

Line 105 in e68a3e4

SetUnbuffered();

Good find! This was done in 1771022, and I think our needs have changed in the intervening 15 years so that line buffering is perhaps now worth the added complexity. Maybe we should try backing this out and fixing the underlying issue? CC @sunfishcode for opinions

tahonermann · 2024-12-11T17:39:05Z

Is it known which platforms are affected by this issue? I ask because I see some issues with how consoles/terminals are handled by raw_fd_ostream.

On Windows, raw_fd_ostream has a bool data member named IsWindowsConsole that is intended to indicate whether FD is a file descriptor for a Windows console. However, the code to detect a Windows console is incorrect.

llvm-project/llvm/lib/Support/raw_ostream.cpp

Lines 639 to 641 in 1946d32

    
           // Check if this is a console device. This is not equivalent to isatty. 
        
           IsWindowsConsole = 
        
               ::GetFileType((HANDLE)::_get_osfhandle(fd)) == FILE_TYPE_CHAR;

The above code only checks that FD is directed to a character device, not a console specifically. The correct way to check for a console is already implemented elsewhere.

llvm-project/llvm/lib/Support/Windows/Process.inc

Lines 289 to 292 in 1946d32

    
           bool Process::FileDescriptorIsDisplayed(int fd) { 
        
             DWORD Mode; // Unused 
        
             return (GetConsoleMode((HANDLE)_get_osfhandle(fd), &Mode) != 0); 
        
           }

Interestingly, the llvm::Process class is already used to detect a terminal for non-Windows systems via indirection through is_displayed().

llvm-project/llvm/lib/Support/raw_ostream.cpp

Lines 866 to 868 in 1946d32

    
           bool raw_fd_ostream::is_displayed() const { 
        
             return sys::Process::FileDescriptorIsDisplayed(FD); 
        
           }

is_displayed() is also used on Windows, so there is some potential for inconsistent and surprising behavior across character devices.

llvm-project/llvm/lib/Support/raw_ostream.cpp

Lines 506 to 520 in 1946d32

    
           bool raw_ostream::prepare_colors() { 
        
             // Colors were explicitly disabled. 
        
             if (!ColorEnabled) 
        
               return false; 
        
             // Colors require changing the terminal but this stream is not going to a 
        
             // terminal. 
        
             if (sys::Process::ColorNeedsFlush() && !is_displayed()) 
        
               return false; 
        
             if (sys::Process::ColorNeedsFlush()) 
        
               flush(); 
        
             return true; 
        
           }

The code for detecting a Windows console is specific to an actual Windows console device. It works for detecting a Windows console for processes running in a cmd.exe or Windows Terminal environment, but it won't detect the display used for a, e.g., Cygwin MinTTY terminal. That might be motivation for checking for a character device instead of a Windows console specifically, but comments should reflect that choice (and they currently suggest otherwise). Regardless, we should have one place where we determine whether output is directed to a console/terminal and the right place for that is Process::FileDescriptorIsDisplayed().

For UNIX platforms, isatty() is used to detect output directed to a terminal.

llvm-project/llvm/lib/Support/Unix/Process.inc

Lines 309 to 316 in 1946d32

    
           bool Process::FileDescriptorIsDisplayed(int fd) { 
        
           #if HAVE_ISATTY 
        
             return isatty(fd); 
        
           #else 
        
             // If we don't have isatty, just return false. 
        
             return false; 
        
           #endif 
        
           }

However, for reasons I don't understand, raw_fd_ostream::preferred_buffer_size() checks for both a character device and (indirectly) isatty() (this code used to call isatty() directly).

llvm-project/llvm/lib/Support/raw_ostream.cpp

Lines 851 to 862 in 1946d32

    
           assert(FD >= 0 && "File not yet open!"); 
        
           struct stat statbuf; 
        
           if (fstat(FD, &statbuf) != 0) 
        
             return 0; 
        
           // If this is a terminal, don't use buffering. Line buffering 
        
           // would be a more traditional thing to do, but it's not worth 
        
           // the complexity. 
        
           if (S_ISCHR(statbuf.st_mode) && is_displayed()) 
        
             return 0; 
        
           // Return the preferred block size. 
        
           return statbuf.st_blksize;

I think it is worth cleaning this up to see if that 1) fixes the reported problem, and 2) causes any regressions (which would then prompt improving the comments to better explain the intent).

Fznamznon · 2025-03-06T15:12:15Z

@tahonermann thank you for your investigation, and sorry for the delay.

Is it known which platforms are affected by this issue?

The issue is reported on Linux with a makefile as a reproducer. I'm not aware if that is the case on Windows.

I think it is worth cleaning this up to see if that 1) fixes the reported problem, and 2) causes any regressions (which would then prompt improving the comments to better explain the intent).

Well, simply commenting out the code preventing buffering for console

--- a/llvm/lib/Support/raw_ostream.cpp
+++ b/llvm/lib/Support/raw_ostream.cpp
@@ -854,8 +854,8 @@ size_t raw_fd_ostream::preferred_buffer_size() const {
   // If this is a terminal, don't use buffering. Line buffering
   // would be a more traditional thing to do, but it's not worth
   // the complexity.
-  if (S_ISCHR(statbuf.st_mode) && is_displayed())
-    return 0;
+  // if (S_ISCHR(statbuf.st_mode) && is_displayed())
+  //   return 0;
   // Return the preferred block size.
   return statbuf.st_blksize;
 #endif

and setting incoming OS in TextDiagnostic to be buffered

 TextDiagnostic::TextDiagnostic(raw_ostream &OS, const LangOptions &LangOpts,
                                DiagnosticOptions *DiagOpts,
                                const Preprocessor *PP)
-    : DiagnosticRenderer(LangOpts, DiagOpts), OS(OS), PP(PP) {}
+    : DiagnosticRenderer(LangOpts, DiagOpts), OS(OS), PP(PP) {
+    OS.SetBuffered();}

-TextDiagnostic::~TextDiagnostic() {}
+TextDiagnostic::~TextDiagnostic() {OS.SetUnbuffered();
+}

does help to make the output more stable, however I think due to llvm::errs being easily accessible by any parts of LLVM, I'm currently having troubles with sporadic memory corruption in clang tooling tests. I suppose this happens due to setting llvm::errs to buffered/unbuffered in several threads. So, either raw_ostream needs to become thread-safe somehow and/or setting llvm::errs to buffer does seem really unsafe. I feel I'm stuck. WDYT?

Fznamznon requested review from AaronBallman, cor3ntin, tahonermann and tbaederr October 23, 2024 10:25

llvmbot added the clang Clang issues not falling into any other category label Oct 23, 2024

tbaederr reviewed Oct 23, 2024

View reviewed changes

AaronBallman reviewed Dec 3, 2024

View reviewed changes

Merge branch 'main' into multi-thread-clang

aa812c9

Fznamznon closed this Mar 10, 2025

[clang] Apply internal buffering to clang diagnostics printing #113440

[clang] Apply internal buffering to clang diagnostics printing #113440

Uh oh!

Conversation

Fznamznon commented Oct 23, 2024

Uh oh!

llvmbot commented Oct 23, 2024

Uh oh!

tbaederr commented Oct 23, 2024

Uh oh!

Fznamznon commented Oct 23, 2024

Uh oh!

tbaederr Oct 23, 2024

Choose a reason for hiding this comment

Uh oh!

Fznamznon Oct 23, 2024

Choose a reason for hiding this comment

Uh oh!

Fznamznon commented Oct 23, 2024

Uh oh!

cor3ntin commented Oct 24, 2024

Uh oh!

Fznamznon commented Oct 24, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fznamznon commented Dec 3, 2024

Uh oh!

AaronBallman left a comment

Choose a reason for hiding this comment

Uh oh!

AaronBallman Dec 3, 2024

Choose a reason for hiding this comment

Uh oh!

Fznamznon Dec 4, 2024

Choose a reason for hiding this comment

Uh oh!

AaronBallman Dec 3, 2024

Choose a reason for hiding this comment

Uh oh!

Fznamznon commented Dec 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Fznamznon commented Dec 6, 2024

Uh oh!

AaronBallman commented Dec 6, 2024

Uh oh!

tahonermann commented Dec 11, 2024

Uh oh!

Fznamznon commented Mar 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Fznamznon commented Oct 24, 2024 •

edited

Loading

Fznamznon commented Dec 4, 2024 •

edited

Loading