-
Notifications
You must be signed in to change notification settings - Fork 13.8k
Compute quoted args for debuginfo at most once per session #146973
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
Closed
Changes from all commits
Commits
Show all changes
2 commits
Select commit
Hold shift + click to select a range
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file was deleted.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
67 changes: 67 additions & 0 deletions
67
compiler/rustc_codegen_ssa/src/debuginfo/command_line_args/mod.rs
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,67 @@ | ||
use std::sync::Arc; | ||
|
||
use rustc_middle::middle::debuginfo::CommandLineArgsForDebuginfo; | ||
use rustc_middle::ty::TyCtxt; | ||
use rustc_middle::util::Providers; | ||
|
||
#[cfg(test)] | ||
mod tests; | ||
|
||
pub(crate) fn provide(providers: &mut Providers) { | ||
providers.hooks.args_for_debuginfo = args_for_debuginfo; | ||
} | ||
|
||
/// Hook implementation for [`TyCtxt::args_for_debuginfo`]. | ||
fn args_for_debuginfo<'tcx>(tcx: TyCtxt<'tcx>) -> &'tcx Arc<CommandLineArgsForDebuginfo> { | ||
tcx.args_for_debuginfo_cache.get_or_init(|| { | ||
// Command-line information to be included in the target machine. | ||
// This seems to only be used for embedding in PDB debuginfo files. | ||
// FIXME(Zalathar): Maybe skip this for non-PDB targets? | ||
let argv0 = std::env::current_exe() | ||
.unwrap_or_default() | ||
.into_os_string() | ||
.into_string() | ||
.unwrap_or_default(); | ||
let quoted_args = quote_command_line_args(&tcx.sess.expanded_args); | ||
|
||
// Self-profile counter for the number of bytes produced by command-line quoting. | ||
tcx.prof.artifact_size("quoted_command_line_args", "-", quoted_args.len() as u64); | ||
|
||
Arc::new(CommandLineArgsForDebuginfo { argv0, quoted_args }) | ||
}) | ||
} | ||
|
||
/// Joins command-line arguments into a single space-separated string, quoting | ||
/// and escaping individual arguments as necessary. | ||
/// | ||
/// The result is intended to be informational, for embedding in debug metadata, | ||
/// and might not be properly quoted/escaped for actual command-line use. | ||
fn quote_command_line_args(args: &[String]) -> String { | ||
// Start with a decent-sized buffer, since rustc invocations tend to be long. | ||
let mut buf = String::with_capacity(128); | ||
|
||
for arg in args { | ||
if !buf.is_empty() { | ||
buf.push(' '); | ||
} | ||
|
||
print_arg_quoted(&mut buf, arg); | ||
} | ||
|
||
buf | ||
} | ||
|
||
/// Equivalent to LLVM's `sys::printArg` with quoting always enabled | ||
/// (see llvm/lib/Support/Program.cpp). | ||
fn print_arg_quoted(buf: &mut String, arg: &str) { | ||
buf.reserve(arg.len() + 2); | ||
|
||
buf.push('"'); | ||
for ch in arg.chars() { | ||
if matches!(ch, '"' | '\\' | '$') { | ||
buf.push('\\'); | ||
} | ||
buf.push(ch); | ||
} | ||
buf.push('"'); | ||
} |
File renamed without changes.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
#[derive(Debug)] | ||
pub struct CommandLineArgsForDebuginfo { | ||
pub argv0: String, | ||
pub quoted_args: String, | ||
} |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If the commandline changed, then we should recompile all CGUs as otherwise the commandline arguments in the debuginfo would be wrong. If it doesn't change, then no CGUs would be recompiled even if it was a query. If unconditionally recompiling cgus when the cli changes is not acceptable, then maybe this entire feature should be put behind a cli flag and be disabled by default?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As far as I can tell, bypassing the query system is already the current behaviour. So if that's a problem, maybe the whole “command-line in PDB” thing needs to be ripped out until it can be re-landed in an acceptable way.
(I don't have any attachment to the feature myself; I'm just trying to make the compiler do this quoting step once per process instead of literally 500+ times for no good reason.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I don't think anyone considered this when landing the original version.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There's also the semi-related #128842, where the EXE path being embedded in PDB is reportedly troublesome.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, I guess we should either rip out this feature or put it behind a flag.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I opened https://rust-lang.zulipchat.com/#narrow/channel/131828-t-compiler/topic/Some.20PDB.20info.20bypasses.20the.20query.20system.20and.20path.20remapping/with/541369247 to ask if anyone else has opinions.
My preference is to just rip out the whole thing and wait for someone to complain, since it's having outsized impact relative to its niche use-case, and we have no idea whether anyone is actually benefiting from it.