sql/*: add hint injection #158096

michae2 · 2025-11-20T00:10:18Z

sql: add parseHint step when loading hint into hints cache

When loading an external statement hint into the statement hints cache,
we might need to call some function to get the hint ready for use. (For
hint injections, this function is tree.NewHintInjectionDonor which
parses and walks the donor statement fingerprint.) This function could
fail, in which case we want to skip over the hint but not return an
error from GetStatementHintsFromDB. This function could succeed but
create some extra state which we need to save.

This commit adds a new parseHint step which calls any functions needed
to get the hint ready, and creates a new hints.Hint struct which holds
the object(s) created when parsing hints. (These are analogous to
parseStats and TableStatistic from the stats cache.)

Informs: #153633

Release note: None

sql/*: add hint injection

During ReloadHintsIfStale we now call Validate and InjectHints
using the donor to perform the AST rewrite. We save the rewritten AST
in the statement separately from the original AST.
We wrap prepareUsingOptimizer and makeOptimizerPlan with
functions that first try preparing / planning with injected hints,
and then try again without injected hints in case the injected hints
are invalid.

With these two pieces we can now actually perform hint injection.

Fixes: #153633

Release note (sql change): A new "hint injection" ability has been
added, which allows operators to dynamically inject inline hints into
statements, without modifying the text of those statements. Hints can be
injected using the builtin function crdb_internal.inject_hint with the
target statement fingerprint to rewrite. For example, to add an index
hint to the statement SELECT * FROM my_table WHERE col = 3, use:

SELECT crdb_internal.inject_hint(
  'SELECT * FROM my_table WHERE col = _',
  'SELECT * FROM my_table@my_table_col_idx WHERE col = _'
);

Whenever a statement is executed matching statement fingerprint
SELECT * FROM my_table WHERE col = _, it will first be rewritten
to include the injected index hint.

sql/*: invalidate cached memos after hint injection changes

If we build a memo with hint injection, and then later we realize that
memo won't work (maybe because we discover the hint is unsatisfiable
during execution of a prepared statement) we need to invalidate the
cached memo.

To do this, add a usingHintInjection field which tells the memo
staleness check whether we're trying with or without hint injection.

Also, in a related but separate change, this commit adds all matching
HintIDs to the optimizer metadata so that we don't invalidate cached
memos if the hintsGeneration changed due to some unrelated statement
hints changing.

Informs: #153633

Release note: None

When loading an external statement hint into the statement hints cache, we might need to call some function to get the hint ready for use. (For hint injections, this function is `tree.NewHintInjectionDonor` which parses and walks the donor statement fingerprint.) This function could fail, in which case we want to skip over the hint but not return an error from `GetStatementHintsFromDB`. This function could succeed but create some extra state which we need to save. This commit adds a new `parseHint` step which calls any functions needed to get the hint ready, and creates a new `hints.Hint` struct which holds the object(s) created when parsing hints. (These are analogous to `parseStats` and `TableStatistic` from the stats cache.) Informs: cockroachdb#153633 Release note: None

1. During `ReloadHintsIfStale` we now call `Validate` and `InjectHints` using the donor to perform the AST rewrite. We save the rewritten AST in the statement separately from the original AST. 2. We wrap `prepareUsingOptimizer` and `makeOptimizerPlan` with functions that first try preparing / planning with injected hints, and then try again without injected hints in case the injected hints are invalid. With these two pieces we can now actually perform hint injection. Fixes: cockroachdb#153633 Release note (sql change): A new "hint injection" ability has been added, which allows operators to dynamically inject inline hints into statements, without modifying the text of those statements. Hints can be injected using the builtin function `crdb_internal.inject_hint` with the target statement fingerprint to rewrite. For example, to add an index hint to the statement `SELECT * FROM my_table WHERE col = 3`, use: ``` SELECT crdb_internal.inject_hint( 'SELECT * FROM my_table WHERE col = _', 'SELECT * FROM my_table@my_table_col_idx WHERE col = _' ); ``` Whenever a statement is executed matching statement fingerprint `SELECT * FROM my_table WHERE col = _`, it will first be rewritten to include the injected index hint.

If we build a memo with hint injection, and then later we realize that memo won't work (maybe because we discover the hint is unsatisfiable during execution of a prepared statement) we need to invalidate the cached memo. To do this, add a usingHintInjection field which tells the memo staleness check whether we're trying with or without hint injection. Also, in a related but separate change, this commit adds all matching HintIDs to the optimizer metadata so that we don't invalidate cached memos if the hintsGeneration changed due to some unrelated statement hints changing. Informs: cockroachdb#153633 Release note: None

cockroach-teamcity · 2025-11-20T00:10:35Z

This change is

michae2

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @DrewKimball)

pkg/sql/plan_opt.go line 563 at r3 (raw file):

		ast = opc.p.stmt.ASTWithInjectedHints
	}
	f.Metadata().SetHintIDs(opc.p.GetHintIDs())

I wasn't completely happy with this call to SetHintIDs. It could also be done inside optbuild somewhere. This seemed like an ok spot because we're mostly not accessing the planner inside optbuild.

pkg/sql/hints/hint_table.go line 168 at r1 (raw file):

func (hint *Hint) Size() int {
	// TODO(michae2): add size of HintInjectionDonor

Whoops, I forgot to do this. One sec.

michae2

Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @DrewKimball)

pkg/sql/logictest/testdata/logic_test/statement_hint_builtins line 216 at r2 (raw file):


query T
SELECT regexp_replace(message, E'\\d+', 'x') FROM [SHOW TRACE FOR SESSION] WHERE message LIKE '%injected hints%'

I think some of these tracing checks will need the same adjustment as in #158026.

yuzefovich

Nice to see this come together! I skimmed the PR and had a few questions, will defer to others for closer review.

@yuzefovich reviewed 6 of 6 files at r1, 7 of 7 files at r2, 7 of 7 files at r3, all commit messages.
Reviewable status: complete! 0 of 0 LGTMs obtained (waiting on @DrewKimball, @mgartner, and @michae2)

pkg/sql/logictest/testdata/logic_test/statement_hint_builtins line 216 at r2 (raw file):

Previously, michae2 (Michael Erickson) wrote…

I think some of these tracing checks will need the same adjustment as in #158026.

Yes, opc.log appends the statement to all messages under high vmodule config.

-- commits line 37 at r2:
nit: have we settled on the naming? It's probably the first time we're documenting the statement hints feature, so we should agree on terminology.

pkg/sql/logictest/testdata/logic_test/statement_hint_builtins line 251 at r2 (raw file):


statement ok
SELECT a, x FROM abc JOIN xy ON y = b WHERE a = 5

nit: might also be nice to EXPLAIN the statement before hint injection to show that lookup join is used.

pkg/sql/hints/hint_table.go line 93 at r1 (raw file):

		hintID, fingerprint, hint, err := parseHint(it.Cur(), fingerprintFlags)
		if err != nil {
			log.Dev.Warningf(

nit: I wonder whether we should add a limiter to this warning, like once a second, but maybe it'd only matter if we have the hint cache thrashing.

pkg/sql/hints/BUILD.bazel line 27 at r1 (raw file):

        "//pkg/sql/hintpb",
        "//pkg/sql/isql",
        "//pkg/sql/parser",

nit: I now have uneasy feeling when adding new dependencies on sql/parser since that package is often the build bottleneck. It might be nice to use parserutils.ParseOne to avoid that.

pkg/sql/statement.go line 185 at r2 (raw file):

	)

	for i, hint := range s.Hints {

Do we plan to document the scenario when multiple hints can be applied to a single stmt? In the current version, the first (in ASC creation order) successful injection wins, but perhaps a more intuitive behavior would be for the latest hint to win?

pkg/sql/logictest/testdata/logic_test/statement_hint_builtins line 450 at r3 (raw file):

SET tracing = off

# (This should be empty.)

nit: there is query empty directive for this (or statement count 0).

pkg/sql/logictest/testdata/logic_test/statement_hint_builtins line 532 at r3 (raw file):

injected hints from external statement hint x
trying preparing with injected hints
preparing with injected hints failed with: index "abc_foo" not found

Can we avoid trying to use the same invalid hint twice?

michae2 added 3 commits November 19, 2025 15:39

michae2 requested review from a team and DrewKimball November 20, 2025 00:10

michae2 commented Nov 20, 2025

View reviewed changes

michae2 requested a review from mgartner November 20, 2025 17:20

yuzefovich reviewed Nov 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

sql/*: add hint injection #158096

sql/*: add hint injection #158096

michae2 commented Nov 20, 2025

Uh oh!

cockroach-teamcity commented Nov 20, 2025

Uh oh!

michae2 left a comment

Uh oh!

michae2 left a comment

Uh oh!

yuzefovich left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sql/*: add hint injection #158096

Are you sure you want to change the base?

sql/*: add hint injection #158096

Conversation

michae2 commented Nov 20, 2025

Uh oh!

cockroach-teamcity commented Nov 20, 2025

Uh oh!

michae2 left a comment

Choose a reason for hiding this comment

Uh oh!

michae2 left a comment

Choose a reason for hiding this comment

Uh oh!

yuzefovich left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants