fix(mysql): RFC Handle missing server-side prepared statements #3995

sulami · 2025-08-24T05:20:47Z

Does your PR solve an issue?

First of all, this is not merge-able as-is, it's a request for feedback/advice.

We're using sqlx with AWS Aurora, and have noticed an issue with Aurora's "zero-downtime" restarts, which wipe prepared statements but preserve TCP connections. In this scenario, all existing connections get effectively poisoned without notice, breaking queries until they age out eventually. This has forced us to disable query caching, incurring quite a big performance overhead.

I've had a look at how ActiveRecord, our other main stack handles this, and they wipe the client-side cache and retry if they get back an error. While retrying right away would be nice to rescue the query, getting the connection back into a functional state for future queries is the next best thing, and rather easy.

The main problem is testing the change. I've included a second commit with a working, but bad integration test setup. It's bad because it needs to expose additional public interfaces for the test to enable deleting the server-side prepared statements while leaving the client-side cache intact, something that the library API should not allow for consistency. I feel like an integration test is the better way to test this scenario, but neither sqlx nor MySQL expose a way to set up the required conditions. Specifically, statements prepared via the binary protocol are not named, and thus cannot be deallocated via DEALLOCATE PREPARED.

I would like to invite feedback & advice on:

the fix in the first place, I think it's a valid fix, albeit for a niche situation
a better way to test this
assuming the above are sorted, specifics of the fix

If this or a similar approach gets accepted, the same change might also be needed for Postgres, I suspect it will exhibit the same issue.

Is this a breaking change?

Assuming all the issues get resolved, no, I don't think it would be breaking.

The worst case outcome I can think of is a false positive match on the error, which would wrongly throw away the statement cache and cause re-preparation. It might be nice to match more closely on the specific error, but given the state of this change, this is a functional PoC.

Some cloud offerings like AWS Aurora allow for "zero-downtime" restarts for patches, which preserves existing TCP connections but wipes out a lot of server state, including the statement cache. In that scenario, trying to execute a previously prepared statement causes an error response with ``` HY000 Unknown prepared statement handler (<id>) given to mysql_stmt_precheck ``` which appears to be the only way to detect this scenario. To avoid subsequent errors for the same connection, we can clear the statement cache on the client side, causing all queries to get re-prepared. This is basically what ActiveRecord does,[0] which we can confirm is not vulnerable to the same issue. An even better solution would be to re-prepare and -try the query right there, like ActiveRecord does. [0]: https://github.com/rails/rails/blob/main/activerecord/lib/active_record/connection_adapters/mysql2/database_statements.rb#L66-L78

I think it's best to test this issue through integration tests, but there seems to be no way to clear the server-side prepared statements without access to the raw packet stream, as they're not named and thus cannot be deleted via DEALLOCATE PREPARED. To make the test work, additional interfaces have to be made available, which should not be shipped as part of the library for regular use.

abonander · 2025-10-05T05:41:47Z

I'm not hostile to this change, but I would consider the wiping of prepared statements to be a huge bug on Aurora's part. We've run into issues with "compatible" providers not handling prepared statements correctly before, e.g. PgBouncer, and in many cases they've actually agreed it was a bug/misfeature and fixed it.

We can merge a fix for this, but I'd like to see at least a little pressure to get this fixed upstream.

sulami · 2025-10-05T06:19:34Z

I agree, it's very surprising behaviour, and there seems to be no way of opting out whatsoever if you're using Aurora. I'm happy to open a support ticket with AWS on our organisation's behalf/complain to our account manager, but I'm afraid we're not significant enough to be in a position to actually exert any meaningful pressure on AWS.

Fwiw, in the meantime we're using a cheap query in before_acquire which detects the same issue by virtue of being prepared itself, so when the zero-downtime patch occurs that query fails and incrementally wipes the connection pool. The nice part about this is that it actually rescues the query without any need for a retry, but at the cost of an extra round-trip for every acquire.

abonander · 2025-10-06T00:19:24Z

I'm happy to open a support ticket with AWS on our organisation's behalf/complain to our account manager, but I'm afraid we're not significant enough to be in a position to actually exert any meaningful pressure on AWS.

That's fine, it generally takes many people complaining over time. As long as they're made aware of it. It's just become my pet peeve that all these third-party implementations advertise themselves as "compatible" with a database's wire protocol but then break spectacularly when you try to use any feature that isn't part of the basic query flow.

abonander · 2025-10-06T00:29:31Z

sqlx-mysql/src/connection/mod.rs

            .intersects(Status::SERVER_STATUS_IN_TRANS)
    }
+
+    pub async fn nuke_cached_statements(&mut self) -> Result<(), Error> {


Of course, this shouldn't just be a public method on the connection. Users might confuse it for the method that they're supposed to use to clear the prepared statement cache.

The easiest route would be to just mark this #[doc(hidden)].

abonander · 2025-10-06T00:31:22Z

sqlx-mysql/src/connection/executor.rs

                // query response is a meta-packet which may be one of:
                //  Ok, Err, ResultSet, or (unhandled) LocalInfileRequest
-                let mut packet = self.inner.stream.recv_packet().await?;
+                let mut packet = self.inner.stream.recv_packet().await.inspect_err(|_| {


Yeah, if you could find the exact error code that's returned, that would probably be better. This is a huge pessimization.

Keep in mind we also get errors here for "normal" errors like constraint violations.

This would leak any valid prepared statements on the server side.

abonander · 2025-10-06T00:36:05Z

sqlx-mysql/src/connection/executor.rs

                //  Ok, Err, ResultSet, or (unhandled) LocalInfileRequest
-                let mut packet = self.inner.stream.recv_packet().await?;
+                let mut packet = self.inner.stream.recv_packet().await.inspect_err(|_| {
+                    // if a prepared statement vanished on the server side, we get an error here


It would be good to link to the specific issue for context, if you created one (I didn't look).

This PR is also fine.

sulami added 2 commits August 24, 2025 14:00

sulami force-pushed the sulami/push-pmrnxvsoyqty branch from 5f8f07a to f1fd80a Compare August 24, 2025 23:06

sulami changed the title ~~fix(sqlx-mysql): RFC Handle missing server-side prepared statements~~ fix(mysql): RFC Handle missing server-side prepared statements Aug 24, 2025

abonander requested changes Oct 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(mysql): RFC Handle missing server-side prepared statements #3995

fix(mysql): RFC Handle missing server-side prepared statements #3995

Uh oh!

sulami commented Aug 24, 2025

Uh oh!

abonander commented Oct 5, 2025

Uh oh!

sulami commented Oct 5, 2025

Uh oh!

abonander commented Oct 6, 2025

Uh oh!

abonander Oct 6, 2025

Uh oh!

abonander Oct 6, 2025

Uh oh!

abonander Oct 6, 2025

Uh oh!

abonander Oct 6, 2025

Uh oh!

Uh oh!

fix(mysql): RFC Handle missing server-side prepared statements #3995

Are you sure you want to change the base?

fix(mysql): RFC Handle missing server-side prepared statements #3995

Uh oh!

Conversation

sulami commented Aug 24, 2025

Does your PR solve an issue?

Is this a breaking change?

Uh oh!

abonander commented Oct 5, 2025

Uh oh!

sulami commented Oct 5, 2025

Uh oh!

abonander commented Oct 6, 2025

Uh oh!

abonander Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

abonander Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

abonander Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

abonander Oct 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!