storage,admission: investigate read-only batch latency during high-volume snapshot ingest

**Describe the problem**

Experiment discussed internally [here](https://cockroachlabs.slack.com/archives/C01SRKWGHG8/p1664367194276249). When trying to reproduce snapshot-induced-latency-hits, using the roachtest added in https://github.com/cockroachdb/cockroach/pull/89191, we noticed that p99.9 latencies for read traffic over data that's not currently receiving snapshots see an increase. When looking at outlier traces, the time is spent entirely below pebble. There's little trace info from within pebble to understand why; this issue tracks investigating just that.

**To Reproduce**

Using #89191-ish:

![image](https://user-images.githubusercontent.com/10536690/195210822-748f7f58-54db-4fbe-a250-dbc78eeca0b6.png)

First red annotation is leases for foreground load being transferred to the node that's going to start receiving snapshots. Second red annotation is when it starts receiving snapshots, and service latencies start going through the roof. A set of outlier traces can be found here: [trace-snapshot-latency.tar.gz](https://github.com/cockroachdb/cockroach/files/9760137/trace-snapshot-latency.tar.gz). They look roughly like the one below:

![image](https://user-images.githubusercontent.com/10536690/195211050-deadac5e-37d4-4663-80f3-80cd1e7431ff.png)

+cc @andrewbaptist, @sumeerbhola.


Jira issue: CRDB-20434

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

storage,admission: investigate read-only batch latency during high-volume snapshot ingest #89788

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

storage,admission: investigate read-only batch latency during high-volume snapshot ingest #89788

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions