Performance Improvements (Raw Ideas)

We are looking to improve the performance of `gh-ost` so that we can safely operate with larger database sizes. Our use case is a bit different to GitHub’s in that we **do not use read-replicas**. Some of the DBs have very cyclic usage as well (i.e. only busy 9-5 M-F), and may have windows of free capacity.

I have a few ideas I wanted to run by you since I’m sure some have come up prior:

| Feature  | Context | Status |
| ------------- | ------------- |------|
| Use 8.0 ALGORITHM=INSTANT when applicable | MySQL 8.0 does not require gh-ost in some cases.  | [Merged](https://github.com/github/gh-ost/pull/1201), thank you kindly for the review!  |
| Dynamic Chunk Size  | Gh-ost can observe the exec time of processing chunks, and dynamically increase the size if it fits below a threshold. For our environment (because we have a lot of replica tolerance) we typically run larger batch sizes, but have varying DB instance sizes. Being able to have this auto-tune is a win for us.  | [See PR here](https://github.com/github/gh-ost/pull/1224) (and [issue comment](https://github.com/github/gh-ost/issues/1204#issuecomment-1335641116)) |
| Multi-threaded applier | Parallel replication apply is much better in MySQL 8.0 – so combined with that we don’t use read-replicas, we can probably push more changes through the binlog than `gh-ost` currently does. We think we can tolerate a few minutes of replica lag. Our limit is Aurora restricts the relay log to ~1000M, if we exceed that.. we reduce our DR capabilities. (Note: there's an [earlier issue](https://github.com/github/gh-ost/issues/193) on this. It lacks the 8.0 parallel context, and the issue @shlomi-noach probably hit when he said it is slower, is possibly [this one](https://dev.mysql.com/doc/refman/8.0/en/innodb-auto-increment-handling.html#innodb-auto-increment-lock-modes)? In any case, I've verified I can bulk-parallel insert with improved performance.)  | Not started |
| Defer Binary Log Apply |  Currently gh-ost prioritizes applying the binary log ahead of copying rows. I actually think it’s possible to track only the primary keys that were discovered in the binary log in memory + if the _last modification_ was a delete or not (bool). If this is kept in a map, then it can be applied after the copy is done. The benefit of this change is most evident in workloads that tend to update the same rows.  **Edit:** This optimization requires mem-comparable primary keys. So it won't work on varchar primary keys with collations. | Not started |
| Resume from failure | I know there is a [stale PR for this](https://github.com/github/gh-ost/pull/343). This doesn’t improve the performance, but it’s semi-related since some of our long running DDLs fail. We also like to use daily pod-cycling on our k8s clusters, so having 2 week long single processes complicates our infra. | See [branch here](https://github.com/github/gh-ost/compare/master...morgo:restore-from-checkpoint?expand=1).
| Better ETA estimates | The [current ETA estimator](https://github.com/github/gh-ost/blob/7320fda848ba03284c815d4988eb34ff3bbc65f9/go/logic/migrator.go#L921-L924) is based on `estimatedTime - elapsedTime` from the start of the copy. This skews poorly for larger tables, which _become_ slower to insert into. As dynamic chunk size/throttling is introduced it also doesn't respond to changes well with a more accurate estimate. Ideally the estimate evaluates how many rows are left to copy and compares that to how many rows were copied in the last few minutes. | [See PR here](https://github.com/github/gh-ost/pull/1231) |
 
That's the raw idea list - there is a good chance we will be able to provide patches for some of these too, but I wanted to check-in first so we can discuss. Maybe you have a few of your own ideas too? :-)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Performance Improvements (Raw Ideas) #1204

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Feature	Context	Status
Use 8.0 ALGORITHM=INSTANT when applicable	MySQL 8.0 does not require gh-ost in some cases.	Merged, thank you kindly for the review!
Dynamic Chunk Size	Gh-ost can observe the exec time of processing chunks, and dynamically increase the size if it fits below a threshold. For our environment (because we have a lot of replica tolerance) we typically run larger batch sizes, but have varying DB instance sizes. Being able to have this auto-tune is a win for us.	See PR here (and issue comment)
Multi-threaded applier	Parallel replication apply is much better in MySQL 8.0 – so combined with that we don’t use read-replicas, we can probably push more changes through the binlog than `gh-ost` currently does. We think we can tolerate a few minutes of replica lag. Our limit is Aurora restricts the relay log to ~1000M, if we exceed that.. we reduce our DR capabilities. (Note: there's an earlier issue on this. It lacks the 8.0 parallel context, and the issue @shlomi-noach probably hit when he said it is slower, is possibly this one? In any case, I've verified I can bulk-parallel insert with improved performance.)	Not started
Defer Binary Log Apply	Currently gh-ost prioritizes applying the binary log ahead of copying rows. I actually think it’s possible to track only the primary keys that were discovered in the binary log in memory + if the last modification was a delete or not (bool). If this is kept in a map, then it can be applied after the copy is done. The benefit of this change is most evident in workloads that tend to update the same rows. Edit: This optimization requires mem-comparable primary keys. So it won't work on varchar primary keys with collations.	Not started
Resume from failure	I know there is a stale PR for this. This doesn’t improve the performance, but it’s semi-related since some of our long running DDLs fail. We also like to use daily pod-cycling on our k8s clusters, so having 2 week long single processes complicates our infra.	See branch here.
Better ETA estimates	The current ETA estimator is based on `estimatedTime - elapsedTime` from the start of the copy. This skews poorly for larger tables, which become slower to insert into. As dynamic chunk size/throttling is introduced it also doesn't respond to changes well with a more accurate estimate. Ideally the estimate evaluates how many rows are left to copy and compares that to how many rows were copied in the last few minutes.	See PR here

Performance Improvements (Raw Ideas) #1204

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions