proc_dff: bit-granularity optimizations and refactoring #4781

georgerennie · 2024-11-28T17:03:40Z

proc_dff converts processes (sets of sync rules) into flip-flops through what is essentially structural pattern matching. As part of this it tries to make some optimizations to the sync rules so that it can produce simpler flip-flops (e.g. $adff and $dff instead of $aldff and $dffsr). These optimizations were previously applied in a fairly adhoc manner sprinkled throughout the inference which made it a pain to improve and have confidence in the correctness, as well as limiting the extent to which the optimizations could be applied. They were applied on full signals as found in the lhs of sync rules and thus could miss potential for optimization where different parts of a signal are best matched by different flip-flops.

Motivated by a pattern I saw with sv2v where a struct is lowered to one wire and may be partially reset even though the whole thing gets assigned at once, this example was being lowered to an $aldff, even though actually it is just the combination of a $dffe and an $adff in different parts of it.

module top(input wire clk, input wire rst, output reg [7:0] q, input wire [7:0] d);
always @(posedge clk or posedge rst) begin
	if (rst) q[3:0] <= '0;
	else     q <= d;
end
endmodule

This pr refactors proc_dff into three parts that are iterated on whilst there are still signals needing DFFs: extracting the relevant sync rules from the process, optimizing them and then generating flip-flop cells. The optimizations narrow the width of the signal that the flop is currently being generated for to the largest range of bits starting at the LSB that can have all the same optimizations applied as the LSB. This means that range is as optimized as it can be. The bits that are removed doing this are not deleted from the process and so are considered as a target in the next iteration. It is probably easiest to see the optimizations and choices of flip-flops by looking at the code which should be fairly well documented. For standard use-cases this should give basically the same results as before this change, it just allows supporting more corner cases.

This pr also fixes an issue in opt_dff where sigmap wasn't being used so it would fail to fold some muxes into enable signals. This caused test failures with the proc_dff changes. It also adds test cases for these new proc_dff optimizations.

To test this, it would be good to try running reasonable size verilog designs (ideally with async resets) through proc and checking the inferred flops are not a regression from previous Yosys. I believe Amaranth doesn't use sync processes so read_verilog ~~and yosys-slang~~ is probably the main interfaces affected by this.

georgerennie · 2024-11-28T22:36:19Z

As a side note, I think there are other bits of proc that could do with a bit of tidying up and being adapted to cover more general patterns. Maybe I'll have a look at proc_arst at somepoint...

jix · 2025-09-01T12:17:10Z

As far as I can tell, the only thing missing is some more testing with representative async reset verilog designs to rule out regressions. @georgerennie have you been using this PR or done any more testing using it in the meantime?

georgerennie · 2025-09-05T14:11:16Z

I have been using a build of yosys with this with a number of test designs (quite a few converted from SV2V which is what this was meant to address). I didn't find issues with it yet, can add some more tests and I'll also have a look over the code again - it's been long enough since I wrote it that I think I am almost approaching it with the eyes of a fresh reviewer!

* Instead of an ad hoc mix of optimizations and inferences, this tries to make it more principled by first extracting a set of asynchronous update rules from the process, then optimizing them before lowering them to a concrete flip-flop type, preferring simpler ones

georgerennie · 2025-09-05T16:33:24Z

There's a mild performance degradation (a few percent for just this pass maybe) - I think perf can be improved by precalculating the set of disjoint lvalues rather than going through and calculating them for each dff which requires a lot of sorting the sync rule lvalues. I am working on a patch for that but its probably easier to put it in a different pr.

povik · 2025-09-05T16:43:48Z

I believe Amaranth doesn't use sync processes so read_verilog and yosys-slang are probably the main interfaces affected by this.

Let me leave a note yosys-slang has retired the use of sync processes.

jix · 2025-09-08T15:45:36Z

I am working on a patch for that but its probably easier to put it in a different pr.

Does that mean that what you just pushed to this PR is ready to be merged? Or are you still in the process of reviewing it and adding tests? (It's fine if you need more time, I just want to avoid this PR sitting around for months again in case it actually is ready.)

georgerennie · 2025-09-08T16:13:06Z

Does that mean that what you just pushed to this PR is ready to be merged? Or are you still in the process of reviewing it and adding tests? (It's fine if you need more time, I just want to avoid this PR sitting around for months again in case it actually is ready.)

In the process of trying to improvev that I've found what I think is a better way of approaching the whole optimizations anyway that happens to improve proc_dff speed 4-7x on the designs I ran it on - therefore I think actually I will put it in this PR but there is no point reviewing until I do so. Will mark it draft until I push that to make this clear.

georgerennie force-pushed the george/proc_dff_improvements branch from c18fbd4 to 2780875 Compare November 28, 2024 18:04

georgerennie marked this pull request as ready for review November 28, 2024 22:56

georgerennie force-pushed the george/proc_dff_improvements branch 2 times, most recently from 673b024 to 8fc9e71 Compare November 28, 2024 23:02

georgerennie mentioned this pull request Dec 2, 2024

Add Optimization Barriers #4763

Open

georgerennie mentioned this pull request Feb 16, 2025

Optimize out self-assignment from DFF reset patterns povik/yosys-slang#87

Open

ShinyKate assigned jix Aug 25, 2025

georgerennie force-pushed the george/proc_dff_improvements branch from 390ff55 to b9b225b Compare September 5, 2025 16:30

georgerennie requested review from KrystalDelusion and mmicko as code owners September 5, 2025 16:30

georgerennie added 9 commits September 5, 2025 17:31

opt_dff: sigmap bits before looking up muxes

e4cf288

proc_dff: split constant and non-constant resets into different flops

a07ec22

proc_dff: optimize self-assignment at bit granularity

da8f1e4

proc_dff: optimize repeated values at bit granularity

a8b77ae

tests: add more complicated proc_dff tests

5c481a4

proc_dff: refactor shrinking logic in optimizers

1151ff0

proc_dff: use invoke_result_t instead of result_of

2cd393d

proc_dff: comments and NULL->nulptr

b9e4418

georgerennie force-pushed the george/proc_dff_improvements branch from b9b225b to b9e4418 Compare September 5, 2025 16:31

georgerennie marked this pull request as draft September 8, 2025 16:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

proc_dff: bit-granularity optimizations and refactoring #4781

proc_dff: bit-granularity optimizations and refactoring #4781

Uh oh!

georgerennie commented Nov 28, 2024 •

edited

Loading

Uh oh!

georgerennie commented Nov 28, 2024

Uh oh!

jix commented Sep 1, 2025

Uh oh!

georgerennie commented Sep 5, 2025

Uh oh!

georgerennie commented Sep 5, 2025 •

edited

Loading

Uh oh!

povik commented Sep 5, 2025

Uh oh!

jix commented Sep 8, 2025

Uh oh!

georgerennie commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

proc_dff: bit-granularity optimizations and refactoring #4781

Are you sure you want to change the base?

proc_dff: bit-granularity optimizations and refactoring #4781

Uh oh!

Conversation

georgerennie commented Nov 28, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

georgerennie commented Nov 28, 2024

Uh oh!

jix commented Sep 1, 2025

Uh oh!

georgerennie commented Sep 5, 2025

Uh oh!

georgerennie commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

povik commented Sep 5, 2025

Uh oh!

jix commented Sep 8, 2025

Uh oh!

georgerennie commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

georgerennie commented Nov 28, 2024 •

edited

Loading

georgerennie commented Sep 5, 2025 •

edited

Loading