synth: keep mocked memories by default

oharboe · oharboe · commit 916b9de6d9bd · 2025-11-09T10:27:27.000+01:00
Signed-off-by: Øyvind Harboe &lt;oyvind.harboe@zylin.com&gt;
diff --git a/docs/user/FlowVariables.md b/docs/user/FlowVariables.md
@@ -242,10 +242,11 @@ configuration file.
 | <a name="SYNTH_HDL_FRONTEND"></a>SYNTH_HDL_FRONTEND| Select an alternative language frontend to ingest the design. Available option is "slang". If the variable is empty, design is read with the Yosys read_verilog command.| |
 | <a name="SYNTH_HIERARCHICAL"></a>SYNTH_HIERARCHICAL| Enable to Synthesis hierarchically, otherwise considered flat synthesis.| 0|
 | <a name="SYNTH_HIER_SEPARATOR"></a>SYNTH_HIER_SEPARATOR| Separator used for the synthesis flatten stage.| .|
+| <a name="SYNTH_KEEP_MOCKED_MEMORIES"></a>SYNTH_KEEP_MOCKED_MEMORIES| Keeping the mocked memories(not flattening them), preserves some of the access logic complexity and avoids optimizations outside of the mocked memory.| 1|
 | <a name="SYNTH_KEEP_MODULES"></a>SYNTH_KEEP_MODULES| Mark modules to keep from getting removed in flattening.| |
 | <a name="SYNTH_MEMORY_MAX_BITS"></a>SYNTH_MEMORY_MAX_BITS| Maximum number of bits for memory synthesis. Ideally, real RAM or realistic fakeram should be used for RAMs much larger than 1024 bits. To temporarily ignore the RAM concerns and investigate other aspects of the design, consider setting `SYNTH_MOCK_LARGE_MEMORIES=1`, or adjusting `SYNTH_MEMORY_MAX_BITS`.| 4096|
 | <a name="SYNTH_MINIMUM_KEEP_SIZE"></a>SYNTH_MINIMUM_KEEP_SIZE| For hierarchical synthesis, we keep modules of larger area than given by this variable and flatten smaller modules. The area unit used is the size of a basic nand2 gate from the platform's standard cell library. The default value is platform specific.| 0|
-| <a name="SYNTH_MOCK_LARGE_MEMORIES"></a>SYNTH_MOCK_LARGE_MEMORIES| Reduce memories larger than SYNTH_MEMORY_MAX_BITS to 1 row. This is useful and convenient to separate the concern of instantiating and placing memories from investigating other issues with a design, though it comes at the expense of the increased accuracy that using realistic fakemem would provide. Memories with a single 1 row will of course have unrealistically good timing and area characteristics, but timing will still correctly terminate in a register. Large port memories, typically register files, will still have the retain a lot of the port logic that can be useful to investigate issues. Consider using SYNTH_KEEP_MODULES to keep the modules of the mocked memories so that code outside the mocked memories is not optimized as a consequence of mocking a memory, yielding better insight into issues running the rest of the design through the ORFS flow.| 0|
+| <a name="SYNTH_MOCK_LARGE_MEMORIES"></a>SYNTH_MOCK_LARGE_MEMORIES| Reduce Yosys inferred memories larger than SYNTH_MEMORY_MAX_BITS to 1 row. Yosys will generally infer memories from behavioral Verilog code, whether the memories are in standalone modules or instantiated within some larger module. fakeram and empty Verilog memories(blackboxes) of memories will not be inferred memories by Yosys and are therefore not affected by this variable. This is useful and convenient to separate the concern of instantiating and placing memories from investigating other issues with a design, though it comes at the expense of the increased accuracy that using realistic fakemem would provide. Memories with a single 1 row will of course have unrealistically good timing and area characteristics, but timing will still correctly terminate in a register. Large port memories, typically register files, will still have the retain a lot of the port logic that can be useful to investigate issues. This can be especially useful during development of designs where the behavioral model comes first and suitable memories are matched up when the design RTL is stable. A typical use case would be Chisel which will generate a behavioral model for a memories with the required clocks, ports, etc. in addition to a computer readable file with the specification of the memories that is used to [automatically](https://chipyard.readthedocs.io/en/stable/Tools/Barstools.html/) match up suitable memory macros later in the flow. During an architectural screening study, a large range of memory configurations can be investigated quickly with this option, without getting bogged down in the concern of how to realize the memories in silicon for emphemral RTL configurations that exist only long enough to run through the ORFS flow to create a table of some characteristics of a design configuration.| 0|
 | <a name="SYNTH_NETLIST_FILES"></a>SYNTH_NETLIST_FILES| Skips synthesis and uses the supplied netlist files. If the netlist files contains duplicate modules, which can happen when using hierarchical synthesis on indvidual netlist files and combining here, subsequent modules are silently ignored and only the first module is used.| |
 | <a name="SYNTH_OPT_HIER"></a>SYNTH_OPT_HIER| Optimize constants across hierarchical boundaries.| |
 | <a name="SYNTH_RETIME_MODULES"></a>SYNTH_RETIME_MODULES| *This is an experimental option and may cause adverse effects.* *No effort has been made to check if the retimed RTL is logically equivalent to the non-retimed RTL.* List of modules to apply automatic retiming to. These modules must not get dissolved and as such they should either be the top module or be included in SYNTH_KEEP_MODULES. The main use case is to quickly identify if performance can be improved by manually retiming the input RTL. Retiming will treat module ports like register endpoints/startpoints. The objective function of retiming isn't informed by SDC, even the clock period is ignored. As such, retiming will optimize for best delay at potentially high register number cost. Automatic retiming can produce suboptimal results as its timing model is crude and it doesn't find the optimal distribution of registers on long pipelines. See OR discussion #8080.| |
@@ -282,6 +283,7 @@ configuration file.
 - [SYNTH_GUT](#SYNTH_GUT)
 - [SYNTH_HDL_FRONTEND](#SYNTH_HDL_FRONTEND)
 - [SYNTH_HIERARCHICAL](#SYNTH_HIERARCHICAL)
+- [SYNTH_KEEP_MOCKED_MEMORIES](#SYNTH_KEEP_MOCKED_MEMORIES)
 - [SYNTH_KEEP_MODULES](#SYNTH_KEEP_MODULES)
 - [SYNTH_MEMORY_MAX_BITS](#SYNTH_MEMORY_MAX_BITS)
 - [SYNTH_MINIMUM_KEEP_SIZE](#SYNTH_MINIMUM_KEEP_SIZE)
diff --git a/flow/designs/sky130hd/microwatt/config.mk b/flow/designs/sky130hd/microwatt/config.mk
@@ -42,14 +42,6 @@ ifeq ($(SYNTH_MOCK_LARGE_MEMORIES),1)
     # These module names comes from the error report when setting SYNTH_MEMORY_MAX_BITS=2048
     # and SYNTH_MOCK_LARGE_MEMORIES=0
     #
-    # Keeping them avoids mocking them away, which would lead to further optimizations
-    # that would obscure what is going on in the rest of the design.
-    export SYNTH_KEEP_MODULES=decode1_0_bf8b4530d8d246dd74ac53a13471bba17941dff7 \
-      decode1_0_bf8b4530d8d246dd74ac53a13471bba17941dff7 \
-      fpu \
-      decode1_0_bf8b4530d8d246dd74ac53a13471bba17941dff7 \
-      decode1_0_bf8b4530d8d246dd74ac53a13471bba17941dff7 \
-      decode1_0_bf8b4530d8d246dd74ac53a13471bba17941dff7
     # The goal is to run through the flow quickly to learn what we can
     # about the design without getting bogged down in memory issues.
     export SYNTH_MEMORY_MAX_BITS ?= 1024
diff --git a/flow/scripts/synth.tcl b/flow/scripts/synth.tcl
@@ -59,6 +59,7 @@ if { !$::env(SYNTH_HIERARCHICAL) } {
 if { $::env(SYNTH_MOCK_LARGE_MEMORIES) } {
   memory_collect
   set select [tee -q -s result.string select -list t:\$mem_v2]
+  set report_file [open $::env(REPORTS_DIR)/synth_mocked_memories.txt "w"]
   foreach path [split [string trim $select] "\n"] {
     set index [string first "/" $path]
     set module [string range $path 0 [expr { $index - 1 }]]
@@ -71,8 +72,15 @@ if { $::env(SYNTH_MOCK_LARGE_MEMORIES) } {
     if { $nbits > $::env(SYNTH_MEMORY_MAX_BITS) } {
       rtlil::set_param -uint $module $instance SIZE 1
       puts "Shrunk memory $path from $size rows to 1"
+      puts -nonewline $report_file "$module:\n  width: $width\n  size: $size\n"
+      if { $::env(SYNTH_KEEP_MOCKED_MEMORIES) } {
+        select -module $module
+        setattr -mod -set keep_hierarchy 1
+        select -clear
+      }
     }
   }
+  close $report_file
 }
 
 json -o $::env(RESULTS_DIR)/mem.json
diff --git a/flow/scripts/variables.yaml b/flow/scripts/variables.yaml
@@ -186,7 +186,15 @@ SYNTH_MEMORY_MAX_BITS:
     - synth
 SYNTH_MOCK_LARGE_MEMORIES:
   description: >
-    Reduce memories larger than SYNTH_MEMORY_MAX_BITS to 1 row.
+    Reduce Yosys inferred memories larger than SYNTH_MEMORY_MAX_BITS to 1 row.
+
+    Yosys will generally infer memories from behavioral Verilog code, whether
+    the memories are in standalone modules or instantiated within some
+    larger module.
+
+    fakeram and empty Verilog memories(blackboxes) of memories will not
+    be inferred memories by Yosys and are therefore not affected by
+    this variable.
 
     This is useful and convenient to separate the concern of instantiating
     and placing memories from investigating other issues with a design,
@@ -200,13 +208,34 @@ SYNTH_MOCK_LARGE_MEMORIES:
     Large port memories, typically register files, will still have the
     retain a lot of the port logic that can be useful to investigate issues.
 
-    Consider using SYNTH_KEEP_MODULES to keep the modules of the mocked
-    memories so that code outside the mocked memories is not
-    optimized as a consequence of mocking a memory, yielding better insight
-    into issues running the rest of the design through the ORFS flow.
+    This can be especially useful during development of designs where the
+    behavioral model comes first and suitable memories are matched up
+    when the design RTL is stable.
+
+    A typical use case would be Chisel which will generate a behavioral model
+    for a memories with the required clocks, ports, etc. in addition to a
+    computer readable file with the specification of the memories that
+    is used to
+    [automatically](https://chipyard.readthedocs.io/en/stable/Tools/Barstools.html/)
+    match up suitable memory macros later in the flow.
+
+    During an architectural screening study, a large range of memory
+    configurations can be investigated quickly with this option,
+    without getting bogged down in the concern of how to realize the
+    memories in silicon for emphemral RTL configurations that exist
+    only long enough to run through the ORFS flow to create a table
+    of some characteristics of a design configuration.
   default: 0
   stages:
     - synth
+SYNTH_KEEP_MOCKED_MEMORIES:
+  description: >
+    Keeping the mocked memories(not flattening them), preserves some of
+    the access logic complexity and avoids optimizations outside
+    of the mocked memory.
+  default: 1
+  stages:
+    - synth
 SYNTH_HDL_FRONTEND:
   description: >
     Select an alternative language frontend to ingest the design. Available option