Is your feature request related to a problem? Please describe.
Across different runs, the DFA generated by the formula is not the same. That changes the observation space the agent learns on.
Describe the solution you'd like
A way to recover the previous DFA by implementing a proper serialization method.
Describe alternatives you've considered
Additional context