Recently, I discovered that one terabyte of my space is occupied by different AdvancedTreeSearchLmImageAndGlobalCacheJobs that differ only by transition distortion penalty (TDP) values. However, all TDPs are part of our dynamic search procedure meaning that they are not relevant for the static search space and the global cache.
I am opening this issue to discuss the most clean way to exclude this or other parameters from the hash calculation. After a short discussion with @vieting, we think there are two different scenarios one can explore:
- Pass two separate
crps to the AdvancedTreeSearchJob, where the additional crp, if not None, is used here. Otherwise the old behavior is performed.
- Change the hash calculation here, with two possible options, and by using a flag that keeps the default behavior
- Deleted all non-relavant parameters from the
RasrConfig if a default value within RASR is defined
- Set some constant value where no default value is available
Please let me know what are your thoughts, before I prepare a PR.