Training REINVENT Prior on Large Cyclic Peptides (≈1200 Da) #307
-
|
Hello REINVENT team, I’m currently trying to train the reinvent.prior on a dataset of a relatively large cyclic peptides (~1200 Da, 80-100 heavy atoms) essentially polymyxin-like macrocycles. "default" filter: CC(C)C[C@@h]1NC(=O)C@@HNC(=O)C@HNC(=O)C@@HCCNC(=O)C@HNC(=O)C@HNC(=O)C@HNC1=O is invalid I believe this happens because the built-in RDKitFilter has a hard-coded max_heavy_atoms limit, so my macrocyclic peptides fall slightly outside the training domain of the standard prior and get dropped before training starts. What I’d like to ask Is there a way to turn off the default molecule filter completely or adjust its thresholds and is reinvent a suitable prior for that or should i switch to a different prior ? Thanks for your time and for maintaining such a great generative framework |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
|
Hi, You can switch off the standardization step with [parameters]
standardize_smiles = falseCheers, |
Beta Was this translation helpful? Give feedback.
Hi,
You can switch off the standardization step with
Cheers,
Hannes.