Skip to content

[BUG] Conformer generation struggles with organometallics like HEM #110

@ljarosch

Description

@ljarosch

Describe the bug
Inference queries that use organometallics such as HEM groups will run into conformer timeout problems. This is because RDKit by default cannot seem to properly handle conformer generation for such compounds.

To Reproduce

from openfold3.core.data.primitives.structure.query import structure_with_ref_mol_from_ccd_code

structure_with_ref_mol_from_ccd_code("HEM", chain_id="A")

> ConformerGenerationError: Failed to generate 3D coordinates

Expected behavior
A conformer should be generated without failure.

Additional context
Our training pipeline makes use of input sanitization by pdbeccdutils, which solves these conformer generation issues. Because of that, this issue is inference-specific.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinginferenceRelating to the inference pipeline

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions