Skip to content

Commit defee41

Browse files
committed
Add EnzyMM preprint
1 parent 876fc0c commit defee41

File tree

1 file changed

+40
-0
lines changed

1 file changed

+40
-0
lines changed
Lines changed: 40 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,40 @@
1+
---
2+
layout: paper
3+
title: "Investigating Enzyme Function by Geometric Matching of Catalytic Motifs"
4+
nickname: 2026-02-12-hackett-investigating-enzyme-function
5+
authors: "Hackett RE, Riziotis IG, Larralde M, Ribeiro AJM, Zeller G, Thornton J"
6+
year: "2026"
7+
journal: "biorxiv"
8+
volume:
9+
issue:
10+
pages:
11+
is_published: false
12+
image: /assets/images/papers/biorxiv.png
13+
projects:
14+
tags: ["preprint"]
15+
16+
# Text
17+
fulltext:
18+
pdf:
19+
pdflink:
20+
pmcid:
21+
preprint: https://doi.org/10.64898/2026.02.10.705182
22+
supplement:
23+
24+
# Links
25+
doi: "10.64898/2026.02.10.705182"
26+
pmid:
27+
28+
# Data and code
29+
github: https://github.com/RayHackett/enzymm
30+
neurovault:
31+
openneuro:
32+
figshare:
33+
figshare_names:
34+
osf:
35+
---
36+
{% include JB/setup %}
37+
38+
# Abstract
39+
40+
The rapidly growing universe of predicted protein structures offers opportunities for data driven exploration but requires computationally scalable and interpretable tools(1-3). We developed a method to detect catalytic features in protein structures, providing insights into enzyme function and mechanism. A library of 6780 3D coordinate sets describing enzyme catalytic sites, referred to as templates, has been collected from manually curated examples of 762 enzyme catalytic mechanisms described in the Mechanism and Catalytic Site Atlas(4-6). For template searching we optimised the geometric-matching algorithm Jess(7). We implemented RMSD and residue orientation filters to differentiate catalytically informative matches from spurious ones. We validated this approach on a non-redundant set of high quality experimental (n=3751, <40% amino acid identity) enzyme structures with well annotated catalytic sites as well as predicted structures of the human proteome. We show matching catalytic templates solely on structure is more sensitive than sequence- and 3D-structure-based approaches in identifying homology between distantly related enzymes. Since geometric matching does not depend on conserved sequence motifs or even common evolutionary history, we are able to identify examples of structural active site similarity in highly divergent and possibly convergent enzymes(8). Such examples make interesting case studies into the evolution of enzyme function. Though not intended for characterizing substrate-specific binding pockets, the speed and knowledge-driven interpretability of our method make it well suited for expanding enzyme active-site annotation across large predicted proteomes. We provide the method and template library as a Python module, Enzyme Motif Miner, at https://github.com/rayhackett/enzymm.

0 commit comments

Comments
 (0)