Skip to content

Commit 514af71

Browse files
authored
Merge pull request #30 from haddocking/pdbe-length-filter
Add nr residue filter to prune pdbs command
2 parents 28956e6 + 8fd065e commit 514af71

File tree

14 files changed

+1901
-782
lines changed

14 files changed

+1901
-782
lines changed

README.md

Lines changed: 27 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,25 @@
88

99
Python package to detect proteins in EM density maps.
1010

11+
It uses
12+
13+
- [Uniprot Sparql endpoint](https://sparql.uniprot.org/) to search for proteins and their measured or predicted 3D structures.
14+
- [powerfit](https://pypi.org/project/powerfit-em/) to fit protein structure in a Electron Microscopy (EM) density map.
15+
16+
An example workflow:
17+
18+
```mermaid
19+
graph LR;
20+
search{Search UniprotKB} --> |uniprot_accessions|fetchpdbe{Retrieve PDBe}
21+
search{Search UniprotKB} --> |uniprot_accessions|fetchad{Retrieve AlphaFold}
22+
fetchpdbe -->|mmcif_files| residuefilter{Filter on nr residues + write chain A}
23+
fetchad -->|pdb_files| densityfilter{Filter out low confidence}
24+
residuefilter -->|pdb_files| powerfit
25+
densityfilter -->|pdb_files| powerfit
26+
powerfit -->|*/solutions.out| solutions{Best scoring solutions}
27+
solutions -->|dataframe| fitmodels{Fit models}
28+
```
29+
1130
## Install
1231

1332
```shell
@@ -23,6 +42,7 @@ pip install git+https://github.com/haddocking/protein-detective.git
2342

2443
The main entry point is the `protein-detective` command line tool which has multiple subcommands to perform actions.
2544

45+
2646
To use programmaticly, see the [notebooks](docs/notebooks) and [API documentation](https://www.bonvinlab.org/protein-detective/autoapi/summary/).
2747

2848
### Search Uniprot for structures
@@ -68,7 +88,10 @@ protein-detective density-filter \
6888
Make PDBe files smaller by only keeping first chain of found uniprot entry and renaming to chain A.
6989

7090
```shell
71-
protein-detective prune-pdbs ./mysession
91+
protein-detective prune-pdbs \
92+
--min-residues 100 \
93+
--max-residues 1000 \
94+
./mysession
7295
```
7396

7497
### Powerfit
@@ -94,9 +117,9 @@ protein-detective powerfit report docs/session1
94117
Outputs something like:
95118

96119
```
97-
powerfit_run_id,structure,rank,cc,fishz,relz,translation,rotation,density_filter_id,pdb_id,pdb_file,uniprot_acc
98-
10,A8MT69_pdb4e45.ent_B2A,1,0.432,0.463,10.091,227.18:242.53:211.83,0.0:1.0:1.0:0.0:0.0:1.0:1.0:0.0:0.0,,4E45,docs/session1/single_chain/A8MT69_pdb4e45.ent_B2A.pdb,A8MT69
99-
10,A8MT69_pdb4ne5.ent_B2A,1,0.423,0.452,10.053,227.18:242.53:214.9,0.0:-0.0:-0.0:-0.604:0.797:0.0:0.797:0.604:0.0,,4NE5,docs/session1/single_chain/A8MT69_pdb4ne5.ent_B2A.pdb,A8MT69
120+
powerfit_run_id,structure,rank,cc,fishz,relz,translation,rotation,pdb_id,pdb_file,uniprot_acc
121+
10,A8MT69_pdb4e45.ent_B2A,1,0.432,0.463,10.091,227.18:242.53:211.83,0.0:1.0:1.0:0.0:0.0:1.0:1.0:0.0:0.0,4E45,docs/session1/single_chain/A8MT69_pdb4e45.ent_B2A.pdb,A8MT69
122+
10,A8MT69_pdb4ne5.ent_B2A,1,0.423,0.452,10.053,227.18:242.53:214.9,0.0:-0.0:-0.0:-0.604:0.797:0.0:0.797:0.604:0.0,4NE5,docs/session1/single_chain/A8MT69_pdb4ne5.ent_B2A.pdb,A8MT69
100123
...
101124
```
102125

0 commit comments

Comments
 (0)