Skip to content

Commit 962dcda

Browse files
committed
Add DeepDrug AI's DPEB dataset
1 parent c406f6a commit 962dcda

File tree

1 file changed

+43
-0
lines changed

1 file changed

+43
-0
lines changed

datasets/deepdrug-dpeb.yml

Lines changed: 43 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,43 @@
1+
Name: DeepDrug Protein Embeddings Bank (DPEB)
2+
Description: DPEB is a multimodal database of human protein embeddings integrating four biologically complementary representations—AlphaFold2, BioEmbeddings, ESM-2, and ProtVec—designed for enhanced protein-protein interaction prediction and functional classification.
3+
Documentation: https://github.com/deepdrugai/DPEB
4+
Contact: https://github.com/deepdrugai/DPEB/issues
5+
ManagedBy: "Louisiana State University"
6+
UpdateFrequency: Initial release; maintained for at least 2 years with updates planned based on new embedding models and protein coverage.
7+
Tags:
8+
- bioinformatics
9+
- protein
10+
- structural biology
11+
- machine learning
12+
- life sciences
13+
License: MIT
14+
Citation: "Sajol MSI et al. DeepDrug Protein Embeddings Bank (DPEB) was accessed on [DATE] at https://registry.opendata.aws/dpeb"
15+
Resources:
16+
- Description: Multimodal human protein embeddings (AlphaFold2, BioEmbeddings, ESM-2, ProtVec) with JSONL-formatted metadata containing FASTA, UniProt IDs, and embeddings.
17+
ARN: arn:aws:s3:::deepdrug-dpeb-human-protein-embeddings
18+
Region: us-cst-1
19+
Type: S3 Bucket
20+
Explore:
21+
- "https://github.com/deepdrugai/DPEB"
22+
DataAtWork:
23+
Tutorials:
24+
- Title: Aggregating and Clustering AlphaFold2 Embeddings from DPEB
25+
URL: https://github.com/deepdrugai/DPEB/tree/main
26+
NotebookURL: https://github.com/deepdrugai/DPEB/tree/main/tutorial
27+
AuthorName: Md. Saiful Islam Sajol
28+
AuthorURL: https://github.com/deepdrugai
29+
Services: EC2
30+
31+
Tools & Applications:
32+
- Title: DPEB Explorer Tool
33+
URL: https://github.com/deepdrugai/DPEB
34+
AuthorName: DeepDrug Lab
35+
AuthorURL: https://github.com/deepdrugai
36+
Publications:
37+
- Title: A Multimodal Human Protein Embeddings Database: DeepDrug Protein Embeddings Bank (DPEB)
38+
URL: https://doi.org/10.XXXX/nar.dpeb2025
39+
AuthorName: Sajol MSI, Rajasekaran M, Bess A, Alvin C, Mukhopadhyay S
40+
AuthorURL: https://github.com/deepdrugai/DPEB
41+
ADXCategories:
42+
- Life Sciences
43+
- Artificial Intelligence

0 commit comments

Comments
 (0)