Skip to content

Commit 8eb570e

Browse files
authored
ok: Merge pull request #2873 from bessx/main
Add DeepDrug AI's DPEB dataset
2 parents c406f6a + 1b20036 commit 8eb570e

File tree

1 file changed

+38
-0
lines changed

1 file changed

+38
-0
lines changed

datasets/deepdrug-dpeb.yaml

Lines changed: 38 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,38 @@
1+
Name: DeepDrug Protein Embeddings Bank (DPEB)
2+
Description: DPEB is a multimodal database of human protein embeddings integrating four biologically complementary representations—AlphaFold2, BioEmbeddings, ESM-2, and ProtVec—designed for enhanced protein-protein interaction prediction and functional classification.
3+
Documentation: https://github.com/deepdrugai/DPEB
4+
Contact: https://github.com/deepdrugai/DPEB/issues
5+
ManagedBy: "Louisiana State University"
6+
UpdateFrequency: Initial release; maintained for at least 2 years with updates planned based on new embedding models and protein coverage.
7+
Tags:
8+
- bioinformatics
9+
- protein
10+
- structural biology
11+
- machine learning
12+
- life sciences
13+
- aws-pds
14+
License: MIT
15+
Citation: "Sajol MSI et al. DeepDrug Protein Embeddings Bank (DPEB) was accessed on [DATE] at https://registry.opendata.aws/dpeb"
16+
Resources:
17+
- Description: Multimodal human protein embeddings (AlphaFold2, BioEmbeddings, ESM-2, ProtVec) with JSONL-formatted metadata containing FASTA, UniProt IDs, and embeddings.
18+
ARN: arn:aws:s3:::deepdrug-dpeb-human-protein-embeddings
19+
Region: us-east-1
20+
Type: S3 Bucket
21+
DataAtWork:
22+
Tutorials:
23+
- Title: Aggregating and Clustering AlphaFold2 Embeddings from DPEB
24+
URL: https://github.com/deepdrugai/DPEB/tree/main
25+
AuthorName: Md. Saiful Islam Sajol
26+
AuthorURL: https://github.com/deepdrugai
27+
Tools & Applications:
28+
- Title: DPEB Explorer Tool
29+
URL: https://github.com/deepdrugai/DPEB
30+
AuthorName: DeepDrug Lab
31+
AuthorURL: https://github.com/deepdrugai
32+
Publications:
33+
- Title: A Multimodal Human Protein Embeddings Database - DeepDrug Protein Embeddings Bank (DPEB)
34+
URL: https://doi.org/10.XXXX/nar.dpeb2025
35+
AuthorName: Sajol MSI, Rajasekaran M, Bess A, Alvin C, Mukhopadhyay S
36+
AuthorURL: https://github.com/deepdrugai/DPEB
37+
ADXCategories:
38+
- Healthcare & Life Sciences Data

0 commit comments

Comments
 (0)