Skip to content

Commit 54e40d6

Browse files
authored
Merge branch 'main' into patch-1
2 parents e275251 + 9ab8b8f commit 54e40d6

File tree

1 file changed

+36
-0
lines changed

1 file changed

+36
-0
lines changed

datasets/proteingym.yaml

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
Name: ProteinGym
2+
Description: |
3+
ProteinGym is a benchmark suite for assessing the performance of protein fitness prediction and design models. It comprises a large curated collection of 200+ high-throughput experimental assays (~3M mutated sequences), as well as clinical annotations from experts about the pathogenicity of mutants in over 3k human genes.
4+
Documentation: https://github.com/OATML-Markslab/ProteinGym/blob/main/README.md
5+
6+
ManagedBy: "Harvard Medical School; University of Oxford"
7+
UpdateFrequency: Quarterly
8+
Tags:
9+
- aws-pds
10+
- protein
11+
- bioinformatics
12+
- biology
13+
- life sciences
14+
- deep learning
15+
- machine learning
16+
License: MIT License
17+
Resources:
18+
- Description: "ProteinGym dataset including all substitution/indel mutations from Deep Mutational Scanning (DMS) experiments (DMS_substitutions.parquet / DMS_indels.parquet), and all substitution/indel mutations from clinical variant databases (clinical_substitutions.parquet / clinical_indels.parquet)."
19+
ARN: arn:aws:s3:::proteingym
20+
Region: us-east-2
21+
Type: S3 Bucket
22+
DataAtWork:
23+
Tutorials:
24+
- Title: Scoring ProteinGym assays with TranceptEVE
25+
URL: https://github.com/OATML-Markslab/ProteinGym/blob/main/notebooks/TranceptEVE_example.ipynb
26+
AuthorName: Daniel Ritter
27+
AuthorURL: https://danieldritter.github.io/
28+
Tools & Applications:
29+
- Title: ProteinGym website
30+
URL: https://proteingym.org/
31+
AuthorName: Pascal Notin & Daniel Ritter
32+
Publications:
33+
- Title: "ProteinGym: Large-Scale Benchmarks for Protein Fitness Prediction and Design"
34+
URL: https://papers.nips.cc/paper_files/paper/2023/hash/cac723e5ff29f65e3fcbb0739ae91bee-Abstract-Datasets_and_Benchmarks.html
35+
AuthorName: "Pascal Notin, et al."
36+
AuthorURL: https://www.pascalnotin.com/

0 commit comments

Comments
 (0)