Skip to content

Commit c494e38

Browse files
authored
Merge branch 'main' into aph-add-radiant
2 parents 7a54d61 + 0d57273 commit c494e38

File tree

169 files changed

+5104
-224
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

169 files changed

+5104
-224
lines changed

datasets/ai3.yaml

Lines changed: 37 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,37 @@
1+
Name: AI3 Protein-Ligand Binding Affinity Dataset
2+
Description: >
3+
The rapid advancement of computing technologies, particularly artificial intelligence (AI), has revolutionized various domains, including drug discovery. Curated datasets are crucial for developing reliable, generalizable, and accurate models for practical applications. Generating experimental data on a large scale is an expensive and arduous process. In domains such as medical diagnostics where real-life data is hard to obtain, synthetic data has been shown to be extremely valuable. We, teams from IIIT Hyderabad, Intel, AWS, and Insilico Medicine, have performed physics-based calculations (molecular dynamics simulations) on about 20,000 protein-ligand complexes. The dataset comprises molecular dynamics snapshots, binding affinities calculated using the MM-PBSA method, and individual energy components, including electrostatic and van der Waals interactions. DatasetFileFormats essentially incorporate i. 3D coordinates of the protein-ligand complexes (pdb) in tar.gz files, and ii. CSV files containing the energy data. DatasetUsages are on i. ML scoring function for predicting binding affinities of given protein-ligand complexes, ii. Classification models for predicting correct binding poses of ligands, iii. identification of cryptic binding pockets, and iv. optimization of binding features by exploiting the individual components of the energy (experimental data has only the total binding affinity). Further, the novelty of the dataset highlights the fact that existing AI/ML training datasets lack dynamic data and are inherently biased. Further, binding affinity data existing in the literature are obtained from different experimental protocols. Therefore, this dataset has been uniquely created (from the same computational protocols) followed by free energy calculations with molecular dynamics (MD) simulations. The dynamic data-enriched protein-ligand coordinates can be used to effectively train convolutional neural network-based regression models for more accurate binding affinity prediction.
4+
Documentation: https://github.com/devalab/AI3
5+
6+
ManagedBy: International Institute of Information Technology Hyderabad
7+
UpdateFrequency: Not updated
8+
Tags:
9+
- pharmaceutical
10+
- simulations
11+
- health
12+
- life sciences
13+
- machine learning
14+
- protein
15+
- molecular dynamics
16+
- aws-pds
17+
License: https://devalab.in/AI3.html
18+
Resources:
19+
- Description: ai3data bucket includes coordinates and the energetics of ~20,000 protein-ligand binding affinity datasets. The subfolders of ai3data bucket consist of Version 1, Version2 and Version 3. Version1 contains the total Size of 10.4 GiB (Initial structure of the protein-ligand complex and the average binding affinities along with average energy components). Version2 contains the total Size of 1.2 TiB (Five trajectories of protein-ligand complex (200 snapshots in all) and the closest two water molecules for each of the protein-ligand complex, and the time series of the binding affinities along with average energy components). Version3 contains the total Size of 10.7 TiB (Five trajectories of completely solvated protein-ligand complex (200 snapshots in all), and the time series of binding affinities along with average energy components).
20+
ARN: arn:aws:s3:::ai3data
21+
Region: us-east-1
22+
Type: S3 Bucket
23+
DataAtWork:
24+
Tutorials:
25+
- Title: "AI3: Protein-Ligand Binding Affinity Dataset"
26+
URL: https://github.com/devalab/AI3
27+
AuthorName: Deva Priyakumar Lab
28+
AuthorURL: https://github.com/devalab
29+
Publications:
30+
- Title: "PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications"
31+
URL: https://www.nature.com/articles/s41597-022-01631-9
32+
AuthorName: U. Deva Priyakumar
33+
AuthorURL: https://devalab.in/
34+
- Title: "PLAS-20k: Extended Dataset of Protein-Ligand Affinities from MD Simulations for Machine Learning Applications"
35+
URL: https://www.nature.com/articles/s41597-023-02872-y
36+
AuthorName: U. Deva Priyakumar
37+
AuthorURL: https://devalab.in

datasets/aodn_animal_acoustic_tracking_delayed_qc.yaml

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -28,9 +28,10 @@ Collabs:
2828
Tags:
2929
- biodiversity
3030
Tags:
31-
- oceans
32-
- marine mammals
33-
- biology
31+
- aws-pds
32+
- oceans
33+
- marine mammals
34+
- biology
3435
License: http://creativecommons.org/licenses/by/4.0/
3536
Resources:
3637
- Description: Cloud Optimised AODN dataset of IMOS - Animal Tracking Facility - Acoustic
@@ -42,12 +43,12 @@ DataAtWork:
4243
Tutorials:
4344
- Title: Accessing IMOS - Animal Tracking Facility - Acoustic Tracking - Quality
4445
Controlled Detections (2007 - ongoing)
45-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/animal_acoustic_tracking_delayed_qc.ipynb
46+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/animal_acoustic_tracking_delayed_qc.ipynb
4647
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/animal_acoustic_tracking_delayed_qc.ipynb
4748
AuthorName: Laurent Besnard
4849
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
4950
- Title: Accessing and search for any AODN dataset
50-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
51+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
5152
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
5253
AuthorName: Laurent Besnard
5354
AuthorURL: https://github.com/aodn/aodn_cloud_optimised

datasets/aodn_animal_ctd_satellite_relay_tagging_delayed_qc.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -44,12 +44,12 @@ DataAtWork:
4444
Tutorials:
4545
- Title: Accessing Satellite Relay Tagging Program - Southern Ocean - MEOP Quality
4646
Controlled CTD Profiles
47-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/animal_ctd_satellite_relay_tagging_delayed_qc.ipynb
47+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/animal_ctd_satellite_relay_tagging_delayed_qc.ipynb
4848
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/animal_ctd_satellite_relay_tagging_delayed_qc.ipynb
4949
AuthorName: Laurent Besnard
5050
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
5151
- Title: Accessing and search for any AODN dataset
52-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
52+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
5353
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
5454
AuthorName: Laurent Besnard
5555
AuthorURL: https://github.com/aodn/aodn_cloud_optimised

datasets/aodn_model_sea_level_anomaly_gridded_realtime.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,12 +7,12 @@ DataAtWork:
77
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
88
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/model_sea_level_anomaly_gridded_realtime.ipynb
99
Title: Accessing IMOS - OceanCurrent - Gridded sea level anomaly - Near real time
10-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/model_sea_level_anomaly_gridded_realtime.ipynb
10+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/model_sea_level_anomaly_gridded_realtime.ipynb
1111
- AuthorName: Laurent Besnard
1212
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
1313
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
1414
Title: Accessing and search for any AODN dataset
15-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
15+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
1616
Description: "Gridded (adjusted) sea level anomaly (GSLA), gridded sea level (GSL)\
1717
\ and surface geostrophic velocity (UCUR,VCUR) for the Australasian region. GSLA\
1818
\ is mapped using optimal interpolation of detided, de-meaned, inverse-barometer-adjusted\

datasets/aodn_mooring_ctd_delayed_qc.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -31,12 +31,12 @@ Resources:
3131
DataAtWork:
3232
Tutorials:
3333
- Title: Accessing IMOS - Australian National Mooring Network (ANMN) - CTD Profiles
34-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/mooring_ctd_delayed_qc.ipynb
34+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/mooring_ctd_delayed_qc.ipynb
3535
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/mooring_ctd_delayed_qc.ipynb
3636
AuthorName: Laurent Besnard
3737
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
3838
- Title: Accessing and search for any AODN dataset
39-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
39+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
4040
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
4141
AuthorName: Laurent Besnard
4242
AuthorURL: https://github.com/aodn/aodn_cloud_optimised

datasets/aodn_mooring_hourly_timeseries_delayed_qc.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -40,12 +40,12 @@ Resources:
4040
DataAtWork:
4141
Tutorials:
4242
- Title: Accessing IMOS - Moorings - Hourly time-series product
43-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/mooring_hourly_timeseries_delayed_qc.ipynb
43+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/mooring_hourly_timeseries_delayed_qc.ipynb
4444
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/mooring_hourly_timeseries_delayed_qc.ipynb
4545
AuthorName: Laurent Besnard
4646
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
4747
- Title: Accessing and search for any AODN dataset
48-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
48+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
4949
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
5050
AuthorName: Laurent Besnard
5151
AuthorURL: https://github.com/aodn/aodn_cloud_optimised

datasets/aodn_mooring_satellite_altimetry_calibration_validation.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -58,12 +58,12 @@ Resources:
5858
DataAtWork:
5959
Tutorials:
6060
- Title: Accessing IMOS - SRS Satellite Altimetry Calibration and Validation Sub-Facility
61-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/mooring_satellite_altimetry_calibration_validation.ipynb
61+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/mooring_satellite_altimetry_calibration_validation.ipynb
6262
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/mooring_satellite_altimetry_calibration_validation.ipynb
6363
AuthorName: Laurent Besnard
6464
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
6565
- Title: Accessing and search for any AODN dataset
66-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
66+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
6767
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
6868
AuthorName: Laurent Besnard
6969
AuthorURL: https://github.com/aodn/aodn_cloud_optimised

datasets/aodn_radar_bonneycoast_velocity_hourly_averaged_delayed_qc.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -37,12 +37,12 @@ DataAtWork:
3737
Tutorials:
3838
- Title: Accessing IMOS - ACORN - Bonney Coast HF ocean radar site (South Australia,
3939
Australia) - Delayed mode sea water velocity
40-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/radar_BonneyCoast_velocity_hourly_averaged_delayed_qc.ipynb
40+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/radar_BonneyCoast_velocity_hourly_averaged_delayed_qc.ipynb
4141
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/radar_BonneyCoast_velocity_hourly_averaged_delayed_qc.ipynb
4242
AuthorName: Laurent Besnard
4343
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
4444
- Title: Accessing and search for any AODN dataset
45-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
45+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
4646
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
4747
AuthorName: Laurent Besnard
4848
AuthorURL: https://github.com/aodn/aodn_cloud_optimised

datasets/aodn_radar_capricornbunkergroup_velocity_hourly_averaged_delayed_qc.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -49,12 +49,12 @@ DataAtWork:
4949
Tutorials:
5050
- Title: Accessing IMOS - ACORN - Capricorn Bunker Group HF ocean radar site (Great
5151
Barrier Reef, Queensland, Australia) - Delayed mode sea water velocity
52-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/radar_CapricornBunkerGroup_velocity_hourly_averaged_delayed_qc.ipynb
52+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/radar_CapricornBunkerGroup_velocity_hourly_averaged_delayed_qc.ipynb
5353
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/radar_CapricornBunkerGroup_velocity_hourly_averaged_delayed_qc.ipynb
5454
AuthorName: Laurent Besnard
5555
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
5656
- Title: Accessing and search for any AODN dataset
57-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
57+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
5858
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
5959
AuthorName: Laurent Besnard
6060
AuthorURL: https://github.com/aodn/aodn_cloud_optimised

datasets/aodn_radar_capricornbunkergroup_wave_delayed_qc.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -48,12 +48,12 @@ DataAtWork:
4848
Tutorials:
4949
- Title: Accessing IMOS - ACORN - Capricorn Bunker Group HF ocean radar site (Great
5050
Barrier Reef, Queensland, Australia) - Delayed mode wave
51-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/radar_CapricornBunkerGroup_wave_delayed_qc.ipynb
51+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/radar_CapricornBunkerGroup_wave_delayed_qc.ipynb
5252
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/radar_CapricornBunkerGroup_wave_delayed_qc.ipynb
5353
AuthorName: Laurent Besnard
5454
AuthorURL: https://github.com/aodn/aodn_cloud_optimised
5555
- Title: Accessing and search for any AODN dataset
56-
URL: https://nbviewer.org/github/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
56+
URL: https://github.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
5757
NotebookURL: https://githubtocolab.com/aodn/aodn_cloud_optimised/blob/main/notebooks/GetAodnData.ipynb
5858
AuthorName: Laurent Besnard
5959
AuthorURL: https://github.com/aodn/aodn_cloud_optimised

0 commit comments

Comments
 (0)