|
| 1 | +# GPCRmd |
| 2 | + |
| 3 | +> GPCRmd is an online platform for visualizing, analyzing, and sharing molecular dynamics simulations of G-protein-coupled receptors (GPCRs), a key family of membrane proteins and common drug targets. |
| 4 | +
|
| 5 | +- web site: <https://www.gpcrmd.org/> |
| 6 | +- publication: [GPCRmd uncovers the dynamics of the 3D-GPCRome](https://www.nature.com/articles/s41592-020-0884-y), Nature Methods, 2020. |
| 7 | +- [documentation](https://gpcrmd-docs.readthedocs.io/en/latest/index.html) |
| 8 | + |
| 9 | +## API |
| 10 | + |
| 11 | +### Base URL |
| 12 | + |
| 13 | +<https://www.gpcrmd.org/api/> |
| 14 | + |
| 15 | +### Documentation |
| 16 | + |
| 17 | +<https://gpcrmd-docs.readthedocs.io/en/latest/api.html#main-gpcrmd-api> |
| 18 | + |
| 19 | +### Token |
| 20 | + |
| 21 | +No token is needed to GPCRmd API in read mode. |
| 22 | + |
| 23 | +### Metadata of datasets and files |
| 24 | + |
| 25 | +Although GPCRmd provides a public API to discover molecular dynamics datasets, **some metadata fields and all file-level information are not exposed via the API**. For this reason, web scraping of the dataset page is required to retrieve complete dataset descriptions and file metadata. |
| 26 | + |
| 27 | +### Datasets |
| 28 | + |
| 29 | +In GPCRmd, dataset (a simulation and its related files) is called a "dynamic". |
| 30 | + |
| 31 | +API entrypoint to search for all datasets at once: |
| 32 | + |
| 33 | +- Endpoint: `/search_all/info/` |
| 34 | +- HTTP method: GET |
| 35 | +- [documentation](https://gpcrmd-docs.readthedocs.io/en/latest/api.html#main-gpcrmd-api) |
| 36 | + |
| 37 | +#### Dataset metadata retrieved via the API |
| 38 | + |
| 39 | +| Field | Description | |
| 40 | +| -------------------- | ----------------------------------- | |
| 41 | +| `dyn_id` | Unique dynamic (dataset) identifier | |
| 42 | +| `modelname` | Name of the simulated system | |
| 43 | +| `timestep` | MD integration time step in fs | |
| 44 | +| `atom_num` | Number of atoms | |
| 45 | +| `mysoftware` | MD engine used | |
| 46 | +| `software_version` | Version of the MD engine | |
| 47 | +| `forcefield` | Force field and model name | |
| 48 | +| `forcefield_version` | Force field and model version | |
| 49 | +| `creation_timestamp` | Dataset creation date | |
| 50 | +| `dataset_url` | URL of the dataset web page | |
| 51 | + |
| 52 | +#### Dataset metadata retrieved via scraping of the dataset HTML page |
| 53 | + |
| 54 | +| Field | Description | |
| 55 | +| -------------------- | ------------------------------------------ | |
| 56 | +| `description` | Textual description of the simulation | |
| 57 | +| `authors` | Authors | |
| 58 | +| `simulation_time` | Total simulation length | |
| 59 | + |
| 60 | +### Files |
| 61 | + |
| 62 | +The GPCRmd API does not provide any endpoint to access file-level metadata. File metadata is extracted from the dataset web page. |
| 63 | + |
| 64 | +For example: |
| 65 | + |
| 66 | +- Files associated to the [dataset `7`](https://www.gpcrmd.org/dynadb/dynamics/id/7/) are: |
| 67 | + |
| 68 | + - <https://www.gpcrmd.org/dynadb/files/Dynamics/10166_trj_7.dcd> |
| 69 | + - <https://www.gpcrmd.org/dynadb/files/Dynamics/10167_dyn_7.psf> |
| 70 | + - <https://www.gpcrmd.org/dynadb/files/Dynamics/10168_dyn_7.pdb> |
| 71 | + |
| 72 | +- Files associated to the [dataset `12`](https://www.gpcrmd.org/dynadb/dynamics/id/12/) are: |
| 73 | + |
| 74 | + - <https://www.gpcrmd.org/dynadb/files/Dynamics/10193_trj_12.xtc> |
| 75 | + - <https://www.gpcrmd.org/dynadb/files/Dynamics/10196_dyn_12.psf> |
| 76 | + - <https://www.gpcrmd.org/dynadb/files/Dynamics/10197_dyn_12.pdb> |
| 77 | + - <https://www.gpcrmd.org/dynadb/files/Dynamics/10191_oth_12.tar.gz> |
| 78 | + - <https://www.gpcrmd.org/dynadb/files/Dynamics/10192_prm_12.tar.gz> |
| 79 | + |
| 80 | +#### File metadata retrieved via scraping of the dataset HTML page |
| 81 | + |
| 82 | +| Field | Description | |
| 83 | +| ----------- | ---------------------- | |
| 84 | +| `file_name` | *Name of the file* | |
| 85 | +| `file_type` | *File extension* | |
| 86 | +| `file_path` | *Public download URL* | |
| 87 | +| `file_size` | *File size in bytes* | |
| 88 | + |
| 89 | +> 💡 File size is obtained using an HTTP `HEAD` request on the file URL, **avoiding file download**. |
| 90 | +
|
| 91 | +## Example |
| 92 | + |
| 93 | +### Dataset ID 2316 |
| 94 | + |
| 95 | +- [Dataset on GPCRmd GUI](https://www.gpcrmd.org/dynadb/dynamics/id/2316/) |
| 96 | +- [Dataset on GPCRmd API](https://www.gpcrmd.org/api/search_dyn/info/2316) |
| 97 | + |
| 98 | +#### Dataset metadata (API + scraping) |
| 99 | + |
| 100 | +| Field | Description | |
| 101 | +| -------------------- | ------------------------------------------------- | |
| 102 | +| `dyn_id` | 2316 | |
| 103 | +| `modelname` | FFA2_TUG1375_Gi1-TUG1375 | |
| 104 | +| `timestep` | 2 | |
| 105 | +| `atom_num` | 4829 | |
| 106 | +| `mysoftware` | AMBER PMEMD.CUDA | |
| 107 | +| `software_version` | 2020 | |
| 108 | +| `forcefield` | ff19SB/lipid21/GAFF2 | |
| 109 | +| `forcefield_version` | ff19SB/lipid21 | |
| 110 | +| `creation_timestamp` | 2025-05-13 | |
| 111 | +| `dataset_url` | <https://www.gpcrmd.org/dynadb/dynamics/id/2316/> | |
| 112 | +| `description` | Simulation aims to observe structural features of FFA2 without an orthosteric agonist and G-protein, which will be compared to docking-based simulations of allosteric activators... | |
| 113 | +| `authors` | Abdul-Akim Guseinov, University of Glasgow | |
| 114 | +| `simulation_time` | 3.0 µs | |
| 115 | + |
| 116 | +#### Files metadata |
| 117 | + |
| 118 | +[files on GPCRmd GUI](https://www.gpcrmd.org/api/search_dyn/info/2316) (accessible via the *Technical Information* section) |
| 119 | + |
| 120 | +| Field | Description | |
| 121 | +| ----------- | ------------------------------------------------------------------------- | |
| 122 | +| `file_name` | tmp_dyn_0_2667.pdb | |
| 123 | +| `file_path` | <https://www.gpcrmd.org/dynadb/files/Dynamics/dyn2667/tmp_dyn_0_2667.pdb> | |
| 124 | +| `file_size` | 1024 | |
0 commit comments