|
| 1 | +# Curating tools with AgentMD |
| 2 | + |
| 3 | +## Step 0: Processing PubMed data |
| 4 | +A full list of PMIDs returned by the Boolean query is shown in `file1_full_classification_results.json`, and the corresponding PubMed abstracts can be downloaded at https://ftp.ncbi.nlm.nih.gov/pubmed/baseline/. |
| 5 | +One can use `file0_sample_candidate_articles.json` to run demonstration scripts in this directory. |
| 6 | + |
| 7 | +## Step 1: Calculator Screening |
| 8 | + |
| 9 | +The first step is screening potential calculator-describing abstracts by GPT-3.5-Turbo. Please run: |
| 10 | + |
| 11 | +```bash |
| 12 | +python step1_classify.py |
| 13 | +``` |
| 14 | + |
| 15 | +The demonstration code results will be saved as `file0_sample_candidate_articles.json`. We also provided the pre-computed full results in `file1_full_classification_results.jsonl`. |
| 16 | + |
| 17 | +## Step 2: Calculator Drafting |
| 18 | + |
| 19 | +The second step is to draft calculators in a structured format with GPT-4. Please run: |
| 20 | + |
| 21 | +```bash |
| 22 | +python step2_draft_calcs.py |
| 23 | +``` |
| 24 | + |
| 25 | +The results will be saved in `./dir1_pubmed_risk_calcs/` and indexed by their corresponding PMIDs. |
| 26 | + |
| 27 | +## Step 3: Calculator Verification |
| 28 | + |
| 29 | +The third step is the verification of drafted calculators. Please run: |
| 30 | + |
| 31 | +```bash |
| 32 | +python step3_verify_calcs.py |
| 33 | +``` |
| 34 | + |
| 35 | +The results will be saved in `./dir2_risk_calc_verifications/` and indexed by their corresponding PMIDs. |
| 36 | + |
| 37 | +## Acknowledgments |
| 38 | + |
| 39 | +This research was supported by the NIH Intramural Research Program, National Library of Medicine, and 1K99LM014024. |
| 40 | + |
| 41 | +## Disclaimer |
| 42 | + |
| 43 | +This tool shows the results of research conducted in the Computational Biology Branch, NLM. The information produced on this website is not intended for direct diagnostic use or medical decision-making without review and oversight by a clinical professional. Individuals should not change their health behavior solely on the basis of information produced on this website. NIH does not independently verify the validity or utility of the information produced by this tool. If you have questions about the information produced on this website, please see a health care professional. More information about NCBI's disclaimer policy is available. |
| 44 | + |
| 45 | +## Citation |
| 46 | + |
| 47 | +If you find this repo helpful, please cite AgentMD by: |
| 48 | +```bibtex |
| 49 | +@article{jin2024agentmd, |
| 50 | + title={AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning}, |
| 51 | + author={Jin, Qiao and Wang, Zhizheng and Yang, Yifan and Zhu, Qingqing and Wright, Donald and Huang, Thomas and Wilbur, W John and He, Zhe and Taylor, Andrew and Chen, Qingyu and others}, |
| 52 | + journal={arXiv preprint arXiv:2402.13225}, |
| 53 | + year={2024} |
| 54 | +} |
| 55 | +``` |
0 commit comments