Skip to content

Conversation

@jhnwu3
Copy link
Collaborator

@jhnwu3 jhnwu3 commented Jan 5, 2026

This pull request adds comprehensive benchmarking tools and updates documentation for the PyHealth project, focusing on drug recommendation tasks using the MIMIC-IV dataset. The main changes include the addition of two new benchmarking scripts for performance analysis and an update to the installation documentation to clarify Python version requirements and improve installation instructions.

Benchmarking tools for drug recommendation:

  • Added benchmark_pandas_drug_rec.py, a standalone script that benchmarks drug recommendation preprocessing using pandas on MIMIC-IV data, including cumulative history construction and peak memory tracking.
  • Added benchmark_workers_n_drug_recommendation.py, a script to benchmark the PyHealth DrugRecommendationMIMIC4 task across different num_workers settings, measuring wall time, cache sizes, and peak memory usage, and writing results to CSV for further analysis.

Documentation improvements:

  • Updated docs/install.rst to clarify that PyHealth 2.0 requires Python 3.12 or higher (up to 3.13), updated installation instructions for both alpha and legacy versions, and improved version-specific recommendations.

@jhnwu3 jhnwu3 requested a review from Logiquo January 5, 2026 18:46
@jhnwu3 jhnwu3 merged commit d9ff382 into master Jan 6, 2026
1 check passed
@jhnwu3 jhnwu3 deleted the fix/python_ver branch January 6, 2026 17:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants