Skip to content

Pull requests: MedARC-AI/med-lm-envs

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Parsed Model Answer logging+ MCQ Answer analysis
#110 opened Jan 26, 2026 by mnishant2 Loading…
Add NLG metrics to aci_bench
#107 opened Jan 22, 2026 by jbdel Loading…
Add BioASQ
#105 opened Jan 22, 2026 by Ash-29 Loading…
Casereportbench environment
#99 opened Jan 20, 2026 by ss8319 Loading…
Add MedSafetyBench environment
#97 opened Jan 18, 2026 by anas-zafar Loading…
Added paper citations to environments
#86 opened Dec 24, 2025 by mkieffer1107 Loading…
Feat/medexqa judges
#67 opened Nov 3, 2025 by mnishant2 Draft
Add MedHALT evaluation environment
#65 opened Oct 30, 2025 by geetua Loading…
BioHopR
#62 opened Oct 26, 2025 by marii-moe Loading…
K-QA #45 added
#52 opened Oct 13, 2025 by Manishram-ai Loading…
implement AgentClinic benchmark
#38 opened Oct 9, 2025 by aymaneo Loading…
Mebullets changes
#26 opened Oct 3, 2025 by mkieffer1107 Loading…
ProTip! Exclude everything labeled bug with -label:bug.