Skip to content

Latest commit

 

History

History
33 lines (24 loc) · 1.07 KB

File metadata and controls

33 lines (24 loc) · 1.07 KB

MedText

Paper

Title: Towards Automatic Generation of Shareable Synthetic Clinical Notes Using Neural Language Models

Abstract: https://arxiv.org/abs/1905.07002

MedText is a medical diagnosis dataset containing over 1000 top notch textbook quality patient presentations and diagnosis/treatments. The 100 most common diseases and the 30 most common injuries people go to the hospital with, are, among others, fully captured in the dataset, with multiple datapoints for each ranging from mild to complicated to severe.

Tasks

  • medtext: Open-ended QA in english.
  • medtext_perplexity: Open-ended QA in english, evaluated with perplexity.

Citation

@misc{melamud2019automaticgenerationshareablesynthetic,
      title={Towards Automatic Generation of Shareable Synthetic Clinical Notes Using Neural Language Models},
      author={Oren Melamud and Chaitanya Shivade},
      year={2019},
      eprint={1905.07002},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/1905.07002},
}