Skip to content

CogStack/cogstack-nlp

Cogstack NLP

Build Status Documentation Status Latest release

CogStack Natural Language Processing offers tools to process and extract information from clinical text and documents in Electronic Health Records (EHRs).

The primary NLP focus is the Medical Concept Annotation Tool (MedCAT), a self-supervised machine learning algorithm for extracting concepts using any concept vocabulary including UMLS/SNOMED-CT. See the paper on arXiv.

Official Docs here

Discussion Forum discourse

Projects

NLP

  • Medical Concept Annotation Tool: MedCAT can be used to extract information from Electronic Health Records (EHRs) and link it to biomedical ontologies like SNOMED-CT, UMLS, or HPO (and potentially other ontologies).
  • Medical Concept Annotation Tool Trainer: MedCATTrainer is an interface for building, improving and customising a given Named Entity Recognition and Linking (NER+L) model (MedCAT) for biomedical domain text.
  • MedCAT Service: A REST API wrapper for MedCAT, allowing you to send text for processing and receive structured annotations in response.

Learning and Demos

  • Deidentify app: Demo for AnonCAT. It uses MedCAT, an advanced natural language processing tool, to identify and classify sensitive information, such as names, addresses, and medical terms.
  • MedCAT Demo App: A simple web application showcasing how to use MedCAT for clinical text annotation.
  • MedCAT Tutorials: The MedCAT Tutorials privde an interactive learning path for using MedCAT

About

CogStack NLP, including the Medical Concept Annotation Tool MedCAT

Resources

License

Code of conduct

Contributing

Security policy

Stars

Watchers

Forks

Packages

No packages published

Contributors 27