Skip to content

Latest commit

 

History

History
8 lines (6 loc) · 453 Bytes

File metadata and controls

8 lines (6 loc) · 453 Bytes

nlp_bpdcn

UCSF ohgami lab pdf nlp project corinn.small@ucsf.edu

Natural language processing (NLP) pipeline for the blastic plasmacytoid dendritic cell neoplasm (bpdcn) project. We want to streamline pdf analysis by using the aws sdk 'textract' and pull out relevant disease data including patient history, tissue/cell morphology, cell markers (disease cell origin/type), genetics, treatments and outcome. WE also want to eventually include images.