Skip to content

Latest commit

 

History

History
13 lines (9 loc) · 435 Bytes

File metadata and controls

13 lines (9 loc) · 435 Bytes

Collocation-Analysis

This repository contains a python script that performs collocation analysis and puts the results in a dataframe that can be exported to csv format. Example data is provided from the One Health dataset available on PubMed Central.

The script uses a csv file as the dataset where each row contains a new document.

Required Packages

nltk.tokenize
nltk.collocations
nltk.corpus
pandas