(ran on local machine, not datahub)
Under the section Phrase Modeling with gensim, this line results in an error: bigram = Phrases(tokens, min_count=2, threshold=3, delimiter='_')
the error is the following:
TypeError: sequence item 0: expected str instance, bytes found