From 16942e9cbdcde72c3c1a43124b1439f57022c6d2 Mon Sep 17 00:00:00 2001 From: Christopher Alexander Date: Sat, 26 Oct 2024 10:31:08 -0400 Subject: [PATCH] remove reference to library for tokenization in example code --- README.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/README.md b/README.md index 303ae65..e3d0f89 100644 --- a/README.md +++ b/README.md @@ -23,7 +23,7 @@ The TF-IDF Transformer accepts a variety of inputs for the raw documents that on Raw documents can simply be provided as tokenized documents. ```julia -using MLJ, MLJText, TextAnalysis +using MLJ, MLJText, WordTokenizers docs = ["Hi my name is Sam.", "How are you today?"] tfidf_transformer = TfidfTransformer() @@ -61,7 +61,7 @@ BM25 is an approach similar to that of TF-IDF in terms of representing documents This transformer is used in much the same way as the `TfidfTransformer`. ```julia -using MLJ, MLJText, TextAnalysis +using MLJ, MLJText, WordTokenizers docs = ["Hi my name is Sam.", "How are you today?"] bm25_transformer = BM25Transformer() @@ -94,7 +94,7 @@ The `MLJText` package also offers a way to represent documents using the simpler ### Usage ```julia -using MLJ, MLJText, TextAnalysis +using MLJ, MLJText, WordTokenizers docs = ["Hi my name is Sam.", "How are you today?"] count_transformer = CountTransformer()