fix a wrong str interpolation in adocstring and some whitespace

ablaom · ablaom · commit 5565098917e5 · 2024-01-09T09:46:09.000+13:00
diff --git a/src/bm25_transformer.jl b/src/bm25_transformer.jl
@@ -137,21 +137,21 @@ In MLJ or MLJBase, bind an instance `model` to data with
 
     mach = machine(model, X)
 
-$DOC_IDF
+$DOC_TRANSFORMER_INPUTS
 
 Train the machine using `fit!(mach, rows=...)`.
 
 # Hyper-parameters
 
-- `max_doc_freq=1.0`: Restricts the vocabulary that the transformer will consider.
-  Terms that occur in `> max_doc_freq` documents will not be considered by the
-  transformer. For example, if `max_doc_freq` is set to 0.9, terms that are in more than
-  90% of the documents will be removed.
+- `max_doc_freq=1.0`: Restricts the vocabulary that the transformer will consider. Terms
+  that occur in `> max_doc_freq` documents will not be considered by the transformer. For
+  example, if `max_doc_freq` is set to 0.9, terms that are in more than 90% of the
+  documents will be removed.
 
-- `min_doc_freq=0.0`: Restricts the vocabulary that the transformer will consider.
-  Terms that occur in `< max_doc_freq` documents will not be considered by the
-  transformer. A value of 0.01 means that only terms that are at least in 1% of the
-  documents will be included.
+- `min_doc_freq=0.0`: Restricts the vocabulary that the transformer will consider. Terms
+  that occur in `< max_doc_freq` documents will not be considered by the transformer. A
+  value of 0.01 means that only terms that are at least in 1% of the documents will be
+  included.
 
 - `κ=2`: The term frequency saturation characteristic. Higher values represent slower
   saturation. What we mean by saturation is the degree to which a term occurring extra
diff --git a/src/count_transformer.jl b/src/count_transformer.jl
@@ -94,15 +94,15 @@ Train the machine using `fit!(mach, rows=...)`.
 
 # Hyper-parameters
 
-- `max_doc_freq=1.0`: Restricts the vocabulary that the transformer will consider.
-  Terms that occur in `> max_doc_freq` documents will not be considered by the
-  transformer. For example, if `max_doc_freq` is set to 0.9, terms that are in more than
-  90% of the documents will be removed.
-
-- `min_doc_freq=0.0`: Restricts the vocabulary that the transformer will consider.
-  Terms that occur in `< max_doc_freq` documents will not be considered by the
-  transformer. A value of 0.01 means that only terms that are at least in 1% of the
-  documents will be included.
+- `max_doc_freq=1.0`: Restricts the vocabulary that the transformer will consider. Terms
+  that occur in `> max_doc_freq` documents will not be considered by the transformer. For
+  example, if `max_doc_freq` is set to 0.9, terms that are in more than 90% of the
+  documents will be removed.
+
+- `min_doc_freq=0.0`: Restricts the vocabulary that the transformer will consider. Terms
+  that occur in `< max_doc_freq` documents will not be considered by the transformer. A
+  value of 0.01 means that only terms that are at least in 1% of the documents will be
+  included.
 
 # Operations