You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
/** Array of character scripts for which to disable bigrams. */
363
+
ignored_scripts?: CjkBigramIgnoredScript[]
364
+
/** If `true`, emit tokens in both bigram and unigram form. If `false`, a CJK character is output in unigram form when it has no adjacent characters. Defaults to `false`. */
/** Number of buckets to which hashes are assigned. Defaults to `512`. */
403
+
bucket_count?: integer
404
+
/** Number of ways to hash each token in the stream. Defaults to `1`. */
405
+
hash_count?: integer
406
+
/** Number of hashes to keep from each bucket. Defaults to `1`.
407
+
* Hashes are retained by ascending size, starting with the bucket’s smallest hash first. */
408
+
hash_set_size?: integer
409
+
/** If `true`, the filter fills empty buckets with the value of the first non-empty bucket to its circular right if the `hash_set_size` is `1`. If the `bucket_count` argument is greater than 1, this parameter defaults to `true`. Otherwise, this parameter defaults to `false`. */
0 commit comments