Skip to content

Commit 247004a

Browse files
committed
Hiding unfinished cluster command
1 parent b2f771d commit 247004a

File tree

6 files changed

+22
-26
lines changed

6 files changed

+22
-26
lines changed

CHANGELOG.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4,6 +4,7 @@
44

55
*Features*
66

7+
* Adding `xan bisect`.
78
* Adding `xan flatten -N/--non-empty`.
89

910
*Fixes*

README.md

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -637,7 +637,6 @@ To access the expression language's [cheatsheet](./docs/moonblade/cheatsheet.md)
637637
638638
- [**tokenize**](./docs/cmd/tokenize.md): Tokenize a text column
639639
- [**vocab**](./docs/cmd/vocab.md): Build a vocabulary over tokenized documents
640-
- [**cluster**](./docs/cmd/cluster.md): Cluster CSV data to find near-duplicates
641640
642641
*Matrix & network-related commands*
643642

docs/cmd/bisect.md

Lines changed: 19 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -2,20 +2,37 @@
22
# xan bisect
33

44
```txt
5-
Search for rows where the value in <column> matches <value> using binary search.
6-
It is assumed that the INPUT IS SORTED according to the specified column.
5+
Search for rows where the value in <column> matches <value> using binary search,
6+
and flush all records after the target value.
7+
The default behavior is similar to a lower_bound bisection, but you can exclude
8+
records (equivalent to upper_bound) with the target value using the -E/--exclude
9+
flag. It is assumed that the INPUT IS SORTED according to the specified column.
710
The ordering of the rows is assumed to be sorted according ascending lexicographic
811
order per default, but you can specify numeric ordering using the -N or --numeric
912
flag. You can also reverse the order using the -R/--reverse flag.
13+
Use the -S/--search flag to only flush records matching the target value instead
14+
of all records after it.
1015
1116
Usage:
1217
xan bisect [options] [--] <column> <value> <input>
1318
xan bisect --help
1419
1520
bisect options:
21+
-E, --exclude When set, the records with the target value will be
22+
excluded from the output. By default, they are
23+
included. Cannot be used with -S/--search.
24+
TODO: not equivalent to upper_bound
1625
-N, --numeric Compare according to the numerical value of cells
1726
instead of the default lexicographic order.
1827
-R, --reverse Reverse sort order, i.e. descending order.
28+
-S, --search Perform a search on the target value instead of
29+
flushing all records after the value (included).
30+
Cannot be used with -E/--exclude nor -e/--end.
31+
-e, --end <end-value> When set, the records after the target value will be
32+
flushed until <end-value> is reached (included).
33+
By default, all records after the target value are
34+
flushed. Cannot be used with -S/--search.
35+
-v, --verbose
1936
2037
Common options:
2138
-h, --help Display this message

docs/cmd/cluster.md

Lines changed: 0 additions & 22 deletions
This file was deleted.

docs/cmd/flatten.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -50,6 +50,8 @@ flatten options:
5050
-H, --highlight <pat> Highlight in red parts of text cells matching given regex
5151
pattern. Will not work with -R/--rainbow.
5252
-i, --ignore-case If given, pattern given to -H/--highlight will be case-insensitive.
53+
-N, --non-empty For each row, only show non-empty values. This can be useful
54+
when the data is sparse.
5355
5456
Common options:
5557
-h, --help Display this message

src/main.rs

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -120,7 +120,6 @@ macro_rules! command_list {
120120
## Lexicometry & fuzzy matching
121121
tokenize Tokenize a text column
122122
vocab Build a vocabulary over tokenized documents
123-
cluster Cluster CSV data to find near-duplicates
124123
125124
## Matrix & network-related commands
126125
matrix Convert CSV data to matrix data

0 commit comments

Comments
 (0)