File tree Expand file tree Collapse file tree 1 file changed +14
-1
lines changed
Expand file tree Collapse file tree 1 file changed +14
-1
lines changed Original file line number Diff line number Diff line change @@ -167,10 +167,23 @@ French and English both use a lot of single-byte whitespace characters, but Fren
167167| ` stdlib::split(char::is_whitespace) ` | 0.77 GiB/s | 1.87 GiB/s | 1.04 GiB/s | 0.72 GiB/s | 0.98 GiB/s |
168168| ` icu::WhiteSpace ` | 0.11 GiB/s | 0.16 GiB/s | 0.15 GiB/s | 0.12 GiB/s | 0.15 GiB/s |
169169| | | | | | |
170- | Split around 9 newline combinations: | | | | | |
170+ | Split around 8 newline combinations: | | | | | |
171171| ` stringzilla::utf8_newline_splits ` | 15.45 GiB/s | 16.65 GiB/s | 18.34 GiB/s | 14.52 GiB/s | 16.71 GiB/s |
172172| ` stdlib::split(char::is_unicode_newline) ` | 1.90 GiB/s | 1.93 GiB/s | 1.82 GiB/s | 1.78 GiB/s | 1.81 GiB/s |
173173
174+ On Apple M2 Pro:
175+
176+ | Library | English | Chinese | Arabic | French | Korean |
177+ | ----------------------------------------- | ---------: | ---------: | ---------: | ---------: | ---------: |
178+ | Split around 25 whitespace characters: | | | | | |
179+ | ` stringzilla::utf8_whitespace_splits ` | 0.57 GiB/s | 2.45 GiB/s | 1.18 GiB/s | 0.61 GiB/s | 0.92 GiB/s |
180+ | ` stdlib::split(char::is_whitespace) ` | 0.59 GiB/s | 1.16 GiB/s | 0.99 GiB/s | 0.63 GiB/s | 0.89 GiB/s |
181+ | ` icu::WhiteSpace ` | 0.10 GiB/s | 0.16 GiB/s | 0.14 GiB/s | 0.11 GiB/s | 0.14 GiB/s |
182+ | | | | | | |
183+ | Split around 8 newline combinations: | | | | | |
184+ | ` stringzilla::utf8_newline_splits ` | 5.69 GiB/s | 6.24 GiB/s | 6.58 GiB/s | 6.70 GiB/s | 6.29 GiB/s |
185+ | ` stdlib::split(char::is_unicode_newline) ` | 1.12 GiB/s | 1.11 GiB/s | 1.11 GiB/s | 1.11 GiB/s | 1.13 GiB/s |
186+
174187## Sequence Operations
175188
176189Rust has several Dataframe libraries, DBMS and Search engines that heavily rely on string sorting and intersections.
You can’t perform that action at this time.
0 commit comments