Skip to content
@daac-tools

daac-tools

Pinned Loading

  1. daachorse daachorse Public

    🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

    Rust 243 21

  2. vaporetto vaporetto Public

    🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

    Rust 250 10

  3. crawdad crawdad Public

    🦞 Rust library of natural language dictionaries using character-wise double-array tries.

    Rust 36 3

  4. vibrato vibrato Public

    🎤 vibrato: Viterbi-based accelerated tokenizer

    Rust 398 23

  5. rucrf rucrf Public

    Conditional Random Fields implemented in pure Rust

    Rust 12 4

  6. trie-match trie-match Public

    Fast match expression optimized for string comparison

    Rust 40

Repositories

Showing 10 of 13 repositories
  • vibrato Public

    🎤 vibrato: Viterbi-based accelerated tokenizer

    daac-tools/vibrato’s past year of commit activity
    Rust 398 Apache-2.0 23 7 0 Updated Feb 7, 2026
  • vaporetto Public

    🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer

    daac-tools/vaporetto’s past year of commit activity
    Rust 250 Apache-2.0 10 3 3 Updated Feb 7, 2026
  • daachorse Public

    🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure in Rust.

    daac-tools/daachorse’s past year of commit activity
    Rust 243 Apache-2.0 21 1 2 Updated Jan 26, 2026
  • python-vaporetto Public

    🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.

    daac-tools/python-vaporetto’s past year of commit activity
    Rust 21 Apache-2.0 1 0 0 Updated Jun 1, 2025
  • rucrf Public

    Conditional Random Fields implemented in pure Rust

    daac-tools/rucrf’s past year of commit activity
    Rust 12 Apache-2.0 4 0 0 Updated Mar 17, 2025
  • python-daachorse Public

    🐎 A fast implementation of the Aho-Corasick algorithm using the compact double-array data structure. (Python wrapper for daachorse)

    daac-tools/python-daachorse’s past year of commit activity
    Rust 20 Apache-2.0 1 1 0 Updated Mar 15, 2025
  • find-simdoc Public

    Finding all pairs of similar documents time- and memory-efficiently

    daac-tools/find-simdoc’s past year of commit activity
    Rust 62 Apache-2.0 3 1 0 Updated Mar 13, 2025
  • crawdad Public

    🦞 Rust library of natural language dictionaries using character-wise double-array tries.

    daac-tools/crawdad’s past year of commit activity
    Rust 36 Apache-2.0 3 0 0 Updated Jan 13, 2025
  • python-vibrato Public

    Viterbi-based accelerated tokenizer (Python wrapper)

    daac-tools/python-vibrato’s past year of commit activity
    Rust 43 Apache-2.0 1 0 0 Updated Sep 4, 2024
  • trie-match Public

    Fast match expression optimized for string comparison

    daac-tools/trie-match’s past year of commit activity
    Rust 40 Apache-2.0 0 0 0 Updated Jan 29, 2024