Skip to content
Change the repository type filter

All

    Repositories list

    • MKA

      Public
      [ACM CF'26 Oral] MKA: Memory-Keyed Attention for Efficient Long-Context Reasoning
      Python
      1600Updated Mar 31, 2026Mar 31, 2026
    • CSV-Decode: Certifiable Sub-Vocabulary Decoding for Efficient Large Language Model Inference
      Python
      01200Updated Feb 26, 2026Feb 26, 2026
    • [FPGA'26 Highlight] CXL-SpecKV: A Disaggregated FPGA Speculative KV-Cache for Datacenter LLM Serving
      C++
      22300Updated Feb 23, 2026Feb 23, 2026
    • [ACM MM 2025 Oral] TinyServe: Query-Aware Page Allocation Optimization
      Shell
      21000Updated Jan 18, 2026Jan 18, 2026
    • SPI_VecDB

      Public
      Distributed Parallel Multi-Resolution Vector Search
      Go
      Apache License 2.0
      01000Updated Jan 16, 2026Jan 16, 2026
    • HSGM

      Public
      [ICPADS 2025 Oral, *SEM 2025 Oral] HSGM: Hierarchical Segment-Graph Memory for Scalable Long-Text Semantics
      Python
      MIT License
      0800Updated Nov 23, 2025Nov 23, 2025
    • SemToken

      Public
      [IWCS 2025 Oral] SemToken: Semantic-Aware Tokenization for Efficient Long-Context Language Modeling
      Python
      0500Updated Sep 21, 2025Sep 21, 2025
    • QTM

      Public
      https://www.arxiv.org/abs/2508.13204
      Python
      3000Updated Sep 21, 2025Sep 21, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.