Skip to content
Change the repository type filter

All

    Repositories list

    • Gateway for refinery. Manages incoming requests and holds the workflow logic. To interact with the gateway, the UI or Python SDK can be used.
      Python
      Apache License 2.0
      3126Updated Mar 3, 2026Mar 3, 2026
    • Data model for refinery. Manages entities and their access for multiple services, e.g. the gateway.
      Python
      Apache License 2.0
      1306Updated Mar 3, 2026Mar 3, 2026
    • Embedder for refinery. Manages the creation of document- and token-level embeddings using the embedders library.
      Python
      Apache License 2.0
      1101Updated Mar 2, 2026Mar 2, 2026
    • Updater for refinery. Manages migration logic to new versions if required.
      Python
      Apache License 2.0
      1001Updated Mar 2, 2026Mar 2, 2026
    • TypeScript
      0001Updated Feb 27, 2026Feb 27, 2026
    • Evaluates whether a user has access to certain resources.
      Python
      Apache License 2.0
      2001Updated Feb 25, 2026Feb 25, 2026
    • TypeScript
      0000Updated Feb 23, 2026Feb 23, 2026
    • .github

      Public
      0100Updated Feb 17, 2026Feb 17, 2026
    • TypeScript
      0000Updated Feb 12, 2026Feb 12, 2026
    • Websocket module for refinery. Enables asynchronous notifications inside the application.
      Go
      Apache License 2.0
      1000Updated Feb 11, 2026Feb 11, 2026
    • TypeScript
      0100Updated Feb 10, 2026Feb 10, 2026
    • refinery-ml-exec-env

      Public
      Execution environment for the active learning module in refinery. Containerized function as a service to build active learning models using scikit-learn and seq…
      Python
      Apache License 2.0
      1010Updated Feb 9, 2026Feb 9, 2026
    • Execution environment for labeling functions in refinery. Containerized function as a service to execute user-defined Python scripts.
      Python
      Apache License 2.0
      1002Updated Feb 9, 2026Feb 9, 2026
    • Execution environment for attribute calculation in refinery. Containerized function as a service to build custom attributes derived from the original data.
      Python
      Apache License 2.0
      1013Updated Feb 9, 2026Feb 9, 2026
    • Neural search for refinery. Manages similarity search powered by Qdrant and outlier detection, both based on vector representations of the project records.
      Python
      Apache License 2.0
      1500Updated Feb 9, 2026Feb 9, 2026
    • Weak supervision for refinery. Manages the integration of heuristics such as labeling functions, active learners or zero-shot classifiers. Uses the weak-nlp lib…
      Python
      Apache License 2.0
      1000Updated Feb 9, 2026Feb 9, 2026
    • Tokenizer for refinery. Manages the creation and storage of spaCy tokens for text-based record attributes and supports multiple language models. It is used by t…
      Python
      Apache License 2.0
      1100Updated Feb 9, 2026Feb 9, 2026
    • S3 related AWS and Minio logic.
      Python
      Apache License 2.0
      1000Updated Feb 9, 2026Feb 9, 2026
    • Go Template
      Apache License 2.0
      0001Updated Feb 5, 2026Feb 5, 2026
    • Defines parent image for the Docker images of the refinery services which provide an execution environment.
      Shell
      Apache License 2.0
      0000Updated Feb 5, 2026Feb 5, 2026
    • Defines parent image for the Docker images of the refinery services that require torch (cpu).
      Shell
      Apache License 2.0
      0000Updated Feb 5, 2026Feb 5, 2026
    • Defines parent image for the Docker images of the refinery services that require torch (gpu).
      Shell
      Apache License 2.0
      0000Updated Feb 5, 2026Feb 5, 2026
    • Defines parent image for the Docker images of the refinery services which require the integration of the model and the s3 submodule.
      Shell
      Apache License 2.0
      0000Updated Feb 5, 2026Feb 5, 2026
    • Defines parent image for the Docker images of the refinery services with the smallest set of requirements.
      Shell
      Apache License 2.0
      0000Updated Feb 5, 2026Feb 5, 2026
    • Submodule which contains the requirements of the different parent images of refinery.
      Python
      Apache License 2.0
      0100Updated Feb 5, 2026Feb 5, 2026
    • Scripts used for Kern AI CI/CD efforts
      Shell
      0011Updated Feb 4, 2026Feb 4, 2026
    • JavaScript
      0000Updated Feb 4, 2026Feb 4, 2026
    • embedders

      Public
      With embedders, you can easily convert your texts into sentence- or token-level embeddings within a few lines of code. Use cases for this include similarity sea…
      Python
      Apache License 2.0
      22111Updated Jul 14, 2025Jul 14, 2025
    • Gateway proxy for refinery. Manages incoming requests and forwards them to the gateway. Used by the Python SDK.
      Python
      Apache License 2.0
      2000Updated Jul 10, 2025Jul 10, 2025
    • bricks

      Public
      Open-source natural language enrichments at your fingertips.
      Python
      Apache License 2.0
      244627310Updated Jan 14, 2025Jan 14, 2025