Skip to content
Change the repository type filter

All

    Repositories list

    • Data model for refinery. Manages entities and their access for multiple services, e.g. the gateway.
      Python
      Apache License 2.0
      1204Updated Dec 20, 2024Dec 20, 2024
    • Scripts used for Kern AI CI/CD efforts
      Shell
      0011Updated Dec 13, 2024Dec 13, 2024
    • Weak supervision for refinery. Manages the integration of heuristics such as labeling functions, active learners or zero-shot classifiers. Uses the weak-nlp library for the actual integration logic and algorithms.
      Python
      Apache License 2.0
      1001Updated Dec 13, 2024Dec 13, 2024
    • Neural search for refinery. Manages similarity search powered by Qdrant and outlier detection, both based on vector representations of the project records.
      Python
      Apache License 2.0
      1502Updated Dec 13, 2024Dec 13, 2024
    • Embedder for refinery. Manages the creation of document- and token-level embeddings using the embedders library.
      Python
      Apache License 2.0
      11015Updated Dec 13, 2024Dec 13, 2024
    • Updater for refinery. Manages migration logic to new versions if required.
      Python
      Apache License 2.0
      1001Updated Dec 13, 2024Dec 13, 2024
    • Tokenizer for refinery. Manages the creation and storage of spaCy tokens for text-based record attributes and supports multiple language models. It is used by the gateway.
      Python
      Apache License 2.0
      1101Updated Dec 13, 2024Dec 13, 2024
    • Gateway for refinery. Manages incoming requests and holds the workflow logic. To interact with the gateway, the UI or Python SDK can be used.
      Python
      Apache License 2.0
      3025Updated Dec 13, 2024Dec 13, 2024
    • Execution environment for the active learning module in refinery. Containerized function as a service to build active learning models using scikit-learn and sequence-learn.
      Python
      Apache License 2.0
      10112Updated Dec 13, 2024Dec 13, 2024
    • Submodule which contains the requirements of the different parent images of refinery.
      Python
      Apache License 2.0
      0107Updated Dec 13, 2024Dec 13, 2024
    • Defines parent image for the Docker images of the refinery services which require the integration of the model and the s3 submodule.
      Shell
      Apache License 2.0
      0001Updated Dec 13, 2024Dec 13, 2024
    • Dockerfile
      Apache License 2.0
      0001Updated Dec 11, 2024Dec 11, 2024
    • refinery

      Public
      The data scientist's open-source choice to scale, assess and maintain natural language data. Treat training data like a software artifact.
      Python
      Apache License 2.0
      691.4k700Updated Dec 9, 2024Dec 9, 2024
    • A PDF to Markdown converter
      JavaScript
      209000Updated Dec 9, 2024Dec 9, 2024
    • Websocket module for refinery. Enables asynchronous notifications inside the application.
      Go
      Apache License 2.0
      1000Updated Dec 9, 2024Dec 9, 2024
    • Gateway proxy for refinery. Manages incoming requests and forwards them to the gateway. Used by the Python SDK.
      Python
      Apache License 2.0
      2000Updated Dec 9, 2024Dec 9, 2024
    • TypeScript
      0002Updated Dec 9, 2024Dec 9, 2024
    • Evaluates whether a user has access to certain resources.
      Python
      Apache License 2.0
      2000Updated Dec 9, 2024Dec 9, 2024
    • TypeScript
      0100Updated Dec 9, 2024Dec 9, 2024
    • Execution environment for labeling functions in refinery. Containerized function as a service to execute user-defined Python scripts.
      Python
      Apache License 2.0
      1009Updated Dec 9, 2024Dec 9, 2024
    • Execution environment for attribute calculation in refinery. Containerized function as a service to build custom attributes derived from the original data.
      Python
      Apache License 2.0
      1017Updated Dec 9, 2024Dec 9, 2024
    • Defines parent image for the Docker images of the refinery services which provide an execution environment.
      Shell
      Apache License 2.0
      0001Updated Dec 9, 2024Dec 9, 2024
    • Defines parent image for the Docker images of the refinery services that require torch (gpu).
      Shell
      Apache License 2.0
      0001Updated Dec 6, 2024Dec 6, 2024
    • Defines parent image for the Docker images of the refinery services that require torch (cpu).
      Shell
      Apache License 2.0
      0001Updated Dec 6, 2024Dec 6, 2024
    • Defines parent image for the Docker images of the refinery services with the smallest set of requirements.
      Shell
      Apache License 2.0
      0001Updated Dec 6, 2024Dec 6, 2024
    • TypeScript
      0000Updated Dec 4, 2024Dec 4, 2024
    • Official Python SDK for Kern AI refinery.
      Python
      Apache License 2.0
      31801Updated Nov 14, 2024Nov 14, 2024
    • TypeScript
      0000Updated Nov 11, 2024Nov 11, 2024
    • refinery-config

      Public archive
      Configuration of refinery. Manages amongst others endpoints and available language models for spaCy.
      Python
      Apache License 2.0
      1110Updated Nov 7, 2024Nov 7, 2024
    • refinery-zero-shot

      Public archive
      Zero-shot module for refinery. Enables the integration of 🤗 Hugging Face zero-shot classifiers as an off-the-shelf no-code heuristic.
      Python
      Apache License 2.0
      1000Updated Oct 25, 2024Oct 25, 2024