Skip to content
Change the repository type filter

All

    Repositories list

    • litellm

      Public
      Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sagemaker, HuggingFace, Replicate, Groq]
      Python
      4.1k000Updated Sep 2, 2025Sep 2, 2025
    • lorax

      Public
      Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
      Python
      2633.4k14829Updated May 21, 2025May 21, 2025
    • Python
      22020Updated Sep 5, 2024Sep 5, 2024
    • Jupyter Notebook
      1600Updated Mar 3, 2024Mar 3, 2024
    • Best practices for distilling large language models.
      Jupyter Notebook
      4857410Updated Feb 1, 2024Feb 1, 2024
    • The official Python client for the Huggingface Hub.
      Python
      794000Updated Dec 18, 2023Dec 18, 2023
    • volcano

      Public archive
      A Cloud Native Batch System (Project under CNCF)
      Go
      1.2k001Updated Dec 4, 2023Dec 4, 2023
    • punica

      Public
      Serving multiple LoRA finetuned LLM as one
      Cuda
      53200Updated Nov 24, 2023Nov 24, 2023
    • volcano-apis

      Public archive
      The API (CRD) of Volcano
      Go
      88000Updated Nov 8, 2023Nov 8, 2023
    • LlamaIndex (GPT Index) is a data framework for your LLM applications
      Python
      6.4k000Updated Aug 1, 2023Aug 1, 2023
    • langchain

      Public
      ⚡ Building applications with LLMs through composability ⚡
      Python
      19k000Updated Jul 20, 2023Jul 20, 2023
    • Kubernetes Image Puller is used for caching images on a cluster. It creates a DaemonSet downloading and running the relevant container images on each node.
      Go
      35000Updated Apr 20, 2023Apr 20, 2023
    • PyBump

      Public
      Bump version in Helm Chart.yaml and setup.py files
      Python
      8000Updated Dec 22, 2022Dec 22, 2022
    • server

      Public
      The Triton Inference Server provides an optimized cloud and edge inferencing solution.
      Python
      1.6k000Updated Oct 22, 2022Oct 22, 2022
    • An MLOps framework to package, deploy, monitor and manage thousands of production machine learning models
      HTML
      847000Updated Aug 24, 2022Aug 24, 2022
    • dask-sql

      Public
      Distributed SQL Engine in Python using Dask
      Python
      72100Updated Apr 5, 2022Apr 5, 2022
    • Python
      14100Updated Feb 23, 2022Feb 23, 2022
    • neuropod

      Public
      A uniform interface to run deep learning models from multiple frameworks
      C++
      76300Updated Feb 23, 2022Feb 23, 2022
    • GitHub action for identifying the last successful commit for a given workflow and branch.
      JavaScript
      52000Updated Jan 5, 2021Jan 5, 2021