Skip to content
@ModelTC

ModelTC

Model Infra

Pinned Loading

  1. MQBench MQBench Public

    Model Quantization Benchmark

    Python 801 142

  2. United-Perception United-Perception Public

    United Perception

    Python 432 67

  3. Dipoorlet Dipoorlet Public

    Offline Quantization Tools for Deploy.

    Python 127 17

  4. lightllm lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    Python 3.2k 250

  5. llmc llmc Public

    [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

    Python 464 53

  6. OmniBal OmniBal Public

    Python 20 3

Repositories

Showing 10 of 49 repositories
  • HarmoniCa Public

    [ICML 2025] This is the official PyTorch implementation of "HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration".

    ModelTC/HarmoniCa’s past year of commit activity
    1 Apache-2.0 0 0 0 Updated May 1, 2025
  • lightllm Public

    LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

    ModelTC/lightllm’s past year of commit activity
    Python 3,180 Apache-2.0 250 75 8 Updated Apr 30, 2025
  • lightx2v Public

    Light Video Generation Inference Framework

    ModelTC/lightx2v’s past year of commit activity
    Python 15 6 0 2 Updated Apr 30, 2025
  • llmc Public

    [EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit".

    ModelTC/llmc’s past year of commit activity
    Python 464 Apache-2.0 53 29 0 Updated Apr 29, 2025
  • ModelTC/lightx2v_comfyui_node’s past year of commit activity
    0 0 0 0 Updated Apr 28, 2025
  • ModelTC/flash-attn-3-build’s past year of commit activity
    Dockerfile 0 0 0 0 Updated Apr 24, 2025
  • general-sam-py Public

    Python bindings for general-sam and some utilities

    ModelTC/general-sam-py’s past year of commit activity
    Python 3 Apache-2.0 0 0 1 Updated Apr 22, 2025
  • MQBench Public

    Model Quantization Benchmark

    ModelTC/MQBench’s past year of commit activity
    Python 801 Apache-2.0 142 7 5 Updated Apr 20, 2025
  • flash-attention Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    ModelTC/flash-attention’s past year of commit activity
    Python 0 BSD-3-Clause 1,666 0 0 Updated Apr 17, 2025
  • greedy-tokenizer Public

    Greedily tokenize strings with the longest tokens iteratively.

    ModelTC/greedy-tokenizer’s past year of commit activity
    Python 0 Apache-2.0 0 0 1 Updated Mar 24, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…