Qubitium

Follow

Qubitium-ModelCloud Qubitium

Follow

Golang, Python, Kotlin, Swift. I prefer strongly typed languages and I do not worship PEP. @ModelCloudAi

42 followers · 54 following

ModelCloud.ai
Earth/Epoch 2.0
https://modelcloud.ai
@qubitium

Achievements

Achievements

Pinned Loading

ModelCloud/GPTQModel ModelCloud/GPTQModel Public

Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang.

Python 122 26
sgl-project/sglang sgl-project/sglang Public

SGLang is a fast serving framework for large language models and vision language models.

Python 6.1k 510
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 30.4k 4.6k
AutoGPTQ/AutoGPTQ AutoGPTQ/AutoGPTQ Public

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4.5k 484
flashinfer-ai/flashinfer flashinfer-ai/flashinfer Public

FlashInfer: Kernel Library for LLM Serving

Cuda 1.4k 137
Dao-AILab/flash-attention Dao-AILab/flash-attention Public

Fast and memory-efficient exact attention

Python 14.3k 1.3k