low-precision

Here are 7 public repositories matching this topic...

intel / neural-compressor

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

sparsity pruning quantization knowledge-distillation auto-tuning int8 low-precision quantization-aware-training post-training-quantization awq int4 large-language-models gptq smoothquant sparsegpt fp4 mxformat

Updated Nov 15, 2024
Python

Tiiiger / QPyTorch

Star

Low Precision Arithmetic Simulation in PyTorch

learning low-precision

Updated May 20, 2024
Python

gudovskiy / ShiftCNN

Star

A script to convert floating-point CNN models into generalized low-precision ShiftCNN representation

cnn dnn low-precision

Updated Jul 14, 2017
Python

sefaburakokcu / quantized-yolov5

Star

Low Precision(quantized) Yolov5

fpga yolov1 finn low-precision quantized-neural-networks pynq-z2 brevitas yolov5

Updated Jan 28, 2024
Python

gudovskiy / fmap_compression

Star

Code for DNN feature map compression paper

compression caffe cnn dnn feature-map low-precision

Updated Nov 21, 2018
C++

graphcore-research / jax-scalify

Star

JAX Scalify: end-to-end scaled arithmetics

jax low-precision llm fp8

Updated Oct 30, 2024
Python

AmanPriyanshu / LinearCosine

Sponsor

Star

LinearCosine: Adding beats multiplying for lower-precision efficient cosine similarity

nlp benchmarking machine-learning computer-vision deep-learning algorithms cpp optimization linear-algebra artificial-intelligence computation matrix-multiplication neural-networks cosine-similarity floating-point quantization energy-efficiency performance-optimization low-precision

Updated Oct 21, 2024
C++

Improve this page

Add a description, image, and links to the low-precision topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the low-precision topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

low-precision

Here are 7 public repositories matching this topic...

intel / neural-compressor

Tiiiger / QPyTorch

gudovskiy / ShiftCNN

sefaburakokcu / quantized-yolov5

gudovskiy / fmap_compression

graphcore-research / jax-scalify

AmanPriyanshu / LinearCosine

Improve this page

Add this topic to your repo