A library for optimising and quantizing HF transformer models with support for ONNXRuntime and Intel OpenVino.
Search
Nov 04, 20241 min read
A library for optimising and quantizing HF transformer models with support for ONNXRuntime and Intel OpenVino.