A library for optimising and quantizing HF transformer models with support for ONNXRuntime and Intel OpenVino.
Search
Apr 25, 20251 min read
A library for optimising and quantizing HF transformer models with support for ONNXRuntime and Intel OpenVino.