A library for optimising and quantizing HF transformer models with support for ONNXRuntime and Intel OpenVino.
Search
Mar 07, 20241 min read
A library for optimising and quantizing HF transformer models with support for ONNXRuntime and Intel OpenVino.