Transformer Model Optimization Demo
Test quantization on DistilBERT for faster edge inference. Toggle quantization to see speed gains.
text
Use 8-bit Quantization
Clear
Submit
output
Share via Link