Tutorials and demos January 25, 2023 5 min read Improve BERT inference speed by combining the power of Optimum, OpenVINO™, ONNX Runtime, and Azure Make large models smaller and faster with OpenVino Execution Provider, NNCF and ONNX Runtime leveraging Azure Machine Learning.