Tutorials and demos AI + Machine Learning • January 25, 2023 • 5 min read Improve BERT inference speed by combining the power of Optimum, OpenVINO™, ONNX Runtime, and Azure Make large models smaller and faster with OpenVino Execution Provider, NNCF and ONNX Runtime leveraging Azure Machine Learning.