ONNX Runtime - Microsoft Open Source Blog

•

Feb 29

•

4 min read

ONNX Runtime Web unleashes generative AI in the browser using WebGPU

By Emma Ning, Principal Program Manager, AI Frameworks
Yulong Wang, Senior Software Engineer, AI Frameworks
Satya Jandhyala, Principal Software Engineer, AI Frameworks

ONNX Runtime Web featuring WebGPU is now available in the ONNX Runtime 1.17 release—unlocking new possibilities.

•

Feb 6

•

7 min read

On-Device Training: Training a model in browser

By Caroline Zhu, Software Engineer, AI Frameworks

Continuing the ONNXRuntime On-Device Training blog series, we are introducing ONNX Runtime Training for Web.

News
Cloud

•

September 7, 2023

•

4 min read

Boosting performance in ONNX Runtime with Intel® AMX for 4th Gen Intel® Xeon® Processors

By Chen Fu, Principal Software Engineer, Microsoft
Kiefer Kuah, Software Engineer, Intel

ONNX Runtime harnesses Intel® AMX to accelerate performance for the 4th Gen Intel® Xeon® CPUs.

•

July 25, 2023

•

5 min read

Connect fluid dynamics, machine learning, and virtual reality with ONNX Runtime

By Cassie Breviu, Senior Technical Program Manager, ONNX Runtime, AI Frameworks—Microsoft

Using ONNX Runtime to unlock the promise of developments in science for solving real world problems.

•

July 5, 2023

•

6 min read

On-Device Training with ONNX Runtime: A deep dive

By Ashwini Khade, Software Engineer, AI Frameworks
Kshama Pawar, Senior Program Manager

Building upon the foundation we established earlier, this blog will present comprehensive information about the underlying details of…

•

June 26, 2023

•

4 min read

Olive: A user-friendly toolchain for hardware-aware model optimization

By Emma Ning, Principal Program Manager, AI Frameworks
Devang Patel, Principal Architect, AI Frameworks
Guoliang Hua, Principal Software Engineer Manager, Microsoft.

Introducing Olive, an easy-to-use toolchain for optimizing models with hardware awareness. With Olive, you don't need to be…

News
Cloud

•

June 26, 2023

•

3 min read

Automate optimization techniques for transformer models

By Emma Ning, Principal Program Manager, AI Frameworks
Feng Tian, AI Architect—Intel
Yuwen Zhou, AI Engineer—Intel
Haihao Shen, Leading AI Architect—Intel
Saurabh Tangri, Principal AI Engineer—Intel

Intel has collaborated with Microsoft to integrate Intel® Neural Compressor into Olive, enabling developers to easily take advantage…

•

May 31, 2023

•

4 min read

On-Device Training: Efficient training on the edge with ONNX Runtime

By Kshama Pawar, Senior Program Manager
Ashwini Khade, Software Engineer, AI Frameworks
Baiju Meswani, Software Engineer, AI Frameworks

ONNX Runtime is a high-performance cross-platform inference and training engine that can run a variety of machine learning…

March 15, 2023

•

4 min read

High-performance deep learning in Oracle Cloud with ONNX Runtime

By Faith Xu, Principal Program Manager, Machine Learning Platform

In this blog post, we’ll share challenges our team faced, and how ONNX Runtime solves these as the…

•

February 8, 2023

•

6 min read

Performant on-device inferencing with ONNX Runtime

By Faith Xu, Principal Program Manager, Machine Learning Platform
Brian Lambert, Machine Learning Engineer, Pieces.app

The team at Pieces shares the problems and solutions evaluated for their on-device model serving stack and how…

•

January 25, 2023

•

5 min read

Improve BERT inference speed by combining the power of Optimum, OpenVINO™, ONNX Runtime, and Azure

By Cassie Breviu, Senior Technical Program Manager, ONNX Runtime, AI Frameworks—Microsoft
Akhila Vidiyala, Cloud Software Development Engineer, OpenVINO™ AI Frameworks Architectures—Intel
Devang Aggarwal, Product Manager, OpenVINO™ AI Framework Integrations—Intel
Sachin Rastogi, Product Manager, OpenVINO™ AI Workflows —Intel

Make large models smaller and faster with OpenVino Execution Provider, NNCF and ONNX Runtime leveraging Azure Machine Learning.