ONNX Runtime - Microsoft Open Source Blog

•

February 29, 2024

•

4 min read

ONNX Runtime Web unleashes generative AI in the browser using WebGPU

ONNX Runtime Web featuring WebGPU is now available in the ONNX Runtime 1.17 release—unlocking new possibilities.

•

February 6, 2024

•

7 min read

On-Device Training: Training a model in browser

Continuing the ONNXRuntime On-Device Training blog series, we are introducing ONNX Runtime Training for Web.

News
Cloud

•

September 7, 2023

•

4 min read

Boosting performance in ONNX Runtime with Intel® AMX for 4th Gen Intel® Xeon® Processors

ONNX Runtime harnesses Intel® AMX to accelerate performance for the 4th Gen Intel® Xeon® CPUs.

•

July 25, 2023

•

5 min read

Connect fluid dynamics, machine learning, and virtual reality with ONNX Runtime

Using ONNX Runtime to unlock the promise of developments in science for solving real world problems.

•

July 5, 2023

•

6 min read

On-Device Training with ONNX Runtime: A deep dive

Building upon the foundation we established earlier, this blog will present comprehensive information about the underlying details of…

•

June 26, 2023

•

4 min read

Olive: A user-friendly toolchain for hardware-aware model optimization

Introducing Olive, an easy-to-use toolchain for optimizing models with hardware awareness. With Olive, you don't need to be…

News
Cloud

•

June 26, 2023

•

3 min read

Automate optimization techniques for transformer models

Intel has collaborated with Microsoft to integrate Intel® Neural Compressor into Olive, enabling developers to easily take advantage…

•

May 31, 2023

•

4 min read

On-Device Training: Efficient training on the edge with ONNX Runtime

ONNX Runtime is a high-performance cross-platform inference and training engine that can run a variety of machine learning…

March 15, 2023

•

4 min read

High-performance deep learning in Oracle Cloud with ONNX Runtime

In this blog post, we’ll share challenges our team faced, and how ONNX Runtime solves these as the…

•

February 8, 2023

•

6 min read

Performant on-device inferencing with ONNX Runtime

The team at Pieces shares the problems and solutions evaluated for their on-device model serving stack and how…

•

January 25, 2023

•

5 min read

Improve BERT inference speed by combining the power of Optimum, OpenVINO™, ONNX Runtime, and Azure

Make large models smaller and faster with OpenVino Execution Provider, NNCF and ONNX Runtime leveraging Azure Machine Learning.

•

September 20, 2022

•

1 min read

Hugging Face Transformers now enabled in Apache OpenNLP by ONNX Runtime

We’re excited to share the recent integration of ONNX Runtime in Apache OpenNLP! Apache OpenNLP is a Java…