
Optimizing memory usage in large language models fine-tuning with KAITO: Best practices from Phi-3
The Cloud Native team at Azure is working to make AI on Kubernetes more cost-effective and approachable for…
The Cloud Native team at Azure is working to make AI on Kubernetes more cost-effective and approachable for…
We're announcing the release of Hyperlight Wasm: a Hyperlight virtual machine (VM) “micro-guest.”
In this post, we’ll take the demo application and show how it demonstrates one way you can use…
The Microsoft Azure Core Upstream team is excited to announce the Hyperlight project.
As the cloud-native space keeps evolving at a rapid pace, WebAssembly is emerging as a promising alternative to…
Project Copacetic simplifies container image patching with a CLI tool and Docker Desktop extension.
On Azure, more than 50 percent of virtual machine (VM) cores run on Linux. There is no better…
The team at Pieces shares the problems and solutions evaluated for their on-device model serving stack and how…
Make large models smaller and faster with OpenVino Execution Provider, NNCF and ONNX Runtime leveraging Azure Machine Learning.
eBPF for Windows native code generation is a new mode of execution that maintains the integrity of the…
We’re excited to share the recent integration of ONNX Runtime in Apache OpenNLP! Apache OpenNLP is a Java…
This post was co-authored by Alejandro Saucedo, Director of Machine Learning Engineering at Seldon Technologies. About the co-author:…