![](https://opensource.microsoft.com/blog/wp-content/uploads/2024/06/STB13_Allen_01-450x246.png)
Journey to optimize large scale transformer model inference with ONNX Runtime
“With its resource-efficient and high-performance nature, ONNX Runtime helped us meet the need of deploying a large-scale multi-layer…
“With its resource-efficient and high-performance nature, ONNX Runtime helped us meet the need of deploying a large-scale multi-layer…