Eric Lin, Author at Microsoft Open Source Blog

AI + Machine Learning
PyTorch

•

June 30, 2021

•

7 min read

Journey to optimize large scale transformer model inference with ONNX Runtime

With its resource-efficient and high-performance nature, ONNX Runtime helped us meet the need of deploying a large-scale multi-layer generative transformer model for code, a.k.a., GPT-C, to empower IntelliCode with the whole line of code completion suggestions in Visual Studio and Visual Studio Code.