ONNX and ONNX Runtime

October 30, 2019
Pranav Sharma | Microsoft

What is the universal inference engine for neural networks?

Tensorflow? PyTorch? Keras? There are many popular frameworks out there for working with Deep Learning and ML models, each with their pros and cons for practical usability for product development and/or research. Once you decide what to use and train a model, now you need to figure out how to deploy it onto your platform and architecture of choice. Cloud? Windows? Linux? IOT? Performance sensitive? How about GPU acceleration? With a landscape of 1,000,001 different combinations for deploying a trained model from some chosen framework into a performant production environment for prediction, we can benefit from some standardization.

- John Langford
  
  Partner Researcher Manager
Research Area
- Artificial intelligence

Watch Next

Expanding Flows for Fast and Flexible Generation Beyond the Fixed Canvas
July 21, 2026
Sophia Tang
Microsoft AI for Good Lab - Introduction to HASTE
July 17, 2026
Juan M. Lavista Ferres,

Caleb Robinson,

Cameron Birge

, et. al.
Learning Genetic Perturbation Effects at Single-Cell Resolution for Virtual Cells
July 14, 2026
Jiaqi Zhang
Convergence Analysis for Fast High-Order ODE Solvers in Diffusion Probabilistic Models
July 7, 2026
Zhengjiang Lin
Reinforce Adjoint Matching: Scaling Diffusion RL
June 30, 2026
Andreas Bergmeister
Plenary Talk 2: Reimagining Education and Skilling for the Age of AI: Challenges & Opportunities
June 9, 2026
Manohar Swaminathan
Session on Retrieval
June 9, 2026
Lokesh Nagalapatti,

Soumen Chakrabarti
Session on Inclusive AI: Data, Models, Evaluation
June 9, 2026
Niloy Ganguly,

Danish Pruthi,

Sunayana Sitaram

, et. al.
Plenary Talk 1: Navigating the AI Horizon: Promises, Perils, and the Power of Collaboration
June 9, 2026
Ece Kamar,

Srinivasan Iyengar
Welcome Session - Microsoft Research India Academic Summit 2026
June 9, 2026
Venkat Padmanabhan,

Srinivasan Iyengar

Your Privacy Choices