ONNX Runtime

ONNX Runtime · 2025-03-03T22:00:05.546Z

🔥 Run DeepSeek R1 distilled models Locally with ONNX Runtime – Faster & Private 🔥 AI inference doesn’t have to rely on the cloud. With ONNX Runtime, you can run DeepSeek R1 on your CPU, GPU, or NPU, keeping your data private while achieving up to 6.3x faster performance than PyTorch. Why run on-device? 🔹 Privacy – No data leaves your machine 🔹 Speed – Optimized performance across Intel, AMD, NVIDIA, and Qualcomm hardware 🔹 Flexibility – Works on PCs with easy deployment via Hugging Face + ORT 🔗 Check out the blog here: https://lnkd.in/gQgYwKJ8

Software Development

Redmond, Washington 2,513 followers

Run fast, run anywhere: ONNX Runtime is a machine learning accelerator for cloud, edge, web and mobile

About us

ONNX Runtime is a cross-platform machine-learning model accelerator, with a flexible interface to integrate hardware-specific libraries. ONNX Runtime can be used with models from PyTorch, Tensorflow/Keras, TFLite, scikit-learn, and other frameworks.

Website: https://onnxruntime.ai
External link for ONNX Runtime
Industry: Software Development
Company size: 51-200 employees
Headquarters: Redmond, Washington
Type: Public Company

Locations

Primary

Redmond, Washington 98052, US

Get directions

Updates

ONNX Runtime

2,513 followers
10mo Edited
Report this post
🔥 Run DeepSeek R1 distilled models Locally with ONNX Runtime – Faster & Private 🔥 AI inference doesn’t have to rely on the cloud. With ONNX Runtime, you can run DeepSeek R1 on your CPU, GPU, or NPU, keeping your data private while achieving up to 6.3x faster performance than PyTorch. Why run on-device? 🔹 Privacy – No data leaves your machine 🔹 Speed – Optimized performance across Intel, AMD, NVIDIA, and Qualcomm hardware 🔹 Flexibility – Works on PCs with easy deployment via Hugging Face + ORT 🔗 Check out the blog here: https://lnkd.in/gQgYwKJ8

Enhancing DeepSeek R1 performance for on-device inference with ONNX Runtime. onnxruntime.ai

1 Comment

Like Comment Share
ONNX Runtime reposted this
Parinita Rahi
10mo
Report this post
You can now experiment Phi4 models with #ONNXRuntime for faster on-device inference. Try now: https://lnkd.in/guKkAAmA https://lnkd.in/gJVQNUzt https://lnkd.in/g2fpic-H

Microsoft Azure

848,427 followers
10mo

Introducing Phi-4-multimodal and Phi-4-mini! Phi-4-multimodal integrates speech, vision, and text processing, while Phi-4-mini excels in text-based tasks. Discover these models on Azure AI Foundry: https://msft.it/6048U7H48

Empowering innovation: The next generation of the Phi family | Microsoft Azure Blog https://azure.microsoft.com/en-us/blog

Like Comment Share

LinkedIn respects your privacy

ONNX Runtime

Software Development

Redmond, Washington 2,513 followers

Run fast, run anywhere: ONNX Runtime is a machine learning accelerator for cloud, edge, web and mobile

About us

Locations

Updates

Join now to see what you are missing

Similar pages

ONNX

Axolotl

BentoML

Evidently

OpenCV

WhyLabs

Kafka

Kubeflow

MLflow

OpenVino

Browse jobs

Machine Learning Engineer jobs

Scientist jobs

Engineer jobs

Staff Software Engineer jobs

Software Engineer jobs

Freelance Software Engineer jobs