Ultimate ONNX for Deep Learning Optimization: Design, Optimize, and Deploy Deep Learning Models Using ONNX for Scalable Production and Edge AI Systems (English Edition)

★★★★★ 4.1 37 reviews

$16.37
Price when purchased online
Free shipping Free 30-day returns

Sold and shipped by travelasia.app
We aim to show you accurate product information. Manufacturers, suppliers and others provide what you see here.
$16.37
Price when purchased online
Free shipping Free 30-day returns

How do you want your item?
You get 30 days free! Choose a plan at checkout.
Shipping
Arrives Jun 28
Free
Pickup
Check nearby
Delivery
Not available

Sold and shipped by travelasia.app
Free 30-day returns Details

Product details

Management number 231975263 Release Date 2026/06/18 List Price $6.55 Model Number 231975263
Category

Bringing Deep Learning Models to the Edge Efficiently Using ONNX. Key Features ● Master end-to-end ONNX workflows from framework export models to edge deployment. ● Hands-on optimization techniques like quantization, pruning and knowledge distillation for real-world edge AI performance. ● Production-grade case studies across vision, speech, and language models on edge devices. Book Description ONNX has emerged as the de facto standard for deploying portable, framework-agnostic machine learning models across diverse hardware platforms. Ultimate ONNX for Deep Learning Optimization provides a structured, end-to-end guide to the ONNX ecosystem, starting with ONNX fundamentals, model representation, and framework integration. You will learn how to export models from PyTorch, TensorFlow, and Scikit-Learn, inspect and modify ONNX graphs, and leverage ONNX Runtime and ONNX Simplifier for inference optimization. Each chapter builds technical depth, equipping you with the tools required to move models beyond experimentation. The book focuses on performance-critical optimization techniques, including quantization, pruning, and knowledge distillation, followed by practical deployment on edge devices such as Raspberry Pi. Through complete, real-world case studies covering object detection, speech recognition, and compact language models, you can implement custom operators, follow deployment best practices, and understand production constraints. Thus, by the end of this book, you will be capable of designing, optimizing, and deploying efficient ONNX-based AI systems for edge environments. What you will learn ● Design and understand ONNX models, graphs, operators, and runtimes. ● Convert and integrate models from PyTorch, TensorFlow, and Scikit-Learn. ● Optimize inference using graph simplification, quantization, and pruning. ● Apply knowledge distillation to retain accuracy on constrained devices. ● Deploy and benchmark ONNX models on Raspberry Pi and edge hardware. ● Build custom ONNX operators, and extend models beyond standard layers. Who is this book for? This book is tailored for Machine Learning Engineers, AI Engineers, Data Scientists, Embedded AI Developers, and Software Engineers transitioning ONNX models from research to production. Readers should have a working knowledge of machine learning fundamentals and basic Python experience to apply the optimization and edge deployment workflows effectively. Table of Contents 1. Introduction to ONNX and Edge Computing 2. Getting Started with ONNX 3. ONNX Integration with Deep Learning Frameworks 4. Model Optimization Using ONNX Simplifier and ONNX Runtime 5. Model Quantization Using ONNX Runtime 6. Model Pruning in Pytorch and Exporting to ONNX 7. Knowledge Distillation for Edge AI 8. Deploying ONNX Models on Edge Devices 9. End to End Execution of YOLOv12 10. End to End Execution of Whisper Speech Recognition Model 11. End to End Execution of SmolLM Model 12. ONNX Model from Scratch and Custom Operators 13. Real-World Applications, Best Practices, Security, and Future Trends in ONNX for Edge AI        Index About the Author Meet Patel is a machine learning engineer with over seven years of expertise dedicated to a singular challenge, that is, making Artificial Intelligence (AI) faster, smaller, and more efficient. His passion lies in unlocking the potential of AI on resource-constrained devices, pushing models from the lab into the real world. Read more

ASIN B0GD2SJ3B3
XRay Not Enabled
Language English
File size 49.3 MB
Page Flip Enabled
Publisher Orange Education Pvt Ltd
Word Wise Not Enabled
Print length 345 pages
Accessibility Learn more
Screen Reader Supported
Publication date December 30, 2025
Enhanced typesetting Enabled

Correction of product information

If you notice any omissions or errors in the product information on this page, please use the correction request form below.

Correction Request Form

Customer ratings & reviews

4.1 out of 5
★★★★★
37 ratings | 15 reviews
How item rating is calculated
View all reviews
5 stars
77% (28)
4 stars
7% (3)
3 stars
4% (1)
2 stars
2% (1)
1 star
10% (4)
Sort by

There are currently no written reviews for this product.