Accelerating Deep Neural Networks

All titles

Author:

Ryoma Sato, National Institute of Informatics, Chiyoda, Japan

Published:

June 2026

Availability:

Available

Format:

Hardback

ISBN:

9781009687089

Looking for an examination copy?

If you are interested in the title for your course we can consider offering an examination copy. To register your interest please contact collegesales@cambridge.org providing details of the course you are teaching.

$50.00 (P) USD

Hardback

$50.00 (Z) USD

eBook

Description

Deep learning models are powerful, but are often large, slow, and expensive to run. This book is a practical guide to accelerating and compressing neural networks using proven techniques such as quantization, pruning, distillation, and fast architectures. It explains how and why these methods work, fostering a comprehensive understanding. Written for engineers, researchers, and advanced students, the book combines clear theoretical insights with hands-on PyTorch implementations and numerical results. Readers will learn how to reduce inference time and memory usage, lower deployment costs, and select the right acceleration strategy for their task. Whether you're working with large language models, vision systems, or edge devices, this book gives you the tools and intuition needed to build faster, leaner AI systems, without sacrificing performance. It is perfect for anyone who wants to go beyond intuition and take a principled approach to optimizing AI systems

Bridges the gap between research and practice by synthesizing information on acceleration techniques into a systematic and practical resource
Allows readers to go beyond theory and immediately apply the techniques to their own models with ready-to-use implementation code
Shows the trade-offs between different methods through numerical comparisons of speed, accuracy, and memory usage, helping readers more easily choose the best approach for their specific task

Reviews & endorsements

‘This book is a practical guide to DNN and LLM acceleration, bridging the gap between theory and practice. Moving beyond ‘black-box’ tricks, it pairs the latest techniques-like FlashAttention-with runnable code and empirical data. Readers will gain both the technical tools and the fundamental understanding to optimize models effectively.’ Masashi Sugiyama, RIKEN and University of Tokyo

‘This book effectively bridges theory and practice in accelerating deep learning. It offers clear insights into modern architectures such as Mamba, while also elucidating fundamental concepts and practical techniques for efficient deep learning. It will be a valuable resource for researchers and graduate students seeking a deep understanding of modern deep learning.’ Makoto Yamada, Okinawa Institute of Science and Technology

See more reviews

Product details

Published: June 2026
Format: Hardback
ISBN: 9781009687089
Length: 310 pages
Dimensions: 229 × 152 × 19 mm
Weight: 0.614kg
Availability: Available

Often bought together

This title is available for institutional purchase via Cambridge Core

Learn more

Related Journals

Also by this Author

Contents

1. Introduction
2. Overview of acceleration methods
3. Quantization and low precision
4. Pruning
5. Distillation
6. Low-rank approximation
7. Fast architectures
8. Tools for tuning
9. Efficient training
Conclusion
References
Index.
Show more

Courses

Resources

Displaying 1 - 1 of 1

Support site

Follow link

Additional Information

About the authors

Author

Ryoma Sato , National Institute of Informatics, Chiyoda, Japan

Ryoma Sato is Assistant Professor at the National Institute of Informatics, Japan, specializing in graph neural networks, optimal transport, and efficient deep learning. He is the author of 'Theory and Algorithms of Optimal Transport' (2023) and 'Graph Neural Networks' (2024). He is a former IOI Japan representative and ACM-ICPC World Finalist, as well as lead developer of Readable, an AI-powered PDF translation service.

Accessibility

Products and services

About us

Careers

Accelerating Deep Neural Networks

Reviews & endorsements