Generative AI on Kubernetes

Regular price €59.99
Quantity:
In stock with our UK publisher. 14-28 days
Delivery/Collection within 10-20 working days
14 days return policy Shipping & Delivery
A01=Daniele Zonca
A01=Roland Huss
Author_Daniele Zonca
Author_Roland Huss
Category=UTC
Category=UTV
Category=UYQ
Category=UYQF
eq_bestseller
eq_computing
eq_isMigrated=1
eq_isMigrated=2
eq_new_release
eq_nobargain
eq_non-fiction
Kubernetes Generative AI MLOps Cloud-native computing Model deployment AI scalability AI orchestration Distributed learning AI operational efficiency AI model monitoring Responsible AI

Product details

  • ISBN 9781098171926
  • Dimensions: 178 x 232mm
  • Publication Date: 13 Mar 2026
  • Publisher: O'Reilly Media
  • Publication City/Country: US
  • Product Form: Paperback
Secure checkout Fast Shipping Easy returns
Generative AI is revolutionizing industries, and Kubernetes has fast become the backbone for deploying and managing these resource-intensive workloads. This book serves as a practical, hands-on guide for MLOps engineers, software developers, Kubernetes administrators, and AI professionals ready to unlock AI innovation with the power of cloud native infrastructure. Authors Roland Huss and Daniele Zonca provide a clear road map for training, fine-tuning, deploying, and scaling GenAI models on Kubernetes, addressing challenges like resource optimization, automation, and security along the way.

With actionable insights with real-world examples, readers will learn to tackle the opportunities and complexities of managing GenAI applications in production environments. Whether you're experimenting with large-scale language models or facing the nuances of AI deployment at scale, you'll uncover expertise you need to operationalize this exciting technology effectively.

  • Learn to run GenAI models on Kubernetes for efficient scalability
  • Get techniques to train and fine-tune LLMs within Kubernetes environments
  • See how to deploy production-ready AI systems with automation and resource optimization
  • Discover how to monitor and scale GenAI applications to handle real-world demand
  • Uncover the best tools to operationalize your GenAI workloads
  • Learn how to run agent-based and AI-driven applications
Dr. Roland Huss is a seasoned software engineer with over 25 years of experience in the field. Currently working at Red Hat, he is the architect of OpenShift Serverless and a former member of the Knative TOC. Roland is a passionate Java and Golang coder and a sought-after speaker at tech conferences. An advocate of open source, he is an active contributor and enjoys growing chili peppers in his free time. Daniele Zonca is a Senior Principal Software Engineer and Architect for model serving of Red Hat OpenShift AI, Red Hat's flagship AI product combining multiple stacks.

More from this author