Generative AI with Kubernetes

Jonathan Baier

SKU: 9789365898323

Rs. 899
Type:
Quantity:

FREE PREVIEW

ISBN: 9789365898323
eISBN: 9789365892826
Authors: Jonathan Baier
Rights: Worldwide
Edition: 2025
Pages: 284
Dimension: 7.5*9.25 Inches
Book Type: Paperback

Over the past few years, we have seen leaps and strides in ML and most recently generative AI. Companies and software teams are rushing to enhance, rebuild, and create new software offerings with this new intelligence. As they innovate and create delightful new experiences for their customers new challenges arise. Understanding how these applications work and how to use state-of-the-art infrastructure tools like Kubernetes will help organizations and professionals succeed with this new technology.

The book covers essential technical implementations from ML fundamentals through advanced deployment strategies, focusing on practical patterns. Core topics include Kubernetes-native GPU scheduling and resource management, MLOps pipeline architectures using Kubeflow/MLflow, and advanced model serving patterns. It details data management architectures, vector databases, and RAG systems, alongside monitoring solutions with Prometheus/Grafana. Finally, we will look at some advanced concerns for production in the realm of security and data reliability. 

After reading this book, you will be equipped with a broad knowledge of the end-to-end generative AI pipeline and how Kubernetes can be leveraged to run your generative AI workloads at scale in the real-world.

KEY FEATURES  
● Learn how Kubernetes can help you run your generative AI workloads.
● Using hands-on examples, you will work with real-world foundational models and a variety of tools and capabilities in the K8s ecosystem.
● A broad survey of both generative AI and Kubernetes in one book.

WHAT YOU WILL LEARN
● How to evaluate and compare models for new applications and use cases.
● How Kubernetes can add reliability and scale to your AI applications.
● What does an AI delivery pipeline contain and how to start one.
● How AI models encode words and work with natural language.
● How prompting and refinement techniques can improve results.
● How to use your own data to augment AI responses.

WHO THIS BOOK IS FOR
This book is for teams building new applications or new functionality with generative AI, but want to better understand the infrastructure needed to bring their AI applications to production. This book is also for shared services, infrastructure, or cybersecurity teams who provide platforms and infrastructure for application, or product development.

1. Introduction to Generative Artificial Intelligence
2. Kubernetes for Generative AI
3. Introduction to Foundational Models on Kubernetes
4. Working with Foundational Models
5. Process and Pipelines
6. Process and Pipelines on Kubernetes
7. Managing Data for Generative AI
8. Refining and Improving Results
9. Observability and Monitoring
10. Securing ML/GenAI Pipelines on K8s

Jonathan Baier is an emerging technology leader living in Northern Virginia and Yokohama, Japan. He has worked in technology for more than 20 years, always using his curiosity to dive into new trends and technologies as they emerge. He has helped countless businesses of all sizes, startup to fortune 100, adopt new technology including cloud computing, containers, platform engineering, and now artificial intelligence.

In 2024, he made the decision to enter the world of entrepreneurship. He is the owner of NextNext LLC as well as co-founder of Cyberify Services. At Cyberify, Jonathan works with several of his brightest colleagues from past adventures as they help companies tackle Artificial Intelligence, Cryptographic Agility, and Cyber Resilience.

Curiosity is his hobby, and he spends his free time reading and absorbing information on new technologies, global cultures, philosophy, and education.

You may also like

Recently viewed