vLLM on EKS: Cut LLM Storage Costs by 95% with S3 Mountpoint

Intro When scaling AI models like DeepSeek or Qwen on Amazon EKS, engineering teams obsess over GPU utilization while quietly bleeding money on storage bloat. Because standard EBS volumes force a 1:1 replica-to-disk ratio, scaling a single 70GB model to 20 pods doesn’t cost 70GB, it forces you to provision 1.4 Terabytes of redundant EBS …

Inside CoreWeave Cloud: CLI & Platform Primer

Intro No invite? No quota? No problem. If youโ€™ve tried to create an account on CoreWeave, you already know the drill: thereโ€™s No open self-registration, No free tier, and No โ€œSign up with GitHubโ€โ€”without an invite. That’s why I decided to write my first CoreWeave blog post. This post shows how to get started with …

vLLM Production Stack on Azure AKS with Terraform๐Ÿง‘๐Ÿผโ€๐Ÿš€

Intro The vLLM Production Stack is designed to work across any cloud provider with Kubernetes. After covering AWS EKS, today we’re deploying vLLM production-stack on Azure AKS with the same Terraform approach. This guide shows you how to deploy the same production-ready LLM serving environment on Azure, with azure-specific optimizations. We’ll cover network architecture, certificate …

Meet Nebius: The Cloud Built for the AI Era

Intro Every once in a while, a new cloud platform shows up that doesnโ€™t just offer โ€œmore computeโ€ . It rethinks what the cloud should look like in an AI-first world. Thatโ€™s what caught my attention with Nebius, a European-born cloud designed from the ground up for high-performance, AI-centric workloads. One that just closed a …

vLLM Production Stack on Amazon EKS with Terraform๐Ÿง‘๐Ÿผโ€๐Ÿš€

Intro Deploying vLLM manually is fine for a lab, but running it in production means dealing with Kubernetes, autoscaling, GPU orchestration, and observability. Thatโ€™s where the vLLM Production Stack comes in – a Terraform-based blueprint that delivers production-ready LLM serving with enterprise-grade foundations. In this post, we’ll deploy it on Amazon EKS, covering everything from …