vLLM on EKS: Cut LLM Storage Costs by 95% with S3 Mountpoint

Intro When scaling AI models like DeepSeek or Qwen on Amazon EKS, engineering teams obsess over GPU utilization while quietly bleeding money on storage bloat. Because standard EBS volumes force a 1:1 replica-to-disk ratio, scaling a single 70GB model to 20 pods doesn’t cost 70GB, it forces you to provision 1.4 Terabytes of redundant EBS …

vLLM Production Stack on Amazon EKS with Terraform๐Ÿง‘๐Ÿผโ€๐Ÿš€

Intro Deploying vLLM manually is fine for a lab, but running it in production means dealing with Kubernetes, autoscaling, GPU orchestration, and observability. Thatโ€™s where the vLLM Production Stack comes in – a Terraform-based blueprint that delivers production-ready LLM serving with enterprise-grade foundations. In this post, we’ll deploy it on Amazon EKS, covering everything from …

Terraform Pipelines for Dummies Part2: GitHub Actions AWS Deploy (OIDC)

Intro Did you know that 12 millions secrets were publicly exposed in GitHub in 2023 alone? This is additional evidence that leaked secrets rhyme with financial and reputation loss for users, organizations and even states. The worst thing to do is make it easy for hackers to infiltrate your company’s system. This is where workload …

How to Survive Pearson Vue exam glitch nightmare

Intro Recently, I passed the AWS SAA certification, and while sharing your achievement on socials after long months of preparation and grind can be gratifying, some Pearson Vue test takers, didnโ€™t have the same smooth experience, for unexpected challenges due to OnVue software glitches.In this short post, Iโ€™ll take you through the surreal blunders of …

Terraform for dummies part 2: Launch an instance with a static website on AWS

Intro This has become a habit so far to explore different ways of automated provisioning for each cloud provider. This time, I will try Terraform on AWS and reproduce the same deployment I have completed on Oracle Cloud, and as usual we wonโ€™t just deploy an instance but also configure a website linked to its …