kv_cache Explained: How It Enhances vLLM Inference

Intro Too often, machine learning concepts are explained like a mathematician talking to other mathematicians—leaving the rest of us scratching our heads. One of those is kv_cache, a key technique that makes large language models run faster and more efficient.This blog is my attempt to break it down simply, without drowning in dark math :). …

HashiCorp Vault for Dummies: Setup your 1st Vault with TLS (WSL)

Intro Vault by HashiCorp is a powerful tool for managing secrets, credentials, and encrypted data. In this guide, you’ll learn how to set up a local Vault server using Raft storage and TLS in a WSL (Windows Subsystem for Linux) environment. Whether you’re just starting with secrets management, prepping for the Vault Associate exam, or …

CloudThrill Joins NVIDIA Inception

Intro CloudThrill has joined NVIDIA Inception, a program that nurtures startups revolutionizing industries with technological advancements. What we do: We are focused on helping organizations deploy privacy-first, cost-efficient AI infrastructure with open-source LLMs and container-native technologies. Our services blend deep expertise in cloud-native architecture, MLOps, and scalable inference to empower businesses to innovate securely and …

world of LLM

How to Quantize AI Models with Ollama CLI

Intro You’ve probably fired up ollama run some-cool-model tons of times, effortlessly pulling models from Ollama’s Repo or even directly from Hugging Face. But have you ever wondered how those CPU-friendly GGUF quantized models actually land on places like Hugging Face in the first place? What if I told you, you could contribute back with tools you might already be …

How to pass the CKA certification

Intro The Certified Kubernetes Administrator (CKA) exam is one of the most recognized Kubernetes certifications, proving your ability to manage, troubleshoot, and configure K8s clusters. But unlike traditional exams, CKA is 100% hands-on—no multiple-choice, just real-world challenges. In this post, I’ll break down:✅ Exam structure & key domains✅ How I prepared (resources, labs, and time …