Aiman Ismail
Articles
Notes
Experiments
Talks
Latest Articles
2025-04-29
Scale down your instances, cut down your AWS bills
2024-12-14
Eliminating cross-AZ traffic cost on AWS
2024-12-12
Exploring AWS ALB for EKS
View all →
Latest Notes
2026-05-20
Supercomputing Skills to Learn
2026-05-19
KV Cache in LLM Inference
2026-04-30
Verifying a Database Implementation with TLA+
View all →
Latest Experiments
2026-05-16
Kubernetes Controller Anti-patterns: What Actually Costs You Performance
2026-04-11
Gemma 4 Quant Showdown: All Sizes, Every Format
2026-04-06
MLX Inference Throughput Gap: Where Do the Missing Tokens/sec Go?
View all →