From Prefill to Decode — LLM Inference Explained