Training a model is cooking the meal. Serving it is running the restaurant. A great recipe (model) means nothing if orders take 30 minutes (high latency) or the kitchen burns down under rush hour (no scaling).