GPU as a Service for Generative AI: Scaling LLM Workloads Efficiently Nasscom