playbooks/gpu_inference_05_vllm.yml

8 lines
150 B
YAML

---
- name: Deploy vLLM Inference Service
hosts: masters[0]
become: true
roles:
- roles/charts/vllm_runtime
- roles/charts/vllm_service