0 views
In this demo, Alex Sin, Senior Solutions Software Engineer at Intel, shows how to use Intel® Xeon® CPUs and Intel® Gaudi® AI accelerators within OpenShift AI to deploy and run models with LiteLLM and vLLM.
Discover how dynamic model routing can improve performance, scalability, and cost-efficiency for your AI workloads.
Date: October 13, 2025