HomeOpenShiftSupercharge LiteLLMs and vLLMs with Dynamic Routing Models

Supercharge LiteLLMs and vLLMs with Dynamic Routing Models

0 views

0%

0 0

Supercharge LiteLLMs and vLLMs with Dynamic Routing Models

In this demo, Alex Sin, Senior Solutions Software Engineer at Intel, shows how to use Intel® Xeon® CPUs and Intel® Gaudi® AI accelerators within OpenShift AI to deploy and run models with LiteLLM and vLLM.

Discover how dynamic model routing can improve performance, scalability, and cost-efficiency for your AI workloads.

Date: October 13, 2025

OpenShift Alex demo senior Solutions this

OpenShift Commons Denver: Accelerating AI: Visions, Trends, and OpenShift

OpenShift Commons Denver: Accelerating AI: Visions, Trends, and OpenShift

OpenShift Commons: Red Hat Device Edge and Microshift Jumpstarter

OpenShift Commons: Red Hat Device Edge and Microshift Jumpstarter

Ask an OpenShift Expert | Ep 166 | Red Hat Trusted Artifact Signer (RHTAS)

Ask an OpenShift Expert | Ep 166 | Red Hat Trusted Artifact Signer (RHTAS)

OpenShift Commons: SaskTel’s Telco Cloud Native Journey with Jerrad DeBolt and Aaron Chartier

OpenShift Commons: SaskTel’s Telco Cloud Native Journey with Jerrad DeBolt and Aaron Chartier

Overheard at KubeCon: Myths and Truths about Podman

Overheard at KubeCon: Myths and Truths about Podman

Shifts Happen: Installing OpenShift on Bare Metal: IPI vs UPI Overview

Shifts Happen: Installing OpenShift on Bare Metal: IPI vs UPI Overview

Keycloak: Open Source Identity and Access Management

Keycloak: Open Source Identity and Access Management

OpenShift Commons Gathering Buenos Aires: Case Study 9 – Sancor Salud (Español)

OpenShift Commons Gathering Buenos Aires: Case Study 9 – Sancor Salud (Español)