Uwc26 Optimizing Ai Inference Performance

Media Summary: Check out videos from Upperside Conference's recent World Congress (formerly known as MPLS World Congress): ... Todd Muirhead talks with Uday Kurkure and Lan Vu about recent tests of The provided text introduces LLM-D, an open-source project designed to

Uwc26 Optimizing Ai Inference Performance - Detailed Analysis & Overview

Check out videos from Upperside Conference's recent World Congress (formerly known as MPLS World Congress): ... Todd Muirhead talks with Uday Kurkure and Lan Vu about recent tests of The provided text introduces LLM-D, an open-source project designed to Master LLM core concepts! Explore MoE, RLHF, DPO alignment, FlashAttention, and LoRA fine-tuning. Learn about KV caching, ... Philip Kiely, Head of Developer Relations at Baseten, presents the “Golden Triangle” of Learn how NVIDIA Dynamo and Kubernetes help scale high-

Discover how AMD powered Amazon EC2 instances are transforming cloud economics for Summary: Victor Moreno, Product Manager for Cloud Networking at Google, discusses the critical role of networking in ...

Photo Gallery

#UWC26: Optimizing AI Inference Performance: Testing Networks at Scale

AI Inference: The Secret to AI's Superpowers

Extreme Performance Series 2026: AI Inference Performance on VCF 9

LLM-D: Optimizing Distributed AI Inference with Intelligent Routing

Deploying scalable and reliable AI inference on Google Cloud

Why Your AI is Slow: Master LLM Inference Optimization

Faster LLMs: Accelerate Inference with Speculative Decoding

The Golden Triangle of Inference Optimization: Balancing Latency, Throughput, and Quality

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

Databricks & Together AI on Inference, Optimization, & Hardware

Scaling AI Inference Performance in the Cloud with Nebius

AWS re:Invent 2025 - Why Your Processor Matters for AI Inference and General Compute (MAM210)

View Detailed Profile

#UWC26: Optimizing AI Inference Performance: Testing Networks at Scale

#UWC26: Optimizing AI Inference Performance: Testing Networks at Scale

Check out videos from Upperside Conference's recent World Congress (formerly known as MPLS World Congress): ...

AI Inference: The Secret to AI's Superpowers

AI Inference: The Secret to AI's Superpowers

Download the

Extreme Performance Series 2026: AI Inference Performance on VCF 9

Extreme Performance Series 2026: AI Inference Performance on VCF 9

Todd Muirhead talks with Uday Kurkure and Lan Vu about recent tests of

LLM-D: Optimizing Distributed AI Inference with Intelligent Routing

LLM-D: Optimizing Distributed AI Inference with Intelligent Routing

The provided text introduces LLM-D, an open-source project designed to

Deploying scalable and reliable AI inference on Google Cloud

Deploying scalable and reliable AI inference on Google Cloud

... 0:00 - Introduction to

Why Your AI is Slow: Master LLM Inference Optimization

Why Your AI is Slow: Master LLM Inference Optimization

Master LLM core concepts! Explore MoE, RLHF, DPO alignment, FlashAttention, and LoRA fine-tuning. Learn about KV caching, ...

Faster LLMs: Accelerate Inference with Speculative Decoding

Faster LLMs: Accelerate Inference with Speculative Decoding

Ready to become a certified watsonx

The Golden Triangle of Inference Optimization: Balancing Latency, Throughput, and Quality

The Golden Triangle of Inference Optimization: Balancing Latency, Throughput, and Quality

Philip Kiely, Head of Developer Relations at Baseten, presents the “Golden Triangle” of

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

Mastering LLM Inference Optimization From Theory to Cost Effective Deployment: Mark Moyou

LLM

Databricks & Together AI on Inference, Optimization, & Hardware

Databricks & Together AI on Inference, Optimization, & Hardware

Together

Scaling AI Inference Performance in the Cloud with Nebius

Scaling AI Inference Performance in the Cloud with Nebius

Learn how NVIDIA Dynamo and Kubernetes help scale high-

AWS re:Invent 2025 - Why Your Processor Matters for AI Inference and General Compute (MAM210)

AWS re:Invent 2025 - Why Your Processor Matters for AI Inference and General Compute (MAM210)

Discover how AMD powered Amazon EC2 instances are transforming cloud economics for

Boosting AI Performance: Networking for AI Inference

Boosting AI Performance: Networking for AI Inference

Summary: Victor Moreno, Product Manager for Cloud Networking at Google, discusses the critical role of networking in ...