AI Everyday

AI Everyday #23 - Hands on & discussion on vLLM - high speed inference engine

January 30, 2024 Matthew Wallace Season 1 Episode 23
AI Everyday #23 - Hands on & discussion on vLLM - high speed inference engine
AI Everyday
More Info
AI Everyday
AI Everyday #23 - Hands on & discussion on vLLM - high speed inference engine
Jan 30, 2024 Season 1 Episode 23
Matthew Wallace

Hands on and discussion around vLLM, high performance inference engine supporting continuous batching and paged attention.

Show Notes

Hands on and discussion around vLLM, high performance inference engine supporting continuous batching and paged attention.