
Anton Alekseev
Avito
If you have a ticket, log in to watch the video
LoginGPU Inference in K8s: Acceleration, Sharing and Scaling Without Pain
How can I speed up GPU inference in Kubernetes and not go crazy? It's all about scaling, sharing, speeding up the start and choosing shaders. With examples, hacks, and conclusions from real production.

Avito

Automator, independent expert
Yandex Cloud
Yandex Cloud
Tourmaline Core
Tourmaline Core
Yandex Cloud
ecom.tech
MWS Cloud Platform
Analytical Program Solutions
KORUS Consulting
DeusOps
Avito
Yandex Cloud
Evrone
Alfa-Bank
Alfa-Bank
Orion soft
KORUS Consulting
DeusOps
KORUS Consulting
KORUS Consulting