
Anton Alekseev
Avito
If you have a ticket, log in to watch the video
LoginHow can I speed up GPU inference in Kubernetes and not go crazy? It's all about scaling, sharing, speeding up the start and choosing shaders. With examples, hacks, and conclusions from real production.
Avito
Automator, independent expert