How To Get the Most out of GPU and Ray: Our Production ML Infrastructure Pipeline

ML/AI

In RussianComplexity -

A talk on building a scalable ML infrastructure based on Ray and Kubernetes with an emphasis on efficient GPU utilization, distributed task management, and integration with external orchestrators. Using real examples, I'll show you how to build a fault-tolerant production pipeline and avoid typical errors when scaling loads.

Speakers

Mikhail Untura
Orion soft

Invited experts

Oleg Blokhin
VK

Other talks on «ML/AI»

Schedule

How To Get the Most out of GPU and Ray: Our Production ML Infrastructure Pipeline

Speakers

Mikhail Untura

Invited experts

Oleg Blokhin

Other talks on «ML/AI»