Talk

How To Get the Most out of GPU and Ray: Our Production ML Infrastructure Pipeline

In Russian

A talk on building a scalable ML infrastructure based on Ray and Kubernetes with an emphasis on efficient GPU utilization, distributed task management, and integration with external orchestrators. Using real examples, I'll show you how to build a fault-tolerant production pipeline and avoid typical errors when scaling loads.

Speakers

Talks