Schedule

  • The time in the program is for your time zone .

  • The program hasn’t been finally approved yet, so there still might be some changes.

Download schedule
  • Infrastructure

    11
    • Talk

      Fast or Slow: The Story of the Struggle to Speed up Image Builds

      What should we do if the images take 30–40 minutes to assemble, the obvious solutions no longer work, and the problem cannot be fixed? There is no perfect solution, but there are techniques and tools that make it possible to speed up the most complex assemblies several times.

    • Talk

      Testing Tools for Configuration Management Systems

      Testing roles in infrastructure administration. Tools of Infrastructure as Code approach and testing practices of configuration management systems such as Ansible, Puppet, Salt, and Chef. Attendees will gain a comprehensive understanding of the available tools, their advantages and disadvantages, as well as Avito's best testing practices.

    • Talk

      Platforms and Other Adult Toys

      How to build an Internal Developer Platform in an enterprise: from the first pain to a successful pilot. I will show you how to turn Kubernetes from a "container twister" into the basis of an automated platform for developers. Honestly about lessons learned and architecture, seasoned with practical advice.

    • Talk

      Fault-Tolerant Infrastructure: From Knee-Jerk Solutions to More Expensive Ones

      How to make the system fault-tolerant without Google's budget? We will analyze solutions for each level of architecture from DNS to database in three variants: minimal, optimal and industrial. I will show you, using examples of on-premise and Russian clouds like Yandex, VK, Selectel, what mistakes to avoid and how to save money without losing reliability.

    • Talk

      Scheduling GPU Workloads in Kubernetes: From Standard Mechanisms to Custom Solutions

      The standard Kubernetes scheduler kube-scheduler was developed with general load balancing principles in mind and is not specialized for the unique characteristics of GPU workloads. I propose examining the full spectrum of possibilities: from built-in K8s scheduling mechanisms to customization of the standard scheduler and specialized schedulers such as Volcano, Apache YuniKorn, and KAI-Scheduler.

    • Talk

      Implementation of Policy as Code in Apache Kafka

      A talk on a declarative approach to managing access to Kafka, implemented on the basis of Open Policy Agent. We will find out the principle of operation of Open Policy Agent, as well as get answers to the most popular questions regarding this approach and learn about the experience of real-life operation.

  • K8s

    8
    • Talk

      Cross-Zone Traffic and Native Ways To Manage It in Kubernetes

      Inter-zone traffic can lead to increased cost of ownership and latency. For a long time it was thought that the solution is only possible through the use of Service Mesh. I will tell you how to solve these problems through native Kubernetes mechanisms.

    • Talk

      L2 Announcements in Cilium: Access to Load Balancer in Bare-Metal Kubernetes

      How Cilium’s built-in L2 announcement feature enables native Kubernetes LoadBalancer services in bare-metal clusters without external components or complex setups, leveraging modern eBPF technology. This approach provides reliable external access to services with minimal operational overhead.

    • Talk

      Platforms and Other Adult Toys

      How to build an Internal Developer Platform in an enterprise: from the first pain to a successful pilot. I will show you how to turn Kubernetes from a "container twister" into the basis of an automated platform for developers. Honestly about lessons learned and architecture, seasoned with practical advice.

    • Talk

      Scheduling GPU Workloads in Kubernetes: From Standard Mechanisms to Custom Solutions

      The standard Kubernetes scheduler kube-scheduler was developed with general load balancing principles in mind and is not specialized for the unique characteristics of GPU workloads. I propose examining the full spectrum of possibilities: from built-in K8s scheduling mechanisms to customization of the standard scheduler and specialized schedulers such as Volcano, Apache YuniKorn, and KAI-Scheduler.

  • ML/AI

    8
    • Talk

      The Perfect 'Sandbox' for ML Models: Setting Up Containerization Without Stress

      Why K8s is the best platform for deploying and testing ML models. We will demonstrate a step-by-step plan for creating high-quality machine learning environments in Kubernetes, which will allow you to automate machine learning for production environment creation and codebase management, and also make efficient use of the GPU.

    • Talk

      How To Get the Most out of GPU and Ray: Our Production ML Infrastructure Pipeline

      A talk on building a scalable ML infrastructure based on Ray and Kubernetes with an emphasis on efficient GPU utilization, distributed task management, and integration with external orchestrators. Using real examples, I'll show you how to build a fault-tolerant production pipeline and avoid typical errors when scaling loads.

    • Talk

      AI in SDLC

      A talk on the practical application of local open-source AI tools at all stages of SDLC. Based on DORA metrics and typical tasks of developers and managers, we show where AI really helps and where it hinders. We share our personal experiences, mistakes and findings.

    • Talk

      n8n + AI for DevOps processes

      How we’re implementing n8n for self-service automation without involving the Data Science team: from first steps to real use cases, pitfalls, and memes. How we ended up with an on-prem solution that fits seamlessly into our infrastructure and is understandable even beyond the developer crowd.

  • Security

    7
    • Talk

      Kubernetes Cluster Security Audit Without a Kubernetes Cluster

      Not many people know how to break Kubernetes, much less how to break Kubernetes when it doesn't even exist yet. I'll share my experience of conducting cluster audits at the design stage, when all you have on hand are the Cluster API manifests of future Kubernetes. I'll tell you what types of flaws can be detected at this stage, and which ones can't. I'll dilute all this with interesting moments and automation of the process.

    • Talk

      Rootless Kubernetes. Pros and Cons

      Methods of cluster deployment that allow to significantly reduce the attack surface of an attacker on a Kubernetes cluster. I will focus on the podsec-k8s package, which allows deploying a native Kubernetes cluster of versions 1.26 and higher in rootless mode. I will touch on the second method of reducing the attack surface — SSHless cluster (fork of the Talos@SideroLabs project)

    • Talk

      Unexploited Vulnerabilities in Kubernetes: How VEX Helps Bring Order to the Noise of CVEs

      Kubernetes is a complex ecosystem with many components and dependencies, and common vulnerabilities (CVEs) that, while not all of them are real threats, are often flagged by scanners as reliable threats, creating an overabundance of false production processes and complicating CI/CD processes. Here's how to make sense of this noise.

    • Talk

      Implementation of Policy as Code in Apache Kafka

      A talk on a declarative approach to managing access to Kafka, implemented on the basis of Open Policy Agent. We will find out the principle of operation of Open Policy Agent, as well as get answers to the most popular questions regarding this approach and learn about the experience of real-life operation.

  • Teams & Culture

    4
    • Talk

      Is There an Economic Effect From the Implementation of Engineering Practices

      Let's figure out whether it is possible to calculate the economic effect of such practices as CI/CD, monitoring, code review, and engineering culture in general. Using Raiffeisen Bank cases as an example, we will try to figure out how much it costs to implement any practice and whether it really saves money for a business.

    • Talk

      How We Created and Developed an Enabling Team

      I'll tell you about our Enabling team, how we created it and are developing it. And also about the prerequisites for the appearance, stages of development, problems and areas of work.

    • Talk

      Platforms and Other Adult Toys

      How to build an Internal Developer Platform in an enterprise: from the first pain to a successful pilot. I will show you how to turn Kubernetes from a "container twister" into the basis of an automated platform for developers. Honestly about lessons learned and architecture, seasoned with practical advice.

    • Talk

      Engineer: Where Do We Go From Here? Is There Life After the Senior Level?

      Often I see engineers running up grades to grow their skills and paychecks: from junior to middle, then to senior and... what's next? To become a team lead? What's a tech lead? Or maybe there's some kind of engineering track? I'll tell you about it! Not everyone needs to be a team leader (or maybe they do).

  • Platform

    4
    • Talk

      n8n + AI for DevOps processes

      How we’re implementing n8n for self-service automation without involving the Data Science team: from first steps to real use cases, pitfalls, and memes. How we ended up with an on-prem solution that fits seamlessly into our infrastructure and is understandable even beyond the developer crowd.

  • SRE

    4
    • Talk

      We Сounted, We Counted, Our Fingers Were Tired

      The graphs are green, the reliability is five nines, and the user is unhappy. Sound familiar? It means that somewhere in your calculations your math failed. I will tell you how we at VK calculated reliability for infrastructure products, highlighting critical user paths.

    • Talk

      Fault-Tolerant Infrastructure: From Knee-Jerk Solutions to More Expensive Ones

      How to make the system fault-tolerant without Google's budget? We will analyze solutions for each level of architecture from DNS to database in three variants: minimal, optimal and industrial. I will show you, using examples of on-premise and Russian clouds like Yandex, VK, Selectel, what mistakes to avoid and how to save money without losing reliability.

  • Community Day

    3
  • Observability

    3
    • Talk

      Observability of Distributed Systems

      A talk on ensuring a high level of observability of distributed systems using existing telemetry tools. Using examples, let's look at how to manage the growing cognitive complexity of supporting large systems in terms of monitoring and finding the root causes of degradation.

  • QA

    2
  • Off Topic

    6

We will add more talks soon.

We are actively adding to the program. Sign up for our newsletter to stay informed.

Subscribe