Skip to content
New Job?Nejo!

The AI Job Search Engine

DODoctolib

Senior Site Reliability Engineer - Observability(m/w/x)

Berlin
Full-timeWith Home OfficeSenior

Developing platform reliability strategy for healthcare, building scalable logging and tracing capabilities. Expertise in observability tooling and architecture essential. Free family health insurance, up to 14 RTT days, 1-month additional parental leave.

Requirements

  • Experience on large-scale production platforms
  • Experience with AWS, Azure or Google Cloud
  • Understanding of Docker and Kubernetes
  • Understanding of Helm and ArgoCD
  • Expertise in observability tooling and architecture
  • Knowledge of logging tools
  • Knowledge of OpenTelemetry or proprietary APMs
  • Knowledge of Prometheus, Thanos, or Datadog
  • Proficiency in Ruby, Python, Go, or Java
  • Experience with monitoring and observability tools
  • Enjoyment of troubleshooting performance issues
  • English language skills

Tasks

  • Lead the platform-wide observability strategy
  • Build scalable logging and tracing capabilities
  • Develop developer-friendly observability tools
  • Identify large-scale cross-cutting reliability initiatives
  • Improve incident detection and response capabilities
  • Enhance postmortem analysis processes
  • Participate in the on-call rotation
  • Refine alerting to reduce noise
  • Ensure actionable telemetry across services

Work Experience

  • 3 years

Education

  • Bachelor's degreeOR
  • Master's degree

Languages

  • EnglishBusiness Fluent

Tools & Technologies

  • AWS
  • Azure
  • Google Cloud
  • Docker
  • Kubernetes
  • Helm
  • ArgoCD
  • Fluent Bit
  • OpenTelemetry
  • Loki
  • Elasticsearch
  • Logstash
  • Vector
  • Prometheus
  • Thanos
  • Datadog
  • Ruby
  • Python
  • Go
  • Java

Benefits

Healthcare & Fitness

  • Free family health insurance

More Vacation Days

  • Up to 14 RTT days

Family Support

  • Parental care program
  • School start leave

Generous Parental Leave

  • 1-month additional parental leave

Mental Health Support

  • Wellbeing program
  • Free mental health coaching

Free or Subsidized Food

  • Swile lunch vouchers

Additional Allowances

  • Work Council sport subsidy
  • Work Council creative subsidy
  • Bicycle subsidy
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Doctolib and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • Doctolib

    Engineering Manager - Observability & Reliability Engineering Obsession(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin
  • Talon.One

    Senior Site Reliability Engineer(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin
  • Scout24

    Senior Platform Engineer - Site Reliability(m/w/x)

    Full-timeWith HomeofficeManagement
    Berlin
  • fiskaly

    Site Reliability Engineer(m/w/x)

    Full-timeWith HomeofficeNot specified
    Berlin, Wien
    from 80,000 / year
  • GetYourGuide

    Senior Site Reliability Engineer(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin
View all 100+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes