The AI Job Search Engine
Senior Site Reliability Engineer - Observability(m/w/x)
Developing platform reliability strategy for healthcare, building scalable logging and tracing capabilities. Expertise in observability tooling and architecture essential. Free family health insurance, up to 14 RTT days, 1-month additional parental leave.
Requirements
- Experience on large-scale production platforms
- Experience with AWS, Azure or Google Cloud
- Understanding of Docker and Kubernetes
- Understanding of Helm and ArgoCD
- Expertise in observability tooling and architecture
- Knowledge of logging tools
- Knowledge of OpenTelemetry or proprietary APMs
- Knowledge of Prometheus, Thanos, or Datadog
- Proficiency in Ruby, Python, Go, or Java
- Experience with monitoring and observability tools
- Enjoyment of troubleshooting performance issues
- English language skills
Tasks
- Lead the platform-wide observability strategy
- Build scalable logging and tracing capabilities
- Develop developer-friendly observability tools
- Identify large-scale cross-cutting reliability initiatives
- Improve incident detection and response capabilities
- Enhance postmortem analysis processes
- Participate in the on-call rotation
- Refine alerting to reduce noise
- Ensure actionable telemetry across services
Work Experience
- 3 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- AWS
- Azure
- Google Cloud
- Docker
- Kubernetes
- Helm
- ArgoCD
- Fluent Bit
- OpenTelemetry
- Loki
- Elasticsearch
- Logstash
- Vector
- Prometheus
- Thanos
- Datadog
- Ruby
- Python
- Go
- Java
Benefits
Healthcare & Fitness
- Free family health insurance
More Vacation Days
- Up to 14 RTT days
Family Support
- Parental care program
- School start leave
Generous Parental Leave
- 1-month additional parental leave
Mental Health Support
- Wellbeing program
- Free mental health coaching
Free or Subsidized Food
- Swile lunch vouchers
Additional Allowances
- Work Council sport subsidy
- Work Council creative subsidy
- Bicycle subsidy
Not a perfect match?
- DoctolibFull-timeWith HomeofficeSeniorBerlin
- Talon.One
Senior Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Full-timeWith HomeofficeManagementBerlin - fiskaly
Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeNot specifiedBerlin, Wienfrom 80,000 / year - GetYourGuide
Senior Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin
Senior Site Reliability Engineer - Observability(m/w/x)
Developing platform reliability strategy for healthcare, building scalable logging and tracing capabilities. Expertise in observability tooling and architecture essential. Free family health insurance, up to 14 RTT days, 1-month additional parental leave.
Requirements
- Experience on large-scale production platforms
- Experience with AWS, Azure or Google Cloud
- Understanding of Docker and Kubernetes
- Understanding of Helm and ArgoCD
- Expertise in observability tooling and architecture
- Knowledge of logging tools
- Knowledge of OpenTelemetry or proprietary APMs
- Knowledge of Prometheus, Thanos, or Datadog
- Proficiency in Ruby, Python, Go, or Java
- Experience with monitoring and observability tools
- Enjoyment of troubleshooting performance issues
- English language skills
Tasks
- Lead the platform-wide observability strategy
- Build scalable logging and tracing capabilities
- Develop developer-friendly observability tools
- Identify large-scale cross-cutting reliability initiatives
- Improve incident detection and response capabilities
- Enhance postmortem analysis processes
- Participate in the on-call rotation
- Refine alerting to reduce noise
- Ensure actionable telemetry across services
Work Experience
- 3 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- AWS
- Azure
- Google Cloud
- Docker
- Kubernetes
- Helm
- ArgoCD
- Fluent Bit
- OpenTelemetry
- Loki
- Elasticsearch
- Logstash
- Vector
- Prometheus
- Thanos
- Datadog
- Ruby
- Python
- Go
- Java
Benefits
Healthcare & Fitness
- Free family health insurance
More Vacation Days
- Up to 14 RTT days
Family Support
- Parental care program
- School start leave
Generous Parental Leave
- 1-month additional parental leave
Mental Health Support
- Wellbeing program
- Free mental health coaching
Free or Subsidized Food
- Swile lunch vouchers
Additional Allowances
- Work Council sport subsidy
- Work Council creative subsidy
- Bicycle subsidy
About the Company
Doctolib
Industry
Healthcare
Description
Das Unternehmen transformiert und digitalisiert die Gesundheitsbranche mit einem Produkt, das einen Mehrwert für die Gesellschaft bietet.
Not a perfect match?
- Doctolib
Engineering Manager - Observability & Reliability Engineering Obsession(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Talon.One
Senior Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Full-timeWith HomeofficeManagementBerlin - fiskaly
Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeNot specifiedBerlin, Wienfrom 80,000 / year - GetYourGuide
Senior Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin