Your personal AI career agent
Senior Site Reliability Engineer - Observability(m/w/x)
Leading observability strategy for a cloud-native healthcare platform. Building scalable logging and developer-friendly tracing required. 28 vacation days, 10 days work abroad.
Requirements
- Solid hands-on experience (3y+) on large-scale production platform
- Proven experience with cloud platforms (AWS, Azure, or Google Cloud)
- Solid understanding of containerization and orchestration (Docker and Kubernetes)
- Strong understanding of Helm and ArgoCD
- Deep expertise in observability tooling and architecture
- Logging tools: Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Logstash, Vector
- Tracing tools: OpenTelemetry or proprietary APMs
- Metrics tools: Prometheus, Thanos, Datadog, or equivalent
- Proficiency in at least one programming language (Ruby, Python, Go, Java, etc.)
- Deep understanding of infrastructure as code principles
- Experience with monitoring and observability tools
- Troubleshooting performance issues in complex environments
- Fluent in English
- Experience contributing to open-source observability projects
- Worked in a high-growth tech environment
- Passionate about developer experience and platform engineering
Tasks
- Lead observability strategy
- Build scalable logging capabilities
- Develop developer-friendly tracing
- Identify cross-cutting reliability initiatives
- Improve incident detection
- Enhance incident response
- Conduct postmortem analysis
- Participate in on-call rotation
- Refine alerting processes
- Reduce alert noise
- Ensure actionable telemetry
Work Experience
- 3 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Native
Tools & Technologies
- AWS
- Azure
- Google Cloud
- Docker
- Kubernetes
- Helm
- ArgoCD
- Fluent Bit
- OpenTelemetry
- Loki
- Elasticsearch
- Logstash
- Vector
- Prometheus
- Thanos
- Datadog
- Ruby
- Python
- Go
- Java
Benefits
Public Transport Subsidies
- Deutschlandticket
More Vacation Days
- Additional vacation day per year
Workation & Sabbatical
- 10 days work from abroad per year
Healthcare & Fitness
- Company health insurance with supplementary benefits
- Subsidized sports membership
Retirement Plans
- Company pension scheme with employer subsidy
Generous Parental Leave
- Parental leave
Competitive Pay
- Employee value sharing plan
Mental Health Support
- Mental health and coaching services
Flexible Working
- Flexible workplace policy
Snacks & Drinks
- Healthy snacks
- Breakfast buffet
Free or Subsidized Food
- Subsidized meal benefit
Family Support
- Caregiver and disability support package
Other Benefits
- Relocation support
Modern Equipment
- Access to AI tools for coding
Learning & Development
- Dedicated training
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
Not a perfect match?
- Scout24Full-timeWith HomeofficeManagementBerlin
- SysEleven GmbH
Senior Site Reliability Engineer - Kubernetes Plattform(m/w/x)
Full-time/Part-timeRemoteSeniorBerlin - flaschenpost SE
(Senior) Site Reliability Engineer / DevOps(m/w/x)
Full-timeWith HomeofficeExperiencedBerlin - Redcare Pharmacy
Senior Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - ImmoScout24
Senior Platform Engineer - Site Reliability(m/w/x)
Full-timeWith HomeofficeManagementBerlin
Senior Site Reliability Engineer - Observability(m/w/x)
Leading observability strategy for a cloud-native healthcare platform. Building scalable logging and developer-friendly tracing required. 28 vacation days, 10 days work abroad.
Requirements
- Solid hands-on experience (3y+) on large-scale production platform
- Proven experience with cloud platforms (AWS, Azure, or Google Cloud)
- Solid understanding of containerization and orchestration (Docker and Kubernetes)
- Strong understanding of Helm and ArgoCD
- Deep expertise in observability tooling and architecture
- Logging tools: Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Logstash, Vector
- Tracing tools: OpenTelemetry or proprietary APMs
- Metrics tools: Prometheus, Thanos, Datadog, or equivalent
- Proficiency in at least one programming language (Ruby, Python, Go, Java, etc.)
- Deep understanding of infrastructure as code principles
- Experience with monitoring and observability tools
- Troubleshooting performance issues in complex environments
- Fluent in English
- Experience contributing to open-source observability projects
- Worked in a high-growth tech environment
- Passionate about developer experience and platform engineering
Tasks
- Lead observability strategy
- Build scalable logging capabilities
- Develop developer-friendly tracing
- Identify cross-cutting reliability initiatives
- Improve incident detection
- Enhance incident response
- Conduct postmortem analysis
- Participate in on-call rotation
- Refine alerting processes
- Reduce alert noise
- Ensure actionable telemetry
Work Experience
- 3 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Native
Tools & Technologies
- AWS
- Azure
- Google Cloud
- Docker
- Kubernetes
- Helm
- ArgoCD
- Fluent Bit
- OpenTelemetry
- Loki
- Elasticsearch
- Logstash
- Vector
- Prometheus
- Thanos
- Datadog
- Ruby
- Python
- Go
- Java
Benefits
Public Transport Subsidies
- Deutschlandticket
More Vacation Days
- Additional vacation day per year
Workation & Sabbatical
- 10 days work from abroad per year
Healthcare & Fitness
- Company health insurance with supplementary benefits
- Subsidized sports membership
Retirement Plans
- Company pension scheme with employer subsidy
Generous Parental Leave
- Parental leave
Competitive Pay
- Employee value sharing plan
Mental Health Support
- Mental health and coaching services
Flexible Working
- Flexible workplace policy
Snacks & Drinks
- Healthy snacks
- Breakfast buffet
Free or Subsidized Food
- Subsidized meal benefit
Family Support
- Caregiver and disability support package
Other Benefits
- Relocation support
Modern Equipment
- Access to AI tools for coding
Learning & Development
- Dedicated training
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
About the Company
Doctolib
Industry
Healthcare
Description
Das Unternehmen transformiert und digitalisiert die Gesundheitsbranche mit einem Produkt, das einen Mehrwert für die Gesellschaft bietet.
Not a perfect match?
- Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Full-timeWith HomeofficeManagementBerlin - SysEleven GmbH
Senior Site Reliability Engineer - Kubernetes Plattform(m/w/x)
Full-time/Part-timeRemoteSeniorBerlin - flaschenpost SE
(Senior) Site Reliability Engineer / DevOps(m/w/x)
Full-timeWith HomeofficeExperiencedBerlin - Redcare Pharmacy
Senior Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - ImmoScout24
Senior Platform Engineer - Site Reliability(m/w/x)
Full-timeWith HomeofficeManagementBerlin