The AI Job Search Engine
Engineering Manager - Observability & Reliability Engineering Obsession(m/w/x)
Leading SREs for healthcare digitalization, fostering operational excellence and psychological safety. 3+ years engineering management experience, deep observability tooling knowledge required. Free comprehensive health insurance, 1-month additional parental leave, free mental health services.
Requirements
- 5+ years software engineering or SRE experience
- 3+ years engineering management experience
- Deep understanding of observability tooling and architecture
- Experience with infrastructure as code and secrets management
- Ability to balance technical depth with leadership
- Experience scaling SRE or platform teams
- Background in high-scale telemetry pipelines
- Hands-on experience with Vault and Terraform Enterprise
- Hands-on experience with backend programming languages
- Experience driving cultural and technical transformations
Tasks
- Lead and coach Site Reliability Engineers
- Support technical development and career progression
- Foster operational excellence and psychological safety
- Conduct performance reviews and career conversations
- Recruit and onboard top SRE talent
- Define the platform-wide observability strategy
- Manage logging, metrics, tracing, and alerting
- Oversee HashiCorp Vault and Terraform Enterprise
- Drive roadmap planning for reliability initiatives
- Align team objectives with business goals
- Allocate resources to reduce technical debt
- Improve the overall developer experience
- Manage on-call and incident response processes
- Ensure high availability of transversal services
- Lead postmortem reviews and systemic improvements
- Collaborate with Product Managers and architects
- Partner with security on secrets management
- Represent the team in leadership forums
- Promote instrumentation quality and best practices
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- AWS
- GCP
- Kubernetes
- Fluent Bit
- OpenTelemetry
- Loki
- Elasticsearch
- Prometheus
- Thanos
- Datadog
- Terraform
- OpenTofu
- Vault
- AWS Secrets Manager
- Terraform Enterprise
- Go
- Python
- Ruby
Benefits
Healthcare & Fitness
- Free comprehensive health insurance
- Sport club membership refund
Family Support
- Parent Care Program
Generous Parental Leave
- 1-month additional parental leave
Mental Health Support
- Free mental health services
- Psychological support
Mentorship & Coaching
- Free coaching services
Flexible Working
- Remote policy adaptation
- 10 flexibility days policy
More Vacation Days
- Medical leave extra days
- Up to 14 RTT days
Additional Allowances
- Work Council subsidy
Other Benefits
- Creative class refund
Free or Subsidized Food
- Lunch vouchers
Not a perfect match?
- DoctolibFull-timeWith HomeofficeSeniorBerlin
- Scout24
Engineering Manager - Platform Engineering(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Scandit
Engineering Manager, Platform(m/w/x)
Full-timeWith HomeofficeExperiencedZürich, Berlin - Talon.One
Senior Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Full-timeWith HomeofficeManagementBerlin
Engineering Manager - Observability & Reliability Engineering Obsession(m/w/x)
Leading SREs for healthcare digitalization, fostering operational excellence and psychological safety. 3+ years engineering management experience, deep observability tooling knowledge required. Free comprehensive health insurance, 1-month additional parental leave, free mental health services.
Requirements
- 5+ years software engineering or SRE experience
- 3+ years engineering management experience
- Deep understanding of observability tooling and architecture
- Experience with infrastructure as code and secrets management
- Ability to balance technical depth with leadership
- Experience scaling SRE or platform teams
- Background in high-scale telemetry pipelines
- Hands-on experience with Vault and Terraform Enterprise
- Hands-on experience with backend programming languages
- Experience driving cultural and technical transformations
Tasks
- Lead and coach Site Reliability Engineers
- Support technical development and career progression
- Foster operational excellence and psychological safety
- Conduct performance reviews and career conversations
- Recruit and onboard top SRE talent
- Define the platform-wide observability strategy
- Manage logging, metrics, tracing, and alerting
- Oversee HashiCorp Vault and Terraform Enterprise
- Drive roadmap planning for reliability initiatives
- Align team objectives with business goals
- Allocate resources to reduce technical debt
- Improve the overall developer experience
- Manage on-call and incident response processes
- Ensure high availability of transversal services
- Lead postmortem reviews and systemic improvements
- Collaborate with Product Managers and architects
- Partner with security on secrets management
- Represent the team in leadership forums
- Promote instrumentation quality and best practices
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- AWS
- GCP
- Kubernetes
- Fluent Bit
- OpenTelemetry
- Loki
- Elasticsearch
- Prometheus
- Thanos
- Datadog
- Terraform
- OpenTofu
- Vault
- AWS Secrets Manager
- Terraform Enterprise
- Go
- Python
- Ruby
Benefits
Healthcare & Fitness
- Free comprehensive health insurance
- Sport club membership refund
Family Support
- Parent Care Program
Generous Parental Leave
- 1-month additional parental leave
Mental Health Support
- Free mental health services
- Psychological support
Mentorship & Coaching
- Free coaching services
Flexible Working
- Remote policy adaptation
- 10 flexibility days policy
More Vacation Days
- Medical leave extra days
- Up to 14 RTT days
Additional Allowances
- Work Council subsidy
Other Benefits
- Creative class refund
Free or Subsidized Food
- Lunch vouchers
About the Company
Doctolib
Industry
Healthcare
Description
Das Unternehmen transformiert und digitalisiert die Gesundheitsbranche mit einem Produkt, das einen Mehrwert für die Gesellschaft bietet.
Not a perfect match?
- Doctolib
Senior Site Reliability Engineer - Observability(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Scout24
Engineering Manager - Platform Engineering(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Scandit
Engineering Manager, Platform(m/w/x)
Full-timeWith HomeofficeExperiencedZürich, Berlin - Talon.One
Senior Site Reliability Engineer(m/w/x)
Full-timeWith HomeofficeSeniorBerlin - Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Full-timeWith HomeofficeManagementBerlin