New Job?Nejo!

The AI Job Search Engine

DO
Doctolib
8d ago

Engineering Manager - Observability & Reliability Engineering Obsession(m/w/x)

Berlin
Full-timeWith Home OfficeSenior
AI/ML

Description

You will lead a world-class SRE team to evolve the observability platform. By balancing people management with technical strategy, you'll ensure the infrastructure remains scalable and reliable.

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Requirements

  • 5+ years software engineering or SRE experience
  • 3+ years engineering management experience
  • Deep understanding of observability tooling and architecture
  • Experience with infrastructure as code and secrets management
  • Ability to balance technical depth with leadership
  • Experience scaling SRE or platform teams
  • Background in high-scale telemetry pipelines
  • Hands-on experience with Vault and Terraform Enterprise
  • Hands-on experience with backend programming languages
  • Experience driving cultural and technical transformations

Tasks

  • Lead and coach Site Reliability Engineers
  • Support technical development and career progression
  • Foster operational excellence and psychological safety
  • Conduct performance reviews and career conversations
  • Recruit and onboard top SRE talent
  • Define the platform-wide observability strategy
  • Manage logging, metrics, tracing, and alerting
  • Oversee HashiCorp Vault and Terraform Enterprise
  • Drive roadmap planning for reliability initiatives
  • Align team objectives with business goals
  • Allocate resources to reduce technical debt
  • Improve the overall developer experience
  • Manage on-call and incident response processes
  • Ensure high availability of transversal services
  • Lead postmortem reviews and systemic improvements
  • Collaborate with Product Managers and architects
  • Partner with security on secrets management
  • Represent the team in leadership forums
  • Promote instrumentation quality and best practices

Tools & Technologies

AWSGCPKubernetesFluent BitOpenTelemetryLokiElasticsearchPrometheusThanosDatadogTerraformOpenTofuVaultAWS Secrets ManagerTerraform EnterpriseGoPythonRuby

Languages

EnglishBusiness Fluent

Benefits

Healthcare & Fitness

  • Free comprehensive health insurance
  • Sport club membership refund

Family Support

  • Parent Care Program

Generous Parental Leave

  • 1-month additional parental leave

Mental Health Support

  • Free mental health services
  • Psychological support

Mentorship & Coaching

  • Free coaching services

Flexible Working

  • Remote policy adaptation
  • 10 flexibility days policy

More Vacation Days

  • Medical leave extra days
  • Up to 14 RTT days

Additional Allowances

  • Work Council subsidy

Other Benefits

  • Creative class refund

Free or Subsidized Food

  • Lunch vouchers
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Doctolib and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.
Not a perfect match?
100+ Similar Jobs in Berlin
  • Parloa

    Engineering Manager - Site Reliability Engineering/DevEx(m/w/x)

    Full-timeWith HomeofficeExperienced
    Berlin
  • Doctolib

    Senior Site Reliability Engineer - Observability(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin
  • GetYourGuide

    Engineering Manager, Site Reliability(m/w/x)

    Full-timeWith HomeofficeExperienced
    Berlin
  • Scandit

    Engineering Manager, Platform(m/w/x)

    Full-timeWith HomeofficeExperienced
    Zürich, Berlin
  • Scout24

    Engineering Manager - Platform Engineering(m/w/x)

    Full-timeWith HomeofficeSenior
    Berlin
100+ View all similar jobs