Dein persönlicher KI-Karriere-Agent
Senior Site Reliability Engineer - Observability(m/w/x)
Leading observability strategy for a cloud-native healthcare platform. Building scalable logging and developer-friendly tracing required. 28 vacation days, 10 days work abroad.
Anforderungen
- Solid hands-on experience (3y+) on large-scale production platform
- Proven experience with cloud platforms (AWS, Azure, or Google Cloud)
- Solid understanding of containerization and orchestration (Docker and Kubernetes)
- Strong understanding of Helm and ArgoCD
- Deep expertise in observability tooling and architecture
- Logging tools: Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Logstash, Vector
- Tracing tools: OpenTelemetry or proprietary APMs
- Metrics tools: Prometheus, Thanos, Datadog, or equivalent
- Proficiency in at least one programming language (Ruby, Python, Go, Java, etc.)
- Deep understanding of infrastructure as code principles
- Experience with monitoring and observability tools
- Troubleshooting performance issues in complex environments
- Fluent in English
- Experience contributing to open-source observability projects
- Worked in a high-growth tech environment
- Passionate about developer experience and platform engineering
Aufgaben
- Lead observability strategy
- Build scalable logging capabilities
- Develop developer-friendly tracing
- Identify cross-cutting reliability initiatives
- Improve incident detection
- Enhance incident response
- Conduct postmortem analysis
- Participate in on-call rotation
- Refine alerting processes
- Reduce alert noise
- Ensure actionable telemetry
Berufserfahrung
- 3 Jahre
Ausbildung
- Bachelor-AbschlussODER
- Master-Abschluss
Sprachen
- Englisch – fließend
Tools & Technologien
- AWS
- Azure
- Google Cloud
- Docker
- Kubernetes
- Helm
- ArgoCD
- Fluent Bit
- OpenTelemetry
- Loki
- Elasticsearch
- Logstash
- Vector
- Prometheus
- Thanos
- Datadog
- Ruby
- Python
- Go
- Java
Benefits
Öffi Tickets
- Deutschlandticket
Mehr Urlaubstage
- Additional vacation day per year
Workation & Sabbatical
- 10 days work from abroad per year
Gesundheits- & Fitnessangebote
- Company health insurance with supplementary benefits
- Subsidized sports membership
Betriebliche Altersvorsorge
- Company pension scheme with employer subsidy
Großzügige Elternzeit
- Parental leave
Attraktive Vergütung
- Employee value sharing plan
Mentale Gesundheitsförderung
- Mental health and coaching services
Flexibles Arbeiten
- Flexible workplace policy
Snacks & Getränke
- Healthy snacks
- Breakfast buffet
Gratis oder Vergünstigte Mahlzeiten
- Subsidized meal benefit
Familienfreundlichkeit
- Caregiver and disability support package
Sonstige Vorteile
- Relocation support
Moderne Technikausstattung
- Access to AI tools for coding
Weiterbildungsangebote
- Dedicated training
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Noch nicht perfekt?
- Scout24Vollzeitmit HomeofficeManagementBerlin
- SysEleven GmbH
Senior Site Reliability Engineer - Kubernetes Plattform(m/w/x)
Vollzeit/TeilzeitRemoteSeniorBerlin - Redcare Pharmacy
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - ImmoScout24
Senior Platform Engineer - Site Reliability(m/w/x)
Vollzeitmit HomeofficeManagementBerlin - SysEleven GmbH
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin
Senior Site Reliability Engineer - Observability(m/w/x)
Leading observability strategy for a cloud-native healthcare platform. Building scalable logging and developer-friendly tracing required. 28 vacation days, 10 days work abroad.
Anforderungen
- Solid hands-on experience (3y+) on large-scale production platform
- Proven experience with cloud platforms (AWS, Azure, or Google Cloud)
- Solid understanding of containerization and orchestration (Docker and Kubernetes)
- Strong understanding of Helm and ArgoCD
- Deep expertise in observability tooling and architecture
- Logging tools: Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Logstash, Vector
- Tracing tools: OpenTelemetry or proprietary APMs
- Metrics tools: Prometheus, Thanos, Datadog, or equivalent
- Proficiency in at least one programming language (Ruby, Python, Go, Java, etc.)
- Deep understanding of infrastructure as code principles
- Experience with monitoring and observability tools
- Troubleshooting performance issues in complex environments
- Fluent in English
- Experience contributing to open-source observability projects
- Worked in a high-growth tech environment
- Passionate about developer experience and platform engineering
Aufgaben
- Lead observability strategy
- Build scalable logging capabilities
- Develop developer-friendly tracing
- Identify cross-cutting reliability initiatives
- Improve incident detection
- Enhance incident response
- Conduct postmortem analysis
- Participate in on-call rotation
- Refine alerting processes
- Reduce alert noise
- Ensure actionable telemetry
Berufserfahrung
- 3 Jahre
Ausbildung
- Bachelor-AbschlussODER
- Master-Abschluss
Sprachen
- Englisch – fließend
Tools & Technologien
- AWS
- Azure
- Google Cloud
- Docker
- Kubernetes
- Helm
- ArgoCD
- Fluent Bit
- OpenTelemetry
- Loki
- Elasticsearch
- Logstash
- Vector
- Prometheus
- Thanos
- Datadog
- Ruby
- Python
- Go
- Java
Benefits
Öffi Tickets
- Deutschlandticket
Mehr Urlaubstage
- Additional vacation day per year
Workation & Sabbatical
- 10 days work from abroad per year
Gesundheits- & Fitnessangebote
- Company health insurance with supplementary benefits
- Subsidized sports membership
Betriebliche Altersvorsorge
- Company pension scheme with employer subsidy
Großzügige Elternzeit
- Parental leave
Attraktive Vergütung
- Employee value sharing plan
Mentale Gesundheitsförderung
- Mental health and coaching services
Flexibles Arbeiten
- Flexible workplace policy
Snacks & Getränke
- Healthy snacks
- Breakfast buffet
Gratis oder Vergünstigte Mahlzeiten
- Subsidized meal benefit
Familienfreundlichkeit
- Caregiver and disability support package
Sonstige Vorteile
- Relocation support
Moderne Technikausstattung
- Access to AI tools for coding
Weiterbildungsangebote
- Dedicated training
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Über das Unternehmen
Doctolib
Branche
Healthcare
Beschreibung
Das Unternehmen transformiert und digitalisiert die Gesundheitsbranche mit einem Produkt, das einen Mehrwert für die Gesellschaft bietet.
Noch nicht perfekt?
- Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Vollzeitmit HomeofficeManagementBerlin - SysEleven GmbH
Senior Site Reliability Engineer - Kubernetes Plattform(m/w/x)
Vollzeit/TeilzeitRemoteSeniorBerlin - Redcare Pharmacy
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - ImmoScout24
Senior Platform Engineer - Site Reliability(m/w/x)
Vollzeitmit HomeofficeManagementBerlin - SysEleven GmbH
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin