Dein persönlicher KI-Karriere-Agent
Senior Engineer, Operational Excellence(m/w/x)
Reducing incident frequency and MTTR for travel booking platform. Deep observability tooling and Java coding skills required. Work from anywhere 30 days/year, annual growth budget.
Anforderungen
- Deep understanding of observability tooling (Datadog)
- Proven experience reducing MTTD, MTTR, change failure rate
- Strong coding skills in Java
- Comfortable reading/contributing in Go
- Frontend context to collaborate with React/Vue teams
- Experience with Kubernetes
- Experience with AWS
- Experience with service mesh technologies (Istio/Envoy)
- Solid understanding of distributed systems
- Solid understanding of networking
- Solid understanding of container technology
- Hands-on experience with CI/CD
- Hands-on experience with automated testing strategies
- Hands-on experience with build systems
- Ability to influence engineers and teams
- Excellent written communication skills in English
- Excellent verbal communication skills in English
- Positive, proactive team player
- Passionate about operational excellence
- Led company-wide initiatives to improve DORA metrics
- Identified systemic gaps in automated testing
- Driven improvements reducing change failure rate/incidents
- Embedded operational excellence into product engineering culture
- Driven cost-reduction outcomes through improvements
Aufgaben
- Prevent incidents and enhance user trust
- Drive down incident frequency, MTTD, and MTTR
- Lead post-incident reviews and implement improvements
- Build tooling and runbooks for faster issue resolution
- Champion blameless incident handling and continuous improvement
- Participate in infrastructure on-call rotation
- Advance Datadog-based observability practices
- Ensure meaningful SLOs and actionable alerts
- Enable production debugging capabilities
- Improve change failure rate with automated test coverage
- Identify and mitigate blast radius risks and architectural gaps
- Reduce deployment costs and risks through better tooling
- Design and maintain well-documented development paths
- Collaborate with product teams on system design and testability
- Leverage Kubernetes, AWS, and Istio for infrastructure best practices
- Identify and drive cost optimization opportunities
- Leverage AI to accelerate incident response and improve workflows
Berufserfahrung
- ca. 4 - 6 Jahre
Ausbildung
- Bachelor-AbschlussODER
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- Datadog
- Java
- Go
- React
- Vue
- Kubernetes
- AWS
- Istio
- Envoy
Benefits
Flexibles Arbeiten
- Hybrid working approach
- Work from anywhere (30 days/year)
Sonstige Zulagen
- Annual personal growth budget
Mentoring & Coaching
- Mentorship programs
Team Events & Ausflüge
- Quarterly team events
- Yearly company-wide events
Öffi Tickets
- Monthly transportation budget
Gesundheits- & Fitnessangebote
- Monthly fitness budget
- Health and wellness benefits
Mitarbeiterrabatte
- Discounts on activities
Weiterbildungsangebote
- Language reimbursement program
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Noch nicht perfekt?
- GetYourGuideVollzeitmit HomeofficeSeniorBerlin
- Doctolib
Senior Site Reliability Engineer - Observability(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - GetYourGuide
Senior Software Engineer, Developer Enablement(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - KAYAK
Senior JAVA Software Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Vollzeitmit HomeofficeManagementBerlin
Senior Engineer, Operational Excellence(m/w/x)
Reducing incident frequency and MTTR for travel booking platform. Deep observability tooling and Java coding skills required. Work from anywhere 30 days/year, annual growth budget.
Anforderungen
- Deep understanding of observability tooling (Datadog)
- Proven experience reducing MTTD, MTTR, change failure rate
- Strong coding skills in Java
- Comfortable reading/contributing in Go
- Frontend context to collaborate with React/Vue teams
- Experience with Kubernetes
- Experience with AWS
- Experience with service mesh technologies (Istio/Envoy)
- Solid understanding of distributed systems
- Solid understanding of networking
- Solid understanding of container technology
- Hands-on experience with CI/CD
- Hands-on experience with automated testing strategies
- Hands-on experience with build systems
- Ability to influence engineers and teams
- Excellent written communication skills in English
- Excellent verbal communication skills in English
- Positive, proactive team player
- Passionate about operational excellence
- Led company-wide initiatives to improve DORA metrics
- Identified systemic gaps in automated testing
- Driven improvements reducing change failure rate/incidents
- Embedded operational excellence into product engineering culture
- Driven cost-reduction outcomes through improvements
Aufgaben
- Prevent incidents and enhance user trust
- Drive down incident frequency, MTTD, and MTTR
- Lead post-incident reviews and implement improvements
- Build tooling and runbooks for faster issue resolution
- Champion blameless incident handling and continuous improvement
- Participate in infrastructure on-call rotation
- Advance Datadog-based observability practices
- Ensure meaningful SLOs and actionable alerts
- Enable production debugging capabilities
- Improve change failure rate with automated test coverage
- Identify and mitigate blast radius risks and architectural gaps
- Reduce deployment costs and risks through better tooling
- Design and maintain well-documented development paths
- Collaborate with product teams on system design and testability
- Leverage Kubernetes, AWS, and Istio for infrastructure best practices
- Identify and drive cost optimization opportunities
- Leverage AI to accelerate incident response and improve workflows
Berufserfahrung
- ca. 4 - 6 Jahre
Ausbildung
- Bachelor-AbschlussODER
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- Datadog
- Java
- Go
- React
- Vue
- Kubernetes
- AWS
- Istio
- Envoy
Benefits
Flexibles Arbeiten
- Hybrid working approach
- Work from anywhere (30 days/year)
Sonstige Zulagen
- Annual personal growth budget
Mentoring & Coaching
- Mentorship programs
Team Events & Ausflüge
- Quarterly team events
- Yearly company-wide events
Öffi Tickets
- Monthly transportation budget
Gesundheits- & Fitnessangebote
- Monthly fitness budget
- Health and wellness benefits
Mitarbeiterrabatte
- Discounts on activities
Weiterbildungsangebote
- Language reimbursement program
Gefällt dir diese Stelle?
BetaDein Career Agent findet täglich ähnliche Jobs für dich.
Über das Unternehmen
GetYourGuide
Branche
Tourism
Beschreibung
GetYourGuide is the globally leading marketplace for unforgettable travel experiences, helping travelers discover the best things to do.
Noch nicht perfekt?
- GetYourGuide
Staff Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - Doctolib
Senior Site Reliability Engineer - Observability(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - GetYourGuide
Senior Software Engineer, Developer Enablement(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - KAYAK
Senior JAVA Software Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Vollzeitmit HomeofficeManagementBerlin