Die KI-Suchmaschine für Jobs
Senior Site Reliability Engineer(m/w/x)
In this role, you will ensure the reliability and performance of production systems while collaborating with various teams. Your day-to-day responsibilities will involve designing resilient architectures, automating tasks, and advocating for best practices in site reliability engineering.
Anforderungen
- Strong experience running production systems at scale
- Solid understanding of distributed systems and failure modes
- Proven experience with SLO-driven reliability
- Strong coding skills
- Cloud infrastructure automation experience
- Ability to debug complex cross-system issues
- Ownership mindset and strong communication skills
- Pragmatic approach to reliability, speed, and cost trade-offs
Aufgaben
- Design resilient architectures
- Define reliability standards
- Improve observability and incident response
- Reduce operational toil through automation
- Implement and maintain reliable cloud infrastructure
- Define and evolve SLIs, SLOs, and error budgets
- Enhance monitoring, alerting, and observability across services
- Lead incident response and post-mortems
- Conduct root-cause analysis
- Automate repetitive operational tasks
- Collaborate on service design and scalability
- Address failure modes with Backend engineers
- Improve CI/CD pipelines and deployment strategies
- Contribute to infrastructure as code and platform tooling
- Advocate for reliability across the engineering organization
Berufserfahrung
Ausbildung
Sprachen
- JobtomeVollzeitRemoteSeniorMendrisio
- Tether Operations Limited
Software Architect(m/w/x)
VollzeitRemoteSeniorLugano - Jobtome
Senior Front-end Developer(m/w/x)
VollzeitRemoteSeniorMendrisio - lastminute.com
Head of Data Platform Engineering(m/w/x)
Vollzeitmit HomeofficeManagementChiasso - Tether Operations Limited
Technical Project Manager(m/w/x)
VollzeitRemoteSeniorLugano
Senior Site Reliability Engineer(m/w/x)
In this role, you will ensure the reliability and performance of production systems while collaborating with various teams. Your day-to-day responsibilities will involve designing resilient architectures, automating tasks, and advocating for best practices in site reliability engineering.
Anforderungen
- Strong experience running production systems at scale
- Solid understanding of distributed systems and failure modes
- Proven experience with SLO-driven reliability
- Strong coding skills
- Cloud infrastructure automation experience
- Ability to debug complex cross-system issues
- Ownership mindset and strong communication skills
- Pragmatic approach to reliability, speed, and cost trade-offs
Aufgaben
- Design resilient architectures
- Define reliability standards
- Improve observability and incident response
- Reduce operational toil through automation
- Implement and maintain reliable cloud infrastructure
- Define and evolve SLIs, SLOs, and error budgets
- Enhance monitoring, alerting, and observability across services
- Lead incident response and post-mortems
- Conduct root-cause analysis
- Automate repetitive operational tasks
- Collaborate on service design and scalability
- Address failure modes with Backend engineers
- Improve CI/CD pipelines and deployment strategies
- Contribute to infrastructure as code and platform tooling
- Advocate for reliability across the engineering organization
Berufserfahrung
Ausbildung
Sprachen
Über das Unternehmen
Jobtome
Branche
IT
Beschreibung
The company is an AI-Powered Job Ad Platform disrupting how high-volume employers find talent.
- Jobtome
Senior Backend Developer(m/w/x)
VollzeitRemoteSeniorMendrisio - Tether Operations Limited
Software Architect(m/w/x)
VollzeitRemoteSeniorLugano - Jobtome
Senior Front-end Developer(m/w/x)
VollzeitRemoteSeniorMendrisio - lastminute.com
Head of Data Platform Engineering(m/w/x)
Vollzeitmit HomeofficeManagementChiasso - Tether Operations Limited
Technical Project Manager(m/w/x)
VollzeitRemoteSeniorLugano