Your personal AI career agent
Platform (Site Reliability) Engineering Manager(m/w/x)
Leading a team of SREs focused on multi-region AWS environments, defining reliability strategy with SLIs/SLOs. Proven team leadership and deep AWS expertise required. 33 days annual vacation, 30 days remote work allowance.
Requirements
- Strong Site Reliability Engineering, DevOps, or Platform Engineering experience in AWS
- Proven experience leading and developing engineering teams
- Deep expertise in AWS services (EC2, S3, RDS, Lambda, VPC, IAM)
- Strong knowledge of Infrastructure as Code (Terraform or CloudFormation)
- Experience with container orchestration (ECS or EKS)
- Solid understanding of distributed systems and reliability engineering
- Experience designing and maintaining CI/CD pipelines
- Strong understanding of networking, security, and observability
- Experience managing incident response and operational processes
- Excellent stakeholder management and communication skills
- Fluent English
- Experience with globally distributed systems and large-scale production
- Exposure to security incident response and compliance audits
- Experience supporting AI/ML infrastructure on AWS
- Experience mentoring senior engineers or managers
- Relevant certifications (AWS, Kubernetes, Terraform)
Tasks
- Lead and develop a team of Site Reliability and Platform Engineers
- Foster a culture of ownership and continuous improvement
- Define and drive reliability strategy, including SLIs, SLOs, and error budgets
- Ensure high availability, scalability, and performance in multi-region AWS environments
- Own and improve incident management processes and on-call practices
- Drive automation and reduce operational toil through tooling and standardization
- Partner with Security and Compliance teams to adhere to standards like GDPR, ISO 27001, and SOC 2
- Provide architectural guidance on infrastructure, networking, and platform services
- Collaborate with engineering, product, data, and AI teams for reliable and scalable systems
- Communicate risks, performance metrics, and priorities to technical and non-technical stakeholders
Work Experience
- 5 - 7 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Fluent
Tools & Technologies
- AWS
- EC2
- S3
- RDS
- Lambda
- VPC
- IAM
- Terraform
- CloudFormation
- ECS
- EKS
- Kubernetes
- AI/ML
Benefits
Healthcare & Fitness
- Additional health insurance coverage
- Subsidy for UrbanSportsClub membership
Flexible Working
- 30 days of remote work allowance
More Vacation Days
- 33 days annual vacation allowance
Additional Allowances
- 70 EUR monthly cashback
- Self-development days allowance
Learning & Development
- 500 EUR yearly personal learning and development budget
Team Events
- Team events
- Company events
Bonuses & Incentives
- Referral bonus
Corporate Discounts
- Corporate benefits
Other Benefits
- Humanoo for Humanoos
Mental Health Support
- Employee Assistance Programme (EAP)
Not a perfect match?
- Bonial International GmbHFull-time/Part-timeWith HomeofficeSeniorBerlin
- Scandit
Engineering Manager, Platform(m/w/x)
Full-timeWith HomeofficeExperiencedZürich, Berlin - Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Full-timeWith HomeofficeManagementBerlin - bunch
Engineering Manager - Platform(m/w/x)
Full-timeWith HomeofficeExperiencedBerlin - Taxfix
Engineering Manager - Cloud & SRE(m/w/x)
Full-timeWith HomeofficeExperiencedBerlin
Platform (Site Reliability) Engineering Manager(m/w/x)
Leading a team of SREs focused on multi-region AWS environments, defining reliability strategy with SLIs/SLOs. Proven team leadership and deep AWS expertise required. 33 days annual vacation, 30 days remote work allowance.
Requirements
- Strong Site Reliability Engineering, DevOps, or Platform Engineering experience in AWS
- Proven experience leading and developing engineering teams
- Deep expertise in AWS services (EC2, S3, RDS, Lambda, VPC, IAM)
- Strong knowledge of Infrastructure as Code (Terraform or CloudFormation)
- Experience with container orchestration (ECS or EKS)
- Solid understanding of distributed systems and reliability engineering
- Experience designing and maintaining CI/CD pipelines
- Strong understanding of networking, security, and observability
- Experience managing incident response and operational processes
- Excellent stakeholder management and communication skills
- Fluent English
- Experience with globally distributed systems and large-scale production
- Exposure to security incident response and compliance audits
- Experience supporting AI/ML infrastructure on AWS
- Experience mentoring senior engineers or managers
- Relevant certifications (AWS, Kubernetes, Terraform)
Tasks
- Lead and develop a team of Site Reliability and Platform Engineers
- Foster a culture of ownership and continuous improvement
- Define and drive reliability strategy, including SLIs, SLOs, and error budgets
- Ensure high availability, scalability, and performance in multi-region AWS environments
- Own and improve incident management processes and on-call practices
- Drive automation and reduce operational toil through tooling and standardization
- Partner with Security and Compliance teams to adhere to standards like GDPR, ISO 27001, and SOC 2
- Provide architectural guidance on infrastructure, networking, and platform services
- Collaborate with engineering, product, data, and AI teams for reliable and scalable systems
- Communicate risks, performance metrics, and priorities to technical and non-technical stakeholders
Work Experience
- 5 - 7 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Fluent
Tools & Technologies
- AWS
- EC2
- S3
- RDS
- Lambda
- VPC
- IAM
- Terraform
- CloudFormation
- ECS
- EKS
- Kubernetes
- AI/ML
Benefits
Healthcare & Fitness
- Additional health insurance coverage
- Subsidy for UrbanSportsClub membership
Flexible Working
- 30 days of remote work allowance
More Vacation Days
- 33 days annual vacation allowance
Additional Allowances
- 70 EUR monthly cashback
- Self-development days allowance
Learning & Development
- 500 EUR yearly personal learning and development budget
Team Events
- Team events
- Company events
Bonuses & Incentives
- Referral bonus
Corporate Discounts
- Corporate benefits
Other Benefits
- Humanoo for Humanoos
Mental Health Support
- Employee Assistance Programme (EAP)
About the Company
TELUS Health
Industry
Healthcare
Description
TELUS Health is a global-leading health and well-being provider, improving health outcomes for consumers, patients, and healthcare professionals.
Not a perfect match?
- Bonial International GmbH
Team Lead Platform Engineering(m/w/x)
Full-time/Part-timeWith HomeofficeSeniorBerlin - Scandit
Engineering Manager, Platform(m/w/x)
Full-timeWith HomeofficeExperiencedZürich, Berlin - Scout24
Senior Platform Engineer - Site Reliability(m/w/x)
Full-timeWith HomeofficeManagementBerlin - bunch
Engineering Manager - Platform(m/w/x)
Full-timeWith HomeofficeExperiencedBerlin - Taxfix
Engineering Manager - Cloud & SRE(m/w/x)
Full-timeWith HomeofficeExperiencedBerlin