Your personal AI career agent
Site Reliability Engineer (SRE)(m/w/x)
Building AI-powered resource planner systems on cloud platforms. 4+ years SRE/DevOps experience required. Apple hardware, annual bonus, top-tier health insurance.
Requirements
- 4+ years SRE, DevOps, or System Engineering experience
- Strong cloud platform knowledge (AWS, Azure, GCP)
- Experience with observability and monitoring tools
- Proficiency in Infrastructure as Code (IaC) tools
- Hands-on containerization and orchestration experience
- Strong Linux system administration
- Strong networking fundamentals
- Experience with incident management
- Experience with debugging
- Experience with root cause analysis
- Proficiency in scripting for automation and monitoring
- Knowledge of load balancing
- Knowledge of failover strategies
- Knowledge of distributed systems
- Understanding of security best practices
- Understanding of access control
- Understanding of compliance requirements
- Strong communication skills
- Ability to collaborate with cross-functional teams
Tasks
- Ensure system reliability, availability, and scalability
- Design and implement scalable, reliable, fault-tolerant systems
- Develop and maintain observability tools
- Automate infrastructure provisioning, deployment, and incident response
- Optimize system performance, scalability, and incident response workflows
- Collaborate with development and DevOps teams on system design
- Conduct root cause analysis and implement preventative measures
- Design and maintain load balancing, failover, and disaster recovery strategies
- Improve CI/CD pipelines for faster, stable deployments
- Optimize cloud cost and resource utilization
- Participate in on-call rotations to address system failures
Work Experience
- 4 years
Education
- Vocational certificationOR
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- AWS
- Azure
- GCP
- Prometheus
- Grafana
- ELK
- Datadog
- New Relic
- Terraform
- CloudFormation
- Pulumi
- Docker
- Kubernetes
- Helm
- Bash
- Python
- Go
Benefits
Modern Equipment
- Apple hardware ecosystem
Bonuses & Incentives
- Annual bonus
Healthcare & Fitness
- Top-tier health insurance
- Urban Sports Club membership
Other Benefits
- Life insurance
- Air Conference participation
Additional Allowances
- Transportation budget
- Meal allowances
Corporate Discounts
- Coverflex benefits package
Mental Health Support
- Well-being support
Childcare
- Childcare support
Retirement Plans
- Pension fund
Free or Subsidized Food
- Free meals at the hub
Not a perfect match?
- FortoFull-timeOn-siteSeniorBerlin
- Nebius
Senior Site Reliability Engineer — AI Studio (Inference Platform)(m/w/x)
Full-timeOn-siteSeniorBerlin - Air Apps
DevOps Engineer Mobile(m/w/x)
Full-timeOn-siteExperiencedBerlin - Air Apps
Software Architect / Solutions Architect(m/w/x)
Full-timeOn-siteSeniorBerlin - Ivy
DevOps Engineer(m/w/x)
Full-timeOn-siteExperiencedBerlin
Site Reliability Engineer (SRE)(m/w/x)
Building AI-powered resource planner systems on cloud platforms. 4+ years SRE/DevOps experience required. Apple hardware, annual bonus, top-tier health insurance.
Requirements
- 4+ years SRE, DevOps, or System Engineering experience
- Strong cloud platform knowledge (AWS, Azure, GCP)
- Experience with observability and monitoring tools
- Proficiency in Infrastructure as Code (IaC) tools
- Hands-on containerization and orchestration experience
- Strong Linux system administration
- Strong networking fundamentals
- Experience with incident management
- Experience with debugging
- Experience with root cause analysis
- Proficiency in scripting for automation and monitoring
- Knowledge of load balancing
- Knowledge of failover strategies
- Knowledge of distributed systems
- Understanding of security best practices
- Understanding of access control
- Understanding of compliance requirements
- Strong communication skills
- Ability to collaborate with cross-functional teams
Tasks
- Ensure system reliability, availability, and scalability
- Design and implement scalable, reliable, fault-tolerant systems
- Develop and maintain observability tools
- Automate infrastructure provisioning, deployment, and incident response
- Optimize system performance, scalability, and incident response workflows
- Collaborate with development and DevOps teams on system design
- Conduct root cause analysis and implement preventative measures
- Design and maintain load balancing, failover, and disaster recovery strategies
- Improve CI/CD pipelines for faster, stable deployments
- Optimize cloud cost and resource utilization
- Participate in on-call rotations to address system failures
Work Experience
- 4 years
Education
- Vocational certificationOR
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- AWS
- Azure
- GCP
- Prometheus
- Grafana
- ELK
- Datadog
- New Relic
- Terraform
- CloudFormation
- Pulumi
- Docker
- Kubernetes
- Helm
- Bash
- Python
- Go
Benefits
Modern Equipment
- Apple hardware ecosystem
Bonuses & Incentives
- Annual bonus
Healthcare & Fitness
- Top-tier health insurance
- Urban Sports Club membership
Other Benefits
- Life insurance
- Air Conference participation
Additional Allowances
- Transportation budget
- Meal allowances
Corporate Discounts
- Coverflex benefits package
Mental Health Support
- Well-being support
Childcare
- Childcare support
Retirement Plans
- Pension fund
Free or Subsidized Food
- Free meals at the hub
About the Company
Air Apps
Industry
IT
Description
Air Apps creates the world's first AI-powered Personal & Entrepreneurial Resource Planner (PRP), aiming to change how people plan, work, and live.
Not a perfect match?
- Forto
Senior Site Reliability Engineer(m/w/x)
Full-timeOn-siteSeniorBerlin - Nebius
Senior Site Reliability Engineer — AI Studio (Inference Platform)(m/w/x)
Full-timeOn-siteSeniorBerlin - Air Apps
DevOps Engineer Mobile(m/w/x)
Full-timeOn-siteExperiencedBerlin - Air Apps
Software Architect / Solutions Architect(m/w/x)
Full-timeOn-siteSeniorBerlin - Ivy
DevOps Engineer(m/w/x)
Full-timeOn-siteExperiencedBerlin