Dein persönlicher KI-Karriere-Agent
Site Reliability Engineer(m/w/x)
Operating and evolving cloud/on-prem infrastructure for custom AI solutions on Kubernetes. 2-5 years large-scale production infrastructure experience required. Remote-first setup, 30 days vacation.
Anforderungen
- 2-5 years of experience with large-scale production infrastructure
- Experience with distributed or service-oriented architectures
- Working knowledge of Infrastructure as Code (Terraform preferred)
- Solid troubleshooting skills across systems
- Pragmatic mindset balancing speed, simplicity, and reliability
- Ownership and accountability for systems end-to-end
- Ability to work independently while aligned with team goals
- Experience optimizing cloud costs at scale
- Interest or experience in Machine Learning / LLM systems
- Experience improving developer experience and platform tooling using AI agents
- Contributions to SRE practices like postmortems, SLIs/SLOs, and reliability engineering culture
Aufgaben
- Build and operate real-world infrastructure
- Design, configure, and evolve cloud and on-prem environments
- Make self-hosted platform production-ready
- Deliver platform deployable on any Kubernetes setup
- Improve CI/CD pipelines and GitOps setups
- Optimize GitHub workflows for faster, reliable deployments
- Simplify systems and reduce infrastructure costs
- Maintain performance and reliability during optimization
- Champion reliability, scalability, and security best practices
- Implement best practices as working systems
Berufserfahrung
- 2 - 5 Jahre
Ausbildung
- Abgeschlossene BerufsausbildungODER
- Bachelor-AbschlussODER
- Master-Abschluss
Sprachen
- Deutsch – verhandlungssicher
Tools & Technologien
- AWS
- Kubernetes
- ArgoCD
- Terraform
- Datadog
- Prometheus
Benefits
Attraktive Vergütung
- Stock options
Flexibles Arbeiten
- Remote-first setup
- Flexible hours
Moderne Technikausstattung
- Choice of tech
Mehr Urlaubstage
- 30 days vacation
Familienfreundlichkeit
- Family sick leave
Sonstige Zulagen
- Monthly sports allowance
Mentale Gesundheitsförderung
- Mental health support allowance
Weiterbildungsangebote
- Annual learning & development budget
Team Events & Ausflüge
- Monthly team socials
- In-person meetups
Lockere Unternehmenskultur
- Dog-friendly HQ
Noch nicht perfekt?
- IONOS SEVollzeitmit HomeofficeBerufserfahrenBerlin
- Nebius
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - Nomitri
Senior DevOps/MLOps(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - Solactive AG
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorFrankfurt am Main, Berlin - SysEleven GmbH
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin
Site Reliability Engineer(m/w/x)
Operating and evolving cloud/on-prem infrastructure for custom AI solutions on Kubernetes. 2-5 years large-scale production infrastructure experience required. Remote-first setup, 30 days vacation.
Anforderungen
- 2-5 years of experience with large-scale production infrastructure
- Experience with distributed or service-oriented architectures
- Working knowledge of Infrastructure as Code (Terraform preferred)
- Solid troubleshooting skills across systems
- Pragmatic mindset balancing speed, simplicity, and reliability
- Ownership and accountability for systems end-to-end
- Ability to work independently while aligned with team goals
- Experience optimizing cloud costs at scale
- Interest or experience in Machine Learning / LLM systems
- Experience improving developer experience and platform tooling using AI agents
- Contributions to SRE practices like postmortems, SLIs/SLOs, and reliability engineering culture
Aufgaben
- Build and operate real-world infrastructure
- Design, configure, and evolve cloud and on-prem environments
- Make self-hosted platform production-ready
- Deliver platform deployable on any Kubernetes setup
- Improve CI/CD pipelines and GitOps setups
- Optimize GitHub workflows for faster, reliable deployments
- Simplify systems and reduce infrastructure costs
- Maintain performance and reliability during optimization
- Champion reliability, scalability, and security best practices
- Implement best practices as working systems
Berufserfahrung
- 2 - 5 Jahre
Ausbildung
- Abgeschlossene BerufsausbildungODER
- Bachelor-AbschlussODER
- Master-Abschluss
Sprachen
- Deutsch – verhandlungssicher
Tools & Technologien
- AWS
- Kubernetes
- ArgoCD
- Terraform
- Datadog
- Prometheus
Benefits
Attraktive Vergütung
- Stock options
Flexibles Arbeiten
- Remote-first setup
- Flexible hours
Moderne Technikausstattung
- Choice of tech
Mehr Urlaubstage
- 30 days vacation
Familienfreundlichkeit
- Family sick leave
Sonstige Zulagen
- Monthly sports allowance
Mentale Gesundheitsförderung
- Mental health support allowance
Weiterbildungsangebote
- Annual learning & development budget
Team Events & Ausflüge
- Monthly team socials
- In-person meetups
Lockere Unternehmenskultur
- Dog-friendly HQ
Über das Unternehmen
deepset GmbH
Branche
IT
Beschreibung
The company is an AI startup that empowers developers to build applications using natural language as the interface to data.
Noch nicht perfekt?
- IONOS SE
Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenBerlin - Nebius
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - Nomitri
Senior DevOps/MLOps(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - Solactive AG
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorFrankfurt am Main, Berlin - SysEleven GmbH
Senior Site Reliability Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin