Your personal AI career agent
Development Architect for the Autonomous Operations Platform (AIOps)(m/w/x)
Conceptualizing autonomous operations platform and detailing Apeiro Reference Architecture. Deep expertise in Python, Java, or Go required. Flexible working models, focus on health and well-being.
Requirements
- Deep expertise in Python, Java, or Go
- Focus on distributed systems
- Focus on service integration
- Focus on cloud-native architecture at scale
- Expert programming skills
- Experience with distributed systems
- Experience with service integration
- Experience with cloud-native architecture at scale
- Experience in dedicated software architecture role
- Proven track record finding solutions to complex problems
- Kubernetes skills
- Operator development
- Custom controllers
- Production operations across multi-cloud
- Experience with Prometheus
- Experience with OpenTelemetry
- Experience with Grafana
- Experience with timeseries databases
- Experience with distributed tracing systems
- Observability skills
- Understanding of AI/ML service consumption
- Understanding of AI/ML service integration
- Understanding of interpreting AI/ML model outputs
- AI/ML integration understanding
- Proficiency in CI/CD pipelines
- Proficiency in Infrastructure-as-Code
- Proficiency in GitOps workflows
- Cloud-native practices proficiency
- Understanding of distributed systems failure modes
- Understanding of graceful degradation
- Understanding of designing for unreliable infrastructure
- Resilience patterns understanding
- Active involvement in open-source communities
- Open communication
- Giving and receiving feedback gracefully
- Enjoyment of working closely with others
- Team spirit
- Experience guiding junior engineers
- Experience guiding mid-level engineers
- Contributing to team learning culture
- Mentorship and knowledge sharing experience
- Urge to question status quo
- Urge to discover problems
- Urge to find creative solutions
- Innovation drive
Tasks
- Conceptualize autonomous operations platform
- Detail out Apeiro Reference Architecture
- Collaborate with Apeiro ecosystem and NeoNephos community
- Plan and design distributed systems
- Define integrations with AI/ML services
- Develop Kubernetes-native operators
- Influence design and implementation approaches
- Tackle telemetry correlation challenges
- Conduct automated root cause analysis
- Design knowledge graph systems
- Ensure core functionality during failures
- Support global cloud infrastructure
- Maximize automated resolution rates
- Minimize manual effort at enterprise scale
Work Experience
- approx. 4 - 6 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
- German – Basic
Tools & Technologies
- Python
- Java
- Go
- Kubernetes
- Prometheus
- OpenTelemetry
- Grafana
- Terraform
- ArgoCD
- GitHub Actions
Benefits
Learning & Development
- Constant learning
- Skill growth
Other Benefits
- Great benefits
- Accessibility accommodations
Healthcare & Fitness
- Focus on health and well-being
Flexible Working
- Flexible working models
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
Not a perfect match?
- SAPFull-time/Part-timeOn-siteSeniorWalldorf
- SAP
(Senior) DevOps Engineer for Sovereign Cloud(m/w/x)
Full-time/Part-timeOn-siteExperiencedBerlin, St. Leon-Rot - SAP
Senior Full Stack Developer for Sustainable Sovereign Cloud(m/w/x)
Full-time/Part-timeOn-siteSeniorPotsdam, Berlin, Walldorf - All for One Group
Senior Developer Cloud Integration(m/w/x)
Full-time/Part-timeOn-siteSeniorHeidelberg - SAP
Principal Consultant Business Data Strategy & Advisory - Data Architect(m/w/x)
Full-time/Part-timeOn-siteSeniorWalldorf, Berlin, Eschborn, Gerlingen, Hamburg, Garching bei München, München, Dresden, Ratingen, Hannover
Development Architect for the Autonomous Operations Platform (AIOps)(m/w/x)
Conceptualizing autonomous operations platform and detailing Apeiro Reference Architecture. Deep expertise in Python, Java, or Go required. Flexible working models, focus on health and well-being.
Requirements
- Deep expertise in Python, Java, or Go
- Focus on distributed systems
- Focus on service integration
- Focus on cloud-native architecture at scale
- Expert programming skills
- Experience with distributed systems
- Experience with service integration
- Experience with cloud-native architecture at scale
- Experience in dedicated software architecture role
- Proven track record finding solutions to complex problems
- Kubernetes skills
- Operator development
- Custom controllers
- Production operations across multi-cloud
- Experience with Prometheus
- Experience with OpenTelemetry
- Experience with Grafana
- Experience with timeseries databases
- Experience with distributed tracing systems
- Observability skills
- Understanding of AI/ML service consumption
- Understanding of AI/ML service integration
- Understanding of interpreting AI/ML model outputs
- AI/ML integration understanding
- Proficiency in CI/CD pipelines
- Proficiency in Infrastructure-as-Code
- Proficiency in GitOps workflows
- Cloud-native practices proficiency
- Understanding of distributed systems failure modes
- Understanding of graceful degradation
- Understanding of designing for unreliable infrastructure
- Resilience patterns understanding
- Active involvement in open-source communities
- Open communication
- Giving and receiving feedback gracefully
- Enjoyment of working closely with others
- Team spirit
- Experience guiding junior engineers
- Experience guiding mid-level engineers
- Contributing to team learning culture
- Mentorship and knowledge sharing experience
- Urge to question status quo
- Urge to discover problems
- Urge to find creative solutions
- Innovation drive
Tasks
- Conceptualize autonomous operations platform
- Detail out Apeiro Reference Architecture
- Collaborate with Apeiro ecosystem and NeoNephos community
- Plan and design distributed systems
- Define integrations with AI/ML services
- Develop Kubernetes-native operators
- Influence design and implementation approaches
- Tackle telemetry correlation challenges
- Conduct automated root cause analysis
- Design knowledge graph systems
- Ensure core functionality during failures
- Support global cloud infrastructure
- Maximize automated resolution rates
- Minimize manual effort at enterprise scale
Work Experience
- approx. 4 - 6 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
- German – Basic
Tools & Technologies
- Python
- Java
- Go
- Kubernetes
- Prometheus
- OpenTelemetry
- Grafana
- Terraform
- ArgoCD
- GitHub Actions
Benefits
Learning & Development
- Constant learning
- Skill growth
Other Benefits
- Great benefits
- Accessibility accommodations
Healthcare & Fitness
- Focus on health and well-being
Flexible Working
- Flexible working models
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
About the Company
SAP
Industry
IT
Description
SAP innovations help over four hundred thousand customers worldwide work together more efficiently and use business insight more effectively.
Not a perfect match?
- SAP
(Senior) Engineer - Application Performance Management (APM)(m/w/x)
Full-time/Part-timeOn-siteSeniorWalldorf - SAP
(Senior) DevOps Engineer for Sovereign Cloud(m/w/x)
Full-time/Part-timeOn-siteExperiencedBerlin, St. Leon-Rot - SAP
Senior Full Stack Developer for Sustainable Sovereign Cloud(m/w/x)
Full-time/Part-timeOn-siteSeniorPotsdam, Berlin, Walldorf - All for One Group
Senior Developer Cloud Integration(m/w/x)
Full-time/Part-timeOn-siteSeniorHeidelberg - SAP
Principal Consultant Business Data Strategy & Advisory - Data Architect(m/w/x)
Full-time/Part-timeOn-siteSeniorWalldorf, Berlin, Eschborn, Gerlingen, Hamburg, Garching bei München, München, Dresden, Ratingen, Hannover