Skip to content
New Job?Nejo!

Your personal AI career agent

INInfosys Consulting - Europe

Site Reliability Engineering (SRE) Architect(m/w/x)

München
Full-timeOn-siteSenior
AI/ML

Architecting public cloud infrastructure and observability strategies for global firms. Expert-level cloud and Kubernetes experience required. Hybrid work, 4-day work week.

Requirements

  • 10+ years software engineering, DevOps, or systems engineering experience
  • At least 5 years senior SRE or systems architecture experience
  • Expert-level knowledge of AWS, GCP, or Azure
  • Expert-level knowledge of core cloud services (compute, storage, networking, managed databases)
  • Deep, hands-on Kubernetes cluster design and management experience
  • Deep, hands-on container-based microservices architecture experience
  • Proven expertise architecting infrastructure with Terraform
  • Proficiency with Ansible, Chef, or Puppet
  • Extensive experience implementing monitoring and observability solutions
  • Experience with Prometheus, Grafana, OpenTelemetry, Jaeger, or ELK Stack
  • Experience with commercial observability tools (Datadog, New Relic)
  • Strong proficiency in Go or Python for automation
  • Strong proficiency in Go or Python for tooling
  • Strong proficiency in Go or Python for building system integrations
  • Deep understanding of distributed systems
  • Deep understanding of networking protocols (TCP/IP, HTTP)
  • Deep understanding of high-availability design patterns
  • Experience working across multiple cloud environments (multi-cloud)
  • Professional cloud certifications (e.g., AWS Certified Solutions Architect Professional, Google Professional Cloud Architect)
  • Experience with service mesh technologies like Istio or Linkerd
  • Knowledge of security best practices in cloud-native environment (DevSecOps)
  • Demonstrated experience leading large-scale technology transformations
  • Demonstrated experience influencing engineering culture

Tasks

  • Design and architect infrastructure and application services on public cloud platforms
  • Define long-term vision for system reliability and performance
  • Establish standards for SLOs, SLIs, and error budgets
  • Architect a comprehensive observability strategy
  • Design systems for logging, metrics, tracing, and alerting
  • Lead automation and IaC strategy
  • Design reusable patterns and frameworks for infrastructure provisioning
  • Identify and mitigate reliability risks
  • Design and champion resilience patterns and disaster recovery plans
  • Design and champion chaos engineering experiments
  • Act as a thought leader in reliability engineering
  • Mentor SREs and developers on reliability best practices
  • Lead architectural review sessions for reliability
  • Analyze major incidents to identify architectural weaknesses
  • Drive design changes to prevent incident recurrence
  • Evolve postmortem culture and incident response capabilities

Work Experience

  • 10 years

Education

  • Bachelor's degreeOR
  • Master's degree

Languages

  • EnglishBusiness Fluent

Tools & Technologies

  • AWS
  • GCP
  • Azure
  • Kubernetes
  • Terraform
  • Ansible
  • Chef
  • Puppet
  • Prometheus
  • Grafana
  • OpenTelemetry
  • Jaeger
  • ELK Stack
  • Datadog
  • New Relic
  • Go
  • Python
  • Istio
  • Linkerd
Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Infosys Consulting - Europe and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

  • Workato

    Senior Infrastructure Engineer - Observability(m/w/x)

    Full-timeOn-siteSenior
    Berlin, Frankfurt am Main, München
  • realworld one

    Senior DevOps Engineer(m/w/x)

    Full-timeOn-siteSenior
    München
  • Entrix

    Senior / Staff Cloud Engineer(m/w/x)

    Full-timeOn-siteManagement
    München
    from 135,000 / year
  • Hawk

    Customer Cloud Engineer(m/w/x)

    Full-timeOn-siteExperienced
    München
  • Helsing

    Site Reliability Engineer(m/w/x)

    Full-timeOn-siteExperienced
    München
View all 100+ similar jobs

Nejo is an AI – results may be incomplete or contain mistakes