Neuer Job?Nejo!

Dein persönlicher KI-Karriere-Agent

WOWorkato

vor 2 Monaten

Senior Infrastructure Engineer - Observability(m/w/x)

Berlin, Frankfurt am Main, München

VollzeitVor OrtSenior

Nejo KI-Zusammenfassung

Designing and scaling production logging, metrics, and tracing stacks across multiple data centers and Kubernetes clusters. 8+ years industry experience with hands-on production observability systems required. Building infrastructure for an AI-powered orchestration platform.

Anforderungen

8+ years industry experience
Solid hands-on production experience with observability systems
Strong plus: familiarity with OpenTelemetry, Kafka, Vector, VictoriaMetrics
Experience with logging pipelines: design, deployment, refactoring
Understanding of distributed tracing and SPM
Experience with Kubernetes cluster lifecycle management (EKS preferred)
Practical knowledge of storage trade-offs for observability data
Experience using AI to automate infrastructure or observability tasks
Familiarity with AI-assisted tooling selection and workflow integration
Experience with MCP (custom or open-source implementations)
Background in cloud account or environment migrations
Experience preparing infrastructure for compliance/audit processes
Understanding network architecture, troubleshooting, incident resolution, Post-mortems
Experience with containers and Kubernetes (installation, configuration of operators)
Basic knowledge of Python, Golang, Java
Good communication and collaboration skills
Interest in modern big distributed storage technologies, architectures
Good Spoken English for technical discussions
Balance of hands-on and analytical approaches

Aufgaben

Design, deploy, and maintain production observability stacks (logs, metrics, traces)
Scale observability infrastructure across multiple data centers and Kubernetes clusters
Manage logging pipeline architecture and refactoring efforts
Improve distributed tracing coverage
Drive distributed tracing adoption across engineering teams
Manage EKS upgrades, node exporters, agents, and collectors
Automate operational tasks to reduce toil and improve system stability
Ensure compliance and audit readiness for access controls, data handling, and pipeline integrity
Evaluate and adopt new observability tooling

Berufserfahrung

8 Jahre

Ausbildung

Bachelor-AbschlussODER
Master-Abschluss

Sprachen

Englisch – verhandlungssicher

Tools & Technologien

OpenTelemetry
Kafka
Vector
VictoriaMetrics
vmagent
alerting rules
Kubernetes
EKS
Containers
Python
Golang
Java
AI
MCP

Die Originalanzeige dieses Stellenangebotes in der aktuellsten Version findest du hier. Nejo hat diesen Job automatisch von der Website des Unternehmens Workato erfasst und die Informationen auf Nejo mit Hilfe von KI für dich aufbereitet. Trotz sorgfältiger Analyse können einzelne Informationen unvollständig oder ungenau sein. Bitte prüfe immer alle Angaben in der Originalanzeige! Inhalte und Urheberrechte der Originalanzeige liegen beim ausschreibenden Unternehmen.

Gefällt dir diese Stelle?

Beta

Dein Career Agent findet täglich ähnliche Jobs für dich.

Noch nicht perfekt?

Trade Republic
Observability Tech Lead(m/w/x)
Vollzeitnur vor OrtSenior
Berlin
Perplexity
Senior Backend/Infrastructure Engineer - Search(m/w/x)
Vollzeitnur vor OrtSenior
Berlin
Nebius
Senior Site Reliability Engineer — AI Studio (Inference Platform)(m/w/x)
Vollzeitnur vor OrtSenior
Berlin
emnify
Staff/Senior AWS Cloud Platform Engineer(m/w/x)
Vollzeitnur vor OrtSenior
Berlin
1GLOBAL
Senior Site Reliability Engineer (SRE)(m/w/x)
Vollzeitnur vor OrtSenior
Berlin

Alle 100+ ähnlichen Jobs ansehen

WOWorkato

vor 2 Monaten