Your personal AI career agent
Designing and evolving canonical data schemas for AI systems at iconic gaming brand Atari. End-to-end data pipeline development and ownership experience required. Building and maintaining production-grade pipelines with diverse source system integration.
Requirements
- End-to-end production data pipeline development and ownership
- Strong data schema design skills
- Experience integrating with diverse source systems
- Hands-on experience with pipeline orchestration tooling
- Data cleaning and transformation expertise
- Experience with vector databases and relational/document stores
- Working knowledge of RAG systems and context assembly
- Experience partnering with AI or ML engineers
- Ability to interrogate and profile data across sources
- Cloud platform experience (AWS, Azure, or GCP)
- Experience in the gaming industry
- Familiarity with game engine data formats, asset pipelines, or platform SDK data structures
Tasks
- Design and own canonical data schemas
- Evolving schemas based on AI system requirements
- Document schema decisions clearly
- Identify and connect to source systems
- Build and maintain production-grade data pipelines
- Keep pipelines current with source changes
- Transform raw data into structured knowledge
- Own data quality end-to-end
- Trace and fix quality issues at the root
- Monitor and maintain data pipeline health
- Run periodic data cleanup
- Manage knowledge deprecation
- Diagnose and fix knowledge gaps
- Understand RAG retrieval and context assembly
- Surface and address knowledge gaps proactively
- Extract and convert specialist knowledge
- Select and manage appropriate data storage
- Implement data lineage and audit trails
- Enforce data versioning and change management
- Build a legible and accessible data estate
- Translate between domain and engineering language
- Work with domain experts to define schema and pipeline requirements
- Inform stakeholders on pipeline health and data quality
Work Experience
- approx. 4 - 6 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- Apache Airflow
- Prefect
- dbt
- Pinecone
- Weaviate
- pgvector
- SQL
- Python
- AWS
- Azure
- GCP
- Xbox GDK
- PlayStation SDK
Not a perfect match?
- XITASO GmbHFull-time/Part-timeOn-siteSeniorAugsburg, Krumbach (Schwaben), Berlin, Erlangen, Leipzig, Münster, München, Karlsruhefrom 76,000 / year
- Thalia Bücher GmbH
AI Engineer – Schwerpunkt Generative KI Systeme(m/w/x)
Full-timeOn-siteNot specifiedMünster - ISR
Senior Consultant SAP Data & Analytics(m/w/x)
Full-timeOn-siteSeniorBraunschweig, Frankfurt am Main, Hamburg, Köln, Münster - Thalia Bücher GmbH
Data Analyst Catalogue(m/w/x)
Full-timeOn-siteExperiencedMünster - OEDIV Oetker Daten- und Informationsverarbeitung KG
PreSales Engineer Data & AI(m/w/x)
Full-time/Part-timeOn-siteExperiencedBielefeld, Rostock, Augsburg, Oldenburg, Berlin, Chemnitz, Frankfurt am Main, Köln, Münster
Designing and evolving canonical data schemas for AI systems at iconic gaming brand Atari. End-to-end data pipeline development and ownership experience required. Building and maintaining production-grade pipelines with diverse source system integration.
Requirements
- End-to-end production data pipeline development and ownership
- Strong data schema design skills
- Experience integrating with diverse source systems
- Hands-on experience with pipeline orchestration tooling
- Data cleaning and transformation expertise
- Experience with vector databases and relational/document stores
- Working knowledge of RAG systems and context assembly
- Experience partnering with AI or ML engineers
- Ability to interrogate and profile data across sources
- Cloud platform experience (AWS, Azure, or GCP)
- Experience in the gaming industry
- Familiarity with game engine data formats, asset pipelines, or platform SDK data structures
Tasks
- Design and own canonical data schemas
- Evolving schemas based on AI system requirements
- Document schema decisions clearly
- Identify and connect to source systems
- Build and maintain production-grade data pipelines
- Keep pipelines current with source changes
- Transform raw data into structured knowledge
- Own data quality end-to-end
- Trace and fix quality issues at the root
- Monitor and maintain data pipeline health
- Run periodic data cleanup
- Manage knowledge deprecation
- Diagnose and fix knowledge gaps
- Understand RAG retrieval and context assembly
- Surface and address knowledge gaps proactively
- Extract and convert specialist knowledge
- Select and manage appropriate data storage
- Implement data lineage and audit trails
- Enforce data versioning and change management
- Build a legible and accessible data estate
- Translate between domain and engineering language
- Work with domain experts to define schema and pipeline requirements
- Inform stakeholders on pipeline health and data quality
Work Experience
- approx. 4 - 6 years
Education
- Bachelor's degreeOR
- Master's degree
Languages
- English – Business Fluent
Tools & Technologies
- Apache Airflow
- Prefect
- dbt
- Pinecone
- Weaviate
- pgvector
- SQL
- Python
- AWS
- Azure
- GCP
- Xbox GDK
- PlayStation SDK
About the Company
Atari, Inc.
Industry
Entertainment
Description
Atari is an innovative and rapidly growing gaming company, creating immersive experiences for players around the world.
Not a perfect match?
- XITASO GmbH
Senior AI Engineer(m/w/x)
Full-time/Part-timeOn-siteSeniorAugsburg, Krumbach (Schwaben), Berlin, Erlangen, Leipzig, Münster, München, Karlsruhefrom 76,000 / year - Thalia Bücher GmbH
AI Engineer – Schwerpunkt Generative KI Systeme(m/w/x)
Full-timeOn-siteNot specifiedMünster - ISR
Senior Consultant SAP Data & Analytics(m/w/x)
Full-timeOn-siteSeniorBraunschweig, Frankfurt am Main, Hamburg, Köln, Münster - Thalia Bücher GmbH
Data Analyst Catalogue(m/w/x)
Full-timeOn-siteExperiencedMünster - OEDIV Oetker Daten- und Informationsverarbeitung KG
PreSales Engineer Data & AI(m/w/x)
Full-time/Part-timeOn-siteExperiencedBielefeld, Rostock, Augsburg, Oldenburg, Berlin, Chemnitz, Frankfurt am Main, Köln, Münster