Your personal AI career agent
Data Engineering Intern - AI/ML Systems(m/w/x)
Designing data infrastructure and building ingestion pipelines for AI/ML systems in energy management and asset monitoring. Master's or PhD in a technical field, Python, and SQL proficiency required. Focus on feature engineering and data quality for complex datasets.
Requirements
- Master's or PhD in Computer Science, Data Science, Electrical Engineering, or related field, or recent graduation
- Strong proficiency in Python for data manipulation, transformation, and pipeline development
- Solid experience with SQL and relational or time-series data systems
- Understanding of machine learning workflows, training data, feature engineering, inference pipelines, and data quality impact
- Hands-on experience with large, complex datasets, including missing data, schema inconsistencies, temporal alignment, and high-cardinality variables
- Strong data visualization skills with matplotlib, seaborn, or Plotly; clear, informative, reproducible analysis outputs
- Familiarity with cloud data platforms, particularly Microsoft Azure (e.g., Azure Databricks, Azure Data Factory, Azure Data Lake, or Event Hubs); equivalent experience with other major cloud providers
- Exposure to large-scale or streaming data concepts, including real-time ingestion, event-driven architectures, or distributed data processing frameworks (e.g., Spark)
- Experience with relational or analytical databases (e.g., PostgreSQL); awareness of modern storage formats or database categories (columnar, vector, graph) is a plus
- Hands-on ML experience, including model training, feature engineering, or experiment tracking (e.g., MLflow); understanding of data pipeline impact on model performance
- Interest in or prior exposure to time-series sensor data, industrial telemetry, battery systems, or energy infrastructure is a plus
Tasks
- Design data infrastructure for AI/ML projects
- Build data ingestion pipelines for various data types
- Implement data transformation and validation processes
- Develop data quality and monitoring frameworks
- Construct feature engineering pipelines with AI/ML engineers
- Optimize data models for time-series and ML consumption
- Evaluate storage formats and database architectures
- Contribute to streaming data pipeline development
- Support real-time ingestion and event-driven architectures
- Generate synthetic and simulation data for digital twin platforms
- Develop exploratory data analysis and visualizations
- Test and validate data pipelines
- Verify data contracts and perform simulation testing
- Create technical documentation and reports
Education
- Currently in higher education
Languages
- English – Business Fluent
Tools & Technologies
- Python
- SQL
- matplotlib
- seaborn
- Plotly
- Microsoft Azure
- Azure Databricks
- Azure Data Factory
- Azure Data Lake
- Event Hubs
- Spark
- PostgreSQL
- MLflow
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
Not a perfect match?
- Sika AGFull-timeInternshipOn-siteBaar
- EnerSys
Network & Cloud Intern- Azure Infrastructure(m/w/x)
Full-timeSchool InternshipOn-siteZug - Garda Capital Partners
.AI Intern(m/w/x)
Full-timeInternshipOn-siteGenf, Zug - Roche Diagnostics Int. AG
Internship in Innovation & Sustainability(m/w/x)
Full-timeInternshipOn-siteRisch-Rotkreuz - Roche Diagnostics Int. AG
Internship in BGE Development - Production Data Processing(m/w/x)
Full-timeInternshipOn-siteRisch-Rotkreuz
Data Engineering Intern - AI/ML Systems(m/w/x)
Designing data infrastructure and building ingestion pipelines for AI/ML systems in energy management and asset monitoring. Master's or PhD in a technical field, Python, and SQL proficiency required. Focus on feature engineering and data quality for complex datasets.
Requirements
- Master's or PhD in Computer Science, Data Science, Electrical Engineering, or related field, or recent graduation
- Strong proficiency in Python for data manipulation, transformation, and pipeline development
- Solid experience with SQL and relational or time-series data systems
- Understanding of machine learning workflows, training data, feature engineering, inference pipelines, and data quality impact
- Hands-on experience with large, complex datasets, including missing data, schema inconsistencies, temporal alignment, and high-cardinality variables
- Strong data visualization skills with matplotlib, seaborn, or Plotly; clear, informative, reproducible analysis outputs
- Familiarity with cloud data platforms, particularly Microsoft Azure (e.g., Azure Databricks, Azure Data Factory, Azure Data Lake, or Event Hubs); equivalent experience with other major cloud providers
- Exposure to large-scale or streaming data concepts, including real-time ingestion, event-driven architectures, or distributed data processing frameworks (e.g., Spark)
- Experience with relational or analytical databases (e.g., PostgreSQL); awareness of modern storage formats or database categories (columnar, vector, graph) is a plus
- Hands-on ML experience, including model training, feature engineering, or experiment tracking (e.g., MLflow); understanding of data pipeline impact on model performance
- Interest in or prior exposure to time-series sensor data, industrial telemetry, battery systems, or energy infrastructure is a plus
Tasks
- Design data infrastructure for AI/ML projects
- Build data ingestion pipelines for various data types
- Implement data transformation and validation processes
- Develop data quality and monitoring frameworks
- Construct feature engineering pipelines with AI/ML engineers
- Optimize data models for time-series and ML consumption
- Evaluate storage formats and database architectures
- Contribute to streaming data pipeline development
- Support real-time ingestion and event-driven architectures
- Generate synthetic and simulation data for digital twin platforms
- Develop exploratory data analysis and visualizations
- Test and validate data pipelines
- Verify data contracts and perform simulation testing
- Create technical documentation and reports
Education
- Currently in higher education
Languages
- English – Business Fluent
Tools & Technologies
- Python
- SQL
- matplotlib
- seaborn
- Plotly
- Microsoft Azure
- Azure Databricks
- Azure Data Factory
- Azure Data Lake
- Event Hubs
- Spark
- PostgreSQL
- MLflow
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
About the Company
EnerSys
Industry
Manufacturing
Description
Das Unternehmen ist der weltweit führende Anbieter von Energiespeicherlösungen für industrielle Anwendungen.
Not a perfect match?
- Sika AG
Internship - Analytics Engineer(m/w/x)
Full-timeInternshipOn-siteBaar - EnerSys
Network & Cloud Intern- Azure Infrastructure(m/w/x)
Full-timeSchool InternshipOn-siteZug - Garda Capital Partners
.AI Intern(m/w/x)
Full-timeInternshipOn-siteGenf, Zug - Roche Diagnostics Int. AG
Internship in Innovation & Sustainability(m/w/x)
Full-timeInternshipOn-siteRisch-Rotkreuz - Roche Diagnostics Int. AG
Internship in BGE Development - Production Data Processing(m/w/x)
Full-timeInternshipOn-siteRisch-Rotkreuz