Dein persönlicher KI-Karriere-Agent
Biology Data Quality Engineer(m/w/x)
Developing data validation protocols for histology, omics, and clinical datasets for an AI biology foundation model. Strong domain expertise in biology required. Equity package included.
Anforderungen
- Team-first attitude and independence
- Curiosity and detail-orientation
- Ability to thrive in dynamic environments
- Strong domain expertise in biology
- Strong computational and hands-on skills
- Deep understanding of transcriptomics data
- Knowledge of genomics and proteomics data
- Experience in data quality control
- Familiarity with data validation tools
- Strong analytical and problem-solving skills
- Proficiency in Python
- Knowledge of data visualization libraries
- Excellent written and verbal communication
- MSc in Biology, Computational Biology, or Bioinformatics
- Experience in machine learning for histology
- Experience working with AWS
- Experience with data annotation guidelines
- Experience with data ontologies
- Experience building large-scale data collections
- Experience in spatial alignment of datasets
Aufgaben
- Develop comprehensive data validation protocols for histology, omics, and clinical datasets
- Design automated pipelines to streamline data validation and identify issues early
- Establish data standardization practices for seamless integration across different data types
- Curate biological datasets to enhance their usability for machine learning
- Collaborate with the R&D team to address specific data quality requirements
- Communicate findings and recommendations to technical and non-technical stakeholders
- Coordinate and synchronize data standards with external providers
- Document quality assessment procedures, validation results, and data specifications
- Generate regular reports on data quality metrics and trends
- Evaluate external public data sources for foundation model training suitability
- Implement improvements to data quality processes based on industry best practices
Berufserfahrung
- ca. 1 - 4 Jahre
Ausbildung
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- Python
- matplotlib
- AWS
Benefits
Flexibles Arbeiten
- Flexible work arrangements
- Remote work options
Attraktive Vergütung
- Competitive salary
- Equity package
Karriere- und Weiterentwicklung
- Professional growth opportunities
- Leadership development opportunities
Sinnstiftende Arbeit
- Collaborative and mission-driven environment
Noch nicht perfekt?
- Data4LifeVollzeitmit HomeofficeBerufserfahrenBerlin, Potsdam
- Statista
Data Engineer - Healthcare(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenHamburg, Berlin - Veeva Systems
Senior Data Analyst(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - Clue
(Senior) ML Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - GROPYUS
Data Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenBerlin
Biology Data Quality Engineer(m/w/x)
Developing data validation protocols for histology, omics, and clinical datasets for an AI biology foundation model. Strong domain expertise in biology required. Equity package included.
Anforderungen
- Team-first attitude and independence
- Curiosity and detail-orientation
- Ability to thrive in dynamic environments
- Strong domain expertise in biology
- Strong computational and hands-on skills
- Deep understanding of transcriptomics data
- Knowledge of genomics and proteomics data
- Experience in data quality control
- Familiarity with data validation tools
- Strong analytical and problem-solving skills
- Proficiency in Python
- Knowledge of data visualization libraries
- Excellent written and verbal communication
- MSc in Biology, Computational Biology, or Bioinformatics
- Experience in machine learning for histology
- Experience working with AWS
- Experience with data annotation guidelines
- Experience with data ontologies
- Experience building large-scale data collections
- Experience in spatial alignment of datasets
Aufgaben
- Develop comprehensive data validation protocols for histology, omics, and clinical datasets
- Design automated pipelines to streamline data validation and identify issues early
- Establish data standardization practices for seamless integration across different data types
- Curate biological datasets to enhance their usability for machine learning
- Collaborate with the R&D team to address specific data quality requirements
- Communicate findings and recommendations to technical and non-technical stakeholders
- Coordinate and synchronize data standards with external providers
- Document quality assessment procedures, validation results, and data specifications
- Generate regular reports on data quality metrics and trends
- Evaluate external public data sources for foundation model training suitability
- Implement improvements to data quality processes based on industry best practices
Berufserfahrung
- ca. 1 - 4 Jahre
Ausbildung
- Master-Abschluss
Sprachen
- Englisch – verhandlungssicher
Tools & Technologien
- Python
- matplotlib
- AWS
Benefits
Flexibles Arbeiten
- Flexible work arrangements
- Remote work options
Attraktive Vergütung
- Competitive salary
- Equity package
Karriere- und Weiterentwicklung
- Professional growth opportunities
- Leadership development opportunities
Sinnstiftende Arbeit
- Collaborative and mission-driven environment
Über das Unternehmen
Bioptimus
Branche
Research
Beschreibung
The company is building the first universal AI foundation model for biology to fuel breakthrough discoveries and accelerate innovation in biomedicine.
Noch nicht perfekt?
- Data4Life
Data Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenBerlin, Potsdam - Statista
Data Engineer - Healthcare(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenHamburg, Berlin - Veeva Systems
Senior Data Analyst(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - Clue
(Senior) ML Engineer(m/w/x)
Vollzeitmit HomeofficeSeniorBerlin - GROPYUS
Data Engineer(m/w/x)
Vollzeitmit HomeofficeBerufserfahrenBerlin