Your personal AI career agent
PhD Research Internship – Robotics Engineer (VLM / VLA Models)(m/w/x)
Designing and developing Vision-Language-Action systems for autonomous mining and construction machines. PhD program enrollment in Robotics, CS, ML, or EE required. Relocation assistance, stock options, and regular social events.
Requirements
- PhD program enrollment in Robotics, CS, ML, EE, or related
- Strong Python and deep learning framework programming
- Solid ML, deep learning, multi-modal model understanding
- Independent research and project drive ability
- Strong analytical and problem-solving skills
- Experience with Vision-Language Models, embodied AI, robotics learning
- Familiarity with GenAI tooling (Hugging Face, Gemini, Unsloth)
- Experience with multi-modal data (vision + sensor fusion)
- Background in robotics, control systems, or real-world deployment
- Research output track record (publications, preprints)
- Experience with large-scale training, distributed systems, optimization
Tasks
- Design and develop Vision-Language-Action systems for industrial settings
- Explore scalable architectures for multi-modal reasoning and action generation
- Advance state-of-the-art methods in embodied AI and robotic autonomy
- Lead the design and analysis of large-scale multi-modal datasets
- Develop self-supervised or weakly supervised dataset generation pipelines
- Investigate data-centric approaches to improve robustness and generalization
- Build, adapt, and extend cutting-edge GenAI models
- Apply advanced fine-tuning strategies for parameter-efficient tuning
- Explore prompt optimization, reasoning augmentation, and action grounding
- Design rigorous evaluation protocols for embodied AI systems
- Run large-scale experiments, analyze performance, and iterate systematically
- Benchmark models against state-of-the-art approaches and internal baselines
- Collaborate with engineering teams to transition research prototypes into production
- Optimize models for real-time inference, robustness, and safety
- Document findings and contribute to research publications
- Present results internally and at leading conferences
- Build physical AI for off-highway machinery
- Combine cutting-edge robotics research with real-world heavy mobile equipment
- Tailor your career path as a technical specialist or team lead
- Influence the future of robotic technologies and tackle significant challenges
Education
- Currently in higher education
Languages
- English – Business Fluent
Tools & Technologies
- Python
- PyTorch
- Hugging Face ecosystem
- Gemini
- Unsloth
Benefits
Competitive Pay
- Attractive compensation package
- Stock options
Snacks & Drinks
- Beverages on-site
Team Events
- Regular social events
Other Benefits
- Relocation assistance
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
Not a perfect match?
- sensmoreFull-timeOn-siteNot specifiedBerlin, Potsdam
- Helsing
AI Research Engineer - Foundation Models(m/w/x)
Full-timeOn-siteExperiencedBerlin, München - Helsing
AI Research Engineer - 3D Computer Vision(m/w/x)
Full-timeOn-siteSeniorMünchen, Berlin - Fraunhofer-Gesellschaft
Mandatory Internship: Computer Vision and AI in Robotics(m/w/x)
Full-timeInternshipOn-siteBerlin - Fraunhofer-Gesellschaft
Mandatory internship: AI-based industrial image processing(m/w/x)
Full-timeInternshipOn-siteBerlin
PhD Research Internship – Robotics Engineer (VLM / VLA Models)(m/w/x)
Designing and developing Vision-Language-Action systems for autonomous mining and construction machines. PhD program enrollment in Robotics, CS, ML, or EE required. Relocation assistance, stock options, and regular social events.
Requirements
- PhD program enrollment in Robotics, CS, ML, EE, or related
- Strong Python and deep learning framework programming
- Solid ML, deep learning, multi-modal model understanding
- Independent research and project drive ability
- Strong analytical and problem-solving skills
- Experience with Vision-Language Models, embodied AI, robotics learning
- Familiarity with GenAI tooling (Hugging Face, Gemini, Unsloth)
- Experience with multi-modal data (vision + sensor fusion)
- Background in robotics, control systems, or real-world deployment
- Research output track record (publications, preprints)
- Experience with large-scale training, distributed systems, optimization
Tasks
- Design and develop Vision-Language-Action systems for industrial settings
- Explore scalable architectures for multi-modal reasoning and action generation
- Advance state-of-the-art methods in embodied AI and robotic autonomy
- Lead the design and analysis of large-scale multi-modal datasets
- Develop self-supervised or weakly supervised dataset generation pipelines
- Investigate data-centric approaches to improve robustness and generalization
- Build, adapt, and extend cutting-edge GenAI models
- Apply advanced fine-tuning strategies for parameter-efficient tuning
- Explore prompt optimization, reasoning augmentation, and action grounding
- Design rigorous evaluation protocols for embodied AI systems
- Run large-scale experiments, analyze performance, and iterate systematically
- Benchmark models against state-of-the-art approaches and internal baselines
- Collaborate with engineering teams to transition research prototypes into production
- Optimize models for real-time inference, robustness, and safety
- Document findings and contribute to research publications
- Present results internally and at leading conferences
- Build physical AI for off-highway machinery
- Combine cutting-edge robotics research with real-world heavy mobile equipment
- Tailor your career path as a technical specialist or team lead
- Influence the future of robotic technologies and tackle significant challenges
Education
- Currently in higher education
Languages
- English – Business Fluent
Tools & Technologies
- Python
- PyTorch
- Hugging Face ecosystem
- Gemini
- Unsloth
Benefits
Competitive Pay
- Attractive compensation package
- Stock options
Snacks & Drinks
- Beverages on-site
Team Events
- Regular social events
Other Benefits
- Relocation assistance
Like this job?
BetaYour Career Agent finds similar jobs for you every day.
About the Company
sensmore
Industry
Manufacturing
Description
The company automates the world's largest machines with unprecedented intelligence, integrating robotics into a platform for productivity and safety.
Not a perfect match?
- sensmore
Robotics Engineer - Vision Language Action Model(m/w/x)
Full-timeOn-siteNot specifiedBerlin, Potsdam - Helsing
AI Research Engineer - Foundation Models(m/w/x)
Full-timeOn-siteExperiencedBerlin, München - Helsing
AI Research Engineer - 3D Computer Vision(m/w/x)
Full-timeOn-siteSeniorMünchen, Berlin - Fraunhofer-Gesellschaft
Mandatory Internship: Computer Vision and AI in Robotics(m/w/x)
Full-timeInternshipOn-siteBerlin - Fraunhofer-Gesellschaft
Mandatory internship: AI-based industrial image processing(m/w/x)
Full-timeInternshipOn-siteBerlin