Original Beschreibung
## Senior AI Researcher (m/f/d)@audEERING GmbH
###### Permanent employee, Full-time ·audEERING GmbH
---
##### About the role
Agile Robots SE has acquired a majority stake in audEERING GmbH in 2025. This vacancy is to be filled at audEERING GmbH, either in Munich or in Berlin. You're an expert in deep learning, LLMs, and multi-modal AI, with a passion for driving groundbreaking advancements in artificial intelligence?
Join a pioneering team shaping the future of intelligence!
As a Senior AI Researcher (m/f/d), you’ll lead the development of LLMs, push the boundaries of multi-modal AI, and translate state-of-the-art research into production-ready innovations. This role blends advanced NLP research with hands-on engineering — from data curation and model training to distributed systems and performance optimization.
##### Your Responsibilities
* **Pre-training & Specializing LLMs** – Be responsible for developing, optimizing, and adapting largescale language models for a variety of applications, ensuring efficiency, adaptability, and high performance. Contribute to model refinement and specialization, leveraging advanced training methodologies to enhance capabilities across domains. Work on evaluation and benchmarking frameworks to assess and improve model effectiveness, enabling robust and scalable AI solutions.
* **Applied Research & Hands-On Development** – Combine applied NLP research with hands-on engineering, spanning model training, fine-tuning, prompting, and deployment. Set up and manage computing clusters, distributed training environments, and compute optimization for large-scale experiments.
* **Data Curation & Model Preparation** – Handle data curation, dataset selection, and checkpoint preparation to facilitate high-quality model training and evaluation.
* **State-of-the-Art Research & Implementation** – Stay up to date with cutting-edge AI research, identify and understand state-of-the-art techniques, and translate them into technical reports, papers, and production-ready implementations.
* **Multi-Modal AI Development** – Contribute to multi-modal AI, integrate various data modalities and develop novel multi-modal large-scale models.
##### Essential Skills
* PhD (or equivalent substantial related work experience in a R&D department in industry) in Computer Science, Electrical Engineering, Artificial Intelligence, or a related field
* First-author publications at top-tier conferences (e.g., NeurIPS, ICML, ICLR, ACL, EMNLP, Interspeech, and ICASSP) or machine learning journals, or author of patents
* Profound understanding of the field of AI, including fundamental concepts of Machine Learning, training algorithms, evaluation methods and statistics
* Good knowledge of recent Deep Learning architectures
* Experience with Natural Language Processing/Large Language Models, such as model training, fine-tuning, and evaluation
* High proficiency in Python and PyTorch and/or similar and related frameworks
* Fundamental knowledge of Linux and command-line tools
* Excellent team player, problem solver and goal achiever
* Ability to work in a fast-evolving field, dynamic work environment and to deliver on time
##### Beneficial Skills
* Experience with reinforcement learning
* Experience in distributed model training, implementation of training parallelization, and performance optimization
* Good knowledge of C/C++ and CUDA
* Experience with major cloud computing platforms (e.g., AWS, Azure, Google Cloud, or Oracle)
##### What we offer
* A dynamic, highly qualified and diverse team in which your contributions are reflected directly in our products and used by our international customer base
* Flat hierarchies and short decision-making processes
* Exciting and varying tasks for our product portfolio
* Excellent working environment, modern office space and flexible working hours with the option of mobile working
* Close connection to academic research (EU and national projects) and a highly innovative company
* Team and company events
* Free drinks, coffee, tea & fruit and snacks