New Job?Nejo!

The AI Job Search Engine

TE
Tether Operations Limited
16d ago

AI Research Engineer - Reinforcement Learning(m/w/x)

Lugano
Full-timeRemoteExperienced
AI/ML
Data Science

Description

In this role, you will drive innovation in reinforcement learning by developing cutting-edge algorithms and optimizing decision-making processes. Your day-to-day responsibilities will involve running experiments, curating training datasets, and collaborating with teams to enhance AI performance in real-world applications.

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Requirements

  • Degree in Computer Science or related field
  • PhD in NLP, Machine Learning, or related field
  • Solid track record in AI R&D with good publications
  • Proven experience with large-scale reinforcement learning experiments
  • Experience with online RL techniques such as GRPO
  • Deep understanding of reinforcement learning algorithms
  • Expertise in enhancing policy stability, exploration, and sample efficiency
  • Strong expertise in PyTorch and relevant RL frameworks
  • Practical experience in developing RL pipelines
  • Demonstrated ability to apply empirical research to RL challenges
  • Proficiency in designing robust evaluation frameworks

Education

Bachelor's degree
OR
Doctoral / PhD

Work Experience

approx. 1 - 4 years

Tasks

  • Develop and implement advanced reinforcement learning algorithms
  • Establish performance targets for reward maximization and policy stability
  • Build, run, and monitor controlled reinforcement learning experiments
  • Track key performance indicators and document results
  • Identify and curate high-quality simulation environments and training datasets
  • Set measurable criteria to enhance learning processes
  • Debug and optimize the reinforcement learning pipeline
  • Analyze computational efficiency and learning performance metrics
  • Address issues like reward signal noise and policy divergence
  • Collaborate with cross-functional teams to integrate reinforcement learning agents
  • Define success metrics for real-world performance improvements
  • Ensure continuous monitoring and iterative refinements

Tools & Technologies

PyTorch

Languages

EnglishBusiness Fluent

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Tether Operations Limited and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.
Not a perfect match?
32 Similar Jobs in Lugano
  • Tether Operations Limited

    AI Research Engineer - Pre training(m/w/x)

    Full-timeRemoteSenior
    Lugano
  • Tether Operations Limited

    Lead AI Inference Engineer(m/w/x)

    Full-timeRemoteSenior
    Lugano
  • Jobtome

    Senior Site Reliability Engineer(m/w/x)

    Full-timeRemoteSenior
    Mendrisio
  • Jobtome

    Senior Backend Developer(m/w/x)

    Full-timeRemoteSenior
    Mendrisio
  • Tether Operations Limited

    Software Architect(m/w/x)

    Full-timeRemoteSenior
    Lugano
32+ View all similar jobs