New Job?Nejo!

The AI Job Search Engine

Tether Operations Limited

last mo.

AI Research Engineer - Reinforcement Learning(m/w/x)

Lugano

Full-timeRemoteExperienced

AI/ML

Data Science

Nejo AI Summary

Apply now

Description

In this role, you will drive innovation in reinforcement learning by developing cutting-edge algorithms and optimizing decision-making processes. Your day-to-day responsibilities will involve running experiments, curating training datasets, and collaborating with teams to enhance AI performance in real-world applications.

Let AI find the perfect jobs for you!

Upload your CV and Nejo AI will find matching job offers for you.

Start AI Job Search

Requirements

•Degree in Computer Science or related field
•PhD in NLP, Machine Learning, or related field
•Solid track record in AI R&D with good publications
•Proven experience with large-scale reinforcement learning experiments
•Experience with online RL techniques such as GRPO
•Deep understanding of reinforcement learning algorithms
•Expertise in enhancing policy stability, exploration, and sample efficiency
•Strong expertise in PyTorch and relevant RL frameworks
•Practical experience in developing RL pipelines
•Demonstrated ability to apply empirical research to RL challenges
•Proficiency in designing robust evaluation frameworks

Education

Bachelor's degree

Work Experience

approx. 1 - 4 years

Tasks

•Develop and implement advanced reinforcement learning algorithms
•Establish performance targets for reward maximization and policy stability
•Build, run, and monitor controlled reinforcement learning experiments
•Track key performance indicators and document results
•Identify and curate high-quality simulation environments and training datasets
•Set measurable criteria to enhance learning processes
•Debug and optimize the reinforcement learning pipeline
•Analyze computational efficiency and learning performance metrics
•Address issues like reward signal noise and policy divergence
•Collaborate with cross-functional teams to integrate reinforcement learning agents
•Define success metrics for real-world performance improvements
•Ensure continuous monitoring and iterative refinements

Tools & Technologies

PyTorch

Languages

English – Business Fluent

Find the original job posting in its most current version here. Nejo automatically captured this job from the website of Tether Operations Limited and processed the information on Nejo with the help of AI for you. Despite careful analysis, some information may be incomplete or inaccurate. Please always verify all details in the original posting! Content and copyrights of the original posting belong to the advertising company.

Not a perfect match?

66 Similar Jobs in Lugano

Tether Operations Limited
AI Research Engineer - Pre training(m/w/x)
Full-timeRemoteSenior
Lugano
Tether
AI Video Research Engineer Intern(m/w/x)
Full-timeInternshipRemote
Lugano
Tether Operations Limited
Senior AI Inference Engineer(m/w/x)
Full-timeRemoteSenior
Lugano
Tether
Lead AI Inference Engineer(m/w/x)
Full-timeRemoteSenior
Lugano
Jobtome
Senior Site Reliability Engineer(m/w/x)
Full-timeRemoteSenior
Mendrisio

66+ View all similar jobs

Tether Operations Limited

last mo.