Browse the latest jobs in tech

Senior Reinforcement Learning Engineer, Helix

Published: 2024-10-24

Figure is a financial technology company with the mission of leveraging blockchain, AI and advanced analytics to unlock new access points for consumer credit products that can transform the financial lives of our customers. We provide home equity release solutions, including home equity lines of credit, home improvement loans and home buy-lease back offering for retirement. Concurrently, we are building ...

Apply

Job details

San Jose, United States (city)

$150k - $400k

On-site

Full-time

Categories

ML Engineer

PyTorch

Reinforcement Learning

Robotics

Figure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in San Jose, CA and require 5 days/week in-office collaboration. It’s time to build.

We are looking for a Senior or Staff level Reinforcement Learning Engineer. You will own the development, training, and deployment of new reinforcement learning algorithms for our humanoid robot as well as building infrastructure to support training policies at a large scale.

Responsibilities:

Develop, train, and deploy reinforcement learning algorithms for locomotion and manipulation tasks
Build simulation infrastructure to support the training of locomotion and manipulation policies for a general purpose humanoid robot at a large scale
Collaborate with the controls team to integrate policies into the existing control stack
Define, test, and evaluate performance metrics for learned policies

Requirements:

Confident writing production quality code in PyTorch
Familiar with online and offline reinforcement learning algorithms: PPO, SAC, etc.
Experience tuning hyperparameters and cost functions for these RL algorithms
Familiarity with common RL techniques such as: domain randomization, curriculum learning, reward shaping, etc.
Familiarity with general ML evaluation tools such as TensorBoard, Weights&Biases, etc.
Strong mix of industry and research experience, ideally 5-7+ years of experience

Bonus Qualifications:

Experience transferring policies learned in simulation to robot hardware
Experience training locomotion policies for quadrupedal or bipedal robots

The US base salary range for this full-time position is between $150,000 - $400,000 annually.

The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended.

Apply

Figure