π Local Job Near You
Reinforcement Learning Engineer
Appit LLC
π
montreal (administrative region), Canada
Location
montreal (administrative region)
Posted
May 24, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
APPIT Software Solutions is hiring a Reinforcement Learning Engineer in Montreal, Canada . Design reinforcement learning systems at APPIT Software in Montreal, building adaptive AI agents for optimization, autonomous decision-making, and RLHF alignment of large language models.
Responsibilities
- Design and implement reinforcement learning algorithms for enterprise optimization problems
- Build RLHF and reward modeling pipelines for LLM alignment and fine-tuning
- Develop simulation environments for training and evaluating RL agents
- Implement multi-agent reinforcement learning systems for complex coordination tasks
- Optimize RL training stability and sample efficiency using state-of-the-art techniques
- Collaborate with research teams to translate RL advances into production applications
Requirements
- 5+ years of ML experience with 2+ years focused on reinforceme...