🌴 JobsLeisure.com

Where Work Meets Adventure

← Back to Leisure Jobs

Researcher - Reinforcement Learning

Hospitality Full Benefits Career Growth
Company

Huawei Technologies Canada Co., Ltd.

Location

Edmonton, Canada

Posted

March 23, 2026

Start Your Adventure

Join our team and work where others vacation

Apply Now

About This Opportunity

Job description

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher.


About the team:

Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organization with notable achievements in academia and industry. The lab’s mission focuses on advancing artificial intelligence and related fields to benefit the company and society. Driven by impactful, long-term projects, the aim is to enhance state-of-the-art research while integrating innovations into the company's products and services, including LLMs, RL, NLP, computer vision, AI theory, and Autonomous driving.

About the job:

  • Enabling Large Language Models (LLMs) to learn from experience, interaction, and environment feedback, moving beyond static fine-tuning toward continual, agentic self-improvement.

  • LLM post-training paradigms (e.g., RLHF, GRPO, reward-free methods, etc.).

  • <...