🌴 JobsLeisure.com

Where Work Meets Adventure

← Back to Leisure Jobs

Senior SRE: AI/ML HPC Infra & GPU Cluster

Hospitality Full Benefits Career Growth
Company

Boson AI

Location

toronto, Canada

Posted

May 31, 2026

Start Your Adventure

Join our team and work where others vacation

Apply Now

About This Opportunity

A technology company in Toronto seeks a Senior Site Reliability Engineer to manage and optimize its HPC infrastructure. In this role, you'll ensure smooth operations of a powerful GPU cluster, deploy infrastructure-as-code solutions, and support ML teams. Candidates should have extensive SRE experience, proficiency in Linux, and familiarity with Kubernetes and Ceph storage. This position offers the chance to work with cutting-edge technology in a collaborative environment, perfect for problem-solvers who love learning.
#J-18808-Ljbffr