π Local Job Near You
Staff SRE for AI Workloads and Observability
Sitetracker
π
winnipeg, Canada
Location
winnipeg
Posted
June 04, 2026
Commute
Local Area
Local Opportunity Near You!
This job is in your area. Enjoy a short commute and work close to home.
Job Description
Take on a critical role as a Staff Site Reliability Engineer focused on optimizing AI workloads. This position empowers you to define engineering standards and improve our reliability practices.
You will drive the organization toward a proactive reliability approach as you partner with existing engineers. Your efforts will not only enhance incident response strategies but also establish effective tools that improve system observability and response metrics, allowing for sustainable improvements.
Key Responsibilities: β’ Set SLIs and SLOs based on critical user interactions β’ Direct production incident responses and lead effective postmortems β’ Develop meaningful dashboards that enhance system understanding β’ Mentor the engineering team for technical skill enhancement β’ Lead architectural improvements for reliability tools
Requirements: β’ Profound experience with Site Reliability Engineering and AWS β’ Background in handling production incidents effectively β’ St...
You will drive the organization toward a proactive reliability approach as you partner with existing engineers. Your efforts will not only enhance incident response strategies but also establish effective tools that improve system observability and response metrics, allowing for sustainable improvements.
Key Responsibilities: β’ Set SLIs and SLOs based on critical user interactions β’ Direct production incident responses and lead effective postmortems β’ Develop meaningful dashboards that enhance system understanding β’ Mentor the engineering team for technical skill enhancement β’ Lead architectural improvements for reliability tools
Requirements: β’ Profound experience with Site Reliability Engineering and AWS β’ Background in handling production incidents effectively β’ St...