🔔 Get instant job alerts delivered to your inbox! Set up your first alert →
📍 Local Job Near You

AI Systems Reliability Engineer

🏢
Tenstorrent Inc.
📍 toronto, Canada
📍
Location toronto
📅
Posted May 29, 2026
🚗
Commute Local Area
🎯
Local Opportunity Near You!

This job is in your area. Enjoy a short commute and work close to home.

📋
Job Description

Be a part of pioneering AI technology as an AI Systems Reliability Engineer. Ensure operational health and system reliability across varied environments in a hybrid working scenario.
In this role, you'll manage the reliability of systems critical to AI infrastructure. Collaborate effectively with engineering teams and clients to resolve challenges while improving observability and alerting systems. The role requires hands-on troubleshooting in distributed environments, ensuring production readiness across all systems.
Key Responsibilities:
• Ensure operational health of AI systems
• Troubleshoot networking and compute challenges
• Collaborate with teams to manage production incidents
• Enhance monitoring and alerting frameworks
• Develop automations to reduce operational workloads
Requirements:
• Experience in systems or site reliability engineering
• Deep knowledge of Linux systems and troubleshooting
• Familiarity with Prometheus and similar tools
...

Apply for This Job

Submit Application

Quick and secure application process

📍 Location Details

🌆
City
toronto
🗺️
Country
Canada
🚗
Commute
Local Area

🔍 More Jobs Nearby

Explore other opportunities in toronto

View Local Jobs