📍 Local Job Near You

Lead Inference Platform Support Engineer - AI I

🏢

Thomson Reuters

📍 toronto, Canada

📍

Location toronto

📅

Posted June 01, 2026

🚗

Commute Local Area

🎯

Local Opportunity Near You!

This job is in your area. Enjoy a short commute and work close to home.

📋
Job Description

About The Role Lead Inference Platform Engineer – specialized experience in machine learning/deep learning domains such as model compression, hardware‑aware model optimizations, hardware accelerators architecture, GPU/ASIC architecture, ML compilers, high‑performance computing, performance optimizations, numerics, or SW/HW co‑design. 
Responsibilities Optimize LLMs and ML models for high‑performance inference using quantization, pruning, distillation, and hardware specific tuning. 
Deploy and scale inference workloads on GPUs across AWS, Azure, GCP and internal Kubernetes clusters, ensuring predictable performance during peak traffic. 
Implement routing and fail‑over strategies for OpenAI / Anthropic / Vertex AI traffic. 
Integrate models into production‑grade APIs supporting TR products and enterprise workflows. 
Develop highly optimized environments and eliminate performance bottlenecks to reduce latency. 
            

Apply for This Job

Submit Application

Quick and secure application process

📍 Location Details

🌆

City

toronto

🗺️

Country

Canada

🚗

Commute

Local Area

🔍 More Jobs Nearby

Explore other opportunities in toronto

View Local Jobs

Lead Inference Platform Support Engineer - AI I

📋 Job Description

About The Role

Responsibilities

Apply for This Job

📍 Location Details

🔍 More Jobs Nearby

📋
Job Description