🔔 Get instant job alerts delivered to your inbox! Set up your first alert →
📍 Local Job Near You

Lead Inference Platform Support Engineer - AI I

🏢
Thomson Reuters
📍 toronto, Canada
📍
Location toronto
📅
Posted June 01, 2026
🚗
Commute Local Area
🎯
Local Opportunity Near You!

This job is in your area. Enjoy a short commute and work close to home.

📋
Job Description

About The Role

Lead Inference Platform Engineer – specialized experience in machine learning/deep learning domains such as model compression, hardware‑aware model optimizations, hardware accelerators architecture, GPU/ASIC architecture, ML compilers, high‑performance computing, performance optimizations, numerics, or SW/HW co‑design.

Responsibilities

  • Optimize LLMs and ML models for high‑performance inference using quantization, pruning, distillation, and hardware specific tuning.
  • Deploy and scale inference workloads on GPUs across AWS, Azure, GCP and internal Kubernetes clusters, ensuring predictable performance during peak traffic.
  • Implement routing and fail‑over strategies for OpenAI / Anthropic / Vertex AI traffic.
  • Integrate models into production‑grade APIs supporting TR products and enterprise workflows.
  • Develop highly optimized environments and eliminate performance bottlenecks to reduce latency.

Apply for This Job

Submit Application

Quick and secure application process

📍 Location Details

🌆
City
toronto
🗺️
Country
Canada
🚗
Commute
Local Area

🔍 More Jobs Nearby

Explore other opportunities in toronto

View Local Jobs