πŸ”” Get instant job alerts delivered to your inbox! Set up your first alert β†’
πŸ“ Local Job Near You

HPC Network Engineering Lead for GPU Clusters

🏒
Empresa reconocida
πŸ“ Remote, Colombia
πŸ“
Location Remote
πŸ“…
Posted June 10, 2026
πŸš—
Commute Local Area
🎯
Local Opportunity Near You!

This job is in your area. Enjoy a short commute and work close to home.

πŸ“‹
Job Description

Responsibilities

  • Define and own a multi-year architectural vision and roadmap for InfiniBand/RDMA and high-speed Ethernet fabrics supporting massive GPU clusters and distributed AI/LLM workloads across the client portfolio
  • Govern evaluation and standardization of cluster network topologies such as Fat-tree, Clos, Rail-optimized, and Dragonfly, and set decision frameworks aligned to scale, performance, and cost constraints
  • Establish and enforce engineering standards for host-side networking, including NIC configuration, drivers, firmware, IRQ affinity, NUMA placement, PCIe topology, and GPU-to-NIC communication paths
  • Drive strategic performance engineering across RDMA/RoCE, NCCL/MSCCL, and collective communication for multi-node GPU training, and oversee resolution of the hardest systemic performance issues
  • Define the reference architecture for Kubernetes networking on GPU clusters, including CNI plugins, network policies, multi-NI...

Apply for This Job

Submit Application

Quick and secure application process

πŸ“ Location Details

πŸŒ†
City
Remote
πŸ—ΊοΈ
Country
Colombia
πŸš—
Commute
Local Area

πŸ” More Jobs Nearby

Explore other opportunities in Remote

View Local Jobs