Shape cutting-edge AI initiatives as a dedicated Senior Software Development Engineer. This role focuses on model execution, optimizing training infrastructure, and enhancing inference serving on advanced GPU architectures.
In this position, you will be responsible for managing the entire model execution stack on high-performance GPU systems. Key responsibilities include refining large-scale training processes for LLMs and addressing computational challenges with cutting-edge solutions. Your expertise will directly influence the performance and efficiency of frontier models, making a significant impact in the AI domain.
Key Responsibilities:
• Optimize large-scale model training on GPU clusters
• Develop and maintain job orchestration and storage solutions
• Resolve training-specific issues across GPU architectures
• Write high-performance GPU kernels in relevant frameworks
• Collaborate on next-gen GPU integration and design
Requirements:
• Hands-on AI/ML infrastructure experience
• Familiarity with frontier models and AMD hardware
• Background in validation frameworks and proxies
• Experience with large-scale distributed GPU systems
• Advanced degree in related technical fields
Leverage your talents in GPU optimization and AI infrastructure to drive breakthrough solutions in advanced computing applications.
#J-18808-Ljbffr
Apply on Kit Job: kitjob.ca/job/2fs7q4
📌 Senior AI Systems Development Engineer (Markham)
🏢 Advanced Micro Devices
📍 Markham
Reply to this offer
Impress this employer describing Your skills and abilities, fill out the form below and leave Your personal touch in the presentation letter.