19 Apr
|
Cerebras
|
Toronto
Apply on Kit Job: kitjob.ca/job/2g912e
Elevate AI hardware performance as a Senior Software Engineer specializing in observability. Deliver scalable monitoring solutions while ensuring seamless integration with backend APIs for the Inference platform.
This position is crucial for a full-stack engineer focused on observability platforms, connecting software metrics with hardware signals. You will be responsible for designing high-performance telemetry systems, ensuring reliability, and collaborating effectively across multiple teams to enhance understanding of system health. Your innovations will make observability a streamlined aspect of service deployment across AI hardware.
Key Responsibilities:
• Own the architecture and roadmap for observability • Create high-cardinality telemetry pipelines • Collaborate with teams to define reliability metrics • Unify hardware and software observability signals • Foster improvement in engineering practices through mentorship
Requirements: • 8+ years of engineering experience, 4+ in observability • Deep experience with observability ecosystems • Solid command of Go and Python • Expertise in Kubernetes and distributed architectures • Experience in setting technical strategies
Transform the observability landscape for AI applications and elevate engineering standards through impactful mentoring. #J-18808-Ljbffr
Apply on Kit Job: kitjob.ca/job/2g912e
📌 Senior Software Engineer - Observability (Toronto)
🏢 Cerebras
📍 Toronto