Contribute to building AI-driven operations as a Senior Site Reliability Engineer. Ensure production stability while working remotely and automating processes in our AWS/Kubernetes workplace.
This remote role is perfect for individuals with 5+ years as a Senior SRE or Production Engineer. You will demonstrate your expertise in managing complex SaaS environments while leveraging skills in AWS and Kubernetes. Your role involves hands-on coding using Python or Go, driving automation, and enhancing monitoring systems with AI-related knowledge.
Key Responsibilities:
• Operate production-level Kubernetes infrastructure on AWS
• Create AI agents for proactive incident handling
• Develop CI/CD pipelines with GitOps methodology
• Build tools for internal developer services
• Maintain observability and excellence using Datadog
Requirements:
• 5+ years as a Senior SRE or Production Engineer
• Strong experience in high-traffic SaaS settings
• Proficient in AWS and Kubernetes technologies
• Advanced coding skills in Python or Go
• Understanding of AI/LLM principles
Elevate the reliability of operations while embracing innovation in a fully remote capacity.
#J-18808-Ljbffr
Apply on Kit Job: kitjob.ca/job/2fsbc8
📌 Remote Senior Site Reliability Engineer Specializing in Cloud Solutions (Toronto)
🏢 Hibob
📍 Toronto
Reply to this offer
Impress this employer describing Your skills and abilities, fill out the form below and leave Your personal touch in the presentation letter.