Site Reliability Engineer (Node.Js, TypeScript, IaC & Cloud Platforms) (Canada)

Site Reliability Engineer (Node.Js, TypeScript, IaC & Cloud Platforms) (Canada)

17 Apr
|
Brainhunter Systems
|
Canada

17 Apr

Brainhunter Systems

Canada

Hiring Senior Site Reliability Engineer (own reliability, performance, own infrastructure across cloud, off/on-prem infrastructure; design resilient systems, automate deployments, and ensure performance at scale). Preference will be given to candidates whose resumes clearly meet the required experience and requirements outlined below. Professionals with suitable experience may send their updated resume to

Requirement Summary

- Job Role/Title: Senior Site Reliability Engineer (SRE)
- Job Type: Permanent role, Full-Time Employment.
- Job Location: Remote Work.
- Work Style: Hybrid Work Setting - 3 days in the office required
- Work Hours: 40 hours/week, typically Monday-Friday 9:00 am to 5:00 pm
- Contractor to work extra time or on-call: Yes, might require support overtime when required.
- Rate Cap: The compensation will be negotiated between the contractor and recruiter based on experience and engagement terms.
- Interview: Teams interviews; 2 interviews

___________________________________________________________

Position Overview: Seeking a Senior SRE Consultant who can own reliability, performance, own infrastructure across cloud, off/on-prem infrastructure; design resilient systems, automate deployments, and ensure performance at scale. Must be hands-on with off/on-prem infrastructure. We’re building next-gen sweepstakes gaming experiences that are fast, reliable, and highly scalable. As a Senior SRE, you’ll own the infrastructure that powers everything—primarily across on-prem and hybrid environments—ensuring our systems are resilient, performant, and built to scale.

Key Accountabilities:

- Design, build, and operate on-prem and hybrid infrastructure, with potential integration into cloud environments over time
- Architect and maintain highly available, resilient systems for real-time, high-traffic gaming workloads
- Automate deployments, infrastructure provisioning, and operational workflows (CI/CD, IaC where applicable)
- Monitor system performance, uptime,



and reliability—proactively identifying and resolving issues
- Implement observability best practices (logging, metrics, tracing, alerting)
- Improve system resilience through redundancy, failover strategies, and disaster recovery planning
- Partner closely with backend and platform teams to optimize system performance and reliability
- Own incident response, postmortems, and continuous improvement of system stability

Qualifications and Skillset for this Role:

- Strong experience in Site Reliability Engineering, Dev Ops, or infrastructure engineering, with a focus on on-prem or hybrid environments
- Deep understanding of physical infrastructure, networking, and distributed systems
- Experience managing servers, virtualization, and data center environments
- Hands-on experience with automation, scripting, and deployment workflows
- Robust troubleshooting skills across systems, networking, and performance bottlenecks
- Experience with monitoring and observability tools (Prometheus, Grafana, Datadog, or similar)
- Solid understanding of security, redundancy, and system design for uptime and resilience
- Comfortable working autonomously in a fast-paced startup environment
- Exposure to cloud platforms (AWS, GCP, Azure) and hybrid infrastructure models
- Experience with containers and orchestration (Docker, Kubernetes)
- Familiarity with backend systems (Node.js / Type Script environments)
- Experience supporting real-time or high-concurrency systems (gaming, fintech, etc.)

Why This Role

- Own and shape the core infrastructure of a rapidly scaling platform
- High ownership: define how systems are built, deployed, and operated
- Work on real infrastructure challenges beyond just cloud abstractions
- Startup speed: ship fast and see impact immediately
- Fully remote, flexible environment

___________________________________________________________

How to Apply: Please email your up-to-date Resume/CV to

We appreciate all the applicants for their interest in working with us; however, only those candidates shortlisted for the next steps in the hiring process will be contacted.

Thank you, and have a wonderful day!

📌 Site Reliability Engineer (Node.Js, TypeScript, IaC & Cloud Platforms) (Canada)
🏢 Brainhunter Systems
📍 Canada

Reply to this offer

Impress this employer describing Your skills and abilities, fill out the form below and leave Your personal touch in the presentation letter.

Subscribe to this job alert:
Enter Your E-mail address to receive the latest job offers for: site reliability engineer (node.js, typescript, iac & cloud platforms) (canada) / canada
Subscribe to this job alert:
Enter Your E-mail address to receive the latest job offers for: site reliability engineer (node.js, typescript, iac & cloud platforms) (canada) / canada