Site Reliability Engineer

About the Role

You will ensure the reliability, availability, and performance of production systems. You will design, write, and deliver tools and software primarily using Python, Bash, Terraform, or Nix to improve availability, scalability, and efficiency. You will operate and monitor services throughout their lifecycle, participate in on-call rotations, practice sustainable incident response and blameless postmortems, and develop and uphold SLOs, SLIs, and error budgets. You will analyse system performance and reliability, recommend enhancements, collaborate with development teams on scalable solutions, and maintain automations and infrastructure code.

Requirements

  • Proficiency in Python
  • Proficiency in Bash
  • Experience with Terraform
  • Experience with Nix
  • Extensive experience with AWS including EKS and RDS
  • Familiarity with Kubernetes
  • Hands-on experience with PostgreSQL on RDS
  • Experience with monitoring tools such as Prometheus Grafana and Loki
  • Experience with CI/CD including GitHub Actions Hydra and Earthly
  • Troubleshooting and performance tuning skills
  • Strong communication and collaboration skills
  • Ability to quickly learn new technologies and adapt
  • High attention to detail

Responsibilities

  • Design write and deliver tools and software using Python Bash Terraform and Nix
  • Operate and monitor production services throughout their lifecycle
  • Create and maintain automations and infrastructure as code
  • Practice sustainable incident response and conduct blameless postmortems
  • Develop and uphold SLOs SLIs and error budgets
  • Analyse system performance and recommend improvements
  • Participate in on-call rotations and mitigate service interruptions
  • Collaborate with development teams to ensure scalable performant solutions

Benefits

  • Remote work
  • Laptop reimbursement
  • New starter package for hardware essentials
  • Learning and Development opportunities
  • Competitive PTO

Skills

Apply Now
Site Reliability Engineer at Input Output | JobStash