DevOps / Site Reliability Engineer
About the Role
You will design, build, and operate the cloud and platform foundations for a managed relayer and related services. You will develop and maintain CI/CD pipelines, implement Infrastructure as Code using tools like Terraform, Helm, ArgoCD and Ansible, and build monitoring, logging and alerting with Prometheus, Grafana and the ELK stack. You will collaborate with security engineers to integrate DevSecOps practices into deployment pipelines, manage cloud infrastructure on AWS/GCP/Azure, and create automation and tooling to reduce manual overhead and accelerate developer iteration.
Requirements
- Strong hands-on experience with Kubernetes
- Proven experience designing and implementing CI/CD pipelines (e.g., GitHub Actions GitLab CI CircleCI)
- Proficiency with Infrastructure as Code tools such as Terraform Helm ArgoCD Flux or Ansible
- Experience with cloud platforms AWS GCP or Azure and infrastructure optimization
- Understanding of DevSecOps principles and implementing security best practices
- Strong scripting skills such as Python and Bash
- Only candidates on the East Coast of the United States or Canada are eligible
Responsibilities
- Design build and operate cloud and platform foundations
- Develop and maintain CI/CD pipelines for automated deployment and testing
- Manage infrastructure as code using Terraform Helm ArgoCD and Ansible
- Implement monitoring logging and alerting solutions with Prometheus Grafana and ELK
- Integrate DevSecOps practices into deployment pipelines
- Manage and optimize cloud infrastructure on AWS GCP or Azure
- Create and maintain tooling and automation scripts to reduce manual overhead
- Collaborate with developers product and security teams to deliver deployment solutions
