Head of Platform Engineering
About the Role
You will serve as the technical authority for infrastructure and developer tools, leading the design, development, and optimisation of a scalable, reliable, and secure platform. You will identify and remediate infrastructure gaps, deliver critical projects, and build a high-performing DevOps/Platform team. You will mentor and onboard engineers, manage direct reports, and contribute as an individual contributor when needed. You will implement comprehensive monitoring (Prometheus, Datadog), enhance CI/CD pipelines, integrate DevSecOps and security practices (zero-trust models, vulnerability scanning), and ensure compliance with standards like SOC 2 and GDPR. You will design, deploy, and manage highly available, low-latency blockchain infrastructure, optimise costs, automate repetitive tasks, and provide self-service tools to improve developer productivity and reliability.
Requirements
- Hands-on experience in platform engineering
- Experience with cloud infrastructure (AWS, GCP)
- Experience with Infrastructure as Code (IaC) tools
- Experience with containerisation
- Experience with monitoring systems and observability
- Ability to contribute as an individual contributor while leading a team
- Proven track record delivering complex projects on time
- Ability to develop and execute long-term infrastructure strategies
- Experience mentoring and building high-performing teams
- Strong problem-solving and cross-functional collaboration skills
- Clear and effective communication with technical and non-technical stakeholders
Responsibilities
- Lead platform reliability and SRE efforts
- Maintain uptime for critical systems
- Implement and operate monitoring with Prometheus and Datadog
- Implement security practices including zero-trust and vulnerability scanning
- Integrate DevSecOps and ensure compliance with SOC 2 and GDPR
- Enhance CI/CD pipelines to increase deployment frequency
- Define and execute infrastructure strategy
- Build, scale, mentor, and onboard the Platform/DevOps team
- Manage direct reports
- Implement cost monitoring and optimise resource allocation
- Automate repetitive tasks and provide self-service developer tools
- Design, deploy, and manage highly available low-latency blockchain infrastructure
- Reduce mean time to detect and mean time to recover from incidents
