- Company Name
- Maze
- Job Title
- Principal Platform Engineer | Fintech | London | Up to £180k + Equity
- Job Description
-
**Job title:** Principal Platform Engineer
**Role Summary:** Lead the design, build, and operation of a globally distributed, AI‑native ledger infrastructure for a next‑generation banking platform. Own end‑to‑end platform architecture, ensuring five‑nines availability, real‑time observability, and zero‑trust security across multi‑region, hybrid cloud/on‑prem environments.
**Expectations:**
- Deliver production‑ready infrastructure from concept to launch at scale.
- Maintain high‑availability, low‑latency services for core banking workloads.
- Drive innovation in SRE practices, AI‑powered operations, and open‑source contributions.
- Be deeply hands‑on, writing code, configurations, and automation scripts rather than managing people or strategy alone.
**Key Responsibilities:**
1. Own and evolve the overall platform architecture for the Thin Ledger infrastructure.
2. Design, deploy, and scale multi‑region Kubernetes clusters across public clouds and on‑prem data centres.
3. Harden distributed data services (Kafka, Redis, CockroachDB) to meet global banking security and compliance requirements.
4. Lead AI‑enabled Site Reliability Engineering, building observability, automated remediation, and self‑healing capabilities.
5. Implement zero‑trust, multi‑tenant security controls; maintain SOC 2, ISO 27001, and other regulatory compliance.
6. Define and enforce infrastructure‑as‑code practices (Terraform, GitOps, Helm) to enable repeatable, auditable deployments.
7. Collaborate with product, security, and compliance teams to translate banking requirements into technical requirements.
**Required Skills:**
- Expert Kubernetes administration and architecture (including cluster ‑role design, network policies, and admission controllers).
- Deep experience building production‑scale distributed systems (Kafka, Redis, CockroachDB, etc.).
- Proven track record designing and operating multi‑region, highly available infrastructure (on‑prem and cloud).
- Strong SRE fundamentals: defining SLOs/SLIs, incident management, post‑mortems, and continuous improvement.
- Familiarity with AI‑native operations or enthusiasm to adopt agent‑powered automation.
- Proficiency with IaC tools (Terraform, GitOps workflows, Helm charts).
- Solid scripting/automation background; proficiency in Go, Python, Bash, or similar is highly desirable.
- Passion for open‑source principles and clean, modular architecture.
**Required Education & Certifications:**
- Bachelor’s or Master’s degree in Computer Science, Engineering, or related technical field (or equivalent industry experience).
- Certifications beneficial: AWS Certified DevOps Engineer, GCP Professional Cloud DevOps Engineer, Azure DevOps Engineer Expert, or Kubernetes Administration (CKA/CKAD).
---