- Company Name
- Lhasa Limited
- Job Title
- Lead Platform Engineer
- Job Description
-
**Job title**: Lead Platform Engineer
**Role Summary**:
Provide strategic and hands‑on leadership for a cloud‑native platform engineering team, shaping architecture, operations, and developer experience for scalable SaaS delivery.
**Expactations**:
- Lead platform engineering initiatives aligned with organisational goals.
- Balance strategic platform planning with hands‑on execution to ensure reliable, cost‑optimised cloud services.
- Foster continuous improvement, innovation, and technical excellence within the team and across tech groups.
**Key Responsibilities**:
1. Strategic & Technical Leadership
- Set vision, strategy, and roadmap for platform engineering.
- Mentor engineers in Kubernetes, Docker, Helm, GitOps, IaC, and observability.
- Design and implement GitOps workflows (ArgoCD, Helm) and self‑service portals (Backstage).
- Own production SaaS reliability; provide third‑line support and incident response.
2. Cloud & Platform Strategy
- Manage multi‑cluster Kubernetes environments and cloud cost optimisation (AWS, internal).
- Establish standards for cloud, containerisation, orchestration, security, CI/CD, and observability.
- Implement DORA & Core 4 metrics for continuous improvement.
3. Developer Experience & Service Management
- Build personalized dashboards and software templates in Backstage.
- Design robust CI/CD pipelines (Jenkins) and enforce RBAC, network policies, disaster‑recovery plans.
4. Observability & Automation
- Maintain Prometheus, Grafana, Loki stack; define SLOs, alerting, runbooks.
- Automate toil reduction and optimize deployment pipelines.
5. Collaboration & Compliance
- Partner with security teams for compliance in high‑value data environments.
- Drive platform adoption change management and community of practice.
**Required Skills**:
- ≥5 years designing and owning production SaaS platforms.
- Expert Kubernetes orchestration, multi‑cluster management, resource optimisation.
- Cross‑cloud infrastructure architecture (AWS, internal) with cost accountability.
- GitOps proficiency (ArgoCD, Helm).
- IaC tools: CDK, Terraform, Ansible.
- Observability platforms: Prometheus, Grafana, Loki.
- Security best practices: RBAC, network policies, disaster recovery.
- Strong mentorship, stakeholder communication, and change‑management skills.
**Required Education & Certifications**:
- No specific certifications mandated; equivalent experience accepted.
- Relevant academic background (Computer Science, Engineering, or related field) preferred but not explicitly required.