- Company Name
- Owen Thomas
- Job Title
- Engineering Manager, MLOps, Marketplace, Ecommerce, | 35 Million Users | UK Remote OR London, Hybrid, 1 Day PW, Up to £140,000
- Job Description
-
Job Title: Engineering Manager – MLOps, Marketplace, Ecommerce
Role Summary
Lead and scale a 6‑8 engineer MLOps team, define and execute the ML infrastructure roadmap, and ensure production‑grade reliability, scalability, and cost efficiency for a global marketplace. Drive cross‑functional collaboration with data science, product, and engineering to support rapid innovation.
Expectations
- Own end‑to‑end MLOps delivery, from architecture to incident resolution.
- Set strategy aligned with enterprise data and engineering goals.
- Mentor and grow a high‑performing engineering team.
- Act as the escalation point for critical ML system incidents.
Key Responsibilities
- Manage, coach, and develop a dedicated MLOps engineering team.
- Define and deliver the MLOps roadmap, aligning with broader engineering/data strategy.
- Provide architectural guidance on ML pipelines, deployment, monitoring, and incident response.
- Partner with data science, ML, and product to align infrastructure with business needs.
- Oversee reliability, cost optimization, and vendor relationships to keep infrastructure scalable.
- Own resolution of critical ML/infra incidents and drive continuous improvement.
- Communicate progress, risk, and priorities to leadership in a clear, actionable manner.
Required Skills
- Proven leadership experience managing MLOPs/ML Engineering or Platform Engineering teams.
- Deep knowledge of cloud platforms (AWS, GCP, Azure) for large‑scale ML infra.
- Hands‑on experience with GPU training/serving, Docker, Kubernetes, Kubeflow, and CI/CD (Jenkins, GitHub Actions, GitLab CI).
- Familiarity with distributed frameworks (Spark, Ray, TensorFlow Distributed, PyTorch Distributed).
- Strong expertise in monitoring, logging, and observability for ML systems.
- Demonstrated cost optimization for compute/GPU workloads.
- Excellent people, influence, and stakeholder communication skills.
- Experience with vendor management and contract oversight.
- Knowledge of Databricks, Tecton (or Feast), Seldon, SageMaker advantageous.
Required Education & Certifications
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.
- Relevant cloud certifications (AWS Certified Solutions Architect, GCP Professional Data Engineer, Azure AI Engineer) preferred.