cover image
Fractal

Lead MLOps Engineer – AI/ML Infrastructure

Remote

New york city, United states

Senior

Full Time

05-10-2025

Share this job:

Skills

Leadership Python Java C# SQL CI/CD Docker Kubernetes Monitoring Architecture Cloud Architecture Machine Learning Programming git Organization Azure AWS C++ Analytics GCP Dataiku business development PySpark

Job Specifications

Lead MLOPS Engineer

Fractal is a strategic AI partner to Fortune 500 companies with a vision to power every human decision in the enterprise. Fractal is building a world where individual choices, freedom, and diversity are the greatest assets; an ecosystem where human imagination is at the heart of every decision. Where no possibility is written off, only challenged to get better. We believe that a true Fractalite is the one who empowers imagination with intelligence. Fractal has been featured as a Great Place to Work by The Economic Times in partnership with the Great Place to Work(r) Institute and recognized as a 'Cool Vendor' and a 'Vendor to Watch' by Gartner.

Please visit Fractal | Intelligence for Imagination for more information about Fractal.

JOB DESCRIPTION:

This leadership role is ideal for a seasoned MLOps expert with 12+ years of experience who can drive strategic initiatives, mentor teams, and architect scalable AI/ML infrastructure. The Lead MLOps Engineer will spearhead the design and implementation of enterprise-grade MLOps systems and play a pivotal role in client engagements and innovation.

Building the machine learning production System (or MLOps) is the biggest challenge most large companies currently have in making the transition to becoming an AI-driven organization. This position is an opportunity for an experienced, server-side developer to build expertise in this exciting new frontier. You will be part of a team deploying state-of-the-art AI solutions for Fractal clients.

RESPONSIBILITIES:

As Lead MLOps Engineer, you will lead cross-functional teams of Data Scientists, Engineers, and Architects to deliver robust and scalable ML solutions. You will define best practices, establish governance frameworks, and ensure operational excellence across the ML lifecycle.

As MLOps Engineer, you will work collaboratively with Data Scientists and Data engineers to deploy and operate advanced analytics machine learning models. You'll help automate and streamline Model development and Model operations. You'll build and maintain tools for deployment, monitoring, and operations. You'll also troubleshoot and resolve issues in development, testing, and production environments.

* Architect and oversee model tracking, experimentation, and automation strategies

* Lead the development of scalable, reusable ML pipelines and frameworks

* Develop MLOps components in Machine learning development life cycle using

Model Repository (either of): MLFlow, Kubeflow Model Registry

Machine Learning Services (either of): Kubeflow, DataRobot, HopsWorks, Dataiku or any relevant ML E2E PaaS/SaaS

* Guide and optimize all phases of the ML lifecycle, ensuring alignment with business goals and compliance standards

* Build the knowledge base required to deliver increasingly complex MLOPS projects on the Cloud(AWS, Azure, GCP)/On Prem

* Good experience with AWS and Azure.

* Act as a strategic advisor in client engagements, contributing to business development, solutioning, and delivery across diverse domains

REQUIRED QUALIFICATIONS:

* 12+ years of experience in software engineering with at least 5 years in MLOps leadership roles

* 6-12 years experience building production-quality software

* Strong experience in System Integration, Application Development or Data Warehouse projects across technologies used in the enterprise space

* Deep expertise in MLOps, ML lifecycle management, containerization (Docker/Kubernetes), and cloud-native architectures

* Object-oriented languages (e.g. Python, PySpark, Java, C#, C++)

* Experience developing CI/CD components for production ready ML pipeline.

* Database programming using any flavors of SQL

* Knowledge of Git for Source code management

* Ability to collaborate effectively with highly technical resources in a fast-paced environment

* Ability to solve complex challenges/problems and rapidly deliver innovative solutions

* Proven leadership in managing large teams, solving complex problems, and delivering enterprise-scale ML solutions

* Foundational Knowledge of Cloud Computing either one AWS, Azure or GCP

* Hunger and passion for learning new skills

EDUCATION:

* B.E/B.Tech/M.Tech in Computer Science or related technical degree OR Equivalent

* Advanced certifications in MLOps, Cloud Architecture, or AI/ML are a plus

Benefits:

As a full-time employee of the company or as an hourly employee working more than 30 hours per week, you will be eligible to participate in the health, dental, vision, life insurance, and disability plans in accordance with the plan documents, which may be amended from time to time. You will be eligible for benefits on the first day of employment with the Company. In addition, you are eligible to participate in the Company 401(k) Plan after 30 days of employment, in accordance with the applicable plan terms. The Company provides for 11 paid holidays and 12 weeks of Parental Leave. We also follow a "free time" PTO policy, allowing y

About the Company

Fractal is one of the most prominent providers of Artificial Intelligence to Fortune 500(r) companies. Fractal's vision is to power every human decision in the enterprise, and bring AI, engineering, and design to help the world's most admired companies. Fractal's businesses include Crux Intelligence (AI driven business intelligence), Eugenie.ai (AI for sustainability), Asper.ai (AI for revenue growth management) and Senseforth.ai (conversational AI for sales and customer service). Fractal incubated Qure.ai, a leading playe... Know more