- Company Name
- Discovered MENA
- Job Title
- Mid/ Senior DevOps Engineer - AI/ML (Relocation to Abu Dhabi)
- Job Description
-
Job Title: Mid / Senior DevOps Engineer – AI/ML
Role Summary:
Design, implement, and maintain scalable, reliable infrastructure for AI/ML workloads. Collaborate with development teams to streamline deployments, automate pipelines, and ensure high availability and security of production environments.
Expectations:
Deliver robust DevOps solutions for AI/ML projects, operating across AWS, Azure, or GCP. Maintain CI/CD pipelines, containerized services, and observability tooling to support constant delivery and system stability.
Key Responsibilities:
- Build and manage containerized AI/ML services using Docker and Kubernetes.
- Create and maintain CI/CD pipelines with Azure DevOps, Jenkins, GitLab, or CircleCI.
- Implement IaC practices (Terraform, Pulumi) to provision cloud resources.
- Configure monitoring, logging, and alerting with Prometheus, Datadog, New Relic, Grafana, and PagerDuty.
- Optimize networking and load balancing, ensuring DNS, firewalls, and traffic routing are secure and efficient.
- Scale databases (VectorDBs, SQL/NoSQL) for high‑throughput AI workloads.
- Collaborate with data scientists and ML engineers to integrate real‑time streaming architectures.
- Resolve incidents, perform root‑cause analysis, and enforce post‑mortem improvements.
Required Skills:
- Proficiency in Python, Bash, or Go.
- Hands‑on experience with AWS, Azure, or GCP.
- Expertise in Docker, Kubernetes, and CI/CD tooling.
- Knowledge of monitoring and alerting solutions (Prometheus, Datadog, New Relic, Grafana, PagerDuty).
- Strong networking fundamentals (DNS, load balancing, firewalls).
- Understanding of real‑time AI and streaming data architectures.
- Familiarity with service meshes (Istio, Linkerd) and database scaling is a plus.
Required Education & Certifications:
Bachelor’s degree in Computer Science, Engineering, or equivalent practical experience.
---