- Company Name
- HiLabs
- Job Title
- Senior Devops Lead Engineer
- Job Description
-
Job title: Senior DevOps Lead Engineer
Role Summary: Lead and own a major DevOps charter (e.g., CI/CD modernization, cloud platform engineering, SRE, DevSecOps). Drive strategy, roadmap, and execution across diverse in‑house, cloud, hybrid, and customer‑hosted restricted environments. Mentor junior/mid engineers and influence engineering direction to maintain consistent DevOps practices organization‑wide.
Expactations: • 8–12 + years of DevOps/SRE/platform engineering experience. • Proven end‑to‑end ownership of complex, cross‑functional initiatives. • Strong communication skills for engaging technical and non‑technical stakeholders. • Ability to work under strict compliance and security constraints in customer environments.
Key Responsibilities: • Own a DevOps charter: define strategy, roadmap, KPIs, and execution plans. • Mentor and guide cross‑level DevOps staff. • Design and evolve CI/CD pipelines, IaC, observability, and automation frameworks. • Architect and operate Kubernetes clusters and IaC using Terraform, CloudFormation, or Pulumi. • Implement DevSecOps workflows: security scanning, secrets management, policy‑as‑code, and compliance automation. • Manage monitoring, logging, tracing systems (Prometheus, Grafana, ELK/EFK, OpenTelemetry). • Drive SRE practices: incident response, RCA, SLO/SLI management, and resilience engineering. • Build reusable automation templates and deployment patterns for diverse infrastructure setups.
Required Skills: • Deep expertise in Kubernetes, Docker, Linux, networking, and security fundamentals. • Strong proficiency in CI/CD tools (GitHub Actions, GitLab CI, Jenkins, ArgoCD, Azure DevOps). • IaC experience with Terraform, Pulumi, CloudFormation or equivalent. • Automation tooling knowledge (Ansible, Chef, Puppet, SaltStack). • Cloud experience in AWS (primary), Azure, GCP. • DevSecOps practices (scanning, secrets, policy‑as‑code). • Observability and monitoring stacks (Prometheus, Grafana, ELK/EFK, OpenTelemetry). • SRE principles (incident response, RCA, SLO/SLI). • Excellent stakeholder communication and leadership.
Required Education & Certifications: • Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience). • Relevant certifications (e.g., AWS Certified Solutions Architect, Microsoft Certified: Azure DevOps Engineer Expert, Certified Kubernetes Administrator) preferred.