- Company Name
- Grafton Recruitment
- Job Title
- Ingénierie Senior SRE/DevOps H/F
- Job Description
-
**Job Title**
Senior SRE/DevOps Engineer – Cloud (Google Cloud)
**Role Summary**
Lead the deployment, management, and optimization of production, pre‑production, and UAT environments on Google Cloud. Engineer scalable, secure, and cost‑effective infrastructure using Kubernetes, Terraform, CI/CD pipelines, and container best practices. Act as a pivotal point for incident response, monitoring, and continuous improvement in a high‑availability setting.
**Expectations**
- Operate in a rapidly evolving cloud modernization program, collaborating closely with development and operations teams.
- Demonstrate a track record of maintaining and evolving production systems with minimal downtime.
- Exhibit strong ownership of infrastructure reliability, security posture, and cost efficiency.
- Provide proactive support during on‑call rotations and manage critical incidents with thorough post‑mortems.
**Key Responsibilities**
- Deploy and oversee GCP environments (UAT, Pre‑Prod, Prod) ensuring high availability and scalability.
- Design, implement, and maintain Kubernetes clusters (autoscaling, Operators, Helm, RBAC) integrated with GCP services (Cloud SQL, IAM, Secrets Manager, Memorystore).
- Build and maintain CI/CD pipelines using Terraform, GitLab CI, and ArgoCD for automated, repeatable deployments.
- Enforce and evolve security controls for cloud, containers, and infrastructure as code.
- Configure, monitor, and alert using Prometheus, Grafana, and appropriate logging solutions.
- Document architecture, runbooks, and operational procedures.
- Lead incident management, provide first‑line support, and conduct post‑mortem analyses to drive continuous improvement.
**Required Skills**
- Deep expertise in Google Cloud Platform (≥3 years), including GKE, Cloud SQL, IAM, Secrets, and Memorystore.
- Hands‑on experience with Kubernetes cluster operations (autoscaling, Operators, Helm, RBAC).
- Proficiency in Terraform for cloud provisioning.
- CI/CD pipeline creation (GitLab CI, ArgoCD) and scripting in Shell and Python.
- Strong monitoring, logging, and alerting skills (Prometheus, Grafana).
- Linux systems administration in containerized environments.
- Proven incident handling and post‑mortem documentation.
- Excellent communication, ownership, and customer‑service orientation.
**Required Education & Certifications**
- Minimum 5 years DevOps/SRE/Cloud Engineering experience.
- Minimum 3 years experience managing production incidents, monitoring, and on‑call rotations.
- Bachelor’s or Master’s degree (Bac+5) in Computer Science or equivalent.
- Google Cloud or Kubernetes certification preferred (e.g., GCP Professional Cloud Architect, GKE Certified Associate).