- Company Name
- Doctolib
- Job Title
- Engineering Manager - Observability & Reliability Engineering Obsession (x/f/m)
- Job Description
-
**Job Title:** Engineering Manager – Observability & Reliability Engineering (OREO)
**Role Summary:**
Lead and grow a team of Site Reliability Engineers responsible for Doctolib’s observability platform and critical transversal services (HashiCorp Vault, Terraform Enterprise). Define strategy, drive roadmap, and ensure high availability, scalability, and debuggability of cloud‑native services while fostering a culture of operational excellence and psychological safety.
**Expactations:**
- Minimum 5 + years of software engineering or SRE experience in cloud‑native environments (AWS, GCP, Kubernetes).
- At least 3 + years of engineering management experience leading SRE, platform, or infrastructure teams.
- Strong technical depth in observability tooling, IaC, and secrets management.
- Ability to balance people leadership with hands‑on technical guidance.
**Key Responsibilities:**
1. **People Leadership** – Recruit, onboard, coach, and retain SRE talent; conduct 1:1s, performance reviews, and career development.
2. **Technical Strategy** – Define and evolve observability strategy (logging, metrics, tracing, alerting); own roadmap for Vault and Terraform Enterprise.
3. **Operational Excellence** – Manage on‑call rotations, incident response, post‑mortems, and continuous improvement of reliability practices.
4. **Cross‑functional Collaboration** – Align observability capabilities with product, security, and infrastructure teams; represent OREO in leadership forums and architectural reviews.
5. **Resource Advocacy** – Prioritize technical debt reduction and developer experience enhancements; allocate resources for large‑scale reliability initiatives.
**Required Skills:**
- Cloud‑native platforms (AWS, GCP, Kubernetes)
- Observability stack: Fluent Bit, OpenTelemetry, Loki, Elasticsearch, Prometheus/Thanos, Datadog
- Infrastructure as Code: Terraform, OpenTofu
- Secrets management: HashiCorp Vault, AWS Secrets Manager
- Strong leadership, mentoring, and communication abilities
- Experience with high‑traffic, fast‑growing environments and scaling SRE teams
**Required Education & Certifications:**
- Bachelor’s degree in Computer Science, Software Engineering, or a related field, or equivalent professional experience.
- Relevant certifications (e.g., AWS Certified Solutions Architect, Certified Kubernetes Administrator) are a plus but not mandatory.