- Company Name
- Saviynt
- Job Title
- Senior Site Reliability Engineer
- Job Description
-
**Job Title**
Senior Site Reliability Engineer
**Role Summary**
Lead the design, implementation, and maintenance of highly available, scalable cloud infrastructure for an AI‑powered identity platform. Drive continuous delivery excellence through robust CI/CD pipelines, IaC, and observability practices while collaborating cross‑functionally with development, QA, and operations teams to ensure performance, reliability, and security compliance.
**Expectations**
- Deliver end‑to‑end observability, monitoring, logging, and tracing for cloud‑native services.
- Own infrastructure automation, deployment processes, and incident response in AWS & Azure environments.
- Apply industry best practices for security, compliance, and performance across the delivery pipeline.
- Mentor and guide engineering teams on reliability principles and automation tooling.
**Key Responsibilities**
- Design, build, and maintain CI/CD pipelines (GitLab CI, Jenkins, GitHub Actions).
- Provision and manage AWS, Azure or Google Cloud resources using IaC (Terraform, CloudFormation, ARM, Ansible, Puppet).
- Configure and deploy the identity platform across cloud environments, automating install, upgrade, migration, and rollback.
- Implement, tune, and maintain observability stack (Prometheus, Grafana, ELK, Datadog, OpenTelemetry).
- Build and maintain scripts (Python, Java, Bash) to automate routine tasks and reduce manual effort.
- Troubleshoot and resolve production incidents, performing root cause analysis and implementing preventive measures.
- Write and maintain architecture, deployment, and operations documentation.
- Enforce security and quality controls across the software and infrastructure lifecycle.
**Required Skills**
- 2+ years of senior-level DevOps/Infra experience in AWS & Azure SaaS deployments.
- Deep expertise in IaC tools (Terraform, CloudFormation, ARM, Ansible, Puppet).
- Proficient with container orchestration (Docker, Kubernetes).
- Strong coding/scripting in Python, Java, Bash; version control with Git.
- Experience with CI/CD tooling, monitoring, logging, and tracing technologies.
- Solid understanding of cloud networking (BGP, routing, REST APIs) and IAM/security best practices.
- Familiarity with relational databases (MySQL) and API testing (Postman).
- Ability to work collaboratively across development, QA, and operations teams.
**Required Education & Certifications**
- Bachelor’s degree in Engineering or related field (Master’s preferred) or equivalent professional experience.
- Certifications: AWS Certified Solutions Architect/Developer, Azure Solutions Architect, Terraform Associate, or Kubernetes Administrator are highly desirable.
El segundo, United states
Hybrid
Senior
29-01-2026