- Company Name
- Quindar
- Job Title
- Site Reliability Engineer, US Gov
- Job Description
-
**Job Title**
Site Reliability Engineer – US Gov
**Role Summary**
Design, automate, test, deploy, and maintain secure, highly‑available cloud infrastructure in AWS GovCloud and AWS C2E for Quindar’s mission‑critical systems. Ensure compliance with SOC2, NIST 800‑171, and FedRAMP Moderate controls while managing on‑prem Quindar deployments at government sites. Drive readiness and continuous hardening for AWS C2E, provide incident response, 24/7 on‑call support, and collaborate with frontend, backend, and operations teams to meet performance metrics.
**Expectations**
- Deliver enterprise‑grade, self‑healing infrastructure that minimizes manual intervention.
- Achieve and maintain compliance with federal security standards and ATO processes.
- Lead incident management and continuous improvement of operational excellence.
- Maintain deep expertise in Kubernetes, serverless workloads, and GovCloud/C2E environments.
**Key Responsibilities**
- Architect, automate, and maintain AWS GovCloud and C2E environments (EKS, Rancher, serverless).
- Build and manage observability stacks (Grafana, Datadog, etc.) for performance and reliability monitoring.
- Implement IaC with Terraform (or equivalent).
- Design secure networking (VPN, NLB/ALB, HTTPS/TLS, VPC peering, CDN).
- Develop CI/CD pipelines (GitLab Workflows preferred).
- Manage identity and access (Auth0, Keycloak, AWS IAM).
- Conduct incident response, triage, and post‑mortem activities on a 24/7 on‑call rotation.
- Define and enforce best practices for availability, latency, security, and cost efficiency.
- Collaborate with engineering teams to meet system performance and reliability goals.
**Required Skills**
- Kubernetes cluster management (EKS, Rancher).
- AWS GovCloud, IL‑enclave, or C2E experience.
- Observability tools (Grafana, Datadog, etc.).
- IaC: Terraform (or similar).
- Scripting: Python.
- Network fundamentals: VPN, NLB/ALB, HTTPS/TLS, VPC, CDN.
- API design, distributed databases, caching, event‑driven architectures.
- CI/CD pipeline development (GitLab).
- Unix/Linux system administration.
- Cloud security best practices and enclave architecture.
- Identity & access management (Auth0, Keycloak, AWS IAM, ICAM).
- Git proficiency and multi‑classification deployment experience.
**Required Education & Certifications**
- Bachelor’s degree in Computer Science or related field.
- Minimum 3 years professional experience as SRE, DevOps, reliability, infrastructure, or platform engineer.
- U.S. Security Clearance (Secret or higher; TS/SCI preferred).
- U.S. citizenship required.
- Experience with ATO/authorization in federal, DoD, or IC environments preferred.
- Experience deploying in GovCloud, C2S/C2E, or IL‑enclave environments highly desirable.
Los angeles, United states
Hybrid
Junior
02-12-2025