- Company Name
- Cortex Consultants LLC
- Job Title
- Site Engineer
- Job Description
-
**Job Title**
Site Reliability Engineering (SRE) Architect
**Role Summary**
Design, build, and evolve enterprise‑level reliability infrastructure on AWS. Define SRE standards, implement observability, automate toil reduction, and lead architectural reviews to ensure highly available, secure, and cost‑effective services.
**Expactations**
- Lead technical strategy for reliability across development and operations teams.
- Mentor SREs and engineers, fostering adoption of SRE principles.
- Own end‑to‑end reliability lifecycle: design, deployment, monitoring, incident response, and postmortem improvement.
**Key Responsibilities**
1. **Reliability Strategy & Design** – Architect scalable, secure AWS infrastructure; define SLI/SLO/error‑budget policies; enhance observability maturity.
2. **Platform Architecture & Automation** – Create automation frameworks (CI/CD, IaC), evaluate & recommend tooling (chaos engineering, incident‑remediation).
3. **Technical Leadership & Consultation** – Provide shift‑left guidance on new services, conduct architectural reviews, and ensure production readiness.
4. **Resilience** – Own blameless postmortems, drive resilience patterns (circuit breaking, rate limiting), and push systemic improvements.
**Required Skills**
- Architectural experience designing scalable, high‑reliability systems.
- Deep mastery of SRE concepts: SLIs/SLOs, error budgets, toil reduction, incident management, postmortems.
- AWS: compute, networking, security, and IaC (CloudFormation, Terraform).
- Container & orchestration: Kubernetes, Docker, serverless.
- Observability stack: Prometheus, Grafana, Dynatrace, ELK/EFK, Jaeger, OpenTelemetry.
- Programming/scripting: Python, Go, Bash for automation.
- Analytical, strategic thinker with strong communication and leadership skills.
- Experience in chaos engineering best practices (preferred).
**Required Education & Certifications**
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
- Relevant cloud certifications (AWS Certified Solutions Architect, AWS Certified DevOps Engineer) highly preferred.