- Company Name
- SS&C Technologies
- Job Title
- Site Reliability Engineer (SRE)
- Job Description
-
Job title: Site Reliability Engineer (SRE)
Role Summary: Leads cross‑functional technology teams to build and operate scalable, resilient, and secure cloud‑native infrastructure platforms. Drives application modernization, reduces tech debt, and embeds reliability, automation, and compliance across products and services.
Expactations: Demonstrates outstanding organization, project management, and attention to detail. Owns end‑to‑end reliability processes, fosters a culture of ownership, continuous learning, and blameless improvement, and supports global 24x5 coverage with smooth regional handoffs.
Key Responsibilities: • Design, deploy, and manage private cloud environments (VMware, OpenStack, OpenShift) and multi‑cluster Kubernetes architectures. • Build and maintain IaC pipelines, CI/CD workflows, and observability stacks (Prometheus, Splunk). • Implement SLOs, SLIs, and KPIs to guide prioritization and measure impact. • Automate problem detection, self‑healing, and escalation protocols to eliminate toil. • Integrate DevSecOps, zero‑trust principles, and policy‑as‑code into all pipelines. • Produce Architecture Decision Records and adopt the Cloud Well‑Architected Framework. • Conduct blameless retrospectives and disseminate lessons learned. • Ensure compliance with financial services regulations and security frameworks (ISO 27001, NIST 800‑53).
Required Skills: • 5+ years as an SRE, 3+ in regulated financial or healthcare environments. • Deep expertise in private cloud architecture and Kubernetes cluster operations. • Proficient with IaC tools, CI/CD systems, and observability platforms. • Strong grasp of reliability engineering metrics (SLAs, SLOs, KPIs). • Experience with financial‑grade network segmentation and zero‑trust architecture. • Excellent communication, collaboration, and documentation skills.
Required Education & Certifications: • Bachelor’s degree in Computer Science, Engineering, or related field. • Certifications valued: TOGAF, AWS Certified Solutions Architect, VMware VCP, Red Hat Certified Architect, ISO 27001, NIST 800‑53 related credentials.