- Company Name
- Thurn Partners
- Job Title
- Systems/SRE Engineer
- Job Description
-
Job title: Systems/SRE Engineer
Role Summary: Design, build, and operate scalable, highly available infrastructure to support a high-frequency trading platform. Work across development, IT, and trading teams to deliver zero‑downtime services, automate operations, and continuously improve system reliability.
Expactations: Demonstrate ownership of system health, deliver measurable performance gains, and maintain rigorous observability and incident response standards in a fast‑paced, mission‑critical environment.
Key Responsibilities:
- Architect and implement resilient systems for trading infrastructure.
- Proactively manage and scale Linux‑based environments, ensuring reliability, performance, and security.
- Drive automation of deployment, monitoring, and incident handling across development, IT, and trading teams.
- Develop and maintain observability solutions using Prometheus, Grafana, Thanos, ELK stack, and related tooling.
- Design and maintain containerized services with Kubernetes, Docker, and cloud platforms (AWS, GCP).
- Continuously assess and integrate emerging technologies to enhance system capabilities.
Required Skills:
- 4+ years of DevOps or SRE experience in a high‑performance setting.
- Strong proficiency in Python with knowledge of Go, Ruby, or Perl.
- Deep Linux system administration expertise.
- Hands‑on experience with Prometheus, Grafana, Thanos, and ELK stack for monitoring, logging, and alerting.
- Proficiency in Kubernetes, Docker, and cloud services (AWS, GCP).
- Ability to collaborate across multidisciplinary teams and communicate complex technical concepts effectively.
Required Education & Certifications:
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent professional experience).
- Relevant certifications (e.g., AWS Certified Solutions Architect, Certified Kubernetes Administrator, or similar) are a plus.