- Company Name
- Autonomai Recruitment
- Job Title
- DevOps Engineer
- Job Description
-
Job Title: DevOps Engineer
Role Summary: Lead DevOps and SRE for high‑performance, low‑latency Linux platforms that support AI/ML‑driven trading and HPC workloads. Own reliability, performance tuning, automation, Kubernetes orchestration, and observability across bare‑metal and containerized environments.
Expectations: • Proven experience in a top‑tier tech or elite trading environment (FAANG, hyperscale, high‑frequency trading).
• Track record building and owning technology from concept to production (0→1).
• Hands‑on expertise with Linux kernel, bare‑metal systems, and low‑latency optimization.
• Leadership in SRE/DevOps practices and strategic platform direction.
Key Responsibilities:
• Design, deploy, and scale Linux platforms for ultra‑reliable, ultra‑fast trading workloads.
• Optimize and tune kernel, networking, and system resources for minimal latency and maximum throughput.
• Own incident response, root‑cause analysis, and continuous reliability improvement.
• Automate build, deployment, and fleet management of bare‑metal Linux and container stacks.
• Manage large‑scale Kubernetes clusters, network configuration, and orchestration.
• Define observability standards; expand monitoring, alerting, and performance metrics.
• Analyze kernel‑level performance, networking, and distributed systems at scale.
• Build Python tooling for automation, reliability engineering, and performance analysis.
• Design highly distributed systems to support multi‑petabyte, multi‑cluster environments.
Required Skills:
• Deep expertise in Linux (kernel, system tuning, networking).
• Proficiency in Python scripting for automation and tooling.
• Strong DevOps and SRE practices (CI/CD, monitoring, incident management).
• Kubernetes architecture and management (cluster design, scaling, networking).
• Performance tuning, low‑latency optimization, and high‑throughput design.
• Distributed systems knowledge and experience with large‑scale HPC or simulation pipelines.
• Observability, monitoring, and alerting stack design.
• Problem‑solving and root‑cause analysis in production environments.
Required Education & Certifications:
• Bachelor’s degree (or equivalent) in Computer Science, Engineering, or related field.
• Certifications such as Certified Kubernetes Administrator (CKA), AWS Certified DevOps Engineer, or similar Linux/DevOps credentials are preferred.