Hyperbolic

1 Job

1 Employees

About the Company

After working for AI automation Agencies for 12 months delivering cutting-edge full stack and low code solutions, We are embarking as a an agency ourselves. Our focus is on voice AI for business and we believe it will revolutionize customer support, sales and other operations.

Listed Jobs

Company Name: Hyperbolic
Job Title: Head Of Infrastructure
Job Description: **Job Title** Head of Infrastructure **Role Summary** Lead the design, scaling, and reliability of a globally distributed GPU cloud, managing a cross‑functional infrastructure organization and aligning engineering efforts with product, security, and market objectives. **Expectations** - Own and execute a multi‑year infrastructure roadmap. - Build a world‑class engineering team, set standards for excellence, and mentor senior staff and managers. - Deliver high‑availability, secure, and cost‑efficient systems that support AI workloads at scale. **Key Responsibilities** - Architect distributed systems, networking, resource orchestration, and global capacity strategy. - Design and maintain peer‑to‑peer GPU marketplace, inference fabric, and core platform primitives. - Oversee multi‑cloud, on‑prem, and edge topologies with GPU‑centric workloads. - Lead incident response, resilience engineering, and uptime targets (99.9–99.99%). - Implement automation, IaC, GitOps, and observability (metrics, tracing, logging). - Drive capacity planning, load forecasting, and cost optimization. - Ensure security‑first infrastructure: isolation, IAM, hardening, and compliance. - Collaborate with Product, Security, Platform, and GTM leaders to translate AI workloads into infrastructure solutions. **Required Skills** - 10+ years in infrastructure, systems engineering, or distributed systems; 5+ years in leadership. - Deep knowledge of distributed systems, OS internals, networking, and resource orchestration. - Hands‑on experience with Kubernetes, Nomad, SLURM, or custom schedulers at global scale. - Proficiency in Go, Rust, Python, or equivalent for production code. - Expertise in IaC, automation, GitOps, observability, and incident management. - Strong judgment balancing velocity, reliability, cost, and security. - Proven ability to mentor and grow engineering teams across infrastructure, platform, and SRE disciplines. **Required Education & Certifications** - Bachelor’s or Master’s degree in Computer Science, Engineering, or related field (preferred). - Relevant certifications (e.g., Certified Kubernetes Administrator, AWS Certified Solutions Architect, or equivalent) are a plus.

San francisco, United states

On site

21-12-2025