- Company Name
- STATION F
- Job Title
- SOFTWARE ENGINEER - INFRASTRUCTURE & SITE RELIABILITY ENGINEERING
- Job Description
-
**Job Title**
Software Engineer – Infrastructure & Site Reliability Engineering
**Role Summary**
Design, build, and operate the core distributed systems and APIs that power a global serverless platform. Focus on scaling infrastructure, ensuring 99.99 % availability, and enabling zero‑configuration deployment for developers.
**Expectations**
- Scale the platform to 10+ new global data center locations.
- Maintain a deployment success rate of 99.99 % for 200 k+ services.
- Participate in 24/7 on‑call rotation and meet a 99.99 % monthly Service Level Objective.
- Deliver data‑driven, reliable, high‑performance services with measurable impact.
**Key Responsibilities**
- Design and implement core networking, orchestration, and serverless features (e.g., Nomad drivers, autoscaling, block storage, GPU snapshotting).
- Develop Go‑based gRPC and REST APIs, ensuring low latencies and high throughput.
- Build, instrument, and debug production systems across the stack: BareMetal hypervisors, Nomad, MicroVMs, Envoy, and Cilium.
- Strengthen observability, logging, and performance monitoring for rapid troubleshooting.
- Improve engineering standards, tooling, and processes across the team.
- Collaborate closely with leadership and cross‑functional product stakeholders.
**Required Skills**
- Strong proficiency in Go (Golang) with experience developing distributed systems.
- Deep knowledge of gRPC, REST, and microservices architecture.
- Hands‑on experience with container orchestration (Nomad, Kubernetes), hypervisors, and MicroVMs (Kata, Firecracker).
- Networking expertise: Cilium, Envoy, Linux networking, BGP, service mesh concepts.
- Familiarity with observability stacks (Prometheus, Grafana, Loki, Jaeger, etc.) and performance instrumentation.
- Problem‑solving skills in a production environment, including debugging multi‑layer stack issues.
- Experience with CI/CD pipelines, automated deployment, and BareMetal or cloud infrastructure.
- Ability to write clear technical specifications, automate processes, and drive continuous improvement.
**Required Education & Certifications**
- Bachelor’s degree or higher in Computer Science, Software Engineering, or related field.
- Prior professional experience as a backend or infrastructure engineer; SRE or DevOps roles preferred.
---