- Company Name
- Advanced Software Talent
- Job Title
- DevOps Engineer
- Job Description
-
**Job title**: DevOps Engineer
**Role Summary**: Architect, implement, and maintain end‑to‑end deployment pipelines for AI‑driven clinical data applications. Collaborate with full‑stack developers, data scientists, and product teams to integrate generative AI models, ensuring secure, scalable, and observable production workloads on AWS.
**Expactations**:
- Drive continuous improvement of the software delivery lifecycle.
- Lead automation of data, model, and infrastructure provisioning.
- Deliver reliable, high‑performance AI services with robust monitoring and incident response.
- Act as a tech advocate, translating complex concepts to cross‑functional stakeholders.
**Key Responsibilities**:
- Design and maintain CI/CD pipelines (GitLab CI, GitHub Actions) for front‑end and back‑end components, including LLM integration.
- Deploy containerized services (Docker, Podman) to Kubernetes (EKS) and manage cluster lifecycle.
- Provision and manage cloud infrastructure using IaC (Terraform, CloudFormation) and GitOps (Argo CD).
- Integrate caching (Redis), data warehousing (Snowflake, BigQuery, Redshift), and observability stacks (Prometheus, Grafana, ELK).
- Oversee model deployment, monitoring, and versioning for generative AI workflows.
- Implement security hardening, compliance checks, and vulnerability scanning (DevSecOps).
- Conduct incident investigations, root‑cause analyses, and post‑mortems.
- Collaborate with Agile teams to define sprint goals and deliverables.
**Required Skills**:
- Proficient in front‑end frameworks (Vue.js, React) and back‑end Python or JavaScript (FastAPI, Django, Flask, Next.js).
- 2+ years AI/ML application development and deployment experience.
- 3+ years DevOps experience on AWS (EKS, EC2, RDS).
- Container orchestration (Kubernetes), CI/CD, IaC, GitOps expertise.
- Familiarity with caching, messaging, and data warehousing technologies.
- Experience with observability tools (Prometheus, Grafana, ELK).
- Strong Linux administration, security best practices, and incident response.
- Ability to communicate technical details to non‑technical audiences.
**Required Education & Certifications**:
- Bachelor’s or Master’s degree in Computer Science, Engineering, Mathematics, or related field.
- (Optional) AWS Certified DevOps Engineer – Professional, Terraform Associate, or equivalent certifications.
South san francisco, United states
On site
Junior
22-12-2025