- Company Name
- Black Duck
- Job Title
- Principal Engineer, DevOps
- Job Description
-
Job title: Principal Engineer, DevOps
Role Summary: Lead technical architect and product owner for the Platform Engineering Team, overseeing the design, development, and adoption of a self‑service Internal Developer Platform (IDP) that enhances delivery velocity and reduces cognitive load for a large engineering workforce.
Expectations:
- 8+ years in DevOps, SRE, Platform Engineering or Cloud Architecture with proven leadership and product ownership.
- Extensive experience in designing and scaling cloud‑native platforms across GCP, AWS, and Azure.
- Mastery of Kubernetes, Terraform, GitOps (ArgoCD/Flux), service mesh, RBAC/IAM, and modern observability stacks.
- Strong programming in Go, Python, or Node.js; experience building developer portals (e.g., Backstage) and AI/ML tooling.
- Demonstrated success in implementing intelligent operational systems, monitoring, distributed tracing, and automated governance.
Key Responsibilities:
- Architect, build, and scale the enterprise IDP, delivering CI/CD pipelines, runtime environments, observability, RBAC/security guardrails, and FinOps controls.
- Develop AI‑powered observability solutions with pattern recognition and predictive incident response for distributed systems.
- Lead platform reliability initiatives, including automated rollbacks, zero‑touch operations, and standard workflow automation.
- Implement enterprise networking, security policies, and cost optimization strategies with automated governance and compliance.
- Design and scale AI platform infrastructure for SaaS products, supporting internal operations and customer‑facing AI/ML features.
- Mentor and coach engineering teams, driving platform adoption through evangelism and product management principles.
Required Skills:
- Cloud architecture (GCP, AWS, Azure) with enterprise networking and distributed systems.
- Kubernetes, Terraform, GitOps (ArgoCD/Flux), service mesh, RBAC/IAM.
- Programming in Go, Python, or Node.js.
- Experience building developer portals (Backstage) and AI/ML‑powered tooling.
- Implementation of intelligent operational systems, monitoring, and distributed tracing.
- Cloud governance, FinOps optimization, networking security, compliance frameworks.
- Deployment automation (canary deployments, feature flags).
- Knowledge of DORA metrics and data‑driven platform improvement methodologies.
Required Education & Certifications:
- BS or MS in Computer Science, Engineering, or related field (or equivalent experience).
- No specific certifications required.