- Company Name
- Prodigy Education
- Job Title
- Staff Data Developer - Data Architect
- Job Description
-
Job title: Staff Data Developer – Data Architect
Role Summary: Architect and govern enterprise‑level data systems that support batch and streaming workloads, real‑time APIs, ML feature stores, and analytics. Deliver reusable, quality‑controlled, ML‑ready data pipelines and feature layers to scale AI/ML, experimentation, and personalized experiences across products and services.
Expactations: • 8+ years designing, governing, and evolving modern data platforms.
• Strong technical depth with Databricks, MLflow, dbt, Airflow, Kafka, Docker, Kubernetes.
• Proven ability to translate business requirements into scalable, secure, and governed data architectures.
• Excellent communication and stakeholder collaboration skills; mentor and lead cross‑functional teams.
Key Responsibilities:
- Lead end‑to‑end data architecture design for analytics, experimentation, and ML across product lines.
- Define infra strategy and standards for Databricks, Kafka, dbt, MLflow, Airflow, Docker, Kubernetes.
- Establish data governance, quality, lineage, naming conventions, and ML readiness practices.
- Build modular, reusable data products, standardized schemas, and feature stores for cross‑domain use.
- Collaborate with product managers, analysts, and engineers to align architecture with real workflows.
- Provide hands‑on technical guidance, code reviews, and mentorship to engineering teams.
Required Skills:
- Strategic architecture design for batch & streaming, real‑time APIs, ML feature stores, observability layers.
- Expertise in Databricks, MLflow, dbt, Airflow, Kafka, Docker, Kubernetes, and AWS cloud services.
- Strong data modeling, governance, lineage, and quality frameworks.
- Ability to design for reuse, modularity, and scalability.
- Business acumen, effective communication, and leadership for cross‑team collaboration.
Required Education & Certifications:
- Bachelor's (or equivalent) in Computer Science, Software Engineering, Information Systems, or related field.
- Relevant certifications in big data, cloud, or Kubernetes (e.g., Databricks Certified, Certified Kubernetes Administrator) are preferred.