- Company Name
- Swift Strategic Solutions Inc
- Job Title
- Principal Python Developer - Data Engineering (ML Background Welcome)
- Job Description
-
Job Title: Principal Python Developer – Data Engineering (ML Background Welcome)
Role Summary: Lead architecture and implementation of production‑grade Python services and large‑scale data pipelines for financial services projects. Drive end‑to‑end development on AWS, ensuring performance, reliability, and scalability.
Expactations: Rapidly deliver production code and reusable components; influence data‑engineering standards across the team; collaborate with cross‑functional data science and Ops groups; mentor peers; continuously improve processes and tooling.
Key Responsibilities:
- Design, develop, and maintain scalable Python‑based data pipelines using AWS services (EMR, Glue, Redshift, S3, Step Functions, Lambda).
- Build distributed data transformations with PySpark, ensuring optimal performance for large datasets.
- Write efficient SQL and optimize queries against complex schemas in Redshift and other data stores.
- Develop robust, test‑driven code (TDD/BDD) leveraging pytest/behave and maintain unit/integration test suites.
- Contribute to IaC and DevOps practices (Terraform/CloudFormation, CI/CD pipelines on GitHub/GitLab).
- Create reusable libraries and apply design patterns for maintainability and extensibility.
- Engage in open‑source contributions and community best practices.
Required Skills:
- 10+ years of professional development experience, 7+ years in production‑grade Python.
- Strong CS fundamentals: data structures, algorithms, OOP, concurrency, exception handling.
- Proficiency in AWS data services: EMR, Glue, Redshift, S3, Step Functions, Lambda.
- Deep experience with PySpark for distributed transformations.
- Advanced SQL skills and query optimization.
- Experience with TDD/BDD (pytest, behave) and testing frameworks.
- Familiarity with IaC (Terraform or CloudFormation) and DevOps tooling (Git, CI/CD).
- Knowledge of design patterns, reusable library development, and coding best practices.
- Proficiency in Python plus at least one additional language (C++, Go, Java, Scala).
- Proven background in Data Science or ML Engineering requiring substantial Python development (model building, training pipelines, ML framework development at scale).
- Active participation in open‑source or community projects.
Required Education & Certifications:
- Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.