- Company Name
- Forvis Mazars en France
- Job Title
- Stagiaire de fin d'études - Consultant Data Engineer - 2026 - H/F
- Job Description
-
Job Title: Final Year Internship – Data Engineer Consultant (2026)
Role Summary:
Assist in strategic data projects for major clients, focusing on migrating legacy SAS scripts to R, developing end‑to‑end data pipelines, and deploying solutions across public and private cloud environments. Participate in data quality, governance, and performance optimization initiatives.
Expectations:
- Final year student (Bac+5 or equivalent) in engineering, computer science, statistics or related field.
- Strong interest in system migration, code optimization, and data‑centric application development.
- Professional level English (oral & written).
Key Responsibilities:
- Inventory, analyze, and prioritize SAS scripts for migration.
- Translate SAS Base, SAS/STAT, macros, PROC SQL into efficient R (tidyverse, data.table) code.
- Design and implement target architecture, selecting appropriate R packages.
- Develop reusable R functions, optimize performance, and document code.
- Validate results by comparing R outputs with SAS outputs on reference datasets.
- Build and maintain data pipelines (extraction, transformation, API consumption, BI/visualization).
- Deploy solutions using CI/CD pipelines (GitLab, GitLab CI, Docker, Terraform).
- Work with cloud services (AWS Lambda, Azure Functions, GCP Cloud Functions, Kubernetes) and private cloud (OpenNebula, CloudStack, CephFS).
- Collaborate with full‑stack teams (REST APIs, VueJS, ReactNative).
- Support client stakeholders through communication, training, and documentation.
Required Skills:
- SAS Base, SAS/STAT, SAS macros, PROC SQL.
- R programming: tidyverse (dplyr, tidyr), data.table, statistical packages.
- Python or R for function development and code optimization.
- Linux command line and Bash scripting.
- Version control: Git/GitLab.
- Optional: code profiling, memory optimization, SAS data format handling.
- DevOps tools: GitLab CI, Ansible, Docker, Terraform.
- Cloud platforms: Azure, AWS, GCP.
- BI tools: Power BI, Qlik, Tableau.
- Soft skills: code legacy analysis, project planning, risk management, stakeholder communication.
Required Education & Certifications:
- Current enrollment in a Master’s or engineering degree (Bac+5) with a focus on data, statistics, or computer science.
- No mandatory certifications required, but knowledge of data engineering or cloud certifications (e.g., AWS Certified Solutions Architect, Azure Data Engineer Associate) is a plus.