- Company Name
- Haulotte
- Job Title
- Data Scientist H/F
- Job Description
-
**Job Title**
Data Scientist
**Role Summary**
Support the organization’s data-driven strategy by extracting insights, building predictive models, and engineering end-to-end data pipelines. Collaborate closely with data architects, engineers, and business analysts to deliver scalable solutions that integrate into the enterprise IT ecosystem.
**Expectations**
- Translate business requirements into technical deliverables.
- Maintain high standards of data quality, governance, and security.
- Continuously evaluate emerging analytics and AI techniques.
**Key Responsibilities**
- Explore, clean, and prepare complex, big‑data sets for analysis.
- Develop, validate, and document statistical and machine‑learning models (Python, Dataiku).
- Execute feature engineering and model optimisation for predictive and prescriptive analytics.
- Design, build, and maintain ETL/ELT pipelines using Azure Data Factory, Synapse, Spark, or equivalent.
- Implement the medallion architecture to produce gold datasets for analysts and stewards.
- Pilot fine‑tuning and deep‑learning tasks with PyTorch, TensorFlow, or LangChain.
- Engineer AI components (LLMs, diffusion models, embeddings/RAG, Fabric, Microsoft Foundry) and industrialise models for production.
- Produce visualisations (Power BI, Matplotlib, Seaborn) and present concise results to stakeholders.
- Lead project coordination, manage timelines, and document all processes for reproducibility.
- Ensure compliance with data governance, security policies, and regulatory standards.
**Required Skills**
- Advanced programming in Python (Pandas, NumPy, Scikit‑learn) and SQL.
- Strong knowledge of supervised and unsupervised machine‑learning techniques.
- Experience with data‑visualisation tools (Power BI, Matplotlib, Seaborn).
- Familiarity with cloud environments, preferably Azure (Azure Data Factory, Synapse, Data Lake, Fabric).
- Ability to work with structured and semi‑structured data.
- Experience designing and maintaining scalable data pipelines (ETL/ELT, Spark).
- Exposure to AI frameworks (PyTorch, TensorFlow, LangChain) and LLM technologies.
**Soft Skills**
- Analytical mindset, curiosity, and problem‑solving rigor.
- Excellent written and verbal communication for technical and non‑technical audiences.
- Collaborative, adaptable in fast‑changing technology contexts.
**Required Education & Certifications**
- Bachelor’s or Master’s degree in Data Science, Statistics, Computer Science, or equivalent.
- Demonstrated first practical experience in data exploitation.
- Relevant certifications (e.g., Microsoft Azure Data Scientist, Azure Solutions Architect) are a plus.