- Company Name
- Trispoke Managed Services Pvt. Ltd.
- Job Title
- Data Scientist
- Job Description
-
Job Title: Data Scientist
Role Summary:
Lead end‑to‑end development, deployment, and optimization of AI/ML solutions—including large language models—across enterprise supply‑chain, operations, workforce, and logistics domains. Partner with business stakeholders to translate complex requirements into scalable data science products, delivering actionable insights and high‑quality models.
Expectations:
* 6+ years of professional data science experience, focused on generative AI and large language models.
* Proficiency in designing, training, and deploying ML solutions that meet rigorous enterprise performance and reliability standards.
* Strong communication skills to articulate technical findings to both technical and non‑technical audiences.
* Ability to work independently within a contracting framework and manage multiple concurrent projects.
Key Responsibilities:
1. Own full lifecycle of ML projects: data discovery, cleaning, feature engineering, model design, training, evaluation, and production deployment (including LLMs, transformers, RNNs).
2. Collaborate with stakeholders to elicit requirements, define success metrics, and translate business problems into ML solutions.
3. Perform data preprocessing, validation, and transformation to ensure data quality and integrity.
4. Generate and present visualizations and insights using Tableau, Power BI, or equivalent tools.
5. Containerize models with Docker, integrate into CI/CD pipelines, and maintain build artifacts.
6. Deploy models to cloud data warehouses and big‑data platforms (AWS Glue, Azure Data Lake, Apache Spark).
7. Develop and apply agent‑based RAG systems for knowledge‑intensive tasks and data‑mapping to normalization standards (OAGIS).
8. Maintain documentation, model cards, and best‑practice guides for reproducibility.
Required Skills:
* Advanced knowledge of large language models (LLMs), generative AI, and agent‑+ RAG architectures.
* Proficiency in Python‑/R‑based tooling: scikit‑learn, TensorFlow, PyTorch, LangChain, etc.
* Experience with data visualization (Tableau, Power BI).
* Familiarity with Docker, CI/CD, and cloud services (AWS, Azure).
* Strong background in data modeling, statistical inference, and ML algorithm design.
* Excellent verbal and written communication, presentation, and stakeholder‑management skills.
Required Education & Certifications:
* Bachelor’s degree in Computer Science, Statistics, Mathematics, or related field (preferred Master’s or Ph.D.).
* Certifications in AWS/Azure Data Services, Docker, or ML platforms are advantageous.