- Company Name
- Sophus IT Solutions
- Job Title
- Senior Data Scientist
- Job Description
-
**Job Title:** Senior Data Scientist
**Role Summary:**
Lead the design, development, and deployment of advanced AI agent architectures, large language models (LLMs), and NLP solutions for healthcare benefit, clinical, and administrative workflows. Build interoperable, context‑aware, self‑improving multi‑agent systems that comply with HIPAA and related regulations.
**Expectations:**
- Deliver end‑to‑end AI solutions from research prototype to production.
- Ensure scalability, security, and explainability of agent pipelines.
- Collaborate cross‑functionally with engineering, data architecture, and compliance teams.
- Drive innovation in memory‑based agents, RLHF, and retrieval‑augmented generation.
**Key Responsibilities:**
- Design and implement Agent‑to‑Agent (A2A) protocols for autonomous collaboration among specialized AI agents.
- Architect Model Context Protocol (MCP) pipelines to provide persistent, memory‑augmented LLM interactions.
- Fine‑tune domain‑specific LLMs/NLP models (e.g., medical BERT, BioGPT) for document understanding and intent classification.
- Build retrieval‑augmented generation (RAG) systems using structured (FHIR/ICD‑10) and unstructured (EHR notes) data.
- Develop secure, explainable, HIPAA‑compliant agentic pipelines and MLOps workflows for versioning, monitoring, and continuous improvement.
- Lead research and prototyping in memory‑based agents, RLHF, and context‑aware task planning.
- Mentor junior staff and contribute to technical documentation and knowledge sharing.
**Required Skills:**
- Advanced Python programming; expertise with PyTorch, Hugging Face Transformers, LangChain, spaCy, etc.
- Proven experience with multi‑agent orchestration tools (LangGraph, AutoGen, CrewAI or equivalents).
- Hands‑on implementation of A2A protocols and Model Context Protocols.
- Strong background in LLMs, transformers, and NLP applied to healthcare.
- Knowledge of healthcare data standards (FHIR, HL7, ICD‑10, CPT, X12 EDI).
- Cloud‑native development on AWS, Azure, or GCP; containerization (Docker, Kubernetes) and CI/CD pipelines.
- Familiarity with vector databases and retrieval systems for dynamic agent memory.
**Required Education & Certifications:**
- Master’s degree or Ph.D. in Computer Science, Machine Learning, Computational Linguistics, or a related field.
- Minimum 7 years of applied AI/NLP experience, preferably in healthcare.
- No specific certifications required, but demonstrated expertise in relevant technologies and compliance standards (HIPAA, CMS, NCQA) is essential.