Job Specifications
Role name: Sr Data Scientist
Work site: Overland Park, KS (Onsite)
Note : Minimum 12+years
Job Description:
Senior Data Scientist
We are seeking an experienced Senior Data Scientist with deep expertise in traditional predictive modeling, including advanced data preprocessing, feature engineering, model development, inference pipelines, and model evaluation. The ideal candidate will have strong technical skills in SQL, Databricks, Azure, and Databricks MLOps, and a proven track record of delivering production-grade machine learning models that directly support business outcomes.
________________________________________
Key Responsibilities
Predictive Modeling
• Develop, validate, and deploy traditional predictive ML models (e.g., regression, classification, time-series, survival models, uplift models).
• Build robust feature engineering pipelines that improve model accuracy, stability, and scalability.
• Conduct model evaluation and diagnostics using advanced statistical and ML evaluation techniques (ROC/AUC, F1, calibration, lift charts, PSI/KS, cross-validation).
• Implement and optimize inference pipelines for batch and real-time scoring environments.
Data Preprocessing & Feature Engineering
• Perform end to end data cleaning, wrangling, sampling, and transformation to prepare high-quality training datasets.
• Build scalable data preparation workflows using Databricks and PySpark.
• Work with structured, semi structured, and large-scale datasets in SQL, Delta Lake, and cloud storage environments.
• Partner with Data Engineering to define data quality checks, model inputs, and feature store integrations.
MLOps & Deployment
• Use Databricks MLflow, feature store, and model registry to track model experiments, versions, and deployments.
• Build automated CI/CD workflows for ML using Databricks MLOps frameworks.
• Collaborate with engineering teams to deploy, monitor, and maintain models in production.
• Conduct post deployment evaluations including drift monitoring, score stability, and operational performance reviews.
Cross-functional Collaboration
• Partner with business stakeholders to define analytical problems, modeling objectives, and success criteria.
• Work closely with Data Engineers, Product Owners, and ML Engineers to ensure reliable and scalable delivery of ML solutions.
• Communicate insights, modeling decisions, and performance metrics to technical and non-technical audiences.
________________________________________
Required Skills & Qualifications
• 8+ years of hands-on experience as a Data Scientist or Machine Learning practitioner.
• Strong expertise in traditional predictive modeling using Python & Spark (pandas, scikit-learn, statsmodels).
• Advanced skills in data preprocessing, EDA, feature engineering, and model evaluation.
• Strong proficiency in SQL for large-scale analytics and data transformation.
• Hands-on experience with Databricks (Spark, Delta Lake, notebooks, feature engineering pipelines).
• Experience deploying and managing models with Databricks MLflow and MLOps workflows.
• Proficient working with Azure cloud ecosystem (Azure Data Lake, Azure ML, ADF, Databricks on Azure).
• Solid understanding of software engineering best practices for ML—version control, unit testing, reproducibility, documentation.
• Experience working with large, complex, and high-volume datasets.
About the Company
Net2Source (N2S) is a global workforce solutions company recognized by SIA as the largest and fastest-growing Total Talent Solutions provider with a presence in 32 countries. and in-house Glo-Cal (global and local) teams to support our clients.
We carve out custom talent solutions, keeping People, Process, and Technology as the pillars of making the process simple, robust, and efficient. With over 3,500+ contractors working worldwide, we specialize in Contingent Staffing, RPO, Direct Sourcing, Payroll Solutions (EOR/AOR), ...
Know more