- Company Name
- Scribd, Inc.
- Job Title
- Machine Learning Engineer
- Job Description
-
**Job title**
Machine Learning Engineer II
**Role Summary**
Design, build, and optimize production‑grade machine learning pipelines and platform services that deliver real‑time AI features (e.g., recommendations, personalization, LLM-powered experiences) to millions of users.
**Expactations**
- Deliver end‑to‑end ML solutions with high performance, scalability, and reliability.
- Collaborate cross‑functionally with product and software engineering teams to embed models into user‑facing features.
- Conduct rigorous experimentation, A/B testing, and analytics to validate model impact.
- Maintain and enhance core ML platform components (feature store, model registry, embedding‑based retrieval).
**Key Responsibilities**
- Build and optimize data ingestion, feature engineering, training, and serving pipelines using Python, Spark, Databricks, Airflow, and CI/CD tools.
- Extend and improve the feature store, model registry, embedding-based retrieval, and model inference services.
- Integrate trained models into product features (recommendations, personalization, AskAI) via HTTP/gRPC APIs.
- Perform model experimentation, A/B testing, and runtime performance analysis.
- Refactor and tune existing systems for scalability, reliability, and cost efficiency.
- Implement automated data validation, monitoring, and alerting; maintain security and compliance.
- Participate in code reviews, enforce best practices, and document solutions.
**Required Skills**
- 3+ years professional experience in software or machine learning engineering.
- Strong proficiency in Python (or Golang); experience with Scala or Ruby acceptable.
- Hands‑on design and deployment of ML pipelines using Spark, Databricks, or similar.
- Experience with feature stores, model registries, and model serving at scale.
- Cloud expertise in AWS (Lambda, ECS, EKS, SQS, CloudWatch, Terraform).
- Familiarity with AWS SageMaker, embedding‑based retrieval (Weaviate), and large‑scale LLM integration (OpenAI, Anthropic, Gemini).
- Knowledge of PR/SQL for data extraction, ML experiment tracking, and performance tuning.
**Required Education & Certifications**
- Bachelor’s degree in Computer Science, Electrical Engineering, or related field (or equivalent professional experience).
- (Optional) Certifications such as AWS Certified Solutions Architect, TensorFlow Developer, or similar.