- Company Name
- Luxoft
- Job Title
- Data scientist with Java expertise
- Job Description
-
**Job Title**
Data Scientist with Java Expertise
**Role Summary**
Design, develop, and optimize semantic and vector‑based search and recommendation solutions for a large eCommerce platform. Apply machine learning, NLP, and deep learning to improve search relevance, build scalable data pipelines and APIs, and integrate models into production systems while ensuring high performance and reliability.
**Expectations**
- 5+ years in Data Science/ML Engineering focused on Information Retrieval or Semantic Search
- Production‑level Java and Python programming experience
- Deep expertise in Lucene, Solr, or Elasticsearch (indexing, query tuning, analyzers, scoring)
- Proficiency with vector databases, embeddings, and semantic search techniques
- Strong NLP knowledge (tokenization, embeddings, transformers)
- Experience deploying and maintaining ML/search services in production
- Solid software engineering practices (CI/CD, testing, version control, code reviews)
**Key Responsibilities**
- Architect and implement semantic search pipelines using Lucene/Solr and modern embeddings
- Develop and tune deep learning/NLP models to enhance ranking and relevance
- Build and maintain scalable data ingestion, indexing, and inference APIs
- Deploy and monitor ML/search systems in a cloud‑native environment
- Collaborate with software and data engineers to translate business requirements into technical solutions
- Continuously evaluate search performance, refine models, and report metrics
**Required Skills**
- Java, Python (production code)
- Lucene/Solr/Elasticsearch
- Vector databases, embeddings, semantic search
- NLP: tokenization, transformers, embeddings
- Machine Learning, deep learning, model deployment
- CI/CD, automated testing, version control (Git)
- Cloud fundamentals (AWS/Azure/GCP, Docker/Kubernetes preferred)
**Required Education & Certifications**
- Bachelor’s or Master’s degree in Computer Science, Data Science, Statistics, or related field (or equivalent professional experience)
- Certifications in ML, Big Data, or cloud platforms are a plus.