Job Specifications
Join our Generative AI team to work on groundbreaking projects that shape the future of AI and data science. Your contributions will directly impact the development of innovative solutions used by global industry leaders. You'll play a pivotal role in transforming how our data are seamlessly integrated with AI systems, paving the way for the next generation of customer interactions.
We are seeking a Data Scientist to join our Generative AI team. This role will focus on creating and maintaining AI-ready data, leveraging the deep technical knowledge already established within the London team. You will support text and numerical data extraction, curation, and metadata enhancements, accelerating development. You will also help ensure rapid response times, minimizing potential disruptions.
What Will You Be Doing
AI-Ready Data Development: Design, develop, and maintain high-quality AI-ready datasets, ensuring data integrity, usability, and scalability to support advanced Generative AI models.
Data Processing: Contribute to data extraction, cleansing, and curation initiatives for diverse text and numerical datasets. Implement sophisticated metadata enrichment strategies to enhance data utility and accessibility for AI systems.
Algorithm Implementation & Optimization: Implement and optimize state-of-the-art algorithms and pipelines for efficient data processing, feature engineering, and data transformation tailored for LLM and GenAI applications.
GenAI Application Development: Apply and integrate frameworks like LangChain and Hugging Face Transformers to build modular, scalable, and robust Generative AI data pipelines and applications.
Prompt Engineering Application: Apply advanced prompt engineering techniques to optimize LLM performance for specific data extraction, summarization, and generation tasks, working closely.
LLM Evaluation Support: Contribute to the systematic evaluation of Large Language Models (LLMs) outputs, analysing quality, relevance, and accuracy, and supporting the implementation of LLM-as-a-judge frameworks.
Retrieval-Augmented Generation (RAG) Contribution: Actively contribute to the implementation and optimization of RAG systems, including working with embedding models, vector databases, and, where applicable, knowledge graphs, to enhance data retrieval for GenAI.
Cross-Functional Collaboration: Collaborate effectively with global data science teams, engineering, and product stakeholders to integrate data solutions and ensure alignment with broader company objectives.
Operational Excellence: Troubleshoot and resolve data-related issues promptly to minimize potential disruptions, ensuring high operational efficiency and responsiveness.
Documentation & Code Quality: Produce clean, well-documented, production-grade code, adhering to best practices for version control and software engineering.
Skills And Experience
Academic Background: Advanced degree in AI, statistics, mathematics, computer science, or a related field.
Programming and Frameworks: Hands-on experience with Python, TensorFlow or PyTorch, and NLP libraries such as spaCy and Hugging Face.
GenAI Tools: Practical experience with LangChain, Hugging Face Transformers, and embedding models for building GenAI applications.
Prompt Engineering: Expertise in prompt engineering, including prompt tuning, chaining, and optimization techniques.
LLM Evaluation: Experience evaluating LLM outputs, including using LLM-as-a-judge methodologies to assess quality and alignment.
RAG: Familiarity with vector databases.
Cloud: Practical experience with Gemini/OpenAI models and cloud platforms such as AWS, Google Cloud, or Azure. Experience with Docker for containerization is a plus.
Data Engineering: Understanding of data extraction, curation, metadata enrichment, and AI-ready dataset creation.
Collaboration and Communication: Excellent communication skills and a collaborative mindset, with experience working across global teams.
What’s in it for you
Our rapidly growing, award-winning business offers a dynamic environment for talented, entrepreneurial professionals to achieve results and grow their careers. Argus recognizes and rewards successful performance and as an Investor in People, we promote professional development and retain a high-performing team committed to building our success.
Competitive salary and company bonus scheme
Group pension scheme
Group healthcare and life assurance scheme
Flexible working environment
25 days holiday with annual increase up to 30 days
Subsidised gym membership
Season ticket travel loans
Cycle to work scheme
Extensive internal and external training
About Argus
Argus is the leading independent provider of market intelligence to the global energy and commodity markets. We offer essential price assessments, news, analytics, consulting services, data science tools and industry conferences to illuminate complex and opaque commodity markets. Headquartered in London with over 1,500 staff, Argus is an ind
About the Company
Argus is the leading independent provider of market intelligence to the global energy and commodity markets. We offer essential price assessments, news, analytics, consulting services, data science tools and industry conferences to illuminate complex and opaque commodity markets. Headquartered in London with over 1,500 staff, Argus is an independent media organisation with 30 offices in the world’s principal commodity trading hubs. Companies, trading firms and governments in 160 countries around the world trust Argus data to...
Know more