cover image
InstaDeep

AI & Genomics Intern

On site

Paris, France

Fresher

Internship

19-09-2025

Share this job:

Skills

Python Research Machine Learning PyTorch Scikit-Learn Deep Learning Programming Databases git benchmarking Autonomy Numpy Pandas

Job Specifications

InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.

Join us to be a part of the AI revolution!

We are seeking a motivated AI & Genomics Intern to contribute to an applied research project at the intersection of machine learning and genomics. The project will focus on designing and evaluating methods that combine genomic and phenotypic data to improve predictive modeling, benchmarking, and variant interpretation.

Responsibilities

Develop a reproducible pipeline for genomic data ingestion, cleaning, and quality control.
Collect, organize, and document datasets from public or internal sources.
Benchmark machine learning and deep learning models using JAX and PyTorch.
Integrate multi-modal features such as genomic variants, phenotype measurements, and metadata.
Evaluate models for performance, interpretability, and generalization across datasets.
Deliver clean code, reproducible pipelines, a technical report, and a final presentation.

Requirements

Strong programming skills in Python (pandas, numpy, scikit-learn).
Practical experience with deep learning frameworks (JAX and PyTorch).
Good understanding of genomics databases and handling large biological datasets.
Knowledge of genomics handling basics: FASTQ/BAM/VCF formats, SNPs, haplotypes.
Ability to work with autonomy, while communicating results effectively.
Good coding practices (Git, testing).

Preferred Qualifications

Experience with bioinformatics tools (samtools, bcftools).
Familiarity with workflow frameworks (Snakemake, Nextflow).
Statistical genetics (GWAS, mixed models, GBLUP).
Experience with multi-modal biological datasets.

What You Will Learn

Applying state-of-the-art AI frameworks (JAX, PyTorch) to genomic data.
Building end-to-end pipelines for reproducible data processing
Designing benchmarks to compare models and evaluate robustness.
Collecting and curating large-scale datasets for real-world genomics applications.
Please submit your CV/Resume in English*

Duration: 6 Months internship

Our commitment to our people

We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we're proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.

Right to work: Please note that you will require the legal right to work without visa sponsorship in the location you are applying for. We do not sponsor work visas.

About the Company

InstaDeep is a pioneer in decision-making AI, driven by the belief that AI can help everyone. We work with businesses and communities to help them realise the benefits of the latest in AI technology. We excel in applying Machine Learning, and, in particular, Reinforcement Learning – systems learn to excel at tasks through experience – across industry sectors, including logistics, biotech, electronics design, and beyond, empowering businesses to harness AI to tackle their most complex challenges. At InstaDeep, human values ... Know more