cover image
Enigma

Computer Vision Research Engineer

Remote

United kingdom

Junior

Full Time

03-02-2026

Share this job:

Skills

Python Docker Research Training Computer Vision git AWS cloud platforms Recruitment

Job Specifications

Computer Vision Research Engineer | Computer Vision | Diffusion Models | Python | Gaussian Splatting | Remote, UK

Computer Vision Research Engineer

Role: Computer Vision Research Engineer

Location: United Kingdom

Working: Remote, with optional occasional on-site working in a UK office

Reports to: Head of Research

Are you interested in helping to craft exceptional experiences for clients that deliver genuine social impact? Are you ready to join a small, experienced team of innovators and make a meaningful contribution to a growing technology company?

About the company

The company develops AI-powered sign language translation technology, grounded in leading academic research and focused on photorealistic video generation and neural machine translation.

In 2025, the company launched a real-time AI sign language translation product that is used by major live-streaming providers and live events. The organisation is currently expanding to support multiple sign languages and is improving both translation accuracy and video quality.

The research team is growing and is now recruiting a Computer Vision Research Engineer.

This role will expand the existing video generation pipeline to include Gaussian Splatting and Diffusion techniques to improve realism. Using approaches similar to recent academic work you will train models on proprietary sign language datasets for pose-to-video generation and video enhancement.

The work will involve open-source tooling and techniques for data preparation, fine-tuning and inference. You will work closely with other Computer Vision Engineers and will be managed by the Head of Research. This role will lead development of a key component of the organisation’s translation technology.

The company offers a remote working environment, a competitive equity package, and the opportunity to help build an early-stage technology business focused on social good.

Essential requirements

At least 2 years’ experience working with Diffusion models, including both training and inference
At least 3 years’ experience applying Computer Vision technologies in a commercial environment
At least 3 years’ experience using Python in a commercial environment
Experience deploying Computer Vision solutions in real-world applications, including infrastructure and scaling considerations
Experience training Computer Vision models on a compute cluster (for example using Condor or SLURM)

Desirable requirements

A degree in Computer Science or another science-based discipline
Experience with Gaussian Splatting models for human appearance (e.g. face or upper body)
Experience optimising Computer Vision models (for example fast inference, TensorRT, ONNX, distillation, or quantisation)
Research experience, such as published academic work or open-source contributions
An interest in current Computer Vision research and emerging technologies
Understanding of sign language or the Deaf community
Experience with Git, Docker and cloud platforms (e.g. AWS)

Example responsibilities

Design and build Computer Vision solutions using Gaussian Splatting and Diffusion models for generating realistic and accurate sign language videos. This will include:

Exploring both closed-source and open-source Diffusion and Gaussian Splatting models
Training a Diffusion video generation model on proprietary sign language data for pose-to-video generation or video enhancement
Training a Gaussian Splatting model on proprietary sign language data for 3D video generation and body movement modelling
Improving the scalability and latency of the video generation pipeline
Working with an in-house translation team to develop high-quality datasets for the target use case

Benefits

24 days’ holiday plus bank holidays, and a company pension scheme
Competitive compensation and equity packages
Opportunity to work on cutting-edge technologies within a high-growth organisation
Free sign language classes

Hours

This is a full-time role, with standard virtual office hours of 9am to 6pm. Flexibility is offered to accommodate reasonable personal circumstances. The organisation values strong collaboration, delivery against agreed milestones and high-quality work.

Applicants must have the right to work and live full-time in the United Kingdom.

Equality and diversity

The company is committed to eliminating discrimination and to building a diverse and inclusive team. The organisation aims for its workforce to be representative of all sections of society and for every employee to feel respected and supported.

Flexible working practices and supportive policies are in place to help employees balance their personal and professional lives and to support long-term career development.

Applicants who are native sign language users are guaranteed an interview.

Data protection

As part of the recruitment process, the company collects and processes personal data relating to job applicants. The organisation is committed to protecting the privacy and security of

About the Company

Here at Enigma, we specialize in Generative AI recruitment, specifically focused on Machine Learning and Software Engineering disciplines. With a combined experience of 20+ years, we understand the intricacies of finding the perfect role as well as the right talent for your team. But what sets Enigma apart? Our consultative approach. We don't just match candidates with job openings; we guide candidates, founders, and hiring managers through the recruitment process. Our value-added services go beyond traditional recruitment ... Know more