cover image
Ubique Systems

SRE – Engineer

Hybrid

Birmingham, United kingdom

Mid level

Freelance

09-10-2025

Share this job:

Skills

Python Java TypeScript GitLab CI/CD Docker Kubernetes Monitoring Jenkins Programming Azure node.js AWS cloud platforms Agile Gitlab CI Terraform Prometheus Grafana GitLab CI/CD Infrastructure as Code

Job Specifications

Platform Engineering Teams is responsible for building integrated, scalable, and robust enterprise journeys. We are currently looking for a Senior Software Engineer with deep expertise in TypeScript, Java, or another object-oriented programming (OOP) language, Kubernetes (K8s), AWS, and GitLab CI/CD. Familiarity with Dynatrace is a plus.

About the Role

As a Senior SRE Engineer, you will be a hands-on technical expert driving the reliability, scalability, and availability of the engineering platform. Working collaboratively across teams, you will develop and implement automated solutions, address operational challenges, and ensure the platform's robust performance. This role demands strong technical acumen, a proactive mindset, and the ability to influence platform improvements through technical excellence.

Job Responsibilities

Platform Stability and Reliability

Ensure the platform meets performance, availability, and reliability SLAs.
Proactively identify and resolve performance bottlenecks and risks in production environments.
Maintain and improve monitoring, logging, and alerting frameworks to detect and prevent incidents.

Incident Management

Act as the primary responder for critical incidents, ensuring rapid mitigation and resolution.
Conduct post-incident reviews and implement corrective actions to prevent recurrence.
Develop and maintain detailed runbooks and playbooks for operational excellence.

Automation and Efficiency

Build and maintain tools to automate routine tasks, such as deployments, scaling, and failover.
Contribute to CI/CD pipeline improvements for faster and more reliable software delivery.
Write and maintain Infrastructure as Code (IaC) using tools like Pulumi or Terraform to provision and manage resources.

Collaboration and Mentorship

Collaborate with SRE, CI/CD, Developer Experience, and Templates teams to improve the platform's reliability and usability.
Mentor junior engineers by sharing knowledge and best practices in SRE and operational excellence.
Partner with developers to integrate observability and reliability into their applications.

Observability and Metrics

Implement and optimize observability tools like Dynatrace, Prometheus, or Grafana for deep visibility into system performance.
Define key metrics and dashboards to track the health and reliability of platform components.
Continuously analyze operational data to identify and prioritize areas for improvement.

Qualifications

Required:

5+ years of experience in site reliability engineering, software engineering, or a related field.
Demonstrated expertise in managing and optimizing cloud-based environments, with 3+ years of experience in AWS.
Strong programming skills in one or more languages: Python, Java, Node.js, or TypeScript.
Hands-on experience with containerization and orchestration technologies (e.g., Kubernetes, Docker).
Proficiency in CI/CD practices and tools, such as GitLab, Jenkins, or similar.
Familiarity with monitoring, logging, and alerting tools; experience with Dynatrace is a plus.

Preferred:

Hands-on experience with Kubernetes (K8s) for container orchestration and deployment.
Familiarity with monitoring and observability tools like Dynatrace, Prometheus, or similar.
Exposure to agile development practices and collaborative environments.
Experience working with other cloud platforms (e.g., Azure or Google Cloud) is a plus.

About the Company

Ubique Systems is a fast growing multifaceted organization which offers a comprehensive array of outsourcing and consulting services for its customers, including recruitment, human resource management, vendor management, and outplacement services and software development on a global basis, with an objective to adopt the flexible global business practices that today enable companies to operate more efficiently and produce more value. We're a global leader in business and technology services, helping our clients bring the fut... Know more