cover image
OVHcloud

Senior Site Reliability Engineer (Overnight)

On site

Dallas, United states

$ 145,000 /year

Senior

Full Time

10-09-2025

Share this job:

Skills

Python Go Bash Perl Incident Response DevOps Monitoring Configuration Management Ansible Linux Operating Systems Windows System Administration Virtualization Programming Software Development Terraform Grafana Microservices

Job Specifications

Job Summary

The Senior Site Reliability Engineer (SRE) will ensure the high availability, performance, monitoring, and incident response for multiple OVHcloud products and services. This role involves supporting the reliability, configuration, and deployment of existing and new products and services. The SRE will investigate and debug errors, contribute to software development for service improvement, and automate tasks using scripts and tooling.

The Standard shift for this role is Sunday-Thursday from 10:00 PM-6:00 AM CST.

Base pay range: $125,000-145,000 (based on relevant experience)

Essential Duties & Responsibilities

This role requires working overnight shifts and participating in the on-call rotation (including weekends), providing 24/7 support
Manage and maintain essential OVHcloud infrastructures, products, and services
Diagnose errors with a data-driven approach and analyzing the data for resolution
Read, understand, and patch existing code as needed
Create and contribute to knowledge-based articles and instructional guides as needed
Develop scripts and tooling to automate tasks
Participate in building, deploying, and/or troubleshooting microservices software applications and other underlying APIs
Monitor alerting systems and submit configuration changes on a regular basis to ensure availability of systems and services
Install, deploy, and configure OVHcloud infrastructure as new capabilities and features are developed
Analyze available data and metrics to develop meaningful automated reports for technical teams and business leaders
Write well-documented root cause analyses with recommended official documentation to prevent future critical issues
Participate in User Acceptance Testing (UAT) for new product launches

Minimum Requirements

5+ years of relevant experience in an SRE, DevOps, programming, or similar position is required
3+ years of experience performing system administration of Linux/Unix and Windows operating systems is required
Experience performing day-to-day Operational (SRE/DevOps) tasks and working with microservices and multiple APIs.
Experience with languages such as Perl, Python, Bash, Go, etc
Experience managing a distributed, highly available, high-traffic infrastructure based on Linux is preferred
Experience with maintenance/configuration of monitoring, metrics, and logging infrastructures like Nagios, Grafana, OpenSearch
Experience with open-source configuration management tools, such as Puppet, Ansible, Terraform, etc. preferred
Well-versed in cloud technologies and terminology
Experience with virtualization and container technology
Ability to prioritize, organize, and execute on competing priorities; ability to reprioritize based on company need, is critical
Bachelor's degree in computer science or a related field preferred; or equivalent experience in lieu of degree
Constant pro-active and positive attitude: desire to help contacts, both internal and external

Working Conditions

Standard office environment

Company Description - About OVHcloud

OVHcloud US is a subsidiary of OVHcloud, a global cloud provider that specializes in delivering industry-leading performance and cost-effective solutions to better manage, secure, and scale data. OVHcloud US delivers bare metal servers, hosted private cloud, hybrid and public cloud solutions. OVHcloud manages 43 data centers across 12 sites on four continents, manufacturing its own servers, building its own data centers and deploying its own fiber-optic global network to achieve maximum efficiency. Through the OVHcloud spirit of challenging the status quo, the company brings freedom, security and innovation to solve data challenges - today and tomorrow. With a 25-year heritage, OVHcloud is committed to developing responsible technology and strives to be the driving force behind the next cloud evolution. https://us.ovhcloud.com.

EEO Statement

OVHcloud is committed to providing equal employment opportunities to all employees and applicants without regard to race, ethnicity, religion, color, sex (including childbirth, breast feeding, and related medical conditions), gender identity or expression, sexual orientation, national origin, ancestry, citizenship status, uniform service member and veteran status, marital status, pregnancy, age, protected medical condition, genetic information, disability, or any other protected status in accordance with all applicable federal, state and local laws.

Powered by JazzHR

kk04Pcmdo4

About the Company

A OVHcloud é um ator mundial e o líder europeu em serviços cloud, com mais de 450 000 servidores nos seus 37 datacenters distribuídos por quatro continentes. Desde há 20 anos, o grupo baseia-se num modelo integrado que lhe confere um controlo total sobre a sua cadeia de valor: da conceção dos servidores, passando pela administração dos seus datacenters e a orquestração da sua rede de fibra ótica. Esta abordagem única permite-lhe dar resposta, de forma totalmente independente, às necessidades dos seus 1,6 milhões de clientes ... Know more