cover image
Saxon Global

Sr. Site Reliability Engineer

Hybrid

Arlington, United states

Full Time

25-11-2025

Share this job:

Skills

Leadership Java C# Go Bash PowerShell SQL NoSQL CI/CD DevOps Docker Kubernetes Monitoring Jenkins Azure DevOps Test Selenium Test Automation Scrum Architecture Programming SQL Server Azure Software Development Postman Agile .NET .NET Core Maven CI/CD Pipelines Terraform

Job Specifications

Job Title: Sr. Site Reliability Engineer

Job Type: Contract-to-Hire *The client would look to convert these resources to a full-time employee within 6 months, potentially sooner, or within 12-months. It could vary.

Location: 2-days onsite in Arlington, TX | 3-days remote

Interview Process:

Codility Test:
Sample Task: Given a set of integers, create a string that organizes them in tabular form.
72 hours to complete; 75 minutes allocated; average completion time: 55 minutes.
Interview Rounds (3):
30-minute video conference interview with the Hiring Manager
60-minute technical interview with two Principal Architects
Final onsite interview with SRE Leadership

JOB SUMMARY

Added Insight: This initiative is a global digital modernization effort aimed at transforming financial services platforms to be highly available, scalable, and automated. This role will focus on migrating and optimizing applications for cloud-native architectures (Azure), implementing containerization (AKS / Kubernetes / Docker), and embedding reliability and observability into software systems through SRE practices. Key objectives include establishing SLOs / SLIs, building automated CI/CD pipelines, enhancing database performance (SQL Server / Oracle / NoSQL), and enabling enterprise-wide monitoring and incident management. The environment spans multiple regions (LATAM / Europe / China / USA / Canada, yet will have a focus on building tools and best practices for the SRE function across North America and emphasize collaboration between development, architecture, and operations teams to deliver resilient, performant, and data-driven financial services at global scale.

Currently seeking a Sr. Site Reliability Engineer / Lead (SRE) for a contract-to-hire employment opportunity that is located in Arlington, TX 76014 (hybrid onsite 2-days per week). The SRE will join a forward-thinking, technology-driven environment where they are redefining how technology supports customers, partners, and business operations. The SRE Teams leads, directs, and provides accountability for building and running large-scale software systems. The SRE will identify and deliver automation solutions designed to ensure HA and resiliency using expertise in software development, complexity analysis, and scalable system design. The SRE will work closely with engineering teams, ensuring services / systems are highly stable and performant.

Job Scope:

Collaboration / Architecture / Development – Partnering with Architecture / Development Teams, Ensuring Applications Highly Available / Reliable / Performant at Global Scale
Reliability Guidance – Collaborating with Architecture Team, Ensuring Reliability Factors are Accounted for in Business Features / Enablers
SLO - Service-Level Operations / SLI – Service-Level Indicators Management / Implementation – Guiding Development Teams in Understanding Established Service Level Objectives / Consequences | Implementing Appropriate SLIs to Support Objectives
Troubleshooting / Problem Resolution – Collaborating with Development Team Members to Swarm / Troubleshoot / Resolve Problems
Root Cause Analysis / Solution Planning – Guiding Ad-Hoc Teams to Brainstorm Solutions | Build Implementation Plans Based on Root Cause Analysis of Production Issues
Automation / Optimization – Designing / Building Automated Solutions to Optimize Application / Service / Platform Uptime with Minimal Human Intervention
Standards / Mentorship – Implementing / Helping Create Standards / Best Practices | Mentoring Team Members to Drive Adoption Across Development Teams

Job Requirements:

Programming / Scripting Background – Java / C# (.NET MVC / .NET Core) / Go | PowerShell / Bash
Site Reliability Engineer – Identifying / Delivering Automation Solutions, Ensuring HA / Resiliency
SLO - Service-Level Operations / SLI – Service-Level Indicators Management – Defining / Implementing / Evaluating SLOs/SLIs | Associated Consequences
Pipeline Automation – Azure DevOps (YAML / ARM) / Terraform / Jenkins / Chef / Octopus Deploy | Designing / Building / Optimizing Automated Pipelines with Automated Testing / Automated Security Controls
DevOps / Containerization – AKS (Azure Kubernetes Service) / Kubernetes (Open Source) / Docker
Database Design / Optimization – Oracle / MS SQL Server / NoSQL (CosmosDB) | Designing / Evolving Database Schemas | Performing Query Performance Analysis | Indexing to Deliver Scalable / Performant Services
Code Scanning – SonarQube / Checkmarx | Configurations / CI/CD Integrations / Running Scans / Triaging, etc.
Test Automation – Xamarin UITest / SpecFlow / DevTest / Selenium / Test Data Manager / Postman / Maven / TestNG / JMeter
Root Cause Analysis / Problem Management – Performing Root Cause Analysis / Managing Problems
SCRUM / Agile Leadership – Working in SCRUM / Agile Teams | Demonstrated Success Leading Improvements

Technical Skills Requirements:

Proficiency in:

C#
.NET
SQL
Azure expertise:
AKS (Azure Kubernetes Service)
Azure Mon

About the Company

Saxon Global is one of the fastest-growing Inc. 5000 Companies in the U.S., providing enhanced IT consulting and staffing solution services for the past 20 years. Know more