Job Specifications
Job Title: SRE - Director
Location - Dallas, Tx US (ONSITE)
Roles Descriptions:
SRE - Director to lead our global SRE team in building scalable, resilient, and highly available systems. This role combines deep technical expertise with strong leadership, strategic thinking, and a passion for delivering exceptional customer experiences through operational excellence.
As SRE Director, this individual will champion automation initiatives for SRE operations, aiming to enhance the performance and reliability of infrastructure and critical services. The role involves close collaboration with engineering, product, security, and operations teams to define and implement reliability best practices organization-wide.
Key Responsibilities:
Leadership & Strategy
Build and lead a high-performing team of SREs Tools across various Business segment.
Define and execute the SRE strategy aligned with business and engineering goals.
Foster a culture of reliability, observability, and performance.
Reliability Engineering
Own SLAs/SLOs/SLIs for key services and ensure they are met consistently.
Drive incident management practices, root cause analysis (RCA), and continuous improvement.
Oversee reliability tooling, runbooks, and automation frameworks.
Platform & Infrastructure
Partner with Infrastructure, DevOps, and Cloud teams to ensure scalable platform architecture.
Guide the adoption of Infrastructure-as-Code (IaC), CI/CD pipelines, and modern observability tools.
Drive cost optimization and efficient resource utilization in cloud environments.
Collaboration & Communication
Act as a reliability evangelist across engineering teams, enabling them to own and improve their services.
Report reliability and performance metrics to leadership and stakeholders.
Collaborate closely with security, compliance, and governance teams to meet regulatory requirements.
Qualifications:
Bachelor's or master's degree in computer science, Engineering, or related field.
15+ years of experience in software engineering or infrastructure roles, with at least 5+ years in SRE or DevOps leadership.
Proven success managing high-availability, large-scale distributed systems (e.g., microservices, cloud-native apps).
Deep understanding of cloud platforms (AWS GCP), containers (Docker, Kubernetes), monitoring (Prometheus, Grafana, Datadog, new relic), and automation tools (Terraform, Ansible, etc.).
Experience with modern CI/CD tools (e.g., Jenkins, ArgoCD, GitHub Actions).
Strong leadership, communication, and team development skills.
Preferred Qualifications:
Experience in regulated industries (e.g., Telecom, communications) and Global telco leaders.
Certifications in cloud platforms (AWS Certified DevOps Engineer, Google SRE Certificate, etc.).
Experience managing hybrid or multi-cloud environments.
Worked as senior role in Top 5 Consultancy companies.
About the Company
Established in 1994, TechnoSphere is a Global IT Solutions and Services provider, specializing in Digital Transformation, Software Consulting, Business Analytics, AI, and Cloud Computing. With an impressive annual revenue of 114 million USD, we are committed to delivering innovative solutions that drive success for our customers and partners. Our strength lies in our team of creative and strategic thinkers who are spread across 15 global offices, including the United States, Canada, India, Europe, Philippines, Brazil, Singap...
Know more