Job Specifications
Job Description: Senior Network SRE (London)
Senior Network SRE with 10+ years in multi-vendor routing, switching, firewalling, wireless, automation (Ansible, Salt), and observability tools (Grafana, Splunk).
Core skills: routing, switching, firewalling, wireless Automation & monitoring tools: Ansible, Salt, Grafana, Splunk.
Role Overview
We are seeking a highly experienced Senior Network Site Reliability Engineer (SRE) to join our global network operations team. This role is critical in ensuring the reliability, scalability, and performance of our network infrastructure. You will lead incident response, troubleshoot complex issues, and drive automation initiatives to maintain world-class network services.
Required Skills
Minimum 10 years' hands-on experience in network engineering and operations.
Deep expertise in routing, switching, firewalling, and wireless across multiple vendors.
Strong troubleshooting skills, including overlay/underlay network understanding.
Proficiency in Linux/Unix environments.
Experience with automation and monitoring platforms.
Ability to work independently, set technical direction, and mentor others.
Key Responsibilities
Lead Incident Management: Own and resolve critical network incidents, manage outages, and provide expert guidance during high-pressure situations.
Advanced Troubleshooting: Diagnose and resolve complex issues across routing, switching, firewalling, and wireless domains.
Technical Leadership: Set technical direction, mentor junior engineers, and foster a culture of operational excellence.
24/7 Operations: Participate in a shift-based model to ensure continuous availability of critical network services.
Multi-Vendor Expertise: Operate across diverse environments including Arista, Cisco, Cumulus, Spectrum Ethernet, InfiniBand, Palo Alto, Check Point, Mist, Aruba, A10, Netscaler, and F5.
Security & Segmentation: Support network segmentation, policy enforcement, and VPN solutions (GlobalProtect, AnyConnect).
Automation & Observability: Utilize tools like Grafana, Big Panda, ServiceNow, ITMP, syslog, Splunk, Salt, Ansible, and Prometheus to enhance monitoring and automation.
Innovation Projects: Collaborate on wireless design and AI cluster deployments to support cutting-edge initiatives.
Preferred Skills
Experience with InfiniBand and AI cluster deployments.
Familiarity with network asset management systems (e.g., Nautobot).
Wireless design experience with Cisco, Mist, Aruba.
About the Company
W3Global is a leading provider of end-to-end consulting services, empowering businesses to achieve their strategic goals and optimize their operations. With over 15 years of experience, we have a proven track record of delivering innovative and effective solutions across a wide range of industries.
Our Mission
At W3Global, we are committed to helping businesses of all sizes achieve their full potential. We believe that by combining our deep industry expertise with our innovative approach, we can deliver exceptional results...
Know more