Job Specifications
OCI Network Availability is seeking a Senior Network Reliability Engineer to build and operate services that enhance the availability of Oracle Cloud Infrastructure (OCI) networking.
A Network Reliability Engineer (NRE) applies an engineering-driven approach to measuring and automating network reliability in alignment with organizational service-level objectives, agreements, and goals. The NRE team is responsible for responding to network disruptions, identifying root causes, and collaborating with internal and external stakeholders to fully restore functionality. The team also focuses on automating recurring operational tasks to streamline processes, improve workflow efficiency, and increase overall productivity.
As OCI operates a global, cloud-based network, this role supports hundreds of thousands of network devices and millions of servers connected through a combination of dedicated backbone infrastructure, CLoS networks, and the public Internet.
In this role, you will:
Support the design, deployment, and operation of a large-scale global cloud computing environment (Oracle Cloud Infrastructure)
Use and contribute to procedures and tools to safely develop and execute network changes
Develop solutions that enable support teams to respond effectively to network failure conditions
Mentor junior engineers
Participate in network solution and architecture design processes
Provide break-fix support for network events, act as an escalation point for remediation, and lead post-event root cause analysis
Develop scripts to automate routine tasks for teams and business units
Coordinate with network automation service teams to develop and integrate support tooling
Work with network monitoring teams to collect telemetry data and create network event alert rules
Build dashboards to visualize and analyze network performance data
Collaborate with network vendor technical account and quality assurance teams to drive bug resolution and assist with qualification of new firmware and operating systems
Participate in an on-call rotation
Preferred qualifications:
Bachelor’s degree in computer science or a related engineering field with 5+ years of network engineering experience, or a master’s degree with 2+ years of network engineering experience
Experience working in a large ISP or cloud provider environment
Experience in a network operations role
Strong knowledge of protocols and services including MPLS, BGP, OSPF, IS-IS, TCP, IPv4, IPv6, DNS, DHCP, VxLAN, and EVPN
Extensive experience with scripting, automation, and data center design; Python preferred, but expertise in other scripting or compiled languages is acceptable
Experience with networking technologies such as TCP/IP, VPN, DNS, DHCP, and SSL
Experience with network monitoring and telemetry solutions
Experience with network modeling and programmability using technologies such as YANG, OpenConfig, and NETCONF
About the Company
We're a cloud technology company that provides organizations around the world with computing infrastructure and software to help them innovate, unlock efficiencies and become more effective. We also created the world's first - and only - autonomous database to help organize and secure our customers' data.
Oracle Cloud Infrastructure offers higher performance, security, and cost savings. It is designed so businesses can move workloads easily from on-premises systems to the cloud, and between cloud and on-premises and other ...
Know more