Job Specifications
hackajob is collaborating with Comcast to connect them with exceptional tech professionals for this role.
FreeWheel, a Comcast company, provides comprehensive ad platforms for publishers, advertisers, and media buyers. Powered by premium video content, robust data, and advanced technology, we’re making it easier for buyers and sellers to transact across all screens, data types, and sales channels. As a global company, we have offices in nine countries and can insert advertisements around the world.
Job Summary
FreeWheel is seeking a Junior SRE (SRE 2) to join Freewheel OPS team based in Denver, CO or Chicago, IL. As a member of the Global Operation team, you will be responsible for ensuring the reliability, scalability, and performance of Freewheel systems. Working closely with engineers and other operation sub-teams, you will manage infrastructure, optimize system reliability, automate daily operations, and resolve technical issues that impact upstream/downstream platform.
Job Description
Key Responsibilities:
System Monitoring and Optimization: Design and implement monitoring and alerting systems to ensure the stability, reliability, and performance of data platforms. Join in on-call shift to quickly respond to and resolve issues.
Automation and Tool Development: Develop and maintain automation tools and scripts for deployment, monitoring, backup and disaster recovery.
Performance Optimization: Analyze and optimize the performance of data storage, query performance, and data flows to ensure efficient processing of large-scale datasets, reduce latency, an improve processing speed.
Incident Response and Troubleshooting: Respond quickly to platform failures, perform troubleshooting, and coordinate cross-team efforts to resolve issues and ensure high availability and reliability of data.
Capacity Planning and Scaling: Work with engineering teams to analyze and forecast capacity requirements, ensuring the system can handle traffic growth and scale infrastructure accordingly. Support Freewheel powered Live events.
Cloud Access management & Governance: Maintain consistent cloud standards and support enforcement of governance and compliance practices across cloud environment.
Documentation and Knowledge Sharing: Document the architecture, configurations, and operational procedures for platforms, ensuring knowledge is shared across the team and providing relevant training.
Security and Compliance: Ensure platforms meet security standards and compliance requirements to prevent breaches or misuse.
Cross-Team Collaboration: Collaborate with engineering team, product team, and project management team to support product design and implementation, solving reliability-related issues.
Qualifications
1-3 years of experience as an SRE, DevOps or Operations Engineer.
Experience with cloud platforms (e.g. AWS, OCI, GCP, Azure).
Hands-on experience with Terraform and infrastructure as code (IaC) principle.
Proficiency in automation tools and frameworks (e.g. Ansible, Terraform, Kubernetes, Docker) for automating system deployment and maintenance.
Familiarity with modern data architectures and technologies, including big data platforms (e.g., Kafka, Hadoop, Spark), distributed storage (e.g., Cassandra, HDFS, AWS S3), etc.
Extensive experience in data base management (e.g. NoSQL databases, MySQL, PostgreSQL).
Programming Skills: Proficient in at least one programming language, such as Python, Go, Java, or Scala, with the ability to write efficient scripts and automation tools.
System Monitoring and Log Management: Familiar with using monitoring and log management tools such as Prometheus, Grafana, ELK Stack, or other similar tools.
Troubleshooting and Debugging: Strong debugging and troubleshooting skills, with the ability to quickly identify and resolve production issues.
Team Collaboration and Communication: Excellent communication skills with the ability to convey technical information clearly and concisely to both technical and non-technical stakeholders.
Proactive learner eager to grow in operations and governance.
Education: Bachelor’s degree or higher in Computer Science, Software Engineering, or a related field.
We offer SRE positions in 3 different areas, SRE2, SRE2-Data and SRE2-CloudENG, while each area has a slightly different day-to-day focus depending on the development teams they support, the core responsibilities and requirements remain consistent. If the candidates would like to focus on SRE2-CloudENG area, the responsibilities and qualifications will focus more on cloud environment governance and Infrastructure as Code (IaC).
Employees At All Levels Are Expected To
Understand our Operating Principles; make them the guidelines for how you do your job.
Own the customer experience - think and act in ways that put our customers first, give them seamless digital options at every touchpoint, and make them promoters of our products and services.
Know your stuff - be enthusiastic learners, users and advoca