- Company Name
- Zeta
- Job Title
- Data Reliability Engineer
- Job Description
-
**Job Title**
Data Reliability Engineer
**Role Summary**
Ensure high‑performance, secure, and reliable cloud‑native PostgreSQL databases and associated data pipelines. Monitor, troubleshoot, and optimize database and stream processing systems, automate routine tasks, and support incident resolution while enforcing security and compliance standards.
**Expectations**
- Maintain 99.9% availability and optimal performance of PostgreSQL RDS instances.
- Rapidly diagnose and resolve database, connector, and data flow issues within on‑call shift.
- Automate database management and deployment processes using IaC tools.
- Collaborate with developers and data engineers to refine schemas, queries, and pipelines for scalability and efficiency.
- Implement and audit security controls, backup, and recovery strategies per regulatory mandates.
- Continuously improve monitoring, alerting, and incident response procedures.
**Key Responsibilities**
- Monitor PostgreSQL RDS clusters (CPU, memory, storage, connections) via CloudWatch, Prometheus, etc.
- Identify performance bottlenecks; tune queries, add indexes, and adjust RDS parameters.
- Oversee Debezium, Kafka Connect, and Apache NiFi for cataloguing and addressing data capture/delivery errors.
- Manage Apache Airflow DAG execution, detect failures, and trigger re‑runs.
- Develop and maintain Terraform or Crossplane IaC templates for database provisioning, patching, and scaling.
- Participate in 24/7 on‑call rotation: incident triage, root cause analysis, and post‑mortem documentation.
- Enforce database security: access control, encryption at rest/ in transit, compliance with GDPR/HIPAA, and regular security audits.
- Design backup and disaster‑recovery plans; validate restoration procedures.
- Collaborate with development teams on schema design, SQL optimization, and data partitioning for cloud environments.
- Drive continuous improvement initiatives for reliability, scalability, and performance of cloud databases and pipelines.
**Required Skills**
- PostgreSQL administration and performance tuning.
- Experience with PostgreSQL RDS (1–2 years).
- Monitoring tools: CloudWatch, Prometheus, Grafana.
- Data pipeline technologies: Debezium, Kafka Connect, Apache NiFi.
- Workflow orchestration: Apache Airflow.
- Scripting: Python, Bash (basic).
- SQL scripting proficiency.
- Familiarity with AWS services (RDS, S3, Redshift).
- Understanding of database security best practices (encryption, IAM, GDPR, HIPAA).
- Knowledge of IaC (Terraform, Crossplane).
**Required Education & Certifications**
- Bachelor’s degree in Computer Science, Information Technology, or related field.
- 3–5 years of database administration experience focusing on PostgreSQL.
- Optional: AWS Certified Database – Specialty (preferred).
New jersey, United states
On site
Junior
15-12-2025