[Remote] Data Reliability Engineer

Remote, USA Full-time Posted 2026-06-19

Note: The job is a remote job and is open to candidates in USA. Empower is a company focused on transforming financial lives and creating a flexible and inclusive work environment. They are seeking a hands-on Data Reliability Engineer to ensure the reliability and operational excellence of their AWS-based data platform, focusing on troubleshooting and improving production data systems.

Responsibilities

Own the reliability and stability of production data pipelines and data platform services
Diagnose and resolve data pipeline failures, delays, and data quality issues in production environments
Investigate issues across distributed data systems (e.g., Spark/EMR workloads, ingestion pipelines, warehouse performance)
Lead or support incident response, including triage, mitigation, and long-term resolution
Perform root cause analysis (RCA) and implement durable fixes to prevent recurrence
Define and improve data SLAs (freshness, latency, completeness) and ensure adherence
Design and enhance monitoring, alerting, and observability for data systems
Develop automation and tooling to reduce operational toil and improve system resilience
Contribute to disaster recovery (DR) and resiliency planning, including backup validation and recovery workflows
Partner with engineering teams to improve pipeline design, reliability, and operational readiness
Create and maintain runbooks, SOPs, and operational documentation
Participate in occasional off-hours support for production data systems when required

Skills

Minimum 5 years of experience working with production data platforms in AWS environments
Prior experience building data pipelines and seeing them through production, including exposure to real-world failures and operational challenges
Strong experience with Python and SQL in real data systems
Hands-on experience troubleshooting distributed data processing systems (e.g., Spark/EMR, Redshift, streaming systems)
Proven ability to debug and resolve production issues in data pipelines and data platforms
Experience with AWS data services (such as EMR, Redshift, DynamoDB, S3, or similar)
Experience handling production incidents and performing root cause analysis
Strong problem-solving mindset and ability to work through ambiguous production issues
Experience handling real-world data issues such as pipeline delays or failures
Experience with backfills and reprocessing
Experience with late-arriving or incomplete data
Experience improving observability and alerting specifically for data systems
Experience influencing or guiding data pipeline reliability and operational practices
Exposure to streaming/event-driven systems (Kafka, Kinesis, CDC patterns)
Experience with disaster recovery, backup validation, and resiliency testing
Strong communication during incidents with both technical and non-technical stakeholders

Benefits

Medical, dental, vision and life insurance
Retirement savings – 401(k) plan with generous company matching contributions (up to 6%), financial advisory services, potential company discretionary contribution, and a broad investment lineup
Tuition reimbursement up to $5,250/year
Business-casual environment that includes the option to wear jeans
Generous paid time off upon hire – including a paid time off program plus ten paid company holidays and three floating holidays each calendar year
Paid volunteer time — 16 hours per calendar year
Leave of absence programs – including paid parental leave, paid short- and long-term disability, and Family and Medical Leave (FMLA)
Business Resource Groups (BRGs) – BRGs facilitate inclusion and collaboration across our business internally and throughout the communities where we live, work and play. BRGs are open to all.
Other necessary computer equipment, will be provided

Company Overview

EMPower is a non-governmental agency for social well being. It was founded in 1891, and is headquartered in Greenwood Village, Colorado, USA, with a workforce of 10001+ employees. Its website is https://www.empower.com.

Apply To This Job

Apply Now

[Remote] Data Reliability Engineer

Similar Jobs

[Remote] Senior Renewal Account Manager

[Remote] Senior Project Manager- Renewable Energy

[Remote] Technical Account Manager

[Remote] Managing SAP EWM Consultant

[Remote] Cross-Sell Data Scientist

[Remote] Lead Actuarial & Risk Modeling Specialist – Data Science & AI (US Healthcare)

[Remote] Data Scientist

[Remote] Localization Life Sciences Business Development Manager

[Remote] Senior Sales Engineer

[Remote] Corporate Payroll Administrator - Remote

(Online Remote Job) YouTube Moderator Job (Part Time) $26/Hour -Entry Level

Financial Analyst (PTP Exception Management)

Experienced Customer Care Specialist – Credit Services and Support

Information Technology Specialist

Regulatory Associate

Experienced Customer Service Representative-Retention – Remote Opportunity at arenaflex

Data Scientist I

Retail Sales - Fashion Show

Experienced Entry-Level Data Entry Specialist (Remote) – Flexible Work Schedule and Competitive Pay

Experienced Workers' Compensation Claims Adjuster - CA Licensed, Remote Contract Opportunity with Competitive Hourly Rate