- Career Center Home
- Search Jobs
- Executive Director, Digital SRE & Operations
Results
Job Details
Explore Location
CVS Health
Austin, Texas, United States
(on-site)
Posted
1 day ago
CVS Health
Austin, Texas, United States
(on-site)
Job Type
Full-Time
Job Function
Other
Executive Director, Digital SRE & Operations
The insights provided are generated by AI and may contain inaccuracies. Please independently verify any critical information before relying on it.
Executive Director, Digital SRE & Operations
The insights provided are generated by AI and may contain inaccuracies. Please independently verify any critical information before relying on it.
Description
We're building a world of health around every individual - shaping a more connected, convenient and compassionate health experience. At CVS Health®, you'll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger - helping to simplify health care one person, one family and one community at a time.The Executive Director, Site Reliability Engineering (SRE) will lead the strategy, execution, and evolution of enterprise-scale reliability, availability, and operational excellence across the Digital Technology organization.
This role is accountable for end-to-end reliability of web, mobile, API, platform, and AI-enabled systems that serve millions of users. The Executive Director will establish modern SRE practices, AI-driven operations (AIOps), DevOps automation, and reliability-by-design principles-ensuring platforms are resilient, scalable, secure, and cost-efficient.
You will partner closely with Digital Platform Engineering, Digital Experience, AI Platform, Client Integrations, Security, and Architecture to embed reliability into every layer of the digital ecosystem.
Responsibilities:
1. SRE Strategy & Reliability Leadership
• Define and own the enterprise SRE strategy, including SLOs, SLIs, error budgets, and reliability roadmaps.
• Establish reliability standards and practices across web, mobile, backend services, APIs, data platforms, and AI workloads.
• Drive a culture of reliability-by-design and operational excellence across engineering teams.
2. AI-Driven Operations (AIOps) & Automation
• Lead adoption of AIOps capabilities for proactive issue detection, alert noise reduction, and predictive failure prevention.
• Implement AI-assisted incident triage, automated runbooks, root-cause analysis, and self-healing systems.
• Partner with the AI Platform team to integrate LLMs and ML models into operational workflows (log summarization, anomaly detection, remediation).
3. Observability & Monitoring
• Own enterprise observability strategy across metrics, logs, traces, and user experience monitoring.
• Standardize tooling and practices using platforms such as Datadog, Splunk, Prometheus, Grafana, OpenTelemetry.
• Deliver real-time dashboards and executive reporting on uptime, performance, latency, and error budgets.
4. DevOps, CI/CD & Release Reliability
• Partner with DevOps and Platform teams to ensure safe, automated, and scalable CI/CD pipelines.
• Enable progressive delivery patterns (blue/green, canary, feature flags) to minimize blast radius.
• Ensure quality gates, rollback mechanisms, and deployment automation are embedded into delivery pipelines.
5. Incident Management & Operational Excellence
• Lead enterprise incident response, escalation, and post-incident learning (blameless postmortems).
• Reduce MTTR, MTTD, and incident frequency through automation and preventive engineering.
• Establish runbooks, on-call models, and operational readiness reviews.
6. Cloud Reliability & FinOps
• Ensure reliability and scalability across cloud environments (Azure, GCP, AWS).
• Partner with Finance and Platform Engineering to drive FinOps, cost transparency, and capacity planning.
• Optimize performance, availability, and cost across high-traffic digital workloads.
7. Leadership & Talent Development
• Build, mentor, and lead global SRE teams, managers, and technical leaders.
• Define SRE career paths, skill frameworks, and training programs.
• Foster a culture of learning, accountability, and continuous improvement.
Required Qualifications
• 18+ years of experience in software engineering, platform operations, or site reliability engineering.
• 8+ years leading large-scale SRE, DevOps, or platform reliability organizations.
• Experience leveraging AI/ML for operations, including anomaly detection, predictive alerts, log analysis, or automated remediation.
• Familiarity with AIOps tools such as Datadog Watchdog, Dynatrace Davis, Splunk AI, Elastic AIOps, or custom ML/LLM solutions.
• Understanding of how to safely operate and monitor AI-enabled production systems.
• Deep expertise in distributed systems, cloud infrastructure, and high-availability architectures.
• Strong knowledge of SRE principles, DevOps, and reliability engineering at scale.
• Experience implementing AIOps or AI-driven operational tooling.
• Executive-level communication skills with the ability to influence senior leaders and business stakeholders.
• Experience operating mission-critical digital platforms serving millions of users.
• Background in regulated industries (healthcare, financial services, insurance).
• Experience partnering with platform engineering, AI teams, and enterprise architecture.
Education
Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
Master's degree preferred.
Pay Range
The typical pay range for this role is:
$175,100.00 - $334,750.00
This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company's equity award program.
Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.
Great benefits for great people
We take pride in our comprehensive and competitive mix of pay and benefits - investing in the physical, emotional and financial wellness of our colleagues and their families to help them be the healthiest they can be. In addition to our competitive wages, our great benefits include:
- Affordable medical plan options, a 401(k) plan (including matching company contributions), and an employee stock purchase plan.
- No-cost programs for all colleagues including wellness screenings, tobacco cessation and weight management programs, confidential counseling and financial coaching.
- Benefit solutions that address the different needs and preferences of our colleagues including paid time off, flexible work schedules, family leave, dependent care resources, colleague assistance programs, tuition assistance, retiree medical access and many other benefits depending on eligibility.
For more information, visit https://jobs.cvshealth.com/us/en/benefits
We anticipate the application window for this opening will close on: 03/31/2026
Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.
Job ID: 82487334
Jobs You May Like
Median Salary
Net Salary per month
$4,904
Cost of Living Index
67/100
67
Median Apartment Rent in City Center
(1-3 Bedroom)
$2,119
-
$3,831
$2,975
Safety Index
56/100
56
Utilities
Basic
(Electricity, heating, cooling, water, garbage for 915 sq ft apartment)
$101
-
$300
$190
High-Speed Internet
$50
-
$100
$67
Transportation
Gasoline
(1 gallon)
$2.73
Taxi Ride
(1 mile)
$2.61
Data is collected and updated regularly using reputable sources, including corporate websites and governmental reporting institutions.
Loading...