Conversation, Person, Adult, Male, Man, Head, Computer Keyboard, Face, Coat, Monitor

Senior Product Architect (Reliability & Ops)

 

Notice: Equinix is aware of scams involving fake employment offers. Read more. 

Senior Product Architect (Reliability & Ops)

  • JR-160913
  • Hybrid
  • Toronto
  • London
  • Dallas
  • Technology Enablement
  • Full time
View favorites

Who are we?

Equinix is the world’s digital infrastructure company®, shortening the path to connectivity to enable the innovations that enrich our work, life and planet. 

A place where bold ideas are welcomed, human connection is valued, and everyone has the opportunity to shape their future.

A career at Equinix means being at the center of shaping what comes next and amplifying customer value through innovation and impact. You’ll work across teams, influence key decisions, and help shape the path forward. You’ll find belonging, purpose, and a team that welcomes you—because when you feel valued, you’re empowered to do your best work.

Job Summary 

Leads the vision, strategy, and execution for the Runtime, Reliability & Operations capability domain within Equinix Engineering Excellence (E3). Owns the product portfolio and long-term roadmap for the operational platforms, reliability engineering capabilities, observability systems, and AI-assisted operational workflows that ensure resilient, scalable, and self-healing service operations across Equinix environments. 

This leader is responsible for transforming fragmented operational tooling and reactive support models into a unified, intelligent operational platform that improves system reliability, accelerates incident response, reduces operational toil, and enables autonomous operations at scale. 

The Runtime, Reliability & Operations domain is responsible for capabilities spanning observability, incident management, operational telemetry, reliability automation, service health intelligence, operational workflows, resilience engineering, AI Ops, and self-healing operational systems. 

Acts as the single-threaded product owner for the capability domain strategy, executive inspection narrative, investment priorities, operational maturity roadmap, and adoption outcomes across engineering, SRE, infrastructure, operations, and support organizations. 

The role requires balancing operational rigor and reliability engineering discipline with developer productivity, automation, scalability, and AI-native operational transformation. 

 

Responsibilities 

Capability Domain Strategy & Vision 

Defines and evolves the long-term vision, operating model, and roadmap for the Runtime, Reliability & Operations capability domain, including: 

  • Observability platforms and telemetry pipelines  

  • Incident, problem, and operational workflow automation  

  • Service health intelligence and operational analytics  

  • Reliability engineering capabilities and resilience frameworks  

  • AI Ops and event correlation systems  

  • Automated remediation and self-healing operations  

  • Operational runbooks, diagnostics, and recovery orchestration  

  • Integrated alerting, ownership, and escalation systems  

  • Synthetic monitoring and behavioral validation frameworks  

  • Runtime operational governance and operational readiness standards 

  • Reliability telemetry, SLO/SLA management, and operational reporting  

  • Establishes strategic direction aligned to Equinix reliability, operational scalability, resiliency, customer experience, and engineering productivity goals

 

Product Portfolio Ownership 

  • Owns a portfolio of operational and reliability platform products and capabilities, including roadmap prioritization, sequencing, dependency management, and adoption strategy. Ensures operational capabilities are reusable, scalable, and integrated into engineering and support workflows across the enterprise

 

Executive Inspection Leadership 

  • Partners with engineering, infrastructure, SRE, and operations leaders to shape and govern the executive inspection process for the Runtime, Reliability & Operations domain. Drives alignment between operational performance, engineering practices, infrastructure reliability, and business continuity objectives

 

Reliability & Operational Experience Leadership 

  • Represents the needs of developers, SREs, operations teams, infrastructure engineers, incident responders, and engineering managers. Partners with Voice of Developer and operational stakeholders to continuously improve usability, adoption, and operational effectiveness

 

Cross-Functional Platform Leadership 

  • Works across E3, Infrastructure, SRE, Security, Architecture, Delivery, and Operations organizations to align operational strategy and execution. Acts as the connective layer between engineering delivery, infrastructure operations, and enterprise reliability objectives

 

Financial & Investment Planning 

  • Develops business cases and investment strategies for operational and reliability platform capabilities. Understands the economics of operational scalability, incident management, reliability engineering, and AI-driven automation

 

Product & Adoption Metrics 

  • Defines and governs KPIs for Runtime, Reliability & Operations effectiveness. Uses telemetry and analytics to continuously improve reliability outcomes, operational workflows, and platform experiences

 

Agile Product & Engineering Partnership 

  • Partners closely with engineering, SRE, operations, and architecture leaders to define execution priorities and delivery sequencing. Ensures operational and reliability initiatives remain measurable, outcome-oriented, and aligned to enterprise priorities

 

Qualifications

  • Proven years of experience in product management, SRE, platform engineering, infrastructure operations, DevOps, or enterprise operational strategy  

  • Experience leading large-scale operational transformation, observability, reliability engineering, or AI Ops initiatives  

  • Strong understanding of distributed systems operations, incident management, resilience engineering, and operational automation  

  • Demonstrated experience driving cross-functional alignment across engineering, infrastructure, SRE, security, and operations organizations  

  • Experience building operational platforms that improve reliability, scalability, and engineering efficiency  

  • Strong executive communication and strategic planning skills  

  • Experience with Agile product operating models and platform portfolio management  

  • Familiarity with AI-native operational workflows, event intelligence, and autonomous remediation systems preferred  

  • Bachelor’s degree preferred 

The targeted pay range for this position in the following location is / locations are:

United States - Dallas Infomart Office DAI : 177,000 - 265,000 USD / Annual

Canada - Toronto Office TRO : 182,000 - 272,000 CAD / Annual

Our pay ranges reflect the minimum and maximum target for new hire pay for the full-time position determined by role, level, and location.The pay range shown is based on our compensation structure in place at the time of posting and may be updated periodically based on business needs. Individual pay is based on additional factors including job-related skills, experience, and relevant education and/or training.

The targeted pay range listed reflects the base pay only and does not include bonus, equity, or benefits. Employees are eligible for bonus, and equity may be offered depending on the position.

Equinix Benefits

As an employee, you become important to Equinix’s success. We ensure all your benefits are in line with our core values: competitive, inclusive, sustainable, connected and efficient. We keep them competitive within the current marketplace to ensure we’re providing you with the best package possible. So, wherever you are in your career and life, you’ll be able to enhance your experience and bring your whole self to work.

Employee Assistance Program: An Employee Assistance program is available to all employees.

US Benefits: - Insurance: You may enroll in health, life, disability and voluntary plans that are designed for you and your eligible family members. - Retirement: You and Equinix may contribute to a retirement plan to help you plan for your financial future. - Paid Time Off (PTO) and Paid Holidays: You will receive an accrued amount of PTO each pay period along with various paid holidays for you to rest and recharge. Eligibility requirements apply to some benefits. Benefits are subject to change and may be subject to specific plan or program terms. Canada Core Benefits: - Insurance: You may enroll in healthcare coverage that is designed to complement the provincial healthcare system, along with life, disability and optional benefit plans that are designed for you and your eligible family members. - Retirement: You may also enroll in Equinix-sponsored retirement or savings plans: Defined Contribution Pension Plan (DCPP), Group Retirement Savings Plan (RRSP) and Tax-Free Savings Plan (TSFA). - Vacation and Paid Holidays: Equinix offers both vacation and personal time, along with various paid holidays for you to rest and recharge. Eligibility requirements apply to some benefits. Benefits are subject to specific plan or program terms, and to change at Equinix discretion.

Equinix is committed to ensuring that our employment process is open to all individuals, including those with a disability.  If you are a qualified candidate and need assistance or an accommodation, please let us know by completing this form.

Equinix is an Equal Employment Opportunity and, in the U.S., an Affirmative Action employer.  All qualified applicants will receive consideration for employment without regard to unlawful consideration of race, color, religion, creed, national or ethnic origin, ancestry, place of birth, citizenship, sex, pregnancy / childbirth or related medical conditions, sexual orientation, gender identity or expression, marital or domestic partnership status, age, veteran or military status, physical or mental disability, medical condition, genetic information, political / organizational affiliation, status as a victim or family member of a victim of crime or abuse, or any other status protected by applicable law. 

We use artificial intelligence in our hiring process. Learn more here.

This posting is a new position within our organization.