Equinix is one of the fastest-growing data center companies, growing connectivity between clients worldwide. That’s why we're always looking for creative and visionary people who can help us achieve our goal of global interconnection. With 200 data centers in over 24 countries spanning across 5 continents, we are home to the Cloud, supporting over 1000 Cloud and IT services companies that are directly engaged in technological innovation and development. We are passionate about further evolving the specific areas of software development, software and network architecture, network operations, and complex cloud and application solutions.
At Equinix, we make the internet work faster, better, and more reliably. We hire hardworking people who thrive on solving hard problems and give them opportunities to hone new skills, try new approaches, and grow in new directions. Our culture is at the heart of our success and it’s our authentic, humble, gritty people who create The Magic of Equinix. We share a real passion for winning and put the customer at the center of everything we do.
Are you interested in solving problems and data engineering and passionate about automation for business-critical applications? Are you always excited about incubating big ideas and turning them into new products that customers would love? Do you enjoy a startup-like fast-paced environment and love working on cutting-edge innovations that can potentially disrupt the industry? Are you an ardent learner who is always looking to improve? If yes, we are interested in you and would like to discuss further details about our career opportunities.
Site Reliability Engineering Senior Manager
What is the primary need, technical challenge, and/or problem you will be responsible for?
As the leader of Global Site Reliability Engineering team, you will transition the team from a typical operations model to that of a world-class SRE team. In this role, your primary focus will be the confluence of your team and the business, which, if done effectively, will result in a highly stable and reliable and available platforms and products for our customers. Key attributes for the person in this role will include self-reflection, transparency, execution and partnership and customer centric approach.
Success in the Role:
What are the performance goals over the first 6-12 months you will work toward completing?
Success will look like the implementing Observability and achieving the SLO of 99.9 for our platforms and products. Build a team that can scale to meet the needs of the business, Site Reliability engineers who spend ~50% of their time on value added work, and a team that feels empowered, recognized, and gives everyone an opportunity to grow.
What type of work will you be doing? What assignments, requirements, or skills will you be performing on a regular basis?
You will own the driving of team processes, with SLOs and MTTRs top of mind at all times.
You will build and execute on our team’s roadmap in terms of technologies, process improvements, and team enablement.
You will partner with other leaders in Engineering teams to build the reliability engineering practice within PAE organization, Manage and Observe product line.
You will encourage the team to simplify and invest in automation or tooling that increases efficiencies and improves reliability.
You will be responsible for managing a globally distributed team, including 1:1s and leading other team meetings as needed.
What are the team values?
We are: People Centric, Customer centric, Transparent, Thorough, Continuously Improving (Product), Learning Focused (People)
A Day in the Life as a Senior Site Reliability Engineering Manager
As a Senior Site Reliability Engineering Manager of the PAE team, you will spend your days providing technical expertise to increase efficiency, reduce downtime, and optimize costs while maintaining scalability, reliability and availability at 99.9%. You’ll work closely with engineering, product, quality, security and automation teams on strategic initiatives.
Total 12 to 15 years of experience with 6 + years of people management including managing managers and globally distributed teams
Experience in operating and managing large scale distributed systems in on-premises and cloud environments
Broad and extensive proficiency of On-prem and Cloud environments, CI / CD and platform orchestration strategies, Dev-Ops and SRE methodologies, processes and tools
Demonstrated leadership in executing a short/long-term strategic vision with the ability to explore and recommend technology investment with a focus on the business’s ROI.
Must have experience developing strategic program plans, roadmaps, and estimations, including forecasting investments and projects.
Experience managing vendors, negotiating contracts, and managing PL and budgets.
Experience in systems configuration management with automation tools such as, Chef, Ansible, or Puppet
Manage technologies policies, processes, and standards to ensure consistent operations, safeguard of systems and data, and monitor compliance.
Excellent written and verbal communication and interpersonal skills with ability to motivate the team to deliver multiple projects simultaneously and meet deadlines and even ready to roll up the sleeves to analyze critical issues when required.
Ability to think critically and strategically and to collaborate effectively at all levels.
Experience with Agile methodologies and Software Development Life Cycle (SDLC)
Promote organization’s culture and values
You're now being redirected to the application website
Fill in your details
You're now being redirected to the application website
Equal Employment Opportunity:
Equinix is an Equal Employment Opportunity and Affirmative Action Employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, national origin, sex, sexual orientation, gender identity, age, status as a protected veteran, or status as a qualified individual with disability.
Please click here to see the “EEO is the Law” poster and supplement.
Please click here to see our EEO Policy Statement.
Please click here to see our Pay Transparency Policy Statement.
A one-time (for each page view) session cookie to provide protection against a security attack called "Cross-site scripting (XSS)". This cookie is mandatory, short lived (one page interaction) and contains no candidate personally identifiable information.
A permanent long lived cookie that is associated with your device. This is used to associate your candidate actions to your CRM record.
A temporary session cookie (lasts for 20 minutes after your last interaction). This is used to associate your candidate actions into "visits or sessions" and is recorded against your CRM record. This includes location data (city, country) which allows us to provide more localised and relevant job recommendations and other career related content.