Senior Staff Site Reliability Engineer

Posted 8 Days Ago
Remote
Senior level
Fintech • Payments
The Role
The Senior Staff Site Reliability Engineer will lead and mentor SRE teams, design and implement scalable systems, optimize performance, manage incident responses, and ensure compliance and security within the organization. They will also focus on automation tools and collaborate closely with software development teams.
Summary Generated by Built In

About the Role
The WEX Site Reliability Engineering (SRE) team is seeking a Senior Staff SRE who is passionate about developing software and solutions focused on observability, incident response, reliability and performance, operational excellence, and compliance. The team will be part of the Benefits Reliability organization which supports our internal stakeholders and our Benefits Platform teams. As part of the Benefits Reliability organization you’ll have the opportunity to solve complex challenges and improve the quality of life of our engineering teams as well as our ability to service our customers.

 

The ideal candidate will be a technical leader with a proven track record of designing, implementing, and managing complex systems at scale. They will have a deep understanding of software development, cloud computing, and operational best practices. The Senior Staff SRE will work closely with engineering teams to ensure that our systems are reliable, performant, and secure.
 

How you’ll make an impact

  • Technical Leadership: Provide technical guidance and mentorship to other SREs and engineers. Lead the design and implementation of complex systems and solutions. Drive the adoption of SRE best practices across the organization.

  • System Design: Architect and implement highly available, scalable, and fault-tolerant systems. Optimize system performance and resource utilization. Proactively identify and mitigate risks to system reliability.

  • Incident Response: Lead incident response efforts, driving efficient resolution and post-incident analysis. Develop and implement processes to improve incident response capabilities.

  • Automation and Tooling: Design and develop automation tools to streamline operational tasks, improve system reliability, and reduce toil. Utilize monitoring and observability tools to gain deep insights into system behavior.

  • Collaboration: Work closely with development teams to ensure software design meets operational requirements. Foster a culture of collaboration and knowledge sharing across teams.

  • Capacity Planning & Performance Optimization: Forecast future capacity needs and implement strategies to ensure systems scale efficiently. Continuously identify performance bottlenecks and lead efforts to optimize system performance.

  • Security & Compliance: Champion security best practices and ensure that systems are designed and operated in compliance with industry standards and regulations.

  • Innovation: Stay current with emerging technologies and industry trends. Evaluate and introduce new tools and techniques to improve SRE practices and system reliability.

Experience you’ll bring

  • 7+ years of hands-on experience as a Site Reliability Engineer or equivalent role

  • 7+ years of development experience with at least one major programming language

  • Expert-level knowledge of Cloud Computing platforms (AWS and Azure)

  • Proven ability to lead complex technical projects and initiatives

  • Strong communication and collaboration skills, with the ability to influence and build consensus

  • Deep understanding of observability, logging, and monitoring technologies

  • Experience with a variety of RDBMS and NoSQL data stores

  • Expertise in containerization technologies such as Docker and Kubernetes

  • Expertise in infrastructure as code

  • Experience designing and building RESTful APIs

  • Extensive hands-on experience with (Datadog, Splunk, or other tooling)

  • Familiarity with Agile methodologies and practices

  • Extensive experience in providing and leading critical application support in a 24/7/365 high-availability environment.

  • Experience with GitOps

  • BA/BS degree in Computer Science or related technical field, or equivalent job experience

This Senior Staff SRE role offers a unique opportunity to make a significant impact on the reliability and performance of WEX's critical Benefits systems. You will play a key role in shaping the future of SRE at WEX and driving innovation across the organization.


 

The base pay range represents the anticipated low and high end of the pay range for this position. Actual pay rates will vary and will be based on various factors, such as your qualifications, skills, competencies, and proficiency for the role. Base pay is one component of WEX's total compensation package. Most sales positions are eligible for commission under the terms of an applicable plan. Non-sales roles are typically eligible for a quarterly or annual bonus based on their role and applicable plan. WEX's comprehensive and market competitive benefits are designed to support your personal and professional well-being. Benefits include health, dental and vision insurances, retirement savings plan, paid time off, health savings account, flexible spending accounts, life insurance, disability insurance, tuition reimbursement, and more. For more information, check out the "About Us" section.Pay Range: $156,000.00 - $208,000.00

Top Skills

C#
Go
Java
Python
The Company
HQ: Portland, ME
4,900 Employees
On-site Workplace

What We Do

We simplify complex payment systems for fleets, corporate payments, and healthcare—unlocking insights, opportunities, and efficiencies to give you greater control of your business.

Powered by the belief that complex payment systems can be made simple, WEX (NYSE: WEX) is a leading financial technology service provider across a wide spectrum of sectors, including fleet, travel and healthcare. WEX operates in more than 10 countries and in more than 20 currencies through approximately 4,900 associates around the world. WEX fleet cards offer approximately 14 million vehicles exceptional payment security and control; our travel and corporate solutions business processes over $35 billion of purchase volume annually; and the WEX Health financial technology platform helps 343,000 employers and more than 28 million consumers better manage healthcare expenses.

Similar Jobs

BlackLine Logo BlackLine

Senior Site Reliability Engineer

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI
Remote
Hybrid
Bengaluru, Karnataka, IND
1810 Employees
Easy Apply
Remote
India
100 Employees
Remote
India
740 Employees

Guidewire Software Logo Guidewire Software

Site Reliability Engineer (SRE) - Guidewire Cloud Platform Tenancy

Cloud • Information Technology • Insurance • Software • Analytics
Remote
Hybrid
Bangalore, Bengaluru, Karnataka, IND
3400 Employees

Similar Companies Hiring

CSC Thumbnail
Software • Legal Tech • Fintech • Financial Services • Data Privacy • Cybersecurity
Wilmington, DE
8000 Employees
TransUnion Thumbnail
Information Technology • Fintech • Financial Services • Cybersecurity • Business Intelligence • Big Data Analytics • Big Data
Chicago, IL
13000 Employees
Navan Thumbnail
Travel • Software • Productivity • Payments • Information Technology • Fintech • Automation
Palo Alto, CA
3000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account