Draper, UT, USA
3 days ago
Staff Service Reliability Engineer

It's fun to work in a company where people truly BELIEVE in what they're doing!

We're committed to bringing passion and customer focus to the business.

Corporate Overview
Proofpoint is a leading cybersecurity company protecting organizations’ greatest assets and biggest risks: vulnerabilities in people. With an integrated suite of cloud-based solutions, Proofpoint helps companies around the world stop targeted threats, safeguard their data, and make their users more resilient against cyber-attacks. Leading organizations of all sizes, including more than half of the Fortune 1000, rely on Proofpoint for people-centric security and compliance solutions mitigating their most critical risks across email, the cloud, social media, and the web.
 

We are singularly devoted to helping our customers protect their greatest assets and biggest security risk: their people. That’s why we’re a leader in next-generation cybersecurity.
Protection Starts with People.  Proofpoint.

As a Staff Service Reliability Engineer at Proofpoint you will develop a deep understanding of the various services and applications that come together to deliver Proofpoint’s next generation security products.  You will contribute to the architecture to improve scalability, operability, service reliability, capacity, and performance.  You will be responsible for provisioning, maintaining, and scaling our production services and server farms across a hybrid cloud environment.  You bring a ‘cloud first’ mentality to the table and will work cross-functionally to improve automation and orchestration platforms needed to scale the business. We are looking for passion, curiosity, attention to details, taking pride in one's work, taking ownership, and having ideas/opinions. If you’re the enthusiastic team player who cares about the infrastructure, remains calm in crisis, collaborates cross functionally, and easily writes code for automation we want to talk to you.

Your day-to-day 

Build long lasting, effective partnerships across the organization to foster collaboration between Product, Engineering and Operations teams.

Organize and manage multiple simultaneous projects.

Lead by example, care for your team, and establish credibility with the quality of your and your team's technical execution.

Mentor junior members of the team and foster a service ownership mentality.

Manage an international 24x7, multi-site production infrastructure powering the Proofpoint services, including deployment, maintenance, troubleshooting, performance tuning, and security.

Root-cause complex problems and involve multiple stakeholders, network, hardware and software that relate to scaling and performance.

Ensure proper monitoring, alerting, capacity planning and reporting in the production environment.

Contribute to the evolving design and architecture of reliable and scalable infrastructure.

Develop processes, tools, and documentation in support of production operations.

Evaluate new software, hardware and infrastructure solutions.

Participate in an on-call rotation and be willing to jump on escalated issues as needed.

What you bring to the team 

Demonstrable skills and 10+ years’ experience managing, troubleshooting, and tuning Linux systems.

Demonstrable experience working in a high volume, large deployment, multi-datacenter / multi-cloud environment.

Experience automating management of systems and applications using common frameworks, platforms and coding languages.

Experience with observability technologies such as Open Telemetry and Prometheus or similar. Deep experience with industry-standard foundation technologies such as TCP/IP, HTTP, DNS, SMTP, and LDAP.

Experience in management of a large distributed computing environment.

Experience with common virtualization platforms – KVM, VMware vSphere, ESX, ESXi, and vCenter.

Experience with multiple cloud providers and technologies including AWS, GCP and Azure.

Experience with containers and container orchestration platforms such as Kubernetes. Excellent verbal and written communication skills.

Experience with monitoring and alerting systems.

Experience with industry-standard operational practices such as change management, incident management, and working in colocation datacenters.

Extensive experience with configuration management such as Puppet or Chef.

Experience with load-balancing technologies – F5, Netscaler or similar.

#LI-PH1

If you like wild growth and working with happy, enthusiastic over-achievers, you'll enjoy your career with us!

Consistent with Proofpoint values and applicable law, we provide the following information to promote pay transparency and equity. Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets as set out below. Pay within these ranges varies and depends on job-related knowledge, skills, and experience. The actual offer will be based on the individual candidate. The range provided may represent a candidate range and may not reflect the full range for an individual tenured employee. This role may be eligible for variable pay and/or equity. We offer a competitive benefits package that includes flexible time off, a robust well-being program that provides for 4 global wellbeing days per year, and a 3-week work from anywhere option.

Base Pay Ranges:

SF Bay Area, New York City Metro Area:

Base Pay Range: 157,650.00 - 231,220.00 USD

California (excludes SF Bay Area), Colorado, Connecticut, Illinois, Washington DC Metro, Maryland, Massachusetts, New Jersey, Texas, Washington, Virginia, and Alaska:

Base Pay Range: 129,000.00 - 189,200.00 USD

All other cities and states excluding those listed above:

Base Pay Range: 117,600.00 - 172,480.00 USD
Confirm your E-mail: Send Email