At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you’re passionate about developing your career, while helping others along the way, come join the Broadridge team.
Role Overview
We are seeking a dynamic Senior Site Reliability Engineer (SRE) to lead the design, implementation, and operational support of our hybrid environments, spanning on-premises, private cloud, and public cloud platforms. This role will be pivotal in setting the foundation and strategy for our SRE practices while driving their implementation across the organization. The ideal candidate will combine technical expertise with leadership skills to guide our team on the SRE journey and ensure our environments are scalable, reliable, and secure.
Responsibilities
You will manage applications running on Windows and Unix/Linux servers, perform application installations, modify configurations, and server maintenance.Create documentations, diagrams, procedures, turnover document for supporting productsEnsure the production applications are running healthy and inefficiencies or service availability gaps are addressedArchitect and participate in Disaster Recovery testingAutomate processes within the environment to achieve higher efficienciesWork directly with the business partners and development teams to provide leadership on project and task statusesPerform and participate in annual system readiness, capacity planning, and provide recommendations to ensure the production environments meets SLA’sYou will participate on an On-Call rotation which provide off hour supportCoordinate, support, and perform weekend changes when required to support project deliverablesYour Profile
5+ years' experience in SRE related roles and/or functional leadership role, managing systems and applications running on Windows and UnixIn-depth knowledge of Windows operating systems and RHEL/Unix operating systemsUnderstanding of tier 3 architecture design and conceptsExperience automating processing using various scripting languages, such as PowerShell and PythonKnowledge and management of software such as IIS, Apache, WebSphere, Tomcat, and Microsoft clustering technologiesExperience with change control and incident management processesKnowledge of Ansible, Chef, Jenkins, TerraformAbility to troubleshoot complex problems, providing root cause analysis and remediation to mitigate future risk with appropriate Technical and Operational staff to resolve issues.Broadridge is committed to fostering an inclusive workplace and encourages individuals of all backgrounds to apply. Join us in shaping the future of reliable and scalable technology solutions.
#LI-KA2 #LI-Hybrid
We are dedicated to fostering a collaborative, engaging, and inclusive environment and are committed to providing a workplace that empowers associates to be authentic and bring their best to work. We believe that associates do their best when they feel safe, understood, and valued, and we work diligently and collaboratively to ensure Broadridge is a company—and ultimately a community—that recognizes and celebrates everyone’s unique perspective.