Jersey City, NJ, US
17 days ago
Principal Site Reliability Engineer
Job Description:

The Role

As a member of the TechOps SRE team, you'll work closely with our engineering partners to help enable and drive initiatives from design to implementation. Our highly available multi-region Kubernetes (AWS EKS) environments are best-in-class and central to our enterprise-grade infrastructure strategy. These growing environments currently support numerous mission-critical workloads. In this exciting role, you’ll have the opportunity to further develop and refine your skills, collaborate across numerous Fidelity teams, and continue to grow in a fun, collaborative, and rapidly changing environment. This is a phenomenal opportunity to have a direct impact on the emerging strategies of our infrastructure and deployments, while at the same time, helping enable the expansion of our business.

The Skills and Expertise You Bring

5+ years of hands-on experience with AWS in a production environment Experience building and deploying Docker images including Docker Compose Production experience running Kubernetes workloads ideally on AWS EKS Experience managing and maintaining Kubernetes Clusters on AWS EKS Experience with Confluent or Kafka Experience creating and deploying Helm charts & libraries Hands-on experience with Jenkins Core, including authoring and maintaining declarative CI/CD pipelines and libraries Experience with monitoring tools e.g., CloudWatch, Datadog & Splunk Cloud Proficiency with UNIX operating systems and shell scripting Experience with Amazon Web Services (AWS), having managed services and applications in a large AWS cross-account environment using IAM and federated SSO Experience crafting and maintaining logging, monitoring, and alerting capabilities using tools like Datadog and Splunk Ability to communicate at all levels with track record of strong written and verbal communications See problems as opportunities to automate Ability to work independently with minimal direction Drive and champion the overall design of highly available, secure, scalable microservices-based applications in AWS Track record of providing technical leadership to strong teams of Site Reliability Engineers / Cloud Engineers Experience with configuring and deploying resilient infrastructure in multiple regions and multiple availability zones Work multi-functionally with other organizations and collaborate with our risk, product and engineering team leaders Leading the initiative to craft and deploy our applications to the cloud Promoting a DevOps mentality, providing mentorship and establishing development standard methodologies for AWS infrastructure-as-code Championing automation tools to improve software delivery and reduce risk Production experience with infrastructure-as-code (IaC), Terraform preferred Programming experience, e.g., Python preferred Experience with distributed version control systems, Git preferred Experience with Apache or Confluent Kafka a plus Experience with the agile software development lifecycle and Kanban preferred Experience with CDN Providers e.g., Akamai preferred

The Team

Fidelity Digital Assets℠ , a Fidelity Investments Company, is developing a full-service enterprise-grade platform for storing, trading, and servicing digital assets, such as Bitcoin and Ethereum.

Fidelity Digital Assets℠ embraces an entrepreneurial culture and startup mindset while serving as one of the most innovative business units within Fidelity Investments. Our global, diverse team of hundreds of forward-thinking professionals lead with agility and creativity to build solutions that bridge the gap between traditional institutional investors and their exposure to digital assets. The firm’s tenure and experience across multiple business lines present our employees with unprecedented access to knowledge, technology, and resources that help our team reshape the future of finance.

Within Fidelity Digital Assets℠, Technical Operations team is central to our initiative of moving to the cloud. The team uses AWS services to secure our network and scale our applications to ensure their up-time and reliability. Team members are hands-on Site Reliability Engineers who promote a DevOps approach, with a focus on infrastructure-as-code, security and automation.

#cryptojobs

The base salary range for this position is $85,000-$179,000 per year.  

Placement in the range will vary based on job responsibilities and scope, geographic location, candidate’s relevant experience, and other factors.

Base salary is only part of the total compensation package. Depending on the position and eligibility requirements, the offer package may also include bonus or other variable compensation.   

We offer a wide range of benefits to meet your evolving needs and help you live your best life at work and at home.  These benefits include comprehensive health care coverage and emotional well-being support, market-leading retirement, generous paid time off and parental leave, charitable giving employee match program, and educational assistance including student loan repayment, tuition reimbursement, and learning resources to develop your career.  Note, the application window closes when the position is filled or unposted.

Please be advised that Fidelity’s business is governed by the provisions of the Securities Exchange Act of 1934, the Investment Advisers Act of 1940, the Investment Company Act of 1940, ERISA, numerous state laws governing securities, investment and retirement-related financial activities and the rules and regulations of numerous self-regulatory organizations, including FINRA, among others. Those laws and regulations may restrict Fidelity from hiring and/or associating with individuals with certain Criminal Histories.

Certifications:

Confirm your E-mail: Send Email