Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability.
As a Lead Site Reliability Engineer at JPMorgan Chase within the AIML Data Platforms team, you'll play a pivotal role in shaping the future of our globally recognized firm. You'll hold a leadership role in your team, demonstrating strong knowledge across multiple technical domains, and advising others on the technical and business issues they face. You'll lead resiliency design reviews, break down complex problems into manageable tasks for other engineers, act as a technical lead for medium to large-sized products, and provide advice and mentoring to other engineers. This role offers you the opportunity to directly impact our site reliability and to grow professionally within a dynamic and collaborative environment.
Job responsibilities
Conceptualize, design, and implement solutions to enhance the reliability and scalability of platforms and applications.Analyze defects, propose improvements, and implement fault-tolerant and resilient systems.Optimize the performance and utilization of AI ML platform and infrastructure.Develop observability, security, automation, and fin-ops tools and orchestration.Provide strategic technology leadership by defining and evaluating standards and architecture.Build strong cross-functional relationships and design solutions to user problems.Debug and solve issues in a production environment.Required qualifications, capabilities, and skills
Formal training or certification on Site Reliability Engineering concepts and 5+ years applied experienceExpertise in programming with Python and cutting-edge software engineering practices.Experience in designing and architecting large-scale distributed systems and cloud-native architecture.Experience with developing on Cloud, especially AWS, and knowledge in Infrastructure as Code tools such as Terraform.Systematic problem-solving and troubleshooting skills in a complex system.Excellent communication skills and ability to present business and technical concepts to stakeholders.Strong sense of ownership, urgency, and drive.Preferred qualifications, capabilities, and skills
Prior experience working in AI, ML, or Data engineering.Familiarity with modern front-end technologies.Exposure to cloud technologies.