Chicago, IL, 60684, USA
124 days ago
Senior Manager - Observability Engineering
**Description** United's Digital Technology team designs, develops, and maintains massively scaling technology solutions brought to life with innovative architectures, data analytics, and digital solutions. **Our Values** : At United Airlines, we believe that inclusion propels innovation and is the foundation of all that we do. Our Shared Purpose: "Connecting people. Uniting the world." drives us to be the best airline for our employees, customers, and everyone we serve, and we can only do that with a truly diverse and inclusive workforce. Our team spans the globe and is made up of diverse individuals all working together with cutting-edge technology to build the best airline in the history of aviation. With multiple employee-run "Business Resource Group" communities and world-class benefits like health insurance, parental leave, and space available travel, United is truly a one-of-a-kind place to work that will make you feel welcome and accepted. Come join our team and help us make a positive impact on the world. **Job overview and responsibilities** United is seeking an experienced Senior Manager of Observability with a passion for building high performance next generation Observability systems. You will build and lead a team of engineers with a deep focus on delivering enterprise-wide product to operate in a highly performant and efficient way. You will help drive our Digital Operations transformation as we redefine experiences for our customers. Our Senior Manager is an engineering leader who works with the engineering staff to innovate and build new engineering solutions, improve, and enhance existing distributed solutions as well as leverage engineering solutions to solve critical Observability Engineering problems. The Senior Manager will lead the strategy and execution of a technical roadmap that will increase the velocity of delivering products and unlock new engineering capabilities. The ideal candidate has deep technical expertise in Python/Java coding, Kubernetes and building cloud Observability Platform solutions. + Design, Develop & Drive Outcomes: + Understand how requirements and design choices may impact systems across multiple areas + Be responsible for building and mentoring Site reliability engineers + Drive the team towards building solutions towards the long-term goals while ensuring that high priority tech debts are solved in an efficient way + Be a strong thought leader in Observability, Site Reliability engineering Principles + Consistently share standard methodologies and improve processes within and across teams + Program Management & Delivery: + Report on your team’s progress for project and other key metrics, in addition to presenting detailed and implementable ideas for areas to further improve or influence product or project delivery + Drive multi-functional engagement and guidance from technology teams that support continuous improvement of the program + Provides regular insights on key observability metrics that highlight success and opportunities for the program + Maintain a cache of materials to provide status on progress, challenges, opportunities, and program roadmap to Digital Technology leaders + Handle internal and external partnerships to help build and maintain positive strategic partnerships and drive Observability Program success + Implement training programs to ensure our partners are empowered to enhance Observability programs + Talent Management and People Development: + Initiate and support performance evaluation of team members + Cultivate a culture that motivates all levels of performers to higher levels of achievement + Build and maintain relationships with your team members to support an environment of trust + Identify where technical or analytical skill gaps put future team work at risk and craft a plan to remediate, consistently challenge team members to share knowledge and learn new technologies + Develop and empower teams to solve sophisticated problems and be a strong advocate for open-source technologies and solutions + Organizational Effectiveness / People Leadership: + Have strong technical expertise and leadership, you are able to lead from the trenches and have shown knowledge in the area of Observability + Be able to drive the build out of multi cloud infrastructure, lead by example and be a role model to the team of developers and infrastructure engineers + Work with your Director to address project dependencies, negotiate and estimate incremental delivery dates for achievements with the stakeholder community, and deliver projects on time + Collaborate with the product teams to understand their difficulties around performance, resiliency and formulate strategies to address recurring issues in a sustainable way **Qualifications** **Required** + Bachelor's degree in Information Technology, Computer Science, and/or Engineering + 7 + years of experience in IT industry including: + 5+ years of experience in leadership position + 5+ years of leading an Engineering teams + 4+ years coding experience + 5+ years of development in a large-scale, distributed systems + Strong expertise with Python, Java and RESTful Services, with Focus on building high throughput/High volume distributed systems + Strong Expert in Unix, Container orchestration (e.g., Kubernetes), container runtimes and optimization + Experience with Open-source Observability tools such as Prometheus, and LGTM stack will be a big plus + Strong understanding on Columnar data stores + Strong understanding of Site Reliability Engineering and DevOps principles + Strong technical acumen in Cloud Architecture, Performance Benchmarking, and Capacity planning + Solid foundation in algorithms, data structures, and core computer science concepts + Experience leading and growing engineers and teams + In-depth knowledge of CS data structures and algorithms + Basic UI/UX and prototype design knowledge and experience + Proven track record to concentrate and demonstrate a capacity for learning technical concepts and adapting to new technologies quickly + Strong Cloud (AWS, GCP, Azure etc.) platform knowledge + Proficiency in Project Management and work item management tools such as Azure DevOps and Portfolio + Strong knowledge of logging systems, experience with ELK Stack (Elasticsearch, Logstash, Kibana), Splunk, or similar platforms + Experience with tools like Harness, GitLab, Terraform, Ansible, or CloudFormation for managing and monitoring infrastructure + Ability to diagnose performance bottlenecks and other system issues using observability data + Must be legally authorized to work in the United States for any employer without sponsorship + Successful completion of interview required to meet job qualification + Reliable, punctual attendance is an essential function of the position **Preferred** + Master's degree + 9+ years of relevant experience The base pay range for this role is $137,275.00 to $187,000.00. The base salary range/hourly rate listed is dependent on job-related, non-discriminatory factors such as experience, education, and skills. This position is also eligible for bonus and/or long-term incentive compensation awards. You may be eligible for the following competitive benefits: medical, dental, vision, life, accident & disability, parental leave, employee assistance program, commuter, paid holidays, paid time off, 401(k) and flight privileges. United Airlines is an equal opportunity employer. United Airlines recruits, employs, trains, compensates and promotes regardless of race, religion, color, national origin, gender identity, sexual orientation, physical ability, age, veteran status and other protected status as required by applicable law. Equal Opportunity Employer - Minorities/Women/Veterans/Disabled/LGBT. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform crucial job functions. Please contact JobAccommodations@united.com to request accommodation.
Confirm your E-mail: Send Email