Plano, TX, USA
11 days ago
Lead Infrastructure Engineer

DESCRIPTION:

Duties: Deploy and configure monitoring and observability tools. Work with site reliability engineering (SRE) teams to provide advanced support for all instrumentation on public and private clouds. Integrate and automate observability tools. Drive all instrumentation for application performance agents across all commercial banking applications. Partner with vendors and cross-lines of business to provide solutions to automate deployment for agent instrumentation in the private and public cloud. Work with cross-line of business teams, service providers, and partner organizations to ensure consistent SRE strategy and practices. Instrument all AWS services with proper tools for SRE support. Analyze and reduce costs for observability tools by implementing best practices. Support all AWS services to implement monitoring, dashboards, and alerts that can proactively alert SRE teams. Develop dashboards to analyze data for application performance. Contribute to technical and product direction in light of the product roadmap and potential risks. Perform specific use cases from different groups and challenge the team to build solutions that reduce user issues and can be easily supported. Educate application developers on the best way to understand the runtime state of a product effectively through telemetry.


QUALIFICATIONS:

Minimum education and experience required: Master's degree in Applied Computer Science, Computer Engineering, or related field of study plus 3 years of experience in the job offered or as Infrastructure Engineer, Production Engineer, Systems Analyst, IT Consultant, System Administrator or related occupation. The employer will alternatively accept a Bachelor's degree in Applied Computer Science, Computer Engineering, or related field of study plus 5 years of experience in the job offered or as Infrastructure Engineer, Production Engineer, Systems Analyst, IT Consultant, System Administrator or related occupation.

Skills Required: Requires experience in the following: Public cloud; Terraform; Datadog; CloudWatch; Splunk; Python; Linux; CI/CD; Active Directory; DevOps; Site Reliability Engineering.

Job Location: 8181 Communications Pkwy, Plano, TX 75024. Telecommuting permitted up to 40% of the week.

Confirm your E-mail: Send Email