Frisco
41 days ago
Kubernetes Site Reliability Engineer

Kubernetes Site Reliability Engineer

Software Architect II

 

Who We Are:

Born digital, UST transforms lives through the power of technology. We walk alongside our clients and partners, embedding innovation and agility into everything they do. We help them create transformative experiences and human-centered solutions for a better world. 

UST is a mission-driven group of over 39,000+ practical problem solvers and creative thinkers in over 30+ countries. Our entrepreneurial teams are empowered to innovate, act nimbly, and create a lasting and sustainable impact for our clients, their customers, and the communities in which we live.

With us, you’ll create a boundless impact that transforms your career—and the lives of people across the world.

Visit us at UST.com.

 

You Are:

UST’s telecommunications practice is looking for dynamic and driven professionals to join a rapidly growing high-performance team. Site Reliability Engineer, ACE Platform Engineering will support critical API Platform, devops and other activities for the Digital Services Group.

 

The Opportunity:

·       Provide consulting services for improved system stability, availability, performance and reliability.

·       Assist in determining the impact of operational issues and provide input into their resolution via data extraction and quantification.

·       Work through day-to-day support issues, ensure effective and timely resolution of issues in production environment, troubleshoot customer impacting issues.

·       Forecast and plan for rapidly growing environment.

·       Support multiple applications, specifically running Solo Gloo/Kubernetes/PCF/GCP/Java based systems in an enterprise environment.

·       Supporting Gloo running on Kubernetes, Grafana, Prometheus, Cassandra, Postgres, Spring Boot or Java based applications running on PCF and WebLogic.

·       Apply monitoring and creating complex s and dashboards for production systems.

·       Provide capacity analysis, tuning analysis for Cloud applications in a LINUX and container platform.

·       Available to provide 24X7 on call support on a rotating basis with other team members.

·       Lead efforts in troubleshooting, recovery, and root cause investigation.

·       Perform analysis of user requirements and problems to automate or improve systems and review system capabilities, workflow, and scheduling limitations.

·       Able to follow and develop detailed work plans, schedules, project estimates, resource plans, and status reports.

·       Facilitate DR (Disaster Recovery) exercises to ensure that the team are fully prepared in any event.

·       Lead root cause analysis session to understand what causes issues in Production and come up with solutions that will prevent them from happening in the future.

·       Ensure documentation is created and remain updated for any related work.

·       Evaluates product and service solutions.

 

This position description identifies the responsibilities and tasks typically associated with the performance of the position. Other relevant essential functions may be required.

 

What you need:

·       Strong hands on experience in Kubernetes, infrastructure and support.

·       Strong experience in DevOps Practice for Micro Services using Kubernetes as Orchestrator.

·       Strong experience with Cloud configurations, services

·       Strong experience in API microservices

·       Experience with tools like: NGINX, Docker, PostMan, SOAP UI, ELK, Splunk, App Dynamics, CI/CD tools and GITLab

·       Good Experience in performance measures and tuning, capacity planning and management, contingency and disaster recovery

·       Strong scripting knowledge and experience.

·       Good understanding of networking and routing.

·       Master’s degree in Information Technology, Computer Science, Computer Information Systems, Computer Applications, related field or its equivalent and 3 years of relevant work experience. Applicant must have Bachelor’s degree in Information Technology, Computer Science, Computer Information Systems, Computer Applications, related field or its equivalent and 5 years of relevant work experience.

·       Strong understanding of UNIX operating systems and any scripting language.

 

Compensation can differ depending on factors including but not limited to the specific office location, role, skill set, education, and level of experience. As required by applicable law, UST provides a reasonable range of compensation for roles that may be hired in various U.S. markets as set forth below.

Role Location: Remote

Compensation Range:   $96,000-$144,000

 

Our full-time, regular associates are eligible for 401K matching, and vacation accrual and are covered from day 1 for paid sick time, healthcare, dental, vision, life, and disability insurance benefits.

 

 

What we believe:

We’re proud to embrace the same values that have shaped UST since the beginning. Since day one, we’ve been building enduring relationships and a culture of integrity. And today, it's those same values that are inspiring us to encourage innovation from everyone to champion diversity and inclusion, and to place people at the center of everything we do. 

Humility:

We will listen, learn, be empathetic and help selflessly in our interactions with everyone.

Humanity:

Through business, we will better the lives of those less fortunate than ourselves.

Integrity:

We honor our commitments and act with responsibility in all our relationships.

 

Equal Employment Opportunity Statement


UST is an Equal Opportunity Employer.

 

All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.

UST reserves the right to periodically redefine your roles and responsibilities based on the requirements of the organization and/or your performance.

 

 

#UST

#CB     

#LI-AP4

#LI-Remote

Confirm your E-mail: Send Email