BANGALORE, IND
16 hours ago
Site Reliability Automation Engineer
**Introduction** Working in IBM Cloud gives you the platform to learn, develop and utilize your skills everyday by working on the latest cloud related technology products and services. You'll be working in an environment where we understand how we can thrive best when we play to our strengths. That's why developing our people is key to our success, the door is always open for those ready to advance their career. Curiosity and courageous thinking are both vital when working in IBM Cloud, as we continue our dedication in guaranteeing that we are at the forefront of cloud technology. Our renowned legacy means we are leading the way in everything from analytics and security through to unmatched hardware & software designs. We provide our clients with the full end-to-end transformation as we build IBM's next generation cloud platform which is focused around delivering performance and predictability at a global scale. IBM's product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive. **Your role and responsibilities** In this Site Reliability Engineer role, you will work closely with entire IBM Cloud organization to maintain and operationally improve the IBM cloud infrastructure. You will focus on the following key responsibilities: * Implement and automate infrastructure solutions that support IBM Cloud products and services to reduce toil. * Partner with other SRE teams and program managers to deliver mission-critical services to IBM Cloud * Build new tools to improve automated resolution of production issues * Monitor, respond promptly to production alerts, Execute changes in Production through automation * Support the compliance and security integrity of the environment * Continually improve systems and processes regarding automation and monitoring **Required technical and professional expertise** * 3+ years experience in handling large production systems environment * Must be extremely comfortable using and navigating within a Linux environment * Ability to do low level debugging and problem analysis by examining logs and running Unix commands * Must be efficient in writing and debugging scripts * 3-5+ years of experience in Virtualization Technologies and Automation / Configuration Managements * Automation and configuration management tools/solutions: Ansible, Python, bash, Terraform, GoLang etc. (at least one) * Virtualization technologies: Citrix Xen Hypervisor (Preferred), KVM(also preferred), libvirt, VMware vSphere, etc. (at least one) * Monitoring technologies: Zabbix (preferred), Sysdig, Grafana, Nagios, Splunk, etc. (at least one) * Working knowledge with Container technologies: Kubernetes, Docker, etc. * Excellent written and verbal communication skills. **Preferred technical and professional experience** * Good experience in Public cloud platforms, Kubernetes clusters and Strong Linux skills for managing services across microservices platform, good SRE knowledge in Cloud Compute, Storage and Network services.
Confirm your E-mail: Send Email