Job Description
As a member of Automation Engineering team, you will work on producing mission-critical platforms, tools, and processes that will ensure highest levels of availability and reliability of all our applications. We need creative and innovative problem solvers who can partner with our Application development teams to make their services more useable. Our Automation Engineering team is furnished with a standout opportunity to build tools, frameworks, and cloud platforms that will support our company's growth over the next decade. If you are a self-starter and jump on new ideas to make the platform more stable, secure and feature rich, this is your new career.
Your Responsibilities
Participate and contribute to Architecture for Distributed Systems Automation
Write automation code for provisioning and operating infrastructure at massive scale
Work with development teams to make sure the applications is production ready, scalable and reliable from the grounds up
Identify and drive opportunities to improve automation for code deployment, management and visibility of application services
Develop tools and framework to automate operational tasks, deployment of machines, services, applications
Establish end-to-end monitoring and alerting on all critical components of the application
Participate in the on-call rotation supporting the platform and or the production application
Directs root cause analysis of critical business and production issues
Develop and mentor other Automation Engineers on standard methodology from Infra orchestration and troubleshooting application service in production
Engineering team in design reviews and work cross-functionally with Engineering teams on operational Represent Automation readiness
Your Experience
Expertise in configuration management with a framework such as Salt, Ansible or Terraform. Ansible background preferred.
Experience in DevOps, Site Reliability, or infrastructure engineering
Expertise in any public cloud computing and its related services
Strong experience with Linux or Network
Proficiency with a programming language like Python and shell scripting to automate tasks
Familiarity with CI/CD pipeline, GitHub, Jenkins, Artifactory
Deep knowledge of AWS, Azure
Ability to diagnose and troubleshoot complex distributed systems handling high volume transactions
Strong fundamentals in HTTP including HTTP headers and web servers
BS or MS in Computer Science, related field, or equivalent professional experience
Excellent problem solving, critical thinking, communication, and teamwork skills
Excellent written and verbal communication, able to collaborate and rally support
Self-disciplined, self-managed, self-motivated and strong sense of ownership, urgency, and drive
Passion for automation and monitoring instrumentation in the code
Excellent interpersonal skills and the ability to work well in a team
Passionate to learn, understand, and dissect new technology stack quickly on own
Overall hands-on development experience of 8 to 10 years