Kuala Lumpur, MY
20 days ago
P-9025914 Site Reliability Engineer, Principal-1

At AIA we’ve started an exciting movement to create a healthier, more sustainable future for everyone.

As pioneering innovators for over 100 years, we’re now transforming our organisation to be faster, simpler and more connected. Because we want to be even better equipped to develop digital solutions and experiences that help more people live Healthier, Longer, Better Lives.

To get there, we need people with tech/digital/analytics expertise and passion to help develop positive, sustainable change through digitally enhanced experiences that will impact the lives of millions of people and create a healthier future for everyone.

If you believe in developing a better tomorrow, read on. 

About the Role

System Reliability Engineer (SRE) to ensure that our cloud application systems are reliable and available to users. The SRE will monitor application systems and establish automated detections, root cause analysis, and formulate preventive actions. They will gather and analyze metrics from operating systems as well as applications to assist in performance tuning and fault finding. They will partner with development teams to improve services.

Functional Duties:

Setup and maintain monitoring of infrastructure and applicationBuild alerts and auto recovery for various operational issuesGather and analyze metrics from operating systems as well as applicationsAdvise in performance tuning and fault findingPartner with development teams to improve servicesAssist formulating preventive actions where possible, lead potential failure scenarios studies and formulate automated recovery methodsComfortable with working on new tools eg; Azure DevOps, Grafana, ELK, Dynatrace and etc

People Management Duties:

Train and coach other consultants & teammates on your specialtiesBe the advisor toward applications and assist application team establish recovery processes

Requirements:

Programming Languages: Java 8 or above (must have)Experience in developing and optimizing stored procedures for MySQL and MSSQL databasesOS: Linux(RHEL or SUSE) or Windows ServerScripting(Must have either 1) : Shell, Bash, PowershellKnowledge in open-source distributed version control system, gitSound knowledge of how REST API worksExperience in Atlassian tools (Jira, Bitbucket, & Confluence)Familiarity with Azure Cloud servicesWorking experience with ITIL in Agile environmentGood to have:Experience with Python programming languageExperience with containerization (Docker, AKS, ACR, EKS, ECS)Experience in CICD with Azure DevOpsExperience in Dashboard development with Grafana, Azure Monitor, or DynatraceExperience in infrastructure management with Terraform or AnsibleExperience with Azure or AWS cloud certification would be an added advantage

Build a career with us as we help our customers and the community live Healthier, Longer, Better Lives.

You must provide all requested information, including Personal Data, to be considered for this career opportunity. Failure to provide such information may influence the processing and outcome of your application. You are responsible for ensuring that the information you submit is accurate and up-to-date.

Confirm your E-mail: Send Email