Microsoft bets on Artificial Intelligence (AI) as the next growth opportunity for the company. OpenAI, Mistral, and other Large language Model (LLM) driven innovations are happening throughout the industry. Azure AI is focused on building a platform that makes it easy for both first party Microsoft teams and third-party customers to build cutting edge applications on top of these large language models.
The Back Plane team in Azure Machine Learning is looking for a Principal software engineer who loves to build scalable, highly available, and secure microservices that run in Kubernetes. The infrastructure team focuses on managing a large fleet of Azure Kubernetes Services (AKS) that represents the control plane for AzureML.
The team focuses on:
Managing Kubernetes Cluster Deployments at ScaleSecure Control Plane / Data Plane assets from malicious attacks and unauthorized access using industry standard tools and frameworksAutomate Monitors and critical alerts using best in class observability tools such as: Azure Monitor, Prometheus, Azure Data Explorer, GrafanaAutomate CI/CD deployments using YAML builds and releaseFor the Azure ML platform, we build tools to increase the observability of the applications running in the Kubernetes clusters, improve the speed, security, and reliability of our deployments, secure our supply chain and services, and debug production with ease. We use the best of open source, like Prometheus, Grafana, and NGINX, and build solutions to enable Azure ML to deliver a global service that handles large scale ML training and inferencing workloads