About this opportunity
We are looking for a Engineer-Cloud Monitoring with hands-on experience in Integration, google Monitoring, ticketing tool, Deployment, configuration.
What you will do:
• Work with cloud native application design and development using APIs, containers, Kubernetes, service mesh (Istio), preferably on Google Cloud Platform
• Monitoring a compute engine by using Ops Agent, Linux and software development
• Establishing enterprise-level technical strategy and architecture
• Provide hands-on leadership for cloud migration and adoption from inception to production
• Architect backup and disaster recovery solutions
• Develop enterprise cloud guidelines and automation in compliance with business needs and security best practices
• Plan, design, and deploy complex application workloads on the Google Cloud platform
• Need to work with complete knowledge on Google Cloud Operations Suite
• Work on Advanced Logging and Analysis
• Work on Alerting Policies: SLI, SLO and SLA, Developing an alerting strategy, Creating alerts, Alerting in Google Cloud Service Monitoring, Cloud Logging and architecture, Log types and collection, Storing , routing, and exporting the logs, Query and view logs, Using log-based metrics, Log analytics on Google Cloud
• Should be able to work on how to Investigate Application Performance Issues: Error Reporting, Cloud Trace, Cloud Profiler, How to View application latency using Cloud Trace
• Should work on how to Optimize the Costs of Monitoring: Costs and Pricing, Bill Estimations, Cost Control Best Practices
You will bring
• 2+ years of industry experience and significant must have knowledge on Cloud Monitoring, Google cloud devops, Google cloud Operation Suite, Google Cloud Operations for GKE, Google Cloud Operations for GCVE, networking protocols (HTTP/S, TCP/IP, etc.)
• Google Cloud Managed Service for Prometheus
• Practical knowledge of containerization and orchestration using Docker and Kubernetes
• Proficient in Git and Bash
• Proficient in writing in at least one of the following: Python, Golang, PHP, or Typescript.
• Proven expertise throughout the full stack in troubleshooting and debugging.
• Integration of the monitoring solutions with Ticketing tools
• Experience with at least one third-party networking product
• Knowledge of segmentation, encryption, logging, monitoring, and other network security design principles