Site Reliability Engineer

Remote

12 hours ago

Splunk

Description

Splunk's Cloud group is looking for an experienced Site Reliability Engineer to join teams that are responsible for providing and maintaining an automated platform that enables internal and external customers to easily manage and modify Splunk Enterprise Cloud (SEC) environments. As a member of these teams, you will be responsible for maintaining and fixing Splunk's SaaS system, monitoring system stability and performance, fixing complex problems, driving projects to further enhance and automate the system, performing CSP instance maintenance and system upgrades, and managing CSP server/storage deployments, all while collaborating with various other Splunk Cloud teams. This is a fantastic opportunity to work with exceptional teams, solve exciting problems, grow your cloud experience, and help drive the growth of Splunk Cloud!

What you'll get to doOpportunities to develop and grow as an engineer. We are always expanding into new areas and exploring new technologies.Fantastic teams. We have exceptionally skilled and dedicated peers and individual contributors in our organization and company.Growth and mentorship. We believe in growing engineers through ownership and leadership opportunities. We also believe that mentors help both sides of the equation.A stable, collaborative, and inclusive work environment. The teams work together to get things done, and adapt to the changing needs for the team.Balance. We don't expect people to work 12 hour days. We trust our colleagues to be responsible with their time and commitment, and believe that balance helps cultivate a positive environment.Fun. We are committed to having every employee want to do their best, and have fun while doing it!

Must-have QualificationsCloud experience. Knowledge of instance management and storage, as well as an understanding of regional centers, availability zones, and HA strategies. Proven experience in at least one of the major CSPs (AWS, GCP, Azure) is required.Infrastructure as code experience. You are proficient with infrastructure as code solutions, such as Terraform.Software Development and Data Structures/Algorithms. We code primarily in Golang and Ruby, and work with RESTful APIs.Knowledge of technical excellence. You know continuous delivery, testing, security practices, performance, and disaster recovery.Unix/Linux. You will use a command line terminal frequently.CI/CD configuration. You should be familiar with CI/CD pipelines (GitLab, Jenkins) and automation tools to enable smooth and automated deployments.Problem Solving. You are able to fix a product outage, skilled in identifying performance bottlenecks, spotting anomalous system behavior, and figuring out the root cause of incidents.Desire to learn and adapt. Our team has many projects going on at once, and you'll have the opportunity to learn to navigate new code and features.Passion. We want you to actively own your work and be excited about your projects.

Nice-to-have QualificationsProven experience as a Site Reliability Engineer, DevOps Engineer, or similar role within a cloud-native environment.Kubernetes experience. Working in Kubernetes systems with experience in kubectl and docker containers.Familiarity with Configuration management tools like Puppet or Ansible is recommended.Experience working with multiple cloud providers (AWS, GCP, Azure) is a plus!Understanding of network protocols, DNS, load balancing, and general networking concepts in a cloud environment.Multi-tenant infrastructure experience. Experience supporting customer facing multi-tenant infrastructure (SaaS) or similar cloud related services.Python or Bash scripting experience. You may develop scripts and tools in Python/Bash.Exposure to CNCF projects, policy as code (OPA), opentelemetry is a plus.Experience in working on distributed systems like databases, distributed file systems, distributed concurrency control, consistency models, CAP theorem is an added plus.Familiarity with security best practices for cloud infrastructure, including identity and access management (IAM), encryption, and compliance.Proven ability to troubleshoot issues in production environments and participating in mitigation of incidents.Experience with observability and secret management tools.

Note:

Base Pay Range

Poland

Base Pay: PLN 181,200.00 - 249,150.00 per year

Splunk provides flexibility and choice in the working arrangement for most roles, including remote and/or in-office roles. We have a market-based pay structure which varies by location. Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location as set out above, as well as the knowledge, skills and experience of the candidate. In addition to base pay, this role is eligible for incentive compensation and may be eligible for equity or long-term cash awards.

Benefits are an important part of Splunk's Total Rewards package. This role is eligible for a comprehensive, competitive benefits package which may include healthcare and retirement plans, paid time off, wellbeing expense reimbursement, and much more! Learn more about our next-level benefits at https://splunkbenefits.com.

Thank you for your interest in Splunk!

Save & Apply Later Applying Later... Click to ApplyI AppliedDidn't Apply

Confirm your E-mail: Send Email

Apply for this job

Next Job »

All Jobs from Splunk

244 Splunk jobs in REMOTE