About Aerospike
At Aerospike, we dream big. Our focus is helping companies tackle seemingly insurmountable problems and doing what’s never been done before. That is why we developed the world's leading real-time data platform that powers mission-critical applications at the world's most innovative, category-disrupting companies. Aerospike companies have deployed extreme-scale real-time applications to fight fraud, dramatically increase shopping cart size, enable global digital payments, and deliver hyper-personalized
user experiences to tens of millions of customers.
Customers like Airtel, Experian, Nielsen, PayPal, Snap, Verizon Media, and Wayfair rely on Aerospike as the data foundation for the future to help them act in the microsecond moments that matter.
Headquartered in Mountain View, California, Aerospike has a global presence with offices in London, Bangalore, and Tel Aviv.
Job Summary - Site Reliability Engineer
As a member of our Site Reliability Engineering (SRE) team for Aerospike Cloud, you will play a vital role in ensuring the reliability, scalability, and optimal performance of Aerospike deployments and infrastructure. You’ll be responsible for ensuring the smooth operation of our Aerospike Cloud platform, which supports multiple cloud product offerings.
Key Responsibilities
Your responsibilities include designing, implementing, and managing Aerospike deployments at scale.
You will become an Aerospike expert and understand all supported cloud deployment patterns for the distributed database, failure scenarios, and remediation plans.
The SRE role places a strong emphasis on enhancing efficiency and reliability through automation and process optimization. This entails vigilant monitoring of system performance, prompt troubleshooting of issues, and automating infrastructure and service configuration tasks to streamline operations.
Additionally, you will develop and maintain robust monitoring, alerting, and observability solutions to uphold system health.
You will be part of a 24/7 on-call team, and active participation in incident response, post-mortems, and continuous improvement initiatives is essential. Collaboration with development teams is integral to comprehensively understand the Aerospike ecosystem's products slated for deployment, ensuring they meet reliability and scalability standards.
Required Experience
● Experience providing production support for cloud-based, business-critical systems
● Experience with at least one of the major public cloud providers: AWS, Google, Azure
● Experience with configuration management and infrastructure-as-code tools such as Ansible and Terraform.
● Experience with continuous integration/continuous deployment (CI/CD) pipelines.
● Strong understanding of Linux/Unix systems and networking concepts.
● Proficiency in scripting and programming languages such as Python, Bash, or Go.
● Experience with containerization and orchestration tools like Docker and Kubernetes.
● Hands-on experience with monitoring and analytics tools such as Prometheus, Grafana, Datadog, Elasticsearch, and Kibana.
● Excellent problem-solving skills.
● Strong English language communication skills, verbal and written
Preferred Skills and Qualifications
● Hands-on experience managing database deployments and services in real-world scenarios.
● Familiarity with Aerospike database or similar distributed NoSQL databases.
● Certification in relevant technologies such as AWS Certified DevOps Engineer, AWS Certified Cloud Solutions Architect, Google Professional Cloud DevOps Engineer, or their equivalents.
● Agile software methodologies such as SCRUM and Kanban
● JIRA for issue tracking
● Version control systems, preferably Git
● Secrets management systems, preferably cloud-native or Hashicorp Vault
● Vulnerability management systems, preferably Github Dependabot, Snyk, and Tenable
Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.