Algolia is set to enable every company to create world-class Search and Discovery experiences with an API-first approach. Performance and Scalability is at the heart of our mission: we power 1.5 trillion searches a year, for 10K+ customers all over the world.
If you're a problem solver, able to think outside the box and eager to nurture others and learn from them, then this is your challenge!
The TeamThe Fleet team is a Site Reliability engineering team focused on one thing: the search products should always be available. To make this possible, the fleet team creates pragmatic solutions to optimize the Search Products availability and costs at scale, depending on the needs of the customer, the product teams, and the different engineering teams that deliver a unique Search Experience to our customers.
The OpportunityThe team is looking for an experienced individual who is well versed in defining, building and operating scalable architectures, with proven strengths in cross-team collaboration, taking solutions all the way through delivery to adoption by other engineering teams.
As a senior member of the Fleet team, you will have a direct impact on Algolia’s Search Products. You will play a pivotal role in architecting and implementing systems that play a central role to the reliability, scalability, and cost optimization. You will also play a role in training/mentoring and leading team members.
Your role will consist of: Operate a migration tooling orchestrator that has a huge impact on the product scalability, reliability and cost Operate the Search Products Run and improve our homemade tooling for network (such as Edge Load balancer and dns manager) Build, run and improve a backup/restore system to ensure the respect of our SLAs You might be a fit if you have: Strong knowledge of at least one programming language (Python, Golang, Ruby.) and familiar with software craftsmanship. Knowledge of Lua is a plus Experience building and operating distributed systems at scale Experience designing and building APIs Experience designing new applications with reliability, operability, and availability in mind Experience with Public Cloud Providers such as GCP, AWS or Microsoft Azure, and usage of Kubernetes Strong knowledge in Linux system administration, network system and troubleshooting Excellent communication and organization skills Team’s current stack: Golang, ruby/python, pubsub, Github Actions, CircleCI, terraform, Chef Linux, Bare Metal, Kubernetes Bare Metal, AWS, Azure We’re looking for someone who can live our values: GRIT - Problem-solving and perseverance capability in an ever-changing and growing environment TRUST - Willingness to trust our co-workers and to take ownership CANDOR - Ability to receive and give constructive feedback. CARE - Genuine care about other team members, our clients and the decisions we make in the company. HUMILITY- Aptitude for learning from others, putting ego aside.#LI-Remote