To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.
Job Category
Software EngineeringJob Details
About Salesforce
We’re Salesforce, the Customer Company, inspiring the future of business with AI+ Data +CRM. Leading with our core values, we help companies across every industry blaze new trails and connect with customers in a whole new way. And, we empower you to be a Trailblazer, too — driving your performance and career growth, charting new paths, and improving the state of the world. If you believe in business as the greatest platform for change and in companies doing well and doing good – you’ve come to the right place.
We are looking for an experienced Devops engineer to manage critical infrastructure and maintain high availability of services running on this infrastructure. In this role you will have an opportunity to influence the design of the infrastructure and build it from the ground up.
Responsibilities:
Develop and maintain observability products, build tools and automation to eliminate manual touch, and respond to system failures and outages.
Gain insights into platform incidents and address repeat issues.
Leverage AIOps platforms to improve anomaly detection, automate runbooks, and meet MTTD & MTTR goals.
Oversee system monitoring, incident response, and root cause analysis in a timely manner.
Solid understanding of logging frameworks and APM tools like Splunk , Prometheus, Grafana.
Drive continuous improvement initiatives to enhance system reliability and performance.
Collaborate with development teams and drive reliability/availability improvements.
Manage deployments, oversee patching and mitigate security vulnerabilities.
Proactively plan and manage potential risks to ensure system reliability.
Prioritize tasks and projects effectively in a fast-paced environment to ensure critical issues are addressed promptly.
Design, implement, and maintain scalable, fault-tolerant cloud infrastructure.
Leverage container technologies like Kubernetes and Docker to enhance system reliability and efficiency.
Monitor, troubleshoot, and resolve production issues, ensuring system availability and performance.
Collaborate with cross-functional teams to diagnose incidents, improve observability, and drive post-mortem analysis.
Write clear technical documentation and communicate complex issues effectively with both technical and non-technical stakeholders.
Develop automation scripts and tooling using at least one object-oriented language (e.g., Java, Python, Go) and one scripting language (e.g., Bash, Python).
Manage network technologies, including DNS, Load Balancing, TCP/IP, HTTP, and tools like curl and OpenSSL to ensure seamless connectivity.
Continuously improve system performance, security, and cost efficiency through proactive monitoring and optimizations.
Proficiency with source control, continuous integration, and testing pipelines.
Required Skills:
Design, implement, and maintain scalable, fault-tolerant cloud infrastructure.
Expertise in managing large, fault-tolerant, cloud-hosted systems.
Proficiency with container technologies like Kubernetes and Docker.
Proficiency with AWS, GCP or other cloud solutions.
Excellent communication and collaboration skills.
Clear technical communication, especially about problems and incidents.
Proficiency in at least one object-oriented and one scripting language.
Understanding fundamental mesh and network technologies, e.g., DNS, Load Balancing, TCP/IP, HTTP, DNS, curl, openssl.
Strong problem-solving, troubleshooting, and analytical skills demonstrated in past projects.
Proven experience managing large-scale, cloud-hosted systems in AWS, Azure, or GCP.
Strong expertise in Kubernetes, Docker, and container orchestration.
Solid understanding of networking fundamentals, including DNS, TCP/IP, HTTP, and Load Balancing.
Proficiency in at least one object-oriented and one scripting language.
Exceptional troubleshooting, problem-solving, and analytical skills, demonstrated in past projects.
Excellent communication and collaboration skills, with the ability to clearly articulate technical challenges and solutions.
Accommodations
If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.
Posting Statement
At Salesforce we believe that the business of business is to improve the state of our world. Each of us has a responsibility to drive Equality in our communities and workplaces. We are committed to creating a workforce that reflects society through inclusive programs and initiatives such as equal pay, employee resource groups, inclusive benefits, and more. Learn more about Equality at www.equality.com and explore our company benefits at www.salesforcebenefits.com.
Salesforce is an Equal Employment Opportunity and Affirmative Action Employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender perception or identity, national origin, age, marital status, protected veteran status, or disability status. Salesforce does not accept unsolicited headhunter and agency resumes. Salesforce will not pay any third-party agency or company that does not have a signed agreement with Salesforce.
Salesforce welcomes all.