Build the future of the AI Data Cloud. Join the Snowflake team.
Snowflake delivers a unified platform for secure development and deployment of LLMs and ML models. Snowflake AI and ML capabilities allow you to create generative AI applications with fully managed and enterprise-grade LLMs while providing full governance.
We are seeking a skilled and experienced Infrastructure Engineer to join our AI/ML organization. In this role, you will be responsible for designing, developing, and maintaining the infrastructure that supports serving large language models with world class performance and availability.
Responsibilities:
Contribute to the open source vLLM inference engine
Design and implement scalable infrastructure solutions to support the deployment of large language models.
Optimize compute, storage, and networking resources to enhance the performance and cost-efficiency of LLM operations.
Develop and maintain tools for monitoring and managing LLM performance, including resource utilization, latency, and throughput.
Implement security measures and best practices to protect sensitive data processed by LLMs.
Troubleshoot and resolve infrastructure issues in a timely manner, ensuring minimal disruption to model deployment and availability.
Stay updated with advancements in AI infrastructure technologies and contribute to the adoption of new tools and frameworks.
Document infrastructure designs, processes, and configurations for knowledge sharing and training purposes.
Provide technical guidance and mentorship to junior members of the infrastructure team.
Requirements:
Have 2+ years of industry experience designing, building, and supporting Internet serving infrastructure, machine learning platforms, machine learning services and frameworks.
Experience working with vLLM or similar technologies
Proficiency in cloud computing platforms such as AWS, Azure, or GCP.
Strong programming skills in at least one of Python, Go, Java, C++.
Experience with containerization and orchestration tools such as Docker and Kubernetes.
Solid understanding of distributed computing, parallel processing, and data storage systems (e.g., Hadoop, Spark, Elasticsearch).
Knowledge of security best practices and data protection measures in AI environments.
Excellent problem-solving skills and ability to troubleshoot complex issues in a production environment.
Strong communication skills and ability to collaborate effectively in a cross-functional team environment.
BS/MS/PhD in Computer Science, Engineering, or a related field.
Every Snowflake employee is expected to follow the company’s confidentiality and security standards for handling sensitive data. Snowflake employees must abide by the company’s data security plan as an essential part of their duties. It is every employee's duty to keep customer information secure and confidential.
Snowflake is growing fast, and we’re scaling our team to help enable and accelerate our growth. We are looking for people who share our values, challenge ordinary thinking, and push the pace of innovation while building a future for themselves and Snowflake.
How do you want to make your impact?
The following represents the expected range of compensation for this role: The estimated base salary range for this role is $195,000 - $287,500.Additionally, this role is eligible to participate in Snowflake’s bonus and equity plan.
The successful candidate’s starting salary will be determined based on permissible, non-discriminatory factors such as skills, experience, and geographic location. This role is also eligible for a competitive benefits package that includes: medical, dental, vision, life, and disability insurance; 401(k) retirement plan; flexible spending & health savings account; at least 12 paid holidays; paid time off; parental leave; employee assistance program; and other company benefits.
Snowflake is growing fast, and we’re scaling our team to help enable and accelerate our growth. We are looking for people who share our values, challenge ordinary thinking, and push the pace of innovation while building a future for themselves and Snowflake.
How do you want to make your impact?