Bellevue, Washington, USA
1 day ago
Research Scientist – Reward Model Training for Large Language Models

Build the future of the AI Data Cloud. Join the Snowflake team.

Snowflake delivers a unified platform for secure development and deployment of LLMs and ML models. Snowflake AI and ML capabilities allow you to create generative AI applications with fully managed and enterprise-grade LLMs while providing full governance.

We are at the forefront of AI innovation, dedicated to advancing the capabilities of large language models to transform how humans interact with technology. Join our dynamic team to work on groundbreaking projects that shape the future of AI-driven language understanding and generation.

Position Overview: We are seeking an experienced Research Scientist specializing in reward model training and data generation for large language models. In this role, you will design and implement reward models to enhance the performance, accuracy, and efficiency of our language models. Your expertise in machine learning, natural language processing (NLP), and reinforcement learning will drive the development of innovative solutions that push the boundaries of AI capabilities.

Key Responsibilities:

Design, develop, and implement reward models to optimize large language model performance.

Generate high-quality training datasets tailored for reward model training and evaluation.

Develop and refine algorithms to optimize reward functions, ensuring alignment with desired model outcomes.

Conduct iterative testing, validation, and fine-tuning of reward models to enhance reliability and accuracy.

Collaborate with cross-functional teams to integrate reward models into broader AI systems.

Stay abreast of the latest research and advancements in reinforcement learning, NLP, and large-scale machine learning frameworks.

Publish and present findings in leading conferences and journals to contribute to the broader AI research community.

Qualifications:

Ph.D. or Master’s degree in Computer Science, Machine Learning, Artificial Intelligence, or a related field.

Proven experience in training and optimizing reward models for large language models.

Strong proficiency in machine learning frameworks such as TensorFlow or PyTorch.

Deep understanding of natural language processing techniques and applications.

Hands-on experience with reinforcement learning and optimization algorithms.

Strong programming skills in Python or similar languages.

Demonstrated ability to work on complex projects, from conceptualization to deployment.

Excellent problem-solving skills and a passion for advancing AI technologies.

Preferred Qualifications:

Experience with large-scale distributed computing and cloud platforms.

Familiarity with human-in-the-loop systems for data labeling and model training.

Track record of published research in top-tier AI/ML conferences or journals.

Strong communication and collaboration skills to work effectively in a multidisciplinary team.

Join us in shaping the future of AI-powered language understanding and generation!

Every Snowflake employee is expected to follow the company’s confidentiality and security standards for handling sensitive data. Snowflake employees must abide by the company’s data security plan as an essential part of their duties. It is every employee's duty to keep customer information secure and confidential.

Snowflake is growing fast, and we’re scaling our team to help enable and accelerate our growth. We are looking for people who share our values, challenge ordinary thinking, and push the pace of innovation while building a future for themselves and Snowflake.

How do you want to make your impact?

Confirm your E-mail: Send Email