Data Engineer
Kforce
Kforce has an enterprise client seeking a Data Engineer 2 in Seattle, WA.
Summary:
As an LLM Evaluation Expert specializing in Mathematics, you will play a crucial role in assessing and improving our language models' mathematical capabilities. Your expertise will be instrumental in evaluating LLM-generated mathematical solutions, making high-level judgments, and setting the standard for what constitutes excellent AI-assisted mathematical problem-solving.
Key Responsibilities:
* Critically analyze and evaluate mathematical responses generated by our LLMs across various fields of mathematics (e.g., algebra, calculus, statistics, number theory)
* Exercise expert judgment to select the most appropriate and efficient mathematical solutions from multiple LLM-generated options
* Make informed decisions on behalf of our customers, ensuring that selected solutions meet rigorous mathematical standards, are logically sound, and address specific research or application needs
* Develop and write mathematical demonstrations to illustrate "what good looks like" in AI-generated solutions, setting benchmarks for accuracy, elegance, and insight
* Provide detailed feedback and explanations for your evaluations, helping to refine and improve the LLM's understanding and output of mathematical concepts
* Collaborate with the AI research team to identify areas for improvement in the LLM's mathematical reasoning and problem-solving capabilities
* Stay abreast of the latest developments in mathematics, mathematical software, and AI to ensure our evaluations remain cutting-edge
Confirm your E-mail: Send Email
All Jobs from Kforce