At MCG, we lead the healthcare community to deliver patient-focused care. We have a mission-driven team of talented physicians and technical experts developing our evidence-based content and innovating our products to accelerate improvements in healthcare. If you are driven to enhance the US healthcare system, MCG is eager to have you join our team. We cultivate a work environment that nurtures personal and professional growth, and this is a thrilling time to become a part of our organization. With dynamic roles that offer meaningful impact, you'll be able to fully realize your potential. Plus, you'll enjoy world-class benefits and the security, stability, and resources of our parent company, Hearst, with over 100 years of experience.
About our team: MCG develops evidence-based guidelines that help patients get the right care in a variety of healthcare settings. The data science team at MCG combines machine learning with the expert knowledge of our guidelines to increase the efficiency and accuracy of our users’ documentation workflows.
Position Summary: As Lead Data Scientist you will serve as the technical leader for our Natural Language Processing research and development, bringing NLP features from product request to experiment to production code. You will bring experience with best practice and lead brainstorming, technical design, prioritization of experiments, and managing dependencies. You will keep abreast of the state of the art in LLMs while guiding the team’s research of novel techniques to solve issues unique to our application.
Essential Functions and Key Responsibilities:
Translate desired product functionality into machine learning tasks for data scientist and machine learning engineers to solve Lead and establish best practices for various research activities including data collection/annotation, literature research, experimentation, and evaluation Provide technical and project leadership for other Data Scientists Create and test novel LLM based systems Stay informed on new methodologies for evaluating generative models and other complex deep learning techniquesOur tech stack
Primarily Python jobs and applications OpenAI and other proprietary LLM APIs AzureML for Pytorch training Kubernetes via AKS or EKS for compute (with support from infra team) Databricks for data lake Flyte for orchestration
Minimum Qualifications Required
8 years of experience in Data Science or Machine Learning Engineering Expert level Python: able to write reliable, extensible code Demonstrated experience translating high level requirements into models in production Deep experience with Natural Language Processing (NLP) Knowledge of deep learning fundamentals and experience finetuning various foundation/backbone models for new tasks Experience using pretrained LLM’s for zero or few-shot learning tasks Experience generating visualizations for evaluation of complex systems Scientific mindset and ability to run experiments to guide model selection and hyperparameter searchPreferred Qualifications
Strong written communication skills Experience putting LLM based services into production Experience working with clinical data Experience with cloud-based machine learning training providers (Sagemaker, AzureML or similar)
Pay Range: $167,000 – $268,000
Other compensation: Bonus Eligible
Perks & Benefits: