Job Description
We are seeking a skilled and experienced ML Ops Engineer to join our team. This role will be crucial in bridging the gap between data science and production, ensuring the reliable and scalable deployment of machine learning models. You will be responsible for building and maintaining the infrastructure, automation, and processes that enable our data science teams to deliver high-quality AI/ML solutions. The ideal candidate will possess a strong understanding of AWS, infrastructure as code, CI/CD pipelines, and the operational aspects of running machine learning workloads.
ResponsibilitiesDevelop and maintain robust, scalable, and secure infrastructure on AWS to support the entire ML lifecycle, from data ingestion to model deployment and monitoring.Develop experience on AWS using Python libraries to provision resources and handle workloads.Automate the deployment of machine learning models and associated infrastructure using Infrastructure as Code (IaC) tools like Terraform or CloudFormation.Design and implement CI/CD pipelines for machine learning models, enabling continuous integration, testing, and deployment.Build and operate AWS backend applications to support ML model serving, monitoring, and data processing.Implement monitoring and alerting systems to ensure the health and performance of ML models and infrastructure.Collaborate with data scientists and engineers to understand their needs and translate them into operational requirements.Proactively identify and resolve infrastructure bottlenecks and optimize resource utilization.Contribute to the development of best practices for ML Ops within the organization.Support the troubleshooting and debugging of issues across the ML pipeline.Promote knowledge sharing and best practices within the team.Ensure the security and compliance of the ML infrastructure and data.Perform Data Engineering tasks as needed to ensure smooth data flow to ML models.Essential SkillsStrong scripting and automation skills using Python (required)Experience working with AWS services (EC2, S3, ECS/EKS, SageMaker, Lambda, IAM, etc.) (required)Experience with Infrastructure as Code tools (Terraform, CloudFormation) (required)Strong knowledge of CI/CD concepts and tools (Git, Jenkins, AWS CodePipeline) (required)Experience deploying and managing ML models in production (required)Experience working with containerization technologies such as Docker and Kubernetes (good to have)Knowledge of data science tools, packages and ML models (good to have)Experience with monitoring and alerting tools (Cloudwatch, Prometheus, Grafana) (good to have)Experience with security concepts and best practices (good to have)Additional Skills & QualificationsBS/MS in computer science, engineering, or a related field with 4-6 years of relevant experience.Proven experience in designing, building, and operating production-grade ML infrastructure and pipelines.Strong understanding of cloud computing, especially AWS.Demonstrated experience with automation, configuration management, and Infrastructure as Code.Strong knowledge of CI/CD principles and practices.Experience with deploying and managing containerized applications.Experience building and deploying AWS backend applications.Strong problem-solving and troubleshooting skills.Excellent communication and collaboration skills.Ability to work independently and in a team environment.Strong work prioritization, planning, and organizational skills.Strong documentation skills with attention to detail.Demonstrated ability to work independently with minimal oversight throughout the entire lifecycle of a project.Agile mindset.Broad knowledge and experience in the various domains of information technology.AWS Certifications (e.g., AWS Certified DevOps Engineer, AWS Certified Solutions Architect) (good to have)Work Environment
You will work in a dynamic environment that utilizes various AWS technologies and tools, such as EC2, S3, ECS/EKS, SageMaker, Lambda, IAM, Terraform, CloudFormation, Git, Jenkins, AWS CodePipeline, Docker, Kubernetes, Cloudwatch, Prometheus, and Grafana. The role involves collaborating with a team of data scientists and engineers, ensuring the reliable and scalable deployment of machine learning models. You will also engage in an agile work environment, prioritizing tasks and projects efficiently while maintaining high standards of security and compliance.
Pay and Benefits
The pay range for this position is $60.00 - $63.00
Eligibility requirements apply to some benefits and may depend on your job classification and length of employment. Benefits are subject to change and may be subject to specific elections, plan, or program terms. If eligible, the benefits available for this temporary role may include the following:
• Medical, dental & vision
• Critical Illness, Accident, and Hospital
• 401(k) Retirement Plan – Pre-tax and Roth post-tax contributions available
• Life Insurance (Voluntary Life & AD&D for the employee and dependents)
• Short and long-term disability
• Health Spending Account (HSA)
• Transportation benefits
• Employee Assistance Program
• Time Off/Leave (PTO, Vacation or Sick Leave)
Workplace Type
This is a hybrid position in Durham,NC.
Application Deadline
This position will be accepting applications until Feb 6, 2025.
About Actalent
Actalent is a global leader in engineering and sciences services and talent solutions. We help visionary companies advance their engineering and science initiatives through access to specialized experts who drive scale, innovation and speed to market. With a network of almost 30,000 consultants and more than 4,500 clients across the U.S., Canada, Asia and Europe, Actalent serves many of the Fortune 500.
Diversity, Equity & InclusionAt Actalent, diversity and inclusion are a bridge towards the equity and success of our people. DE&I are embedded into our culture through:
Hiring diverse talent Maintaining an inclusive environment through persistent self-reflection Building a culture of care, engagement, and recognition with clear outcomes Ensuring growth opportunities for our peopleThe company is an equal opportunity employer and will consider all applications without regard to race, sex, age, color, religion, national origin, veteran status, disability, sexual orientation, gender identity, genetic information or any characteristic protected by law.
If you would like to request a reasonable accommodation, such as the modification or adjustment of the job application process or interviewing process due to a disability, please email actalentaccommodation@actalentservices.com for other accommodation options.