Prague, Central Bohemian, Czechia
17 days ago
AI/ML Platform Engineer

Job Description

Join the Advanced Data Analytics (ADA) Product Line as a Platform Engineer and help shape the future of AI/ML development experience for researchers and data science teams across all divisions. Product Line mission is to enable users to focus on delivering value to stakeholders and to accelerate the development process. As a member of the Platform Engineering team, you will drive continuous delivery, site reliability and cost efficiency of AI/ML platforms hosted on AWS. You will develop new platform capabilities and collaborate with our Data Science teams to deliver accelerators for our users. Our web-based accelerators range from delivering CI/CD, MLOPs to fully customizable end-to-end Retrieval-Augmented Generation pipelines. Following year we are  planning to extend capabilities with AI agents and develop procedures to host and customize LLMs in our platforms in self-service way.

Your team will consist of internal employees and long-term contractors from multiple time zones. Prague-based engineers meet once a week in the office, but you are welcome to enjoy on-site collaboration with our customers and colleagues any time you want.  We believe in agile manifesto and utilize just small subset of scrum practices. Our team deliver in two-week sprints, we gather business requirements every week, help our colleagues to translate business requirements into engineering tasks, assess complexity to plan our delivery capacity and every two weeks you can trigger retrospective by voting. We are kind to each other and after every daily there is option to discuss engineering topics together. We strive to break product silos and develop product agnostic engineering knowledge.

Your expected career path is to grow professionally and take responsibility for the delivery leadership in one of the platforms or initiatives as subject matter expert. North star of your capabilities is ability to drive delivery of new products, ensure compliance and fulfill stakeholder requirements. 

Infrastructure: AWS, AWS China, On-premise

Product Line: Dataiku, Databricks, Domino, Posit Cloud & On-prem, SAS, AWS OpenSearch, JMP, and Alteryx, 

Continuous Delivery: Terraform, Ansible, CloudFormation, Docker, Bash, Python, GitHub Actions,

Observability: ElasticSearch, CloudWatch

Quality Engineering: X-Ray, Robot Framework, Selenium Grid

Elastic Compute: Kubernetes (Karpenter), Slurm, Databricks Clusters

Distributed Processing: Spark, Ray

Product Line Insight Database: Redshift

Operating Systems: Alma Linux, Red Hat, Amazon Linux, Bottlerocket

Responsibilities

Assess and deliver assigned tasks

Conduct root cause analysis

Participate in code reviews and technical discussions

Improve code maintainability, security, reliability, and platform cost efficiency

Collaborate to enhance practices within the engineering team

Automate and simplify the maintenance and lifecycle of platform services

Keep up with current industry trends, cloud-native concepts, best practices, and technologies

Ensure compliance with the System Development Lifecycle (SDLC) and company policy standards

Maintain up-to-date Design & Configuration Specifications.

Assist customers and colleagues

Must-Have Qualifications

Self-sufficiency

A proactive and delivery-oriented mindset

Ability to effectively work in a remote environment with a global team

Capability to review and understand system requirements and business processes

Hands-on experience with:

Git, Docker, Ansible, Terraform, Shell scripting, Python or similar (even more modern) tooling.

AWS Services (VPC, CloudWatch, ALB, Route53, S3, IAM, EKS, etc.) or Azure or Google Cloud

Networking

Linux system administration

Bachelor’s degree or equivalent in Computer Science, Computer Engineering, Information Systems or related experience.

Nice-to-Have Qualifications

Data Science Experience

AI/ML and data processing platforms

MLOPs

LLMOps

RAGs

Agent Frameworks

Vector Databases

Statistical analysis tools

Karpenter, Slurm, Dask, Github Actions or Jenkins

Software Development

Design Patterns

UI & Visualizations 

Data Engineering

Spark, Ray, Dask

What we offer:

Exciting work in a great team, global projects, international environment 

Opportunity to learn and grow professionally within the company globally. 

Hybrid working model, flexible role pattern (e.g., even 80% full-time is possible in justified cases) 

Pension and health insurance contributions 

Internal reward system plus referral program 

5 weeks annual leave, 5 sick days, 15 days of certified sick leave paid above statutory requirements annually, 40 paid hours annually for volunteering activities, 12 weeks of parental contribution. 

Cafeteria for tax free benefits according to your choice (meal vouchers, Lítačka, sport, culture, health, travel, etc.), Multisport Card 

Vodafone, Raiffeisen Bank, Foodora, and Mall.cz discount programs.

Up-to-date laptop and iPhone 

Parking in the garage, showers, refreshments, massage chairs, library, music corner 

Competitive salary, incentive pay, and many more. 

 
Ready to take up the challenge? Apply now! 
Know anybody who might be interested? Refer this job! 

Current Employees apply HERE

Current Contingent Workers apply HERE

Search Firm Representatives Please Read Carefully 
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company.  No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails. 

Employee Status:

Regular

Relocation:

VISA Sponsorship:

Travel Requirements:

Flexible Work Arrangements:

Remote

Shift:

Valid Driving License:

Hazardous Material(s):


Required Skills:

Availability Management, Capacity Management, Change Controls, Design Applications, High Performance Computing (HPC), Incident Management, Information Management, Information Technology (IT) Infrastructure, IT Service Management (ITSM), Release Management, Software Development, Software Development Life Cycle (SDLC), Solution Architecture, System Administration, System Designs


Preferred Skills:

Job Posting End Date:

01/31/2025

*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.


Requisition ID:R324377

Confirm your E-mail: Send Email