Prague, Central Bohemian, Czechia
5 days ago
Technical Lead, Data Engineering (Associate Director)

Job Description

You will be part of the Scientific Data product line team. In this role, they will collaborate with multi-disciplinary teams, including product teams, technical teams, scientists, data scientists, subject matter experts, and stakeholders helping to solve complex data problems and building data products to enable AI/ML. These data products will help They will have to understand complexities of Research data and work with the product team to make data understandable and analytics ready. In this role they have to define, design, develop and maintain data products.

The Scientific Data Product Line is responsible for enabling data for the Discovery, Preclinical, Translational Medicine (DPTM) and Developmental Sciences and Clinical Sciences (DSCS) organizations to get to better and faster insights in the drug discovery process.

Primary Responsibilities

Understand user problems and pain points. Design and develop solutions to address business needs using enterprise solutions 

Lead data engineers on the team to ensure solution are fit for purpose and align to enterprise needs 

Design, develop and maintain data pipelines to extract data from a variety of sources and populate data lake, data warehouse and Lakehouse 

Develop the various data transformation rules and data modeling capabilities 

Collaborate with Product Analyst, Data Scientists, Machine Learning Engineers to identify and transform data to make data understandable 

Work with data governance team and implement data quality checks and maintain data catalogs 

Use Orchestration, logging, and monitoring tools to build resilient pipelines 

Use test driven development methodology when building ELT pipelines 

Use Git for version control and understand various branching strategies 

Work as part of an Agile team 

Describes what to solve by writing problem statements and requirements and facilitates the “how” with the development team 

Planning, designing, and conducting testing activities, and compiling critical content for training and communication for our users 

Design and implement test cases 

Create technical documentation as needed for SDLC 

Partner with business and IT leadership to formulate business cases 

Required Experience and Skills 

B.Sc. or higher degree in Computer Science or Chemistry equivalent field required 

Minimum 3-5 years working with customers to define requirements, preferably data 

Domain knowledge - Pharmaceutical drug discovery and pre-clinical development 

Hands-on experience with AWS services (S3, IAM, Redshift, SageMaker, Glue, Lambda, Step Functions, CloudWatch) 

Experience with platforms like Databricks, Dataiku, Delta Lake 

Experience with data governance, quality checks, and data catalog capabilities 

Experience with data integration & transformation tools – AWS Glue, Starburst Trino, Databricks 

Proficient in CI/CD using GitHub Actions, Jenkins, CloudFormation, Terraform, Git, Docker, Apache Airflow 

Proficient in Python, Spark – PySpark 

Demonstrates growth and product mindset. Ability to work in a cross functional team setup 

Be able to work independently, anticipating and resolving problems 

Excellent written and verbal communications skills 

Effectively engage both technical and non-technical stakeholders and users 

Preferred Experience and Skills

Any AWS developer or architect certification 

Experience in Java 

Experience with Matillion ETL for Data Transformation 

Experience working in Agile software development 

Familiarity with NoSQL Databases 

Familiarity with ontologies and use of them to create data products 

What we offer

Exciting work in a great team, global projects, international environment

Opportunity to learn and grow professionally within the company globally

Hybrid working model, flexible role pattern

Pension and health insurance contributions

Internal reward system plus referral programme

5 weeks annual leave, 5 sick days, 15 days of certified sick leave paid above statutory requirements annually, 40 paid hours annually for volunteering activities, 12 weeks of parental contribution

Cafeteria for tax free benefits according to your choice (meal vouchers, Lítačka, sport, culture, health, travel, etc.), Multisport Card

Vodafone, Raiffeisen Bank, Foodora, and Mall.cz discount programmes

Up-to-date laptop and iPhone

Parking in the garage, showers, refreshments, massage chairs, library, music corner

Competitive salary, incentive pay, and many more


Ready to take up the challenge? Apply now!
Know anybody who might be interested? Refer this job!

Current Employees apply HERE

Current Contingent Workers apply HERE

Search Firm Representatives Please Read Carefully 
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company.  No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails. 

Employee Status:

Regular

Relocation:

VISA Sponsorship:

Travel Requirements:

Flexible Work Arrangements:

Hybrid

Shift:

Valid Driving License:

Hazardous Material(s):


Required Skills:

Business Intelligence (BI), Database Administration, Data Engineering, Data Governance, Data Integration, Data Lake, Data Management, Data Modeling, Data Pipelines, Data Quality, Data Transformation, Data Visualization, Design, Design Applications, Git, Information Management, Python (Programming Language), Software Development, Software Development Life Cycle (SDLC), Solutions Development, System Designs, Teamwork, Version Control


Preferred Skills:

Agile Methodology, Databricks Platform

Job Posting End Date:

02/1/2025

*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.


Requisition ID:R309248

Confirm your E-mail: Send Email