Job Description
You will be part of the Scientific Data product line team. In this role, they will collaborate with multi-disciplinary teams, including product teams, technical teams, scientists, data scientists, subject matter experts, and stakeholders helping to solve complex data problems and building data products to enable AI/ML. These data products will help They will have to understand complexities of Research data and work with the product team to make data understandable and analytics ready. In this role they have to define, design, develop and maintain data products.
The Scientific Data Product Line is responsible for enabling data for the Discovery, Preclinical, Translational Medicine (DPTM) and Developmental Sciences and Clinical Sciences (DSCS) organizations to get to better and faster insights in the drug discovery process.
Primary Responsibilities
Understand user problems and pain points. Design and develop solutions to address business needs using enterprise solutions
Lead data engineers on the team to ensure solution are fit for purpose and align to enterprise needs
Design, develop and maintain data pipelines to extract data from a variety of sources and populate data lake, data warehouse and Lakehouse
Develop the various data transformation rules and data modeling capabilities
Collaborate with Product Analyst, Data Scientists, Machine Learning Engineers to identify and transform data to make data understandable
Work with data governance team and implement data quality checks and maintain data catalogs
Use Orchestration, logging, and monitoring tools to build resilient pipelines
Use test driven development methodology when building ELT pipelines
Use Git for version control and understand various branching strategies
Work as part of an Agile team
Describes what to solve by writing problem statements and requirements and facilitates the “how” with the development team
Planning, designing, and conducting testing activities, and compiling critical content for training and communication for our users
Design and implement test cases
Create technical documentation as needed for SDLC
Partner with business and IT leadership to formulate business cases
Required Experience and Skills
B.Sc. or higher degree in Computer Science or Chemistry equivalent field required
Minimum 3-5 years working with customers to define requirements, preferably data
Domain knowledge - Pharmaceutical drug discovery and pre-clinical development
Hands-on experience with AWS services (S3, IAM, Redshift, SageMaker, Glue, Lambda, Step Functions, CloudWatch)
Experience with platforms like Databricks, Dataiku, Delta Lake
Experience with data governance, quality checks, and data catalog capabilities
Experience with data integration & transformation tools – AWS Glue, Starburst Trino, Databricks
Proficient in CI/CD using GitHub Actions, Jenkins, CloudFormation, Terraform, Git, Docker, Apache Airflow
Proficient in Python, Spark – PySpark
Demonstrates growth and product mindset. Ability to work in a cross functional team setup
Be able to work independently, anticipating and resolving problems
Excellent written and verbal communications skills
Effectively engage both technical and non-technical stakeholders and users
Preferred Experience and Skills
Any AWS developer or architect certification
Experience in Java
Experience with Matillion ETL for Data Transformation
Experience working in Agile software development
Familiarity with NoSQL Databases
Familiarity with ontologies and use of them to create data products
What we offer
Exciting work in a great team, global projects, international environment
Opportunity to learn and grow professionally within the company globally
Hybrid working model, flexible role pattern
Pension and health insurance contributions
Internal reward system plus referral programme
5 weeks annual leave, 5 sick days, 15 days of certified sick leave paid above statutory requirements annually, 40 paid hours annually for volunteering activities, 12 weeks of parental contribution
Cafeteria for tax free benefits according to your choice (meal vouchers, Lítačka, sport, culture, health, travel, etc.), Multisport Card
Vodafone, Raiffeisen Bank, Foodora, and Mall.cz discount programmes
Up-to-date laptop and iPhone
Parking in the garage, showers, refreshments, massage chairs, library, music corner
Competitive salary, incentive pay, and many more
Ready to take up the challenge? Apply now!
Know anybody who might be interested? Refer this job!
Current Employees apply HERE
Current Contingent Workers apply HERE
Search Firm Representatives Please Read Carefully
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company. No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails.
Employee Status:
RegularRelocation:
VISA Sponsorship:
Travel Requirements:
Flexible Work Arrangements:
HybridShift:
Valid Driving License:
Hazardous Material(s):
Required Skills:
Business Intelligence (BI), Database Administration, Data Engineering, Data Governance, Data Integration, Data Lake, Data Management, Data Modeling, Data Pipelines, Data Quality, Data Transformation, Data Visualization, Design, Design Applications, Git, Information Management, Python (Programming Language), Software Development, Software Development Life Cycle (SDLC), Solutions Development, System Designs, Teamwork, Version ControlPreferred Skills:
Agile Methodology, Databricks PlatformJob Posting End Date:
02/1/2025*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.
Requisition ID:R309248