Hyderabad, Telangana, India
2 days ago
Associate Director, Data Engineering

Job Description

· Based in Hyderabad, join a global healthcare biopharma company and be part of a 130-year legacy of success backed by ethical integrity, forward momentum, and an inspiring mission to achieve new milestones in global healthcare.
· Led an organization driven by digital technology and data-backed approaches that supports a diversified portfolio of prescription medicines, vaccines, and animal health products.
· Drive innovation and execution excellence. Be the leaders who have a passion for using data, analytics, and insights to drive decision-making, which will allow us to tackle some of the world's greatest health threats.

Our Technology centers focus on creating a space where teams can come together to deliver business solutions that save and improve lives. An integral part of the company IT operating model, Tech centers are globally distributed locations where each IT division has employees to enable our digital transformation journey and drive business outcomes. These locations, in addition to the other sites, are essential to supporting our business and strategy.

A focused group of leaders in each tech center helps to ensure we can manage and improve each location, from investing in growth, success, and well-being of our people, to making sure colleagues from each IT division feel a sense of belonging to managing critical emergencies. And together, we must leverage the strength of our team to collaborate globally to optimize connections and share best practices across the Tech Centers.

Role Overview:

As the Associate Director, Data Engineering, your role will focus on business intelligence to enhance data-driven decision-making across the organization. This role is crucial for transforming data into valuable insights that drive business performance, support strategic initiatives and ultimately contribute to company mission to use science to improve and save lives around the world.  

What will you do in this role: 

You will develop and ensure that business intelligence activities are efficient and effective, enabling timely access to accurate data for informed decision-making, and focused on automation, controls, and data quality.  

Design, develop and maintain data pipelines to extract data from a variety of sources and populate data lake and data warehouse.   

Collaborate with Data Analyst, Data scientists, Machine Learning Engineers to identify and transform data for ingestion, exploration, and modeling. 

Work with data governance team and implement data quality checks and maintain data catalogs. 

Use Orchestration, logging, and monitoring tools to build resilient pipelines.  

Use test driven development methodology when building ELT/ETL pipelines.  

Understand and apply concepts like data lake, data warehouse, lake-house, data mesh and data-fabric where relevant. 

Develop data models for cloud data warehouses like Redshift and Snowflake.   

Develop pipelines to ingest data into cloud data warehouses. 

You will investigate enterprise data requirements where there is some complexity and ambiguity and plan own data modeling and design activities, selecting appropriate techniques and the correct level of detail for meeting assigned objectives.  

You will define and implement data engineering strategies that align with organizational goals and data governance standards.   

You will play a lead role in agile engineering and consulting, providing guidance on for complex data and unplanned data challenges.  

You will collaborate in the formulation of analytics policies, standards, and best practices to ensure consistency and compliance across the organization.  

Encourages a culture of continuous learning, constructive collaboration, and innovation within the team.  

What Should you have: 

Bachelor's degree in computer science/engineering, Data Sciences, Bioinformatics, Biostatistics or any other computational quantitative science. 

Minimum of 5-7 years of developing data pipelines & data infrastructure, ideally within a drug development or life sciences context. 

Expert in software / data engineering practices (including versioning, release management, deployment of datasets, agile & related software tools).  

Strong software development skills in R and Python, SQL, PySpark. 

Agile working knowledge.  

Strong working knowledge of at least one large-scale data processing technology (e.g. High-performance computing, distributed computing), databases and underlying technology (cloud or on-prem environments, containerization, distributed storage & databases).  

Strong interpersonal and communication skills (verbal and written) effectively bridging scientific and business needs; experience working in a matrix environment.  

Proven record of delivering high-quality results in quantitative sciences and/or a solid publication track record.  

Our technology teams operate as business partners, proposing ideas and innovative solutions that enable new organizational capabilities. We collaborate internationally to deliver services and solutions that help everyone be more productive and enable innovation.  

Who we are:  

 

We are known as Merck & Co., Inc., Rahway, New Jersey, USA in the United States and Canada and MSD everywhere else. For more than a century, we have been inventing for life, bringing forward medicines and vaccines for many of the world's most challenging diseases. Today, our company continues to be at the forefront of research to deliver innovative health solutions and advance the prevention and treatment of diseases that threaten people and animals around the world.  

 

 What we look for:  

 

Imagine getting up in the morning for a job as important as helping to save and improve lives around the world. Here, you have that opportunity. You can put your empathy, creativity, digital mastery, or scientific genius to work in collaboration with a diverse group of colleagues who pursue and bring hope to countless people who are battling some of the most challenging diseases of our time. Our team is constantly evolving, so if you are among the intellectually curious, join us—and start making your impact today.  

#HYDIT2025

 

 

 
 

Current Employees apply HERE

Current Contingent Workers apply HERE

Secondary Language(s) Job Description:

· Based in Hyderabad, join a global healthcare biopharma company and be part of a 130-year legacy of success backed by ethical integrity, forward momentum, and an inspiring mission to achieve new milestones in global healthcare.
· Led an organization driven by digital technology and data-backed approaches that supports a diversified portfolio of prescription medicines, vaccines, and animal health products.
· Drive innovation and execution excellence. Be the leaders who have a passion for using data, analytics, and insights to drive decision-making, which will allow us to tackle some of the world's greatest health threats.

Our Technology centers focus on creating a space where teams can come together to deliver business solutions that save and improve lives. An integral part of the company IT operating model, Tech centers are globally distributed locations where each IT division has employees to enable our digital transformation journey and drive business outcomes. These locations, in addition to the other sites, are essential to supporting our business and strategy.

A focused group of leaders in each tech center helps to ensure we can manage and improve each location, from investing in growth, success, and well-being of our people, to making sure colleagues from each IT division feel a sense of belonging to managing critical emergencies. And together, we must leverage the strength of our team to collaborate globally to optimize connections and share best practices across the Tech Centers.

Role Overview:

As the Associate Director, Data Engineering, your role will focus on business intelligence to enhance data-driven decision-making across the organization. This role is crucial for transforming data into valuable insights that drive business performance, support strategic initiatives and ultimately contribute to company mission to use science to improve and save lives around the world.  

What will you do in this role: 

You will develop and ensure that business intelligence activities are efficient and effective, enabling timely access to accurate data for informed decision-making, and focused on automation, controls, and data quality.  

Design, develop and maintain data pipelines to extract data from a variety of sources and populate data lake and data warehouse.   

Collaborate with Data Analyst, Data scientists, Machine Learning Engineers to identify and transform data for ingestion, exploration, and modeling. 

Work with data governance team and implement data quality checks and maintain data catalogs. 

Use Orchestration, logging, and monitoring tools to build resilient pipelines.  

Use test driven development methodology when building ELT/ETL pipelines.  

Understand and apply concepts like data lake, data warehouse, lake-house, data mesh and data-fabric where relevant. 

Develop data models for cloud data warehouses like Redshift and Snowflake.   

Develop pipelines to ingest data into cloud data warehouses. 

You will investigate enterprise data requirements where there is some complexity and ambiguity and plan own data modeling and design activities, selecting appropriate techniques and the correct level of detail for meeting assigned objectives.  

You will define and implement data engineering strategies that align with organizational goals and data governance standards.   

You will play a lead role in agile engineering and consulting, providing guidance on for complex data and unplanned data challenges.  

You will collaborate in the formulation of analytics policies, standards, and best practices to ensure consistency and compliance across the organization.  

Encourages a culture of continuous learning, constructive collaboration, and innovation within the team.  

What Should you have: 

Bachelor's degree in computer science/engineering, Data Sciences, Bioinformatics, Biostatistics or any other computational quantitative science. 

Minimum of 5-7 years of developing data pipelines & data infrastructure, ideally within a drug development or life sciences context. 

Expert in software / data engineering practices (including versioning, release management, deployment of datasets, agile & related software tools).  

Strong software development skills in R and Python, SQL, PySpark. 

Agile working knowledge.  

Strong working knowledge of at least one large-scale data processing technology (e.g. High-performance computing, distributed computing), databases and underlying technology (cloud or on-prem environments, containerization, distributed storage & databases).  

Strong interpersonal and communication skills (verbal and written) effectively bridging scientific and business needs; experience working in a matrix environment.  

Proven record of delivering high-quality results in quantitative sciences and/or a solid publication track record.  

Our technology teams operate as business partners, proposing ideas and innovative solutions that enable new organizational capabilities. We collaborate internationally to deliver services and solutions that help everyone be more productive and enable innovation.  

Who we are:  

 

We are known as Merck & Co., Inc., Rahway, New Jersey, USA in the United States and Canada and MSD everywhere else. For more than a century, we have been inventing for life, bringing forward medicines and vaccines for many of the world's most challenging diseases. Today, our company continues to be at the forefront of research to deliver innovative health solutions and advance the prevention and treatment of diseases that threaten people and animals around the world.  

 

 What we look for:  

 

Imagine getting up in the morning for a job as important as helping to save and improve lives around the world. Here, you have that opportunity. You can put your empathy, creativity, digital mastery, or scientific genius to work in collaboration with a diverse group of colleagues who pursue and bring hope to countless people who are battling some of the most challenging diseases of our time. Our team is constantly evolving, so if you are among the intellectually curious, join us—and start making your impact today.  

#HYDIT2025

 

 

 

Search Firm Representatives Please Read Carefully 
Merck & Co., Inc., Rahway, NJ, USA, also known as Merck Sharp & Dohme LLC, Rahway, NJ, USA, does not accept unsolicited assistance from search firms for employment opportunities. All CVs / resumes submitted by search firms to any employee at our company without a valid written search agreement in place for this position will be deemed the sole property of our company.  No fee will be paid in the event a candidate is hired by our company as a result of an agency referral where no pre-existing agreement is in place. Where agency agreements are in place, introductions are position specific. Please, no phone calls or emails. 

Employee Status:

Regular

Relocation:

VISA Sponsorship:

Travel Requirements:

Flexible Work Arrangements:

Hybrid

Shift:

Valid Driving License:

Hazardous Material(s):


Required Skills:

Business Intelligence (BI), Database Administration, Data Engineering, Data Management, Data Modeling, Data Visualization, Design Applications, Information Management, Software Development, Software Development Life Cycle (SDLC), System Designs


Preferred Skills:

Job Posting End Date:

03/31/2025

*A job posting is effective until 11:59:59PM on the day BEFORE the listed job posting end date. Please ensure you apply to a job posting no later than the day BEFORE the job posting end date.


Requisition ID:R329036

Confirm your E-mail: Send Email