Madrid, Spain
19 days ago
Cloud DevOps Infrastructure Engineer

Roche fosters diversity, equity and inclusion, representing the communities we serve. When dealing with healthcare on a global scale, diversity is an essential ingredient to success. We believe that inclusion is key to understanding people’s varied healthcare needs. Together, we embrace individuality and share a passion for exceptional care. Join Roche, where every voice matters.

The Position

As a Cloud DevOps Infrastructure Engineer at Roche, you will be at the forefront of optimizing the reliability, availability, scalability, and performance of our Cloud Platform infrastructure. Your role involves applying software engineering principles across the entire lifecycle, from inception to decommissioning. You will dive into the heart of our cloud infrastructure platforms, leveraging your specialized knowledge to optimize performance, scalability, and reliability. Be the driving force behind continuous improvements, ensuring our platforms meet and exceed the highest standards.

The Cloud platform team globally serves our internal Roche customers and IT partners designing, building and operating modern distributed systems on a Public Cloud Infrastructure globally (Europe, NALA and APAC).

Job responsibilities

Engages in and improves, with low guidance, the whole lifecycle of cloud platforms and services—from inception and design through deployment, operation and retirement by applying software engineering principles to build and manage large scale  IT infrastructure products and services (both on-premises or in public cloud) optimizing:

services reliability, availability, capacity and performance  and eliminating work through automation

software development and deployment  by abstracting away the complexity of infrastructure providing self-service tools and APIs for developers that allow code and ship software quickly.

Implements and maintains CI/CD pipelines, enabling developers to easily build, test, and deploy their applications. 

Develop self-healing features.

Contribute to Disaster Recovery execution plans.

Ensure IT infrastructure services reach and maintain the agreed service level indicators (SLIs), objectives (SLOs), agreements (SLAs) in compliance with QA requirements.

Contribute to the maintenance of services once they are live by measuring and monitoring availability, latency and overall system health.

Contribute to activities focused on availability, tuning, performance, efficiency, change and configuration management, monitoring, emergency response and capacity planning.

Manages ITSM process(es) and track resolution for reporting and resolving incidents, problems, changes, requests and releases. 

Monitors and resolves incidents/problems (including major ones) with platform operations, suggesting priorities and collaborating in the resolution when required.

Practice sustainable incident response and blameless postmortems. 

Ensures implemented solutions and components comply with Quality/Regulatory standards, as applicable.

Implements cost, compliance and security best practices, ensuring that platforms and services meet the corresponding requirements

Contribute to audit exercises providing the required evidence to the audit teams.

Collaborate with developers, Managed Services suppliers, other teams and vendors to:

continuously improve application development velocity

optimize services reliability, availability and performance

Works closely with development teams to ensure that new features and changes are rolled out with reliability in mind bringing a broader and more strategic understanding of reliability that spans multiple facets of development.

Act as an analyst by transforming the customers and developers needs into specific technical requirements to be implemented by the product team or by other teams.

Maintain in-depth knowledge of current and emerging technologies within their technical, infrastructure area of responsibility to further the objectives of the team or department and ability to tackle complex and interdisciplinary issues. 

Job requirements

Good interpersonal skills and good oral and written communication skills.

English language proficiency is required. 

Proficiency in German, Spanish or Chinese is considered a plus.

Demonstrated customer and delivery focus mindset.

Well proven scripting and automation skills with strong knowledge in delivering and managing infrastructure as code.

Experience working with cloud Infrastructure platforms, their availability, administration, configuration and integration. 

Familiarity with Software Engineering and DevOps principles and automated testing and CI/CD tools. 

Knowledge of agile methodologies and principles.

Ability to continuously research, learn, innovate and share knowledge

Ability to work effectively with team members and virtual teams from different locations and different cultural backgrounds.

Strong problem-solving and decision-making skills.

Ability to function independently with low supervision and navigate ambiguity

Customer orientation, partnership, collaboration and trust.

Technology Skills:  

Proven scripting and automation skills with expertise in delivering and managing infrastructure as code.

Recent experience/exposure designing, implementing, operating infrastructure or designing hybrid multi cloud solutions in Azure Public Cloud Platforms

Creation of high-availability, fault tolerant and auto-scaled Dev/Test/Stage and Production environments using Infrastructure as Code techniques such as ARM or Terraform.

Hands-on technical skills in automation (Phyton, Ansible, PowerShell, Jenkins, GitLab, Rundeck, Terraform), infrastructure as code, logging, monitoring and observability, infrastructure configuration, scripting languages and applications. Source code management (GitLab, BitBucket, GitHub). 

Experience with Infrastructure Performance Analysis, reporting and Capacity Planning is nice to have.

DevOps Pipeline Automation experience: Gitlab CI/CD.


 

Education / Years of Experience: 

Bachelor’s degree in Computer Science/Engineering or equivalent work experience in information technology environment (networking, infrastructure, database).

Certifications on Azure or equivalent working knowledge on cloud environments is nice to have.

You will bring +3 years of relevant work experience in one or more multinational work environments (e.g. healthcare industry experience is a plus).

Moderate travel required and ability to work across multiple time zones, including on-call, maintenance and extended hours of work.

Who we are

At Roche, more than 100,000 people across 100 countries are pushing back the frontiers of healthcare. Working together, we’ve become one of the world’s leading research-focused healthcare groups. Our success is built on innovation, curiosity and diversity.

Roche is an Equal Opportunity Employer.

Confirm your E-mail: Send Email