Expo Business Park, Romania
1 day ago
Site Reliability Engineer - WARF @ING Bank

Discover ING Bank Romania

ING believes in a world where everyone has the right to grow and progress in their own way. We express this in our global tagline, “do your thing”. Perhaps more than in any other large company, we extend our belief in the power of autonomy to our own people. But there’s a catch. In return for great freedom, we expect people to do great things for our customers, our stakeholders, and ING at large.

To work here is to be surrounded by people who are energetic, ambitious, friendly and respectful: talented specialists who take the responsibility and autonomy to make great things happen. We stay curious, thrive on change, and seek new and better ways to make it happen. Active in Romania for 30 years, ING Bank pioneered and challenged the local banking industry. Technology and innovation are at the core of what we do, making our products relevant for our customers’ lives and businesses.

ING Bank Romania is the only bank with an organic growth within the top 10 local banks by assets, without acquisitions of client portfolios or other banks. ING Bank Romania is an universal bank with more than 1.8 million customers from three business segments: individuals (retail), SME and Mid-Corporate companies and Wholesale Banking.

Join us!

Mission

The SRE team is responsible to roll-out the SRE (Site Reliability Engineering) practices to improve the reliability of Critical Business Services for ING Bank Romania. The SRE team is responsible for defining, introducing, and promoting SRE processes and practices like Observability, Incident & Problem Management, Capacity & Performance Management, IT Service Continuity, Well-Architected Review Framework, Operational Resilience & Reliability Testing, Release Procedures & Change Management, Reliability reporting & error budgeting, etc.

This role is responsible for ‘Resilience by design’ and challenges & contributes to ING’s Well-Architected Framework and underlying reliability patterns (as developed by Enterprise Architecture).

As part of the SRE team, you will:

Steer patterns to implementation. This includes design and/or development of conformity bots in the CI/CD pipeline, policy-as-code validations for infrastructure provisioning, conformity monkey or other ways to validate implementation in production and perform drift detection.

Ensure proper documentation, training material and other ways to get the knowledge to our engineers across ING.

Contribute as a reliability expert to key operational activities with a focus on services or incidents touching multiple key areas. This includes performing Critical Business Service/critical chain reviews to identify weaknesses to be solved and supporting P1 incidents and Major Incidents (as expert) by providing expertise that ensures high quality root-cause analysis and by ensuring follow-up of structural (architectural/design-related) findings with Architects & DevOps.

Your day to day

The initial focus will be to challenge and to contribute to ING’s Well-Architected Framework (WARF) and underlying reliability patterns (as developed by Enterprise Architecture). The rest of the activities include:

Ensures that the architecture of IT Services that support CBSs is designed for resilience;Prepares, facilitates, and coordinates the Well-Architected Review E2E to identify weaknesses to be solved. Organizes the review based on the process specific triggers, selects the System Experts and the Reviewers (including Lead Reviewer) that should be included in the review;Ensures that the Reviewers challenge the design and implementation of IT Services based on best practices from the Well-Architected Framework;Ensures weaknesses are identified during the Well-Architected Review if the case and documents the findings in the Review Document template. If actions are required, ensures that backlog items are created and follows-up on their resolution;Ensures accurate reporting of the Well-Architected Reviews and related improvements;Operates in strong cooperation with Architects, the rest of the SRE team, engineers and aligns with the Global Review Coordinator from Global SRE team;Supports P1 Incidents and Major Incidents as expert and provides expertise to ensure high-quality root-cause analysis and follow-up of structural (architectural/design-related) findings with Architects and DevOps teams.

What you bring to the team

Education: Bachelor's or Master's degree in computer science, information systems, or a related discipline;Experience: 10+ Years in software engineering/IT operations and/or IT architect roles;Technical skills:Knowledgeable about technology in all levels in the technology stack (from infrastructure to front-end, from CI/CD to observability tooling) with expert knowledge & hands-on experience on one or more levels (e.g. infrastructure & back-end development and/or observability & CI/CD tooling);In-depth knowledge of system design and experience with scalable and reliable infrastructure;Understanding of network protocols, security best practices, and ability to implement secure and robust solutions;Competence in using Cloud services;Tools:  ING Private Cloud or Public Cloud (Azure or Google Cloud) and related VM/container stacks & tooling; application-level technologies & tooling heavily in use at ING e.g. spring boot, ING’s API SDK, Azure DevOps, Prometheus/ELK stack/Tracing or ING’s specific implementations (e.g. RTK2, Log4All, MDPL).Proven experience or interest in the Site Reliability Engineering (SRE) methodology, IT security and compliance. Familiarity with DevOps culture and practices;Proven experience with ITIL processes and ITSM tools (ServiceNow, Azure DevOps, etc.);Strong analytical and problem-solving skills;High accuracy in performing duties;Ability to efficiently promote in the organization the SRE concepts and frameworks;Effective communication, both written and verbal, to convey complex technical concepts in a clear and understandable manner;Strong stakeholder management abilities.

What we offer

Impactful work in a fun and collaborative environment.Open-concept offices designed for both team work and relaxation.Corporate events and social gatherings.Hybrid way of working with flexible working schedule and short week options.Monthly budget on Benefit platform.Extra annual leave days depending on the total length of working experience.Growth opportunities through upskilling/ reskilling programs and a variety of learning and development platforms: ING Learning Centre, Udemy, Bookster, as well as through trainings and certifications.Possibility to access Internal roles, International Short-Term Assignments or Long-Term Assignments.Context to make an impact through Sustainability and Corporate Social Responsibility projects.
Confirm your E-mail: Send Email
All Jobs from ING Direct