SRE/DevOps Engineer
Insight Global
Job Description
Insight Global is seeking 2 SRE/DevOps Engineers to join one of our largest financial clients in New York City supporting the Markets APSE Support Team. One individual will be more focused on operations and daily support while the other will be very technical in nature and focused on heavy automation processes. The ideal candidates will possess extensive experience in scripting with Python and Shell, managing Linux servers, and automating processes using Ansible. This role is vital for overseeing and supporting significant projects, including data center migration, disaster recovery, and capacity management.
Key Responsibilities:
Scripting and Development:
o Develop and maintain scripts in Shell and Python for automation tasks.
o Collaborate with development teams to integrate new scripts into existing systems.
Linux Server Management:
o Manage and maintain a vast network of 10,000 Linux servers.
o Ensure system performance and reliability through routine maintenance and upgrades.
High-Level Automation:
o Focus on automating code, troubleshooting, and re-releasing without direct client interactions.
o Build and maintain Ansible playbooks for automated deployments and configurations.
o Continuously improve automation processes to increase efficiency and reduce manual interventions.
Data Center Migration:
o Support data center migration initiatives, including NPT upgrades.
o Provide technical solutions and troubleshooting for migration-related issues.
Large Enterprise Troubleshooting:
o Hands-on resolution of complex tickets in a large-scale monitoring enterprise environment.
o Develop and implement disaster recovery plans to ensure business continuity.
Capacity Management:
o Monitor system performance and optimize capacity management.
o Utilize monitoring tools like Splunk and Dynatrace to identify and address performance bottlenecks.
Continuous Integration/Continuous Deployment (CI/CD):
o Implement and manage Jenkins for CI/CD processes.
o Ensure smooth and efficient code deployments and releases.
ITIL Processes:
o Utilize Remedy for ITIL-based incident and problem management.
o Maintain adherence to ITIL best practices and document properly for .
Projects:
1. Leapp tool:
o Managing automation developed in the previous year and ensuring the seamless execution of upgrade processes.
o Schedule Work for Servers: Plan and coordinate upgrade activities, ensuring minimal disruption to the existing system. Conduct pre-checks to validate the readiness of servers and the environment before initiating upgrades.
o Perform post-checks to verify the success of the upgrades and ensure systems are functioning as expected.
2. Migrations:
o Support the upgrade to a platform data center migration.
o Conduct capacity management and enhance tooling solutions as needed.
3. Azure Migration:
o Plan and execute Azure migration projects later in the year.
4. GenAI Project:
o Collaborate with the CIO to develop automation tools, including scripting and monitoring solutions.
o Focus on automation using Ansible playbooks, dedicating 40-50% of the time to this maturing platform.
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment regardless of their race, color, ethnicity, religion, sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military or uniformed service member status, or any other status or characteristic protected by applicable laws, regulations, and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request to HR@insightglobal.com .
To learn more about how we collect, keep, and process your private information, please review Insight Global's Workforce Privacy Policy: https://insightglobal.com/workforce-privacy-policy/ .
Skills and Requirements
5+ years of experience as a SRE/DevOps Engineer
Proven experience in developing scripts with Python and Shell.
Extensive Linux server management experience.
Strong proficiency in Ansible for automation.
Familiarity with monitoring tools like ITRS Geneos, Splunk and Dynatrace.
Knowledge of CI/CD tools such as Jenkins.
Experience with ITIL processes and tools like Remedy.
Ability to work on high-level automation projects and support large enterprise environments. Previous Bank of America experience
Experience in data center migrations, capacity management, and disaster recovery.
Cloud experience is an optional plus null
We are a company committed to creating diverse and inclusive environments where people can bring their full, authentic selves to work every day. We are an equal employment opportunity/affirmative action employer that believes everyone matters. Qualified candidates will receive consideration for employment without regard to race, color, ethnicity, religion,sex (including pregnancy), sexual orientation, gender identity and expression, marital status, national origin, ancestry, genetic factors, age, disability, protected veteran status, military oruniformed service member status, or any other status or characteristic protected by applicable laws, regulations, andordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request to HR@insightglobal.com.
Confirm your E-mail: Send Email
All Jobs from Insight Global