NJ, United States
22 days ago
Platform Engineer

Overview: 

We are building out the enterprise big data platform in AWS and looking for hands-on platform engineers to drive best practices in our cloud implementation.  The platform will be used by all facets of the organization to steer the business towards data driven decisions. 

The ideal candidate will want to work in a fast pace agile environment designing and implementing solutions to support the data lake and data warehouse.  Automation is the key.  Performance, distributed computing, asynchronous behavior are the challenges.  We are problem solvers who take ideas from inception to production. The pipelines and tools we build support our community of data scientists and data analyst.  Innovation is encouraged, collaboration is required. We believe in active mentoring and continuous improvement. 

Responsibilities:

The Platform Engineer is responsible for designing, developing, and maintaining the foundational infrastructure and services that support application development, deployment, and operations for the data lake and the data warehouse. The role enables development teams to deliver software more efficiently, securely, and at scale. Key responsibilities include:

Writing reusable, maintainable scripts to define infrastructure. Designing scalable infrastructure solutions. Building continuous integration and continuous delivery (CI/CD) pipelines to automate the software build, test, and deployment processes. Implementing tools and services for monitoring the health and performance of systems, applications, and networks. Ensuring proper logging, alerting, and metrics collection. Ensuring the platform follows security best practices. Working with security teams to implement policies like least privilege access, auditing, and compliance monitoring. Working closely with software development teams to understand their needs and build self-service tools. Providing support and troubleshooting for developers. Implementing containerization solutions. Implementing efficient resource allocation Ensuring that the platform can handle increasing traffic and user demands while maintaining performance and reliability. Implementing fault-tolerant architectures and disaster recovery plans. Creating clear documentation of platform configurations, processes, and best practices.

 

Confirm your E-mail: Send Email