Mumbai, India
29 days ago
HPC Engineer

Position Overview- We are looking for a skilled High-Performance Computing (HPC) Infrastructure Engineer to join our dynamic team. As a HPC Infrastructure Engineer, you will be responsible for designing, implementing, and maintaining HPC-based infrastructure. You will play a crucial role in ensuring the optimal performance, reliability, and scalability of our deep learning and AI infrastructure.

 

Key Responsibilities-

Designing, deploying, and managing HPC systems and related infrastructure. Configuring and optimizing DGX clusters for performance, reliability, and scalability. Collaborating with data scientists, AI engineers, and IT teams to integrate HPC systems into our overall AI and deep learning workflows. Monitoring system performance and implementing proactive measures to maintain optimal operation. Troubleshooting and resolving issues related to HPC systems, including hardware, software, and network components. Implementing security measures and best practices to ensure the integrity and confidentiality of DGX-based data and workflows. Documenting infrastructure configurations, processes, and procedures. Providing technical guidance and training to team members HPC-related technologies and best practices. Staying current with OEMs that provide HPC hardware and software advancements and recommending upgrades or enhancements as needed.  
Confirm your E-mail: Send Email