Santa Clara, CA, USA
6 days ago
Senior Principal Software Developer - Cluster Networks (JoinOCI-SDE)

Oracle Cloud Infrastructure (OCI) Cluster Networking team is building an ultra-high performance network required to support AI/ML/HPC workloads. This is your opportunity to join the AI revolution and designing systems which allow customers to scale from tens to thousands of GPU without compromising on performance.

This team will be responsible for designing, developing and performance tuning the software+hardware stack required to run distributed AI/ML/HPC workload across thousands of GPUs leveraging libraries like NCCL on high performance network.

This is your opportunity to build innovative solutions for our customers from the ground up. These are exciting times and our team is still young and growing fast, working on ambitious new initiatives. We are looking for adaptable, self-motivated engineers with ability to learn quickly. You should be both a rock solid developer and a distributed systems generalist, able to dive deep into any part of the stack and low-level systems, as well as design broad distributed system interactions. You should value simplicity and scale, work comfortably in a collaborative, agile environment, and be excited to learn.

Career Level - IC5

Confirm your E-mail: Send Email