Santa Clara, CA, USA
56 days ago
Principal Systems Software Engineer

We are seeking expert System Software Engineers to join our Apache Spark Acceleration team. Data scientists spend a considerable amount of time exploring data and iterating over machine learning (ML) experiments. NVIDIA believes that data science and analytics workflows can benefit tremendously from being accelerated, to enable data users to explore more and larger datasets to drive towards their business goals faster and more optimally.

You will work with the open source community to accelerate Apache Spark with GPUs for data science. Apache Spark is the most popular data processing engine in data centers. We strive to significantly accelerate Apache Spark 3.x use cases without application code changes. You will work on open source libraries (such as https://nvidia.github.io/spark-rapids/) to be used in both on-premises and cloud services (such as Databricks, AWS EMR, Google Dataproc, and Cloudera).

What you'll be doing:

Leading the design and implementation of accelerated Apache Spark and related big-data frameworks

Creating a collection of accelerated libraries for data analytics and machine learning

Working with a team of outstanding engineers including PMC and Committers of Apache Spark, Apache Hadoop, Apache Hive, and Apache Arrow

Engaging open source communities (including Apache Spark, RAPIDS and UCX) for technical discussion and contribution

Working with NVIDIA strategic partners on deploying advanced machine learning and data analytics solutions in public cloud or on-premise clusters

Presenting technical solutions in industry conferences and meetups

Provide recommendations and feedback to teams regarding decisions surrounding topics such as infrastructure, continuous integration and testing strategy

Build, test and optimize CUDA/C++ libraries across different platforms

What we need to see:

BS, MS, or PhD in Computer Science, Computer Engineering, or closely related field or equivalent experience

15+ years of work experience in software development

5+ years working experience with key open source big-data projects as a contributor or committer including Apache Spark, Apache Flink, Trino, Apache Kafka, Apache Hive, Apache Arrow, Apache Hadoop, Delta Lake, Apache Iceberg

Outstanding technical skills in designing and implementing high-quality distributed systems

Excellent programming skills in C++, Java, and/or Scala

Ability to work successfully with multi-functional teams across organizational boundaries and geographies

Highly motivated with strong interpersonal skills

The base salary range is 272,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Confirm your E-mail: Send Email