Nvidia logo

Senior Research Engineer, ML Data Pipelines Senior Research Engineer, ML Data Pipelines

Nvidia

Santa Clara, CA
Full Time
Senior
224k-426k
8 days ago

Job Description

About the Role

The Senior Research Engineer, ML Data Pipelines at NVIDIA is responsible for designing, implementing, and optimizing scalable machine learning data pipelines for training multimodal foundation models. The role involves close collaboration with researchers to preprocess, transform, and manage large datasets for robot model training and evaluation, as well as developing tools for labeling and curating sensor data. The position offers the opportunity to contribute significantly to research projects and product roadmaps within a leading technology company focused on developing general-purpose robots and large-scale foundation models.

Key Responsibilities

  • Design, implement, and optimize scalable ML data pipelines for training multimodal foundation models.
  • Collaborate closely with researchers to preprocess, transform, and manage large datasets for robot model training and evaluation.
  • Develop tools for labeling and curating multiple streams of sensor data.
  • Continuously monitor robot data collection processes and evaluate data quality.
  • Implement and optimize PyTorch data loading modules for video processing and robot learning on large GPU clusters.

Requirements

  • Bachelor's Degree in Computer Science, Robotics, Engineering, or a related field.
  • 10+ years of full-time industry experience working with large-scale machine learning data pipelines.
  • Proficiency in Python data processing libraries.
  • Hands-on model training experience in PyTorch, JAX, or TensorFlow.
  • Strong experience with large-scale GPU clusters, HPC environments, and job scheduling/orchestration tools (e.g., SLURM, Kubernetes).

Nice to Have

  • Master's or PhD degree in Computer Science, Robotics, Engineering, or a related field.
  • Strong experience with cloud infrastructure management (AWS, Azure, GCP) and data stores (Postgres, MySQL, ElasticSearch, Redis).
  • Experience at autonomous driving or robotics companies training machine learning models on massive datasets.
  • Demonstrated Tech Lead experience, coordinating a team of engineers and driving projects from conception to deployment.
  • Contributions to popular open-source frameworks.

Qualifications

  • Educational background in Computer Science, Robotics, Engineering, or related fields.
  • Extensive industry experience (10+ years) in ML data pipelines.

Benefits & Perks

  • Base salary range of $224,000 to $425,500, determined based on location, experience, and comparable roles.
  • Eligibility for equity and other compensation benefits.

Working at Nvidia

NVIDIA is widely considered one of the most desirable employers in the technology industry, known for forward-thinking and productive people. The company fosters a diverse work environment and is committed to equal opportunity employment, valuing diversity in current and future employees.

Apply Now

Job Details

Posted AtAug 7, 2025
Job CategoryData Engineering
Salary224k-426k
Job TypeFull Time
Work ModeOnsite
ExperienceSenior

Job Skills

AI Insights

Key skills identified from this job posting

Sign upto access all insights for this job

About Nvidia

Website

nvidia.com

Location

Santa Clara, CA

Industry

Semiconductor and Related Device Manufacturing

Get job alerts

Set up personalized alerts for your job search and get tailored job digests for close matches