Nvidia logo

Senior Software QA Test Development Engineer

Nvidia

Santa Clara, CA
Full Time
Senior
136k-265k
25 days ago

Job Description

About the Role

NVIDIA is the world leader in GPU Computing, passionate about markets including gaming, automotive, vision, HPC, datacenters, and networking. Positioned as the ‘AI Computing Company', NVIDIA GPUs power Deep Learning frameworks, analytics, data centers, and autonomous vehicles. The company values dedicated, forward-thinking, and hard-working technical professionals who thrive in diverse environments, possess strong interpersonal skills, and are committed to continuous process improvement. The role involves enterprise server integration, Linux expertise, reliability testing, AI tools, NLP, DevOps, and CI/CD within the platform SWQA team.

Key Responsibilities

  • Responsible for the development and execution of NVIDIA HGX/DGX/MGX platform test plans on servers, OS, firmware, and CUDA software stack from design documentation.
  • Installing and testing various systems including OS, server firmware, and software stack.
  • Drive support for root cause analysis on reliability and validation test failures to identify root causes and achieve mitigation.
  • Build, develop, and debug server and OS level automation front-end and back-end frameworks and tests.
  • Review partner and supplier test results and prescribe additional reliability testing on components, servers, and packaging as needed.
  • Work in an agile software development team with very high production quality standards.
  • Manage bug lifecycle and collaborate with inter-groups to drive solutions.

Requirements

  • Bachelor's Degree (or equivalent experience) in a STEM (Science, Technology, Engineering, Math, or Physics) field.
  • 5+ years proven experience; or master's degree.
  • Experience in OS and server level automation, CI/CD processes, and DevOps using Python, Shell, Ansible, Jenkins, C/C++, Java, JavaScript.
  • Troubleshooting and debugging experience in server and Linux environments (Ubuntu, RedHat, CentOS, SuSE, Fedora, etc.) in bare-metal and virtualized environments (KVM, VMWare, Hyper-V).
  • Hands-on experience with model testing, AI frameworks (TensorFlow, Pytorch, Cursor), NLP, and LLM benchmarking.
  • Experience using AI development tools for test plan creation, test case development, and automation.
  • Knowledge of firmware, BMC/OpenBMC, network protocols, enterprise storage devices, PCIe buses, CPU, memory, UEFI, Redfish.

Nice to Have

  • Experience with AI tools, LLM, and NLP.
  • Experience working with NVIDIA GPU hardware.
  • Solid understanding of Linux virtualization (KVM, Docker, Kubernetes).
  • Background in parallel programming, ideally CUDA or OpenCL.

Qualifications

  • Educational background in STEM or related fields as specified in requirements.

Benefits & Perks

  • Competitive salaries with a base salary range of $136,000 - $264,500, determined by location and experience.
  • Eligibility for equity and benefits (https://www.nvidia.com/en-us/benefits/).

Working at Nvidia

NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. The company values diversity in current and future employees and does not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability, or any other characteristic protected by law.

Apply Now

Job Details

Posted AtJul 10, 2025
Job CategoryQA Engineering
Salary136k-265k
Job TypeFull Time
ExperienceSenior

Job Skills

AI Insights

Key skills identified from this job posting

Sign upto access all insights for this job

About Nvidia

Website

nvidia.com

Location

Santa Clara, CA

Industry

Semiconductor and Related Device Manufacturing

Get job alerts

Set up personalized alerts for your job search and get tailored job digests for close matches