Nvidia logo

Senior Generative AI Research Engineer

Nvidia

Santa Clara, CA
Full Time
Senior
224k-357k
about 1 month ago

Job Description

About the Role

At NVIDIA, we're not just building the future, we're generating it. Our Cosmos generative AI engineering team is pushing the boundaries of what's possible across multimodal learning, video generation, synthetic data, intelligent simulation, and agentic systems. We are looking for exceptionally driven engineers and applied scientists with deep experience in generative modeling to help define the next era of AI computing.

Key Responsibilities

  • Design, post-train, and optimize foundation models (e.g., LLMs, diffusion video models, VLMs, VLAs) for real world applications.
  • Contribute to highly-collaborative development on large-scale training infrastructure, high-efficiency inference pipelines, and scalable data pipelines.
  • Work with teams in research, software, and product to bring world models from idea to deployment.
  • Collaborate on open-source and internal projects, author technical papers or patents, and mentor junior engineers.
  • Prototype and iterate rapidly on experiments across cutting-edge AI domains, including agentic systems, reinforcement learning, reasoning, and video generation.
  • Design and implement model distillation algorithms for size reduction and diffusion step optimization.
  • Profile and benchmark training and inference pipelines to achieve production-ready performance requirements.

Requirements

  • Minimum 8 years industry or 5+ years research/postdoc experience building and deploying generative AI systems.
  • Proficiency in PyTorch, JAX, or other deep learning frameworks.
  • Expertise in one or more of: LLMs, coding agents, diffusion models, autoregressive models, VAE/GAN architectures, retrieval-augmented generation, neural rendering, or multi-agent systems.
  • Experience with transformer architectures and variants of attention mechanisms.
  • Hands-on experience with large scale training (e.g., ZeRO, DDP, FSDP, TP, CP) and data processing tools (e.g., Ray, Spark).
  • Production-quality software engineering skills in Python.
  • MS, PhD, or equivalent experience in Computer Science, Machine Learning, Applied Math, Physics, or related field.
  • 12+ years of relevant software development experience.

Nice to Have

  • Familiarity with high-performance computing and GPU acceleration.
  • Contributions to influential open-source libraries or conference publications (NeurIPS, ICML, CVPR, ICLR).
  • Experience working with multimodal data (e.g., vision-language, VLA, audio).
  • Prior work with NVIDIA GPU-based compute clusters or simulation environments.

Qualifications

  • Educational background with MS, PhD, or equivalent in relevant fields.

Benefits & Perks

  • Base salary range of $224,000 - $356,500 USD, determined by location, experience, and similar positions.
  • Eligibility for equity and benefits.

Working at Nvidia

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. We are committed to fostering a diverse work environment and are proud to be an equal opportunity employer, valuing diversity in our current and future employees.

Apply Now

Job Details

Posted AtJul 26, 2025
Job CategoryData Science
Salary224k-357k
Job TypeFull Time
ExperienceSenior

Job Skills

AI Insights

Key skills identified from this job posting

Sign upto access all insights for this job

About Nvidia

Website

nvidia.com

Location

Santa Clara, CA

Industry

Semiconductor and Related Device Manufacturing

Get job alerts

Set up personalized alerts for your job search and get tailored job digests for close matches